BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014761
(419 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 291/407 (71%), Positives = 337/407 (82%), Gaps = 3/407 (0%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
MN L F L++L+ P SDI++LFETWCK+HGK+Y+S++E+ RLK+FEDNY FV
Sbjct: 1 MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
T+HN+ GNSS++L+LNAFADLTH EFK S LG SAA ++ R +++ G + D+PAS
Sbjct: 61 TKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHR---NLEITGVVGDIPAS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
IDWR KG VT VKDQ SCGACW+FSATGAIEGINKIVTGSLVSLSEQELI+CD+SYN GC
Sbjct: 118 IDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGC 177
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGGLMDYA+QFVI NHGIDTE+DYPYR + G CNK ++ R +VTID Y DVPENNEKQLL
Sbjct: 178 GGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLL 237
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300
QAV AQPVSVGICGSERAFQ+YS GIFTGPCSTSLDHAVLIVGY SENGVDYWI+KNSWG
Sbjct: 238 QAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 297
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGET 360
WGM GYMHMQRN+GNS G+CGINMLASYP KT NPPP PPPGPT+C+LLTYCAAGET
Sbjct: 298 TGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGET 357
Query: 361 CCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CCC GIC+SWKCCG SAVCC D +CCP +YP+CD+ ++ C
Sbjct: 358 CCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCF 404
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 284/387 (73%), Positives = 328/387 (84%), Gaps = 6/387 (1%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
+I LFETWC+QHGK Y+S++EK RLK+F+DNY FVT+HN+ GNSS+TLSLNAFADLTH
Sbjct: 25 EIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTH 84
Query: 84 QEFKASFLGFSAA---SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
EFKAS LG S+A S++ DR ++ Q P + DVPAS+DWRK GAVT+VKDQ +CGA
Sbjct: 85 HEFKASRLGLSSAASASLNVDR---SNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGA 141
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CW+FSATGAIEGINKIVTGSLVSLSEQEL+DCD+SYN+GC GG+MDYA+QFVI NHGIDT
Sbjct: 142 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDT 201
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+G+ CNK+KL RH+VTIDGY DVP+NNEK+LL+AV QPVSVGICGSERAFQ
Sbjct: 202 EEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQ 261
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYS GIFTGPCSTSLDHAVLIVGY SENGVDYWI+KNSWG WGM+GYMHMQRN+G+S G
Sbjct: 262 LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRG 321
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380
+CGINMLASYP KT NPPP PPGPTRC L T+C GETCCC I GICLSWKCC
Sbjct: 322 LCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELD 381
Query: 381 SAVCCSDHRYCCPSNYPICDSVRHQCL 407
SAVCC D R+CCP +YP+CD+ R+ CL
Sbjct: 382 SAVCCKDGRHCCPRDYPVCDTTRNICL 408
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 288/394 (73%), Positives = 329/394 (83%), Gaps = 3/394 (0%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
S+++ELFE WC +HGK+YSS +EK RL +F DNY FVT HNN+ NSS+TLSLN++ADLT
Sbjct: 23 SNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLT 82
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H EFK S LGFS A + R Q P RDVP S+DWRKKGAVT VKDQ SCGACW
Sbjct: 83 HHEFKVSRLGFSPALRNF---RPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
+FSATGA+EGIN+I+TGSL+SLSEQELIDCDRSYNSGCGGGLMDYAYQFVI NHGIDTE
Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ + G C K KL R++VTIDGY D+P N+E +LLQAV AQPVSVGICGSERAFQLY
Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
S GIF+GPCSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHMQRN+GNS G+C
Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSA 382
GIN LASYPTKT NPPPSPPPGPT+CS+LT CAAGETCCC LG+CLSWKCCG SSA
Sbjct: 320 GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379
Query: 383 VCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFT 416
VCC D R+CCP +YPICD+ R+ CL ++ + T
Sbjct: 380 VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRT 413
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 573 bits (1476), Expect = e-161, Method: Compositional matrix adjust.
Identities = 289/421 (68%), Positives = 339/421 (80%), Gaps = 6/421 (1%)
Query: 1 MNSL-AFFLLSILL--LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
MN L A FL+++L LS + SDI++LFE+W K+HGK Y+S+++K R KIFE+NY
Sbjct: 1 MNFLSALFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENY 60
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD-RRRNASVQSPGNLRD 116
FV +HN+ GNSS+TLSLNAFADLTH EFKAS LG SA S RRN + + D
Sbjct: 61 EFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGD 118
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
VP SIDWRKKGAV++VKDQ +CGACW+FSATGAIEGINKIVTGSLVSLSEQEL+DCDRSY
Sbjct: 119 VPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSY 178
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N+GC GGLMDYAYQFVI+N+GIDTE+DYPY+ + CNK+KL RH+VTIDGY DVP+NNE
Sbjct: 179 NNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNE 238
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
K+LL+AV AQPVSVGICGSERAFQLYS GIFTGPCSTSLDHAVLIVGY SENGVDYWI+K
Sbjct: 239 KELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVK 298
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCA 356
NSWG WG+NGYM+M RN+GNS G+CGINMLAS+P KT NPPP PPGPT+C L T C
Sbjct: 299 NSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCG 358
Query: 357 AGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFT 416
GETCCC I G+C SWKCC SAVCC D +CCP +YP+CD+ R+ CL VS+ +F
Sbjct: 359 EGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLKVSIFSAFN 418
Query: 417 V 417
+
Sbjct: 419 L 419
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 559 bits (1441), Expect = e-157, Method: Compositional matrix adjust.
Identities = 267/391 (68%), Positives = 319/391 (81%), Gaps = 2/391 (0%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
DI+ELF+ WC++HGK Y SE+E+QQR++IF+DN+ FVTQHN + N++++LSLNAFADLTH
Sbjct: 27 DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTH 86
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EFKAS LG S ++ QS G VP S+DWRKKGAVT VKDQ SCGACW+
Sbjct: 87 HEFKASRLGLSVSAPSVIMASKG--QSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSATGA+EGIN+IVTG L+SLSEQELIDCD+SYN+GC GGLMDYA++FVIKNHGIDTEKD
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+ + G C K KL + +VTID Y V N+EK L++AV AQPVSVGICGSERAFQLYS
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SGIF+GPCSTSLDHAVLIVGY S+NGVDYWI+KNSWG+SWGM+G+MHMQRNT NS G+CG
Sbjct: 265 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCG 324
Query: 324 INMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAV 383
INMLASYP KT NPPP PPGPT+C+L TYC++GETCCC + G+C SWKCC SAV
Sbjct: 325 INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAV 384
Query: 384 CCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
CC D R+CCP +YP+CD+ R CL + F+
Sbjct: 385 CCKDGRHCCPHDYPVCDTTRSLCLKKTGNFT 415
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 266/391 (68%), Positives = 318/391 (81%), Gaps = 2/391 (0%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
DI+ELF+ WC++HGK Y SE+E+QQR++IF+DN+ FVTQHN + N++++LSLNAFADLTH
Sbjct: 27 DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTH 86
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EFKAS LG S ++ QS G VP S+DWRKKGAVT VKDQ SCGACW+
Sbjct: 87 HEFKASRLGLSVSAPSVIMASKG--QSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSATGA+EGIN+IVTG L+SLSEQELIDCD+SYN+GC GGLMDYA++FVIKNHGIDTEKD
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+ + G C K KL + +VTID Y V N+EK L++AV AQPVSVGICGSERAFQLYS
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
GIF+GPCSTSLDHAVLIVGY S+NGVDYWI+KNSWG+SWGM+G+MHMQRNT NS G+CG
Sbjct: 265 RGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCG 324
Query: 324 INMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAV 383
INMLASYP KT NPPP PPGPT+C+L TYC++GETCCC + G+C SWKCC SAV
Sbjct: 325 INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAV 384
Query: 384 CCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
CC D R+CCP +YP+CD+ R CL + F+
Sbjct: 385 CCKDGRHCCPHDYPVCDTTRSLCLKKTGNFT 415
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 277/414 (66%), Positives = 330/414 (79%), Gaps = 8/414 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
SL FF L ++ S DI+ELF+ WC++HGK Y SE+E+QQR++IF+DN+ FVTQ
Sbjct: 10 SLTFFFLLLVSSPSSS----DDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQ 65
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
HN + N++++LSLNAFADLTH EFKAS LG S ++ + QS G VP S+D
Sbjct: 66 HNLITNATYSLSLNAFADLTHHEFKASRLGLSVSA--SSLIMASKGQSLGGNAKVPDSVD 123
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WRKKGAVT VKDQ SCGACW+FSATGA+EGIN+IVTG L+SLSEQELIDCD+SYN+GC G
Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242
GLMDYA++FVIKNHGIDTEKDYPY+ + G C K KL + +VTID Y V N+EK L +A
Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREA 243
Query: 243 VVAQPVSVGICGSERAFQLYS--SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300
V AQPVSVGICGSERAFQLYS SGIF+GPCSTSLDHAVLIVGY S+NGVDYWI+KNSWG
Sbjct: 244 VAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWG 303
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGET 360
+SWGM+G+MHMQRNTGNS GICGINMLASYP KT NPPP PPGPT+C+L TYC+AGET
Sbjct: 304 KSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGET 363
Query: 361 CCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
CCC ++ G+C SWKCC SAVCCSD R+CCP +YP+CD+ R CL + F+
Sbjct: 364 CCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFT 417
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 267/393 (67%), Positives = 317/393 (80%), Gaps = 9/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
DI+ELF+ WC++HGK Y SE+E+QQR++IF+DN+ FVTQHN + N++++LSLNAFADLTH
Sbjct: 25 DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTH 84
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EFKAS LG S ++ QS G VP S+DWRKKGAVT VKDQ SCGACW+
Sbjct: 85 HEFKASRLGLSVSAPSVIMASKG--QSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 142
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSATGA+EGIN+IVTG L+SLSEQELIDCD+SYN+GC GGLMDYA++FVIKNHGIDTEKD
Sbjct: 143 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 202
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+ + G C K KL + +VTID Y V N+EK L++AV AQPVSVGICGSERAFQLYS
Sbjct: 203 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 262
Query: 264 S-------GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
S GIF+GPCSTSLDHAVLIVGY S+NGVDYWI+KNSWG+SWGM+G+MHMQRNT
Sbjct: 263 SKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 322
Query: 317 NSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKC 376
NS G+CGINMLASYP KT NPPP PPGPT+C+L TYC++GETCCC + G+C SWKC
Sbjct: 323 NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 382
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
C SAVCC D R+CCP +YP+CD+ R CL V
Sbjct: 383 CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKV 415
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 265/403 (65%), Positives = 317/403 (78%), Gaps = 5/403 (1%)
Query: 6 FFLLSILLLS-SLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
+ +SIL+L+ ++ S +LFE WC+Q+GK YSSE+EK RLK+FE+N+AFVTQHN
Sbjct: 5 LWAVSILILAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHN 64
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
+M N+S+TL+LNAFADLTH EFKAS LGFS R SV +P VP ++DWR
Sbjct: 65 SMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSIR----SVGTPVQELHVPPAVDWR 120
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
K GAVT VKDQ +CG CW+FS TGAIEGINKIVTGSLVSLSEQEL+DCDRSYNSGC GGL
Sbjct: 121 KSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGL 180
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
MDYAYQFVIKN GID+E DYPY G CNK+KL +HIVTIDGY D+P N+EKQLLQ V
Sbjct: 181 MDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVA 240
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
QPVSVGICGSE+ FQLYS G++TGPCS++LDHAVLIVGY +E+GVD+WI+KNSWG WG
Sbjct: 241 KQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWG 300
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCG 364
M GY+HM RN G + GICGINMLASYP KT NPPP P PGPT+C + C+ GETCCC
Sbjct: 301 MRGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCS 360
Query: 365 SSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+G+CLSW CC SAVCC ++ YCCP+++PICD+ R++CL
Sbjct: 361 WRFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCL 403
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 276/400 (69%), Positives = 313/400 (78%), Gaps = 9/400 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN-----SSFTLSLNA 77
SD +ELFE WCK+H K YSSE+EK RLK+FEDNYAFV QHN N SS+TLSLNA
Sbjct: 27 SDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNA 86
Query: 78 FADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQAS 137
FADLTH EFK + LG + R +N Q +L +P+ IDWR+ GAVT VKDQAS
Sbjct: 87 FADLTHHEFKTTRLGLPLTLLRFKRPQN---QQSRDLLHIPSQIDWRQSGAVTPVKDQAS 143
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD SYNSGCGGGLMD+AYQFVI N G
Sbjct: 144 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKG 203
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE DYPY+ + C+K KL R VTI+ Y DVP + E+++L+AV +QPVSVGICGSER
Sbjct: 204 IDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPS-EEEILKAVASQPVSVGICGSER 262
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLYS GIFTGPCST LDHAVLIVGY SENGVDYWI+KNSWG+ WGMNGY+HM RN+GN
Sbjct: 263 EFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGN 322
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCC 377
S GICGIN LASYP KT NPP PPPGP RC+L T+C+ GETCCC S LGIC SWKCC
Sbjct: 323 SKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKCC 382
Query: 378 GFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTV 417
G +SAVCC D R+CCP +YPICD+ R QCL + + T+
Sbjct: 383 GLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTI 422
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 264/403 (65%), Positives = 298/403 (73%), Gaps = 13/403 (3%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-------------SF 71
I F+ WC +HGKAY++ +E+ RL +F DN AFV HN + S+
Sbjct: 32 IEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSY 91
Query: 72 TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
TL+LNAFADLTH+EF+A+ LG A R G VP ++DWRK GAVT+
Sbjct: 92 TLALNAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTK 151
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191
VKDQ SCGACW+FSATGA+EGINKI TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY+F
Sbjct: 152 VKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKF 211
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
VIKN GIDTE+DYPYR G CNK KL + +VTIDGY DVP N E LLQAV QPVSVG
Sbjct: 212 VIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVG 271
Query: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
ICGS RAFQLY GIF GPC TSLDHAVLIVGY SE G DYWI+KNSWG SWGM GYMHM
Sbjct: 272 ICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHM 331
Query: 312 QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
RNTG+S G+CGINM+AS+PTKT NPPPSP PGPT+CSLLTYC G TCCC +LG C
Sbjct: 332 HRNTGDSKGVCGINMMASFPTKTSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGFC 391
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
LSW CC +AVCC D+RYCCP +YP+CD+ R QCL S FS
Sbjct: 392 LSWSCCELDNAVCCKDNRYCCPHDYPVCDTGRGQCLKASGNFS 434
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 509 bits (1311), Expect = e-142, Method: Compositional matrix adjust.
Identities = 264/394 (67%), Positives = 300/394 (76%), Gaps = 11/394 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS--------SFTLSLNAFA 79
LF+ WC +HGKAY++ +E+ RL +F DN AFV HN N+ S+TL+LNAFA
Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQS--PGNLRDVPASIDWRKKGAVTEVKDQAS 137
DLTH+EF+A+ LG AA R A V G L VP ++DWR+ GAVT+VKDQ S
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CGACW+FSATGA+EGINKI TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY+FV+KN G
Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE+DYPYR G CNK KL + IVTIDGY DVP N E LLQAV QPVSVGICGS R
Sbjct: 220 IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279
Query: 258 AFQLYS-SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
AFQLYS GIF GPC TSLDHAVLIVGY SE G DYWI+KNSWG SWGM GYMHM RNTG
Sbjct: 280 AFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTG 339
Query: 317 NSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKC 376
+S G+CGINM+AS+PTK+ NPPPSP PGPT+CSLLTYC G TCCC ILG CLSW C
Sbjct: 340 DSKGVCGINMMASFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRILGFCLSWSC 399
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVS 410
C +AVCC D++ CCP +YP+CD+ R CL S
Sbjct: 400 CELDNAVCCKDNKSCCPHDYPVCDTDRGLCLKAS 433
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 257/399 (64%), Positives = 296/399 (74%), Gaps = 7/399 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM------GNSSFTLSLN 76
SD FE WC +HGKAY++ E+ RL F +N AFV HN+ G S+TL+LN
Sbjct: 33 SDYEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALN 92
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQS-PGNLRDVPASIDWRKKGAVTEVKDQ 135
AFADLTH EF+A+ LG A + S G + VP ++DWR+ GAVT+VKDQ
Sbjct: 93 AFADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQ 152
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
SCGACW+FSATGA+EGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLM YAY+FVIKN
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
GIDTE DYP+R G CNK KL +H+VTIDGYK+VP + E LLQAV QP+SVGICGS
Sbjct: 213 GGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGS 272
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
RAFQLYS GIF GPC TSLDHAVLIVGY SE G DYWI+KNSWG WGM GYMHM RNT
Sbjct: 273 ARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNT 332
Query: 316 GNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWK 375
G+S GICGINM+AS+PTKT NPPPSP PGPT+CS+ T C G TCCC LG CLSW
Sbjct: 333 GSSSGICGINMMASFPTKTSPNPPPSPGPGPTKCSVFTSCPEGSTCCCSWRALGFCLSWS 392
Query: 376 CCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
CC +AVCCSD+R CCP +YPICD+ R +CL + FS
Sbjct: 393 CCELDNAVCCSDNRSCCPHDYPICDTARGRCLKGNGNFS 431
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 261/408 (63%), Positives = 291/408 (71%), Gaps = 9/408 (2%)
Query: 20 NYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS--------SF 71
N + LFE WC +HGKAY+S E+ RL F DN AFV HN G S+
Sbjct: 33 NLSAAYEPLFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSY 92
Query: 72 TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
TL+LNAFADLTH EF+A+ LG A + VP ++DWR+ GAVT+
Sbjct: 93 TLALNAFADLTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTK 152
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191
VKDQ SCGACW+FSATGAIEGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLMDYAY+F
Sbjct: 153 VKDQGSCGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRF 212
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
VIKN GIDTE DYPYR G CNK KL RH+VTIDGY DVP N E LLQAV QP+SVG
Sbjct: 213 VIKNGGIDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVG 272
Query: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
ICGS RAFQLYS GIF GPC TSLDHAVLIVGY SE G DYWI+KNSWG WGM GYMHM
Sbjct: 273 ICGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHM 332
Query: 312 QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
RNTG+S GICGINM+AS+PTKT NPPPSP PGPT+CS T C G TCCC LG C
Sbjct: 333 HRNTGSSSGICGINMMASFPTKTSPNPPPSPGPGPTKCSAFTSCPEGSTCCCSWRALGFC 392
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVR-HQCLTVSLKFSFTVK 418
LSW CC +AVCC D+R CCP +YPICD+ R CL+ K + K
Sbjct: 393 LSWSCCELDNAVCCKDNRSCCPHDYPICDTDRGRTCLSSREKEAVLAK 440
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 499 bits (1285), Expect = e-139, Method: Compositional matrix adjust.
Identities = 257/399 (64%), Positives = 296/399 (74%), Gaps = 7/399 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM------GNSSFTLSLN 76
SD FE WC +HGKAY++ E+ RL F +N AFV HN+ G S+TL+LN
Sbjct: 33 SDYEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALN 92
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQS-PGNLRDVPASIDWRKKGAVTEVKDQ 135
AFADLTH EF+A+ LG A + S G + VP ++DWR+ GAVT+VKDQ
Sbjct: 93 AFADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQ 152
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
SCGACW+FSATGA+EGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLM YAY+FVIKN
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
GIDTE DYP+R G CNK KL +H+VTIDGYK+VP + E LLQAV QP+SVGICGS
Sbjct: 213 GGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGS 272
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
RAFQLYS GIF GPC TSLDHAVLIVGY SE G DYWI+KNSWG WGM GYMHM RNT
Sbjct: 273 ARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNT 332
Query: 316 GNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWK 375
G+S GICGINM+AS+PTKT NPPPSP PGPT+CS+ T C G TCCC LG CLSW
Sbjct: 333 GSSSGICGINMMASFPTKTNPNPPPSPGPGPTKCSVFTSCPEGSTCCCSWRALGFCLSWS 392
Query: 376 CCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
CC +AVCCSD+R CCP +YPICD+ R +CL + FS
Sbjct: 393 CCELDNAVCCSDNRSCCPHDYPICDTARGRCLKGNGNFS 431
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 254/383 (66%), Positives = 295/383 (77%), Gaps = 2/383 (0%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE WC +HG++Y++ E+ RL F DN AFV HN +S+ L+LNAFADLTH EF+A
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNG-APASYALALNAFADLTHDEFRA 96
Query: 89 SFLGFSAASIDHDRRRNAS-VQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
+ LG AA+ R A + G + VP ++DWR+ GAVT+VKDQ SCGACW+FSAT
Sbjct: 97 ARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSAT 156
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
GA+EGINKI TGSL+SLSEQELIDCDRSYNSGCGGGLMDYAY+FV+KN GIDTE DYPYR
Sbjct: 157 GAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYR 216
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G CNK KL R +VTIDGYKDVP NNE LLQAV QPVSVGICGS RAFQLYS GIF
Sbjct: 217 ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIF 276
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
GPC TSLDHA+LIVGY SE G DYWI+KNSWG SWGM GYM+M RNTGNS G+CGIN +
Sbjct: 277 DGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336
Query: 328 ASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSD 387
S+PTK+ NPPPSP PGPT+CSLLTYC G TCCC +LG+CLSW CC +AVCC D
Sbjct: 337 PSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKD 396
Query: 388 HRYCCPSNYPICDSVRHQCLTVS 410
+RYCCP +YP+CD+ +C +
Sbjct: 397 NRYCCPHDYPVCDTASQRCFKAN 419
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 252/382 (65%), Positives = 293/382 (76%), Gaps = 1/382 (0%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE WC +HG++Y++ E+ RL F DN AFV HN +S+ L+LNAFADLTH EF+A
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNG-APASYALALNAFADLTHDEFRA 96
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+ LG AA+ + G + VP ++DWR+ GAVT+VKDQ SCGACW+FSATG
Sbjct: 97 ARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 156
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
A+EGINKI TGSL+SLSEQELIDCDRSYNSGCGGGLMDYAY+FV+KN GIDTE DYPYR
Sbjct: 157 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 216
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G CNK KL R +VTIDGYKDVP NNE LLQAV QPVSVGICGS RAFQLYS GIF
Sbjct: 217 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 276
Query: 269 GPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
GPC TSLDHA+LIVGY SE G DYWI+KNSWG SWGM GYM+M RNTGNS G+CGIN +
Sbjct: 277 GPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMP 336
Query: 329 SYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDH 388
S+PTK+ NPPPSP PGPT+CSLLTYC G TCCC +LG+CLSW CC +AVCC D+
Sbjct: 337 SFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDN 396
Query: 389 RYCCPSNYPICDSVRHQCLTVS 410
RYCCP +YP+CD+ +C +
Sbjct: 397 RYCCPHDYPVCDTASQRCFKAN 418
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 220/410 (53%), Positives = 271/410 (66%), Gaps = 15/410 (3%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L I EL+E W QH KAY+ EKQ R +F+DN+ ++ QHNN GN
Sbjct: 24 FSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGN 83
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP----GNLRDVPASIDWR 124
S+ L LN FADL+H+EFKA++LG A +D +R + S SP + D+P SIDWR
Sbjct: 84 PSYKLGLNQFADLSHEEFKATYLG---AKLDTKKRLSNS-PSPRYQYSDGEDLPESIDWR 139
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
+KGAVT VKDQ SCG+CWAFS A+EGIN+IVTG+L SLSEQEL+DCD SYN GC GGL
Sbjct: 140 EKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGL 199
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
MDYA+QF+I N G+D+E DYPY+ G C+ + N H+VTID Y+DVPEN+EK L +A
Sbjct: 200 MDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAA 259
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
QP+SV I S RAFQ Y SG+FT C T LDH V +VGY SE+G DYWI+KNSWG+SWG
Sbjct: 260 NQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYGSESGTDYWIVKNSWGKSWG 319
Query: 305 MNGYMHMQRN-TGNSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAA 357
G++ +QRN G S G+CGI M ASYP K G PPSP PT C C
Sbjct: 320 EKGFIRLQRNIEGVSTGMCGIAMEASYPLKKGANPPNPGPSPPSPVKPPTVCDNYYSCPE 379
Query: 358 GETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
TCCC G C +W CC +SA CC DH CCP+++P+CD CL
Sbjct: 380 SNTCCCMYDFGGYCYAWGCCPLNSATCCDDHYSCCPNDHPVCDLDAQTCL 429
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/420 (52%), Positives = 276/420 (65%), Gaps = 14/420 (3%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L I EL+E W QH KAY+ EKQ++ +F+DN+ ++ QHNN GN
Sbjct: 24 FSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGN 83
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR--RNASVQSPGNL-RDVPASIDWRK 125
S+ L LN FADL+H+EFKA++LG +D +R R+ S + ++ D+P SIDWR+
Sbjct: 84 PSYKLGLNQFADLSHEEFKAAYLG---TKLDAKKRLSRSPSPRYQYSVGEDLPESIDWRE 140
Query: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM 185
KGAVT VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQEL+DCD SYN GC GGLM
Sbjct: 141 KGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLM 200
Query: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA 245
DYA+QF+I N G+D+E DYPY+ G C+ + N H+VTID Y+DVPEN+EK L +A
Sbjct: 201 DYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAAN 260
Query: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 305
QP+SV I S RAFQ Y SG+FT C T LDH V +VGY SE+G+DYW++KNSWG SWG
Sbjct: 261 QPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYGSESGIDYWLVKNSWGNSWGE 320
Query: 306 NGYMHMQRN-TGNSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAAG 358
G++ +QRN G S G+CGI M ASYP K G PPSP PT C C
Sbjct: 321 KGFIKLQRNLEGASTGMCGIAMEASYPVKKGANPPNPGPSPPSPVKPPTVCDNYYSCPES 380
Query: 359 ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
TCCC G C +W CC +SA CC DH CCPS++P+CD CL S K F K
Sbjct: 381 NTCCCMYDFGGYCYAWGCCPLNSATCCDDHYSCCPSDHPVCDLDAQTCLK-SRKDPFGTK 439
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 201/383 (52%), Positives = 260/383 (67%), Gaps = 9/383 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+++E W +HGK Y++ EK++R +IF+DN FV + N++ ++ L L FADLT++E+
Sbjct: 50 KMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEY 109
Query: 87 KASFLGFSAASIDHDR--RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
+A +LG + R R + GN D+P+ +DWR+KGAVTEVKDQ CG+CWAF
Sbjct: 110 RAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAF 169
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
S G++EGIN+IVTG L+SLSEQEL+DCD++YN GC GGLMDYA++F+IKN GID+E DY
Sbjct: 170 STVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADY 229
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PYR C+ + N H+VTIDGY+DVPEN+E+ L +AV QPVSV I R FQLY S
Sbjct: 230 PYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQS 289
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICG 323
G+FTG C T+LDH V+ VGY +ENG+DYWI++NSWG WG +GY+ M+RN ++ G CG
Sbjct: 290 GVFTGRCGTNLDHGVVAVGYGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCG 349
Query: 324 INMLASYPTKTGQNPPPSPPPGPTRCSLLTYC------AAGETCCCGSSILGICLSWKCC 377
I M ASYPTK GQNPP P P+ T C TCCC G C W CC
Sbjct: 350 IAMEASYPTKKGQNPPKPGPSPPSPVRPPTVCDEYYSRPEATTCCCVYEYGGFCFGWGCC 409
Query: 378 GFSSAVCCSDHRYCCPSNYPICD 400
SA CC DH CCP +YPICD
Sbjct: 410 PLESATCCDDHYSCCPHDYPICD 432
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/440 (48%), Positives = 278/440 (63%), Gaps = 31/440 (7%)
Query: 6 FFLLSILLL------------SSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIF 53
F+ LS+ L +P ++ L+E W ++GKAY++ EK++R +IF
Sbjct: 14 FYFLSVCLAIDMSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALGEKERRFEIF 73
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN 113
+DN FV QHN++GN S+ L LN FADL+++E++A++LG +D RR +S
Sbjct: 74 KDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLG---TRMDGKRRLLGGPKSARY 130
Query: 114 L----RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
L D+P S+DWR+KGAV VKDQ CG+CWAFS GA+EGIN+IVTG+L SLSEQEL
Sbjct: 131 LFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQEL 190
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DCD+ YN GC GGLMDYA++F++KN GIDTE+DYPY+ C+ + N +VTIDGY+
Sbjct: 191 VDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRKNARVVTIDGYE 250
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENG 289
DVP+N+EK L +AV QPVSV I RAFQLY SG+FTG C T LDH V+ VGY +ENG
Sbjct: 251 DVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVGYGTENG 310
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPTKTG----------QNP 338
VDYW+++NSWG +WG NGY+ M+RN ++ G CGI M ASYPTK G +P
Sbjct: 311 VDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTKKGANPPNPGPSPPSP 370
Query: 339 PPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPI 398
PP + C C AG TCCC C W CC SA CC DH CCP YP+
Sbjct: 371 VNPSPPPSSECDDYYSCPAGSTCCCIYPYGDYCFGWGCCPLESATCCDDHNSCCPHEYPV 430
Query: 399 CDSVRHQCLTVSLKFSFTVK 418
CD C +S F VK
Sbjct: 431 CDLEAGTC-RMSKNNPFGVK 449
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 212/413 (51%), Positives = 265/413 (64%), Gaps = 9/413 (2%)
Query: 13 LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFT 72
++SS L I EL+E W +H +AY+ EKQ+R +F+DN+ ++ +HN GN S+
Sbjct: 26 IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84
Query: 73 LSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEV 132
L LN FADL+H+EFKA++LG + R + + D+P SIDWR+KGAVT V
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAVTSV 144
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFV 192
KDQ SCG+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+
Sbjct: 145 KDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFI 204
Query: 193 IKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI 252
I N G+D+E+DYPY G C+ + N H+VTID Y+DVPEN+EK L +A QP+SV I
Sbjct: 205 INNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAI 264
Query: 253 CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
S R FQ Y SG+FT C T LDH V +VGY SE+G DYW +KNSWG+SWG G++ +Q
Sbjct: 265 EASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGSESGTDYWTVKNSWGKSWGEEGFIRLQ 324
Query: 313 RNTG-NSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAAGETCCCGS 365
RN S G+CGI M ASYP K G PPSP PT C C TCCC
Sbjct: 325 RNIEVASTGMCGIAMEASYPVKKGANPPNPGPSPPSPIKPPTVCDNYYSCPESNTCCCMY 384
Query: 366 SILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
G C +W CC SA CC DH CCP+ YP+CD CL S K F VK
Sbjct: 385 DFGGYCYAWGCCPLDSATCCDDHYSCCPNEYPVCDLDGGTCLKSS-KDPFGVK 436
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/398 (52%), Positives = 261/398 (65%), Gaps = 15/398 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+++ L++ W QH ++Y++ E +QRL+IF DN F+ QHN N G SF L L FAD
Sbjct: 42 EVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFAD 101
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQA 136
LT++E+++++LG A RRRN++V S + D+P SIDWR KGAV +VKDQ
Sbjct: 102 LTNEEYRSTYLGVRTAG--SRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQG 159
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
SCG+CWAFS A+EGIN IVTG L+SLSEQEL+DCD YN GC GGLMDYA++F+I N
Sbjct: 160 SCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNG 219
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GIDT++DYPY G+ G C++ + N H+VTID Y+DVP N+EK L +AV QPVSV I
Sbjct: 220 GIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGG 279
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
RAFQLY SGIFTG C T LDH V +GY SENG YWI+KNSWG WG +GY+ M+RN
Sbjct: 280 RAFQLYESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRMERNIN 339
Query: 317 NSLGICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGI 370
++ G CGI M ASYP K GQN PPSP PT C C TCCC
Sbjct: 340 SATGKCGIAMEASYPIKNGQNPPNPGPSPPSPSKPPTVCDSYYSCPESMTCCCVYEFGSY 399
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
C +W CC A CC DH CCP +YPIC+ CL
Sbjct: 400 CFAWGCCPLEGATCCEDHYSCCPHDYPICNVQEGTCLV 437
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 213/432 (49%), Positives = 274/432 (63%), Gaps = 30/432 (6%)
Query: 2 NSLAFFLLSIL-LLSSLPLNYC---------------SDINELFETWCKQHGKAYSSEQE 45
+S+A FL +L L S+L ++ D+ ++E W +HGK+Y++ E
Sbjct: 8 SSMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGE 67
Query: 46 KQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN 105
K++R +IF+DN F+ +HN N ++ + LN FADLT++E+++ +LG A+ RR +
Sbjct: 68 KERRFQIFKDNLRFIDEHN-AENRTYKVGLNRFADLTNEEYRSMYLGTRTAA---KRRSS 123
Query: 106 ASVQSPGNLR---DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLV 162
+ R +P S+DWRKKGAV EVKDQ SCG+CWAFS A+EGINKIVTG L+
Sbjct: 124 NKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLI 183
Query: 163 SLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHI 222
SLSEQEL+DCD SYN GC GGLMDYA++F+I N GID+E+DYPY+ G+C++ + N +
Sbjct: 184 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKV 243
Query: 223 VTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIV 282
VTIDGY+DVPEN+EK L +AV QPVSV I R FQLY SGIFTG C T+LDH V V
Sbjct: 244 VTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAV 303
Query: 283 GYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPTKTGQ----- 336
GY +ENGVDYWI+KNSWG SWG GY+ M+R+ S G CGI M ASYP K GQ
Sbjct: 304 GYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKKGQNPPNP 363
Query: 337 -NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSN 395
PPSP PT C C TCCC C W CC +A CC DH CCP
Sbjct: 364 GPSPPSPIKPPTVCDNYYACPESSTCCCIFEYAKYCFQWGCCPLEAATCCEDHDSCCPQE 423
Query: 396 YPICDSVRHQCL 407
YP+C+ C+
Sbjct: 424 YPVCNVRAGTCM 435
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 207/435 (47%), Positives = 281/435 (64%), Gaps = 34/435 (7%)
Query: 1 MNSLAFFLLSILLLSSL-------------------PLNYCSDINELFETWCKQHGKAYS 41
M +L+FF L I ++S++ PL ++N L+E+W +HGK Y+
Sbjct: 6 MATLSFFAL-ISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYN 64
Query: 42 SEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD 101
+ EK +R +IF+DN F+ +HN+ G+ ++ L LN FADLT++E++ ++ G +ID D
Sbjct: 65 ALGEKDRRFQIFKDNLRFIDEHNS-GDHTYKLGLNKFADLTNEEYRMTYTGIK--TID-D 120
Query: 102 RRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIV 157
+++ + ++S + +P +DWR++GAVT+VKDQ SCG+CWAFS TG++EG+NKIV
Sbjct: 121 KKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIV 180
Query: 158 TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQK 217
TG L+S+SEQEL++CD SYN GC GGLMDYA++F+IKN GIDTE+DYPY G+ G+C+K K
Sbjct: 181 TGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNK 240
Query: 218 LNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDH 277
N +VTID Y+DVP N+E L +AV QPV+V I R FQ Y+SGIFTG C T+LDH
Sbjct: 241 KNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDH 300
Query: 278 AVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
VL GY +E+G DYW++KNSWG WG GY+ M+RN + G CGI M ASYP K G N
Sbjct: 301 GVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPIKNGDN 360
Query: 338 ------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYC 391
PPSP C + C TCCC G C +W CC A CC DH C
Sbjct: 361 PPNPGPTPPSPAAPEVVCDEYSTCPESTTCCCIYEYYGYCFAWGCCPLEGASCCDDHYSC 420
Query: 392 CPSNYPICDSVRHQC 406
CP +YPIC+ R C
Sbjct: 421 CPHDYPICNVRRGTC 435
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 213/434 (49%), Positives = 272/434 (62%), Gaps = 32/434 (7%)
Query: 2 NSLAFFLLSILLLSSLPLNYCS------------------DINELFETWCKQHGKAYSSE 43
+S+A FL +L L+S S D+ ++E W +HGK+Y++
Sbjct: 8 SSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNAL 67
Query: 44 QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR 103
EK++R +IF+DN F+ +HN N ++ + LN FADLT++E+++ +LG A+ RR
Sbjct: 68 GEKERRFQIFKDNLRFIDEHN-AENRTYKVGLNRFADLTNEEYRSMYLGTRTAA---KRR 123
Query: 104 RNASVQSPGNLR---DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGS 160
+ + R +P S+DWRKKGAV EVKDQ SCG+CWAFS A+EGINKIVTG
Sbjct: 124 SSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGG 183
Query: 161 LVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GID+E+DYPY+ G+C++ + N
Sbjct: 184 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNA 243
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
+VTIDGY+DVPEN+EK L +AV QPVSV I R FQLY SGIFTG C T+LDH V
Sbjct: 244 XVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVT 303
Query: 281 IVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPTKTGQ--- 336
VGY +ENGVDYWI+KNSWG SWG GY+ M+R+ S G CGI M ASYP K GQ
Sbjct: 304 AVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKKGQNPP 363
Query: 337 ---NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCP 393
PPSP PT C C TCCC C W CC +A CC DH CCP
Sbjct: 364 NPGPSPPSPIKPPTVCDNYYACPESSTCCCIFEYAKYCFQWGCCPLEAATCCEDHDSCCP 423
Query: 394 SNYPICDSVRHQCL 407
YP+C+ C+
Sbjct: 424 QEYPVCNVRAGTCM 437
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/405 (51%), Positives = 263/405 (64%), Gaps = 13/405 (3%)
Query: 14 LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTL 73
L+S PL + L+E+W +H K Y++ EK+ R IF+DN FV +HN+M N S+ L
Sbjct: 45 LNSPPLRTHDQLLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKL 104
Query: 74 SLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD----VPASIDWRKKGAV 129
LN FADLT+ E+++ +L S + +R+ +S + + +P S+DWR +GAV
Sbjct: 105 GLNKFADLTNDEYRSLYL--SGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAV 162
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
VKDQ CG+CWAFS GA+EGINKIVTG L+SLSEQEL+DCD YN GC GGLMDYA+
Sbjct: 163 APVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAF 222
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
+F++KN GIDTE DYPY+G G C++ + N +VTI+GY+DVP N+EK L +AV QPVS
Sbjct: 223 EFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVS 282
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
V I RAFQLY SG+FTG C T LDH V+ VGY SENG DYWI++NSWG WG +GY+
Sbjct: 283 VAIEAGGRAFQLYESGVFTGQCGTELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYI 342
Query: 310 HMQRNTGN-SLGICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCC 362
++RN + S G CGI M ASYPTKTG N PPSP T C C TCC
Sbjct: 343 RLERNVASTSTGKCGIAMQASYPTKTGDNPPKPGPSPPSPVKPQTVCDDYYSCPESTTCC 402
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C I C W CC +SA CC DH CCP +P+CD CL
Sbjct: 403 CLYEIGQYCFGWGCCPLASATCCDDHYSCCPQEFPVCDLDAGTCL 447
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 212/430 (49%), Positives = 278/430 (64%), Gaps = 26/430 (6%)
Query: 3 SLAFFLLSILLLSSLPLNYCS--------------DINELFETWCKQHGKAYSSEQEKQQ 48
++ FL I++ S++ ++ S +++ L+E W +HGKA +S EK +
Sbjct: 2 TVILFLAMIVVSSAMDMSIISYDKNHHTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKDR 61
Query: 49 RLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASV 108
R +IF+DN F+ +HN N S+ L L FADLT+ E+++ +LG S + S+
Sbjct: 62 RFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLG----SRLKRKATKTSL 116
Query: 109 QSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
+ + D +P S+DWRK+GAV EVKDQ SCG+CWAFS GA+EGINKIVTG L+SLSEQ
Sbjct: 117 RYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQ 176
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
EL+DCD SYN GC GGLMDYA++F+IKN GIDTE+DYPY+G G+C++ + N +VTID
Sbjct: 177 ELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDS 236
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
Y+DVP N+E+ L +A+ QP+SV I G RAFQLY SGIF G C T LDH V+ VGY +E
Sbjct: 237 YEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTE 296
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQ------NPPPS 341
NG DYWI+KNSWG SWG +GY+ M+RN +S G CGI + SYP K GQ PPS
Sbjct: 297 NGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIKNGQNPPNPGPSPPS 356
Query: 342 PPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDS 401
P PT+C C TCCC CL+W CC +A CC D+ CCP YP+CD
Sbjct: 357 PVTPPTQCDSYYTCPESNTCCCLFDYGKYCLAWGCCPLEAATCCDDNYSCCPHEYPVCDL 416
Query: 402 VRHQCLTVSL 411
+ CL VS
Sbjct: 417 DQGTCLMVSF 426
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 197/390 (50%), Positives = 265/390 (67%), Gaps = 7/390 (1%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
+++ L+E+W +HGK+Y++ EK +R +IF+DN ++ + N++ N S+ L L FADLT+
Sbjct: 44 EVSALYESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTN 103
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACW 142
+E+++ +LG ++ +N S + + D +P SIDWR+KG + VKDQ SCG+CW
Sbjct: 104 EEYRSIYLGTKSSGDRKKLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCW 163
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFSA A+E IN IVTG+L+SLSEQEL+DCDRSYN GC GGLMDYA++FVIKN GIDTE+
Sbjct: 164 AFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEE 223
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ + G C++ + N +V ID Y+DVP NNEK L +AV QPVS+ + R FQ Y
Sbjct: 224 DYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHY 283
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SGIFTG C T++DH V+I GY +ENG+DYWI++NSWG +WG NGY+ +QRN +S G+C
Sbjct: 284 KSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLC 343
Query: 323 GINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKC 376
G+ + SYP KTG PPSP PT C + CA G TCCC C SW C
Sbjct: 344 GLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDEYSQCAVGTTCCCILQFRRSCFSWGC 403
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C A CC DH CCP +YPIC+ + C
Sbjct: 404 CPLEGATCCEDHYSCCPHDYPICNVRQGTC 433
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/390 (51%), Positives = 262/390 (67%), Gaps = 9/390 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E+W +HGK+Y++ EK++R +IF+DN F+ +HN N S+ + LN FADLT+
Sbjct: 45 EVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTN 104
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+E+++++LG A S + + +P +P S+DWR KGAV +KDQ SCG+CWA
Sbjct: 105 EEYRSTYLG--AKSKPKLSKVKSDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWA 162
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS A+EGIN+IVTG L++LSEQEL+DCD+SYN GC GGLMDY ++F+I N GIDT+KD
Sbjct: 163 FSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKD 222
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G+ +C++ + N +VTID Y+DVP NNE+ L +AV +QPVSVGI G RAFQ Y
Sbjct: 223 YPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYD 282
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN-TGNSLGIC 322
SGIFTG C T+LDH V +VGY +E G DYWI++NSWG SWG GY+ M+RN G S+G C
Sbjct: 283 SGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKC 342
Query: 323 GINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKC 376
GI M SYP K GQN PP+P PT C C TCCC G C SW C
Sbjct: 343 GIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDDYYTCPESSTCCCVYEYYGYCFSWGC 402
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C A CC DH CCP +YP+C+ C
Sbjct: 403 CPLDGATCCDDHYSCCPHDYPVCNVQAGTC 432
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 200/399 (50%), Positives = 258/399 (64%), Gaps = 14/399 (3%)
Query: 15 SSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLS 74
+ PL S + ++E W +HGKAY++ EK++R +IF+DN F+ +HN++ + S+ +
Sbjct: 37 TKYPLRTDSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVG 95
Query: 75 LNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKD 134
LN FADLT++E+KA FLG + + + D+P ++DWR+KGAV VKD
Sbjct: 96 LNRFADLTNEEYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKD 155
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIK 194
Q CG+CWAFS GA+EGIN+IVTG L+SLSEQEL+DCD+SYN GC GGLMDYA++F+I
Sbjct: 156 QGQCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIIN 215
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N GIDTE+DYPY+ C+ + N +VTIDGY+DVPEN+E L +AV QPVSV I
Sbjct: 216 NGGIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEA 275
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
RAFQLY SG+FTG C T LDH V+ VGY +ENGV+YWI++NSWG +WG +GY+ M+RN
Sbjct: 276 GGRAFQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERN 335
Query: 315 TGNS-LGICGINMLASYPTKTG------------QNPPPSPPPGPTRCSLLTYCAAGETC 361
N+ G CGI + SYPTK G PP P T C C G TC
Sbjct: 336 VANTKTGKCGIAIQPSYPTKKGANPPNPGPSPPSPVNPPPPVSPSTVCDDYFSCPDGNTC 395
Query: 362 CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICD 400
CC G C W CC SA CC DH CCP YP+CD
Sbjct: 396 CCIYEYSGYCFGWGCCPLESATCCDDHNSCCPHEYPVCD 434
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/430 (48%), Positives = 278/430 (64%), Gaps = 26/430 (6%)
Query: 3 SLAFFLLSILLLSSLPLNYCS--------------DINELFETWCKQHGKAYSSEQEKQQ 48
++ FL I++ S++ ++ S +++ L+E W +HGKA +S EK +
Sbjct: 2 TVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDR 61
Query: 49 RLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASV 108
R +IF+DN F+ +HN N S+ L L FADLT+ E+++ +LG S + +S+
Sbjct: 62 RFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLG----SRLKRKATKSSL 116
Query: 109 QSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
+ + D +P S+DWRK+GAV EVKDQ SCG+CWAFS GA+EGINKIVTG L++LSEQ
Sbjct: 117 RYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQ 176
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
EL+DCD SYN GC GGLMDYA++F+I N GIDTE+DYPY+G G+C++ + N +VTID
Sbjct: 177 ELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDL 236
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
Y+DVP N+E+ L +A+ QP+SV I G RAFQLY SGIF G C T LDH V+ VGY +E
Sbjct: 237 YEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTE 296
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQ------NPPPS 341
NG DYWI+KNSWG SWG +GY+ M+RN +S G CGI + SYP K GQ PPS
Sbjct: 297 NGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIKNGQNPPNPGPSPPS 356
Query: 342 PPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDS 401
P PT+C C TCCC CL+W CC +A CC D+ CCP YP+CD
Sbjct: 357 PVKPPTQCDSYYTCPESNTCCCLFDYGKYCLAWGCCPLEAATCCDDNYSCCPHEYPVCDL 416
Query: 402 VRHQCLTVSL 411
+ CL VS
Sbjct: 417 DQGTCLMVSF 426
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 198/395 (50%), Positives = 262/395 (66%), Gaps = 11/395 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ ++ W +HG Y++ E+++R + F DN ++ QHN + G SF L LN FAD
Sbjct: 38 EVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFAD 97
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E+++++LG + D +R+ +A Q+ N ++P S+DWRKKGAV VKDQ CG+
Sbjct: 98 LTNEEYRSTYLG-ARTKPDRERKLSARYQAADN-DELPESVDWRKKGAVGAVKDQGGCGS 155
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG ++ LSEQEL+DCD SYN GC GGLMDYA++F+I N GID+
Sbjct: 156 CWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDS 215
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+ + +C+ K N +VTIDGY+DVP N+EK L +AV QP+SV I RAFQ
Sbjct: 216 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 275
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SGIFTG C T+LDH V VGY +ENG DYW+++NSWG WG +GY+ M+RN S G
Sbjct: 276 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSVWGEDGYIRMERNIKASSG 335
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYPTKTG+NPP P P+ C C A TCCC C +W
Sbjct: 336 KCGIAVEPSYPTKTGENPPNPGPTPPSPAPPSSVCDSYNECPASTTCCCIYEYGKECFAW 395
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
CC A CC DH CCP NYPIC++ + CL
Sbjct: 396 GCCPLEGATCCDDHYSCCPHNYPICNTKQGTCLAA 430
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 195/395 (49%), Positives = 263/395 (66%), Gaps = 11/395 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ ++ W +H + Y++ E+++R ++F DN ++ QHN + G SF L LN FAD
Sbjct: 36 EVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFAD 95
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E+++++LG + D +R+ +A Q+ N ++P ++DWRKKGAV +KDQ CG+
Sbjct: 96 LTNEEYRSTYLG-ARTKPDRERKLSARYQADDN-EELPETVDWRKKGAVAAIKDQGGCGS 153
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG ++ LSEQEL+DCD SYN GC GGLMDYA++F+I N GID+
Sbjct: 154 CWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDS 213
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+ + +C+ K N +VTIDGY+DVP N+EK L +AV QP+SV I RAFQ
Sbjct: 214 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 273
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SGIFTG C T+LDH V VGY +ENG DYW+++NSWG WG +GY+ M+RN S G
Sbjct: 274 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGTVWGEDGYIRMERNIKASSG 333
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYPTKTG+NPP P P+ C C A TCCC C +W
Sbjct: 334 KCGIAVEPSYPTKTGENPPNPGPTPPSPAPPSSVCDSYNECPASTTCCCIYEYGKECFAW 393
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
CC A CC DH CCP NYPIC++ + CL
Sbjct: 394 GCCPLEGATCCDDHYSCCPHNYPICNTQQGTCLAA 428
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 197/390 (50%), Positives = 259/390 (66%), Gaps = 7/390 (1%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGKAY+S EK++R ++F+DN F+ +HN+ N ++ + LN FADLT+
Sbjct: 37 EVMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSE-NRTYRVGLNRFADLTN 95
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+E+++ +LG + + R+ + +P +P S+DWRK+GAV VKDQ SCG+CWA
Sbjct: 96 EEYRSMYLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWA 155
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSA A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDY ++F+I N GID+E+D
Sbjct: 156 FSAVAAVEGINKIVTGDLISLSEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEED 215
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY + G+C+ + N +V+ID Y+DVP NNE L +AV QPVSV I R FQLYS
Sbjct: 216 YPYLARDGRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYS 275
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SG+F+G C T+LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN GICG
Sbjct: 276 SGVFSGRCGTALDHGVVAVGYGTENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICG 335
Query: 324 INMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKCC 377
I M ASYP K GQNPP P P+ C C TCCC C W CC
Sbjct: 336 IAMEASYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCPESNTCCCIFEYANFCFEWGCC 395
Query: 378 GFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
A CC DH CCP +YPIC+ + CL
Sbjct: 396 PLEGATCCDDHYSCCPHDYPICNVNQGTCL 425
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/392 (50%), Positives = 257/392 (65%), Gaps = 18/392 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
S ++E W +HGKAY++ EK++R KIF+DN F+ +HN G+ S+ L LN FADLT
Sbjct: 42 SHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLT 101
Query: 83 HQEFKASFLGF-------SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
++E++A FLG AA + R A ++PA +DWR+KGAVT +KDQ
Sbjct: 102 NEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAG----EELPAMVDWREKGAVTPIKDQ 157
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
CG+CWAFS GA+EGIN+IVTG+L SLSEQEL+DCDR YN GC GGLMDYA++F+++N
Sbjct: 158 GQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQN 217
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
GIDTE+DYPY + C+ + N +VTIDGY+DVP N+EK L++AV QPVSV I
Sbjct: 218 GGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAG 277
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
FQLY SG+FTG C T+LDH V+ VGY +ENG DYW+++NSWG +WG NGY+ ++RN
Sbjct: 278 GMEFQLYQSGVFTGRCGTNLDHGVVAVGYGTENGTDYWLVRNSWGSAWGENGYIKLERNV 337
Query: 316 GNS-LGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSIL 368
N+ G CGI + ASYP K G NPP P P+ C C +G TCCC
Sbjct: 338 QNTETGKCGIAIEASYPIKNGANPPNPGPSPPSPATPSIVCDEYYSCNSGTTCCCLFEYR 397
Query: 369 GICLSWKCCGFSSAVCCSDHRYCCPSNYPICD 400
G C W CC SA CC D CCP ++P CD
Sbjct: 398 GFCFGWGCCPIESATCCPDQTSCCPPDFPFCD 429
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 220/421 (52%), Positives = 266/421 (63%), Gaps = 25/421 (5%)
Query: 10 SILLLSSLPLNYCSD--INELFETWCKQHGKAY--------SSEQEKQQRLKIFEDNYAF 59
SIL L P + S+ + LF++W QHGK+Y S EK R IF+DN F
Sbjct: 36 SILDLGYDPQDLSSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRF 95
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLG----FSAASIDHDRRRNASVQSPGNLR 115
+ N N + L LNAFADLT++EF+A G S H+ R SVQ L+
Sbjct: 96 IHGENEK-NQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHEEFRYGSVQ----LK 150
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
D+P SIDWR+KGAV VKDQ SCG+CWAFSA AIEG+NK+ TG LVSLSEQEL+DCD+
Sbjct: 151 DLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKG 210
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
+ GC GGLMDYA+ FVIKN G+DTE DYPY+G +C++ K+N +VTIDGY+DVP N+
Sbjct: 211 EDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVND 270
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E LL+AV QPVSV I + Q Y SGIFTG C T LDH V VGY E+G YWII
Sbjct: 271 ETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWII 330
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN------PPPSPPPGPTRC 349
KNSWG +WG GY+ M RNTG + G+CGINM ASYPTKTG N PPSP P P C
Sbjct: 331 KNSWGSNWGEKGYVKMARNTGLAAGLCGINMEASYPTKTGANPPNPGPTPPSPAPPPNEC 390
Query: 350 SLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
C TCCC + C +W CC SA CC DH +CCPS++PIC+ + CL
Sbjct: 391 DDYYTCPESSTCCCLFNYGKYCFAWGCCPLQSATCCEDHYHCCPSDFPICNLQANTCLRS 450
Query: 410 S 410
S
Sbjct: 451 S 451
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 210/433 (48%), Positives = 278/433 (64%), Gaps = 26/433 (6%)
Query: 3 SLAFFLLSILLLSSLPLNYCS--------------DINELFETWCKQHGKAYSSEQEKQQ 48
++ FL I++ S++ ++ S +++ L+E W +HGKA +S EK +
Sbjct: 8 TVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDR 67
Query: 49 RLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASV 108
R +IF+DN F+ +HN N S+ L L FADLT+ E+++ +LG S + +S+
Sbjct: 68 RFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLG----SRLKRKATKSSL 122
Query: 109 QSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
+ + D +P S+DWRK+GAV EVKDQ SCG+CWAFS GA+EGINKIVTG L++LSEQ
Sbjct: 123 RYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQ 182
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
EL+DCD SYN GC GGLMDYA++F+I N GIDTE+DYPY+G G+C++ + N +VTID
Sbjct: 183 ELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDL 242
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
Y+DVP N+E+ L +A+ QP+SV I G RAFQLY SGIF G C T LDH V+ VGY +E
Sbjct: 243 YEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGTE 302
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQ------NPPPS 341
NG DYWI+KNSWG SWG +GY+ M+RN +S G CGI + SYP K GQ PPS
Sbjct: 303 NGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIKNGQNPPNPGPSPPS 362
Query: 342 PPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDS 401
P PT+C C TCCC CL+W CC +A CC D+ CCP YP+CD
Sbjct: 363 PVKPPTQCDSYYTCPESNTCCCLFDYGKYCLAWGCCPLEAATCCDDNYSCCPHEYPVCDL 422
Query: 402 VRHQCLTVSLKFS 414
+ CL FS
Sbjct: 423 DQGTCLIGKFCFS 435
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/392 (52%), Positives = 255/392 (65%), Gaps = 17/392 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQ 84
L+E W +HG+AY++ EK++R +IF+DN F+ HN + G+ SF L LN FAD+T++
Sbjct: 49 LYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTNE 108
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSP----GNLRDVPASIDWRKKGAVTEVKDQASCGA 140
E++A +LG A RR A V S D+P S+DWR KGAV VKDQ SCG+
Sbjct: 109 EYRAVYLGTRPAG----HRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGS 164
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGINKIVTG L+SLSEQEL+DCD YN GC GGLMDY ++F+I N GIDT
Sbjct: 165 CWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFIINNGGIDT 224
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY + G+C++ + N +V+IDGY+DVP N+EK L +AV QPVSV I R FQ
Sbjct: 225 EEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SGIFTG C T LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M+RN S G
Sbjct: 285 LYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIRMERNVNTSTG 344
Query: 321 ICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYPTK GQN PPSP PT C C + TCCC C +W
Sbjct: 345 KCGIAIEPSYPTKKGQNPPKPAPSPPSPVSPPTVCDNYYSCPSSTTCCCVYEYGRYCFAW 404
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
CC A CC DH CCP +YP+C+ C
Sbjct: 405 GCCPLEGATCCEDHYSCCPHDYPVCNVKAGTC 436
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/390 (50%), Positives = 257/390 (65%), Gaps = 7/390 (1%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W + GK Y++ E+++R ++F+DN F+ +HN+ N ++ L LN FADLT+
Sbjct: 47 EVMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSE-NRTYKLGLNGFADLTN 105
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+E+++++LG + R+ + +P +P S+DWRK+GAV EVKDQ SCG+CWA
Sbjct: 106 EEYRSTYLGARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWA 165
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDTE+D
Sbjct: 166 FSTIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEED 225
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY + G+C+ + N +VTID Y+DVP N+E L +AV QPVSV I R FQ Y+
Sbjct: 226 YPYLARDGRCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYA 285
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SGIF+G C T LDH V VGY +ENG DYWI++NSWG+SWG NGY+ M R+ + GICG
Sbjct: 286 SGIFSGRCGTQLDHGVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICG 345
Query: 324 INMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCC 377
I M ASYP K GQN PPSP PT C C TCCC C W CC
Sbjct: 346 IAMEASYPIKKGQNPPNPAPLPPSPVTPPTVCDNYYSCPDNNTCCCLFEYGNFCFEWGCC 405
Query: 378 GFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
A CC DH CCP +YPIC+ + CL
Sbjct: 406 PLEGATCCEDHYSCCPHDYPICNINQGTCL 435
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 219/422 (51%), Positives = 266/422 (63%), Gaps = 25/422 (5%)
Query: 9 LSILLLSSLPLNYCSD--INELFETWCKQHGKAY--------SSEQEKQQRLKIFEDNYA 58
SIL L P + S+ + LF++W QHGK+Y S EK R IF+DN
Sbjct: 35 FSILDLGYDPQDLSSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLR 94
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLG----FSAASIDHDRRRNASVQSPGNL 114
F+ N N + L LNAFADLT++EF+A G S ++ R SVQ L
Sbjct: 95 FIHGENEK-NQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSYEEFRYGSVQ----L 149
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+D+P SIDWR+KGAV VKDQ SCG+CWAFSA AIEG+NK+ TG LVSLSEQEL+DCD+
Sbjct: 150 KDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDK 209
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GGLMDYA+ FVIKN G+DTE DYPY+G +C++ K+N +VTIDGY+DVP N
Sbjct: 210 GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVN 269
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
+E LL+AV QPVSV I + Q Y SGIFTG C T LDH V VGY E+G YWI
Sbjct: 270 DETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWI 329
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN------PPPSPPPGPTR 348
IKNSWG +WG GY+ M RNTG + G+CGINM ASYPTKTG N PPSP P P
Sbjct: 330 IKNSWGSNWGEKGYIKMARNTGLAAGLCGINMEASYPTKTGANPPNPGPTPPSPVPPPNE 389
Query: 349 CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
C C TCCC + C +W CC SA CC DH +CCPS++PIC+ + CL
Sbjct: 390 CDDYYTCPESSTCCCLFNYGKYCFAWGCCPLQSATCCDDHYHCCPSDFPICNLKANTCLR 449
Query: 409 VS 410
S
Sbjct: 450 SS 451
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 206/415 (49%), Positives = 269/415 (64%), Gaps = 14/415 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS---------DINELFETWCKQHGKAYSSEQEKQQRLKIF 53
+LA + S+LL+S L L + + ++E W ++ K Y+ EK++R +IF
Sbjct: 9 TLALLIFSVLLIS-LSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIF 67
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN 113
+DN FV +H+++ N ++ + L FADLT+ EF+A +L + + G+
Sbjct: 68 KDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEKYLYKVGD 127
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
+P +IDWR KGAV VKDQ SCG+CWAFSA GA+EGIN+I TG L+SLSEQEL+DCD
Sbjct: 128 --SLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG-QAGQCNKQKLNRHIVTIDGYKDVP 232
SYN GCGGGLMDYA++F+I+N GIDTE+DYPY CN K N +VTIDGY+DVP
Sbjct: 186 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVP 245
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDY 292
+N+EK L +A+ QP+SV I RAFQLY+SG+FTG C TSLDH V+ VGY SE G DY
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDY 305
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK-TGQNPPPSPPPGPTRCSL 351
WI++NSWG +WG +GY ++RN S G CG+ M+ASYPTK +G NPP P P P C
Sbjct: 306 WIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGSNPPKPPAPSPVVCDK 365
Query: 352 LTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C A TCCC G C SW CC + SA CC D CCP +YP+CD + C
Sbjct: 366 SNTCPAKSTCCCLYEYNGKCYSWGCCPYESATCCDDGSSCCPQSYPVCDLKANTC 420
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 200/412 (48%), Positives = 265/412 (64%), Gaps = 15/412 (3%)
Query: 23 SDINELFETWCKQHGKA-YSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADL 81
SD++ + +WC + GK SS +R + F++N+ ++ +HN G S+ L LN F+DL
Sbjct: 7 SDLSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDL 66
Query: 82 THQEFKASFLGFSAASIDH---DRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
T +EF+ FLG ID R++ ++ D+PAS+DWRK GAVT KDQ SC
Sbjct: 67 TSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKDQGSC 126
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G CWAF+ TGAIEGIN+IVTG L+SLSEQELIDCD+ + GC GGLM+ AYQF+++N G+
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGL 186
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTE DYPY CN +KLN +V IDGY+ +P+ +E+ LL+AV QPVSV I G+ +
Sbjct: 187 DTETDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKD 246
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQ Y+SG+FTG C ++H VLIVGY +E+G+DYWI+KNSW +WG G++ MQRNTG
Sbjct: 247 FQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKR 306
Query: 319 LGICGINMLASYPTKTGQN----------PPPSPPPGPTRCSLLTYCAAGETCCCGSSIL 368
G+C IN LASYP K+G N P P P +C C +G TCCC I
Sbjct: 307 GGLCSINTLASYPVKSGGNPPQPEPRPPSPEPPSPAPEQQCDKFNKCPSGTTCCCRFPIG 366
Query: 369 GICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV-SLKFSFTVKY 419
CL W CCG SAVCC DH++CCP +YP+C CL V ++ F F +
Sbjct: 367 PKCLLWGCCGVESAVCCPDHQHCCPHDYPVCHPKDGLCLKVLAMLFLFLFSW 418
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/395 (51%), Positives = 261/395 (66%), Gaps = 16/395 (4%)
Query: 24 DINELFETWCKQHGKAYSS-EQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
++ L+E+W +HGK+Y+ EK +R +IF+DN ++ + N+ G+ S+ L LN FADLT
Sbjct: 44 EVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLT 103
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQS-----PGNLRDVPASIDWRKKGAVTEVKDQAS 137
++E+++++LG + RRR A +S P +P SIDWR+KGAV EVKDQ S
Sbjct: 104 NEEYRSTYLGAKTDA----RRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGS 159
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+IKN G
Sbjct: 160 CGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 219
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE DYPY G+ G+C++ + N +V+IDGY+DV +E L +AV QPVSV I R
Sbjct: 220 IDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGR 279
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLYSSGIFTG C T LDH V VGY +ENGVDYWI+KNSW SWG GY+ MQRN +
Sbjct: 280 DFQLYSSGIFTGSCGTDLDHGVTAVGYGTENGVDYWIVKNSWAASWGEKGYLRMQRNVKD 339
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGIC 371
G+CGI + SYPTKTG+NPP P P+ C C TCCC C
Sbjct: 340 KNGLCGIAIEPSYPTKTGENPPNPGPSPPSPVSPPNMCDDYDECPTSTTCCCVFPYGEHC 399
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
+W C SAVCC DH CCP +YP+C + C
Sbjct: 400 FAWGCSPLESAVCCEDHYSCCPHDYPVCHVSQGTC 434
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/403 (50%), Positives = 263/403 (65%), Gaps = 13/403 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGK Y++ EK++R +IF+DN F+ QHN+ N ++T+ LN FADLT+
Sbjct: 46 EVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTN 104
Query: 84 QEFKASFLGFSAASIDHDRR--RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+EF++ +LG H +R + + +P +P S+DWRK+GAV EVKDQ CG+C
Sbjct: 105 EEFRSMYLGTRTG---HKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSC 161
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGINKIVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+I N GIDTE
Sbjct: 162 WAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTE 221
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY G+ G+C+ + N +V+ID Y+DVPEN+E L +AV QPVSV I G R FQL
Sbjct: 222 DDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQL 281
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
Y+SG+FTG C TSLDH V VGY +E G DYWI++NSWG+SWG +GY+ M+RN + G
Sbjct: 282 YNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGK 341
Query: 322 CGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWK 375
CGI + SYP K GQNPP P P+ C C TCCC C +W
Sbjct: 342 CGIAIEPSYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCPDSSTCCCIFEYGKYCFAWG 401
Query: 376 CCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
CC A CC DH CCP YP+C+ CL +S F VK
Sbjct: 402 CCPLEGATCCDDHYSCCPHEYPVCNVNEGTCL-ISKGNPFGVK 443
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/403 (50%), Positives = 263/403 (65%), Gaps = 13/403 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGK Y++ EK++R +IF+DN F+ QHN+ N ++T+ LN FADLT+
Sbjct: 37 EVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTN 95
Query: 84 QEFKASFLGFSAASIDHDRR--RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+EF++ +LG H +R + + +P +P S+DWRK+GAV EVKDQ CG+C
Sbjct: 96 EEFRSMYLGTRTG---HKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGINKIVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+I N GIDTE
Sbjct: 153 WAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY G+ G+C+ + N +V+ID Y+DVPEN+E L +AV QPVSV I G R FQL
Sbjct: 213 DDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQL 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
Y+SG+FTG C TSLDH V VGY +E G DYWI++NSWG+SWG +GY+ M+RN + G
Sbjct: 273 YNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGK 332
Query: 322 CGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWK 375
CGI + SYP K GQNPP P P+ C C TCCC C +W
Sbjct: 333 CGIAIEPSYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCPDSSTCCCIFEYGKYCFAWG 392
Query: 376 CCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
CC A CC DH CCP YP+C+ CL +S F VK
Sbjct: 393 CCPLEGATCCDDHYSCCPHEYPVCNVNEGTCL-ISKGNPFGVK 434
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 198/387 (51%), Positives = 258/387 (66%), Gaps = 12/387 (3%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
++ L+E+W HGKAY++ EK++R +IF+DN F+ +HN + ++ + L FADLT
Sbjct: 56 EEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADLT 114
Query: 83 HQEFKASFLG--FSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
++E++A FLG FS + + G+ D+P +DWRKKGAV VKDQ CG+
Sbjct: 115 NEEYRARFLGGRFSRKPRLSAAKSGRYAAALGD--DLPDDVDWRKKGAVATVKDQGQCGS 172
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS+ A+EGIN+IVTG L+ LSEQEL+DCD+S+N GC GGLMDYA+QF+I N GIDT
Sbjct: 173 CWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDT 232
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+G+ C+ + N +VTIDGY+DVPEN+E L +AV QPVSV I RAFQ
Sbjct: 233 EEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN-SL 319
LY SG+FTG C T LDH V+ VGY ++NG DYWI++NSWG+ WG +GY+ ++RN N +
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANITT 352
Query: 320 GICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYPTK+G N PPSP PT C C G TCCC C +
Sbjct: 353 GKCGIAVQPSYPTKSGANPPKPSASPPSPVKPPTECDEYFSCEEGSTCCCIYQFGSTCFA 412
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICD 400
W CC SA CC DH CCP YP+CD
Sbjct: 413 WGCCPLESATCCDDHYSCCPHEYPVCD 439
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 200/402 (49%), Positives = 258/402 (64%), Gaps = 14/402 (3%)
Query: 23 SDINELFETWCKQHGKA-YSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADL 81
SD++ + +WC + GK SS R + F++N+ ++ +HN G S+ L LN F+DL
Sbjct: 7 SDLSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDL 66
Query: 82 THQEFKASFLGFSAASIDH---DRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
T +EF+ FLG ID R++ ++ D+PAS+DWR+ GAVT KDQ SC
Sbjct: 67 TSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQGSC 126
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G CWAF+ TGAIEGIN+IVTG LVSLSEQELIDCD+ + GC GGLM+ AYQF+++N G+
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGL 186
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTE DYPY CN +KLN +V IDGYK +PE +E+ LL AV QPVSV I G+ +
Sbjct: 187 DTETDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKD 246
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQ Y+SG+FTG C ++H VLIVGY +E+G+DYWI+KNSW +WG G++ MQRNTG
Sbjct: 247 FQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKR 306
Query: 319 LGICGINMLASYPTKTGQN----------PPPSPPPGPTRCSLLTYCAAGETCCCGSSIL 368
G+C IN LASYP K+G N P P P +C C +G TCCC I
Sbjct: 307 GGLCSINTLASYPVKSGGNPPQPEPRPPSPEPPSPAPEQQCDKFNKCPSGTTCCCRFPIG 366
Query: 369 GICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVS 410
CL W CCG SAVCC DH++CCP +YP+C CL S
Sbjct: 367 PKCLLWGCCGVESAVCCPDHQHCCPHDYPVCHPKDGLCLKSS 408
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/393 (50%), Positives = 253/393 (64%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ ++ W HG+ Y++ E+++R ++F DN +V HN + G SF L LN FAD
Sbjct: 41 EARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFAD 100
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT+ E++A++LG S RR G+ D+P S+DWR KGAV EVKDQ SCG+
Sbjct: 101 LTNDEYRATYLGVR--SRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGS 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+IVTG ++SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDT
Sbjct: 159 CWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 218
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QP+SV I RAFQ
Sbjct: 219 EEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQ 278
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY+SGIFTG C T+LDH V VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G
Sbjct: 279 LYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 338
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G NPP P P+ C C TCCC C +W
Sbjct: 339 KCGIAVEPSYPLKKGANPPNPGPTPPSPTPPPTVCDNYYSCPDSTTCCCIYEYGKYCFAW 398
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YP+C+ + CL
Sbjct: 399 GCCPLEGATCCDDHYSCCPHDYPVCNVKQGTCL 431
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 198/393 (50%), Positives = 253/393 (64%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ ++ W HG+ Y++ E+++R ++F DN +V HN + G SF L LN FAD
Sbjct: 41 EARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFAD 100
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT+ E++A++LG S RR G+ D+P S+DWR KGAV E+KDQ SCG+
Sbjct: 101 LTNDEYRATYLGVR--SRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGS 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+IVTG ++SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDT
Sbjct: 159 CWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 218
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QP+SV I RAFQ
Sbjct: 219 EEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQ 278
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY+SGIFTG C T+LDH V VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G
Sbjct: 279 LYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 338
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G NPP P P+ C C TCCC C +W
Sbjct: 339 KCGIAVEPSYPLKKGANPPNPGPTPPSPTPPPTVCDNYYSCPDSTTCCCIYEYGKYCFAW 398
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YP+C+ + CL
Sbjct: 399 GCCPLEGATCCDDHYSCCPHDYPVCNVKQGTCL 431
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/395 (49%), Positives = 260/395 (65%), Gaps = 11/395 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ ++ W +H Y+ E+++R + F +N ++ QHN + G SF L LN FAD
Sbjct: 37 EVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRFAD 96
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E+++++LG + D +R+ +A Q+ N ++P S+DWRKKGAV VKDQ CG+
Sbjct: 97 LTNEEYRSTYLG-ARTKPDRERKLSARYQAADN-DELPESVDWRKKGAVGAVKDQGGCGS 154
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG ++ LSEQEL+DCD SYN GC GGLMDYA++F+I N GID+
Sbjct: 155 CWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDS 214
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+ + +C+ K N +VTIDGY+DVP N+EK L +AV QP+SV I RAFQ
Sbjct: 215 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 274
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SGIFTG C T+LDH V VGY +ENG DYW+++NSWG WG NGY+ M+RN S G
Sbjct: 275 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSVWGENGYIRMERNIKASSG 334
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYPTKTG+NPP P P+ C C A TCCC C +W
Sbjct: 335 KCGIAVEPSYPTKTGENPPNPGPTPPSPAPTSSVCYSHNECPASTTCCCIYEYGKECFAW 394
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
CC A CC DH CCP NYPIC++ + CL
Sbjct: 395 GCCPLEGATCCDDHYSCCPHNYPICNTKQGTCLAA 429
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/405 (50%), Positives = 265/405 (65%), Gaps = 14/405 (3%)
Query: 23 SDINELFETWCKQHGKAYSSEQ--EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80
+++ ++E W +HGKA S EK +R +IF+DN FV +HN N S+ L L FAD
Sbjct: 44 AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFAD 102
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCG 139
LT+ E+++ +LG A ++ R S++ + D +P SIDWRKKGAV EVKDQ CG
Sbjct: 103 LTNDEYRSKYLG---AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCG 159
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GID
Sbjct: 160 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGID 219
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
T+KDYPY+G G C++ + N +VTID Y+DVP +E+ L +AV QP+S+ I RAF
Sbjct: 220 TDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAF 279
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN +S
Sbjct: 280 QLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS 339
Query: 320 GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G+ PPSP PT+C C TCCC C +
Sbjct: 340 GKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFA 399
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
W CC +A CC D+ CCP YP+CD + CL +S F+VK
Sbjct: 400 WGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCL-LSKNSPFSVK 443
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/405 (50%), Positives = 265/405 (65%), Gaps = 14/405 (3%)
Query: 23 SDINELFETWCKQHGKAYSSEQ--EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80
+++ ++E W +HGKA S EK +R +IF+DN FV +HN N S+ L L FAD
Sbjct: 44 AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFAD 102
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCG 139
LT+ E+++ +LG A ++ R S++ + D +P SIDWRKKGAV EVKDQ CG
Sbjct: 103 LTNDEYRSKYLG---AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCG 159
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GID
Sbjct: 160 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGID 219
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
T+KDYPY+G G C++ + N +VTID Y+DVP +E+ L +AV QP+S+ I RAF
Sbjct: 220 TDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAF 279
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN +S
Sbjct: 280 QLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS 339
Query: 320 GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G+ PPSP PT+C C TCCC C +
Sbjct: 340 GKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFA 399
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
W CC +A CC D+ CCP YP+CD + CL +S F+VK
Sbjct: 400 WGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCL-LSKNSPFSVK 443
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/389 (50%), Positives = 254/389 (65%), Gaps = 13/389 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ E+FE+W +HGK+Y++ EK +R KIF DN ++ + N++ N S+ L LN FAD+T+
Sbjct: 45 EVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITN 104
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+E++ +LG + + + + +P +P SIDWR+KGAVT VKDQ SCG+CWA
Sbjct: 105 EEYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWA 164
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS A+EG+N++ TG+L+SLSEQEL+DCDR N GC GG M YA+QF+IKN GID+E+D
Sbjct: 165 FSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDSEED 224
Query: 204 YPYRGQAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY G+ G+C+ + N + +IDGY++VP NNEK L +AV QPVSV I FQLY
Sbjct: 225 YPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQLY 284
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SSGIFTG C T LDH V VGY +ENGVDYWI+KNSWG WG GY+ MQRN G+C
Sbjct: 285 SSGIFTGSCGTDLDHGVAAVGYGTENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAKTGLC 344
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGI 370
GI M ASYPTK G + PP PP P C C A TCCC
Sbjct: 345 GIAMEASYPTKKGGDNPPPSPPSPPSPTPTPPSPSPSVCDKFNACPASTTCCCVFPFGNY 404
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPIC 399
C +W CC SAVCC DH CCP +YP+C
Sbjct: 405 CFAWGCCPLDSAVCCDDHYSCCPHDYPVC 433
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/401 (49%), Positives = 260/401 (64%), Gaps = 20/401 (4%)
Query: 17 LPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76
+P ++ ++E W +HG+AY++ EK++R +IF+DN F+ +HN++GN S+ L LN
Sbjct: 13 VPERTEAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLN 72
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL----RDVPASIDWRKKGAVTEV 132
FADL++ E+++ +LG +D R +S L D+P ++DWR+KGAV V
Sbjct: 73 KFADLSNDEYRSVYLG---TRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPV 129
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFV 192
KDQ CG+CWAFS GA+EGIN+IVTG+L SLSEQEL+DCD++YN GC GGLMDYA+ F+
Sbjct: 130 KDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFI 189
Query: 193 IKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI 252
I+N GIDTE+DYPY+ C+ + N +VTIDGY+DVP+N+EK L +AV QPVSV I
Sbjct: 190 IENGGIDTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAI 249
Query: 253 CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C T LDH V+ VGY +E+GVDYWI++NSWG +WG NGY+ M+
Sbjct: 250 EAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRME 309
Query: 313 RNTGNS-LGICGINMLASYPTKTG------------QNPPPSPPPGPTRCSLLTYCAAGE 359
R+ ++ G CGI M ASYPTK PP P + C C AG
Sbjct: 310 RDVASTETGKCGIAMEASYPTKKSANPPNPGPSPPSPVNPPPPEKPSSECDDYYSCPAGS 369
Query: 360 TCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICD 400
TCCC C W CC SA CC DH CCP YP+CD
Sbjct: 370 TCCCIYQYGDYCFGWGCCPLESATCCDDHNSCCPHEYPVCD 410
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 205/395 (51%), Positives = 264/395 (66%), Gaps = 14/395 (3%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
++N L+E W +HGK Y++ EK +R +IF+DN F+ Q N N ++ L LN FADLT
Sbjct: 34 EEVNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQN-AENRTYKLGLNRFADLT 92
Query: 83 HQEFKASFLGFSAASIDHDRR--RNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
++E++A +LG ID +RR R S + +P +P S+DWRK+GAV VKDQASCG
Sbjct: 93 NEEYRARYLG---TKIDPNRRLGRTPSNRYAPRVGETLPDSVDWRKEGAVVPVKDQASCG 149
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFSA GA+EGINKIVTG L+SLSEQEL+DCD YN GC GGLMDYA++F+IKN GID
Sbjct: 150 SCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDTGYNMGCNGGLMDYAFEFIIKNGGID 209
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+E+DYPY+G G+C++ + N +V+IDGY+DV +E L +AV QPVSV + G R F
Sbjct: 210 SEEDYPYKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVSVAVEGGGREF 269
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLYSSG+FTG C T+LDH V+ VGY ++NG D+WI++NSWG WG GY+ ++RN GNS
Sbjct: 270 QLYSSGVFTGRCGTALDHGVVAVGYGTDNGHDFWIVRNSWGADWGEEGYIRLERNLGNSR 329
Query: 320 -GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICL 372
G CGI + SYP KTGQ PPSP P C C+ TCCC C
Sbjct: 330 SGKCGIAIEPSYPIKTGQNPPNPGPSPPSPVKPPNVCDNYYSCSDSATCCCIFEFGKTCF 389
Query: 373 SWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
W CC A CC DH CCP +YPIC++ CL
Sbjct: 390 EWGCCPLEGATCCDDHYSCCPHDYPICNTYAGTCL 424
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/393 (51%), Positives = 255/393 (64%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ ++ W HG+ Y++ E+++R ++F DN ++ HN + G SF L LN FAD
Sbjct: 36 EARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFAD 95
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT+ E++A++LG + +R+ A + N D+P S+DWR KGAV EVKDQ SCG+
Sbjct: 96 LTNDEYRATYLG-ARTRPQRERKLGARYHAADN-EDLPESVDWRAKGAVAEVKDQGSCGS 153
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDT
Sbjct: 154 CWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 213
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
EKDYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QPVSV I + AFQ
Sbjct: 214 EKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQ 273
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G
Sbjct: 274 LYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 333
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G NPP P P+ C C TCCC C +W
Sbjct: 334 KCGIAVEPSYPLKEGANPPNPGPSPPSPTPAPAVCDNYYSCPDSTTCCCIYEYGKYCFAW 393
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YPIC+ + CL
Sbjct: 394 GCCPLEGATCCDDHYSCCPHDYPICNVRQGTCL 426
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/393 (51%), Positives = 255/393 (64%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ ++ W HG+ Y++ E+++R ++F DN ++ HN + G SF L LN FAD
Sbjct: 41 EARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFAD 100
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT+ E++A++LG + +R+ A + N D+P S+DWR KGAV EVKDQ SCG+
Sbjct: 101 LTNDEYRATYLG-ARTRPQRERKLGARYHAADN-EDLPESVDWRAKGAVAEVKDQGSCGS 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDT
Sbjct: 159 CWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 218
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
EKDYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QPVSV I + AFQ
Sbjct: 219 EKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQ 278
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G
Sbjct: 279 LYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 338
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G NPP P P+ C C TCCC C +W
Sbjct: 339 KCGIAVEPSYPLKEGANPPNPGPSPPSPTPAPAVCDNYYSCPDSTTCCCIYEYGKYCFAW 398
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YPIC+ + CL
Sbjct: 399 GCCPLEGATCCDDHYSCCPHDYPICNVRQGTCL 431
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 205/405 (50%), Positives = 264/405 (65%), Gaps = 14/405 (3%)
Query: 23 SDINELFETWCKQHGKAYSSEQ--EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80
+++ ++E W +HGKA + EK +R +IF+DN F+ HN N S+ L L FAD
Sbjct: 37 AEVMSIYEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK-NLSYRLGLTRFAD 95
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCG 139
LT+ E+++ +LG A ++ R S + + D +P SIDWRKKGAV EVKDQ SCG
Sbjct: 96 LTNDEYRSKYLG---AKMEKKGERRTSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCG 152
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GID
Sbjct: 153 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGID 212
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
T+KDYPY+G G C++ + N +VTID Y+DVP +E+ L +AV QPVSV I RAF
Sbjct: 213 TDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAF 272
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN +S
Sbjct: 273 QLYDSGIFDGTCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLKMARNIASSS 332
Query: 320 GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G+ PPSP PT+C C TCCC C +
Sbjct: 333 GKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFA 392
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
W CC +A CC D+ CCP YP+CD + CL +S F+VK
Sbjct: 393 WGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCL-LSKNSPFSVK 436
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 198/410 (48%), Positives = 267/410 (65%), Gaps = 15/410 (3%)
Query: 9 LSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66
+SI+ +++ SD ++ L+E+W +HGK+Y++ EK +R +IF+DN ++ + N++
Sbjct: 27 MSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDEQNSV 86
Query: 67 GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV----PASID 122
N S+ L L FADLT++E+++ +LG ++ DRR+ + +S L V P S+D
Sbjct: 87 PNQSYKLGLTKFADLTNEEYRSIYLGTKSSG---DRRKLSKNKSDRYLPKVGDSLPESVD 143
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WR KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCD+SYN GC G
Sbjct: 144 WRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEGCDG 203
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242
GLMDYA++FVI N GIDTE+DYPY+ + C++ + N +V ID Y+DVP NNEK L +A
Sbjct: 204 GLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEKALQKA 263
Query: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRS 302
V QPVS+ I R Q Y SGIFTG C T++DH V+ GY SENG+DYWI++NSWG
Sbjct: 264 VAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYGSENGMDYWIVRNSWGAK 323
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCA 356
WG GY+ +QRN +S G+CG+ SYP KTG N PPSP PT C + C
Sbjct: 324 WGEKGYLRVQRNVASSSGLCGLATEPSYPVKTGANPPKPAPSPPSPVKPPTECDEYSQCP 383
Query: 357 AGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
G TCCC C SW CC A CC DH CCP +YP+C+ + C
Sbjct: 384 VGTTCCCVLEFRRSCFSWGCCPLEGATCCEDHSSCCPHDYPVCNVRQGTC 433
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 196/393 (49%), Positives = 258/393 (65%), Gaps = 10/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQ---EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80
++ ++E W ++GKA+S+ EK++R ++F+DN F+ +HN+ N S+ + LN FAD
Sbjct: 46 EVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE-NRSYKVGLNRFAD 104
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E+++ +LG + + + R+++ P +P S+DWRK+GAV EVKDQ SCG+
Sbjct: 105 LTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGS 164
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGINKIVTG L+SLSEQEL+DCDRSYN GC GGLMDYA+QF+I N GID+
Sbjct: 165 CWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGIDS 224
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY + G C+ + N +VTID Y+DVP N+EK L +AV QPVSV I R FQ
Sbjct: 225 EEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y SGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN + G
Sbjct: 285 FYQSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYIRMERNIATATG 344
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K GQNPP P P+ C C TCCC C W
Sbjct: 345 KCGIAIEPSYPIKKGQNPPNPGPSPPSPIKPPSVCDSYFSCPESTTCCCIFEYAKYCFEW 404
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YP+C+ CL
Sbjct: 405 GCCPLEGATCCDDHYSCCPHDYPVCNINEGTCL 437
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/396 (50%), Positives = 252/396 (63%), Gaps = 16/396 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAF 78
+++ ++E W +HGK ++ EK QR +IF+DN ++ +HN N S+ L L F
Sbjct: 44 AEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK-NLSYKLGLTRF 102
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQAS 137
ADLT+ E+++ +LG R S + + D +P S+DWRK+GAV +VKDQ S
Sbjct: 103 ADLTNDEYRSMYLGAKPVK----RVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGS 158
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS GA+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+IKN G
Sbjct: 159 CGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGG 218
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE DYPY+ G+C++ + N +VTID Y+DVPEN+E L +A+ QP+SV I R
Sbjct: 219 IDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGR 278
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
AFQLYSSG+F G C T LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M RN
Sbjct: 279 AFQLYSSGVFDGICGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIAE 338
Query: 318 SLGICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
G CGI M ASYP K GQ PPSP PT C C TCCC C
Sbjct: 339 PTGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYC 398
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
W CC SA CC DH CCP YP+CD R CL
Sbjct: 399 FGWGCCPLESATCCDDHSSCCPHEYPVCDINRGTCL 434
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/390 (51%), Positives = 258/390 (66%), Gaps = 15/390 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W HGKAY++ EK++R +IF+DN FV +HN + S+ + LN FADLT++E++
Sbjct: 46 IYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVA-GSYRVGLNRFADLTNEEYR 104
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNL----RDVPASIDWRKKGAVTEVKDQASCGACWA 143
+ FLG + + R+AS +S +P S+DWR+KGAV+ VKDQ CG+CWA
Sbjct: 105 SMFLGGNMEM----KERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWA 160
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS A+EGIN+IVTG L+SLSEQEL+DCD+SYN GC GGLMDY +QF+I N GIDTE+D
Sbjct: 161 FSTISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEED 220
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPYR G C++ + N +V+I+GY+DVPE++E L +AV QPVSV I RAFQLY
Sbjct: 221 YPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYE 280
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SG+FTG C T+LDH V+ VGY +ENGVDYW ++NSWG WG NGY+ ++RN + G CG
Sbjct: 281 SGVFTGHCGTNLDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKCG 340
Query: 324 INMLASYPTKT------GQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCC 377
I +ASYPTKT PP+P PT C C G TCCC C+ W CC
Sbjct: 341 IASMASYPTKTGSNPPNPGPSPPTPVNPPTVCDDYYSCPEGSTCCCVYQYGDFCIGWGCC 400
Query: 378 GFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
SA CC DH CCP YPICD CL
Sbjct: 401 PLESATCCDDHSSCCPHEYPICDLDGGTCL 430
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 202/396 (51%), Positives = 259/396 (65%), Gaps = 19/396 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E+W +HGK+Y++ EK++R +IF+DN F+ +HN + ++ + LN FADLT+
Sbjct: 41 EVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHN-AESRTYKVGLNRFADLTN 99
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQS------PGNLRDVPASIDWRKKGAVTEVKDQAS 137
E+++ +LG S RR S Q P +P S+DWR+KGAV VKDQ S
Sbjct: 100 DEYRSMYLGARTGS-----RRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGS 154
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+IKN G
Sbjct: 155 CGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 214
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE+DYPY + G+C++ + N +VTID Y+DVP NNE+ L +AV QPVSV I S
Sbjct: 215 IDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGM 274
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
AFQ Y SG+FTG C T+LDH V VGY +EN VDYWI+KNSWG SWG +GY+ M+RNTG
Sbjct: 275 AFQFYESGVFTGNCGTALDHGVTAVGYGTENSVDYWIVKNSWGSSWGESGYIRMERNTG- 333
Query: 318 SLGICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
+ G CGI + SYP KT Q PPSP PT C C TCCC C
Sbjct: 334 ATGKCGIAVEPSYPIKTSQNPPNPGPSPPSPIKPPTVCDDYYTCPESSTCCCVYEYGKYC 393
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+W CC A CC DH CCP +YPIC+ CL
Sbjct: 394 FAWGCCPLEGATCCDDHYSCCPHDYPICNVYAGTCL 429
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 205/407 (50%), Positives = 259/407 (63%), Gaps = 17/407 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAF 78
S++ ++E W +HGK ++ EK QR +IF+DN F+ +HN N S+ L L F
Sbjct: 44 SEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSYKLGLTRF 102
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQAS 137
ADLT++E+++ +LG R S + + D +P S+DWRK+GAV +VKDQ S
Sbjct: 103 ADLTNEEYRSMYLGAKPTK----RVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGS 158
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS GA+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+IKN G
Sbjct: 159 CGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGG 218
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE DYPY+ G+C++ + N +VTID Y+DVPEN+E L +A+ QP+SV I R
Sbjct: 219 IDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGR 278
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
AFQLYSSG+F G C T LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M RN
Sbjct: 279 AFQLYSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEA 338
Query: 318 SLGICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
G CGI M ASYP K GQ PPSP PT C C TCCC C
Sbjct: 339 PTGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYC 398
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
W CC +A CC D+ CCP YP+CD R CL +S F+VK
Sbjct: 399 FGWGCCPLEAATCCDDNSSCCPHEYPVCDVNRGTCL-MSKNSPFSVK 444
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 204/406 (50%), Positives = 260/406 (64%), Gaps = 17/406 (4%)
Query: 23 SDINELFETWCKQHGKAYSSE----QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAF 78
+++ ++E W ++HGK S +EK QR +IF+DN F+ +HNN N S+ L L F
Sbjct: 43 AEVARIYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTRF 101
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
ADLT++E+++ +LG A + + P +P S+DWRK+GAV VKDQ SC
Sbjct: 102 ADLTNEEYRSIYLG---AKSKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSC 158
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS GA+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+IKN GI
Sbjct: 159 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGI 218
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTE+DYPY+ G+C++ + N +VTID Y+DVPENNE L + + QP+SV I RA
Sbjct: 219 DTEEDYPYKAADGRCDQTRKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRA 278
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQLYSSG+F G C T LDH V+ VGY +ENG DYWI++NSWG SWG +GY+ M RN
Sbjct: 279 FQLYSSGVFDGICGTELDHGVVAVGYGTENGKDYWIVRNSWGGSWGESGYIKMARNIAEP 338
Query: 319 LGICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICL 372
G CGI M ASYP K GQ PPSP PT+C C TCCC C
Sbjct: 339 TGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTQCDKYYSCPESNTCCCLFKYGKYCF 398
Query: 373 SWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
W CC +A CC D+ CCP YP+C+ CL +S F+VK
Sbjct: 399 GWGCCPLEAATCCDDNTSCCPHEYPVCNG--DTCL-MSKNSPFSVK 441
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/399 (50%), Positives = 258/399 (64%), Gaps = 17/399 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGK+Y+ EK +R +IF+DN F+ +HN + NS++ L L FADLT+
Sbjct: 50 EVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTN 108
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQAS 137
+E+++ FLG ID +RR S N +P S+DWRK+GAV VKDQAS
Sbjct: 109 EEYRSKFLG---TKIDPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQAS 165
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFSA A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N G
Sbjct: 166 CGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGG 225
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
ID+E DYPY+ G+C++ + N +VTID Y+DVP +E L +AV QP++V + G R
Sbjct: 226 IDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGR 285
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLY G+FTG C T+LDH V VGY +ENG DYWI++NSWG SWG GY+ ++RN +
Sbjct: 286 EFQLYEYGVFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLAS 345
Query: 318 S-LGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGI 370
S G CGI + SYP K GQNPP P P+ C CA G TCCC
Sbjct: 346 SRAGKCGIAIEPSYPIKNGQNPPNPGPSPPSPIKPPSVCDSYYSCAEGSTCCCIYEYGRS 405
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
C W CC SA CC DH CCP YP+CD+ CL V
Sbjct: 406 CFEWGCCPLESATCCDDHYSCCPHEYPVCDTRAGLCLKV 444
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/407 (49%), Positives = 261/407 (64%), Gaps = 20/407 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN-MGNSSFTLSLNAFADLTHQEF 86
L+E W +HG+AY++ E+ +R ++F DN FV HN F L +N FADLT+ EF
Sbjct: 108 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 167
Query: 87 KASFLGFSAASIDHDRRRNASV----QSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+A++LG A I RRR +V + G ++P S+DWR+KGAV VK+Q CG+CW
Sbjct: 168 RAAYLG---ARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 224
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA ++E +N+IVTG +V+LSEQEL++C NSGC GGLMD A+ F+IKN GIDTE
Sbjct: 225 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTE 284
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I R FQL
Sbjct: 285 GDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 344
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
Y +G+FTG C+T+LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M+RN + G
Sbjct: 345 YKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGK 404
Query: 322 CGINMLASYPTKTGQNPPPSPPPGPTR----------CSLLTYCAAGETCCCGSSILGIC 371
CGI M+ASYPTK G NPP P PT C CAAG TCCC +C
Sbjct: 405 CGIAMMASYPTKKGANPPKPSPTPPTPPPPPVAPDNVCDENFSCAAGSTCCCAFGFRNVC 464
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
L W CC A CC DH CCP YP+C+ VR +VS +VK
Sbjct: 465 LVWGCCPMEGATCCKDHASCCPPGYPVCN-VRAGTCSVSKNSPLSVK 510
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 204/394 (51%), Positives = 253/394 (64%), Gaps = 16/394 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F W +HGK YS+ +E+ R +++DN ++ +H+ N S+ L L FADLT++EF+
Sbjct: 45 FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEK-NLSYWLGLTKFADLTNEEFRR 103
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLR----DVPASIDWRKKGAVTEVKDQASCGACWAF 144
+ G ID RR + G+ R + P SIDWR+KGAVT VKDQ SCG+CWAF
Sbjct: 104 QYTG---TRIDRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGSCWAF 160
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
SA G++EGIN I TG +SLS QEL+DCD+ YN GC GGLMDYA+ FVI+N GIDTEKDY
Sbjct: 161 SAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGIDTEKDY 220
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY+G G+C+ K+N +VTID Y+DVPEN+E+ L +AV QPVSV I R FQLYS
Sbjct: 221 PYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSG 280
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN--TGNSLGIC 322
G+FTG C T LDH VL VGY SE G+DYWI+KNSWG WG +GY+ MQRN N G+C
Sbjct: 281 GVFTGRCGTDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGLC 340
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKC 376
GIN+ SY KT NPP P P+ C C A TCCC + CL+W C
Sbjct: 341 GINIEPSYAVKTSPNPPNPGPTPPSPPPPEVICDKWRTCPAENTCCCTFPVGKSCLAWGC 400
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVS 410
C SA CC DH +CCP YPIC+ CL S
Sbjct: 401 CALDSATCCDDHYHCCPHEYPICNLDAGLCLKGS 434
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 200/397 (50%), Positives = 257/397 (64%), Gaps = 17/397 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGK+Y+ EK +R +IF+DN F+ +HN + NS++ L L FADLT+
Sbjct: 50 EVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTN 108
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQAS 137
+E+++ FLG ID +RR S N +P S+DWRK+GAV VKDQAS
Sbjct: 109 EEYRSKFLG---TKIDPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQAS 165
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFSA A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N G
Sbjct: 166 CGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGG 225
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
ID+E DYPY+ G+C++ + N +VTID Y+DVP +E L +AV QP++V + G R
Sbjct: 226 IDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGR 285
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLY G+FTG C T+LDH V VGY +ENG DYWI++NSWG SWG GY+ ++RN +
Sbjct: 286 EFQLYEYGVFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLAS 345
Query: 318 S-LGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGI 370
S G CGI + SYP K GQNPP P P+ C CA G TCCC
Sbjct: 346 SRAGKCGIAIEPSYPIKNGQNPPNPGPSPPSPIKPPSVCDSYYSCAEGSTCCCIYEYGRS 405
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C W CC SA CC DH CCP YP+CD+ CL
Sbjct: 406 CFEWGCCPLESATCCDDHYSCCPHEYPVCDTRAGLCL 442
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/407 (49%), Positives = 261/407 (64%), Gaps = 20/407 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN-MGNSSFTLSLNAFADLTHQEF 86
L+E W +HG+AY++ E+ +R ++F DN FV HN F L +N FADLT+ EF
Sbjct: 51 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 110
Query: 87 KASFLGFSAASIDHDRRRNASV----QSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+A++LG A I RRR +V + G ++P S+DWR+KGAV VK+Q CG+CW
Sbjct: 111 RAAYLG---ARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 167
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA ++E +N+IVTG +V+LSEQEL++C NSGC GGLMD A+ F+IKN GIDTE
Sbjct: 168 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTE 227
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I R FQL
Sbjct: 228 GDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 287
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
Y +G+FTG C+T+LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M+RN + G
Sbjct: 288 YKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGK 347
Query: 322 CGINMLASYPTKTGQNPPPSPPPGPTR----------CSLLTYCAAGETCCCGSSILGIC 371
CGI M+ASYPTK G NPP P PT C CAAG TCCC +C
Sbjct: 348 CGIAMMASYPTKKGANPPKPSPTPPTPPPPPVAPDNVCDENFSCAAGSTCCCAFGFRNVC 407
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
L W CC A CC DH CCP YP+C+ VR +VS +VK
Sbjct: 408 LVWGCCPMEGATCCKDHASCCPPGYPVCN-VRAGTCSVSKNSPLSVK 453
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 209/423 (49%), Positives = 271/423 (64%), Gaps = 20/423 (4%)
Query: 3 SLAFFLLSILLL---SSLPLNYCS--------DINELFETWCKQHGKAYSSEQEKQQRLK 51
S F L SI+ + S+L L+ +I L+ETW +HGK Y+ EKQ R
Sbjct: 6 STIFLLFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFN 65
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR-RNASVQS 110
IF+DN FV + N+ N SF L LN FADLT++E+++ +LG S+ R R+ S +
Sbjct: 66 IFKDNLRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRY 124
Query: 111 PGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
D +P S+DWRKKGAV +KDQ SCG+CWAFSA A+EG+N+IVTG L+SLSEQEL
Sbjct: 125 AFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQEL 184
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
++CD SYN GC GGLMDYA++F+IKN GID+++DYPY G+ G+C+ + N +VTID Y+
Sbjct: 185 VECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVTIDDYE 244
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENG 289
D P +EK L +AV QPVSV I G R FQLY SG+FTG C T+LDH V +VGY +E+G
Sbjct: 245 DSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGYGTEDG 304
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR- 348
+DYWI++NSWG +WG GY+ MQRNT GICGI + SYP K+G NPP P P+
Sbjct: 305 LDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPIKSGLNPPNPGPSPPSPV 364
Query: 349 -----CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVR 403
C CA TCCC C SW CC +A CC D+ CCP +YP+C+
Sbjct: 365 QPPSVCDDNYSCAERTTCCCLFEYAHYCYSWGCCPLEAATCCEDNYSCCPHDYPVCNIYA 424
Query: 404 HQC 406
C
Sbjct: 425 GTC 427
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 200/407 (49%), Positives = 261/407 (64%), Gaps = 20/407 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN-MGNSSFTLSLNAFADLTHQEF 86
L+E W +HG+AY++ E+ +R ++F DN FV HN F L +N FADLT+ EF
Sbjct: 48 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 107
Query: 87 KASFLGFSAASIDHDRRRNASV----QSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+A++LG A I RRR +V + G ++P S+DWR+KGAV VK+Q CG+CW
Sbjct: 108 RAAYLG---ARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 164
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA ++E +N+IVTG +V+LSEQEL++C NSGC GGLMD A+ F+IKN GIDTE
Sbjct: 165 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTE 224
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I R FQL
Sbjct: 225 GDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 284
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
Y +G+F+G C+T+LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M+RN + G
Sbjct: 285 YKAGVFSGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGK 344
Query: 322 CGINMLASYPTKTGQNPPPSPPPGPTR----------CSLLTYCAAGETCCCGSSILGIC 371
CGI M+ASYPTK G NPP P PT C CAAG TCCC +C
Sbjct: 345 CGIAMMASYPTKKGANPPKPSPTPPTPPPPPVAPDNVCDENFSCAAGSTCCCAFGFRNVC 404
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
L W CC A CC DH CCP YP+C+ VR +VS +VK
Sbjct: 405 LVWGCCPMEGATCCKDHASCCPPGYPVCN-VRAGTCSVSKNSPLSVK 450
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 193/390 (49%), Positives = 258/390 (66%), Gaps = 17/390 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++E W +HGK+Y++ E+++R +IF+DN F+ +HN + N ++ + LN FADLT
Sbjct: 48 AEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADLT 106
Query: 83 HQEFKASFLGFSAASIDHDRR--RNASVQSPGNLR---DVPASIDWRKKGAVTEVKDQAS 137
++E+++ +LG D RR R + V + R D+P S+DWR+KGAV VKDQ +
Sbjct: 107 NEEYRSRYLGRR----DETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGN 162
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS A+EGIN+I TG L+SLSEQEL+DCD+SYN GC GGLMDYA++F+I N G
Sbjct: 163 CGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGG 222
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
ID+E+DYPYR C+ + N +V+IDGY+DVP+N+E+ L +AV QPVSV I R
Sbjct: 223 IDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGR 282
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN-TG 316
AFQLY SG+FTG C T LDH V+ VGY +EN VDYWI++NSWG +WG +GY+ ++RN G
Sbjct: 283 AFQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAG 342
Query: 317 NSLGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGI 370
G CGI + SYP K GQNPP P P+ C C TCCC G
Sbjct: 343 TETGKCGIAIEPSYPIKNGQNPPNPGPSPPSPSKPSVVCDEYYTCPEESTCCCIYEYAGF 402
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICD 400
C W CC A CC DH CCP YP+CD
Sbjct: 403 CFEWGCCPLEGATCCDDHYSCCPHEYPVCD 432
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 204/377 (54%), Positives = 253/377 (67%), Gaps = 7/377 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
++FE W ++ K Y+ EK +R +IF DN FV +HN++ N S+ L L FADLT++EF
Sbjct: 35 KMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEF 94
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFS 145
+A +L + ++ R S + N+ D +P +DWR KGAV VKDQ SCG+CWAFS
Sbjct: 95 RAIYL---RSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFS 151
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A GA+EGIN+I TG LVSLSEQEL+DCD SYN+GCGGGLMDYA+QF+I N GIDTE+DYP
Sbjct: 152 AIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISNGGIDTEEDYP 211
Query: 206 YRGQAGQ-CNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
Y CN K N +VTIDGY+DVPE NE L +A+ QP+SV I R FQLY S
Sbjct: 212 YTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVAIEAGGRGFQLYKS 270
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
G+FTG C T+LDH V+ VGY + G DYWII+NSWG +WG +GY+ +QRN +S G CG+
Sbjct: 271 GVFTGTCGTALDHGVVAVGYGTSEGQDYWIIRNSWGSNWGESGYIKLQRNIKDSSGKCGV 330
Query: 325 NMLASYPTK-TGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAV 383
M+ASYPTK +G NPP PPP P C C A TCCC G C SW CC SA
Sbjct: 331 AMMASYPTKSSGSNPPKPPPPAPVVCDKSYTCPAKSTCCCLYEYKGKCYSWGCCPLESAT 390
Query: 384 CCSDHRYCCPSNYPICD 400
CC D CCP YP+CD
Sbjct: 391 CCEDGSSCCPQAYPVCD 407
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/395 (48%), Positives = 259/395 (65%), Gaps = 11/395 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
++ ++ W ++G+ Y++ E+++R ++F DN +V QHN + G SF L LN FA
Sbjct: 36 EEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHSFRLGLNRFA 95
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
DLT++E++ ++LG + +RR + Q+ N ++P S+DWR+KGAV +VKDQ CG
Sbjct: 96 DLTNEEYRDTYLGVRTKPV-RERRLSGRYQAADN-EELPESVDWREKGAVAKVKDQGGCG 153
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFSA A+EGIN+IVTG +++LSEQEL+DCD SYN GC GGLMDYA++F+I N GID
Sbjct: 154 SCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGID 213
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+E+DYPY+ + +C+ K N +VTIDGY+DVP N+E L +AV QP+SV I RAF
Sbjct: 214 SEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPISVAIEAGGRAF 273
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIFTG C T+LDH V VGY SENG DYWI+KNSWG WG +GY+ ++RN +
Sbjct: 274 QLYKSGIFTGRCGTALDHGVTAVGYGSENGKDYWIVKNSWGTVWGEDGYVRLERNIKATS 333
Query: 320 GICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G NPP P P+ C C A TCCC + C +
Sbjct: 334 GKCGIAIEPSYPLKKGANPPNPGPTPPSPAPPSTVCDSYNECPASTTCCCIYTYGKECFA 393
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
W CC A CC DH CCP +YPIC+ + CL
Sbjct: 394 WGCCPLEGATCCDDHYSCCPHSYPICNVQQGTCLA 428
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 199/386 (51%), Positives = 248/386 (64%), Gaps = 11/386 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ ++ W HG+ Y++ +++R ++F DN ++ HN + G SF L LN FAD
Sbjct: 39 EARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFAD 98
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT+ E+ A++LG + DR+ A + N D+P S+DWR KGAV EVKDQ SCG
Sbjct: 99 LTNDEYPATYLG-ARTRPQRDRKLGARYHAADN-EDLPESVDWRAKGAVAEVKDQGSCGT 156
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDT
Sbjct: 157 CWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 216
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
EKDYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QPVSV I + AFQ
Sbjct: 217 EKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQ 276
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T LDH V VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G
Sbjct: 277 LYSSGIFTGSCGTRLDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 336
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G NPP P P+ C C TCCC C +W
Sbjct: 337 KCGIAVEPSYPLKEGANPPNPGPSPPSPTPAPAVCDNYYSCPDSTTCCCIYEYGKYCFAW 396
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICD 400
CC A CC DH CCP +YPIC+
Sbjct: 397 GCCPLEGATCCDDHYSCCPHDYPICN 422
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 266/414 (64%), Gaps = 12/414 (2%)
Query: 3 SLAFFLLSILLLS-SLPLNYCSDINE-------LFETWCKQHGKAYSSEQEKQQRLKIFE 54
+LA + S+LL+S SL +D ++E W ++ K Y+ EK+ R +IF
Sbjct: 9 TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL 114
DN ++ +HN++ N +F + L FADLT+ EF+A +L + + G+
Sbjct: 69 DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGERYLYKVGDT 128
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+P IDWR KGAV VKDQ +CG+CWAFSA GA+EGIN+I TG L+SLSEQEL+DCD
Sbjct: 129 --LPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDT 186
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGYKDVPE 233
SYN GCGGGLMDYA++F+I+N GIDTE+DYPY CN K N +VTIDGY+DVP+
Sbjct: 187 SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQ 246
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYW 293
N+EK L +A+ QP+SV I RAFQLY SG+FTG C TSLDH V+ VGY SE G DYW
Sbjct: 247 NDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGSEGGQDYW 306
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK-TGQNPPPSPPPGPTRCSLL 352
I++NSWG +WG +GY ++RN S G CG+ M+ASYPTK +G NPP PPP P C
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGSNPPKPPPPSPVVCDKS 366
Query: 353 TYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C A TCCC G C SW CC + SA CC D CCP +YP+CD + C
Sbjct: 367 NTCPAKSTCCCLYEYNGKCYSWGCCPYESATCCDDGSSCCPQSYPVCDLKANTC 420
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/405 (49%), Positives = 253/405 (62%), Gaps = 13/405 (3%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSF 71
LL + L ++E F W +HGK YSS +E R +++DN ++ +H+ N S+
Sbjct: 29 LLRMTTDLGNERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEK-NRSY 87
Query: 72 TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
L L FAD+T+ EF+ + G ID +R + P S+DWRKKGAVT
Sbjct: 88 WLGLTKFADITNDEFRRQYTG---TRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTT 144
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191
VKDQ SCG+CWAFSA G++EGIN I TG VSLSEQEL+DCD YN GC GGLMDYA+ F
Sbjct: 145 VKDQGSCGSCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDF 204
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
+++N GIDTE DYPY+G G+C+ K N H+VTIDGY+DVPEN+E+ L +AV QPVSV
Sbjct: 205 ILENGGIDTENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVA 264
Query: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
I R FQLYS G+FTG C T LDH VL VGY SE +DYWI+KNSWG WG +GY+ M
Sbjct: 265 IEAGGRDFQLYSGGVFTGECGTDLDHGVLAVGYGSEGSLDYWIVKNSWGEYWGESGYLRM 324
Query: 312 QRNTGNS---LGICGINMLASYPTK------TGQNPPPSPPPGPTRCSLLTYCAAGETCC 362
QRN +S G+CGIN+ SY K PPSP P C C + TCC
Sbjct: 325 QRNIKDSNHQFGLCGINIEPSYAVKTSPNPPNPGPTPPSPSPPEVVCDKWRTCPSENTCC 384
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C + +CL+W CC SA CC DH +CCP +YP+C+ CL
Sbjct: 385 CTFPVGKMCLAWGCCSLDSATCCDDHYHCCPHDYPVCNLAAGLCL 429
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 199/393 (50%), Positives = 253/393 (64%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ ++ W HG+ Y++ E+++R ++F DN ++ HN + G SF L LN FAD
Sbjct: 39 EARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFAD 98
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT+ E++A++LG + +R+ A + N D+P S+DWR KGAV EVKDQ S G+
Sbjct: 99 LTNDEYRATYLG-ARTRPQRERKLGARYHAADN-EDLPESVDWRAKGAVAEVKDQGSYGS 156
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDT
Sbjct: 157 CWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 216
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
EKDYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QPVSV I + FQ
Sbjct: 217 EKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQ 276
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G
Sbjct: 277 LYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 336
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G NPP P P+ C C TCCC C +W
Sbjct: 337 KCGIAVEPSYPLKEGANPPNPGPSPPSPTPAPAVCDNYYSCPDSTTCCCIYEYGKYCFAW 396
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YPIC+ + CL
Sbjct: 397 GCCPLEGATCCDDHYSCCPHDYPICNVRQGTCL 429
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/396 (50%), Positives = 263/396 (66%), Gaps = 13/396 (3%)
Query: 24 DINELFETWCKQHGKAYSS--EQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADL 81
++ ++E W +HGK ++ EK +R +IF+DN F+ +HN N ++ + LN FADL
Sbjct: 48 EVKNIYEEWRVKHGKLNNNIDGSEKDKRFEIFKDNLKFIDEHN-AENRTYKVGLNRFADL 106
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ---SPGNLRDVPASIDWRKKGAVTEVKDQASC 138
+++E+++ +LG I R + +P +P S+DWR +GAV +VKDQ SC
Sbjct: 107 SNEEYRSRYLGTKIDPIGMMMARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSC 166
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS A+EGINKIVTG LVSLSEQEL+DCDR+ N+GC GGLM+YA++F+I N GI
Sbjct: 167 GSCWAFSTIAAVEGINKIVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGI 226
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
D+++DYPYRG G+C++ K N +V+ID Y+ VP +E L +AV QP+SV I R
Sbjct: 227 DSDEDYPYRGVDGKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGRE 286
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQLY SGIFTG C T+LDH V VGY +ENGVDYWI++NSWG+SWG +GY+ M+RN S
Sbjct: 287 FQLYVSGIFTGKCGTALDHGVTAVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAAS 346
Query: 319 L-GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
+ G CGI M +SYP K GQ PPSP P CS CA+ TCCC I +C
Sbjct: 347 VAGKCGIVMQSSYPIKKGQNPPNPGPSPPSPVNPPNVCSRYHSCASSTTCCCVFGIGKLC 406
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
SW CC +AVCC DH CCP NYPIC++ + CL
Sbjct: 407 FSWGCCPLEAAVCCKDHSSCCPHNYPICNTRQGTCL 442
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/386 (50%), Positives = 254/386 (65%), Gaps = 10/386 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++ +W +HGK+Y++ EK+ R +IF+DN ++ HN + S+ L LN FADLT+
Sbjct: 44 EVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTN 103
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+E++A +LG + + S + +P ++P SIDWR+KGAV VKDQ SCG+CW
Sbjct: 104 EEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSCW 163
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFSA GA+EGIN+I TG L++LSEQEL+DCDRSYN GC GGLMDYA+ F+IKN GID++
Sbjct: 164 AFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDSDL 223
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY G+ G CN+ K N +VTID Y+DVP +EK L +A QP+SV I FQLY
Sbjct: 224 DYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQLY 283
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SGIFTG C T++DH V++VGY SE G+DYWI++NSWG +WG GY+ MQRN G S G+C
Sbjct: 284 VSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSSGLC 343
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR---------CSLLTYCAAGETCCCGSSILGICLS 373
GI + SYP K G NPP P P+ C T C A TCCC + C
Sbjct: 344 GITIEPSYPVKNGDNPPNPGPTPPSPPSPSLPDNVCDAYTSCPAHTTCCCLYTFGKQCFY 403
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPIC 399
W CC +A CC D CCP +YP+C
Sbjct: 404 WGCCPLEAASCCDDGYSCCPHDYPVC 429
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 197/391 (50%), Positives = 257/391 (65%), Gaps = 13/391 (3%)
Query: 23 SDINELFETWCKQHGKAYS--SEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80
+++ ++E W +HGKA S S EK +R +IF+DN FV +HN N S+ L L FAD
Sbjct: 44 AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFAD 102
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCG 139
LT+ E+++ +LG A ++ R S++ + D +P SIDWRKKGAV EVKDQ CG
Sbjct: 103 LTNDEYRSKYLG---AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCG 159
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GID
Sbjct: 160 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGID 219
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
T+KDYPY+G G C++ + N +VTID Y+DVP +E+ L +AV QP+S+ I RAF
Sbjct: 220 TDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAF 279
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN +S
Sbjct: 280 QLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS 339
Query: 320 GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G+ PPSP PT+C C TCCC C +
Sbjct: 340 GKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFA 399
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRH 404
W CC +A CC D+ CCP YP+ ++
Sbjct: 400 WGCCPLEAATCCDDNYSCCPHEYPLVTLIKE 430
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 212/427 (49%), Positives = 268/427 (62%), Gaps = 14/427 (3%)
Query: 3 SLAFFLLSILLLSS----LPLNYCSDINELFETWCKQHGKAYSS-EQEKQQRLKIFEDNY 57
+L FFL L +S +P ++ L++ W +HGK +++ E + R IF+DN
Sbjct: 11 ALLFFLFIALSAASPSSIIPQRTDDEVMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNL 70
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
F+ + N N + L LN FADLT++E+++ +LG AS R R ++ P D+
Sbjct: 71 KFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSR-RNRTSNRYLPRLGDDL 128
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P SIDWR KGAV VKDQ SCG+CWAFS ++E IN+IVTG L++LSEQEL+DCDRSYN
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++F+I+N G+DTE+DYPY G C + K N +V ID Y+DVP NNEK
Sbjct: 189 EGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYEDVPVNNEK 248
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV Q VSV I G R+FQLY SGIFTG C T LDH V +VGY SE GVDYWI++N
Sbjct: 249 ALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGSEGGVDYWIVRN 308
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK------TGQNPPPSPPPGPTRCSL 351
SWG SWG +GY+ MQRN + G+CGI M SYPTK PPSP P+ C
Sbjct: 309 SWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTKTGPNPPNPGPTPPSPVKPPSVCDE 368
Query: 352 LTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSL 411
C A ETCCC +CL W CC SA CC DH CCP +YP+C+ VR + S
Sbjct: 369 YYTCPAAETCCCIFQFSNLCLEWGCCPLESATCCDDHYSCCPHDYPVCN-VRAGTCSKSK 427
Query: 412 KFSFTVK 418
F VK
Sbjct: 428 NDIFGVK 434
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/394 (49%), Positives = 255/394 (64%), Gaps = 13/394 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGK Y++ EK++R +IF+DN F+ HN+ + ++ L LN FADLT+
Sbjct: 74 ELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTN 133
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQ---SPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+E++A +LG ID +RR + +P +P S+DWRK+GAV VKDQ CG+
Sbjct: 134 EEYRAKYLG---TKIDPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGS 190
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA GA+EGINKIVTG L+SLSEQEL+DCD YN GC GGLMDYA++F+I N GID+
Sbjct: 191 CWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINNGGIDS 250
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPYRG G+C+ + N +V+ID Y+DVP +E L +AV QPVSV I G R FQ
Sbjct: 251 EEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQ 310
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL- 319
LY SG+FTG C T+LDH V+ VGY + NG DYWI++NSWG SWG +GY+ ++RN NS
Sbjct: 311 LYVSGVFTGRCGTALDHGVVAVGYGTANGHDYWIVRNSWGPSWGEDGYIRLERNLANSRS 370
Query: 320 GICGINMLASYP------TKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP PPSP P C CA TCCC C
Sbjct: 371 GKCGIAIEPSYPLKNGPNPPNPGPSPPSPVKPPNVCDNYYSCADSATCCCIFEFGNACFE 430
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
W CC A CC DH CCP++YPIC++ CL
Sbjct: 431 WGCCPLEGATCCDDHYSCCPNDYPICNTYAGTCL 464
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 207/431 (48%), Positives = 272/431 (63%), Gaps = 26/431 (6%)
Query: 2 NSLAFFLLSILLLSS-LPLNYCS---------------DINELFETWCKQHGKAYSSEQE 45
+SL+ FLL I SS + ++ S ++ ++E W +HGKAY++ E
Sbjct: 6 SSLSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGE 65
Query: 46 KQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR-R 104
K++R IF+DN F+ +HN+ N ++ L LN FADLT++E+++ +LG + R+
Sbjct: 66 KEKRFGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVS 124
Query: 105 NASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVS 163
S + + D +P IDWRK+GAV VKDQ SCG+CWAFS A+EGIN+IVTG L+S
Sbjct: 125 RKSDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLIS 184
Query: 164 LSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV 223
LSEQEL+DCD SYN GC GGLMDYA++F+I N GID+E+DYPYR +C++ + N ++V
Sbjct: 185 LSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVV 244
Query: 224 TIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVG 283
+IDGY+DVPEN+E L +AV QPVSV I RAFQLY SG+FTG C TSLDH V VG
Sbjct: 245 SIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVG 304
Query: 284 YDSENGVDYWIIKNSWGRSWGMNGYMHMQRN-TGNSLGICGINMLASYPTK------TGQ 336
Y +ENG DYWI+ NSWG++WG +GY+ M+RN G+S G CGI + SYP K
Sbjct: 305 YGTENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIKNGPNPPNPG 364
Query: 337 NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNY 396
PPSP PT C C TCCC C +W CC A CC DH CCP +Y
Sbjct: 365 PSPPSPVQPPTVCDNYYSCPERTTCCCIYEYGKYCFAWGCCPLEGATCCEDHYSCCPHDY 424
Query: 397 PICDSVRHQCL 407
PIC+ CL
Sbjct: 425 PICNVKDGTCL 435
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/412 (48%), Positives = 261/412 (63%), Gaps = 22/412 (5%)
Query: 24 DINELFETWCKQHGKAYSS----EQEKQQRLKIFEDNYAFVTQHNN-MGNSSFTLSLNAF 78
++ +++ W +HG+AY++ E E+ +R +F DN FV HN G F L +N F
Sbjct: 52 EVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMNQF 111
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASV----QSPGNLRDVPASIDWRKKGAVTEVKD 134
ADLT+ EF+A++LG + RR A V + G ++P S+DWR+KGAV VK+
Sbjct: 112 ADLTNDEFRAAYLGAMVPAA----RRGAVVGERYRHDGAAEELPESVDWREKGAVAPVKN 167
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVI 193
Q CG+CWAFSA ++E +N+IVTG +V+LSEQEL++C NSGC GGLMD A+ F+I
Sbjct: 168 QGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFII 227
Query: 194 KNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGIC 253
KN GIDTE DYPYR G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I
Sbjct: 228 KNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIE 287
Query: 254 GSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQR 313
R FQLY SG+F+G C+T+LDH V+ VGY +ENG DYWI++NSWG WG GY+ M+R
Sbjct: 288 AGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGAENGKDYWIVRNSWGPKWGEAGYIRMER 347
Query: 314 NTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR-------CSLLTYCAAGETCCCGSS 366
N S G CGI M+ASYPTK G NPP P PT C C+AG TCCC
Sbjct: 348 NVNASTGKCGIAMMASYPTKKGANPPRPSPTPPTPPAAPDNVCDENFSCSAGSTCCCAFG 407
Query: 367 ILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
+CL W CC A CC DH CCP YP+C+ VR +VS +VK
Sbjct: 408 FRNVCLVWGCCPVEGATCCKDHASCCPPGYPVCN-VRAGTCSVSKNSPLSVK 458
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/390 (48%), Positives = 248/390 (63%), Gaps = 10/390 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
I +E+W +HGK+Y++ EK+QR +IF+DN+ ++ + N + SF L LN FADLT++
Sbjct: 40 IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNE 99
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL--RDVPASIDWRKKGAVTEVKDQASCGACW 142
E+++ + G D ++ + Q +L +P S+DWR+ GAV VKDQ CG+CW
Sbjct: 100 EYRSKYTGIRTK--DSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCW 157
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS A+EGIN+I TG L++LSEQEL+DCDRSYN GC GGLMD A+QF+I N GID++
Sbjct: 158 AFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDA 217
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY G+ GQC++ + N +VTID Y+DVPE +EK L +A QP+SV I S R FQ Y
Sbjct: 218 DYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFY 277
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SGIFTG C T LDH V++VGY +ENG DYWI++NSWG WG GY+ M+R + GIC
Sbjct: 278 DSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGIC 337
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKC 376
GI SYP K+G NPP P P+ C C TCCC G C +W C
Sbjct: 338 GITSEPSYPVKSGVNPPNPGPSPPSPKSPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGC 397
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C A CC D CCP +YP+C+ C
Sbjct: 398 CPLEGASCCDDGYSCCPHDYPVCNVRAGTC 427
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 193/380 (50%), Positives = 250/380 (65%), Gaps = 11/380 (2%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFS 94
+H K Y++ K++R +IF+DN F+ +HN N SF L LN FADL+++E+K+ FLG
Sbjct: 13 KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG-- 70
Query: 95 AASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGI 153
+ DR+ S + + D +P S+DWR+KGAV VKDQ CG+CWAFS A+EGI
Sbjct: 71 -GRMVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGI 129
Query: 154 NKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC 213
N+I TG L+SLSEQEL+DCD+ +N GC GG MDYA++F++KN GIDTE DYPY+G GQC
Sbjct: 130 NQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDGQC 189
Query: 214 NKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCST 273
++ + N +VTI+G++DVP+N+EK L +AV QPVSV I RAFQLY SGIF G C T
Sbjct: 190 DQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLCGT 249
Query: 274 SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPT 332
LDH V+ VGY +E+G DYWI++NSWG +WG NGY+ ++RN ++ G CGI M SYPT
Sbjct: 250 DLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPSYPT 309
Query: 333 KTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCS 386
KTG N PPSP + C C A TCCC C W CC +A CC
Sbjct: 310 KTGVNPPKPGPSPPSPVKPQSVCDDYYTCPASTTCCCVYEYGKYCFGWGCCPLEAATCCD 369
Query: 387 DHRYCCPSNYPICDSVRHQC 406
DH CCP YP+CD C
Sbjct: 370 DHSSCCPQEYPVCDINAQTC 389
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/393 (49%), Positives = 249/393 (63%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ L+ W +HGK+Y++ E+++R F DN ++ +HN + G SF L LN FAD
Sbjct: 36 EARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFAD 95
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E++ ++LG R+ + + +P S+DWR KGAV E+KDQ CG+
Sbjct: 96 LTNEEYRDTYLGLRNKP--RRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGS 153
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GIDT
Sbjct: 154 CWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDT 213
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RAFQ
Sbjct: 214 EDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 273
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S G
Sbjct: 274 LYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSG 333
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G+NPP P P+ C C TCCC C +W
Sbjct: 334 KCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAW 393
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP YPIC+ + CL
Sbjct: 394 GCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 426
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/393 (49%), Positives = 250/393 (63%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ L+ W +HGK+Y++ E+++R F DN ++ +HN + G SF L LN FAD
Sbjct: 35 EARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFAD 94
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E++ ++LG + R+ + + +P S+DWR KGAV E+KDQ CG+
Sbjct: 95 LTNEEYRDTYLGLR--NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGS 152
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GIDT
Sbjct: 153 CWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDT 212
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RAFQ
Sbjct: 213 EDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 272
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S G
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G+NPP P P+ C C TCCC C +W
Sbjct: 333 KCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAW 392
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP YPIC+ + CL
Sbjct: 393 GCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/394 (49%), Positives = 254/394 (64%), Gaps = 13/394 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E W +HGK Y++ EK++R +IF+DN F+ HN+ + ++ L LN FADLT+
Sbjct: 54 ELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTN 113
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQ---SPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+E++A +LG ID +RR + +P +P S+DWRK+GAV VKDQ CG+
Sbjct: 114 EEYRAKYLG---TKIDPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGS 170
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA GA+EGINKIVTG L+SLSEQEL+DCD YN GC GGLMDYA++F+I N GID+
Sbjct: 171 CWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDS 230
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
++DYPYRG G+C+ + N +V+ID Y+DVP +E L +AV QPVSV I G R FQ
Sbjct: 231 DEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQ 290
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL- 319
LY SG+FTG C T+LDH V+ VGY + G DYWI++NSWG SWG +GY+ ++RN NS
Sbjct: 291 LYVSGVFTGRCGTALDHGVVAVGYGTAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRS 350
Query: 320 GICGINMLASYP------TKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP PPSP P C CA TCCC C
Sbjct: 351 GKCGIAIEPSYPLKNGPNPPNPGPSPPSPVKPPNVCDNYYSCADSATCCCIFEFGNACFE 410
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
W CC A CC DH CCP++YPIC++ CL
Sbjct: 411 WGCCPLEGASCCDDHYSCCPADYPICNTYAGTCL 444
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/393 (46%), Positives = 251/393 (63%), Gaps = 9/393 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
LFE+W HGK+Y++ E+++R +IF++N ++ + N + + F L LN FADLT++E++
Sbjct: 44 LFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYR 103
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
+ + G + + + + + +P S+DWR+ GAV VKDQ SCG+CWAFS
Sbjct: 104 SKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFSTI 163
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EGIN+I TG L++LSEQEL+DCDRSYN GC GGLMDYA++F+I N GIDT+ DYPY
Sbjct: 164 SAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPYT 223
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G+ G+C++ + N +VTID Y+DVP +E L +A QP+SV I S R FQ Y SGIF
Sbjct: 224 GRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGIF 283
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
TG C +LDH V++VGY +ENG DYWI++NSWG WG NGY+ M+R + GICGI +
Sbjct: 284 TGKCGIALDHGVVVVGYGTENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICGIAIE 343
Query: 328 ASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSS 381
SYP KTG N PP+P + C C TCCC G C +W CC
Sbjct: 344 PSYPVKTGVNPPNPGPSPPTPKTPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEG 403
Query: 382 AVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFS 414
A CC D CCP +YP+C+ C S+K++
Sbjct: 404 ASCCDDGYSCCPHDYPVCNVRAGTC---SMKYN 433
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/394 (49%), Positives = 248/394 (62%), Gaps = 11/394 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
+ L+ W +HGK Y++ E+++R F DN ++ +HN + G SF L LN FA
Sbjct: 34 EEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFA 93
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
DLT++E++ ++LG R+ + + +P S+DWR KGAV E+KDQ CG
Sbjct: 94 DLTNEEYRDTYLGLRNKP--RRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCG 151
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GID
Sbjct: 152 SCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGID 211
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RAF
Sbjct: 212 TEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAF 271
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S
Sbjct: 272 QLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASS 331
Query: 320 GICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G+NPP P P+ C C TCCC C +
Sbjct: 332 GKCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYA 391
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
W CC A CC DH CCP YPIC+ + CL
Sbjct: 392 WGCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 197/405 (48%), Positives = 255/405 (62%), Gaps = 14/405 (3%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSF 71
L + L + + + E F W +HGKAY ++ R +++DN A++ N ++
Sbjct: 37 FLHMTTDLEHENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSET--NRTY 94
Query: 72 TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
+L L FADLT++EF+ + G ID RR + P S+DWRK GAVT
Sbjct: 95 SLGLTKFADLTNEEFRRMYTG---TRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTS 151
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191
VKDQ SCG+CWAFSA G++EGIN I G VSLSEQEL+DCD YN GC GGLMDYA+ F
Sbjct: 152 VKDQGSCGSCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDF 211
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
+I+N GIDTEKDYPY+G G+C+ K N H+VTIDGY+DVPEN+E+ L +AV QPVSV
Sbjct: 212 IIQNGGIDTEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVA 271
Query: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
I R FQLY+ G+F+G C T LDH VL VGY +E+GVDYWI+KNSWG WG +GY+ M
Sbjct: 272 IEAGGRDFQLYAQGVFSGECGTDLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRM 331
Query: 312 QRNTGNS---LGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCC 362
+RN +S G+CGIN+ SY KT NPP P P+ C C + TCC
Sbjct: 332 KRNMKDSNDGPGLCGINIEPSYAVKTSPNPPNPGPTPPSPTPPEVICDKWRTCPSENTCC 391
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C + +CL+W CC SA CC DH +CCP +YP+C+ C+
Sbjct: 392 CTFPMGKMCLAWGCCSMDSATCCDDHYHCCPHDYPVCNLAAGLCV 436
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 201/409 (49%), Positives = 260/409 (63%), Gaps = 16/409 (3%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
+I+ + L+ + ++F W ++H + Y S EKQ+R +IF+DN ++ HN
Sbjct: 33 AIMDYEAHELHSDDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EK 91
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS--IDWRKKG 127
S+ L LN F+DLTH EF+A +LG A H R DV A +DWRKKG
Sbjct: 92 SYWLGLNKFSDLTHDEFRALYLGIRPAGRAHGLRNGDRFI----YEDVVAEEMVDWRKKG 147
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AV++VKDQ SCG+CWAFSA G++EG+N IVTG L+SLSEQEL+DCDR N GC GGLMDY
Sbjct: 148 AVSDVKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDY 207
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNK-QKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
A+ F+IKN GIDTE+DYPY+ GQC++ +K +V ID Y+DVP +E LL+AV
Sbjct: 208 AFDFIIKNGGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKN 267
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGM 305
PVSV I R FQ Y G+FTGPC T LDH VL VGY + ++GV+YWI+KNSWG SWG
Sbjct: 268 PVSVAIEAGGRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGE 327
Query: 306 NGYMHMQRNTGNSL-GICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAG 358
GY+ M+R NS G CGIN+ S+P K G N PP+P P++C C A
Sbjct: 328 KGYIRMERMGSNSTSGKCGINIEPSFPIKKGANPPPAPPSPPTPVKPPSQCDSSHSCPAS 387
Query: 359 ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
TCCC +I CL W CC SA CC DH +CCPS++P+C+ QC+
Sbjct: 388 STCCCAFNIGKYCLQWGCCPMESATCCEDHYHCCPSDFPVCNLRAGQCV 436
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 193/393 (49%), Positives = 248/393 (63%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ L+ W +HGK+Y++ E+++R F DN ++ +HN + G SF L LN FAD
Sbjct: 35 EARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFAD 94
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E++ ++LG R+ + + +P S+DWR KGAV E+KDQ G+
Sbjct: 95 LTNEEYRDTYLGLRNKP--RRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQEVAGS 152
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GIDT
Sbjct: 153 CWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDT 212
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RAFQ
Sbjct: 213 EDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 272
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S G
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G+NPP P P+ C C TCCC C +W
Sbjct: 333 KCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAW 392
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP YPIC+ + CL
Sbjct: 393 GCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 193/393 (49%), Positives = 248/393 (63%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ L+ W +HGK+Y++ E+++R F DN ++ +HN + G SF L LN FAD
Sbjct: 35 EARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFAD 94
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E++ ++LG R+ + + +P S+DWR KGAV E+KDQ CG+
Sbjct: 95 LTNEEYRDTYLGLRNKP--RRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGS 152
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+E IN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GIDT
Sbjct: 153 CWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDT 212
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RAFQ
Sbjct: 213 EDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQ 272
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S G
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G+NPP P P+ C C TCCC C +W
Sbjct: 333 KCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAW 392
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP YPIC+ + CL
Sbjct: 393 GCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 199/392 (50%), Positives = 250/392 (63%), Gaps = 16/392 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W +HGKAY++ EK +R IF+DN F+ HN N ++ L LN FADLT++E++
Sbjct: 3 LYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHN-ADNRTYKLGLNRFADLTNEEYR 61
Query: 88 ASFLGFSAASIDHDRR-----RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
A +LG ID +RR ++ +P ++P S+DWR + AV VKDQ +CG+CW
Sbjct: 62 ARYLG---TRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCW 118
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS GA+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYAY+F+I N GID+E+
Sbjct: 119 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEE 178
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPYR G C++ + N +VTID Y+DVP N+E L +AV QPVSV I G R FQLY
Sbjct: 179 DYPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLY 238
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL-GI 321
SG+FTG C T+LDH V+ VGY S G DYWI++NSWG SWG GY+ ++RN S G
Sbjct: 239 VSGVFTGRCGTALDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGK 298
Query: 322 CGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWK 375
CGI + SYP K G PPSP P C C+ TCCC C+ W
Sbjct: 299 CGIAIEPSYPIKNGANPPNPGPSPPSPVKPPNVCDNSYSCSDSATCCCIFEFQKYCMVWG 358
Query: 376 CCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC +A CC DH CCP YPIC+ CL
Sbjct: 359 CCPLEAATCCDDHYSCCPHEYPICNVRAGTCL 390
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/401 (49%), Positives = 257/401 (64%), Gaps = 19/401 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADL 81
++ L+E W +GKAY+ EK++R +IF DN ++ HN N+ S+TL L FADL
Sbjct: 32 EEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADL 91
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-------PASIDWRKKGAVTEVKD 134
T++E+++++LG + RR N ++PG RD+ P +DWR+KGAV +KD
Sbjct: 92 TNEEYRSTYLGVKPGQV-RPRRAN---RAPGRGRDLSANGDDLPQKVDWREKGAVAPIKD 147
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIK 194
Q CG+CWAFS A+EGIN+IVTG L+ LSEQEL+DCD +YN GC GGLMDYA+QF+I
Sbjct: 148 QGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAFQFIIS 207
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N GIDTE+DYPY+ + G C+ + N +V+ID Y+DV EN+E L AV QPVSV I G
Sbjct: 208 NGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEG 267
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
R+FQLY SGIF G C LDH V+ VGY +E+G DYWI++NSWG+SWG GY+ M+RN
Sbjct: 268 GGRSFQLYKSGIFDGRCGIDLDHGVVAVGYGTESGKDYWIVRNSWGKSWGEAGYIRMERN 327
Query: 315 -TGNSLGICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSI 367
+S G CGI + SYP K GQN PPSP PT C C TCCC
Sbjct: 328 LPSSSSGKCGIAIEPSYPIKKGQNPPKPAPSPPSPVKPPTECDNYYSCPESTTCCCVYEY 387
Query: 368 LGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
C +W CC +AVCC DH CCP +YP+C+ + CL
Sbjct: 388 GKYCFAWGCCPLVNAVCCDDHSSCCPHDYPVCNVKQGICLA 428
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/411 (47%), Positives = 261/411 (63%), Gaps = 17/411 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E+W QH K Y++ EK++R IF+DN F+ QHN+ + +F + LN FADLT+
Sbjct: 48 EVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTN 107
Query: 84 QEFKASFLG--FSAASIDHDRRRNASVQSPGNL----RDVPASIDWRKKGAVTEVKDQAS 137
+EF++ +LG S++S + V+S L ++P ++DWRK GAV +VKDQ
Sbjct: 108 EEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAVAKVKDQGQ 167
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYNSGC GGLMDYAY+F+I N G
Sbjct: 168 CGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMDYAYEFIINNGG 227
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDT+ DYPY + G+C++ + N +VTID ++DVPEN+EK L +AV QPVSV I
Sbjct: 228 IDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVSVAIEAGGS 287
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQ Y SG+FTG C LDH V+ VGY S++G DYWI++NSWG WG +GY+ M+RN
Sbjct: 288 TFQFYQSGVFTGKCGADLDHGVVAVGYGSDDGKDYWIVRNSWGADWGESGYIRMERNLET 347
Query: 318 -SLGICGINMLASYPTKTGQ---------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSI 367
G CGI + SYP K Q PPSP C C + TCCC
Sbjct: 348 VKTGKCGIAIEPSYPIKNSQNPPNPGPTPPSPPSPASADVTCDEYYTCPSSTTCCCVYEY 407
Query: 368 LGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
C +W CC SAVCC+DH CCP +YP+C++ + C S F+VK
Sbjct: 408 GPYCFAWGCCPLESAVCCADHSSCCPHDYPVCNARKGTC-NASKNSPFSVK 457
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 196/410 (47%), Positives = 258/410 (62%), Gaps = 18/410 (4%)
Query: 23 SDINELFETWCKQHGKAYSSE-QEKQQRLKIFEDNYAFVTQHNN-MGNSSFTLSLNAFAD 80
+++ ++E W +HG+ S+ E R ++F DN FV HN G F L +N FAD
Sbjct: 50 AEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFAD 109
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNA--SVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
LT+ EF+A++LG A I R NA + ++P S+DWR+KGAV VK+Q C
Sbjct: 110 LTNDEFRAAYLG---ARIPAARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
G+CWAFSA ++E IN+IVTG +V+LSEQEL++C NSGC GGLMD A+ F+IKN G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDTE DYPY+ G+C+ + N +V+ID ++DVPEN+EK L +AV QPVSV I R
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLY SG+F+G C+T+LDH V+ VGY +ENG DYWI++NSWG WG GY+ M+RN
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYGTENGKDYWIVRNSWGPKWGEAGYIRMERNINA 346
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGPTR---------CSLLTYCAAGETCCCGSSIL 368
+ G CGI M+ASYPTK G NPP P PT C C+AG TCCC
Sbjct: 347 TTGKCGIAMMASYPTKKGANPPKPSPTPPTPPPPVAPDHVCDENFVCSAGSTCCCAFGFR 406
Query: 369 GICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
+CL W CC A CC DH CCP +YP+C+ +R + +VS +VK
Sbjct: 407 NVCLVWGCCPIEGATCCKDHASCCPPDYPVCN-IRARTCSVSKNSPLSVK 455
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 194/393 (49%), Positives = 255/393 (64%), Gaps = 18/393 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
I ++F W + H + Y S EK R +IF++N+ ++ HN S+ L LN F+DLTHQ
Sbjct: 45 ILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQ-QKSYWLGLNKFSDLTHQ 103
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS--IDWRKKGAVTEVKDQASCGACW 142
EF+A +LG + +R+ A+ DV A +DWR KGAVT+VKDQ +CG+CW
Sbjct: 104 EFRAQYLGTKPVN---RQRKEANFM----YEDVEAEPKVDWRLKGAVTDVKDQGACGSCW 156
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFSA G++EG+N I TG LVSLSEQEL+DCDR N GC GGLMDYA++F+IKN GIDTEK
Sbjct: 157 AFSAVGSVEGVNAIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEK 216
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ + G+C++ + N +V ID Y+DVP +E L++A+ PVSV I R FQ Y
Sbjct: 217 DYPYKARDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQHY 276
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL-G 320
G+FTGPC + LDH VL VGY + ++GV+YWI+KNSWG WG GY+ M+R +S G
Sbjct: 277 QGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDG 336
Query: 321 ICGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSW 374
CGIN+ AS+P K G PPSP P++C C A TCCC +I CL W
Sbjct: 337 KCGINIEASFPIKKGPNPPPSPPSPPSPIKPPSQCDNSHSCPASSTCCCAFNIGKYCLQW 396
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC SA CC DH +CCPS++P+C+ QCL
Sbjct: 397 GCCPMESATCCEDHYHCCPSDFPVCNLRAGQCL 429
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 194/406 (47%), Positives = 249/406 (61%), Gaps = 23/406 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
+ L+ W +HGK Y++ E+++R F DN ++ +HN + G SF L LN FA
Sbjct: 34 EEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFA 93
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
DLT++E++ ++LG R+ + + +P S+DWR KGAV E+KDQ CG
Sbjct: 94 DLTNEEYRDTYLGLRNKP--RRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCG 151
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GID
Sbjct: 152 SCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGID 211
Query: 200 TEKDYPYRGQAGQCNKQKL------------NRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
TE DYPY+G+ +C+ ++ N +VTID Y+DV N+E L +AV QP
Sbjct: 212 TEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSETSLQKAVANQP 271
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
VSV I RAFQLYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +G
Sbjct: 272 VSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESG 331
Query: 308 YMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETC 361
Y+ M+RN S G CGI + SYP K G+NPP P P+ C C TC
Sbjct: 332 YVRMERNIKASSGKCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTC 391
Query: 362 CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC C +W CC A CC DH CCP YPIC+ + CL
Sbjct: 392 CCIYEYGKYCYAWGCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 437
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 197/435 (45%), Positives = 260/435 (59%), Gaps = 34/435 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYC-----------------SDINELFETWCKQHGKAYSSE 43
++ L +++ SL L+ C + ++E W +HGK Y++
Sbjct: 2 LSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNAL 61
Query: 44 QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR 103
EK++R +IF+DN F+ +HN+ N SF L LN FADLT++E++ FLG + R
Sbjct: 62 GEKEKRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRI----NPNR 116
Query: 104 RNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIV 157
RN V S N +P S+DWRK+GAV VKDQ SCG+CWAFSA A+EG+NK+
Sbjct: 117 RNRKVNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLA 176
Query: 158 TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQK 217
TG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I + E+DYPYR G+C++ +
Sbjct: 177 TGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNR 236
Query: 218 LNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDH 277
N +V+ID Y+DVP +E L +AV Q ++V + G R FQLY SG+FTG C T+LDH
Sbjct: 237 KNAKVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDH 296
Query: 278 AVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL-GICGINMLASYPTKTGQ 336
V VGY +ENG DYWI++NSWG SWG GY+ ++RN S G CGI + SYP K G
Sbjct: 297 GVAAVGYGTENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIKNGL 356
Query: 337 NPPPSPPPGPTRCSLLTY-----CAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYC 391
NPP P P+ + CA G TCCC G C W CC SA CC DH C
Sbjct: 357 NPPKPAPSPPSPVKPPSVCDSYSCAEGSTCCCIFDYGGSCFEWGCCPLESATCCDDHYSC 416
Query: 392 CPSNYPICDSVRHQC 406
CP YP+CD+ C
Sbjct: 417 CPHEYPVCDTYAGLC 431
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/395 (47%), Positives = 253/395 (64%), Gaps = 17/395 (4%)
Query: 23 SDINELFETWCKQHGKAYSSE-QEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFAD 80
+ + ++E W +HGKA S+ E +R + F DN FV HN G + L +N FAD
Sbjct: 46 AQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFAD 105
Query: 81 LTHQEFKASFLGF-----SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
LT+ EF+A++L +A + +R R+ V++ +P +DWR+KGAV VK+Q
Sbjct: 106 LTNAEFRAAYLSAGARNGTATAATGERYRHDGVEA------LPEFVDWRQKGAVAPVKNQ 159
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIK 194
CG+CWAFSA GA+EGIN+IVTG LV+LSEQEL+DC ++ N GC GG+MD A+ F++
Sbjct: 160 GQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVG 219
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N GIDT+KDYPY + G+C+ K +RH+V+IDG++ VP N+EK L +AV QPV+V I
Sbjct: 220 NGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEA 279
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE--NGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C TSLDH V+ VGY +E G DYW+++NSWG WG GY+ M+
Sbjct: 280 GGREFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRME 339
Query: 313 RNTGNSLGICGINMLASYPTKTGQN-PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
RN G G CGI M ASYP K+G N P PP P C + C AG TCCC + +C
Sbjct: 340 RNVGARAGKCGIAMEASYPVKSGANPDPSPSPPTPVTCDRYSACPAGSTCCCTYGVRNVC 399
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
L W CC A CC D CCP+++P+CD+ C
Sbjct: 400 LVWGCCPAEGATCCKDRATCCPADHPVCDARTRTC 434
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/393 (47%), Positives = 254/393 (64%), Gaps = 18/393 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEF 86
++ W ++G++Y++ E+++R ++F DN FV HN + F L +N FADLT+ EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+++FLG A ++ R + G + ++P S+DWR+KGAV VK+Q CG+CWAFSA
Sbjct: 109 RSTFLG--AKVVERSRAAGERYRHDG-VEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 165
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
+E IN++VTG +++LSEQEL++C + NSGC GGLMD A+ F+IKN GIDTE DYP
Sbjct: 166 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 225
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y+ G+C+ + N +V+IDG++DVP+N+EK L +AV QPVSV I R FQLY SG
Sbjct: 226 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 285
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +GY+ M+RN + G CGI
Sbjct: 286 VFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIA 345
Query: 326 MLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGICLS 373
M+ASYPTK+G NPP P PT C C AG TCCC +CL
Sbjct: 346 MMASYPTKSGANPPKPSPAPPTPPTPPPPAAPDHVCDDNFSCPAGSTCCCAFGFRNLCLV 405
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
W CC A CC DH CCP +YPIC++ C
Sbjct: 406 WGCCPVEGATCCKDHASCCPPDYPICNTRAGTC 438
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 192/394 (48%), Positives = 240/394 (60%), Gaps = 15/394 (3%)
Query: 29 FETWCKQHGKAYSSE-QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
F W + KAY +E +++ ++ DN FV HN +S+F L L FADLTH E++
Sbjct: 48 FSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEK-DSTFKLGLTNFADLTHDEYR 106
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
LG+ S + P SIDWRKKGAVT+VK+Q CG+CWAFS T
Sbjct: 107 QHALGYRPELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTT 166
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
G++EG N I +G LVSLSEQEL+DCD + + GC GGLMD+A+ F+I+N GIDTEKDY Y+
Sbjct: 167 GSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
Q G CN K RH+VTID Y+DVP N+E L +A QP+SV I +R FQLY+ G+F
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
PC T+LDH VL+VGY S+NG DYWI+KNSWG WG +GY+ + R NS G CGI M
Sbjct: 287 DAPCGTALDHGVLVVGYGSDNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAMQ 346
Query: 328 ASYPTKTGQNPPPSPPPGPTR-------------CSLLTYCAAGETCCCGSSILGICLSW 374
ASYP K NPP PP P C T C TCCC G C +W
Sbjct: 347 ASYPIKKTPNPPTPPPVPPPTPGPPSPPSPKPEVCDTATSCPPASTCCCMREFFGYCFTW 406
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
CC A CC DH +CCPSN P+CD+V +CL+
Sbjct: 407 ACCPLKEATCCDDHEHCCPSNLPVCDTVAGRCLS 440
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/338 (52%), Positives = 233/338 (68%), Gaps = 11/338 (3%)
Query: 7 FLLSILLLSSLPLNYC-SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
F LS + +S +NY +++ ++E W +H K Y+ +K +R ++F+DN F+ +HNN
Sbjct: 15 FTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNN 74
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN-----LRD-VPA 119
N+++ L LN FAD+T++E++A +LG + + +RR +S G+ RD +P
Sbjct: 75 NLNNTYKLGLNKFADMTNEEYRAMYLGTKSNA----KRRLMKTKSTGHRYAFSARDRLPV 130
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
+DWR KGAV +KDQ SCG+CWAFS +E INKIVTG VSLSEQEL+DCDR+YN G
Sbjct: 131 HVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEG 190
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA++F+I+N GIDT+KDYPYRG G C+ K N +V IDGY+DVP +E L
Sbjct: 191 CNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYDENAL 250
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299
+AV QPVSV I S RA QLY SG+FTG C TSLDH V++VGY SENGVDYW+++NSW
Sbjct: 251 KKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYGSENGVDYWLVRNSW 310
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
G WG +GY MQRN S G CGI M ASYP K G N
Sbjct: 311 GTGWGEDGYFKMQRNVRTSTGKCGITMEASYPVKNGLN 348
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/418 (46%), Positives = 256/418 (61%), Gaps = 27/418 (6%)
Query: 23 SDINELFETWCKQHGKAYS-SEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAF 78
++ ++ W +HG S S E+++R + F DN FV HN G F L +N F
Sbjct: 46 AEARAIYGLWRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRF 105
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVK 133
ADLT+ EF+A++LG A +RR+A R ++P ++DWR+KGAV VK
Sbjct: 106 ADLTNDEFRAAYLGVKGAG----QRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVK 161
Query: 134 DQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFV 192
+Q CG+CWAFSA A+E IN++VTG LV+LSEQEL++CD ++GC GGLMD A+ F+
Sbjct: 162 NQGQCGSCWAFSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFI 221
Query: 193 IKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI 252
I N GIDTE DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I
Sbjct: 222 INNGGIDTEDDYPYKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 281
Query: 253 CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C T LDH V+ VGY +ENG DYWI++NSWG WG GY+ M+
Sbjct: 282 EAGGREFQLYHSGVFTGRCGTELDHGVVAVGYGTENGKDYWIVRNSWGPKWGEAGYLRME 341
Query: 313 RNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGET 360
RN + G CGI M++SYPTK G NPP P PT C CAAG T
Sbjct: 342 RNINATTGKCGIAMMSSYPTKKGANPPKPSPTPPTPPTPPPPVAPDHVCDENVSCAAGST 401
Query: 361 CCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
CCC +CL W CC A CC DH CCP +YP+C+ ++ + S + TVK
Sbjct: 402 CCCAFGFRNMCLVWGCCPVEGATCCKDHASCCPPDYPVCN-IKAGTCSASKNRTLTVK 458
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/392 (46%), Positives = 252/392 (64%), Gaps = 17/392 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFK 87
++ W ++G++Y++ E ++R ++F DN F HN + F L +N FADLT++EF+
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A+FLG A ++ R + G + ++P S+DWR+KGAV VK+Q CG+CWAFSA
Sbjct: 114 ATFLG--AKVVERSRAAGERYRHDG-VEELPESVDWREKGAVAPVKNQGQCGSCWAFSAV 170
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+E IN++VTG +++LSEQEL++C + NSGC GGLMD A+ F+IKN GIDTE DYPY
Sbjct: 171 STVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPY 230
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G+C+ + N +V+IDG++DVP+N+EK L +AV QPVSV I R FQLY SG+
Sbjct: 231 KAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGV 290
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +GY+ M+RN + G CGI M
Sbjct: 291 FSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAM 350
Query: 327 LASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGICLSW 374
+ASYPTK+G NPP P PT C C G TCCC +CL W
Sbjct: 351 MASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPVGSTCCCAFGFRNLCLVW 410
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
CC A CC DH CCP +YP+C++ C
Sbjct: 411 GCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 442
>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
Length = 1105
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/308 (60%), Positives = 215/308 (69%), Gaps = 3/308 (0%)
Query: 29 FETWCKQHGKAYSSEQEKQQR-LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
FE WC +HG++Y++ E R + F + L+L +
Sbjct: 38 FEAWCAEHGRSYATPGELVGRGSRRFAGTTRRSWRRTTARPRRTPLALQRLRGPYARRVP 97
Query: 88 ASFLGFSAASIDHDRRRNAS--VQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
A A+ R + + G + VP ++DWR+ GAVT+VKDQ SCGACW+FS
Sbjct: 98 APRRSGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 157
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
ATGA+EGINKI TGSL+SLSEQELIDCDRSYNSGCGGGLMDYAY+FV+KN GIDTE DYP
Sbjct: 158 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 217
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
YR G CNK KL R +VTIDGYKDVP NNE LLQAV QPVSVGICGS RAFQLYS G
Sbjct: 218 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 277
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
IF GPC TSLDHA+LIVGY SE G DYWI+KNSWG SWGM GYM+M RNTGNS G+CGIN
Sbjct: 278 IFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGIN 337
Query: 326 MLASYPTK 333
+ S+PTK
Sbjct: 338 QMPSFPTK 345
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 194/387 (50%), Positives = 241/387 (62%), Gaps = 40/387 (10%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W +HGK+Y++ EK++R +IF+DN F+ +HN N ++ +S +
Sbjct: 3 VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN-AENRTYKIS----------DRY 51
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A +G S +P S+DWRKKGAV EVKDQ SCG+CWAFS
Sbjct: 52 AFRVGDS----------------------LPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GID+E+DYPY+
Sbjct: 90 AAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYK 149
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G+C++ + N +VTIDGY+DVPEN+EK L +AV QPVSV I R FQLY SGIF
Sbjct: 150 ASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIF 209
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINM 326
TG C T+LDH V VGY +ENGVDYWI+KNSWG SWG GY+ M+R+ S G CGI M
Sbjct: 210 TGRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAM 269
Query: 327 LASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380
ASYP K GQ PPSP PT C C TCCC C W CC
Sbjct: 270 EASYPIKKGQNPPNPGPSPPSPIKPPTVCDNYYACPESSTCCCIFEYAKYCFQWGCCPLE 329
Query: 381 SAVCCSDHRYCCPSNYPICDSVRHQCL 407
+A CC DH CCP YP+C+ C+
Sbjct: 330 AATCCEDHDSCCPQEYPVCNVRAGTCM 356
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 211/430 (49%), Positives = 266/430 (61%), Gaps = 21/430 (4%)
Query: 3 SLAFFLLSILLLSS----LPLNYCSDINELFETWCKQHGKAYSS-EQEKQQRLKIFEDNY 57
+L FFL L +S +P ++ L++ W +HGK +++ E + R IF+DN
Sbjct: 11 ALLFFLFIALSAASPSSIIPQRTDDEVMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNL 70
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
F+ + N N + L LN FADLT++E+++ +LG AS R R ++ P D+
Sbjct: 71 KFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSR-RNRTSNRYLPRLGDDL 128
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P SIDWR KGAV VKDQ SCG+CWAFS ++E IN+IVTG L++LSEQEL+DCDRSYN
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++F+I+N G+DTE+DYPY G C + K N IDGY+DVP NNEK
Sbjct: 189 EGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKN----AIDGYEDVPVNNEK 244
Query: 238 QLLQA---VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
L +A V VSV I G R+FQLY SGIFTG C T LDH V +VGY SE GVDYWI
Sbjct: 245 ALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGSEGGVDYWI 304
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK------TGQNPPPSPPPGPTR 348
++NSWG SWG +GY+ MQRN + G+CGI M SYPTK PPSP P+
Sbjct: 305 VRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTKTGPNPPNPGPTPPSPVKPPSV 364
Query: 349 CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
C C A ETCCC +CL W CC SA CC DH CCP +YP+C+ VR +
Sbjct: 365 CDEYYTCPAAETCCCIFQFSNLCLEWGCCPLESATCCDDHYSCCPHDYPVCN-VRAGTCS 423
Query: 409 VSLKFSFTVK 418
S F VK
Sbjct: 424 KSKNDIFGVK 433
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/416 (45%), Positives = 255/416 (61%), Gaps = 22/416 (5%)
Query: 23 SDINELFETWCKQHGKA----YSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSL 75
++ +++ W +HG +S E+++R + F DN FV HN G F L++
Sbjct: 44 AEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAM 103
Query: 76 NAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
N FADLT+ EF+A++LG R + G ++P ++DWR+KGAV VK+Q
Sbjct: 104 NRFADLTNDEFRAAYLGVKGQRARPGRVVGERYRHDG-AEELPEAVDWREKGAVAPVKNQ 162
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIK 194
CG+CWAFSA +E IN+IVTG +V+LSEQEL++CD + +SGC GGLMD A++F+IK
Sbjct: 163 GQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 222
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N GIDTE DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I
Sbjct: 223 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 282
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
R FQLY SG+F+G C T LDH V+ VGY +ENG DYWI++NSWG +WG GY+ M+RN
Sbjct: 283 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLRMERN 342
Query: 315 TGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCC 362
+ G CGI M++SYPTK G NPP P P+ C C AG TCC
Sbjct: 343 INVTSGKCGIAMMSSYPTKKGANPPKPAPTPPSPPTPPPPVAPDHVCDENFSCPAGSTCC 402
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
C +CL W CC A CC DH CCP +YP+C+ VR + + +VK
Sbjct: 403 CSFGFRNLCLVWGCCPAEGATCCKDHSSCCPPDYPVCN-VRAGTCSATKNSPLSVK 457
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 177/358 (49%), Positives = 231/358 (64%), Gaps = 21/358 (5%)
Query: 1 MNSLAFFLLSILLLSSLPL----------NYC-SDINELFETWCKQHGKAYSSEQEKQQR 49
M S+ ++S LL S L NY +++ ++E W +H K Y+ EK +R
Sbjct: 1 MASIMTLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKR 60
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ 109
++F+DN F+ +HNN N+++ L LN FAD+T++E++ + G + + +RR +
Sbjct: 61 FQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDA----KRRLMKTK 116
Query: 110 SPGNL------RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVS 163
S G+ +P +DWR KGAV +KDQ SCG+CWAFS +E INKIVTG VS
Sbjct: 117 STGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVS 176
Query: 164 LSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV 223
LSEQEL+DCDR+YN GC GGLMDYA++F+I+N GIDT+KDYPYRG G C+ K N V
Sbjct: 177 LSEQELVDCDRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAV 236
Query: 224 TIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVG 283
IDGY+DVP +E L +AV QPVS+ I S RA QLY SG+FTG C TSLDH V++VG
Sbjct: 237 NIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVG 296
Query: 284 YDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPS 341
Y SENGVDYW+++NSWG WG +GY MQRN G CGI M ASYP K G N S
Sbjct: 297 YGSENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPVKNGLNSANS 354
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 190/420 (45%), Positives = 255/420 (60%), Gaps = 28/420 (6%)
Query: 23 SDINELFETWCKQHGKAYS----SEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSL 75
++ +++ W +HG S S ++++R F DN FV HN G F L++
Sbjct: 46 AEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAM 105
Query: 76 NAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD----VPASIDWRKKGAVTE 131
N FADLT+ EF+A++LG A+ +R R V D +P ++DWR+KGAV
Sbjct: 106 NRFADLTNDEFRAAYLGVKGAA---ERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAP 162
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQ 190
VK+Q CG+CWAFSA +E IN+IVTG +V+LSEQEL++CD +SGC GGLMD A++
Sbjct: 163 VKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFE 222
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
F+IKN GIDTE DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV PVSV
Sbjct: 223 FIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSV 282
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMH 310
I R FQLY SG+F+G C T LDH V+ VGY +ENG DYWI++NSWG +WG GY+
Sbjct: 283 AIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLR 342
Query: 311 MQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAG 358
M+RN + G CGI M++SYPTK G NPP P P+ C C AG
Sbjct: 343 MERNINVTSGKCGIAMMSSYPTKKGANPPKPAPTPPSPPTPPPPVAPDHVCDENFSCPAG 402
Query: 359 ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
TCCC +CL W CC A CC DH CCP +YP+C+ +R + + +VK
Sbjct: 403 STCCCSFGFRNLCLVWGCCPAEGATCCKDHSSCCPPDYPVCN-IRAGTCSATKNSPLSVK 461
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 190/420 (45%), Positives = 255/420 (60%), Gaps = 28/420 (6%)
Query: 23 SDINELFETWCKQHGKAYS----SEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSL 75
++ +++ W +HG S S ++++R F DN FV HN G F L++
Sbjct: 46 AEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAM 105
Query: 76 NAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD----VPASIDWRKKGAVTE 131
N FADLT+ EF+A++LG A+ +R R V D +P ++DWR+KGAV
Sbjct: 106 NRFADLTNDEFRAAYLGVKGAA---ERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAP 162
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQ 190
VK+Q CG+CWAFSA +E IN+IVTG +V+LSEQEL++CD +SGC GGLMD A++
Sbjct: 163 VKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFE 222
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
F+IKN GIDTE DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV PVSV
Sbjct: 223 FIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSV 282
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMH 310
I R FQLY SG+F+G C T LDH V+ VGY +ENG DYWI++NSWG +WG GY+
Sbjct: 283 AIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLR 342
Query: 311 MQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAG 358
M+RN + G CGI M++SYPTK G NPP P P+ C C AG
Sbjct: 343 MERNINVTSGKCGIAMMSSYPTKKGANPPKPAPTPPSPPTPPPPVAPDHVCDENFSCPAG 402
Query: 359 ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
TCCC +CL W CC A CC DH CCP +YP+C+ +R + + +VK
Sbjct: 403 STCCCSFGFRNLCLVWGCCPAEGATCCKDHSSCCPPDYPVCN-IRAGTCSATKNSPLSVK 461
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 177/360 (49%), Positives = 238/360 (66%), Gaps = 16/360 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ ++ W +HG Y++ E+++R + F DN ++ QHN + G SF L LN FAD
Sbjct: 38 EVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFAD 97
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E+++++LG + D +R+ +A Q+ N ++P S+DWRKKGAV VKDQ CG+
Sbjct: 98 LTNEEYRSTYLG-ARTKPDRERKLSARYQAADN-DELPESVDWRKKGAVGAVKDQGGCGS 155
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG ++ LSEQEL+DCD SYN GC GGLMDYA++F+I N GID+
Sbjct: 156 CWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDS 215
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYPY+ + +C+ K N +VTIDGY+DVP N+EK L +AV QP+SV I RAFQ
Sbjct: 216 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 275
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SGIFTG C T+LDH V VGY +ENG DYW+++NSWG WG +GY+ M+RN S G
Sbjct: 276 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSVWGEDGYIRMERNIKASSG 335
Query: 321 ICGINMLASYPTKTGQNP---------PPS--PPPGPTRCSLLTYCAAGETCCCGSSILG 369
CGI + SYPTKT + P PP P T +L AA T S+ G
Sbjct: 336 KCGIAVEPSYPTKTARTPLTPAQLHRLPPHRLPSVTATTSALRARPAAASTSTARSASPG 395
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 189/411 (45%), Positives = 250/411 (60%), Gaps = 27/411 (6%)
Query: 23 SDINELFETWCKQHGKAYS----SEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSL 75
++ +++ W +HG S S ++++R F DN FV HN G F L++
Sbjct: 46 AEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAM 105
Query: 76 NAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD----VPASIDWRKKGAVTE 131
N FADLT+ EF+A++LG A+ +R R V D +P ++DWR+KGAV
Sbjct: 106 NRFADLTNDEFRAAYLGVKGAA---ERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAP 162
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQ 190
VK+Q CG+CWAFSA +E IN+IVTG +V+LSEQEL++CD +SGC GGLMD A++
Sbjct: 163 VKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFE 222
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
F+IKN GIDTE DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV PVSV
Sbjct: 223 FIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSV 282
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMH 310
I R FQLY SG+F+G C T LDH V+ VGY +ENG DYWI++NSWG +WG GY+
Sbjct: 283 AIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLR 342
Query: 311 MQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAG 358
M+RN + G CGI M++SYPTK G NPP P P+ C C AG
Sbjct: 343 MERNINVTSGKCGIAMMSSYPTKKGANPPKPAPTPPSPPTPPPPVAPDHVCDENFSCPAG 402
Query: 359 ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
TCCC +CL W CC A CC DH CCP +YP+C+ C V
Sbjct: 403 STCCCSFGFRNLCLVWGCCPAEGATCCKDHSSCCPPDYPVCNIRAGTCSAV 453
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 186/396 (46%), Positives = 249/396 (62%), Gaps = 22/396 (5%)
Query: 29 FETWCKQHGKAYSSE--QEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQ 84
++ W ++G + E ++R +F DN FV HN + F L +N FADLT++
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 85 EFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EF+A+FLG A +R R A + + ++P S+DWR+KGAV VK+Q CG+CWA
Sbjct: 112 EFRATFLGAKVA----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSA +E IN++VTG +++LSEQEL++C + NSGC GGLMD A+ F+IKN GIDTE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ G+C+ + N +V+IDG++DVP+N+EK L +AV QPVSV I R FQLY
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SG+F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +GY+ M+RN + G C
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 347
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGI 370
GI M+ASYPTK+G NPP P PT C C AG TCCC +
Sbjct: 348 GIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPAGSTCCCAFGFRNL 407
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
CL W CC A CC DH CCP +YP+C++ C
Sbjct: 408 CLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 443
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 368 bits (945), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 188/416 (45%), Positives = 256/416 (61%), Gaps = 22/416 (5%)
Query: 23 SDINELFETWCKQHGKAYS----SEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSL 75
++ +++ W ++G S S E+++R + F DN FV HN G + L +
Sbjct: 47 AEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGM 106
Query: 76 NAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
N FADLT+ EF+A++LG A R + G ++P ++DWR+KGAV VK+Q
Sbjct: 107 NRFADLTNDEFRAAYLGVKAQRARPGRMVGERYRHDG-AEELPEAVDWREKGAVAPVKNQ 165
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIK 194
CG+CWAFSA +E IN+IVTG +V+LSEQEL++CD + +SGC GGLMD A++F+IK
Sbjct: 166 GQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 225
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N GIDTE DYPY+ G+C+ + N +V+IDG++DVPEN+EK L +AV QPVSV I
Sbjct: 226 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 285
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
R FQLY SG+F+G C T LDH V+ VGY +ENG DYWI++NSWG +WG +GY+ M+RN
Sbjct: 286 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGESGYLRMERN 345
Query: 315 TGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCC 362
+ G CGI M++SYPTK G NPP P P+ C C AG TCC
Sbjct: 346 INVTSGKCGIAMMSSYPTKKGANPPKPAPTPPSPPTPPPPVAPDHVCDENFSCPAGSTCC 405
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
C +CL W CC A CC DH CCP +YP+C+ +R + + +VK
Sbjct: 406 CSFGFRNLCLVWGCCPAEGATCCKDHSSCCPPDYPVCN-IRAGTCSATKNSPLSVK 460
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 367 bits (941), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 183/350 (52%), Positives = 236/350 (67%), Gaps = 9/350 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
++ L+E W +HG+A ++ EK++R +IF+DN F+ HN + G+ SF L LN FA
Sbjct: 44 EEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFA 103
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
D+T++E++ +LG AS R + ++P S+DWR KGAVT VKDQ SCG
Sbjct: 104 DMTNEEYRTVYLGTRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGSCG 163
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGINKIVTG L+SLSEQEL+DCD N GC GGLMDYA++F+I N GID
Sbjct: 164 SCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGGID 223
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE+DYPY+ + G+C++ + N +V+IDGY+DVP N+EK L +AV QPVSV I R F
Sbjct: 224 TEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREF 283
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIFTG C T LDH V+ VGY +ENG DYWI++NSWG WG +GY+ M+RN S
Sbjct: 284 QLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIRMERNVNAST 343
Query: 320 GICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCC 363
G CGI M +SYPTK GQNPP P P+ C C +G TCCC
Sbjct: 344 GKCGIAMESSYPTKKGQNPPNPGPSPPSPVNPPAVCDNYYSCPSGTTCCC 393
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 46/89 (51%), Gaps = 6/89 (6%)
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGIC 371
S G CGI M +SYPTK GQNPP P P+ C C +G TCCC C
Sbjct: 402 STGKCGIAMESSYPTKKGQNPPNPGPSPPSPVNPPAVCDNYYSCPSGTTCCCVYEFGRRC 461
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICD 400
+W CC A CC D CCP +YP+C+
Sbjct: 462 FAWGCCPLEGATCCEDRYSCCPHDYPVCN 490
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 194/401 (48%), Positives = 255/401 (63%), Gaps = 37/401 (9%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFT--LSLNAFADLT 82
+ ELF+ W K+H K Y +E RL+ F+ N ++ + N M NS L LN FAD++
Sbjct: 47 VVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMS 106
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EFK F+ + V+S D P S+DWRKKG VT VKDQ +CG+CW
Sbjct: 107 NEEFKNKFI--------------SKVES---CDDAPYSLDWRKKGVVTGVKDQGNCGSCW 149
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
+FS+TGAIEG+N IVTG L+SLSEQEL+DCD + N GC GG MDYA+++VI N GIDTE
Sbjct: 150 SFSSTGAIEGVNAIVTGDLISLSEQELVDCDTT-NDGCEGGYMDYAFEWVINNGGIDTEA 208
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY G G CN K +VTIDGY DV + ++ L A V QP+SVGI GS FQLY
Sbjct: 209 DYPYIGVGGTCNVTKEETKVVTIDGYTDVTQ-SDSALFCATVKQPISVGIDGSTLDFQLY 267
Query: 263 SSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
+ GI+ G CS++ +DHAVLIVGY S+ DYWI+KNSWG SWG+ G+++++RNT
Sbjct: 268 TGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNLKY 327
Query: 320 GICGINMLASYPTK-------------TGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSS 366
G+C IN +AS+PTK PP P P P++C +YC ETCCC
Sbjct: 328 GVCAINYMASFPTKESTSISPTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEETCCCLYE 387
Query: 367 ILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+ CL++ CC + +AVCC+ +YCCPS+YPICD+ CL
Sbjct: 388 LFDFCLAYGCCEYENAVCCTGTKYCCPSDYPICDTEDGLCL 428
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 170/336 (50%), Positives = 226/336 (67%), Gaps = 11/336 (3%)
Query: 7 FLLSILLLSSLPLNYC-SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
F LS + +S NY +++ ++E W +H K Y+ +EK +R ++F+DN F+ +HNN
Sbjct: 17 FTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNN 76
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL------RDVPA 119
N+++ L LN FAD+T++E++ + G + + +RR +S G+ +P
Sbjct: 77 NQNNTYKLGLNQFADMTNEEYRVMYFGTKSDA----KRRLMKTKSTGHRYAYSAGDRLPV 132
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
+DWR KGAV +KDQ SCG+CWAFS +E INKIVTG VSLSEQEL+DCDR+YN G
Sbjct: 133 HVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEG 192
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA++F+I+N GIDT+KDYPYRG G C+ K N +V IDG++DVP +E L
Sbjct: 193 CNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVPPYDENAL 252
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299
+AV QPVS+ I S R QLY SG+FTG C TSLDH V++VGY SENGVDYW+++NSW
Sbjct: 253 KKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGSENGVDYWLVRNSW 312
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG 335
G WG +GY MQRN G CGI M ASYP K G
Sbjct: 313 GTGWGEDGYFKMQRNVRTPTGKCGITMEASYPVKNG 348
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 192/404 (47%), Positives = 249/404 (61%), Gaps = 23/404 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLN 76
++ +++ W +H S E ++R ++F DN FV HN + F L +N
Sbjct: 59 AEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMN 118
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE-VKDQ 135
FADLT+ EF+A++LG + A R + + G + +P S+DWR KGAV VK+Q
Sbjct: 119 RFADLTNDEFRAAYLGTTPAG--RGRHVGEAYRHDG-VEALPDSVDWRDKGAVVAPVKNQ 175
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIK 194
CG+CWAFSA A+EGINKIVTG LVSLSEQEL++C R+ NSGC GG+MD A+ F+ +
Sbjct: 176 GQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIAR 235
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N G+DTE+DYPY G+CN K +R +V+IDG++DVPEN+E L +AV QPVSV I
Sbjct: 236 NGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDA 295
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C TSLDH V+ VGY D+ G DYW ++NSWG WG NGY+ M+
Sbjct: 296 GGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRME 355
Query: 313 RNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT----------RCSLLTYCAAGETCC 362
RN G CGI M+ASYP K G NP PSP P P +C + C AG TCC
Sbjct: 356 RNVTARTGKCGIAMMASYPIKKGPNPKPSPSPAPAPLSPAPSPPQQCDRYSKCPAGTTCC 415
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C I C+ W CC A CC DH CCP +YP+C++ C
Sbjct: 416 CNYGIRNHCIVWGCCPAKGATCCKDHSTCCPKDYPVCNAKARTC 459
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 185/396 (46%), Positives = 247/396 (62%), Gaps = 22/396 (5%)
Query: 29 FETWCKQHGKAYSSE--QEKQQRLKIFEDNYAFVTQHNNMGNSS--FTLSLNAFADLTHQ 84
++ W ++G + E ++R +F DN FV HN + F L +N FADLT++
Sbjct: 51 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110
Query: 85 EFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EF+A+FLG A +R R A + + ++P S+DWR+KGAV VK+Q CG+CWA
Sbjct: 111 EFRATFLGAKVA----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSA +E IN++VTG +++LSEQEL++C NSGC GGLM A+ F+IKN GIDTE
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ G+C+ + N +V+IDG++DVP+N+EK L +AV QPVSV I R FQLY
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SG+F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +GY+ M+RN + G C
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 346
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGI 370
GI M+ASYPTK+G NPP P PT C C AG TCCC +
Sbjct: 347 GIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPAGSTCCCAFGFRNL 406
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
CL W CC A CC DH CCP +YP+C++ C
Sbjct: 407 CLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 442
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 192/404 (47%), Positives = 249/404 (61%), Gaps = 23/404 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLN 76
++ +++ W +H S E ++R ++F DN FV HN + F L +N
Sbjct: 59 AEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMN 118
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE-VKDQ 135
FADLT+ EF+A++LG + A R + + G + +P S+DWR KGAV VK+Q
Sbjct: 119 RFADLTNDEFRAAYLGTTPAG--RGRHVGEAYRHDG-VEVLPDSVDWRDKGAVVAPVKNQ 175
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIK 194
CG+CWAFSA A+EGINKIVTG LVSLSEQEL++C R+ NSGC GG+MD A+ F+ +
Sbjct: 176 GQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIAR 235
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N G+DTE+DYPY G+CN K +R +V+IDG++DVPEN+E L +AV QPVSV I
Sbjct: 236 NGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDA 295
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C TSLDH V+ VGY D+ G DYW ++NSWG WG NGY+ M+
Sbjct: 296 GGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRME 355
Query: 313 RNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT----------RCSLLTYCAAGETCC 362
RN G CGI M+ASYP K G NP PSP P P +C + C AG TCC
Sbjct: 356 RNVTARTGKCGIAMMASYPIKKGPNPKPSPSPAPAPPSPAPSPPQQCDRYSKCPAGTTCC 415
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C I C+ W CC A CC DH CCP +YP+C++ C
Sbjct: 416 CNYGIRNHCIVWGCCPAKGATCCKDHSTCCPKDYPVCNAKARTC 459
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 171/307 (55%), Positives = 220/307 (71%), Gaps = 4/307 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
ELFE+W +H KAY S +EK R +IF DN + + N SS+ L LN FADL+H+EF
Sbjct: 45 ELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDE-TNKKVSSYWLGLNEFADLSHEEF 103
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K+ +LG ++ R+R++ S G++ D+P S+DWR KGAVT VK+Q SCG+CWAFS
Sbjct: 104 KSKYLGLR---VEFPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFST 160
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGIN+IVTG+L SLSEQELIDCDRS+N+GC GGLMDYA+Q+++ N G+ E+DYPY
Sbjct: 161 VAAVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPY 220
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G+C ++K +VTI GY+DVP N+E+ LL+A+ QPVSV I S R FQ Y GI
Sbjct: 221 LMEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGI 280
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
FTG C T +DH V VGY S G DY I+KNSWG WG NGY+ M+RNTG G+CGIN
Sbjct: 281 FTGRCGTQMDHGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQ 340
Query: 327 LASYPTK 333
+ASYPTK
Sbjct: 341 MASYPTK 347
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 187/425 (44%), Positives = 253/425 (59%), Gaps = 50/425 (11%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEF 86
++ W ++G++Y++ E+++R ++F DN FV HN + F L +N FADLT+ EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC-------- 138
+A+FLG A ++ R + G + ++P S+DWR+KGAV VK+Q C
Sbjct: 109 RATFLG--AKFVERSRAAGERYRHDG-VEELPESVDWREKGAVAPVKNQGQCVDRIIVWN 165
Query: 139 ------------------------GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
G+CWAFSA +E IN++VTG +++LSEQEL++C
Sbjct: 166 SMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECST 225
Query: 175 S-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+ NSGC GGLMD A+ F+IKN GIDTE DYPY+ G+C+ + N +V+IDG++DVP+
Sbjct: 226 NGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQ 285
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYW 293
N+EK L +AV QPVSV I R FQLY SG+F+G C TSLDH V+ VGY ++NG DYW
Sbjct: 286 NDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYW 345
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR----- 348
I++NSWG WG +GY+ M+RN + G CGI M+ASYPTK+G NPP P PT
Sbjct: 346 IVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPTKSGANPPKPSPTPPTPPTPPP 405
Query: 349 -------CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDS 401
C C AG TCCC +CL W CC A CC DH CCP YPIC++
Sbjct: 406 PAAPDHVCDDNFSCPAGSTCCCAFGFRNLCLVWGCCPVEGATCCKDHASCCPPEYPICNT 465
Query: 402 VRHQC 406
C
Sbjct: 466 RAGTC 470
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 364 bits (935), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 175/317 (55%), Positives = 225/317 (70%), Gaps = 12/317 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+++W QHGKAY+ E+++R +IF+DN F+ +HN+ N+++ L LN FADLT+QE++
Sbjct: 45 LYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYR 104
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASCGAC 141
A FLG RRR + P + ++P S++WR GAV+ VKDQ SCG+C
Sbjct: 105 AKFLGTRTDP----RRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSC 160
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA A+EGINKIV+G L+SLSEQEL+DCDRSY++GC GGLMDYA+QF+I N GIDTE
Sbjct: 161 WAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGIDTE 220
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
KDYPY G QC+ K N +V+IDGY+DVP NNE L +AV QPVS+ I RAFQL
Sbjct: 221 KDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGRAFQL 279
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y SG+F G C +LDH V+ VGY S +NG DYWI++NSWG +WG NGY+ M+RN + G
Sbjct: 280 YESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNINANTG 339
Query: 321 ICGINMLASYPTKTGQN 337
CGI M ASYP K G N
Sbjct: 340 KCGIAMEASYPVKNGAN 356
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 197/402 (49%), Positives = 253/402 (62%), Gaps = 15/402 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS----SFTLSLNAFADLTHQ 84
++W +H K Y++ EK++R IF DN F+ QHNN N F L LN FADLT+
Sbjct: 5 LQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTND 64
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EF+ + G + + G+ ++P S+DWRKKGAV+ VKDQ CG+CWAF
Sbjct: 65 EFRRIYFGVKRPEKAESVKSDRYAVKEGD--ELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
SA GA+EGINKIVTG L++LSEQEL+DCD SYNSGC GGLMDYA++F+I N GIDT+KDY
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY+ G C+ + N +VTIDG +DVP NNEK L +AV QPV + I R FQLY S
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242
Query: 265 GIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G+FTG C TSLDH V+ VGY +++G DYWI++NSWG WG +GY+ M+RNT + G CG
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCG 302
Query: 324 INMLASYPTKT-------GQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKC 376
I + SYP KT G +PP PP C + C + TCCC C W C
Sbjct: 303 IAIEPSYPVKTSPNPPNPGPSPPSPPPAPKVVCDSYSSCPSATTCCCVYEYGPYCYMWGC 362
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
C +A CC D CCP +YP+C++ + C + S FTVK
Sbjct: 363 CPLEAASCCDDDSSCCPHDYPVCNTQQGTC-SKSKNNPFTVK 403
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 191/407 (46%), Positives = 250/407 (61%), Gaps = 23/407 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLN 76
++ +++ W +H S E ++R ++F DN FV HN + F L +N
Sbjct: 60 AEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMN 119
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV-TEVKDQ 135
FADLT+ EF+A++LG + A R + + +P S+DWR KGAV + VK+Q
Sbjct: 120 RFADLTNDEFRAAYLGTTPAGRG---RHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQ 176
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIK 194
CG+CWAFSA A+EGINKIVTG LVSLSEQEL++C R+ NSGC GG+MD A+ F+ +
Sbjct: 177 GQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITR 236
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N G+DTE+DYPY G+C+ K +R +V+IDG++DVPEN+E L +AV QPVSV I
Sbjct: 237 NGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDA 296
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C TSLDH V+ VGY D+ G DYW ++NSWG WG NGY+ M+
Sbjct: 297 GGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRME 356
Query: 313 RNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT----------RCSLLTYCAAGETCC 362
RN G CGI M+ASYP K G NP PSP P P+ +C + C AG TCC
Sbjct: 357 RNVTARTGKCGIAMMASYPIKKGPNPKPSPSPKPSPPSPAPSPPQQCDRYSKCPAGTTCC 416
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
C I C+ W CC A CC DH CCP +YP+C++ C V
Sbjct: 417 CNYGIRNHCIVWGCCPVEGATCCKDHSTCCPKDYPVCNAKARTCSKV 463
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 170/307 (55%), Positives = 219/307 (71%), Gaps = 4/307 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
ELFE+W +H K Y S +EK R +IF DN + + N SS+ L LN FADL+H+EF
Sbjct: 45 ELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDE-TNKKVSSYWLGLNEFADLSHEEF 103
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K+ +LG ++ R+R++ S G++ D+P S+DWR KGAVT VK+Q SCG+CWAFS
Sbjct: 104 KSKYLGLR---VEFPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFST 160
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGIN+IVTG+L SLSEQELIDCDRS+N+GC GGLMDYA+Q+++ N G+ E+DYPY
Sbjct: 161 VAAVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPY 220
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G+C ++K +VTI GY+DVP N+E+ LL+A+ QPVSV I S R FQ Y GI
Sbjct: 221 LMEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGI 280
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
FTG C T +DH V VGY S G DY I+KNSWG WG NGY+ M+RNTG G+CGIN
Sbjct: 281 FTGRCGTQMDHGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQ 340
Query: 327 LASYPTK 333
+ASYPTK
Sbjct: 341 MASYPTK 347
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 179/380 (47%), Positives = 234/380 (61%), Gaps = 40/380 (10%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W +HGK+Y++ E+++R +IF+DN F+ +HN + N ++ + F+
Sbjct: 3 VYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-------DRYSFR 54
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A D+P S+DWR+KGAV VKDQ +CG+CWAFS
Sbjct: 55 AG-------------------------EDLPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EGIN+I TG L+SLSEQEL+DCD+SYN GC GGLMDYA++F+I N GID+E+DYPYR
Sbjct: 90 AAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYR 149
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
C+ + N +V+IDGY+DVP+N+E+ L +AV QPVSV I RAFQLY SG+F
Sbjct: 150 AADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVF 209
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN-TGNSLGICGINM 326
TG C T LDH V+ VGY +EN VDYWI++NSWG +WG +GY+ ++RN G G CGI +
Sbjct: 210 TGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAI 269
Query: 327 LASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380
SYP K GQNPP P P+ C C TCCC G C W CC
Sbjct: 270 EPSYPIKNGQNPPNPGPSPPSPSKPSVVCDEYYTCPEESTCCCIYEYAGFCFEWGCCPLE 329
Query: 381 SAVCCSDHRYCCPSNYPICD 400
A CC DH CCP YP+CD
Sbjct: 330 GATCCDDHYSCCPHEYPVCD 349
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 177/344 (51%), Positives = 236/344 (68%), Gaps = 23/344 (6%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ +++ W +HGKAY+ EK++R +IF+DN F+ +HN N ++ + LN FADLT+
Sbjct: 41 EVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADLTN 99
Query: 84 QEFKASFLGFSAASIDHDRR----RNASVQ---SPGNLRDVPASIDWRKKGAVTEVKDQA 136
+E++A +LG + D RR +NAS + PG + +P S+DWR+ GAV VKDQ
Sbjct: 100 EEYRAIYLGTRS---DPKRRFAKLKNASPRYAVMPGEV--LPESVDWRETGAVNPVKDQR 154
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
SCG+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD Y+ GC GGLMDYA+ F+IKN
Sbjct: 155 SCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNG 214
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
G+DTEKDYPY G G+CN + +V+IDGY+DVP +EK L +AV QPVSV +
Sbjct: 215 GLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGG 274
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
RA QLY SGIFTG C T+LDH ++ VGY +ENG DYWI++NSWG SWG NGY+ M+RN
Sbjct: 275 RALQLYVSGIFTGECGTALDHGIVAVGYGTENGTDYWIVRNSWGSSWGENGYIRMERNMA 334
Query: 317 NSL-GICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGE 359
++ G CGI M ASYP K G+NP + L++ AGE
Sbjct: 335 DAFSGKCGIAMEASYPIKNGENPSK---------TYLSFGTAGE 369
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 181/398 (45%), Positives = 243/398 (61%), Gaps = 19/398 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQE 85
E F+ W + +AY+S +E ++R ++ DN FV ++N G++S LS+ +ADL+ E
Sbjct: 37 REAFDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEYN-AGHTSHWLSMGVYADLSQDE 95
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
+++ LG++A + R A G + P +DW KGAVT VK+Q CG+CWAFS
Sbjct: 96 YRSKALGYNADLHEERPLRAAPFLYEGTVP--PKEVDWVAKGAVTPVKNQLLCGSCWAFS 153
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
TGA+EG + I TG L SLSEQ L+DCDR ++GC GGLMD+A++F++KN GIDTE DYP
Sbjct: 154 TTGAVEGASAIATGKLASLSEQMLVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYP 213
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y + G C K+ RH+VTID Y+DVP N+E L++AV QPVSV I +RAFQLY G
Sbjct: 214 YTAEEGMCQDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGG 273
Query: 266 IFTGPCSTSLDHAVLIVGY-DSENG---VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
+F C T+LDH VL+VGY + NG + YW++KNSWG WG GY+ + RN G G
Sbjct: 274 VFDAECGTALDHGVLVVGYGTASNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEE-GQ 332
Query: 322 CGINMLASYPTKTGQN-----------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGI 370
CG+ M AS+P K G N P P P P C T C TCCC G
Sbjct: 333 CGVAMQASFPIKKGANPPEPPPTPPGPGPEPPEPQPVSCDDTTQCPPDNTCCCMREFFGF 392
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
C +W CC A CC D ++CCP + P+CD+V +CL
Sbjct: 393 CFTWACCPLPKATCCDDQQHCCPEDLPVCDTVAGRCLA 430
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 169/327 (51%), Positives = 226/327 (69%), Gaps = 17/327 (5%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ +E W +HGK Y++ EK+ R +IF DN F+ +HN GN S+ + LN FADLT+
Sbjct: 31 EVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLTN 90
Query: 84 QEFKASFLGFSAASIDHDRR----------RNASVQSPGNLRDVPASIDWRKKGAVTEVK 133
+E+++ +LG +D RR R +VQ PA +DWR++GAV+ VK
Sbjct: 91 EEYRSMYLG---TKVDPYRRIAKMQRGEISRRYAVQENEMF---PAKVDWRERGAVSPVK 144
Query: 134 DQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVI 193
+Q CG+CWAFS ++EGINKIVTG L+SLSEQEL+DCD YNSGC GG MDYA+QF++
Sbjct: 145 NQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQFIV 204
Query: 194 KNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGIC 253
N GID+E DYPY+G C+ + IV+IDGY+DVP NEK L++AV QPVSVGI
Sbjct: 205 SNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVGIE 264
Query: 254 GSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQR 313
S RAFQLY+SG+ TG C T+LDH V++VGY SENG DYWI++NSWG WG +GY+ M+R
Sbjct: 265 ASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGYIRMER 324
Query: 314 NTGNS-LGICGINMLASYPTKTGQNPP 339
N ++ +G+CGI ++ASYP K G P
Sbjct: 325 NMVDTPVGMCGITLMASYPIKYGNKNP 351
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 196/413 (47%), Positives = 251/413 (60%), Gaps = 33/413 (7%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSF--TLSLNAFADLT 82
I E+F+ W +H K Y E ++R + F+ N ++ + ++ ++ LN FADL+
Sbjct: 46 IIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADLS 105
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLR--DVPASIDWRKKGAVTEVKDQASCGA 140
++EFK +L I+ +R A NL+ D P+S+DWRKKG VT VKDQ CG+
Sbjct: 106 NEEFKELYLSKVKKPINI-KRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCGS 164
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CW+FS TGAIEGIN IVTG L+SLSEQEL+DCD + N GC GG MDYA+++VI N GIDT
Sbjct: 165 CWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCEGGYMDYAFEWVINNGGIDT 223
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E +YPY G G CN K +V+IDGY DV E + LL A V QP+SVG+ GS FQ
Sbjct: 224 EANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLCATVQQPISVGMDGSALDFQ 282
Query: 261 LYSSGIFTGPCS---TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
LY+ GI+ G CS +DHAVLIVGY SENG DYWI+KNSWG WGM GY +++RNT
Sbjct: 283 LYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFYIKRNTDL 342
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGPTR-----------------------CSLLTY 354
G+C IN ASYPTK +P P+ PP P C Y
Sbjct: 343 PYGVCAINAEASYPTKESSSPSPTSPPSPPSPLSPPPPPPPTPVPPPPCPQPSDCGDFAY 402
Query: 355 CAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C + ETCCC + C+ + CC + +AVCC+D YCCPS+YPICD CL
Sbjct: 403 CPSDETCCCILKVFDYCIVYGCCQYENAVCCADSVYCCPSDYPICDVEEGLCL 455
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 183/392 (46%), Positives = 250/392 (63%), Gaps = 17/392 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFK 87
++ W ++G++Y++ E ++R ++F DN F HN + F L +N FADLT++EF+
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A+FLG A ++ R + G + ++P S+DWR+KGAV VK+Q CG+CWAFSA
Sbjct: 113 ATFLG--AKVVERSRAAGERYRHDG-VEELPESVDWREKGAVAPVKNQGQCGSCWAFSAV 169
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+E IN++VTG +++LSEQEL++C N GC GGLMD A+ F+IKN GIDTE DYPY
Sbjct: 170 STVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYPY 229
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G+C+ + N +V+IDG++DVP+N+EK L +AV QPVSV I R FQLY SG+
Sbjct: 230 KAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGV 289
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +GY+ M+RN + G CGI M
Sbjct: 290 FSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAM 349
Query: 327 LASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGICLSW 374
+ASYPTK+G NPP P PT C C G TCCC +CL W
Sbjct: 350 MASYPTKSGANPPKPSPTPPTPPTPPPPSATDHVCDDNFSCPVGSTCCCAFGFRNLCLVW 409
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
CC A CC DH CCP +YP+C++ C
Sbjct: 410 GCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 441
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 173/317 (54%), Positives = 223/317 (70%), Gaps = 12/317 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+++W QHGKAY+ E+++R +IF+DN F+ +HN+ N+++ L LN FADLT+QE++
Sbjct: 44 LYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYR 103
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASCGAC 141
A FLG RRR + P + ++P S+DWR GAV+ VKDQ SCG+C
Sbjct: 104 AKFLGTRTDP----RRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSC 159
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS +EGINKIV+G LVSLSEQEL+DCDRSY++GC GGLMDYA+QF++ N GIDTE
Sbjct: 160 WAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGIDTE 219
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
KDYPY G QC+ K N +V+IDGY+DVP NNE L +AV QPVS+ I RAFQL
Sbjct: 220 KDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGRAFQL 278
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y SG+F G C +LDH V+ VGY + +NG DYWI++NSWG +WG NGY+ M+RN + G
Sbjct: 279 YESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRMERNINANTG 338
Query: 321 ICGINMLASYPTKTGQN 337
CGI M ASYP K G N
Sbjct: 339 KCGIAMEASYPVKNGAN 355
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 170/341 (49%), Positives = 234/341 (68%), Gaps = 5/341 (1%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L+FF LSI S+L ++ E+++ W +HGKAY+ E+++R +IF++N F+ H
Sbjct: 11 LSFFFLSISA-SALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDH 69
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ--SPGNLRDVPASI 121
N+ N ++ + LN FADLT++E++A +LG + + + + + NL +P S+
Sbjct: 70 NSE-NRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDRLPESM 128
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWR +GAV VK+Q SCG+CWAFS A+EGIN+IVTG L+SLSEQEL+ CD+ YNSGC
Sbjct: 129 DWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCN 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+QF+I N G+DTE+DYPY GQC+ + N +V+ID Y+DVP N+E+ L +
Sbjct: 189 GGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPANDEESLKK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
AV QPVSV I S A QLY SG+FTG C ++LDH V+ VGY ENGVDYW+++NSWG
Sbjct: 249 AVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKENGVDYWLVRNSWGT 308
Query: 302 SWGMNGYMHMQRNTGN-SLGICGINMLASYPTKTGQNPPPS 341
SWG +GY ++RN + + G CGI M ASYP K NP S
Sbjct: 309 SWGEDGYFKLERNVKHITEGKCGIAMQASYPVKNDNNPTKS 349
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 168/316 (53%), Positives = 224/316 (70%), Gaps = 12/316 (3%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ +E W +HG+AY++ EK++R +IF+DN F+ HNN GN ++ + LN FADLT++
Sbjct: 46 VKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNE 105
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASC 138
E++ +LG + + RRR ++P +P S+DWRK+GAV +K+Q SC
Sbjct: 106 EYRTMYLGTKSDA----RRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS A+EGIN+IVTG +++LSEQEL+DCDR NSGC GGLMDYA++F+I N G+
Sbjct: 162 GSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTEK YPYRG G+C+ + N +V+IDGY+DVP NE+ L +AV QPV V I S RA
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQLYSSG+FTG C +DH V++VGY SE+GVDYWI++NSWG WG NGY+ M+RN S
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKS 340
Query: 319 -LGICGINMLASYPTK 333
LG CGI ASYPTK
Sbjct: 341 HLGKCGIMTEASYPTK 356
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 361 bits (926), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 173/325 (53%), Positives = 222/325 (68%), Gaps = 4/325 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ SS L + ELFE+W +HGK Y S +EK R IF+DN + + N +
Sbjct: 27 FSIVGYSSEDLKSMDKLIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKV-V 85
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
S++ L LN FADL+HQEFK +LG +D+ RRR + + ++P S+DWRKKGA
Sbjct: 86 SNYWLGLNEFADLSHQEFKNKYLGLK---VDYSRRRESPEEFTYKDFELPKSVDWRKKGA 142
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT+VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCDR+YN+GC GGLMDYA
Sbjct: 143 VTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYA 202
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+ F+++N G+ E+DYPY + G C K +VTI GY DVP+NNE+ LL+A+V QP+
Sbjct: 203 FSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPL 262
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV I S R FQ YS G+F G C + LDH V VGY + GV+Y I+KNSWG WG GY
Sbjct: 263 SVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTSKGVNYIIVKNSWGSKWGEKGY 322
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ M+RN G GICGI +ASYPTK
Sbjct: 323 IRMRRNIGKPEGICGIYKMASYPTK 347
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 361 bits (926), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 199/410 (48%), Positives = 255/410 (62%), Gaps = 31/410 (7%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG----NSSFTLSLNAFADL 81
ELFE W ++H K Y+ EK +R F N AFV + N G +S + +N FADL
Sbjct: 48 QELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADL 107
Query: 82 THQEFKASF----LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQAS 137
+++EF+ + L AA RRR + D PAS+DWRK+GAVT VK+Q
Sbjct: 108 SNEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGC-DAPASLDWRKRGAVTAVKNQGD 166
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS+TGA+EGIN I TG L+SLSEQEL+DCD + N GC GG MDYA+++VI N G
Sbjct: 167 CGSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT-NEGCDGGYMDYAFEWVINNGG 225
Query: 198 IDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
ID+E +YPY GQA CN K +V+IDGY+DV +E LL A V QPVSVGI GS
Sbjct: 226 IDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVA-TSESALLCAAVQQPVSVGIDGSS 284
Query: 257 RAFQLYSSGIFTGPCS---TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQR 313
FQLY+ GI+ G CS +DHAVL+VGY + G DYWI+KNSWG WGM GY++++R
Sbjct: 285 LDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGTDYWIVKNSWGTDWGMQGYIYIRR 344
Query: 314 NTGNSLGICGINMLASYPTK----------------TGQNPPPSPPPGPTRCSLLTYCAA 357
NTG G+C I+ +ASYPTK + PP P P P++C +YC +
Sbjct: 345 NTGLPYGVCAIDAMASYPTKQFAPAATPPSPAPPPPSPPPPPTPPSPSPSQCGDYSYCPS 404
Query: 358 GETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
ETCCC + G CL + CC + +AVCC+ YCCP +YPICD CL
Sbjct: 405 DETCCCLVELGGFCLIYGCCAYQNAVCCTGTVYCCPQDYPICDVPDGLCL 454
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 172/325 (52%), Positives = 221/325 (68%), Gaps = 4/325 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ SS L + ELFE+W +HGK Y S +EK R +IF+DN + + N +
Sbjct: 28 FSIVGYSSEDLKSMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKV-V 86
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
S++ L LN FADL+HQEFK +LG +D+ RRR + + ++P S+DWRKKGA
Sbjct: 87 SNYWLGLNEFADLSHQEFKNKYLGLK---VDYSRRRESPEEFTYKDVELPKSVDWRKKGA 143
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT+VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCDR+YN+GC GGLMDYA
Sbjct: 144 VTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYA 203
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+ F+++N G+ E+DYPY + G C K +VTI GY DVP+NNE+ LL+A+ QP+
Sbjct: 204 FSFIVENDGLHKEEDYPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPL 263
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV I S R FQ YS G+F G C + LDH V VGY + GVDY +KNSWG WG GY
Sbjct: 264 SVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGY 323
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ M+RN G GICGI +ASYPTK
Sbjct: 324 IRMRRNIGKPEGICGIYKMASYPTK 348
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 178/370 (48%), Positives = 240/370 (64%), Gaps = 19/370 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNY-------CSDINELFETWCKQHGKAYSSEQEKQQRLKIF 53
M L FFL L+ SL L+ ++ ++E W +H K Y+ +EK QR +IF
Sbjct: 4 MTILPFFLFFSLITFSLALDIQLPTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIF 63
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN 113
+DN F+ +HN N ++ + LN FAD+T++E++ +LG + + I +RR + G+
Sbjct: 64 KDNLNFIDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLG-TRSDI---KRRIMKNKITGH 118
Query: 114 L------RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
+P +DWR KGA+T +KDQ SCG+CWAFS +E INKIVTG LVSLSEQ
Sbjct: 119 RYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQ 178
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
EL+DCDR++N GC GGLMDYA++F+I N GIDT++ YPY+G G+C+ + IV+IDG
Sbjct: 179 ELVDCDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDG 238
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
Y+DVP NNE L +AV QPVSV I S RA QLY SG+FTG C TSLDHAV+IVGY SE
Sbjct: 239 YEDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGSE 298
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNT-GNSLGICGINMLASYPTKTGQNPPPSPPPGP 346
NG+DYW+++NSWG +WG +GY M+RN G G CGI + ASYP K G+N +
Sbjct: 299 NGLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPVKYGKNSAVTTNSAY 358
Query: 347 TRCSLLTYCA 356
+ +L A
Sbjct: 359 EKTEVLVSSA 368
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 205/449 (45%), Positives = 267/449 (59%), Gaps = 47/449 (10%)
Query: 3 SLAFFLLSIL--LLSSLPLNYC---------SDINELFETWCKQHGKAYSSEQEKQQRLK 51
+L F+ + L L SSLP + + ELF W ++H + Y +E +R +
Sbjct: 9 ALVLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFE 68
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR--RNASVQ 109
IF++N +V + N+ G+ TL +N FAD++++EFK +L I+ R + Q
Sbjct: 69 IFKENLKYVIERNSKGHRH-TLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQQ 127
Query: 110 SPGNLR-DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQE 168
G + P+S+DWRKKG VT +KDQ CG+CWAFS+TGA+EGIN IVTG L+SLSEQE
Sbjct: 128 KKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQE 187
Query: 169 LIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
L+DCD + N GC GG MDYA+++VI N GID+E DYPY G G CN K + +V+IDGY
Sbjct: 188 LVDCDTT-NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVVSIDGY 246
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTG---PCSTSLDHAVLIVGYD 285
KDV E++ LL A V QP+SVG+ GS FQLY+SGI+ G +DHAVLIVGY
Sbjct: 247 KDVDESD-SALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYG 305
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK------------ 333
SE+ DYWI KNSWG SWGM GY +++RNT G C IN +ASYPTK
Sbjct: 306 SEDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKESSSPSPYPSPA 365
Query: 334 ---------------TGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCG 378
PPPSP P P+ C +YC + ETCCC CL + CC
Sbjct: 366 VPPPPPPPPSPPPPPPPSPPPPSPGPSPSECGDFSYCPSDETCCCIYEFYDFCLIYGCCE 425
Query: 379 FSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+ +AVCC+ YCCPS+YPICD CL
Sbjct: 426 YENAVCCTGTEYCCPSDYPICDVEEGLCL 454
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 360 bits (924), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 176/363 (48%), Positives = 235/363 (64%), Gaps = 23/363 (6%)
Query: 7 FLLSILLLSSLPLNYCS----------DINELFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
L+ LLL S ++ + ++ +++E W +H K Y+ EK++R ++F+DN
Sbjct: 4 MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL-- 114
F+ HN N+++TL LN FAD+T++E++A +LG + +RR Q+ G+
Sbjct: 64 LGFIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDA----KRRVMKTQNTGHRYA 118
Query: 115 ----RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
+P +DWR KGAV +KDQ +CG+CWAFS A+EGIN IVTG VSLSEQEL+
Sbjct: 119 YNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELV 178
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCDR Y+ GC GGLMDYA+QF+I+N GIDTE+DYPY+G G C++ K +V IDGY+D
Sbjct: 179 DCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYED 238
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGV 290
VP NNE L +AV QPVSV I S RA QLY SG+FTG C T+LDH V++VGY +ENGV
Sbjct: 239 VPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGTENGV 298
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNT-GNSLGICGINMLASYPTKTGQNPP-PSPPPGPTR 348
DYW+++NSWG WG +GY M+RN S G CGI M SYP K G N PS T
Sbjct: 299 DYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVPSSVYESTE 358
Query: 349 CSL 351
S+
Sbjct: 359 ASI 361
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 360 bits (924), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 170/325 (52%), Positives = 227/325 (69%), Gaps = 4/325 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + ELFE+W HGKAY+S +EK R ++F++N + Q N
Sbjct: 27 FSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKE-V 85
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+S+ L LN FADL+H+EFK+ FLG + R++++ S ++ D+P SIDWRKKGA
Sbjct: 86 TSYWLGLNEFADLSHEEFKSKFLGLYP---EFPRKKSSEDFSYRDVVDLPKSIDWRKKGA 142
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT VK+Q SCG+CWAFS A+EGIN+IV G+L SLSEQ+LIDCD S+N+GC GGLMDYA
Sbjct: 143 VTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYA 202
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
++F++ N G+ E+DYPY + G C++++ +VTI GY DVP N+E+ LL+A+ QP+
Sbjct: 203 FEFIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPL 262
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV I S R FQ YS G+F+GPC T LDH V VGY S +G+DY I+KNSWG WG GY
Sbjct: 263 SVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAAVGYGSSSGIDYIIVKNSWGPKWGERGY 322
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ M+RNTG G+CGIN +ASYPTK
Sbjct: 323 LRMKRNTGKPEGLCGINKMASYPTK 347
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 360 bits (924), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 167/316 (52%), Positives = 224/316 (70%), Gaps = 12/316 (3%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ +E W +HG+AY++ EK++R +IF+DN F+ +HNN GN ++ + LN FADLT++
Sbjct: 46 VKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNE 105
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASC 138
E++ +LG + + RRR ++P +P S+DWRK+GAV +K+Q SC
Sbjct: 106 EYRTMYLGTKSDA----RRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS A+ GIN+IVTG +++LSEQEL+DCDR NSGC GGLMDYA++F+I N G+
Sbjct: 162 GSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTEK YPYRG G+C+ + N +V+IDGY+DVP NE+ L +AV QPV V I S RA
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQLYSSG+FTG C +DH V++VGY SE+GVDYWI++NSWG WG NGY+ M+RN S
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKS 340
Query: 319 -LGICGINMLASYPTK 333
LG CGI ASYPTK
Sbjct: 341 HLGKCGIMTEASYPTK 356
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 360 bits (924), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 176/363 (48%), Positives = 235/363 (64%), Gaps = 23/363 (6%)
Query: 7 FLLSILLLSSLPLNYCS----------DINELFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
L+ LLL S ++ + ++ +++E W +H K Y+ EK++R ++F+DN
Sbjct: 4 MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL-- 114
F+ HN N+++TL LN FAD+T++E++A +LG + +RR Q+ G+
Sbjct: 64 LGFIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDA----KRRVMKTQNTGHRYA 118
Query: 115 ----RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
+P +DWR KGAV +KDQ +CG+CWAFS A+EGIN IVTG VSLSEQEL+
Sbjct: 119 YNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELV 178
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCDR Y+ GC GGLMDYA+QF+I+N GIDTE+DYPY+G G C++ K +V IDGY+D
Sbjct: 179 DCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYED 238
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGV 290
VP NNE L +AV QPVSV I S RA QLY SG+FTG C T+LDH V++VGY +ENGV
Sbjct: 239 VPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGTENGV 298
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNT-GNSLGICGINMLASYPTKTGQNPP-PSPPPGPTR 348
DYW+++NSWG WG +GY M+RN S G CGI M SYP K G N PS T
Sbjct: 299 DYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVPSSVYESTE 358
Query: 349 CSL 351
S+
Sbjct: 359 ASI 361
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 167/324 (51%), Positives = 226/324 (69%), Gaps = 16/324 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++ W +HGKAY+ E+++R +IF+DN FV +HN+ N S+ + LN FADLT++E++
Sbjct: 46 IYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLNRFADLTNEEYR 104
Query: 88 ASFLGFSAASIDHDRR--------RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
+ FLG D RR R +VQ L P S+DWR+ GAV +KDQ SCG
Sbjct: 105 SMFLG---TKTDSKRRFMKSKSASRRYAVQDSDML---PESVDWRESGAVAPIKDQGSCG 158
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EG+N+I TG ++ LSEQEL+DCDR+Y++GC GGLMDYA++F+I N GID
Sbjct: 159 SCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIINNGGID 218
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE+DYPYRG G C+ ++ N +V+I+ Y+DVP +E L +AV QPVSV I S RAF
Sbjct: 219 TEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAF 278
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SG+FTG C +LDH V++VGY ++NG D+WI++NSWG SWG NGY+ M+RN ++
Sbjct: 279 QLYLSGVFTGECGRALDHGVVVVGYGTDNGADHWIVRNSWGTSWGENGYIRMERNVVDNF 338
Query: 320 -GICGINMLASYPTKTGQNPPPSP 342
G CGI M ASYP K G+NP P
Sbjct: 339 GGKCGIAMQASYPIKNGENPANKP 362
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 176/332 (53%), Positives = 224/332 (67%), Gaps = 5/332 (1%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y S +EK R +IF+DN +
Sbjct: 20 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHID 79
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
+ N + S++ L LN FADL+HQEFK +LG +D+ RRR + + ++P S+
Sbjct: 80 ERNKV-VSNYWLGLNEFADLSHQEFKNKYLGLK---VDYSRRRESPEEFTYKDVELPKSV 135
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAV VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCDR+YN+GC
Sbjct: 136 DWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCN 195
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+ F+++N G+ E+DYPY + G C K +VTI GY DVP+NNE+ LL+
Sbjct: 196 GGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLK 255
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QP+SV I S R FQ YS G+F G C + LDH V VGY + GVDY I+KNSWG
Sbjct: 256 ALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYIIVKNSWGS 315
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG GY+ M+RN G GICGI +ASYPTK
Sbjct: 316 KWGEKGYIRMRRNIGKPEGICGIYKMASYPTK 347
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 187/372 (50%), Positives = 238/372 (63%), Gaps = 13/372 (3%)
Query: 45 EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEFKASFLGFSAASIDHDR 102
E ++R ++F DN FV HN + F L +N FADLT+ EF+A++LG + A R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAG--RGR 141
Query: 103 RRNASVQSPGNLRDVPASIDWRKKGAVTE-VKDQASCGACWAFSATGAIEGINKIVTGSL 161
R + + G + +P S+DWR KGAV VK+Q CG+CWAFSA A+EGINKIVTG L
Sbjct: 142 RVGEAYRHDG-VEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 162 VSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
VSLSEQEL++C R+ NSGC GG+MD A+ F+ +N G+DTE+DYPY G+CN K +R
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 260
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
+V+IDG++DVPEN+E L +AV QPVSV I R FQLY SG+FTG C T+LDH V+
Sbjct: 261 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 320
Query: 281 IVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
VGY D+ G YW ++NSWG WG NGY+ M+RN G CGI M+ASYP K G NP
Sbjct: 321 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKKGPNP 380
Query: 339 PPSPPPGPT----RCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPS 394
PSPP +C + C AG TCCC I C+ W CC A CC DH CCP
Sbjct: 381 KPSPPSPAPSPPQQCDRYSKCPAGTTCCCNYGIRNHCIVWGCCPVEGATCCKDHSTCCPK 440
Query: 395 NYPICDSVRHQC 406
YP+C++ C
Sbjct: 441 EYPVCNAKARTC 452
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 187/372 (50%), Positives = 238/372 (63%), Gaps = 13/372 (3%)
Query: 45 EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEFKASFLGFSAASIDHDR 102
E ++R ++F DN FV HN + F L +N FADLT+ EF+A++LG + A R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAG--RGR 141
Query: 103 RRNASVQSPGNLRDVPASIDWRKKGAVTE-VKDQASCGACWAFSATGAIEGINKIVTGSL 161
R + + G + +P S+DWR KGAV VK+Q CG+CWAFSA A+EGINKIVTG L
Sbjct: 142 RVGEAYRHDG-VEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 162 VSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
VSLSEQEL++C R+ NSGC GG+MD A+ F+ +N G+DTE+DYPY G+CN K +R
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 260
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
+V+IDG++DVPEN+E L +AV QPVSV I R FQLY SG+FTG C T+LDH V+
Sbjct: 261 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 320
Query: 281 IVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
VGY D+ G YW ++NSWG WG NGY+ M+RN G CGI M+ASYP K G NP
Sbjct: 321 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKKGPNP 380
Query: 339 PPSPPPGPT----RCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPS 394
PSPP +C + C AG TCCC I C+ W CC A CC DH CCP
Sbjct: 381 KPSPPSPAPSPPQQCDRYSKCPAGTTCCCNYGIRNHCIVWGCCPVEGATCCKDHSTCCPK 440
Query: 395 NYPICDSVRHQC 406
YP+C++ C
Sbjct: 441 EYPVCNAKARTC 452
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 202/422 (47%), Positives = 257/422 (60%), Gaps = 33/422 (7%)
Query: 14 LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFT 72
S LP + I E+F+ W +H KAY +E ++R F+ N ++ + +
Sbjct: 30 FSELPPD--ESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHR 87
Query: 73 LSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR--DVPASIDWRKKGAVT 130
+ LN FADL+++EFK +L I+ R +A +S NL+ D P+S+DWRKKG VT
Sbjct: 88 VGLNKFADLSNEEFKQLYLSKVKKPINK-TRIDAEDRSRRNLQSCDAPSSLDWRKKGVVT 146
Query: 131 EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQ 190
VKDQ CG+CW+FS TGAIEGIN IVT L+SLSEQEL+DCD + N GC GG MDYA++
Sbjct: 147 AVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NYGCEGGYMDYAFE 205
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
+VI N GIDTE +YPY G G CN K +V+IDGYKDV E + LL A QP+SV
Sbjct: 206 WVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALLCAAAQQPISV 264
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLD---HAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
GI GS FQLY+ GI+ G CS D HAVLIVGY SENG DYWI+KNSWG SWG+ G
Sbjct: 265 GIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEG 324
Query: 308 YMHMQRNTGNSLGICGINMLASYPTKTGQ----------------------NPPPSPPPG 345
Y +++RNT G+C IN +ASYPTK PP P P
Sbjct: 325 YFYIKRNTDLPYGVCAINAMASYPTKEASAQSPTSPPSPPSPPPPPPPPPTPVPPPPSPQ 384
Query: 346 PTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQ 405
P+ C +YC + ETCCC ++ CL + CC + +AVCC+D YCCPS+YPICD
Sbjct: 385 PSDCGDFSYCPSDETCCCILNVFDYCLVYGCCAYENAVCCADSVYCCPSDYPICDVEEGL 444
Query: 406 CL 407
CL
Sbjct: 445 CL 446
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 199/436 (45%), Positives = 248/436 (56%), Gaps = 32/436 (7%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQH----------GKAYSSEQEKQQRLKIFEDNY 57
L + + ++ P ++ L+E W +H G E + +RL++F N
Sbjct: 32 LAAAVTVTPPPERTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNL 91
Query: 58 AFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL 114
++ HN + G F L L FADLT +E++A L S R +V G+
Sbjct: 92 RYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRG------RNGTAVGVVGSR 145
Query: 115 R-------DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
R +P ++DWR++GAV EVKDQ CGACWAFSA A+EGINKIVTGSL+SLSEQ
Sbjct: 146 RYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQ 205
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
ELIDCD+ + GC GGLMD A+ F+IKN GIDTE DYP+ G G C+ + N +V+ID
Sbjct: 206 ELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDS 265
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
++ VP N E+ L +AV QPVS I S RAFQLYSSGIF G C T LDH V +VGY SE
Sbjct: 266 FERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSE 325
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT 347
G DYWI+KNSWG WG GY+ M RN G CGI M YP K G NPPP P P
Sbjct: 326 GGKDYWIVKNSWGTQWGEAGYVRMARNVRVRAGKCGIAMEPLYPVKEGPNPPPGPTPPSP 385
Query: 348 R-----CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSV 402
C+ C TCCC S G CL++ CC +A CC DH CCP +YP+C SV
Sbjct: 386 VKPPNVCNAEYSCPEATTCCCVSEYRGKCLAYGCCELENATCCEDHSSCCPHDYPVC-SV 444
Query: 403 RHQCLTVSLKFSFTVK 418
R S VK
Sbjct: 445 RDGTCRKSANSPMMVK 460
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 188/406 (46%), Positives = 250/406 (61%), Gaps = 28/406 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSF--TLSLNAFADLT 82
I E+F+ W ++H K Y +E ++R+ F+ N ++ + N S + LN FADL+
Sbjct: 46 ITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLS 105
Query: 83 HQEFKASFLGFSAASID-HDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++EF+ +L I ++R++ +Q+ D P+S+DWR KG VT VKDQ CG+C
Sbjct: 106 NEEFREMYLSKVKKPITIEEKRKHRHLQTC----DAPSSLDWRNKGVVTAVKDQGDCGSC 161
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
W+FS TGAIE IN IVTG L+SLSEQEL+DCD + N GC GG MD A+Q+VI N GIDTE
Sbjct: 162 WSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTE 221
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY G G CN K + +V+I+GY DV + ++ LL A V QP+SVG+ GS FQL
Sbjct: 222 ADYPYTGVDGTCNTAKEEKKVVSIEGYVDV-DPSDSALLCATVQQPISVGMDGSALDFQL 280
Query: 262 YSSGIFTGPCS---TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Y+ GI+ G CS +DHA+LIVGY SEN DYWI+KNSWG WGM GY +++RNT
Sbjct: 281 YTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIRRNTSKP 340
Query: 319 LGICGINMLASYPTKTGQNPPPSPPPGPTR-----------------CSLLTYCAAGETC 361
G+C IN ASYPTK P P PP P C ++C + ETC
Sbjct: 341 YGVCAINADASYPTKVPSPPSPPSPPPPPSPPPPPPSPPPPCPQPSDCGDSSFCPSDETC 400
Query: 362 CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC + C+ + CC + +AVCC++ YCCPS+YPICD CL
Sbjct: 401 CCILKLFSSCIIYGCCPYENAVCCAESTYCCPSDYPICDVDDGLCL 446
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 166/306 (54%), Positives = 215/306 (70%), Gaps = 5/306 (1%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE+W +HGK Y S +EK R ++F +N + + N SS+ L LN FADL+H+EFK+
Sbjct: 404 FESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNK-EVSSYWLGLNEFADLSHEEFKS 462
Query: 89 SFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
+LG A + R R+ S + ++ D+P S+DWRKKGAVT VK+Q +CG+CWAFS
Sbjct: 463 KYLGLRA---EFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTV 519
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EGIN+IVTG+L +LSEQELIDCD ++NSGC GGLMDYA+ F+ N G+ E DYPY
Sbjct: 520 AAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYL 579
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+ G C +QK + IVTI GY+DVPE +E+ LL+A+ QP+SV I S R FQ YS G+F
Sbjct: 580 MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVF 639
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
GPC T LDH V VGY S G+DY I+KNSWG WG GY+ M+RNTG + G+CGIN +
Sbjct: 640 NGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKM 699
Query: 328 ASYPTK 333
ASYPTK
Sbjct: 700 ASYPTK 705
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 354 bits (908), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 166/322 (51%), Positives = 225/322 (69%), Gaps = 9/322 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++ W +H K Y+ E+++R +IF++N F+ +HNN N ++ + L FADLT
Sbjct: 42 NEVISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLT 101
Query: 83 HQEFKASFLGFSAASIDHDRR----RNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQAS 137
++E++A FLG + D RR +N S + DV P SIDWR+ GAV+ +KDQ S
Sbjct: 102 NEEYRAKFLGTKS---DPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGS 158
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS A+EG+NKIVTG L+SLSEQEL+DCDRSYN+GC GGLMD A+QF+I N G
Sbjct: 159 CGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQFIINNGG 218
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
IDT+KDYPY+ G+C+ K+ VTIDG++DV +E L +AV QPVSV I S
Sbjct: 219 IDTDKDYPYQAVDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGM 278
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
A Q Y SG+FTG C ++LDH V+IVGY +E+G+DYW+++NSWGR WG NGY+ MQRN +
Sbjct: 279 ALQFYQSGVFTGECGSALDHGVVIVGYGTEDGIDYWLVRNSWGRDWGENGYIKMQRNVVD 338
Query: 318 SL-GICGINMLASYPTKTGQNP 338
+ G CGI M +SYP K QNP
Sbjct: 339 TFTGKCGIAMESSYPIKNTQNP 360
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 354 bits (908), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 169/326 (51%), Positives = 218/326 (66%), Gaps = 2/326 (0%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ + L + ELFE+W +H KAY S +EK R ++F +N + Q NN N
Sbjct: 31 FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN 90
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
S + L LN FADLTH+EFK +LG + R+ +A+ + ++ D+P S+DWRKKGA
Sbjct: 91 S-YWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR-DITDLPKSVDWRKKGA 148
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
V VKDQ CG+CWAFS A+EGIN+I TG+L SLSEQELIDCD ++NSGC GGLMDYA
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+Q++I G+ E DYPY + G C +QK + VTI GY+DVPEN+++ L++A+ QPV
Sbjct: 209 FQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPV 268
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV I S R FQ Y G+F G C T LDH V VGY S G DY I+KNSWG WG G+
Sbjct: 269 SVAIEASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGF 328
Query: 309 MHMQRNTGNSLGICGINMLASYPTKT 334
+ M+RNTG G+CGIN +ASYPTKT
Sbjct: 329 IRMKRNTGKPEGLCGINKMASYPTKT 354
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 353 bits (907), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 164/319 (51%), Positives = 227/319 (71%), Gaps = 7/319 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
++ ++E W +HGK Y++ +EK++R +IF+DN F+ +HN + N ++ + LN F+DL+
Sbjct: 46 EEVMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAV-NRTYKVGLNRFSDLS 104
Query: 83 HQEFKASFLGFSAASIDHDRR--RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
++E+++ +LG ID R R + SP ++P S+DWRK+GAV VK+Q+ C
Sbjct: 105 NEEYRSKYLG---TKIDPSRMMARPSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEG 161
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGINKIVTG+L +LSEQEL+DCDR+ N+GC GGL+DYA++F+I N GIDT
Sbjct: 162 CWAFSAIAAVEGINKIVTGNLTALSEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDT 221
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E+DYP++G G C++ K+N VTIDGY+ VP +E L +AV QPVSV I + FQ
Sbjct: 222 EEDYPFQGADGICDQYKINARAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQ 281
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG-NSL 319
LY SGIFTG C TS+DH V VGY +ENG+DYWI+KNSWG +WG GY+ M+RN ++
Sbjct: 282 LYESGIFTGTCGTSIDHGVTAVGYGTENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTA 341
Query: 320 GICGINMLASYPTKTGQNP 338
G CGI +L YP K GQNP
Sbjct: 342 GKCGIAILTLYPIKIGQNP 360
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 353 bits (907), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 167/319 (52%), Positives = 219/319 (68%), Gaps = 8/319 (2%)
Query: 28 LFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLT 82
++ W +HGK+ S+ ++ +R IF+DN F+ HN N N+++ L L FA+LT
Sbjct: 3 IYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLT 62
Query: 83 HQEFKASFLGFSAA---SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
+ E+++ +LG I + N + N+ +VP ++DWR+KGAV +KDQ +CG
Sbjct: 63 NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCG 122
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGINKIVTG LVSLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G++
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLN 182
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TEKDYPY G G+CN N +VTIDGY+DVP +E L +AV QPVSV I RAF
Sbjct: 183 TEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAF 242
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Q Y SGIFTG C T++DHAV+ VGY SENGVDYWI++NSWG WG +GY+ M+RN +
Sbjct: 243 QHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKS 302
Query: 320 GICGINMLASYPTKTGQNP 338
G CGI + ASYP K NP
Sbjct: 303 GKCGIAIEASYPVKYSPNP 321
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 353 bits (907), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 168/308 (54%), Positives = 211/308 (68%), Gaps = 3/308 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
ELFE W HGK Y + +EK R ++F+DN + + N +S+ L +N FADLTHQEF
Sbjct: 43 ELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKK-VTSYWLGVNEFADLTHQEF 101
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K +LG S R++ + ++ D+P S+DWRKKGAVT VK+Q SCG+CWAFS
Sbjct: 102 KNMYLGLKVES--SRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFST 159
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGINKIV G+L SLSEQELIDCDR YN+GC GGLMDYA+ F++ + G+ E+DYPY
Sbjct: 160 VAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPY 219
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
C+ +K +VTI GYKDVPENNE L++A+ QP+SV I S R FQ YS G+
Sbjct: 220 LEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGV 279
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDH V VGY S GVDY I+KNSWG WG GY+ M+RNTG G+CGIN
Sbjct: 280 FDGPCGTQLDHGVTAVGYGSSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGINK 339
Query: 327 LASYPTKT 334
+ASYPTK+
Sbjct: 340 MASYPTKS 347
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 353 bits (906), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 169/347 (48%), Positives = 230/347 (66%), Gaps = 16/347 (4%)
Query: 1 MNSLAFF-LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
+ SL FF L+++ L + ++ ++E W +H K Y+ EK QR +IF+DN F
Sbjct: 6 ITSLLFFSLITLSLAMDTSMRSNEEVMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGF 65
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLR--- 115
+ +HN N ++ + LN FAD T++E++ +LG +D +RN ++ R
Sbjct: 66 IDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLG-----TKNDAKRNVMKIKITTGHRYAF 119
Query: 116 ----DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
+P +DWR KGAV +KDQ SCG+CWAFS +E INKIVTG LVSLSEQEL+D
Sbjct: 120 NSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 179
Query: 172 CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
CDR++N GC GGLMDYA++F+++N GIDTE+DYPY+G G+C+ + N +V+IDGY+DV
Sbjct: 180 CDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGYEDV 239
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVD 291
P NE L +AV QPVSV I RA QLY SG+FTG C T+LDH V++VGY ENGVD
Sbjct: 240 PAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGFENGVD 299
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGN-SLGICGINMLASYPTKTGQN 337
YW+++NSWG +WG +GY ++RN + G CGI M ASYP K GQN
Sbjct: 300 YWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPVKYGQN 346
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 353 bits (906), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 168/308 (54%), Positives = 211/308 (68%), Gaps = 3/308 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
ELFE W HGK Y + +EK R ++F+DN + + N +S+ L +N FADLTHQEF
Sbjct: 46 ELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDE-TNKKVTSYWLGVNEFADLTHQEF 104
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K +LG S R++ + ++ D+P S+DWRKKGAVT VK+Q SCG+CWAFS
Sbjct: 105 KNMYLGLKVES--SRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFST 162
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGINKIV G+L SLSEQELIDCDR YN+GC GGLMDYA+ F++ + G+ E+DYPY
Sbjct: 163 VAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPY 222
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
C+ +K +VTI GYKDVPENNE L++A+ QP+SV I S R FQ YS G+
Sbjct: 223 LEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGV 282
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDH V VGY S GVDY I+KNSWG WG GY+ M+RNTG G+CGIN
Sbjct: 283 FDGPCGTQLDHGVTAVGYGSSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGINK 342
Query: 327 LASYPTKT 334
+ASYPTK+
Sbjct: 343 MASYPTKS 350
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 353 bits (906), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 168/326 (51%), Positives = 217/326 (66%), Gaps = 2/326 (0%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ + L + ELFE+W +H K Y S +EK R ++F +N + Q NN N
Sbjct: 31 FSIVGYTPEQLTSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN 90
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
S + L LN FADLTH+EFK +LG + R+ +A+ + ++ D+P S+DWRKKGA
Sbjct: 91 S-YWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR-DITDLPKSVDWRKKGA 148
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
V VKDQ CG+CWAFS A+EGIN+I TG+L SLSEQELIDCD ++NSGC GGLMDYA
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+Q++I G+ E DYPY + G C +QK + VTI GY+DVPEN+++ L++A+ QPV
Sbjct: 209 FQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPV 268
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV I S R FQ Y G+F G C T LDH V VGY S G DY I+KNSWG WG G+
Sbjct: 269 SVAIEASGRDFQFYKGGVFNGQCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGF 328
Query: 309 MHMQRNTGNSLGICGINMLASYPTKT 334
+ M+RNTG G+CGIN +ASYPTKT
Sbjct: 329 IRMKRNTGKPEGLCGINKMASYPTKT 354
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 172/332 (51%), Positives = 222/332 (66%), Gaps = 5/332 (1%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y + +EK R +IF+DN +
Sbjct: 21 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHID 80
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
+ N + S++ L LN FADL+H+EF +LG +D+ RRR + + ++P S+
Sbjct: 81 ERNKV-VSNYWLGLNEFADLSHREFNNKYLGLK---VDYSRRRESPEEFTYKDVELPKSV 136
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAV VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCDR+YN+GC
Sbjct: 137 DWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCN 196
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+ F+++N G+ E+DYPY + G C K +VTI GY DVP+NNE+ LL+
Sbjct: 197 GGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLK 256
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QP+SV I S R FQ YS G+F G C + LDH V VGY + GVDY +KNSWG
Sbjct: 257 ALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYITVKNSWGS 316
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG GY+ M+RN G GICGI +ASYPTK
Sbjct: 317 KWGEKGYIRMRRNIGKPEGICGIYKMASYPTK 348
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 166/319 (52%), Positives = 221/319 (69%), Gaps = 8/319 (2%)
Query: 28 LFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLT 82
++ W +HGK+ S+ ++ +R IF+DN F+ HN N N+++ L L FA+LT
Sbjct: 3 IYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLT 62
Query: 83 HQEFKASFLGFSAASIDH-DRRRNASVQSPGNLRDV--PASIDWRKKGAVTEVKDQASCG 139
+ E+++ +LG + + +N +++ + DV P ++DWR+KGAV +KDQ +CG
Sbjct: 63 NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCG 122
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGINKIVTG LVSLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G++
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLN 182
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TEKDYPY G G+CN N +VTIDGY+DVP +E L +AV QPVSV I RAF
Sbjct: 183 TEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAF 242
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Q Y SGIFTG C T++DHAV+ VGY SENGVDYWI++NSWG WG +GY+ M+RN +
Sbjct: 243 QHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKS 302
Query: 320 GICGINMLASYPTKTGQNP 338
G CGI + ASYP K NP
Sbjct: 303 GKCGIAIEASYPVKYSPNP 321
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 167/325 (51%), Positives = 217/325 (66%), Gaps = 1/325 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SI+ S L + ELFE W KAY + +EK R ++F+DN + + N G S
Sbjct: 32 SIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS 91
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV 129
+ L LN FADL+H+EFK +LG + D R+ + + ++ VP S+DWRKKGAV
Sbjct: 92 -YWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAV 150
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
EVK+Q SCG+CWAFS A+EGINKIVTG+L +LSEQELIDCD +YN+GC GGLMDYA+
Sbjct: 151 AEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAF 210
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
++++KN G+ E+DYPY + G C QK VTI+G++DVP N+EK LL+A+ QP+S
Sbjct: 211 EYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLS 270
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
V I S R FQ YS G+F G C LDH V VGY S G DY I+KNSWG WG GY+
Sbjct: 271 VAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYI 330
Query: 310 HMQRNTGNSLGICGINMLASYPTKT 334
++RNTG G+CGIN +AS+PTKT
Sbjct: 331 RLKRNTGKPEGLCGINKMASFPTKT 355
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 177/405 (43%), Positives = 246/405 (60%), Gaps = 27/405 (6%)
Query: 29 FETWCKQHGKAYSSE-QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
F+ W + H ++Y ++ E + R K++ +N +V +N S + L+LN ADL+ E+K
Sbjct: 13 FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHW-LTLNHLADLSTPEYK 71
Query: 88 ASFLGF-SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+ LGF + A + ++ + + +P +IDWRKK AV EVK+Q CG+CWAF+
Sbjct: 72 SKLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFAT 131
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
TG++EGIN IVTGSLVSLSEQEL+DCD + GC GGLMDYAY ++IKN GI+TE+DYPY
Sbjct: 132 TGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYPY 191
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
GQC+ K+ R +VTID Y+DVPEN+E L +A QPV+V I ++FQLY G+
Sbjct: 192 TAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGGV 251
Query: 267 FTGP-CSTSLDHAVLIVGYDSE---NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
+ P C TSL+H VL+VGY + +G +YWI+KNSWG WG GY+ ++ + ++ G+C
Sbjct: 252 YDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGLC 311
Query: 323 GINMLASYPTK--------------------TGQNPPPSPPPGPTRCSLLTYCAAGETCC 362
GI M SYP K P PPGP +C C G TCC
Sbjct: 312 GIAMAPSYPVKTGPNPPTPGPTPGPSPKPGPKPGPKPGPTPPGPVKCDDDNECPNGSTCC 371
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C + I +C W CC A CC DH +CCP++ P+CD+ +CL
Sbjct: 372 CVNEIFNMCFQWGCCPMPKATCCDDHEHCCPADLPVCDTDAGRCL 416
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 196/445 (44%), Positives = 257/445 (57%), Gaps = 66/445 (14%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSF--TLSLNAFADLT 82
+ ELF+ W K+H K Y +E RL+ F+ N ++ + N M NS L LN FAD++
Sbjct: 48 VVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMS 107
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG--- 139
++EFK F+ I R N V+ + D P S+DWRKKG VT VKDQ +CG
Sbjct: 108 NEEFKNKFISKVKKPISK-RASNLHVKVE-SCDDAPYSLDWRKKGVVTGVKDQGNCGKLL 165
Query: 140 -----------------------------------------ACWAFSATGAIEGINKIVT 158
+CW+FS+TGAIEG+N IVT
Sbjct: 166 YFMHFKSFLVIYILELTTNFPLYSFESQFCILEKKKLDFVGSCWSFSSTGAIEGVNAIVT 225
Query: 159 GSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKL 218
G L+SLSEQEL+DCD + N GC GG MDYA+++VI N GIDTE DYPY G G CN K
Sbjct: 226 GDLISLSEQELVDCDTT-NDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKE 284
Query: 219 NRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---L 275
+VTIDGY DV ++ + L A V QP+SVGI GS FQLY+ GI+ G CS++ +
Sbjct: 285 ETKVVTIDGYTDVTQS-DSALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDI 343
Query: 276 DHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK-- 333
DHAVLIVGY S+ DYWI+KNSWG SWG+ G+++++RNT G+C IN +AS+PTK
Sbjct: 344 DHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMASFPTKES 403
Query: 334 -----------TGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSA 382
PP P P P++C +YC ETCCC + CL++ CC + +A
Sbjct: 404 TSISPTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEETCCCLYELFDFCLAYGCCEYENA 463
Query: 383 VCCSDHRYCCPSNYPICDSVRHQCL 407
VCC+ +YCCPS+YPICD+ CL
Sbjct: 464 VCCTGTKYCCPSDYPICDTEDGLCL 488
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 351 bits (900), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 220/326 (67%), Gaps = 5/326 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + +LFE+W +HGK+Y S +EK R ++F+DN + + N
Sbjct: 28 FSIVGYSPDDLTSMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDE-TNKKV 86
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKG 127
SS+ L LN FADL+H+EFK +LG I+ +RR++ + S ++ D+P S+DWRKKG
Sbjct: 87 SSYWLGLNEFADLSHEEFKRKYLGLK---IELPKRRDSPEEFSYKDVADLPKSVDWRKKG 143
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AV VK+Q +CG+CWAFS A+EGIN+IVTG+L +LSEQELIDCD+ +N+GC GGLMDY
Sbjct: 144 AVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDY 203
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
A+ F+I N G+ E+DYPY + G C ++K +VTI GY DVPE+NE+ L+A+ QP
Sbjct: 204 AFAFIISNGGLRKEEDYPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQP 263
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
+SV I S R FQ YS GIF G C T LDH V VGY + GVDY +KNSWG WG G
Sbjct: 264 LSVAIEASSRGFQFYSGGIFNGHCGTELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKG 323
Query: 308 YMHMQRNTGNSLGICGINMLASYPTK 333
Y+ M+RN G GICGI +ASYPTK
Sbjct: 324 YIRMKRNVGKPEGICGIYKMASYPTK 349
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 350 bits (898), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 194/418 (46%), Positives = 253/418 (60%), Gaps = 37/418 (8%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS--FTLSLNAFADLT 82
+ ELF+ W ++HGK Y QE +++ + F DN +V + N +S + LN FAD++
Sbjct: 47 VVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMS 106
Query: 83 HQEFKASFLGF----SAASIDHDRRRNASVQSPGNLR--DVPASIDWRKKGAVTEVKDQA 136
++EF+ ++ ++ + +RRR + + D P S+DWRK G VT VKDQ
Sbjct: 107 NEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQG 166
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS+TGAIEGIN + G L+SLSEQEL+DCD S N GC GG MDYA+++V+ N
Sbjct: 167 DCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCD-STNDGCEGGYMDYAFEWVMSNG 225
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GIDTE DYPY G+ G CN K V+IDGY+DV E E L AV+ QP+SVGI G
Sbjct: 226 GIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAEE-ESALFCAVLKQPISVGIDGGA 284
Query: 257 RAFQLYSSGIFTGPCSTSLD---HAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQR 313
FQLY+ GI+ G CS D HAVL+VGY +E+G +YWIIKNSWG WGM GY +++R
Sbjct: 285 IDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAYIKR 344
Query: 314 NTGNSLGICGINMLASYPTK------------------------TGQNPPPSPPPGPTRC 349
NT G+C IN +ASYPTK + PPP P P PT+C
Sbjct: 345 NTSKDYGVCAINAMASYPTKESSAPSPYPSPAVPPPPPPPPPPPSPPPPPPPPSPSPTQC 404
Query: 350 SLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+YCAA ETCCC CL + CC ++ AVCC+ YCCP +YPICD CL
Sbjct: 405 GDFSYCAATETCCCIFEFFDYCLIYGCCDYTDAVCCTGTEYCCPHDYPICDIEEGLCL 462
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 350 bits (897), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 171/332 (51%), Positives = 222/332 (66%), Gaps = 5/332 (1%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y + +EK R +IF+DN +
Sbjct: 21 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHID 80
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
+ N + S++ L L+ FADL+H+EF +LG +D+ RRR + + ++P S+
Sbjct: 81 ERNKV-VSNYWLGLSEFADLSHREFNNKYLGLK---VDYSRRRESPEEFTYKDVELPKSV 136
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAV VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCDR+YN+GC
Sbjct: 137 DWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCN 196
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+ F+++N G+ E+DYPY + G C K +VTI GY DVP+NNE+ LL+
Sbjct: 197 GGLMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLK 256
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QP+SV I S R FQ YS G+F G C + LDH V VGY + GVDY +KNSWG
Sbjct: 257 ALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYITVKNSWGS 316
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG GY+ M+RN G GICGI +ASYPTK
Sbjct: 317 KWGEKGYIRMRRNIGKPEGICGIYKMASYPTK 348
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 187/407 (45%), Positives = 247/407 (60%), Gaps = 23/407 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNM--GNSSFTLSLN 76
++ +++ W +H S E ++R ++F DN FV HN G+ F L +N
Sbjct: 60 AEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMN 119
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV-TEVKDQ 135
FADLT+ EF+A++LG + A R + + +P S+DWR KGAV + VK+Q
Sbjct: 120 RFADLTNDEFRAAYLGTTPAGRG---RHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQ 176
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG-LMDYAYQFVIK 194
CG+CWAFSA A+EGINKIVTG LVSLSEQEL++C R+ + G +MD A+ F+ +
Sbjct: 177 GQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITR 236
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
N G+DTE+DYPY G+C+ K +R +V+IDG++DVPEN+E L +AV QPVSV I
Sbjct: 237 NGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDA 296
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R FQLY SG+FTG C TSLDH V+ VGY D+ G DYW ++NSWG WG NGY+ M+
Sbjct: 297 GGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRME 356
Query: 313 RNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT----------RCSLLTYCAAGETCC 362
RN G CGI M+ASYP K G NP PSP P P+ +C + C AG TCC
Sbjct: 357 RNVTARTGKCGIAMMASYPIKKGPNPKPSPSPKPSPPSPAPSPPQQCDRYSKCPAGTTCC 416
Query: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
C I C+ W CC A CC DH CCP +YP+C++ C V
Sbjct: 417 CNYGIRNHCIVWGCCPVEGATCCKDHSTCCPKDYPVCNAKARTCSKV 463
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 180/400 (45%), Positives = 244/400 (61%), Gaps = 23/400 (5%)
Query: 29 FETWCKQHGKAYSSE-QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
F+ W Q+ KAY+++ +E + R ++ +N ++ +N S + L LNAFADLT EF+
Sbjct: 45 FQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSHW-LHLNAFADLTTDEFR 103
Query: 88 ASFLGFSAASIDHDRRRNAS--VQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
+ LG+ + R +S + + +P IDWRKKGAVTEVK+Q CG+CWAF+
Sbjct: 104 -NRLGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFA 162
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
TG++EGIN IVTG L SLSEQEL+DCD + GC GGLMDYAYQ++IKN G+DTE DYP
Sbjct: 163 TTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYP 222
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y + G C K NR +VTIDGY D+PEN+E L +A QP++V I ++FQLY G
Sbjct: 223 YTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGG 282
Query: 266 IFTGP-CSTSLDHAVLIVGYDSENGV-DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
++ P C TSL+H VL+VGY + +YWI+KNSWG WG NGY+ ++ + G+CG
Sbjct: 283 VYDDPTCGTSLNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCG 342
Query: 324 INMLASYPTK----------------TGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSI 367
I M S+PTK P P P P +C C AG TCCC
Sbjct: 343 IAMAPSFPTKKGPNPPTPGPTPGPGPKPSPSPKPPSPQPVKCDDDNECPAGSTCCCVMEF 402
Query: 368 LGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+C W CC A CCSD+++CCP++ P+CD+V +CL
Sbjct: 403 FNMCFQWGCCPMPKATCCSDNQHCCPADLPVCDTVGGRCL 442
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 174/334 (52%), Positives = 225/334 (67%), Gaps = 8/334 (2%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y + +EK R ++F+DN +
Sbjct: 20 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID 79
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV--PA 119
N + S++ L LN FADL+HQEFK +LG +D +RR +S + RDV P
Sbjct: 80 DRNKV-VSNYWLGLNEFADLSHQEFKNKYLGLK---VDLSQRRESS-EEEFTYRDVDLPK 134
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
S+DWRKKGAVT VK+Q CG+CWAFS A+EGIN+IVTG+L SLSEQELIDCD +YN+G
Sbjct: 135 SVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNG 194
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA+ F++KN G+ E+DYPY + C +K +VTI+GY DVP+NNE+ L
Sbjct: 195 CNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSL 254
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299
L+A+ QP+SV I S R FQ YS G+F G C + LDH V VGY + G+DY I+KNSW
Sbjct: 255 LKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELDHGVSAVGYGTSKGLDYIIVKNSW 314
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G WG G++ M+RN G S GICG+ +ASYPTK
Sbjct: 315 GAKWGEKGFIRMKRNIGKSEGICGLYKMASYPTK 348
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 348 bits (892), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 168/287 (58%), Positives = 200/287 (69%), Gaps = 6/287 (2%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P S+DWRK+GAV VKDQ SCG+CWAFS GA+EGINKIVTG L+SLSEQEL+DCD SY
Sbjct: 3 IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSY 62
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA++F+IKN GIDTE+DYPY+ G+C++ + N +VTID Y+DVPENNE
Sbjct: 63 NQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENNE 122
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L +A+ QP+SV I RAFQLYSSG+F G C T LDH V+ VGY +ENG DYWI++
Sbjct: 123 AALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGTENGKDYWIVR 182
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN------PPPSPPPGPTRCS 350
NSWG SWG +GY+ M RN + G CGI M ASYP K GQN PPSP PT+C
Sbjct: 183 NSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPIKKGQNPPQPGPSPPSPIKPPTQCD 242
Query: 351 LLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYP 397
C G TCCC C W CC +A CC D+ CCP YP
Sbjct: 243 KYYSCPEGNTCCCLFKYGKYCFGWGCCPLEAATCCDDNTSCCPHEYP 289
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 347 bits (890), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 166/320 (51%), Positives = 224/320 (70%), Gaps = 10/320 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ +++ W ++HGKAY+ EK +R +IF++N F+ +HN+ N ++ + L FADLT+
Sbjct: 23 EVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYKVGLTKFADLTN 81
Query: 84 QEFKASFLGFSAASIDHDRR----RNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASC 138
QE++A FLG + D RR +N S + D +P S+DWR KGAV +KDQ SC
Sbjct: 82 QEYRAMFLGTRS---DPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSC 138
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS A+EGIN+IVTG L+SLSEQEL+DCDR YN+GC GGLMDYA+QF+I N G+
Sbjct: 139 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAFQFIINNGGL 198
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTEKDYPY G C++ K+ V+IDG++DV +EK L +AV QPVSV I S A
Sbjct: 199 DTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMA 258
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q Y SG+FTG C T+LDH V++VGY +E G+DYW+++NSWG WG +GY+ MQRN ++
Sbjct: 259 LQFYQSGVFTGECGTALDHGVVVVGYGTEKGLDYWLVRNSWGTEWGEHGYIKMQRNVRDT 318
Query: 319 L-GICGINMLASYPTKTGQN 337
G CGI M +SYP K GQN
Sbjct: 319 YTGRCGIAMESSYPVKNGQN 338
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 347 bits (889), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 170/334 (50%), Positives = 223/334 (66%), Gaps = 7/334 (2%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y + +EK R ++F+DN +
Sbjct: 20 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID 79
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV--PA 119
+ N + S++ L LN FADL+HQEFK +LG ++ +RR +S + RDV P
Sbjct: 80 ERNKI-VSNYWLGLNEFADLSHQEFKNKYLGLK---VNLSQRRESSNEEEFTYRDVDLPK 135
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
S+DWRKKGAVT VK+Q CG+CWAFS A+EGIN+IVTG+L SLSEQELIDCD +YN+G
Sbjct: 136 SVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNG 195
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA+ F+++N G+ E DYPY + C +K +VTI+GY DVP+NNE+ L
Sbjct: 196 CNGGLMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSL 255
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299
L+A+ QP+SV I S R FQ YS G+F G C + LDH V VGY + +DY I+KNSW
Sbjct: 256 LKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAVGYGTSKNLDYIIVKNSW 315
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G WG G++ M+RN G GICG+ +ASYPTK
Sbjct: 316 GAKWGEKGFIRMKRNIGKPEGICGLYKMASYPTK 349
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 347 bits (889), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 167/324 (51%), Positives = 219/324 (67%), Gaps = 9/324 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAF 78
++ ++ W +HGK ++ ++ +R IF+DN F+ HN N N+++ L L F
Sbjct: 44 EVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKF 103
Query: 79 ADLTHQEFKASFLGFS---AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
DLT+ E++ +LG A I + N + N ++VP ++DWR+KGAV +KDQ
Sbjct: 104 TDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQ 163
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
+CG+CWAFS T A+EGINKIVTG L+SLSEQEL+DCD+SYN GC GGLMDYA+QF++KN
Sbjct: 164 GTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 223
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G++TEKDYPYRG G+CN N +V+IDGY+DVP +E L +A+ QPVSV I
Sbjct: 224 GGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAG 283
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
R FQ Y SGIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG WG GY+ M+RN
Sbjct: 284 GRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNL 343
Query: 316 GNSL-GICGINMLASYPTKTGQNP 338
S G CGI + ASYP K NP
Sbjct: 344 AASKSGKCGIAVEASYPVKYSPNP 367
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 166/326 (50%), Positives = 215/326 (65%), Gaps = 2/326 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SI+ S L + ELFE W KAY + +EK R ++F+DN + + N
Sbjct: 32 SIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDE-TNKKVK 90
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV 129
S+ L LN FADL+H+EFK +LG + D R+ + + ++ VP S+DWRKKGAV
Sbjct: 91 SYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAV 150
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
EVK+Q SCG+CWAFS A+EGINKIVTG+L +LSEQELIDCD +YN+GC GGLMDYA+
Sbjct: 151 AEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAF 210
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
++++KN G+ E+DYPY + G C QK VTIDG++DVP N+EK LL+A+ QP+S
Sbjct: 211 EYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLS 270
Query: 250 VGICGSERAFQLYSS-GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
V I S R FQ YS +F G C LDH V VGY S G DY I+KNSWG WG GY
Sbjct: 271 VAIDASGREFQFYSGVSVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGY 330
Query: 309 MHMQRNTGNSLGICGINMLASYPTKT 334
+ ++RNTG G+CGIN +AS+PTKT
Sbjct: 331 IRLKRNTGKPEGLCGINKMASFPTKT 356
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 219/324 (67%), Gaps = 9/324 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAF 78
++ ++ W +HGK ++ ++ +R IF+DN F+ HN + N+++ L L F
Sbjct: 44 EVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKF 103
Query: 79 ADLTHQEFKASFLGFS---AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
DLT+ E++ +LG A I + N + N ++VP ++DWR+KGAV +KDQ
Sbjct: 104 TDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQ 163
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
+CG+CWAFS T A+EGINKIVTG L+SLSEQEL+DCD+SYN GC GGLMDYA+QF++KN
Sbjct: 164 GTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 223
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G++TEKDYPYRG G+CN N +V+IDGY+DVP +E L +A+ QPVSV I
Sbjct: 224 GGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAG 283
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
R FQ Y SGIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG WG GY+ M+RN
Sbjct: 284 GRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNL 343
Query: 316 GNSL-GICGINMLASYPTKTGQNP 338
S G CGI + ASYP K NP
Sbjct: 344 AASKSGKCGIAVEASYPVKYSPNP 367
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 345 bits (885), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 218/324 (67%), Gaps = 9/324 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAF 78
++ ++ W +HGK ++ ++ +R IF+DN F+ HN N N+++ L L F
Sbjct: 44 EVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKF 103
Query: 79 ADLTHQEFKASFLGFS---AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
DLT+ E++ +LG A I + N + N ++VP ++DWR+KGAV +KDQ
Sbjct: 104 TDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQ 163
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
+CG+CWAFS T A+EGINKIVTG L+SLSEQEL+DCD+SYN GC GGLMDYA+QF++KN
Sbjct: 164 GTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 223
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G++TEKDYPYRG G+CN N +V+IDGY+DVP +E L +A+ QPV V I
Sbjct: 224 GGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIEAG 283
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
R FQ Y SGIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG WG GY+ M+RN
Sbjct: 284 GRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNL 343
Query: 316 GNSL-GICGINMLASYPTKTGQNP 338
S G CGI + ASYP K NP
Sbjct: 344 AASKSGKCGIAVEASYPVKYSPNP 367
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 344 bits (883), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 170/334 (50%), Positives = 222/334 (66%), Gaps = 7/334 (2%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y + +EK R ++F+DN +
Sbjct: 20 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID 79
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV--PA 119
N + S++ L LN FADL+HQEFK +LG +D +RR +S + RDV P
Sbjct: 80 DRNKI-VSNYWLGLNEFADLSHQEFKNKYLGLK---VDLSQRRESSNEEEFTYRDVDLPK 135
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
S+DWRKKGAVT VK+Q CG+CWAFS A+EGIN+IVTG+L SLSEQELIDCD +YN+G
Sbjct: 136 SVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNG 195
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA+ F+ +N G+ E+DYPY + C +K +VTI+GY DVP+NNE+ L
Sbjct: 196 CNGGLMDYAFSFIGQNGGLHKEEDYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSL 255
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299
L+A+ QP+SV I S R FQ YS G+F G C + LDH V VGY + +DY I+KNSW
Sbjct: 256 LKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAVGYGTSKNLDYIIVKNSW 315
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G WG G++ M+R+ G GICG+ +ASYPTK
Sbjct: 316 GAKWGEKGFIRMKRDIGKPEGICGLYKMASYPTK 349
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 344 bits (882), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 165/317 (52%), Positives = 224/317 (70%), Gaps = 12/317 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+++ W +HGKAY+ E+ +R +IF++N F+ +HN+ N ++ + L FADLT++E++
Sbjct: 3 MYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEEYR 61
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASCGAC 141
A FLG + + +RR +SP +P S+DWR KGAV +KDQ SCG+C
Sbjct: 62 AMFLGTRSDA----KRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSC 117
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+IVTG L+SLSEQEL+DCDR+YN+GC GGLMDYA+QF+I N G+DTE
Sbjct: 118 WAFSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTE 177
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
KDYPY G +C+K K+ V+IDG++DV +EK L +AV QPVSV I S A Q
Sbjct: 178 KDYPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQF 237
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL-G 320
Y SG+FTG C T+LDH V++VGY SENG+DYW+++NSWG WG +GY+ MQRN G++ G
Sbjct: 238 YQSGVFTGECGTALDHGVVVVGYASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTG 297
Query: 321 ICGINMLASYPTKTGQN 337
CGI M +SYP K G+N
Sbjct: 298 RCGIAMESSYPVKNGEN 314
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 171/328 (52%), Positives = 217/328 (66%), Gaps = 9/328 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ + L I +LFE+W +HGK Y S +EK R +IF+DN F N
Sbjct: 13 FSIVGYTPEDLTSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDN-LFHIDETNKKV 71
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV---PASIDWRK 125
++ L LN F+DL+H+EFK +LG +D RR S + N +DV P S+DWRK
Sbjct: 72 VNYWLGLNEFSDLSHEEFKNKYLGLK---VDMSERRECSQEF--NYKDVMSIPKSVDWRK 126
Query: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM 185
KGAVT+VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQEL+DCD + N GC GGLM
Sbjct: 127 KGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLM 186
Query: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA 245
DYA+ ++I N G+ E DYPY + G C +K +VTI GY DVP+N+E+ LL+A+
Sbjct: 187 DYAFSYIISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALAN 246
Query: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 305
QP+SV I S R FQ YS G+F G C T LDH V VGY S NG+DY I+KNSWG WG
Sbjct: 247 QPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGSTNGLDYIIVKNSWGSKWGE 306
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+RNTG G+CGIN +ASYPTK
Sbjct: 307 KGYIRMKRNTGKPAGLCGINKMASYPTK 334
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 188/383 (49%), Positives = 229/383 (59%), Gaps = 16/383 (4%)
Query: 48 QRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFL----GFSAASIDH 100
+RL++F DN ++ HN + G F L L FADLT +E++A L G + ++
Sbjct: 91 RRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV 150
Query: 101 DRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGS 160
RR P +P ++DWR++GAV EVKDQ CG CWAFSA A+EGINKIVTGS
Sbjct: 151 VGRRR---YLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGS 207
Query: 161 LVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
L+SLSEQELIDCD+ + GC GGLMD A+ F+IKN GIDTE DYP+ G G C+ + N
Sbjct: 208 LISLSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNT 267
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
+V+ID ++ VP N E+ L +AV QPVS I S RAFQLYSSGIF G C T LDH V
Sbjct: 268 RVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVT 327
Query: 281 IVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPP 340
+VGY SE G DYWI+KNSWG WG GY+ M RN GI M YP K G NPPP
Sbjct: 328 VVGYGSEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPVKEGPNPPP 387
Query: 341 SPPPGPTR-----CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSN 395
P P C+ C TCCC S G CL++ CC +A CC DH CCP +
Sbjct: 388 GPTPPSPVKPPNVCNAEYSCPEATTCCCVSEYRGKCLAYGCCELENATCCEDHSSCCPHD 447
Query: 396 YPICDSVRHQCLTVSLKFSFTVK 418
YP+C SVR S VK
Sbjct: 448 YPVC-SVRDGTCRKSANSPMMVK 469
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 219/326 (67%), Gaps = 5/326 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ + L I +LFE+W +H K Y S +EK R +IF+DN F N
Sbjct: 13 FSIVGYAPEDLTSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDNL-FHIDETNKKV 71
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKG 127
++ L LN FADL+H+EFK +LG + +D RR S + + ++ +P S+DWRKKG
Sbjct: 72 VNYWLGLNEFADLSHEEFKNKYLGLN---VDLSNRRECSEEFTYKDVSSIPKSVDWRKKG 128
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AVT+VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQEL+DCD +YN+GC GGLMDY
Sbjct: 129 AVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDY 188
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
A+ ++I N G+ E+DYPY + G C +K +VTI GY DVP+N+E+ LL+A+ QP
Sbjct: 189 AFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQP 248
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
+SV I S R FQ YS G+F G C T LDH V VGY S G+D+ ++KNSWG WG G
Sbjct: 249 LSVAIDASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAKGLDFIVVKNSWGSKWGEKG 308
Query: 308 YMHMQRNTGNSLGICGINMLASYPTK 333
++ M+RNTG G+CGIN +ASYPTK
Sbjct: 309 FIRMKRNTGKPAGLCGINKMASYPTK 334
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/319 (52%), Positives = 211/319 (66%), Gaps = 15/319 (4%)
Query: 104 RNASVQSPGNLRD---------VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGIN 154
R A ++PG D +P S+DWR+KGAV +KDQ CG+CWAFS ++EGIN
Sbjct: 19 RGAGRRTPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGIN 78
Query: 155 KIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN 214
KIVTG L+SLSEQEL+DCD++YN GC GGLMDYA+QF+I N GIDTEKDYPY Q G+C+
Sbjct: 79 KIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYTEQDGRCD 138
Query: 215 KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS 274
+ N +V+I+ Y+DVP N+E+ L +A +QP++V I G R+FQLY+SGIFTG C TS
Sbjct: 139 SYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTS 198
Query: 275 LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
LDH V +VGY SE+G DYWI++NSWG SWG GY+ M RN + GICGI M ASYP K
Sbjct: 199 LDHGVTVVGYGSESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPIKK 258
Query: 335 GQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDH 388
GQNPP P P+ C C TCCC C +W CC A CC DH
Sbjct: 259 GQNPPNPGPSPPSPVKPPSVCDNYYSCPESSTCCCLFQYGRSCFAWGCCPLEGATCCDDH 318
Query: 389 RYCCPSNYPICDSVRHQCL 407
CCP ++PIC+ + CL
Sbjct: 319 SSCCPHDFPICNVQQGLCL 337
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 179/400 (44%), Positives = 243/400 (60%), Gaps = 20/400 (5%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ +++ W +H A + + RL++F++N FV +HN + G ++ L +N FAD
Sbjct: 47 EVRIIYQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFAD 106
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD---VPASIDWRKKGAVTEVKDQAS 137
LT++E++A FL + R + + + LR+ +P SIDWR+KGAV VK+Q
Sbjct: 107 LTNEEYRARFLRDLSRL---GRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGR 163
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAF+A A+EGIN+IVTG L+SLSEQ+L+DC + N GC GG A+Q++I N G
Sbjct: 164 CGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCS-TRNYGCEGGWPYRAFQYIINNGG 222
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
+++E+ YPY G G CN K N H+V+ID Y++VP N+EK L +A QP+SVGI S R
Sbjct: 223 VNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGR 282
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLY SGIFTG C+TSL+H V +VGY +ENG DYWI+KNSWG +WG +GY+ M+RN
Sbjct: 283 NFQLYHSGIFTGSCNTSLNHGVTVVGYGTENGNDYWIVKNSWGENWGNSGYILMERNIAE 342
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGP----------TRCSLLTYCAAGETCCCGSSI 367
S G CGI + SYP K G +P T C C+ TCCC
Sbjct: 343 SSGKCGIAISPSYPIKVGATNLRNPTTSSSSVPSLVESLTACDNYYTCSGSTTCCCMHER 402
Query: 368 LGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C +W CC A CC DH CCP NYPIC CL
Sbjct: 403 GNRCFAWGCCPLEGATCCKDHYSCCPFNYPICSVADDNCL 442
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/305 (52%), Positives = 205/305 (67%), Gaps = 24/305 (7%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE+W +HGK Y S +EK R ++F +N + + N SS+ L LN FADL+H+EFK+
Sbjct: 49 FESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE-VSSYWLGLNEFADLSHEEFKS 107
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
++ D+P S+DWRKKGAVT VK+Q +CG+CWAFS
Sbjct: 108 K-----------------------DVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 144
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
A+EGIN+IVTG+L +LSEQELIDCD ++NSGC GGLMDYA+ F+ N G+ E DYPY
Sbjct: 145 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLM 204
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
+ G C +QK + IVTI GY+DVPE +E+ LL+A+ QP+SV I S R FQ YS G+F
Sbjct: 205 EEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVFN 264
Query: 269 GPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
GPC T LDH V VGY S G+DY I+KNSWG WG GY+ M+RNTG + G+CGIN +A
Sbjct: 265 GPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKMA 324
Query: 329 SYPTK 333
SYPTK
Sbjct: 325 SYPTK 329
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 180/400 (45%), Positives = 241/400 (60%), Gaps = 20/400 (5%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ +++ W +H A + + RL++F++N FV +HN + G ++ L +N FAD
Sbjct: 38 EVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFAD 97
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD---VPASIDWRKKGAVTEVKDQAS 137
LT++E++A FL + R + + + LR+ +P SIDWR+KGAV VK Q
Sbjct: 98 LTNEEYRARFLRDLSRL---GRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGR 154
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAF+A +EGIN+IVTG L+SLSEQ+L+DC + N GC GG A+Q++I N G
Sbjct: 155 CGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCS-TRNHGCEGGWPYRAFQYIINNGG 213
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
+++E+ YPY G G CN K N H+V+ID Y++VP N+EK L +AV QP+SVGI S R
Sbjct: 214 VNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGR 273
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLY SGIFTG C+TSL+H V +VGY + NG DYWI+KNSWG SWG +GY+ M+RN
Sbjct: 274 NFQLYHSGIFTGSCNTSLNHGVTVVGYGTVNGNDYWIVKNSWGESWGDSGYILMERNIAE 333
Query: 318 SLGICGINMLASYPTKTGQNPPPSPPPGP----------TRCSLLTYCAAGETCCCGSSI 367
S G CGI + SYP K G +P T C CA TCCC
Sbjct: 334 SSGKCGIAISPSYPIKEGATNLRNPTTSSSSVPSLVESLTACDNYYTCAGSTTCCCMYER 393
Query: 368 LGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
C +W CC A CC DH CCP NYPIC CL
Sbjct: 394 GNRCFAWGCCPVEGATCCKDHYSCCPFNYPICSVADDNCL 433
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 338 bits (867), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 220/315 (69%), Gaps = 5/315 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++E W ++ K Y+ EK++R KIF+DN FV +HN++ + +F + L FADLT
Sbjct: 38 TEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLT 97
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EF+A +L + + G++ +P +DWR GAV VKDQ +CG+CW
Sbjct: 98 NEEFRAIYLRKKMERTKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCGSCW 155
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA GA+EGIN+I TG L+SLSEQEL+DCDR + N+GC GG+M+YA++F++KN GI+T+
Sbjct: 156 AFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETD 215
Query: 202 KDYPYRG-QAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+DYPY G CN K N +VTIDGY+DVP ++EK L +AV QPVSV I S +AF
Sbjct: 216 QDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAF 275
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SG+ TG C SLDH V++VGY S +G DYWII+NSWG +WG +GY+ +QRN +
Sbjct: 276 QLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPF 335
Query: 320 GICGINMLASYPTKT 334
G CGI M+ SYPTK+
Sbjct: 336 GKCGIAMMPSYPTKS 350
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 337 bits (865), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 169/308 (54%), Positives = 209/308 (67%), Gaps = 7/308 (2%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P SIDWR+KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCDRSY
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA++FVIKN GIDTE+DYPY+ + G C++ + N +V ID Y+DVP NNE
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 137
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
K L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+I GY +ENG+DYWI++
Sbjct: 138 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVR 197
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCS 350
NSWG + NGY+ +QRN +S G+CG+ + SYP KTG PPSP PT C
Sbjct: 198 NSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECD 257
Query: 351 LLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVS 410
+ CA G TCCC C SW CC A CC DH CCP +YPIC+ VR ++S
Sbjct: 258 EYSQCAVGTTCCCILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICN-VRQGTCSMS 316
Query: 411 LKFSFTVK 418
VK
Sbjct: 317 KGNPLGVK 324
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 337 bits (864), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 157/311 (50%), Positives = 212/311 (68%), Gaps = 9/311 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W HG+ Y+ EK++R +IF DN ++ +HN N ++ L LN FAD+TH EFK
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A + G + + + + + NL P DWR KGAV VK+Q +CG+CWAFS
Sbjct: 93 ALYFG-TKVPLSNTIKSGFRYEDATNL---PLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG+N+IVTG LVSLSEQEL+DCD+ N GC GGLMD A++F+I+N G+D+E DYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+G C++ + N H+VTIDG++DVP +E LL+AV QPVSV I S R FQLYS G++
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268
Query: 268 TGPCSTSLDHAVLIVGYDSE---NGV--DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
TG C LDH V+ VGY + +GV DYWI++NSWG +WG +GY+ +QRN +S G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGKC 328
Query: 323 GINMLASYPTK 333
GI M+ASYP K
Sbjct: 329 GIAMMASYPVK 339
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 337 bits (864), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 162/324 (50%), Positives = 215/324 (66%), Gaps = 9/324 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHNNMG-NSSFTLSLNAF 78
++ ++ W HGK ++ ++ +R IF+DN F+ HN N+++ L L F
Sbjct: 44 EVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKF 103
Query: 79 ADLTHQEFKASFLGFSAA---SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
DLT++E+++ +LG I + N + + ++VP ++DWR KGAV +KDQ
Sbjct: 104 TDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQ 163
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
+CG+CWAFS A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA+QF++KN
Sbjct: 164 GTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFIMKN 223
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G+ TEKDYPYRG G+CN N +V+IDGY+DVP +E L +A+ QPVSV I
Sbjct: 224 GGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAIEAG 283
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
R FQ Y +GIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG WG GY+ M+RN
Sbjct: 284 GRIFQHYQTGIFTGNCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNL 343
Query: 316 GNSL-GICGINMLASYPTKTGQNP 338
+S G CGI + ASYP K NP
Sbjct: 344 ASSKSGKCGIAVEASYPVKYSPNP 367
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 337 bits (863), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 220/315 (69%), Gaps = 5/315 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++E W ++ K Y+ EK++R KIF+DN FV +HN++ + +F + L FADLT
Sbjct: 38 TEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLT 97
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EF+A +L + + G++ +P +DWR GAV VKDQ +CG+CW
Sbjct: 98 NEEFRAIYLRKKMERNKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCGSCW 155
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA GA+EGIN+I TG L+SLSEQEL+DCDR + N+GC GG+M+YA++F++KN GI+T+
Sbjct: 156 AFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETD 215
Query: 202 KDYPYRG-QAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+DYPY G CN K N +VTIDGY+DVP ++EK L +AV QPVSV I S +AF
Sbjct: 216 QDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAF 275
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SG+ TG C SLDH V++VGY S +G DYWII+NSWG +WG +GY+ +QRN +
Sbjct: 276 QLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPF 335
Query: 320 GICGINMLASYPTKT 334
G CGI M+ SYPTK+
Sbjct: 336 GKCGIAMMPSYPTKS 350
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 159/300 (53%), Positives = 208/300 (69%), Gaps = 5/300 (1%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFS 94
+HGK+Y S +EK R ++F+DN + + N SS+ L LN FADL+H+EFK +LG
Sbjct: 3 KHGKSYRSFEEKLHRFEVFQDNLKHIDETNKK-VSSYWLGLNEFADLSHEEFKRKYLGLK 61
Query: 95 AASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGI 153
I+ +RR++ + S ++ D+P S+DWRKKGAV VK+Q +CG+CWAFS A+EGI
Sbjct: 62 ---IELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGI 118
Query: 154 NKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC 213
N+IVTG+L +LSEQELIDCD+ +N+GC GGLMDYA+ F+I N G+ E+DYPY + G C
Sbjct: 119 NQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTC 178
Query: 214 NKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCST 273
++K +VTI GY DVPE+NE+ L+A+ QP+SV I S R FQ YS GIF G C T
Sbjct: 179 GEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGT 238
Query: 274 SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
LDH V VGY + GVDY +KNSWG WG GY+ M+RN G GICGI +ASYPTK
Sbjct: 239 ELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYPTK 298
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 156/311 (50%), Positives = 211/311 (67%), Gaps = 9/311 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W HG+ Y+ EK++R +IF DN ++ +HN N ++ L LN FAD+TH EFK
Sbjct: 33 LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A + G + + + + + NL P DWR KGAV VK+Q +CG+CWAFS
Sbjct: 93 ALYFG-TKVPLSNTIKSGFRYKDATNL---PLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG+N+IVTG LVSLSEQEL+DCD+ N GC GGLMD A++F+I+N G+D+E DYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+G C++ + N H+VTIDG++DVP +E LL+AV QPVSV I S R FQLYS G++
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268
Query: 268 TGPCSTSLDHAVLIVGYDSE---NGV--DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
TG C LDH V+ VGY + +GV DYWI++NSWG +WG +GY+ +QRN + G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPRGKC 328
Query: 323 GINMLASYPTK 333
GI M+ASYP K
Sbjct: 329 GIAMMASYPVK 339
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 228/348 (65%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN + N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNYPEP 352
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 230/344 (66%), Gaps = 8/344 (2%)
Query: 3 SLAFFLLSILLLSSLPLN---YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
S++ S LL+ SL L+ ++ ++E+W +HGK+Y+S E+++R +IF++ F
Sbjct: 9 SMSLLFFSTLLILSLALDAKRTNDEVKAMYESWLIKHGKSYNSLGERERRFEIFKETLRF 68
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ +HN + S+ + LN FADLT++EF++++LGF+ S ++ + ++ P + +P
Sbjct: 69 IDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS---NKTKVSNRYEPRVGQVLPD 125
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS- 178
+DWR +GAV ++K+Q CG+CWAFSA A+EGINKIVTG+L+SLSEQEL+DC R+ ++
Sbjct: 126 YVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCGRTQSTK 185
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG M ++F+I N GI+TE++YPY Q GQC+ N VTID Y++VP NE
Sbjct: 186 GCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDNYENVPYYNEWA 245
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNS 298
L AV QPVSV + + AFQ YSSGIFTGPC T+ DHAV IVGY +E G+DYWI+KNS
Sbjct: 246 LQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYGTEGGIDYWIVKNS 305
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
W +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 WDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 348
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 162/322 (50%), Positives = 213/322 (66%), Gaps = 12/322 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W +H K Y+ EK R +IF+DN F+ +HN N S+ + LN FAD+ ++E++
Sbjct: 3 MYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYR 61
Query: 88 ASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+LG + + +RR + G N V +DWR KGAVT +KDQ SCG+CW
Sbjct: 62 DMYLGTKSDA----KRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCW 117
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS +E INKIVTG VSLSEQEL+DCDR++N GC GGLMDYA++F+I+N GIDT++
Sbjct: 118 AFSTIATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQ 177
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY G +C+ K N +V+IDGY+DVP + L +AV QPVSV I G RA QLY
Sbjct: 178 DYPYNGFERKCDPTKKNAKVVSIDGYEDVP-SYMNALKKAVAHQPVSVAIAGLGRALQLY 236
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM-QRNTGNSLGI 321
SG+FTG C T LDH V++VGY SENGVDYW+++NSWG +WG +GY + RN +
Sbjct: 237 QSGVFTGKCGTDLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRK 296
Query: 322 CGINMLASYPTKTGQNPPPSPP 343
CGI M ASYP K GQN + P
Sbjct: 297 CGIAMEASYPVKYGQNTNSAAP 318
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 334 bits (856), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 167/347 (48%), Positives = 234/347 (67%), Gaps = 18/347 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINE---------LFETWCKQHGKAYSSEQEKQQRLKIFE 54
+A L S++L + L+ D++ ++E W +H K Y EK QR +IF+
Sbjct: 1 MASILYSLILFGLITLSLSLDMSSGRSNKEVMTMYEKWLVKHQKVYYGLGEKNQRFQIFK 60
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ---SP 111
DN F+ +HN N S+ + LN F+D+T++E++ ++L S S ++ + + SV+
Sbjct: 61 DNLIFIDEHN-APNHSYRVGLNEFSDITNKEYRDTYL--SRWSNNNIKNKITSVRYAYKA 117
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
G+ +P S+DWR GA+T +K+Q SCGACWAFSA A+E INKIVTGSLVSLSEQEL+D
Sbjct: 118 GHNNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVD 175
Query: 172 CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
CDR+ N GC GG AY+F+++N G+D++ DYPY G+ CN+ K N +V+I+GYK+V
Sbjct: 176 CDRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNV 235
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVD 291
N+E L++AV QPVSVGI + FQLY SG+FTG C TSLDHAV++VGY SENG D
Sbjct: 236 QRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYGSENGKD 295
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPTKTGQN 337
YW++KNSWG +WG GY+ ++RN N+ G CGI M A+YPTK +N
Sbjct: 296 YWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTKLREN 342
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 333 bits (855), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 164/298 (55%), Positives = 200/298 (67%), Gaps = 7/298 (2%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P S+DWRK+GAV VKDQASCG+CWAFSA A+EGINKIVTG L+SLSEQEL+DCD SY
Sbjct: 24 LPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSY 83
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA++F+I N GID+E DYPY+ G+C++ + N +VTID Y+DVP +E
Sbjct: 84 NEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDE 143
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L +AV QP++V + G R FQLY G+ TG C T+LDH V VGY +ENG DYWI++
Sbjct: 144 LALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGTENGKDYWIVR 203
Query: 297 NSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPTKTGQNPPPSPPPGPTR------C 349
NSWG SWG GY+ ++RN +S G CGI + SYP K GQNPP P P+ C
Sbjct: 204 NSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKNGQNPPNPGPSPPSPIKPPSVC 263
Query: 350 SLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CA G TCCC C W CC SA CC DH CCP YP+CD+ CL
Sbjct: 264 DSYYSCAEGSTCCCIYEYGRSCFEWGCCPLESATCCDDHYSCCPHEYPVCDTRAGLCL 321
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 333 bits (854), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 164/348 (47%), Positives = 229/348 (65%), Gaps = 13/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK-TGQNPPPS 341
+KNSW +WG GYM + RN G + G CGI + SYP K QN P S
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKS 352
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 333 bits (854), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 227/348 (65%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 227/348 (65%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS---NKTKVSNRYEPRFGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 187/417 (44%), Positives = 254/417 (60%), Gaps = 33/417 (7%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSF--TLSLNAFADLT 82
+++LF W + HGK Y E+E+ RL+ F+ + FV + N+ S T+ LN FADL+
Sbjct: 46 VSDLFGKWKELHGKTYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLS 105
Query: 83 HQEFKASFLGFSAASIDHDRR-----RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQAS 137
++EFK ++ S ++ + RN SV S D P S+DWR KG VT +KDQ
Sbjct: 106 NEEFKEMYMSKVKGSRSNELKMGGVKRNMSVSS--RTCDAPTSLDWRDKGVVTPMKDQGQ 163
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS +G+IE N I TG L+ LSEQEL+DCD +Y+ GC GG MD AY+++IKN G
Sbjct: 164 CGSCWAFSVSGSIESANAIATGDLIRLSEQELVDCD-TYDYGCDGGNMDTAYRWIIKNGG 222
Query: 198 IDTEKDYPY---RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
+D+E DYPY G+ G+C+K K + +V++D Y +V E+NE +L AV PV++GI G
Sbjct: 223 LDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYVEV-ESNEDAVLCAVATTPVTIGIVG 281
Query: 255 SERAFQLYSSGIFTGPCST---SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
S FQLY+ G++ G CS+ +DHAVLIVGY S++G DYWI+KNSWG WG+ GY+ M
Sbjct: 282 SAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILM 341
Query: 312 QRNTGNSLGICGINMLASYP----------------TKTGQNPPPSPPPGPTRCSLLTYC 355
+RNT G+CG+ + YP PPP PP P++C YC
Sbjct: 342 ERNTDIKNGVCGMYLEPVYPITAAPTPPGPPPPPAPPSPPHPPPPPTPPAPSKCGDFHYC 401
Query: 356 AAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLK 412
AA +TCCC CL + CCG+S AVCC + CCPS+YPICD C S K
Sbjct: 402 AADQTCCCIFEFYNYCLIYGCCGYSDAVCCKNSAACCPSDYPICDVQAGYCYKNSAK 458
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 331 bits (849), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 168/347 (48%), Positives = 228/347 (65%), Gaps = 12/347 (3%)
Query: 3 SLAFFLLSILLLSSLPL-----NYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
SL FF ++L S+L + + +++E+W + GK+Y+S EK+ R +IF+DN
Sbjct: 11 SLLFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNL 70
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+ HN N SF+L LN FADLT +E+++++LGF + + N V G++ +
Sbjct: 71 RIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFKSGP--KAKVSNRYVPKVGDV--L 126
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P +DWR GAV VK+Q C +CWAFSA A+EGINKI+TG+L+SLSEQEL+DC R+ +
Sbjct: 127 PNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQS 186
Query: 178 S-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC G M A+QF+I N GI+TE +YPY Q GQCN+ N+ VTID Y++VP NNE
Sbjct: 187 TRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNE 246
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L AV QPVSVG+ F+LY+SGIFT C T++DH V IVGY +E G+DYWI+K
Sbjct: 247 WALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGTERGLDYWIVK 306
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP-PPSP 342
NSWG +WG NGY+ +QRN G + G CGI +ASYP K NP P P
Sbjct: 307 NSWGTNWGENGYIRIQRNIGGA-GKCGIARMASYPVKYNSNPLKPYP 352
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 331 bits (849), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 166/328 (50%), Positives = 214/328 (65%), Gaps = 5/328 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L+ + ELFE W +H KAY+S +EK R ++F+DN + + N
Sbjct: 29 FSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINRE-V 87
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+S+ L LN FADLTH EFKA++LG AA R+ + + D+P S+DWRKKGA
Sbjct: 88 TSYWLGLNEFADLTHDEFKAAYLGLDAAPARRGSSRSFRYEDV-SASDLPKSVDWRKKGA 146
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VTEVK+Q CG+CWAFS A+EGIN IVTG+L +LSEQELIDC NSGC GGLMDYA
Sbjct: 147 VTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYA 206
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQC-NKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
+ ++ + G+ TE+ YPY + G C + +K VTI GY+DVP N+E+ L++A+ QP
Sbjct: 207 FSYIASSGGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQP 266
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGV--DYWIIKNSWGRSWGM 305
VSV I S R FQ YS G+F GPC LDH V VGY S+ G DY I++NSWG WG
Sbjct: 267 VSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGE 326
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+R T N G+CGIN +ASYPTK
Sbjct: 327 KGYIRMKRGTSNGEGLCGINKMASYPTK 354
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 330 bits (847), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 172/352 (48%), Positives = 220/352 (62%), Gaps = 18/352 (5%)
Query: 3 SLAFFLLSIL-LLSSLPLNYCSDINE-----LFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
+LA LS L + S+P +E L+E W H A + EK +R +F++N
Sbjct: 8 ALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLD-EKNRRFNVFKEN 66
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG---- 112
F+ + N ++ + L+LN F D+T+QEF++ + G + I H R + ++ G
Sbjct: 67 VKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAG---SKIQHHRSQRGIQKNTGSFMY 123
Query: 113 -NLRDVPA-SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
N+ +PA SIDWR KGAVT VKDQ CG+CWAFS ++EGIN+I TG LVSLSEQEL+
Sbjct: 124 ENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELV 183
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCD SYN GC GGLMDYA++F+ KN GI TE YPY Q G C LN +V+IDG++D
Sbjct: 184 DCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQD 242
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENG 289
VP NNE L+QAV QP+SV I S FQ YS G+FTG C T LDH V IVGY + +G
Sbjct: 243 VPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDG 302
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPS 341
YWI+KNSWG WG +GY+ MQR + G CGI M ASYP KT NP S
Sbjct: 303 TKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIKTSANPKNS 354
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 330 bits (847), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 167/330 (50%), Positives = 216/330 (65%), Gaps = 6/330 (1%)
Query: 3 SLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
SLAF SI+ SS L + ELFE+W +HGK Y S +EK R +IF+DN +
Sbjct: 20 SLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHID 79
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
+ N + S++ L LN FADL+HQEFK +LG +D+ RRR + + ++P S+
Sbjct: 80 ERNKV-VSNYWLGLNEFADLSHQEFKNKYLGLK---VDYSRRRESPEEFTYKDVELPKSV 135
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAV VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCDR+Y++GC
Sbjct: 136 DWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCN 195
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+ F+++N G+ E+DYPY + G C K +VTI GY DVP+NNE+ LL+
Sbjct: 196 GGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLK 255
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ Q +SV I S R FQ YS G+F G C + LDH V VGY + GVDY I+KNSWG
Sbjct: 256 ALANQSLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYIIVKNSWGS 315
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WG GY+ M R T + G +ASYP
Sbjct: 316 KWGEKGYIRM-RGTLETRGNLRYLQMASYP 344
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 157/335 (46%), Positives = 225/335 (67%), Gaps = 13/335 (3%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+L I LL+ + ++ + ++++ W ++HGKAY+S E ++R +IF++N ++ HN
Sbjct: 15 LWLKPIHLLTRISWHFIDPLWQVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNA 74
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR---DVPASID 122
N+S +L LN FADLT+ EF+ ++G +R A G++ D S+D
Sbjct: 75 RRNNSHSLGLNKFADLTNSEFRGLYVG--------RLQRPAPFHEVGDIALVADTATSVD 126
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WRKKG VTE+KDQ CG+CWAFSA A+EG+ + TG+LVSLSEQEL+DCD + N GC G
Sbjct: 127 WRKKGGVTEIKDQGDCGSCWAFSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDG 186
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242
G+MDYA+Q++I+N GI ++ +YPYR G C+K K+ H TI+G++ +P +E+ LL+A
Sbjct: 187 GIMDYAFQYMIRNGGITSQSNYPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRA 246
Query: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGR 301
V QPVSV I + FQLYSSG+FTG C ++LDH V IVGY ++ G YW++KNSWG
Sbjct: 247 VANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGS 306
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQ 336
WG +GY+ M+R G G+CGIN+ ASYPTK Q
Sbjct: 307 GWGESGYVRMERQ-GPGAGVCGINLDASYPTKIQQ 340
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 330 bits (846), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 226/348 (64%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++L F++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 205/309 (66%), Gaps = 8/309 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+LFE+W + G+ Y S +EK +R +IF+DN F N ++ L LN FADL+H+EF
Sbjct: 45 DLFESWISRFGRVYESAEEKLERFEIFKDN-LFHIDDTNKKVRNYWLGLNEFADLSHEEF 103
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDV--PASIDWRKKGAVTEVKDQASCGACWAF 144
K +LG D + A +DV P S+DWRKKGAVT VK+Q SCG+CWAF
Sbjct: 104 KNKYLGLKP-----DLSKRAQCPEEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSCWAF 158
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
S A+EGIN+IVTG+L SLSEQELIDCD +YN+GC GGLMDYA+ +++ N G+ E+DY
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEEDY 218
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY + G C+ +K VTI GY DVP+N+E+ LL+A+ QP+S+ I S R FQ YS
Sbjct: 219 PYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSG 278
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
G+F G C T LDH V VGY + G+DY I+KNSWG WG GY+ M+R T GICGI
Sbjct: 279 GVFDGHCGTELDHGVAAVGYGTSKGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEGICGI 338
Query: 325 NMLASYPTK 333
+ASYPTK
Sbjct: 339 YKMASYPTK 347
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 156/305 (51%), Positives = 204/305 (66%), Gaps = 2/305 (0%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
+TW Q+G+ Y EK++R KIF++N F+ NN GN + L +NAF DLT++EF+AS
Sbjct: 39 KTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRAS 98
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
G++ + H N+ VP S+DWR KGAVT +KDQ CG CWAFSA A
Sbjct: 99 HNGYTMSMSSHQSSYRTKSFRYENVTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAA 158
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EGI K+ TG+L+SLSEQEL+DCD S + GC GGLMD A++F+I+N+G+ TE +YPY G
Sbjct: 159 MEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEG 218
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G CN +K H I GY++VP +E+ L +AV QPVSV I E AFQ YSSGIFT
Sbjct: 219 VDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFT 278
Query: 269 GPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C T LDH V +VGY S++G YW++KNSWG SWG +GY+ M+R+ G+CGI M
Sbjct: 279 GDCGTELDHGVTVVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIDAKEGLCGIAME 338
Query: 328 ASYPT 332
SYPT
Sbjct: 339 PSYPT 343
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 165/349 (47%), Positives = 224/349 (64%), Gaps = 14/349 (4%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINE-------LFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL L+ + + ++E+W + GK+Y+S EK+ R +IF++
Sbjct: 9 SMSLLFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
N + HN N S++L LN FADLT +E+++++LG + ++ P
Sbjct: 69 NLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGP----KTDVSNEYMPKVGE 124
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P +DWR GAV VK+Q C +CWAFSA A+EGINKIVTG+L+SLSEQEL+DC R+
Sbjct: 125 ALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRT 184
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GLM A+QF+I N GI+TE +YPY + GQCN N+ VTID YK+VP N
Sbjct: 185 QRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVPSN 244
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L +AV QPVSVG+ F+LY+SGIFTG C T++DH V IVGY +E G+DYWI
Sbjct: 245 NEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGTERGMDYWI 304
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP-PPSP 342
+KNSWG +WG NGY+ +QRN G + G CGI + SYP K NP P P
Sbjct: 305 VKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMPSYPVKYTTNPLKPYP 352
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 164/328 (50%), Positives = 216/328 (65%), Gaps = 5/328 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L+ I ELFE W +H KAY+S +EK R ++F+DN + + N
Sbjct: 130 FSIVGYSEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNRE-V 188
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+S+ L LN FADLTH+EFKA++LG + + + R + + + D+P S+DWR KGA
Sbjct: 189 TSYWLGLNEFADLTHEEFKATYLGLAPPAPARESRGSFKYEDV-SADDLPKSVDWRTKGA 247
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VTEVK+Q CG+CWAFS A+EGIN IVTG+L +LSEQELIDC N+GC GGLMDYA
Sbjct: 248 VTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYA 307
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQC-NKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
+ ++ + G+ TE+ YPY + G C + +K VTI GY+DVP +NE+ L++A+ QP
Sbjct: 308 FSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQP 367
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGV--DYWIIKNSWGRSWGM 305
VSV I S R FQ YS G+F GPC T LDH V VGY S+ G DY I++NSWG WG
Sbjct: 368 VSVAIEASGRHFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGE 427
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 428 KGYIRMKRGTGKGEGLCGINKMASYPTK 455
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 227/348 (65%), Gaps = 13/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC G + + F+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK-TGQNPPPS 341
+KNSW +WG GYM + RN G + G CGI + SYP K QN P S
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKS 352
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 328 bits (840), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 174/331 (52%), Positives = 218/331 (65%), Gaps = 18/331 (5%)
Query: 19 LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAF 78
+N + + LF+ W +HGK Y S +EK +RL+IF N ++ HN NSSF L LN F
Sbjct: 33 INSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKF 92
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNA------------SVQSPGNLRDVPASIDWRKK 126
ADLT++EFK + G ++ DRRR +V S + + +S+DWRKK
Sbjct: 93 ADLTNEEFKTRYFGKNSKQW-RDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKK 151
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
GAVT VKDQA CG+CWAFS TGAIEG+N I TG LVSLSEQEL+ CD + N GC GG MD
Sbjct: 152 GAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDAT-NYGCEGGDMD 210
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
YA+ +VI+N GIDTEKDY Y G CN K + IV+IDGY DV ++ LL A +Q
Sbjct: 211 YAFTWVIQNGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSP-DDSALLCAAGSQ 269
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCS---TSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 303
PVSVGI GS FQLY+ GI+ G CS +DHAVL+VGY ++NG DYWI+KNSWG W
Sbjct: 270 PVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGYSAKNGKDYWIVKNSWGTDW 329
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
G+ GY ++ RNT G+C IN +ASYPTKT
Sbjct: 330 GLEGYFYILRNTELPYGVCAINAMASYPTKT 360
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 328 bits (840), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 167/348 (47%), Positives = 227/348 (65%), Gaps = 14/348 (4%)
Query: 3 SLAFFLLSILLLSSLPL-NYCSDINE----LFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
SL FF ++L S++ + N N+ ++E+W +HGK+Y+S EK+ R +IF++N
Sbjct: 11 SLLFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENL 70
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD- 116
+ HN N S++L LN FADLT +E+++++LG + + S Q + D
Sbjct: 71 RIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGP-----KTDVSNQYMPKVGDA 125
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS- 175
+P +DWR GAV VK+Q C +CWAFSA A+EGINKIVTG+L+SLSEQEL+DC R+
Sbjct: 126 LPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQ 185
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
GC GLM A++F+I N GI+TE +YPY + GQCN N+ VTID YK+VP NN
Sbjct: 186 ITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNN 245
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E L +AV QPVSVG+ F+LY+SGIFTG C T++DH V IVGY +E G+DYWI+
Sbjct: 246 EMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGTERGMDYWIV 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP-PPSP 342
KNSWG +WG +GY+ +QRN G + G CGI + SYP K NP P P
Sbjct: 306 KNSWGTNWGESGYIRIQRNIGGA-GKCGIAKMPSYPVKYTSNPLKPYP 352
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 328 bits (840), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 159/325 (48%), Positives = 207/325 (63%), Gaps = 29/325 (8%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + ELFE+W +HGK Y S +EK RL++F+DN + + N
Sbjct: 27 FSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNR-DV 85
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+++ L+LN FADL+H+EFK+ A I +KGA
Sbjct: 86 TTYWLALNEFADLSHEEFKSKL----------------------------AQIRRLEKGA 117
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
V VK+Q SCG+CWAFS A+EGIN+IVTG+L SLSEQELIDCD S+NSGC GGLMDYA
Sbjct: 118 VAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTSFNSGCNGGLMDYA 177
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+ +++ N G+ E+DYPY + G C++++ +VTI GY DVPENNE+ LL+A+ QP+
Sbjct: 178 FDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPL 237
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
S+ I S R FQ Y G+F GPC T LDH V VGY S G+DY I+KNSWG WG GY
Sbjct: 238 SIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGY 297
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ M+RNTG G+CGIN +ASYPTK
Sbjct: 298 IRMKRNTGKPEGLCGINKMASYPTK 322
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 327 bits (839), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 164/337 (48%), Positives = 218/337 (64%), Gaps = 9/337 (2%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M+S + SI+ S L + + +LFE W ++ KAY+S +EK R ++F+DN +
Sbjct: 38 MDSDSDDFFSIVGYSPEDLVHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHI 97
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGF---SAASIDHDRRRNASVQSPGNLRDV 117
+ N +++ L LNAFADLTH EFKA++LG R R V DV
Sbjct: 98 DEANKK-VTTYWLGLNAFADLTHDEFKATYLGLRQPETKKTTDSRFRYGGVADD----DV 152
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
PAS+DWRKKGAVT+VK+Q CG+CWAFS A+EGIN+IVTG+L SLSEQEL+DC N
Sbjct: 153 PASVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGN 212
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNE 236
+GC GG+MD A+ ++ + G+ TE+ YPY + G C+ K + +VTI GY+DVP N+E
Sbjct: 213 NGCNGGVMDNAFSYIASSGGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDE 272
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
+ L++A+ QP+SV I S R FQ YS G+F GPC + LDH V VGY S G DY I+K
Sbjct: 273 QALVKALAHQPLSVAIEASGRHFQFYSGGVFNGPCGSELDHGVAAVGYGSSKGQDYIIVK 332
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG WG GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 333 NSWGSHWGEKGYIRMKRGTGKPEGLCGINKMASYPTK 369
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 327 bits (838), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 165/324 (50%), Positives = 213/324 (65%), Gaps = 8/324 (2%)
Query: 15 SSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSF 71
SS + + ++ W QHG ++E+E R + F DN ++ +HN + G SF
Sbjct: 29 SSGQIRSEEETRRMYAEWTAQHGSPITNEEEG--RYEAFRDNLRYIDEHNAAADAGIHSF 86
Query: 72 TLSLNAFADLTHQEFKASFLGFSAAS-IDHDRRRNASVQSPGNLRDVPASIDWRKKGAVT 130
L LN FA LT++E++A++LG S D R+ ++ + +P S+DWR+KGAV
Sbjct: 87 RLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVG 146
Query: 131 EVKDQA-SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
+VKDQ SCG+ WAFSA A+E IN+IVTG L+SLSEQEL+DCD SYN+GC GGLMD A+
Sbjct: 147 KVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDDAF 206
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
+F+I N GIDT++DYPY+ + C+ K NR VTID Y+D+ NEK L +AV QPVS
Sbjct: 207 EFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDL-RMNEKSLQKAVSNQPVS 265
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
V I R FQLY SGIFTG C T LDHA IVGY SENG DYWI+K S+G SWG +GY
Sbjct: 266 VAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGSENGTDYWIVKESYGTSWGESGYA 325
Query: 310 HMQRNTGNSLGICGINMLASYPTK 333
M+RN + G CGI ML SYP K
Sbjct: 326 RMERNIKETSGKCGIAMLPSYPVK 349
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 327 bits (837), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 166/345 (48%), Positives = 219/345 (63%), Gaps = 19/345 (5%)
Query: 3 SLAFFLLSILLLSSLP-----LNYCSD-------INELFETWCKQHGKAYSSEQEKQQRL 50
SL F +SIL S+L L Y + + LFE+W +H K Y S EK R
Sbjct: 11 SLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYESLDEKLHRF 70
Query: 51 KIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110
+IF DN + + N S++ L LN FADLTH+EFK FLGF + R++ S +
Sbjct: 71 EIFMDNLKHIDE-TNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAE---RKDESSKE 126
Query: 111 PG--NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQE 168
G + D+P S+DWRKKGAV VK+Q CG+CWAFS A+EGIN+IVTG+L LSEQE
Sbjct: 127 FGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQE 186
Query: 169 LIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
LIDCD ++N+GC GGLMDYA+ +V+++ G+ E++YPY G C+++K VTI GY
Sbjct: 187 LIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSEKVTISGY 245
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN 288
DVP N+E L+A+ QP+SV I S R FQ YS G+F G C T LDH V VGY +
Sbjct: 246 HDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK 305
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G+DY I++NSWG WG GY+ M+R +G G+CG+ M+ASYPTK
Sbjct: 306 GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPTK 350
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 161/327 (49%), Positives = 210/327 (64%), Gaps = 7/327 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SIL + L + LFE+W +H K Y S EK R +IF DN + N
Sbjct: 29 FSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDD-TNKKV 87
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ--SPGNLRDVPASIDWRKK 126
S++ L LN FADLTH+EFK FLG + R++ S++ S + D+P S+DWRKK
Sbjct: 88 SNYWLGLNEFADLTHEEFKNKFLGLKG---ELPERKDESIEEFSYRDFVDLPKSVDWRKK 144
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
GAV VK+Q CG+CWAFS A+EGIN+IVTG+L LSEQELIDCD ++N+GC GGLMD
Sbjct: 145 GAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMD 204
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
YA+ +V+++ G+ E++YPY G C+++K VTI GY DVP NNE L+A+ Q
Sbjct: 205 YAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETVTISGYHDVPRNNEDSFLKALANQ 263
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
P+SV I S R FQ YS G+F G C T LDH V VGY + G+DY I++NSWG WG
Sbjct: 264 PISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTKGLDYVIVRNSWGPKWGEK 323
Query: 307 GYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+R TG G+CG+ M+ASYPTK
Sbjct: 324 GYIRMKRKTGKPHGMCGLYMMASYPTK 350
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 225/341 (65%), Gaps = 16/341 (4%)
Query: 6 FFLLSILLLSS------LPLNYC------SDINELFETWCKQHGKAYSSEQ-EKQQRLKI 52
FLL + +LS+ LP ++ +F+ W +HGK Y++ EK++R +
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQN 71
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG 112
F+DN F+ QHN N S+ L L FADLT QE++ F G + + V G
Sbjct: 72 FKDNLRFIDQHN-AKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTSRRYVPLAG 130
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
+ +P S+DWR++GAV+E+KDQ +C +CWAFS A+EG+NKIVTG L+SLSEQEL+DC
Sbjct: 131 D--QLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDC 188
Query: 173 DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
+ N G GLMD A+QF+I N+G+D+EKDYPY+G G CN+++++ ++TID Y+DVP
Sbjct: 189 NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDSYEDVP 248
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDY 292
N+E L +AV QPVSVG+ + F LY S I+ GPC T+LDHA++IVGY SENG DY
Sbjct: 249 ANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQDY 308
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WI++NSWG +WG GY+ + RN + G+CGI MLASYP K
Sbjct: 309 WIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIK 349
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 158/325 (48%), Positives = 213/325 (65%), Gaps = 2/325 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
S++ S L + + LF +W +H K Y+S +EK +R +IF+ N + + N N
Sbjct: 27 SVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRR-NG 85
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGA 128
S+ L LN FAD+ H+EFKAS+LG D + + S N ++P ++DWRKKGA
Sbjct: 86 SYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGA 145
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT VK+Q CG+CWAFS A+EGIN+IVTG LVSLSEQEL+DCD ++N GC GGLMD+A
Sbjct: 146 VTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFA 205
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+ +++ N GI TE+DYPY + G C +++ + ++TI GY+DVPEN+E LL+A+ QPV
Sbjct: 206 FAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVITITGYEDVPENSETSLLKALAHQPV 265
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SVGI R FQ Y GIF G C DHA+ VGY S G DY I+KNSWG++WG GY
Sbjct: 266 SVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGYGSYYGQDYIIMKNSWGKNWGEQGY 325
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
++R TG G+C I +ASYPTK
Sbjct: 326 FRIRRGTGKPEGVCDIYKIASYPTK 350
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 214/313 (68%), Gaps = 7/313 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W ++GK+Y++ EK++R +IF+DN FV +HN N S+ + LN F+DLT E+
Sbjct: 47 MFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDAEYS 106
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+ +LG + R N S + + D +P S+DWRKKGAV VK+Q +CG+CW F++
Sbjct: 107 SIYLGTKF----NIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWTFAS 162
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGINKIVTG+L+SLSEQE++DC R Y N+GC GG + AYQF+I N GI+TE +YP
Sbjct: 163 IAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEANYP 222
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y G+ G C++ K N+ VTID Y++VP NNEK L +AV QPVSV I + AF+ Y SG
Sbjct: 223 YTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSYKSG 282
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
IF GPC +DH V IVGY +E G DYWI++NSWG +WG +GY+ MQRN G S G C I
Sbjct: 283 IFNGPCGPRIDHGVTIVGYGTEGGKDYWIVRNSWGPNWGESGYVRMQRNVGGS-GKCFIA 341
Query: 326 MLASYPTKTGQNP 338
YP K G NP
Sbjct: 342 RAPVYPVKYGPNP 354
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 169/332 (50%), Positives = 210/332 (63%), Gaps = 20/332 (6%)
Query: 18 PLNYCSD-INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTL 73
P+ D + ++E W +HG + S+ + RL++F DN ++ HN + G +F L
Sbjct: 40 PVERADDEVRRMYEAWKSEHGHGHGSDD--RLRLEVFRDNLRYIDAHNAEADAGLHTFRL 97
Query: 74 SLNAFADLTHQEFKASFLGFSAASIDHDRRRNAS-VQSPGNLR------DVPASIDWRKK 126
L FADLT +E++ LGF A RR AS V S + R D+P +IDWR+
Sbjct: 98 GLTPFADLTLEEYRGRALGFRA------RRGGASRVGSGSSYRPRPRGGDLPDAIDWREL 151
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
GAVT VK+Q CG CWAFSA AIEGIN+IVTG+LVSLSEQE+IDCD + + GC GG M
Sbjct: 152 GAVTGVKNQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCD-TQDGGCNGGEMQ 210
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
A+QFVI N GIDTE DYPY G C+ ++N +VTIDG+ V NE L +AV Q
Sbjct: 211 NAFQFVINNGGIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQ 270
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
PVSV I S R FQ Y+SGIF GPC T LDH V VGY SENG DYWI+KNSW SWG
Sbjct: 271 PVSVAIDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYGSENGKDYWIVKNSWSSSWGEA 330
Query: 307 GYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
GY+ ++RN + G CGI M ASYP K+ NP
Sbjct: 331 GYIRIRRNVAAATGKCGIAMDASYPVKSSSNP 362
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 166/345 (48%), Positives = 217/345 (62%), Gaps = 19/345 (5%)
Query: 3 SLAFFLLSILLLSSLP-----LNYCSD-------INELFETWCKQHGKAYSSEQEKQQRL 50
SL F +SIL S L L Y + + LFE+W +H K Y S EK R
Sbjct: 11 SLLFLFVSILACSPLAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYESLDEKLHRF 70
Query: 51 KIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110
+IF DN + + N S++ L LN FADLTH+EFK FLGF + R++ S +
Sbjct: 71 EIFMDNLKHIDE-TNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAE---RKDESSKE 126
Query: 111 PG--NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQE 168
G + D+P S+DWRKKGAV VK+Q CG CWAFS A+EGIN+IVTG+L LSEQE
Sbjct: 127 FGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTMLSEQE 186
Query: 169 LIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
LIDCD ++N+GC GGLMDYA+ +V+++ G+ E++YPY G C+++K VTI GY
Sbjct: 187 LIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSEKVTISGY 245
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN 288
DVP N+E L+A+ QP+SV I S R FQ YS G+F G C T LDH V VGY +
Sbjct: 246 HDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK 305
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G+DY I++NSWG WG GY+ M+R +G G+CG+ M+ASYPTK
Sbjct: 306 GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPTK 350
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 166/328 (50%), Positives = 209/328 (63%), Gaps = 14/328 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLTHQE 85
EL+E W + H S EK +R +F+ N +V HN N + + L LN FAD+T+ E
Sbjct: 36 ELYERW-RSHHTVSRSLDEKDKRFNVFKANVHYV--HNFNKKDKPYKLKLNKFADMTNHE 92
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGA 140
F+ + G + I H R + ++ G N+ DVP S+DWRKKGAVT VKDQ CG+
Sbjct: 93 FRHHYAG---SKIKHHRSFLGASRANGTFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGS 149
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+I T LVSLSEQEL+DCD S N GC GGLMD A++F+ K GI+T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E++YPY + G+C+ QK N +V+IDGY+DVP N+E LL+AV QPVSV I S FQ
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQ 269
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+FTG C T LDH V IVGY + +G YWI++NSWG WG GY+ MQR
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEE 329
Query: 320 GICGINMLASYPTKT-GQNPPPSPPPGP 346
G+CGI M SYP KT NP SP P
Sbjct: 330 GLCGIAMQPSYPIKTSSSNPTGSPATAP 357
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 165/343 (48%), Positives = 228/343 (66%), Gaps = 12/343 (3%)
Query: 3 SLAFFLLSILLLSSLPL-NYCSDINE----LFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
SL FF ++L S+L + N N+ ++E+W + GK+Y+S EK+ R +IF++N
Sbjct: 13 SLLFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENL 72
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+ HN N S++L LN FADLT +E+++++LGF + + N V G + +
Sbjct: 73 RIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGP--KAKVSNRYVPKVGVV--L 128
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P +DWR GAV VKDQ C +CWAFSA A+EGINKIVTG+L+SLSEQEL+DC R+
Sbjct: 129 PNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQR 188
Query: 178 S-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC G M+ A+QF+I N GI+TE +YPY Q GQC+ + N+ VTID Y+ +P NNE
Sbjct: 189 TRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNYEQLPANNE 248
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L AV QP++VG+ F+LY+SGI+TG C T++DH V IVGY +E G+DYWI+K
Sbjct: 249 WVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGTERGLDYWIVK 308
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK-TGQNP 338
NSWG +WG NGY+ +QRN G + G CGI M+ SYP K + QNP
Sbjct: 309 NSWGTNWGENGYIRIQRNIGGA-GKCGIAMVPSYPVKYSYQNP 350
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 156/309 (50%), Positives = 203/309 (65%), Gaps = 3/309 (0%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+NE E W ++G+ Y EK++R +IF +N F+ N GN + L +N FADLT++
Sbjct: 34 MNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNE 93
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFKAS G+ +S + S GN+ VP S+DWR+KGAVT +KDQ CG CWAF
Sbjct: 94 EFKASRNGYKRSS--NVGLSEKSSFRYGNVTAVPTSMDWRQKGAVTPIKDQGQCGCCWAF 151
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SA A+EGI K+ TG L+SLSEQEL+DCD S + GC GGLMD A++F+ +N G+ TE +
Sbjct: 152 SAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEAN 211
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+G G CN K I GY+DVP N+E LL+AV +QPVSV I S AFQ YS
Sbjct: 212 YPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYS 271
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G+FTG C T LDH V VGY + +G YW++KNSWG SWG +GY+ M+R+ G+CG
Sbjct: 272 GGVFTGDCGTELDHGVTAVGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCG 331
Query: 324 INMLASYPT 332
I M +SYPT
Sbjct: 332 IAMQSSYPT 340
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 324 bits (831), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 213/325 (65%), Gaps = 3/325 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
S++ S L + +LF +W +H K Y S +EK +R ++F+ N + + N N
Sbjct: 29 SVVGYSQEDLALPYKLVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NG 87
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV 129
S+ L LN FAD+ H+EFK+++LG +D R + + N ++P S+DWRKKGAV
Sbjct: 88 SYWLGLNQFADVAHEEFKSTYLGLKTG-MDGPARAPTAFRYE-NSVNLPWSVDWRKKGAV 145
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
T VK+Q CG+CWAFS A+EGIN+I TG L SLSEQEL+DCD +++ GCGGG MD+A+
Sbjct: 146 TPVKNQGECGSCWAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAF 205
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
+++ N GI T+ DYPY + G C +++ +VTI GY+DVPEN+E LL+A+ QP+S
Sbjct: 206 AYIMGNLGIHTDDDYPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPIS 265
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
VGI + FQ Y G+F G C T LDHA+ VGY S +G DY I+KNSWG+SWG GY
Sbjct: 266 VGIAAGSKDFQFYKRGVFEGSCGTELDHALTAVGYGSSDGQDYIIMKNSWGKSWGEQGYF 325
Query: 310 HMQRNTGNSLGICGINMLASYPTKT 334
++R TG G+C I +ASYPTKT
Sbjct: 326 RIKRGTGKPEGVCSIYSMASYPTKT 350
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 324 bits (831), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 162/327 (49%), Positives = 211/327 (64%), Gaps = 3/327 (0%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
SI+ S L + LFE W ++ KAY S +EK +R ++F+DN + + N
Sbjct: 51 FFSIVGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKE 110
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+S+ L LNAFADLTH EFKA++LG R R V +VPAS+DWRKKG
Sbjct: 111 VTSYWLGLNAFADLTHDEFKATYLGLLPKRTSGGRFRYGGVGD--GGDEVPASVDWRKKG 168
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AVTEVK+Q CG+CWAFS A+EGIN+IVTG+L SLSEQ+L+DC N+GC GG+MD
Sbjct: 169 AVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDN 228
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQAVVAQ 246
A+ F+ G+ +E+ YPY + G C+ + + + VTI GY+DVP N+E+ L++A+ Q
Sbjct: 229 AFSFIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQ 288
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
PVSV I S R FQ YS G+F GPC + LDH V VGY S G DY I+KNSWG WG
Sbjct: 289 PVSVAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEK 348
Query: 307 GYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 349 GYIRMKRGTGKPEGLCGINKMASYPTK 375
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 220/348 (63%), Gaps = 30/348 (8%)
Query: 8 LLSILLLSSLPLNYCSDI-------------NELFETWCKQHGKAYSSEQE--KQQRLKI 52
LL I L +L L++C I + E W QHG+ Y+ EQE K +R +
Sbjct: 3 LLQIFLFVALVLSFCFSIQLAGLSRPLLDEDSMRHEEWMSQHGRVYADEQEDHKNKRFNV 62
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG 112
F++N + + N+ +F L++N FADLT++EF+AS+ GF + ++ + P
Sbjct: 63 FKENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPMV-----LSSQITKPT 115
Query: 113 NLR------DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSE 166
R +P S+DWRKKGAVT VK+Q CG CWAFSA AIEGI +I TG L+SLSE
Sbjct: 116 PFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSE 175
Query: 167 QELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTI 225
QEL+DCD + + GC GGLMD A++F+I N G+ TE +YPY+G+ G CN K N V+I
Sbjct: 176 QELVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSI 235
Query: 226 DGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY- 284
GY+DVP N+E+ L++AV QPVSV I FQ YSSG+FTG C T LDHAV VGY
Sbjct: 236 TGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYG 295
Query: 285 DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+SE+G YWI+KNSWG WG +GY+ MQ++ G+CGI M ASYPT
Sbjct: 296 ESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPT 343
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 162/327 (49%), Positives = 211/327 (64%), Gaps = 3/327 (0%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
SI+ S L + LFE W ++ KAY S +EK +R ++F+DN + + N
Sbjct: 65 FFSIVGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKE 124
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+S+ L LNAFADLTH EFKA++LG R R V +VPAS+DWRKKG
Sbjct: 125 VTSYWLGLNAFADLTHDEFKATYLGLLPKRTSGGRFRYGGVGD--GGDEVPASVDWRKKG 182
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AVTEVK+Q CG+CWAFS A+EGIN+IVTG+L SLSEQ+L+DC N+GC GG+MD
Sbjct: 183 AVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDN 242
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQAVVAQ 246
A+ F+ G+ +E+ YPY + G C+ + + + VTI GY+DVP N+E+ L++A+ Q
Sbjct: 243 AFSFIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQ 302
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
PVSV I S R FQ YS G+F GPC + LDH V VGY S G DY I+KNSWG WG
Sbjct: 303 PVSVAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEK 362
Query: 307 GYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 363 GYIRMKRGTGKPEGLCGINKMASYPTK 389
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 324 bits (830), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 164/342 (47%), Positives = 223/342 (65%), Gaps = 17/342 (4%)
Query: 6 FFLLSILLLSS------LPLNYC------SDINELFETWCKQHGKAYSSEQ-EKQQRLKI 52
FLL + +LS+ LP ++ +F+ W +HGK Y++ EK++R +
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQN 71
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG 112
F+DN F+ QHN N S+ L L FADLT QE++ F G + + V G
Sbjct: 72 FKDNLRFIDQHN-AKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTSRRYVPLAG 130
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
+ +P S+DWR++GAV+E+KDQ +C +CWAFS A+EG+NKIVTG L+SLSEQEL+DC
Sbjct: 131 D--QLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDC 188
Query: 173 DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDV 231
+ N G GLMD A+QF+I N+G+D+EKDYPY+G G CN KQ + ++TID Y+DV
Sbjct: 189 NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDV 248
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVD 291
P N+E L +AV QPVSVG+ + F LY S I+ GPC T+LDHA++IVGY SENG D
Sbjct: 249 PANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQD 308
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
YWI++NSWG +WG GY+ + RN + G+CGI MLASYP K
Sbjct: 309 YWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIK 350
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/325 (48%), Positives = 212/325 (65%), Gaps = 2/325 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
S++ S L + + LF +W +H K Y+S +EK +R +IF+ N + + N N
Sbjct: 36 SVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRR-NG 94
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGA 128
S+ L LN FAD+ H+EFKAS+LG D + + S N ++P ++DWRKKGA
Sbjct: 95 SYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGA 154
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT VK+Q CG+CWAFS A+EGIN+IVTG LVSLSEQEL+DCD ++N GC GGLMD+A
Sbjct: 155 VTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFA 214
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+ +++ N GI TE+DYPY + G C +++ + ++TI GY+DVP N+E LL+A+ QPV
Sbjct: 215 FAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPV 274
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SVGI R FQ Y GIF G C DHA+ VGY S G DY I+KNSWG++WG GY
Sbjct: 275 SVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGYGSYYGQDYIIMKNSWGKNWGEQGY 334
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
++R TG G+C I +ASYPTK
Sbjct: 335 FRIRRGTGKPEGVCDIYKIASYPTK 359
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 172/342 (50%), Positives = 217/342 (63%), Gaps = 21/342 (6%)
Query: 3 SLAFFL---LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
SLA F L + ++S L S I E E W +GK Y QE++ RLKIF++N +
Sbjct: 12 SLALFFCLGLFAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNY 71
Query: 60 VTQHNNMGNSS-FTLSLNAFADLTHQEFKAS---FLGFSAASIDHD---RRRNASVQSPG 112
+ NN GN+ + L +N FADLT++EF AS F G +SI + NASV
Sbjct: 72 IEASNNAGNNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSITKTSTFKYENASV---- 127
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
P+++DWRKKGAVT VK+Q CG CWAFSA A EGI+K+ TG LVSLSEQEL+DC
Sbjct: 128 -----PSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDC 182
Query: 173 D-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
D + + GC GGLMD A++F+I+NHG++TE YPY+G G C+ K + H VTI GY+DV
Sbjct: 183 DTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDV 242
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-GV 290
P NNE+ L +AV QP+SV I S FQ Y SG+FTG C T LDH V VGY N G
Sbjct: 243 PANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGT 302
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 303 KYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYPT 344
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 162/337 (48%), Positives = 215/337 (63%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA L L + D + E E W ++GK Y QE+++R +IF++N ++
Sbjct: 558 SLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYI 617
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N + L++N FADLT++EF A F G +SI R + + N+ V
Sbjct: 618 EAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSII----RTTTFKYE-NVTAV 672
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+KGAVT +KDQ CG CWAFSA A EGI+ + +G L+SLSEQEL+DCD +
Sbjct: 673 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGV 732
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD A++FVI+NHG++TE +YPY+G G+CN + +VTI GY+DVP NNE
Sbjct: 733 DQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNE 792
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S +G +YW++
Sbjct: 793 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLV 852
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 853 KNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPT 889
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 162/337 (48%), Positives = 215/337 (63%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA L L + D + E E W ++GK Y QE+++R +IF++N ++
Sbjct: 29 SLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYI 88
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N + L++N FADLT++EF A F G +SI R + + N+ V
Sbjct: 89 EAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSI----IRTTTFKYE-NVTAV 143
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+KGAVT +KDQ CG CWAFSA A EGI+ + +G L+SLSEQEL+DCD +
Sbjct: 144 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGV 203
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD A++FVI+NHG++TE +YPY+G G+CN + +VTI GY+DVP NNE
Sbjct: 204 DQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNE 263
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S +G +YW++
Sbjct: 264 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLV 323
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 324 KNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPT 360
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 158/348 (45%), Positives = 224/348 (64%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FAD T++EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFTSGS---NKMKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P +DWR GAV ++K Q CG+CWAFSA +EGINKIVTG L+SLSEQEL+DC R+
Sbjct: 126 VLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE +YPY + GQCN N +ID Y++VP N
Sbjct: 186 QNTRGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AFQ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GY+ + RN G + G CGI SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYIRILRNVGGA-GTCGIATKPSYPVKYNNQNHPKP 352
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 213/332 (64%), Gaps = 6/332 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L + ++ + + + E E W Q+G+ Y EK +R KIF+DN A +
Sbjct: 13 ALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N + S+ LS+N FADLT++EF+AS F A H A+ N+ VP+++D
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVPSTVD 128
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WRKKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC
Sbjct: 129 WRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCS 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F+ +NHG+ TE +YPY G G CN++K I+GY+DVP NNEK L +
Sbjct: 189 GGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWG 300
AV QP++V I S FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSW
Sbjct: 249 AVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWS 308
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 309 TGWGEEGYIRMQRDVTVKEGLCGIAMQASYPT 340
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 213/332 (64%), Gaps = 6/332 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L + ++ + + + E E W Q+G+ Y EK +R KIF+DN A +
Sbjct: 13 ALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N + S+ LS+N FADLT++EF+AS F A H A+ N+ VP+++D
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVPSTVD 128
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WRKKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC
Sbjct: 129 WRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCS 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F+ +NHG+ TE +YPY G G CN++K I+GY+DVP NNEK L +
Sbjct: 189 GGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWG 300
AV QP++V I S FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSW
Sbjct: 249 AVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWS 308
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 309 TGWGEEGYIRMQRDVTAKEGLCGIAMQASYPT 340
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 160/330 (48%), Positives = 212/330 (64%), Gaps = 7/330 (2%)
Query: 6 FFLLSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F+L+ + N + + + E E W Q+G+ Y EK +R KIF+DN A + N
Sbjct: 15 LFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFN 74
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
+ S+ LS+N FADLT++EF+AS F A H A+ N+ VP+++DWR
Sbjct: 75 KAMDKSYKLSINEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVPSTVDWR 130
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGG 183
KKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC GG
Sbjct: 131 KKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGG 190
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
LMD A++F+ +NHG+ TE +YPY G G CN++K I+GY+DVP NNEK L +AV
Sbjct: 191 LMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAV 250
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 302
QP++V I FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSWG
Sbjct: 251 AHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNSWGTG 310
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 311 WGEEGYIRMQRDVTAKEGLCGIAMQASYPT 340
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/317 (50%), Positives = 219/317 (69%), Gaps = 8/317 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ +FE+W ++GK+Y++ EK++R +IF+DN FV +HN N S+ + LN F+DLT
Sbjct: 43 EVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTL 102
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACW 142
+E+ + +LG D R N S + + D +P SIDWRKKGAV VK+Q +CG+CW
Sbjct: 103 EEYSSIYLG---TKFDM-RMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCW 158
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTE 201
F+ A+E IN+IVTG+L+SLSEQ+++DC R S N+GC GG AYQF+I N GI+TE
Sbjct: 159 TFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTE 218
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+ Q G+C++QK N+ VTID Y++VP NEK L +AV Q VSVGI + F+
Sbjct: 219 ANYPYKAQDGECDEQK-NQKYVTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKA 277
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
Y SGIFTGPC +DHAV IVGY +E G+DYWI++NSWG +WG NGY+ MQRN GN+ G
Sbjct: 278 YKSGIFTGPCGAKIDHAVTIVGYGTEGGMDYWIVRNSWGSNWGENGYVRMQRNVGNA-GT 336
Query: 322 CGINMLASYPTKTGQNP 338
C I +YP K G NP
Sbjct: 337 CFIATSPNYPVKYGPNP 353
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 164/332 (49%), Positives = 210/332 (63%), Gaps = 13/332 (3%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + ELFE W ++ KAY+S +EK +R ++F+DN + N
Sbjct: 31 FSIVGYSEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-V 89
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASI 121
+S+ L LN FADLTH EFKA++LG + R N+ S R +VP +
Sbjct: 90 TSYWLGLNEFADLTHDEFKATYLGLTPPPT----RSNSKHYSSEEFRYGKMSNGEVPKEM 145
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKK AVTEVK+Q CG+CWAFS A+EGIN IVTG+L SLSEQELIDC N+GC
Sbjct: 146 DWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCN 205
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+ ++ G+ TE+ YPY + G C++ K +VTI GY+DVP N+E+ L++
Sbjct: 206 GGLMDYAFSYIASTGGLRTEEAYPYAMEEGDCDEGK-GAAVVTISGYEDVPANDEQALVK 264
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QPVSV I S R FQ YS G+F GPC LDH V VGY + G DY I+KNSWG
Sbjct: 265 ALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSKGQDYIIVKNSWGP 324
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 325 HWGEKGYIRMKRGTGKGEGLCGINKMASYPTK 356
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 167/337 (49%), Positives = 211/337 (62%), Gaps = 17/337 (5%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
LSI+ S L + ELFE + ++ KAYSS +EK +R ++F+DN + + N
Sbjct: 32 LSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-I 90
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ----SPGNLRDVPASIDWR 124
+ + L LN FADLTH EFKA++LG + RRN++ Q +P +DWR
Sbjct: 91 TGYWLGLNEFADLTHDEFKAAYLGLTLTPA----RRNSNDQLFRYEEVEAASLPKEVDWR 146
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
KKGAVTEVK+Q CG+CWAFS A+EGIN IVTG+L LSEQELIDCD N+GC GGL
Sbjct: 147 KKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGL 206
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLN-------RHIVTIDGYKDVPENNEK 237
MDYA+ ++ N G+ TE+ YPY + G C + VTI GY+DVP NNE+
Sbjct: 207 MDYAFSYIAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQ 266
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIK 296
LL+A+ QPVSV I S R FQ YS G+F GPC T LDH V VGY + G DY I+K
Sbjct: 267 ALLKALAHQPVSVAIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVK 326
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG WG GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 327 NSWGSHWGEKGYIRMRRGTGKHDGLCGINKMASYPTK 363
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 167/330 (50%), Positives = 213/330 (64%), Gaps = 18/330 (5%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS- 70
+ ++S L S+I E E W +GK Y QE++ RLKIF++N ++ NN GN+
Sbjct: 24 IQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKL 83
Query: 71 FTLSLNAFADLTHQEFKAS---FLGFSAASIDHD---RRRNASVQSPGNLRDVPASIDWR 124
+ L +N FADLT++EF AS F G +SI + NASV P+++DWR
Sbjct: 84 YKLGINQFADLTNEEFIASRNKFKGHMCSSITKTSTFKYENASV---------PSTVDWR 134
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGG 183
KKGAVT VK+Q CG CWAFSA A EGI+K+ TG LVSLSEQEL+DCD + + GC GG
Sbjct: 135 KKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGG 194
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
LMD A++F+I+NHG++TE YPY+G G C+ K + H VTI GY+DVP NNE+ L +AV
Sbjct: 195 LMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAV 254
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRS 302
QP+SV I S FQ Y SG+FTG C T LDH V VGY N G YW++KNSWG
Sbjct: 255 ANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTD 314
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR + G+CGI M ASYPT
Sbjct: 315 WGEEGYIKMQRGVDAAEGLCGIAMEASYPT 344
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 161/330 (48%), Positives = 211/330 (63%), Gaps = 7/330 (2%)
Query: 6 FFLLSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F+L+ + N + + + E E W Q+G+ Y EK +R KIF+DN A + N
Sbjct: 15 LFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFN 74
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
+ S+ LS+N FADLT++EF S F A H A+ N+ VP++IDWR
Sbjct: 75 KAMDKSYKLSINEFADLTNEEFGTSRNRFKA----HICSTEATSFKYENVTAVPSTIDWR 130
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGG 183
KKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC GG
Sbjct: 131 KKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGG 190
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
LMD A++F+ +NHG+ TE +YPY G G CN++K I+GY+DVP NNEK L +AV
Sbjct: 191 LMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAV 250
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 302
V QP++V I FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSWG
Sbjct: 251 VHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWGTG 310
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 311 WGEEGYIRMQRDVTAKEGLCGIAMQASYPT 340
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 157/308 (50%), Positives = 203/308 (65%), Gaps = 6/308 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W Q+G+ Y EK +R KIF+DN A + N N S+ LS+N FADLT++EF
Sbjct: 37 ERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEF 96
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+AS F A H A+ ++ VP+++DWRKKGAVT +KDQ CG+CWAFSA
Sbjct: 97 RASRNRFKA----HICSTEATSFKYEHVXAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGI ++ TG L+SLSEQEL+DCD S + GC GGLMD A++F+ +NHG+ TE +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYP 212
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y G G CN++K I+GY+DVP NNEK L +AV QP++V I FQ YSSG
Sbjct: 213 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSG 272
Query: 266 IFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+FTG C T LDH V VGY S++G+ YW++KNSWG WG GY+ MQR+ G+CGI
Sbjct: 273 VFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGI 332
Query: 325 NMLASYPT 332
M ASYPT
Sbjct: 333 AMQASYPT 340
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 320 bits (821), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 159/308 (51%), Positives = 204/308 (66%), Gaps = 7/308 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E ETW Q+G+AY EK++RL IF++N F+ N +G + LS+N FADLT++EF
Sbjct: 2 ERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEEF 61
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+AS G+ ++ H + N+ VP+++DWRKKGAVT +KDQ CG CWAFSA
Sbjct: 62 QASRNGYKMSA--HLSSSSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAFSA 119
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A EGI ++ TG L+SLSEQEL+DCD S + GC GGLMD A+ F+I+N G+ TE +YP
Sbjct: 120 VAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEANYP 179
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y+G G CN K I GY+DVP N+E LL+AV QPVSV I AFQ YSSG
Sbjct: 180 YQGADGACNSGKA---AAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSSG 236
Query: 266 IFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+FTG C T LDH V VGY S++G YW++KNSWG SWG NGY+ M+R+ G+CGI
Sbjct: 237 VFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLCGI 296
Query: 325 NMLASYPT 332
M ASYPT
Sbjct: 297 AMEASYPT 304
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 320 bits (820), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 213/332 (64%), Gaps = 6/332 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L FFL + ++ + + E E W Q+G+ Y EK +R KIF+DN A +
Sbjct: 13 ALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N + S+ LS+N FADLT++EF+AS F A H A+ ++ VP+++D
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFKA----HICSTEATSFKYEHVAAVPSTVD 128
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WRKKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC
Sbjct: 129 WRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCN 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F+ +NHG+ TE +YPY G G CN++K I+GY+DVP NNEK L +
Sbjct: 189 GGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWG 300
AV QP++V I FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSWG
Sbjct: 249 AVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWG 308
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 309 TGWGEVGYIRMQRDVTAKEGLCGIAMQASYPT 340
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 166/347 (47%), Positives = 214/347 (61%), Gaps = 23/347 (6%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L+ + ELFE W +H +AY+S +EK +R ++F+DN + + N
Sbjct: 39 FSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDE-TNRKV 97
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAA-------SIDHDRRRNASVQSPGNLRDVPASI 121
SS+ L LN FADLTH EFKA++LG ++ D D + +P S+
Sbjct: 98 SSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSV 157
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWR KGAVT VK+Q CG+CWAFS A+EGIN+IVTG+L +LSEQELIDCD N+GC
Sbjct: 158 DWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCN 217
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH--------------IVTIDG 227
GGLMDYA+ ++ N G+ TE+ YPY + G C + + +VTI G
Sbjct: 218 GGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISG 277
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS- 286
Y+DVP NNE+ LL+A+ QPVSV I S R FQ YS G+F GPC T LDH V VGY +
Sbjct: 278 YEDVPRNNEQALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTA 337
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G DY I+KNSWG SWG GY+ M+R TG G+CGIN +ASYPTK
Sbjct: 338 AKGHDYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPTK 384
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 161/337 (47%), Positives = 213/337 (63%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA L L + D + E E W ++GK Y QE+++R +IF++N ++
Sbjct: 11 SLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N + L++N FADLT++EF A F G +SI R + + N+ V
Sbjct: 71 EAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSI----IRTTTFKYE-NVTAV 125
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+KGAVT +KDQ CG CWAFSA A EGI+ + +G L+SLSEQEL+DCD +
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGV 185
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD A++FVI+NHG++TE +YPY+G G+CN + TI GY+DVP NNE
Sbjct: 186 DQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANNE 245
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S +G +YW++
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLV 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 306 KNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPT 342
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/339 (46%), Positives = 218/339 (64%), Gaps = 21/339 (6%)
Query: 6 FFLLSILLLSSL-------PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
FL +L+L++ PL+ + + E W QHG+ Y +EK++R IF++N
Sbjct: 10 IFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIE 69
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG----NL 114
+ NN + + L +N FADLT++EF+A + G+ +R+++ + S NL
Sbjct: 70 RIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGY--------KRQSSKLMSSSFRYENL 121
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
D+P S+DWR GAVT VKDQ +CG CWAFS AIEGI K+ TG+L+SLSEQ+L+DC
Sbjct: 122 SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTA 181
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N GC GGLMD A+Q++I+N G+ +E +YPY+G G C+ +K I GY+DVP+N
Sbjct: 182 G-NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVPQN 240
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYW 293
NE LLQAV QPVSVG+ G FQ Y SG+F G C T +HAV +GY ++ +G DYW
Sbjct: 241 NENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTDYW 300
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++KNSWG SWG NGYM M+R G+S G+CG+ M ASYPT
Sbjct: 301 LVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPT 339
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 168/357 (47%), Positives = 213/357 (59%), Gaps = 20/357 (5%)
Query: 4 LAFFLLSILL-------LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
L F L+++L L EL+E W + H S EK +R +F+ N
Sbjct: 6 LVLFTLALVLRLGESFDFHEKELETEEKFWELYERW-RSHHTVSRSLDEKHKRFNVFKAN 64
Query: 57 YAFVTQHN-NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG--- 112
+V HN N + + L LN FAD+T+ EF+ + G + I H R + ++ G
Sbjct: 65 VHYV--HNFNKKDKPYKLKLNKFADMTNHEFRQHYAG---SKIKHHRTLLGASRANGTFM 119
Query: 113 --NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
N +VP SIDWRKKGAVT VKDQ CG+CWAFS A+EGIN+I T LVSLSEQEL+
Sbjct: 120 YANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQELV 179
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCD + N GC GGLMD A+ F+ K GI TE+ YPY+ + +C+ QK N +V+IDG++D
Sbjct: 180 DCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSIDGHED 239
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NG 289
VP N+E LL+AV QP+SV I S FQ YS G+FTG C T LDH V IVGY + +G
Sbjct: 240 VPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGTTVDG 299
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGP 346
YWI+KNSWG WG GY+ MQR G+CGI M SYP KT NP SP P
Sbjct: 300 TKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPIKTSSNPTGSPAATP 356
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 152/305 (49%), Positives = 199/305 (65%), Gaps = 4/305 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W + GK Y+ EK++R +IF+DN ++ N GN + LS+N FADLT++E K +
Sbjct: 39 EQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKVA 98
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
G+ R + N+ VPA++DWRKKGAVT +KDQ CG+CWAFS A
Sbjct: 99 RNGYRRPL--QTRPMKVTSFKYENVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAA 156
Query: 150 IEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
EGIN++ TG LVSLSEQEL+DCD + + GC GGLM+ ++F+IKNHGI TE +YPY+
Sbjct: 157 TEGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQA 216
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G CN +K I I GY+ VP N+E LL+AV +QP+SV I FQ YSSG+FT
Sbjct: 217 ADGTCNSKKEASRIAKITGYESVPANSEAALLKAVASQPISVSIDAGGSDFQFYSSGVFT 276
Query: 269 GPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C T LDH V VGY ++ +G YW++KNSWG SWG GY+ MQR+T G+CGI M
Sbjct: 277 GQCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMD 336
Query: 328 ASYPT 332
+SYPT
Sbjct: 337 SSYPT 341
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 167/333 (50%), Positives = 208/333 (62%), Gaps = 21/333 (6%)
Query: 24 DINELFETWCKQHGKAYSS--------------EQEKQQRLKIFEDNYAFVTQHN---NM 66
++ ++E W +HG+ SS E++++ RL++F DN ++ HN +
Sbjct: 49 EVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEADA 108
Query: 67 GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKK 126
G +F L L FADLT +E++ LGF A R + G D+P +IDWR+
Sbjct: 109 GLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRGG--DLPDAIDWRQL 166
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
GAVTEVKDQ CG CWAFSA AIEG+N I TG+LVSLSEQE+IDCD + +SGC GG M+
Sbjct: 167 GAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCD-AQDSGCDGGQME 225
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQK-LNRHIVTIDGYKDVPENNEKQLLQAVVA 245
A++FVI N GIDTE DYP+ G G C+ K N + TIDG +V NNE L +AV
Sbjct: 226 NAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETALQEAVAI 285
Query: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 305
QPVSV I S RAFQ YSSGIF GPC TSLDH V VGY SE+G DYWI+KNSW SWG
Sbjct: 286 QPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSWSASWGE 345
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
GY+ M+RN G CGI M ASYP K +P
Sbjct: 346 AGYIRMRRNVPRPTGKCGIAMDASYPVKDTYHP 378
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 212/328 (64%), Gaps = 5/328 (1%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L+ + ELFE W +H KAY+S +EK R ++F+DN + + N
Sbjct: 24 FSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINRE-V 82
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+S+ L LN FADLTH EFK ++LG S R+ ++ D+P ++DWRKKGA
Sbjct: 83 TSYWLGLNEFADLTHDEFKTTYLGLSPPPARRSSSRSFRYENVA-AHDLPKAVDWRKKGA 141
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT+VK+Q CG+CWAFS A+EGIN IVTG+L +LSEQELIDC NSGC GG+MDYA
Sbjct: 142 VTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGMMDYA 201
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQC-NKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
+ ++ + G+ TE+ YPY + G C + +K V+I GY+DVP +E+ L++A+ QP
Sbjct: 202 FSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQP 261
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGV--DYWIIKNSWGRSWGM 305
VSV I S R FQ YS G+F GPC LDH V VGY S+ G DY I+KNSWG WG
Sbjct: 262 VSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGE 321
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ M+R TG S G+CGIN +ASYPTK
Sbjct: 322 KGYIRMKRGTGKSEGLCGINKMASYPTK 349
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 153/305 (50%), Positives = 212/305 (69%), Gaps = 7/305 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE W +HGK+YSS+ EK +RL IF D A++ +HN N++FTL LN F+DLT+ EF+
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A+++G + DRR V ++ +P S+DWR++GAVT +KDQ CG+CWAFSA
Sbjct: 61 ANYVGKFKSPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
+IE + + T LVSLSEQ+LIDCD + + GC GG + A++FV++N G+ TE+ YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G AG CN K +V I GYKDV +++ L++AV PV+VGICGS++ FQ Y SGI
Sbjct: 178 GFAGSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
+G CS S DHAVL++GY +E G+ YWIIKNSWG SWG NG+M +++ G G+CG+N
Sbjct: 236 SGQCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKKDGE--GMCGMNGQ 293
Query: 328 ASYPT 332
+SYPT
Sbjct: 294 SSYPT 298
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 160/326 (49%), Positives = 200/326 (61%), Gaps = 12/326 (3%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
++S L SL L E E W +HGK Y EK++R IF+DN F+ N
Sbjct: 25 VMSRKLYESLSLQ------ERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAAD 78
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
N + LS+N ADLT EFKAS G+ DR + N+ +PA++DWR KG
Sbjct: 79 NQPYKLSVNHLADLTLDEFKASRNGYKKI----DREFTTTSFKYENVTAIPAAVDWRVKG 134
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMD 186
AVT +KDQ CG+CWAFS A EGIN+I TG LVSLSEQEL+DCD + + GC GGLM+
Sbjct: 135 AVTPIKDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLME 194
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
++F+IKN GI +E +YPY+ G CN + I GY+ VP N+EK LL+AV Q
Sbjct: 195 DGFEFIIKNGGITSETNYPYKAADGSCN-TATTTPVAKITGYEKVPVNSEKSLLKAVANQ 253
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
P+SV I S+ +F YSSGI+TG C T LDH V VGY S NG DYWI+KNSWG WG
Sbjct: 254 PISVSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEK 313
Query: 307 GYMHMQRNTGNSLGICGINMLASYPT 332
GY+ MQR G+CGI M +SYPT
Sbjct: 314 GYIRMQRGIAAKEGLCGIAMDSSYPT 339
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 208/315 (66%), Gaps = 11/315 (3%)
Query: 24 DINELFETWCKQHGKAYSSE-QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
++ +F+ W +HGK Y++ EK++R + F+DN F+ QHN N S+ L L FADLT
Sbjct: 43 EVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN-AKNLSYQLGLTRFADLT 101
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQS---PGNLRDVPASIDWRKKGAVTEVKDQASCG 139
QE++ F G ++RN + P + +P S+DWR +GAV+ +KDQ +C
Sbjct: 102 VQEYRDLFPGSPKP-----KQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQGTCN 156
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGINKIVTG LVSLSEQEL+DC+ N G G MD A+QF+I N G+D
Sbjct: 157 SCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNLVNNGCYGSGTMDAAFQFLINNGGLD 216
Query: 200 TEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
++ DYPY+G G CN K+ + I+TID Y+DVP N+E L +AV QPVSVG+ +
Sbjct: 217 SDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQE 276
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
F LY SGI+ GPC T LDHA++IVGY SENG DYWI++NSWG +WG GY M RN
Sbjct: 277 FMLYRSGIYNGPCGTDLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYAKMARNFEYP 336
Query: 319 LGICGINMLASYPTK 333
G+CGI MLASYP K
Sbjct: 337 SGVCGIAMLASYPVK 351
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 206/329 (62%), Gaps = 18/329 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W H + EK +R F+ N F+ HN G+ + L LN F D++ EF
Sbjct: 44 DLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDMSQAEF 102
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG---------NLRDVPASIDWRKKGAVTEVKDQAS 137
+A+F G + DRRR+ P N+ D+P S+DWR+KGAVT VK+Q
Sbjct: 103 RATFAGSRVS----DRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQGK 158
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS ++EGIN I TG LVSLSEQELIDCD + N GC GGLMD A++++ KN G
Sbjct: 159 CGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKNGG 218
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
+ TE YPYR G C K+ + +V IDG++DVP N+E+ L +AV QPVSVGI
Sbjct: 219 LTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGIDA 278
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQR 313
S +AF YS G+FTG C T LDH V +VGY +E+G YW +KNSWG SWG GY+ +++
Sbjct: 279 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRVEK 338
Query: 314 NTGNSLGICGINMLASYPTKTGQNPPPSP 342
++G G+CGI M ASY KT P P+P
Sbjct: 339 DSGAEGGLCGIAMEASYAVKTDSKPKPTP 367
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 215/341 (63%), Gaps = 16/341 (4%)
Query: 2 NSLAFF-LLSILLLSSLPLN---YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
N L F LL + L +S + + + +NE E W ++G+ Y EK++R +IF +N
Sbjct: 7 NKLMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNV 66
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS----IDHDRRRNASVQSPGN 113
F+ N +GN + L +N FADLT++EFK S G+ +S + R A+V +
Sbjct: 67 EFIESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYANVTA--- 123
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
VP S+DWR+ GAVT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DCD
Sbjct: 124 ---VPTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCD 180
Query: 174 RS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
S + GC GGLMD A++F+ +N G+ TE +YPY+G G CN K I GY+DVP
Sbjct: 181 TSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVP 240
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVD 291
N+E LL+AV +QPVSV I S AFQ YS G+FTG C T LDH V VGY S++G
Sbjct: 241 ANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTK 300
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG SWG +GY+ M+R+ G+CGI M SYPT
Sbjct: 301 YWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPT 341
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 157/351 (44%), Positives = 226/351 (64%), Gaps = 14/351 (3%)
Query: 3 SLAFFLLSILLLSSL-------PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S L+ S PL ++ L+E+W ++GK+Y+S E++ R++IF++
Sbjct: 9 SMSLLFFSTFLIFSFAIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
N F+ +HN N S+T+ LN FADLT +E+++++LGF ++ + N + G +
Sbjct: 69 NLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSL--KSKVSNRYMPQVGEV- 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P +DWR GAV +VK+Q C +CWAF+ +E IN+I+TG L+SLSEQEL+DC+R+
Sbjct: 126 -LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCNRT 184
Query: 176 -YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N GC GG MD AY+F+I N GI+TE++YPY GQ QC++ K N++ VTID Y+ VP N
Sbjct: 185 PINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQVPPN 244
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT-GPCSTSLDHAVLIVGYDSENGVDYW 293
+E + +AV QPVSV I F+ Y SGIFT G C T+L+HAV I+GY +ENG+DYW
Sbjct: 245 DELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYGTENGIDYW 304
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPP 344
I+KNS+G WG +GY +QRN G G CGI YP K + P P P
Sbjct: 305 IVKNSYGTQWGESGYGKVQRNVGGE-GRCGIASYPFYPVKNYTSKPAKPHP 354
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 317 bits (812), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 157/337 (46%), Positives = 210/337 (62%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA S L + D + E E W ++ K Y QE+++R KIF++N ++
Sbjct: 11 SLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N +TL +N FADLT++EF A F G +SI R + + N+ +
Sbjct: 71 EAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSI----TRTTTFKYE-NVTAI 125
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+KGAVT +KDQ CG CWAFSA A EGI+ + G L+SLSEQE++DCD +
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGE 185
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG MD A++F+I+NHG++ E +YPY+ G+CN + H+ TI GY+DVP NNE
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNE 245
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S +G +YW++
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLV 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR G+CGI M+ASYPT
Sbjct: 306 KNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPT 342
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 317 bits (812), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 157/337 (46%), Positives = 210/337 (62%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA S L + D + E E W ++ K Y QE+++R KIF++N ++
Sbjct: 11 SLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N +TL +N FADLT++EF A F G +SI R + + N+ +
Sbjct: 71 EAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSIT----RTTTFKYE-NVTAI 125
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+KGAVT +KDQ CG CWAFSA A EGI+ + G L+SLSEQE++DCD +
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGE 185
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG MD A++F+I+NHG++ E +YPY+ G+CN + H+ TI GY+DVP NNE
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNE 245
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S +G +YW++
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLV 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR G+CGI M+ASYPT
Sbjct: 306 KNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPT 342
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 209/329 (63%), Gaps = 21/329 (6%)
Query: 25 INELFETWCKQH--------GKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76
+ L+E W ++ G + + E ++R +F +N ++ + N G F L+LN
Sbjct: 38 LRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRLALN 97
Query: 77 AFADLTHQEFKASFLGFSAASIDHDR-------RRNASVQSPGNLRD-VPASIDWRKKGA 128
FAD+T EF+ ++ G A H R S + G+ D +P ++DWR++GA
Sbjct: 98 KFADMTTDEFRRTYAGSRAR---HHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT +KDQ CG+CWAFSA A+EG+NKI TG LV+LSEQEL+DCD N GC GGLMDYA
Sbjct: 155 VTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+QF+ +N GI TE +YPYR + G+CNK K + H VTIDGY+DVP N+E L +AV QPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNG 307
+V + S + FQ YS G+FTG C T LDH V VGY + +G YWI+KNSWG WG G
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERG 334
Query: 308 YMHMQRN-TGNSLGICGINMLASYPTKTG 335
Y+ MQR + +S G+CGI M ASYP K+G
Sbjct: 335 YIRMQRGVSSDSNGLCGIAMEASYPVKSG 363
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 159/348 (45%), Positives = 213/348 (61%), Gaps = 22/348 (6%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
L S + + L + +L+E W H + EK +R F+ N F+ HN G
Sbjct: 25 LCSAIPMEDKDLESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG 83
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG---------NLRDVP 118
+ + L LN F D+ EF+A+F+G D RR+ + P N+ D+P
Sbjct: 84 DHPYRLHLNRFGDMDQAEFRATFVG--------DLRRDTPSKPPSVPGFMYAALNVSDLP 135
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
S+DWR+KGAVT VKDQ CG+CWAFS ++EGIN I TGSLVSLSEQELIDCD + N
Sbjct: 136 PSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND 195
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENN 235
GC GGLMD A++++ N G+ TE YPYR G CN + ++ +V IDG++DVP N+
Sbjct: 196 GCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANS 255
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E+ L +AV QPVSV + S +AF YS G+FTG C T LDH V +VGY +E+G YW
Sbjct: 256 EEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWT 315
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSWG SWG GY+ +++++G S G+CGI M ASYP KT P P+P
Sbjct: 316 VKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTP 363
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 209/332 (62%), Gaps = 11/332 (3%)
Query: 7 FLLSILLLSSLP----LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
L +I +L+SL LN S + E + W ++G+ Y + EK +R IF++N ++
Sbjct: 14 LLFTIGVLASLAAARSLNEAS-MTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQT 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N N + L +N FADLT++EF S F + H +V N+ VPA++D
Sbjct: 73 FNKANNKPYKLGVNEFADLTNEEFTTSRNKFKS----HVCATVTNVFRYENVTAVPATMD 128
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WRKKGAVT +K+Q CG CWAFSA A+EGI ++ TG L+SLSEQEL+DCD + + GC
Sbjct: 129 WRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCE 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMDYA+ F+ +NHG+ TE +YPY G G CN K H TI G++DVP N+E LL+
Sbjct: 189 GGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITGHEDVPANSESALLK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWG 300
AV QP+SV I S FQ YSSG+FTG C T LDH V VGY + +G YW++KNSWG
Sbjct: 249 AVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAADGTKYWLVKNSWG 308
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG GY+ MQR + G+CGI M ASYPT
Sbjct: 309 TSWGEEGYIQMQRGVAAAEGLCGIAMQASYPT 340
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 196/312 (62%), Gaps = 6/312 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+ + E E W +G+ Y EKQ+R KIFE+N A + N N + LS+N FADLT
Sbjct: 32 ASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQFADLT 91
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EFKAS F H ++ GN+ VP+++DWR KGAVT VKDQ CG CW
Sbjct: 92 NEEFKASRNRFKG----HICSTKSTSFKYGNVSAVPSAMDWRMKGAVTPVKDQGQCGCCW 147
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A EGI K+ TG L+SLSEQEL+DCD S + GC GGLMD A+ F+ NHG+ +E
Sbjct: 148 AFSAVAATEGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHGLASE 207
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+G G CN K H I+G++DVP N+E+ LL AV QPVSV I FQ
Sbjct: 208 ANYPYKGVDGTCNTNKQAIHAAEINGFEDVPANSEEALLNAVAHQPVSVAIDAGGSGFQF 267
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+F G C T LDH V VGY S++G YW++KNSWG WG GY+ MQR+ G
Sbjct: 268 YSKGVFIGACGTQLDHGVTAVGYGTSDDGTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEG 327
Query: 321 ICGINMLASYPT 332
+CGI M ASYPT
Sbjct: 328 LCGIAMKASYPT 339
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 207/309 (66%), Gaps = 5/309 (1%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
+I +FE W +HGK+YSS+ EK +RL IF D A++ +HN N++FTL LN F+DLT+
Sbjct: 32 EIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTN 91
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EF+A +G DR + ++ +P S+DWR+KGAVT +KDQ CG+CWA
Sbjct: 92 AEFRAMHVGKFKRPRYQDRL--PAEDEDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWA 149
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSA +IE + + T LVSLSEQ+L+DCD + ++GC GGLM+ A++FV+KN G+ TE
Sbjct: 150 FSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCDGGLMETAFKFVVKNGGVTTEAA 208
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G G CN K + I G+K V E++ L++AV PV+V ICGS+ FQ Y
Sbjct: 209 YPYTGSVGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYK 268
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SGI +G C SLDH VL++GY +E G+ YWIIKNSWG SWG +G+M ++R G+ G+CG
Sbjct: 269 SGILSGKCDDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGD--GMCG 326
Query: 324 INMLASYPT 332
+N +SYPT
Sbjct: 327 MNGDSSYPT 335
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 207/334 (61%), Gaps = 10/334 (2%)
Query: 7 FLLSILLLSSLPLNYCS-DINELF-----ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
F IL+L S ++ E + E W +GK Y EK++R KIF++N ++
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N GN + LS+N FAD T+++FK + G+ R + N+ VPA+
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPF--QTRPMKVTSFKYENVTAVPAT 127
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWRKKGAVT +KDQ CG+CWAFS A EGIN++ TG LVSLSEQEL+DCD + + G
Sbjct: 128 MDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQG 187
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLM+ ++F+IKNHGI TE +YPY+ G CN +K HI I GY+ VP N+E +L
Sbjct: 188 CEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAEL 247
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNS 298
L+ V QP+SV I FQ YSSG+FTG C T LDH V VGY ++ +G YW++KNS
Sbjct: 248 LKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKNS 307
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG SWG GY+ MQR+ G+CGI M +SYPT
Sbjct: 308 WGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPT 341
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/310 (49%), Positives = 208/310 (67%), Gaps = 8/310 (2%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE W +HG+AY++ EKQ+R +++++N A + + N+ G +TL+ N FADLT++EF+A
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFRA 177
Query: 89 SFLGFSAASIDHDR---RRNASVQSPGN--LRDVPASIDWRKKGAVTEVKDQASCGACWA 143
LG A D R + +++ PGN D+P +DWRKKGAV EVK+Q SCG+CWA
Sbjct: 178 KMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWA 237
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSA A+EG+N+I G LVSLSEQEL+DCD + GC GG M +A++FV+ NHG+ TE
Sbjct: 238 FSAVAAMEGLNQIKNGKLVSLSEQELVDCD-AEAVGCAGGFMSWAFEFVMANHGLTTEAS 296
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+G G C KLN V+I GY +V N+E +LL+ QPVSV + FQLY+
Sbjct: 297 YPYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYA 356
Query: 264 SGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+F+GPC+ ++H V +VGY +++ YWI+KNSWG WG GYM MQR+ G G+C
Sbjct: 357 GGVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTGLC 416
Query: 323 GINMLASYPT 332
GI MLASYP
Sbjct: 417 GIAMLASYPV 426
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/307 (49%), Positives = 213/307 (69%), Gaps = 7/307 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE W +HGK+YSS+ EK +RL IF D A++ +HN + N++FTL LN F+DLT+ EF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A+++G DRR V ++ +P S+DWR++GAVT +KDQ CG+CWAFSA
Sbjct: 61 ANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
+IE + + T LVSLSEQ+LIDCD + + GC GG + A++FV++N G+ TE+ YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G AG CN K +V I GYKDV +++ L++AV PV+VGICGS++ FQ Y SGI
Sbjct: 178 GFAGSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
+G CS S DHAVL++GY +E G+ YWIIKNSWG SWG +G+M +++ G G+CG+N
Sbjct: 236 SGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKKDGE--GMCGMNGQ 293
Query: 328 ASYPTKT 334
+SYPT +
Sbjct: 294 SSYPTTS 300
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/307 (49%), Positives = 213/307 (69%), Gaps = 7/307 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE W +HGK+YSS+ EK +RL IF D A++ +HN + N++FTL LN F+DLT+ EF+
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A+++G DRR V ++ +P S+DWR++GAVT +KDQ CG+CWAFSA
Sbjct: 61 ANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
+IE + + T LVSLSEQ+LIDCD + + GC GG + A++FV++N G+ TE+ YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G AG CN K +V I GYKDV +++ L++AV PV+VGICGS++ FQ Y SGI
Sbjct: 178 GFAGSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
+G CS S DHAVL++GY +E G+ YWIIKNSWG SWG +G+M +++ G G+CG+N
Sbjct: 236 SGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKEDGE--GMCGMNGQ 293
Query: 328 ASYPTKT 334
+SYPT +
Sbjct: 294 SSYPTTS 300
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 202/317 (63%), Gaps = 16/317 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+ ++E E W Q+GK Y EK+ R KIF++N + NN GN S+ L +N FADLT
Sbjct: 33 ASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLT 92
Query: 83 HQEFKAS--FLGFSAASIDHDRRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQA 136
++EFKA F G ++ S ++P ++ VPAS+DWR+KGAVT +KDQ
Sbjct: 93 NEEFKARNRFKGHMCSN---------STRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQG 143
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKN 195
CG CWAFSA A EGI K+ TG L+SLSEQEL+DCD + + GC GGLMD A++F+++N
Sbjct: 144 QCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQN 203
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G++TE YPY+G CN + +I G++DVP N+E LL+AV QP+SV I S
Sbjct: 204 KGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDAS 263
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
FQ YSSG+FTG C T LDH V VGY S+ G YW++KNSWG WG GY+ MQR+
Sbjct: 264 GSEFQFYSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDV 323
Query: 316 GNSLGICGINMLASYPT 332
G+CG M ASYPT
Sbjct: 324 AAEEGLCGFAMQASYPT 340
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/314 (50%), Positives = 205/314 (65%), Gaps = 11/314 (3%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM-GNSSFTLSLNAFADLTH 83
++E E W +GK Y QE+++R KIF +N ++ NN N S+ L +N FADLT+
Sbjct: 35 MHERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTN 94
Query: 84 QEFKAS---FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+EF AS F G +SI R + + N+ +P+++DWRKKGAVT VK+Q CG
Sbjct: 95 EEFVASRNKFKGHMCSSI----IRTTTFKYE-NVSAIPSTVDWRKKGAVTPVKNQGQCGC 149
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGID 199
CWAFSA A EGI+K+ TG LVSLSEQEL+DCD + + GC GGLMD A++F+I+NHG++
Sbjct: 150 CWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLN 209
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE YPY+G G CN K + TI GY+DVP NNE+ L +AV QP+SV I S F
Sbjct: 210 TEAQYPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDF 269
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q Y SG+FTG C T LDH V VGY S +G YW++KNSWG WG GY+ MQR +
Sbjct: 270 QFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAA 329
Query: 319 LGICGINMLASYPT 332
G+CGI M ASYPT
Sbjct: 330 EGLCGIAMQASYPT 343
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 315 bits (807), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 168/364 (46%), Positives = 217/364 (59%), Gaps = 23/364 (6%)
Query: 4 LAFFLLSILLLSSL--------PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
L FL S+++L + + ++ L++ W + H S E+++R +F
Sbjct: 5 LLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRW-RSHHSVPRSLNEREKRFNVFRH 63
Query: 56 NYAFVTQHN-NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR----RNASVQ- 109
N V HN N N S+ L LN FADLT EFK ++ G ++I H R + S Q
Sbjct: 64 NVMHV--HNTNKKNRSYKLKLNKFADLTINEFKNAYTG---SNIKHHRMLQGPKRGSKQF 118
Query: 110 --SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
NL +P+S+DWRKKGAVTE+K+Q CG+CWAFS A+EGINKI T LVSLSEQ
Sbjct: 119 MYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQ 178
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
EL+DCD N GC GGLM+ A++F+ KN GI TE YPY G G+C+ K N +VTIDG
Sbjct: 179 ELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDG 238
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
++DVPEN+E LL+AV QPVSV I FQ YS G+FTG C T L+H V VGY SE
Sbjct: 239 HEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSE 298
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT 347
G YWI++NSWG WG GY+ ++R G CGI M ASYP K + P+P G
Sbjct: 299 RGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL-SSSNPTPKDGDV 357
Query: 348 RCSL 351
+ L
Sbjct: 358 KDEL 361
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 315 bits (807), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 159/315 (50%), Positives = 207/315 (65%), Gaps = 11/315 (3%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS-FTLSLNAFADLT 82
D+ E W Q+GK Y QE+++R KIF +N ++ N N+ +TL +N FADLT
Sbjct: 33 DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLT 92
Query: 83 HQEFKAS---FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
+ EF +S F G +SI R ++ + N +P+S+DWRKKGAVT VK+Q CG
Sbjct: 93 NDEFTSSRNKFKGHMCSSI----TRTSTFKYE-NASAIPSSVDWRKKGAVTPVKNQGQCG 147
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGI 198
CWAFSA A EGI+K+ TG L+SLSEQEL+DCD + + GC GGLMD A++F+I+NHG+
Sbjct: 148 CCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 207
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
+TE +YPY+G G CN K + + VTI GY+DVP NNE+ L +AV QP+SV I S
Sbjct: 208 NTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVAIDASGSD 267
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQ Y SG+FTG C T LDH V VGY S +G YW++KNSWG WG GY+ MQR
Sbjct: 268 FQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEGYIMMQRGVDA 327
Query: 318 SLGICGINMLASYPT 332
+ G+CGI M ASYPT
Sbjct: 328 AEGLCGIAMQASYPT 342
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 163/330 (49%), Positives = 211/330 (63%), Gaps = 18/330 (5%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS- 70
+ ++S L S I E E W +GK Y QE++ RLKIF++N ++ NN GN+
Sbjct: 24 IQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKL 83
Query: 71 FTLSLNAFADLTHQEFKAS---FLGFSAASIDHD---RRRNASVQSPGNLRDVPASIDWR 124
+ L +N FAD+T++EF AS F G +SI + NASV P+++DWR
Sbjct: 84 YKLGINQFADITNEEFIASRNKFKGHMCSSITKTSTFKYENASV---------PSTVDWR 134
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGG 183
KKGAVT VK+Q CG CWAFSA A EGI+K+ TG LVSLSEQEL+DCD + + GC GG
Sbjct: 135 KKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGG 194
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
LMD A++F+I+NHG+ TE YPY+G G C+ + + TI GY+DVP NNE L +AV
Sbjct: 195 LMDDAFKFIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAV 254
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRS 302
QP+SV I S FQ Y SG+FTG C T LDH V VGY S +G YW++KNSWG
Sbjct: 255 ANQPISVAIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGND 314
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ + G+CGI M+ASYPT
Sbjct: 315 WGEEGYIRMQRSVDAAQGLCGIAMMASYPT 344
>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
Length = 321
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 151/269 (56%), Positives = 186/269 (69%), Gaps = 7/269 (2%)
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS+ A+EGIN+IVTG L+ LSEQEL+DCD+S+N GC GGLMDYA+QF+I N GI
Sbjct: 13 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 72
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTE+DYPY+G+ C+ + N +VTIDGY+DVPEN+E L +AV QPVSV I RA
Sbjct: 73 DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 132
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN- 317
FQLY SG+FTG C T LDH V+ VGY ++NG DYWI++NSWG+ WG +GY+ ++RN N
Sbjct: 133 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 192
Query: 318 SLGICGINMLASYPTKTGQN------PPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371
+ G CGI + SYPTK+G N PPSP PT C C G TCCC C
Sbjct: 193 TTGKCGIAVQPSYPTKSGANPPKPSASPPSPVKPPTECDEYFSCEEGSTCCCIYQFGSTC 252
Query: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICD 400
+W CC SA CC DH CCP YP+CD
Sbjct: 253 FAWGCCPLESATCCDDHYSCCPHEYPVCD 281
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 164/364 (45%), Positives = 210/364 (57%), Gaps = 26/364 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINE-----------LFETWCKQHGKAYSSEQEKQQR 49
M F LS+ L+ L + D +E L+E W + H +S EK +R
Sbjct: 3 MKKFLFVALSLALV--LGITESLDFHEKDLESEESLWDLYERW-RSHHTVSTSLDEKHKR 59
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR------R 103
+F++N V + N MG + L LN FAD+T+ EF++ + G + + H R R
Sbjct: 60 FNVFKENVMHVHKTNKMGKP-YKLKLNKFADMTNHEFRSVYAG---SKVKHHRMFRGTTR 115
Query: 104 RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVS 163
N S G + VP S+DWRKKGAVT VKDQ CG+CWAFS A+EGIN I T LVS
Sbjct: 116 GNGSFMY-GKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVS 174
Query: 164 LSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV 223
LSEQEL+DCD + N GC GGLM+YA++F+ K GI TE YPY+ + G C+ K N V
Sbjct: 175 LSEQELVDCDTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAV 234
Query: 224 TIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVG 283
+IDGY+ VPEN+E LL+A QPVSV I FQ YS G+F G C T LDH V +VG
Sbjct: 235 SIDGYEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVG 294
Query: 284 YDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
Y + +G YWI++NSWG WG GY+ MQR + G+CGI M ASYP K P
Sbjct: 295 YGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPIKNSSTNPSGT 354
Query: 343 PPGP 346
P
Sbjct: 355 KSSP 358
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 315 bits (806), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 155/329 (47%), Positives = 208/329 (63%), Gaps = 21/329 (6%)
Query: 25 INELFETWCKQH--------GKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76
+ L+E W ++ G + + E ++R +F +N ++ + N G F L+LN
Sbjct: 38 LRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRLALN 97
Query: 77 AFADLTHQEFKASFLGFSAASIDHDR-------RRNASVQSPGNLRD-VPASIDWRKKGA 128
FAD+T EF+ ++ G A H R S + G+ D +P ++DWR++GA
Sbjct: 98 KFADMTTDEFRRTYAGSRAR---HHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT +KDQ CG+CWAFS A+EG+NKI TG LV+LSEQEL+DCD N GC GGLMDYA
Sbjct: 155 VTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+QF+ +N GI TE +YPYR + G+CNK K + H VTIDGY+DVP N+E L +AV QPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNG 307
+V + S + FQ YS G+FTG C T LDH V VGY + +G YWI+KNSWG WG G
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERG 334
Query: 308 YMHMQRN-TGNSLGICGINMLASYPTKTG 335
Y+ MQR + +S G+CGI M ASYP K+G
Sbjct: 335 YIRMQRGVSSDSNGLCGIAMEASYPVKSG 363
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 315 bits (806), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 211/339 (62%), Gaps = 27/339 (7%)
Query: 24 DINELFETWCKQHGKAYSS-------------EQEKQQRLKIFEDNYAFVTQHN---NMG 67
++ ++E W +HG+ SS E++++ RL++F DN ++ +HN + G
Sbjct: 79 EVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAG 138
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASID------HDRRRNASVQSPGNLRDVPASI 121
+F L L FADLT E++ LGF A + H A + G+L +P +I
Sbjct: 139 LHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRG-GDL--LPDAI 195
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWR+ GAVTEVKDQ CG CWAFSA AIEGIN I TG+LVSLSEQE+IDCD + +SGC
Sbjct: 196 DWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCD-AQDSGCD 254
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLN-RHIVTIDGYKDVPENNEKQLL 240
GG M+ A++FVI N GIDTE DYP+ G G C+ K N + TIDG +V NNE L
Sbjct: 255 GGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQ 314
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300
+AV QPVSV I S RAFQ YSSGIF GPC TSLDH V VGY SE+G DYWI+KNSW
Sbjct: 315 EAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSWS 374
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPP 339
SWG GY+ M+RN G CGI M ASYP K + P
Sbjct: 375 ASWGEAGYIRMRRNVPRPTGKCGIAMDASYPVKDTYHDP 413
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 157/333 (47%), Positives = 203/333 (60%), Gaps = 7/333 (2%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA L L+S D ++E E W + G+ Y+ EK+ R KIF++N +
Sbjct: 11 SLALIFLLGALVSQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N S+ L +N FADLT++EFK S F H A NL P+S
Sbjct: 71 ESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKG----HMCSSQAGPFRYENLTAAPSS 126
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWRKKGAVT +KDQ CG+CWAFSA A+EGI ++ T L+SLSEQEL+DCD + + G
Sbjct: 127 MDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A++F+ +N G+ TE +YPY G G CN ++ H I+G++DVP NNE L
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299
++AV QPVSV I FQ YSSGIFTG C T LDH V VGY NG++YW++KNSW
Sbjct: 247 MKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSW 306
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG GY+ MQ++ G+CGI M ASYPT
Sbjct: 307 GTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPT 339
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 314 bits (805), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 207/308 (67%), Gaps = 4/308 (1%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IFEDN F+T N N S+ L LN FADL+ E+
Sbjct: 55 MFESWMVKHGKVYESVAEKERRLTIFEDNLRFITNRN-AENLSYRLGLNRFADLSLHEYA 113
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSA 146
G + +S + + DV P S+DWR +GAVTEVKDQ C +CWAFS
Sbjct: 114 QICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFST 173
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++ N G+ T+ DYPY
Sbjct: 174 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPY 232
Query: 207 RGQAGQCNKQ-KLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
+ G CN + K N V IDGY+++P N+E L++AV QPV+ + S R FQLY+SG
Sbjct: 233 KALNGVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASG 292
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F G C T+L+H V++VGY +ENG DYWI++NS G +WG GYM M RN N G+CGI
Sbjct: 293 VFDGTCGTNLNHGVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIA 352
Query: 326 MLASYPTK 333
M ASYP K
Sbjct: 353 MRASYPLK 360
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 314 bits (804), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 159/325 (48%), Positives = 200/325 (61%), Gaps = 9/325 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H +S EK++R +F N V N M + + L LN FAD+T+ EF
Sbjct: 36 DLYEKW-RSHHTVSTSLDEKRKRFNVFRANVLHVHNTNKM-DKPYKLKLNKFADMTNHEF 93
Query: 87 KASFLGFSAASIDHDRRRNASVQSP----GNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+ ++ S+ H R A + + GN+ VPASIDWRKKGAVT VKDQ CG+CW
Sbjct: 94 RTAYA--SSKVKHHTMFRGAPLGNGSFMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCW 151
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS A+EGIN I T L+SLSEQEL+DC+ N GC GGLMDYA++F+ K GI TE
Sbjct: 152 AFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFITKQKGITTEA 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
+YPYR Q G C+ K N+ V+IDG++DV NNE LL+AV QPVSV I FQ Y
Sbjct: 212 NYPYRAQDGHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAIDAGGSDFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
S G+FTG C LDH V IVGY + +G YWI++NSWG WG GY+ MQR + G+
Sbjct: 272 SEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQRGISDRRGL 331
Query: 322 CGINMLASYPTKTGQNPPPSPPPGP 346
CGI M ASYP K P P P
Sbjct: 332 CGIAMEASYPIKKSSTNPIGPADSP 356
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 155/345 (44%), Positives = 220/345 (63%), Gaps = 17/345 (4%)
Query: 2 NSLAFFLLSILLLSSLPLNYCS----------DINELFETWCKQHGKAYSSEQEKQQRLK 51
N +A L+ ++++ + P +I +FE W +HGK+YSS+ EK +RL
Sbjct: 4 NMIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLM 63
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP 111
IF D A++ +HN N++FTL LN F+DLT+ EF+A +G DR +
Sbjct: 64 IFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRL--PAEDED 121
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
++ +P S+DWR+KGAVT +KDQ CG+CWAFSA +IE + + T LVSLSEQ+L+D
Sbjct: 122 VDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMD 181
Query: 172 CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLN--RHIVTIDGYK 229
CD + ++GC GGLM+ A++FV+KN G+ TE YPY G G CN K+ + I G+K
Sbjct: 182 CD-TVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFK 240
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENG 289
V E++ L++AV PV+V ICGS+ FQ Y SGI +G C SLDH VL++GY +E G
Sbjct: 241 VVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGTEGG 300
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
+ YWIIKNSWG SWG +G+M ++R G+ GICG+N +SYPT +
Sbjct: 301 MPYWIIKNSWGTSWGEDGFMKIERKDGD--GICGMNGDSSYPTTS 343
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 313 bits (803), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 153/310 (49%), Positives = 213/310 (68%), Gaps = 17/310 (5%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W ++ K Y+ EK++R KIF++N F+ +HN++ N +F + L FADLT+ E K
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
F+ + + + G++ +P IDWR KGAV VKDQ +CG+CWAFSA
Sbjct: 61 -DFM-----------KADRYLYKEGDI--LPDEIDWRAKGAVVPVKDQGNCGSCWAFSAV 106
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EGIN+I TG L+SLS+QELIDCDR + N+GC GG+M+YA++F+I N GI++++DYPY
Sbjct: 107 GAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPY 166
Query: 207 RG-QAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
G CN K N +V IDGY+ V +N+EK L +AV QPV V I S +AF+LY S
Sbjct: 167 TATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKS 226
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
G+FTG C LDH V++VGY + +G DYWII+NSWG +WG NGY+ +QRN +S G CG+
Sbjct: 227 GVFTGTCGIYLDHGVVVVGYGTSSGEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKCGV 286
Query: 325 NMLASYPTKT 334
M+ SYPTK+
Sbjct: 287 AMMPSYPTKS 296
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 313 bits (803), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 156/337 (46%), Positives = 209/337 (62%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA S L + D + E E W ++ K Y QE+++R KIF++N ++
Sbjct: 11 SLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N +TL +N FADLT++EF A F G +SI R + + N+ +
Sbjct: 71 EAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSI----TRTTTFKYE-NVTAI 125
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+KGAVT +KDQ CG CWAFSA A EGI+ + G L+SLSEQE++DCD +
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGE 185
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG MD A++F+I+NHG++ E +YPY+ G+CN + H+ TI GY+DVP NNE
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNE 245
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S +G +YW++
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLV 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR G+ GI M+ASYPT
Sbjct: 306 KNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPT 342
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 313 bits (803), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 158/307 (51%), Positives = 202/307 (65%), Gaps = 11/307 (3%)
Query: 32 WCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFKAS- 89
W Q+GK Y QE++ R KIF++N ++ NN ++ S+ L +N FADLT++EF AS
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASR 101
Query: 90 --FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
F G +SI R S + N+ +P+++DWRKKGAVT VK+Q CG CWAFSA
Sbjct: 102 NKFKGHMCSSI----MRTTSFKYE-NVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAV 156
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A EGI+K+ TG L+SLSEQEL+DCD + + GC GGLMD A++F+I+NHG+ TE YPY
Sbjct: 157 AATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPY 216
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
G G CN K + VTI GY+DVP N+E+ L +AV QP+SV I S FQ Y SG+
Sbjct: 217 EGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV 276
Query: 267 FTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
FTG C T LDH V VGY S +G YW++KNSWG WG GY+ MQR + GICGI
Sbjct: 277 FTGACGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICGIA 336
Query: 326 MLASYPT 332
M ASYPT
Sbjct: 337 MQASYPT 343
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 313 bits (803), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 201/309 (65%), Gaps = 14/309 (4%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W QHG+ Y +EK++R IF++N + NN + + L +N FADLT++EF+A
Sbjct: 6 EEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAM 65
Query: 90 FLGFSAASIDHDRRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
G+ +R+++ + S NL +P S+DWRK GAVT VKDQ +CG CWAFS
Sbjct: 66 HHGY--------KRQSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFS 117
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
A AIEGI K+ TG L+SLSEQ+L+DCD + + GCGGGLMD A+QF+++N G+ +E Y
Sbjct: 118 AVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEATY 177
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY+G G C +K I GY+DVP NNE LLQAV QPVSV + G FQ Y S
Sbjct: 178 PYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFYKS 237
Query: 265 GIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G+F G C T LDHAV +GY + +G +YW++KNSWG SWG +GYM MQR G G+CG
Sbjct: 238 GVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGLCG 297
Query: 324 INMLASYPT 332
+ M ASYPT
Sbjct: 298 VAMDASYPT 306
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/329 (47%), Positives = 210/329 (63%), Gaps = 10/329 (3%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
S++ S L + + LF++W +H K Y S +EK +R IF+ N + + N N
Sbjct: 26 SVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAE-TNRKNG 84
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWR 124
S+ L LN FAD+TH+EFKA+ LG R A ++P R ++P S+DWR
Sbjct: 85 SYWLGLNQFADITHEEFKANHLGLKQGL----SRMGAQTRTPTTFRYAAAANLPWSVDWR 140
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
KGAVT VK+Q CG+CWAFS+ A+EGIN+IVTG LVSLSEQEL+DCD + GC GGL
Sbjct: 141 YKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLDHGCEGGL 200
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
MD+A+ +++ + GI E DYPY + G C +++ ++VTI GY+DVPEN+E LL+A+
Sbjct: 201 MDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALA 260
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
QPVSVGI R FQ Y G+F G CS LDHA+ VGY S G +Y +KNSWG++WG
Sbjct: 261 HQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYGSSYGQNYITMKNSWGKNWG 320
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYPTK 333
GY+ ++ TG G+CGI +ASYP K
Sbjct: 321 EQGYVRIKMGTGKPEGVCGIYTMASYPVK 349
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 206/308 (66%), Gaps = 4/308 (1%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IFEDN F+ N N S+ L L FADL+ E+K
Sbjct: 48 IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRN-AENLSYRLGLTGFADLSLHEYK 106
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSA 146
G + +S + + DV P S+DWR +GAVTEVKDQ C +CWAFS
Sbjct: 107 EVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 166
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++KN G+ T+ DYPY
Sbjct: 167 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPY 225
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
+ G C+ + K N V IDGY+++P N+E L++AV QPV+ I S R FQLY SG
Sbjct: 226 KAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESG 285
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F G C T+L+H V++VGY +ENG DYW++KNS G +WG GYM M RN N G+CGI
Sbjct: 286 VFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIA 345
Query: 326 MLASYPTK 333
M ASYP K
Sbjct: 346 MRASYPLK 353
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/334 (46%), Positives = 206/334 (61%), Gaps = 10/334 (2%)
Query: 7 FLLSILLLSSLPLNYCS-DINELF-----ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
F IL+L S ++ E + E W +GK Y EK++R KIF++N ++
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N GN + LS+N FAD T+++FK + G+ R + N+ VPA+
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPF--QTRPMKVTSFKYENVTAVPAT 127
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWRKKGAVT +KDQ CG+CWAFS A EGIN++ TG LVSLSEQEL+DCD + + G
Sbjct: 128 MDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQG 187
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLM+ ++F+IKNHGI TE +YPY+ G CN +K HI I GY+ VP N+E +L
Sbjct: 188 CEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAEL 247
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNS 298
L+ V QP+SV I FQ YSSG+FTG C T LDH V VGY ++ +G YW++KNS
Sbjct: 248 LKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKNS 307
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
W SWG GY+ MQR+ G+CGI M +SYPT
Sbjct: 308 WXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPT 341
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 162/328 (49%), Positives = 205/328 (62%), Gaps = 14/328 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLTHQE 85
EL+E W + H S EK +R +F+ N +V HN N + + L LN FAD+T+ E
Sbjct: 36 ELYERW-RSHHTVSRSLDEKDKRFNVFKANVHYV--HNFNKKDKPYKLKLNKFADMTNHE 92
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGA 140
F+ + G + I H R + ++ G VP ++DWRKKGAVT VKDQ CG+
Sbjct: 93 FRHHYAG---SKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGS 149
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN+I T LVSLSEQEL+DCD S N GC GGLMD A++F+ K GI+T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E++YPY + G+C+ QK N +V+IDG++DVP N+E LL+AV QPVSV I S FQ
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+FTG C T LDH V IVGY + + YWI+KNSWG WG GY+ MQR
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEE 329
Query: 320 GICGINMLASYPTKT-GQNPPPSPPPGP 346
G+CGI M SYP KT NP SP P
Sbjct: 330 GLCGIAMQPSYPIKTSSSNPTGSPATAP 357
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 206/308 (66%), Gaps = 4/308 (1%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IFEDN F+ N N S+ L L FADL+ E+K
Sbjct: 41 IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRN-AENLSYRLGLTGFADLSLHEYK 99
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSA 146
G + +S + + DV P S+DWR +GAVTEVKDQ C +CWAFS
Sbjct: 100 EVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 159
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++KN G+ T+ DYPY
Sbjct: 160 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPY 218
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
+ G C+ + K N V IDGY+++P N+E L++AV QPV+ I S R FQLY SG
Sbjct: 219 KAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESG 278
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F G C T+L+H V++VGY +ENG DYW++KNS G +WG GYM M RN N G+CGI
Sbjct: 279 VFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIA 338
Query: 326 MLASYPTK 333
M ASYP K
Sbjct: 339 MRASYPLK 346
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 205/331 (61%), Gaps = 5/331 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L FFL ++ + + I+E E W + + YS +EK+ R KIF++N +
Sbjct: 13 ALIFFLGALASQAIARTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIES 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N S+ L +N FADLT++EFK S F H A N+ VP+S+D
Sbjct: 73 FNKASEKSYKLGINQFADLTNEEFKTSRNRFKG----HMCSSQAGPFRYENITAVPSSMD 128
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCG 181
WRK+GAVT +KDQ CG+CWAFSA A+EGI ++ T L+SLSEQEL+DCD + + GC
Sbjct: 129 WRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQ 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F+ +N G+ TE +YPY G G CN ++ H I+G++DVP NNE L++
Sbjct: 189 GGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
AV QPVSV I FQ YSSGIFTG C T LDH V VGY NG++YW++KNSWG
Sbjct: 249 AVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSWGT 308
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQ++ G+CGI M ASYPT
Sbjct: 309 QWGEEGYIRMQKDIDAKEGLCGIAMQASYPT 339
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 166/335 (49%), Positives = 211/335 (62%), Gaps = 8/335 (2%)
Query: 3 SLA-FFLLSILL--LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
SLA FF L L ++S L S + E E W ++GK Y +EK++R ++F++N +
Sbjct: 11 SLALFFCLGFLAFQVASRTLQDAS-MYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNY 69
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ NN N + L +N FADLT +EF F+ + + R N+ +P
Sbjct: 70 IEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHTRSSNTRTTTFKYE--NVTVLPD 127
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNS 178
SIDWR+KGAVT +K+Q SCG CWAFSA A EGI+KI TG LVSLSEQE++DCD + +
Sbjct: 128 SIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDH 187
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG MD A++F+I+NHGI+TE YPY+G G+CN ++ H TI GY+DVP NNEK
Sbjct: 188 GCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKA 247
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKN 297
L +AV QPVSV I S FQ Y SGIFTG C T LDH V VGY N G YW++KN
Sbjct: 248 LQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVKN 307
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG WG GY+ MQR GICGI M+ASYPT
Sbjct: 308 SWGTEWGEEGYIMMQRGVKAVEGICGIAMMASYPT 342
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 211/346 (60%), Gaps = 22/346 (6%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
L S + + L + +L+E W H + EK +R F+ N F+ HN G
Sbjct: 25 LCSAIPMEDKDLESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG 83
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG---------NLRDVP 118
+ + L LN F D+ EF+A+F+G D RR+ + P N+ D+P
Sbjct: 84 DHPYRLHLNRFGDMDQAEFRATFVG--------DLRRDTPAKPPSVPGFMYAALNVSDLP 135
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
S+DWR+KGAVT VKDQ CG+CWAFS ++EGIN I TGSLVSLSEQELIDCD + N
Sbjct: 136 PSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND 195
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENN 235
GC GGLMD A++++ N G+ TE YPYR G CN + ++ +V IDG++DVP N+
Sbjct: 196 GCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANS 255
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E+ L +AV QPVSV + S +AF YS G+FTG C T LDH V +VGY +E+G YW
Sbjct: 256 EEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWT 315
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPP 340
+KNSWG SWG GY+ +++++G S G+CGI M ASYP KT P P
Sbjct: 316 VKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYNKPMP 361
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/326 (46%), Positives = 206/326 (63%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H + S G VPAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+ Q G C++ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C+T L+H V IVGY + +G +YWI++NSWG WG GY+ MQRN G
Sbjct: 273 YSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI M+ASYP K + P P
Sbjct: 333 LCGIAMMASYPIKNSSDNPTGSLSSP 358
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 213/339 (62%), Gaps = 12/339 (3%)
Query: 3 SLAFFLLSIL---LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
+L F L + + SS P+NY + + + W H K Y EK+ R KIF++N
Sbjct: 13 ALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVER 72
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP----GNLR 115
+ N + + L +N F+DLT+++F+ G+ + H + ++S N+
Sbjct: 73 IEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRS---HPKVMSSSKPKTHFRYANVT 129
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
D+P ++DWRKKGAVT +KDQ CG CWAFSA A EG++++ TG L+ LSEQEL+DCD
Sbjct: 130 DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVE 189
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GGL+D A+ F++KN G+ TE +YPY+G+ G CNK+K I GY+DVP N
Sbjct: 190 GEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPAN 249
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYW 293
+EK LLQAV QPVSV I GS FQ YSSG+F+G CST L+HAV VGY + +G YW
Sbjct: 250 SEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYW 309
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
IIKNSWG WG +GYM ++R+ G+CG+ M ASYPT
Sbjct: 310 IIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPT 348
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 159/342 (46%), Positives = 211/342 (61%), Gaps = 19/342 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDIN--ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
++SLA L+ L D++ E E W Q+GK Y+ EK+ R IF++N
Sbjct: 9 ISSLALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQ 68
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKAS--FLGFSAASIDHDRRRNASVQSPG---- 112
+ NN GN + L +N FADLT++EFKA F G ++ S ++P
Sbjct: 69 RIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSN---------STRTPTFKYE 119
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
++ VPAS+DWR+KGAVT +KDQ CG CWAFSA A EGI K+ TG L+SLSEQEL+DC
Sbjct: 120 DVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDC 179
Query: 173 D-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
D + + GC GGLMD A++F+++N G++TE YPY+G CN + +I G++DV
Sbjct: 180 DTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDV 239
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGV 290
P N+E LL+AV QP+SV I S FQ YSSG+FTG C T LDH V VGY S++G
Sbjct: 240 PANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDDGT 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 300 KYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPT 341
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 213/338 (63%), Gaps = 14/338 (4%)
Query: 3 SLA-FFLLSILLLSSLPLNYCSD-INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA FF L +L + D I E E W +GK Y + QE+++RL+IF +N ++
Sbjct: 11 SLALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYI 70
Query: 61 TQHNNMGNSS-FTLSLNAFADLTHQEFKAS---FLGFSAASIDHDRRRNASVQSPGNLRD 116
NN GN+ + L +N FADLT++EF AS F G +SI R + +
Sbjct: 71 EASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSI----IRTTTFKYENT--S 124
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS- 175
VP+++DWRKKGAVT VK+Q CG CWAFSA A EGI+KI TG LVSLSEQEL+DCD +
Sbjct: 125 VPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNG 184
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
+ GC GGLMD A++F+I+N+GI TE YPY+G G C + + TI GY+DVP NN
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANN 244
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E L +AV QP+SV I S FQ Y SG+FTG C T LDH V VGY S +G YW+
Sbjct: 245 ENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWL 304
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG WG GY+ MQR+ + G+CGI M ASYPT
Sbjct: 305 VKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPT 342
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/306 (50%), Positives = 198/306 (64%), Gaps = 10/306 (3%)
Query: 32 WCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS-- 89
W Q+GK Y QE++ R KIF +N +V N S+ L +N FADLT++EF AS
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEFVASRN 101
Query: 90 -FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
F G +SI R + + N+ +P+++DWRKKGAVT VK+Q CG CWAFSA
Sbjct: 102 KFKGHMCSSI----TRTTTFKYE-NVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVA 156
Query: 149 AIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A EGI+K+ TG L+SLSEQEL+DCD + + GC GGLMD A++F+I+NHG+ TE YPY
Sbjct: 157 ATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYE 216
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G G CN K + VTI GY+DVP N+E+ L +AV QP+SV I S FQ Y SG+F
Sbjct: 217 GVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGVF 276
Query: 268 TGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
TG C T LDH V VGY S +G YW++KNSWG WG GY+ MQR + G+CGI M
Sbjct: 277 TGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAM 336
Query: 327 LASYPT 332
ASYPT
Sbjct: 337 QASYPT 342
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/308 (49%), Positives = 208/308 (67%), Gaps = 4/308 (1%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IF+DN F+T N+ N + L LN FADL+ E+K
Sbjct: 63 IFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYK 121
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSA 146
G + ++S + + DV P S+DWR +GAVTEVKDQ C +CWAFS
Sbjct: 122 EICHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 181
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++ N G+ T+ DYPY
Sbjct: 182 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTDNDYPY 240
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
+ G C+ + K N V IDGY+++P N+E L++AV QPV+ I S R FQLY SG
Sbjct: 241 KAVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESG 300
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F G C T+L+H V++VGY +ENG +YWI++NSWG +WG GYM M RN N G+CGI
Sbjct: 301 VFDGRCGTNLNHGVVVVGYGTENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIA 360
Query: 326 MLASYPTK 333
M SYP K
Sbjct: 361 MRVSYPLK 368
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/312 (49%), Positives = 208/312 (66%), Gaps = 8/312 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +G+ Y EK++R KIF++N ++ N+ GN + LS+N FAD T++
Sbjct: 32 MSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIESVNSAGNRRYKLSINEFADQTNE 91
Query: 85 EFKASFLGFSAASIDHDRRRNASVQS--PGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
EFKAS G++ +S R R++ + S N+ VP+S+DWRKKGAVT +KDQ CG CW
Sbjct: 92 EFKASRNGYNMSS----RPRSSEITSFRYENVAAVPSSMDWRKKGAVTPIKDQGQCGCCW 147
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A+EG+ ++ TG L+SLSEQEL+DCD S + GCGGGLMD A++F+I N G+ TE
Sbjct: 148 AFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTE 207
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+G CNK+K I Y+DVP N+E LL+AV PVSV I FQ
Sbjct: 208 ANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQF 267
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YSSG+FTG C T LDH V VGY +++G YW++KNSWG WG +GY+ M+R+ G G
Sbjct: 268 YSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEG 327
Query: 321 ICGINMLASYPT 332
+CGI M ASYPT
Sbjct: 328 LCGIAMEASYPT 339
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 209/334 (62%), Gaps = 6/334 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA L L + D + E E W ++GK Y QE+++R ++F++N ++
Sbjct: 11 SLAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
NN N S+ L +N FADLT++EF A GF R + N+ P++
Sbjct: 71 EAFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIR--TTTFKFENVTATPST 128
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWR+KGAVT +KDQ CG CWAFSA A EGI+ + G L+SLSEQEL+DCD + + G
Sbjct: 129 VDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQG 188
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A++F+I+NHG++TE +YPY+G G+CN + ++ TI GY+DVP NNE L
Sbjct: 189 CEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANNEMAL 248
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNS 298
+AV QPVSV I S FQ Y SG+FTG C T LDH V VGY S++G +YW++KNS
Sbjct: 249 QKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNS 308
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 309 WGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPT 342
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 163/363 (44%), Positives = 218/363 (60%), Gaps = 21/363 (5%)
Query: 4 LAFFLLSILLLSSL--------PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
L FL S+++L + + +++L++ W + H S E+++R +F
Sbjct: 5 LLIFLFSLVILETACGFDYEDKEIESEEGLSKLYDRW-RSHHSVPRSLHEREKRFNVFRH 63
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR----RNASVQ-- 109
N V ++N N S+ L LN FADLT EFK ++ G + I H R + S Q
Sbjct: 64 NVMHV-HNSNKKNRSYKLKLNKFADLTIHEFKNAYTG---SKIKHHRMLQGPKRGSKQFM 119
Query: 110 -SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQE 168
N+ +P+S+DWRKKGAVTE+K+Q CG+CWAFS A+EGINKI T LVSLSEQE
Sbjct: 120 YDHENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQE 179
Query: 169 LIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
L+DCD + N GC GGLM+ A++F+ KN GI TE YPY G G+C+ K N +VTIDG+
Sbjct: 180 LVDCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGH 239
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN 288
++VPEN+E LL+AV QPVSV I FQ YS G+FTG C T L+H V VGY S+
Sbjct: 240 ENVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYGSQG 299
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR 348
G YWI++NSWG WG GY+ ++R G CGI M ASYP K + P+P G +
Sbjct: 300 GKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIKL-SSSNPTPKDGDVK 358
Query: 349 CSL 351
L
Sbjct: 359 DEL 361
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 212/338 (62%), Gaps = 14/338 (4%)
Query: 3 SLA-FFLLSILLLSSLPLNYCSD-INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA FF L +L + D I E E W +GK Y + QE+++RL+IF +N ++
Sbjct: 11 SLALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYI 70
Query: 61 TQHNNMGNSS-FTLSLNAFADLTHQEFKAS---FLGFSAASIDHDRRRNASVQSPGNLRD 116
NN GN + L +N FADLT++EF AS F G +SI R + +
Sbjct: 71 EASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSI----IRTTTFKYENT--S 124
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS- 175
VP+++DWRKKGAVT VK+Q CG CWAFSA A EGI+KI TG LVSLSEQEL+DCD +
Sbjct: 125 VPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNG 184
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
+ GC GGLMD A++F+I+N+GI TE YPY+G G C + + TI GY+DVP NN
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANN 244
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E L +AV QP+SV I S FQ Y SG+FTG C T LDH V VGY S +G YW+
Sbjct: 245 ENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWL 304
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG WG GY+ MQR+ + G+CGI M ASYPT
Sbjct: 305 VKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPT 342
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 312 bits (799), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 212/332 (63%), Gaps = 18/332 (5%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
L +SS L S ++E E W ++GK Y QEK++R IF++N ++ NN GN
Sbjct: 20 LWAFQVSSRTLQDAS-MHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGN 78
Query: 69 SSFTLSLNAFADLTHQEFKAS---FLGFSAASIDHD---RRRNASVQSPGNLRDVPASID 122
+ L +N F DLT++EF A+ F G ++SI + N + P+++D
Sbjct: 79 KPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTTFKYENVTA---------PSTVD 129
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WR++GAVT VK+Q +CG CWAFSA A EGI+K+ TG+LVSLSEQEL+DCD S + GC
Sbjct: 130 WRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQ 189
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F+I+N G++TE YPY+G G CN + H+ TI GY+DVP NNE+ L Q
Sbjct: 190 GGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQALQQ 249
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWG 300
AV QP+SV I S FQ Y SG+FTG C T LDH V +VGY S++G YW++KNSWG
Sbjct: 250 AVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWG 309
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M SYPT
Sbjct: 310 EDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPT 341
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 194/309 (62%), Gaps = 6/309 (1%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ E E W ++GK Y EK++R IF+DN F+ N N + LS+N ADLT
Sbjct: 36 LQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLD 95
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFKAS G+ DR + N+ +P ++DWR KGAVT +KDQ CG+CWAF
Sbjct: 96 EFKASRNGYKKI----DREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAF 151
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
S AIEGIN+I TG L+SLSEQEL+DCD + + GC GGLM+ ++F+IKN GI +E +
Sbjct: 152 STVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETN 211
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+ G CN + I GY+ VP N+E LL+AV QP+SV I S+ +F YS
Sbjct: 212 YPYKAADGSCN-TATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYS 270
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SGI+TG C T LDH V VGY S NG DYWI+KNSWG WG GY+ MQR + G+CG
Sbjct: 271 SGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKEGLCG 330
Query: 324 INMLASYPT 332
I M +SYPT
Sbjct: 331 IAMDSSYPT 339
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 215/330 (65%), Gaps = 6/330 (1%)
Query: 6 FFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
FF+L++ +S + S + E E W +HGK Y ++EK +R +IF++N F+ N
Sbjct: 15 FFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSN 74
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
GN+S+ L +N FADLT++EF+AS+ G+ D R + N+ +P S+DWR
Sbjct: 75 AAGNNSYMLGINRFADLTNEEFRASWNGYKRP---LDASRIVTPFKYENVTALPYSMDWR 131
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGG 183
+KGAVT +KDQ CG+CWAFSA A EG++K+ TG LVSLSEQEL+DCD + + GC GG
Sbjct: 132 RKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGEDKGCQGG 191
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
LM+ A++F+ +N GI TE +Y YRG+ G+C+ +K H+ I GY+ VPEN+E LL+AV
Sbjct: 192 LMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEAALLKAV 251
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 302
QPVSV I +FQ Y SGI+ G C + L+H V VGY S +G YWI+KNSWG
Sbjct: 252 AHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKYWIVKNSWGPE 311
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ M+R+ + G+CGI M SYPT
Sbjct: 312 WGERGYVRMKRDITSRKGLCGIAMDCSYPT 341
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 154/305 (50%), Positives = 198/305 (64%), Gaps = 6/305 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W Q+G+ Y +E EK +R IF++N ++ N G + L +NAFADLT+QEFKAS
Sbjct: 38 EQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQEFKAS 97
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
G+ + HD N + N+ VP ++DWR KGAVT VKDQ CG CWAFSA A
Sbjct: 98 RNGYK---LPHDCSSNTPFRYE-NVSSVPTTVDWRTKGAVTPVKDQGQCGCCWAFSAVAA 153
Query: 150 IEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EGI K+ TG+L+SLSEQEL+DCD + + GC GGLMD A+ F+I N G+ TE +YPY+G
Sbjct: 154 MEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKGLTTESNYPYQG 213
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G C K K + I GY+DVP N+E L +AV QPVSV I FQ YSSG+FT
Sbjct: 214 TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFT 273
Query: 269 GPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C T LDH V VGY +E+G YW++KNSWG SWG GY+ MQ++ G+CGI M
Sbjct: 274 GECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQ 333
Query: 328 ASYPT 332
+SYP+
Sbjct: 334 SSYPS 338
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 152/307 (49%), Positives = 195/307 (63%), Gaps = 5/307 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W Q+G+ Y + EK+ R IF++N A + N+ S+ L +N FADL+++EF
Sbjct: 37 ERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEF 96
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
KAS F H A N+ VPA++DWRKKGAVT VKDQ CG CWAFSA
Sbjct: 97 KASRNRFKG----HMCSPQAGPFRYENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSA 152
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGIN++ TG L+SLSEQE++DCD + + GC GGLMD A++F+ +N G+ TE +YP
Sbjct: 153 VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 212
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y G G CN QK H I G++DVP N+E L++AV QPVSV I FQ YSSG
Sbjct: 213 YTGTDGTCNTQKEATHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSG 272
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
IFTG C T LDH V VGY +G YW++KNSWG WG GY+ MQ++ G+CGI
Sbjct: 273 IFTGSCGTQLDHGVTAVGYGISDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIA 332
Query: 326 MLASYPT 332
M ASYP+
Sbjct: 333 MQASYPS 339
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 163/337 (48%), Positives = 209/337 (62%), Gaps = 12/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA L + L + D + E E W +HGK Y +E+++R +IF +N +V
Sbjct: 107 SLAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYV 166
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSPGNLRDV 117
NN N + L +N F DLT+QEF A F G +SI R + + N+ V
Sbjct: 167 EAFNNAANKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSI----IRTTTFKYE-NVTTV 221
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P+++DWR+ GAVT VKDQ CG CWAFSA A EGI+ + G L+SLSEQEL+DCD +
Sbjct: 222 PSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGV 281
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD AY+F+I+NHG++TE +YPY+G G+CN + H TI GY+DVP NNE
Sbjct: 282 DQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGVDGKCNANEAANHAATITGYEDVPANNE 341
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
K L +AV QPVSV I S FQ Y SG FTG C T LDH V VGY S++G YW++
Sbjct: 342 KALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGVTAVGYGVSDHGTKYWLV 401
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 402 KNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPT 438
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 311 bits (797), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 159/358 (44%), Positives = 221/358 (61%), Gaps = 23/358 (6%)
Query: 1 MNSLAFFLLSILLL-------SSLPLNYCSDINE-----LFETWCKQHGKAYSSEQEKQQ 48
M L++ LLS++L+ S+P + +E L+E W H + + + +
Sbjct: 1 MAKLSYALLSVVLVLGSVALAQSIPFDEKDLASEESLWSLYEKWRAHHAVSRDLD-DTDK 59
Query: 49 RLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR----R 104
R +F++N F+ + N ++++ L+LN F D+T+QEF++++ G + IDH +
Sbjct: 60 RFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAG---SKIDHHMTLRGVK 116
Query: 105 NASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSL 164
+A S D+P S+DWR+KGAVT VKDQ CG+CWAFS A+EGIN+I T LVSL
Sbjct: 117 DAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSL 176
Query: 165 SEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVT 224
SEQ+L+DCD + NSGC GGLMDYA+ F+ N G+ +E YPY + C + N +VT
Sbjct: 177 SEQQLVDCD-TKNSGCNGGLMDYAFDFIKNNGGLSSEDSYPYLAEQKSCGSE-ANSAVVT 234
Query: 225 IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY 284
IDGY+DVP NNE L++AV QPVSV I S AFQ YS G+F+G C T LDH V VGY
Sbjct: 235 IDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGHCGTELDHGVAAVGY 294
Query: 285 D-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPS 341
++G YWI+KNSWG WG +GY+ M+R + G CGI M ASYP K+ NP +
Sbjct: 295 GVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGIAMEASYPIKSSPNPKKA 352
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 311 bits (797), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 153/326 (46%), Positives = 205/326 (62%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H + S G VPAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY Q G C++ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C+T L+H V IVGY + +G +YWI++NSWG WG GY+ MQRN G
Sbjct: 273 YSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI M+ASYP K + P P
Sbjct: 333 LCGIAMMASYPIKNSSDNPTGSLSSP 358
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 157/326 (48%), Positives = 202/326 (61%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S +K +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGDKHKRFNVFKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H R + G VPAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHRMFRDMPRGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGHCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD N+GC GGLM+ A+QF+ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTEENAGCNGGLMESAFQFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY Q G C+ K N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SYYPYTAQDGTCDASKANDLAVSIDGHENVPGNDENALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG CST L+H V IVGY + +G YWI++NSWG WG GY+ MQRN G
Sbjct: 273 YSEGVFTGDCSTELNHGVAIVGYGATVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI MLASYP K N P P P
Sbjct: 333 LCGIAMLASYPIKNSSNNPTGPSSSP 358
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 160/334 (47%), Positives = 201/334 (60%), Gaps = 10/334 (2%)
Query: 4 LAFFLLSILLLSSLPLNYCSDIN----ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
LA FLL + +S + + E E W ++ K Y EK++R IF+DN F
Sbjct: 12 LALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEF 71
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ N GN + L +N ADLT +EFKAS G + +D + N+ +PA
Sbjct: 72 IESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRS---YDYEVGTTSFKYENVTAIPA 128
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNS 178
S+DWRKKGAVT +KDQ CG+CWAFS A EGI+KI TG LVSLSEQEL+DCDR +
Sbjct: 129 SVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQ 188
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG M+ ++F+IKN GI TE +YPY+ G C + I GY+ VP N+EK
Sbjct: 189 GCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSC--KNATAPAAQIKGYEKVPVNSEKA 246
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNS 298
LL+AV QPVSV I ++ +F YSSGIFTG C T LDH V VGY NG DYWI+KNS
Sbjct: 247 LLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRANGTDYWIVKNS 306
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG WG GY+ MQR G+CGI M +SYPT
Sbjct: 307 WGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPT 340
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 155/312 (49%), Positives = 200/312 (64%), Gaps = 5/312 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
SD+ + +E W QHG+ Y + E Q+ I++ N F+ + N N SFTL+ N FAD+T
Sbjct: 39 SDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFIN-YINAQNFSFTLTDNQFADMT 97
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++E+KA ++G + R+N S + +P S+DWRK GAVT V++Q CG+CW
Sbjct: 98 NEEYKALYMGLGTSETS---RKNQSSFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCW 154
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS A+EGINKI TG LVSLSEQEL+DCD S N GC GG M A++F+ +N GI T
Sbjct: 155 AFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTA 214
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
++YPY G+ G CNK K H+V I GY+ VP NNEK L AV QPVSV I FQL
Sbjct: 215 RNYPYIGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQL 274
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YS GIF G C L+HAV ++GY +NG YW++KNSWG WG GY M R++ + GI
Sbjct: 275 YSKGIFNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGI 334
Query: 322 CGINMLASYPTK 333
CGI M ASYP K
Sbjct: 335 CGIAMEASYPIK 346
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 150/305 (49%), Positives = 209/305 (68%), Gaps = 7/305 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE W +H K+YSS+ EK +RL +F D A++ +HN N++FTL LN F+DLT+ EF+
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
A+++G DRR V ++ +P S+DWR++GAVT +KDQ CG+CWAFSA
Sbjct: 61 ANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
+IE + + T LVSLSEQ+LIDCD + + GC GG D A++FV++N G+ TE+ YPY
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYT 177
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G AG CN K +V I GYKDV +++ L++AV PV+VGICGS++ FQ Y SGI
Sbjct: 178 GFAGSCNTNK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
+G C S DHAVL++GY +E G+ YWIIKNSWG SWG +G+M +++ G G+CG+N
Sbjct: 236 SGQCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKKDGE--GMCGMNGQ 293
Query: 328 ASYPT 332
+SYPT
Sbjct: 294 SSYPT 298
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 200/313 (63%), Gaps = 5/313 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
SD+ + +E W QHG+ Y + E Q+ I++ N F+ + N N SFTL+ N FAD+T
Sbjct: 35 SDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFIN-YINAQNFSFTLTDNQFADMT 93
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++E+KA ++G + R+N S + +P S+DWRK GAVT V++Q CG+CW
Sbjct: 94 NEEYKALYMGLGTSETS---RKNQSSFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCW 150
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS A+EGINKI TG LVSLSEQEL+DCD S N GC GG M A++F+ +N GI T
Sbjct: 151 AFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTA 210
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
++YPY G+ G CNK K H+V I GY+ VP NNEK L AV QPVSV I FQL
Sbjct: 211 RNYPYIGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQL 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YS GIF G C L+HAV ++GY +NG YW++KNSWG WG GY M R++ + GI
Sbjct: 271 YSKGIFNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGI 330
Query: 322 CGINMLASYPTKT 334
CGI M ASYP K
Sbjct: 331 CGIAMEASYPIKA 343
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 157/335 (46%), Positives = 205/335 (61%), Gaps = 10/335 (2%)
Query: 6 FFLLSILLLSSLPLNYCS------DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
F + +++LL + S + E E W Q+G+ Y E EK R +IF DN F
Sbjct: 28 FMIAALILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ + N G S+ L++N FAD T++EF+AS G+ A R ++ N+ VP+
Sbjct: 88 IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKMAV--SSRPSQTTLFRYENVTAVPS 145
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNS 178
S+DWRKKGAVT VKDQ CG+CWAFS A EGI K+ TG L+SLSEQEL+DCD++ +
Sbjct: 146 SMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGEDQ 205
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG M+ ++F++KN GI E YPY G CN ++ I GY+ VP N+E
Sbjct: 206 GCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSETA 265
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKN 297
LL+AV QPVSV I S AFQ YSSG+FTG C T LDH V VGY + +G YW++KN
Sbjct: 266 LLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKYWLVKN 325
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG SWG +GY+ MQR G+CGI M ASYPT
Sbjct: 326 SWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPT 360
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 149/308 (48%), Positives = 203/308 (65%), Gaps = 14/308 (4%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W QHG+ Y +EK++R IF++N + NN + + L +N FADLT++EF+A
Sbjct: 6 EEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAM 65
Query: 90 FLGFSAASIDHDRRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
+ G+ +R+++ + S NL D+P S+DWR GAVT VKDQ +CG CWAFS
Sbjct: 66 YHGY--------KRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFS 117
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
AIEGI K+ TG+L+SLSEQ+L+DC N GC GGLMD A+Q++I+N G+ +E +YP
Sbjct: 118 TVAAIEGIIKLQTGNLISLSEQQLVDCTAG-NKGCQGGLMDTAFQYIIRNGGLTSEDNYP 176
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y+G G C+ +K I GY+DVP+NNE LLQAV QPVSV + G F+ Y SG
Sbjct: 177 YQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYKSG 236
Query: 266 IFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+F G C T+L+H V +GY ++ +G DYW++KNSWG SWG +GY MQR G S G+CG+
Sbjct: 237 VFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLCGV 296
Query: 325 NMLASYPT 332
M ASYPT
Sbjct: 297 AMDASYPT 304
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 154/305 (50%), Positives = 198/305 (64%), Gaps = 6/305 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W Q+G+ Y +E EK +R IF++N ++ N G + L +NAFADLT+QEFKAS
Sbjct: 40 EQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQEFKAS 99
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
G+ + HD N + N+ VP ++DWR KGAVT VKDQ CG CWAFSA A
Sbjct: 100 RNGYK---LPHDCSSNTPFRYE-NVSSVPTTVDWRTKGAVTPVKDQGQCGCCWAFSAVAA 155
Query: 150 IEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EGI K+ TG+L+SLSEQEL+DCD + + GC GGLMD A+ F+I N G+ TE +YPY+G
Sbjct: 156 MEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKGLTTESNYPYQG 215
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G C K K + I GY+DVP N+E L +AV QPVSV I FQ YSSG+FT
Sbjct: 216 TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFT 275
Query: 269 GPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C T LDH V VGY +E+G YW++KNSWG SWG GY+ MQ++ G+CGI M
Sbjct: 276 GECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQ 335
Query: 328 ASYPT 332
+SYP+
Sbjct: 336 SSYPS 340
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 153/319 (47%), Positives = 199/319 (62%), Gaps = 15/319 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W H + +Q KQ+R +F++N F+ + N + +F L+LN F D+T+QEF+
Sbjct: 37 LYERWRSHHAVSRDLDQ-KQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFR 95
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRD-------VPASIDWRKKGAVTEVKDQASCGA 140
A + G + + H R S G+ P SIDWR++GAV VK+Q CG+
Sbjct: 96 AKYAG---SKVHHHRTMKGSRHGSGSGAKFMYENAVAPPSIDWRERGAVAAVKNQGQCGS 152
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVT LV LSEQELIDCD N GC GGLMDYA++F+ N GI T
Sbjct: 153 CWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEFIKNNGGITT 212
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E YPY+ + C K N V IDGY+DVP N+E L++AV QPV+V I S FQ
Sbjct: 213 EDVYPYQAEDATCKK---NSPAVVIDGYEDVPTNDEDALMKAVANQPVAVAIEASGYVFQ 269
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+FTG C T LDH V +VGY +++G YW ++NSWG WG +GY+ MQR +
Sbjct: 270 FYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVRMQRGIKATH 329
Query: 320 GICGINMLASYPTKTGQNP 338
G+CGI M ASYP KT NP
Sbjct: 330 GLCGIAMQASYPIKTSLNP 348
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 158/334 (47%), Positives = 204/334 (61%), Gaps = 7/334 (2%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
+LA FL+ D + E E W HGK Y EK+Q+ +IF +N +
Sbjct: 10 TLALFLIFAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMENVQRI 69
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
NN G + L +N FADLT++EFKA + + R R + + N+ VPAS
Sbjct: 70 EAFNNAGXKPYKLGINHFADLTNEEFKA--INRFKGHVCSKRTRTTTFRYE-NVTAVPAS 126
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWR+KGAVT +KDQ CG CWAFSA A EGI K+ TG L+SLSEQEL+DCD + + G
Sbjct: 127 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKGVDQG 186
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A++F+++N G+ TE YPY G G CN + H +I GY+DVP N+E L
Sbjct: 187 CEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPANSESAL 246
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNS 298
L+AV QPVSV I S FQ YS G+FTG C T+LDH V VGY ++G YW++KNS
Sbjct: 247 LKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVKNS 306
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG WG GY+ MQR+ G+CGI MLASYP+
Sbjct: 307 WGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPS 340
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 310 bits (794), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 153/305 (50%), Positives = 196/305 (64%), Gaps = 3/305 (0%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W HG+ Y+ E EKQ R +IF++N A++ HN + S+TL +N FADLT+ EF+AS
Sbjct: 56 EQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKFADLTNDEFRAS 115
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
G+ D D + + N+ VP +DWRK+GAVT VKDQ CG CWAFSA A
Sbjct: 116 RNGYKKQP-DSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKDQGDCGCCWAFSAVAA 174
Query: 150 IEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EGINK+ G LVSLSEQEL+DCD + GC GGLM+ A+QF+ K G+ E YPY G
Sbjct: 175 MEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKGLAAESVYPYTG 234
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
+ G CN +K I G++ VP NNEK LLQAV QPVS+ I S FQ YS G+FT
Sbjct: 235 EDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAIDASGYEFQFYSGGVFT 294
Query: 269 GPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C T LDHA+ VGY + +G YW++KNSWG SWG NGY+ ++R++ G+CGI M
Sbjct: 295 GSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIKRDSLAKEGLCGIAMD 354
Query: 328 ASYPT 332
SYP
Sbjct: 355 PSYPV 359
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 310 bits (794), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/322 (47%), Positives = 201/322 (62%), Gaps = 11/322 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W + H S EK +R +F++N FV + N + + L LN FAD+T+ EF+
Sbjct: 37 LYERW-RSHHTVSRSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFADMTNHEFR 94
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGN-----LRDVPASIDWRKKGAVTEVKDQASCGACW 142
+++ G + ++H R S + G+ ++ VP S+DWRKKGAVT +KDQ CG+CW
Sbjct: 95 STYAG---SKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQCGSCW 151
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS A+EGIN I T LVSLSEQEL+DCD S N GC GGLM YA++F+ + GI TE+
Sbjct: 152 AFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGITTEQ 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY + G C+ K+N +V+IDG++ VP NNE LL+A QP+SV I AFQ Y
Sbjct: 212 SYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSAFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
S G+F G C T LDH V IVGY + +G YWI+KNSWG WG NGY+ M+R G+
Sbjct: 272 SEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGISAKEGL 331
Query: 322 CGINMLASYPTKTGQNPPPSPP 343
CGI + ASYP K P P
Sbjct: 332 CGIAVEASYPIKNSSTNPVGAP 353
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 310 bits (794), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 151/307 (49%), Positives = 195/307 (63%), Gaps = 5/307 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W Q+G+ Y + E+ R IF++N A + N+ S+ L +N FADLT++EF
Sbjct: 3 ERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEF 62
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
KAS F H A N+ VP+++DWRK+GAVT VKDQ CG CWAFSA
Sbjct: 63 KASRNRFKG----HMCSPQAGPFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSA 118
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGINK+ TG L+SLSEQE++DCD + + GC GGLMD A++F+ +N G+ TE +YP
Sbjct: 119 VAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 178
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y+G G CN +K H I G++DVP N+E L++AV QPVSV I FQ YSSG
Sbjct: 179 YKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSG 238
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
IFTG C T LDH V VGY +G YW++KNSWG WG GY+ MQ++ G+CGI
Sbjct: 239 IFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIA 298
Query: 326 MLASYPT 332
M ASYPT
Sbjct: 299 MQASYPT 305
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/307 (49%), Positives = 194/307 (63%), Gaps = 5/307 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W Q+G+ Y + E+ R IF++N A + N+ S+ L +N FADLT++EF
Sbjct: 37 ERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEF 96
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
KAS F H A N+ VP+++DWRK+GAVT VKDQ CG CWAFSA
Sbjct: 97 KASRNRFKG----HMCSPQAGPFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSA 152
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGINK+ TG L+SLSEQE++DCD + + GC GGLMD A++F+ +N G+ TE +YP
Sbjct: 153 VAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 212
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y+G G CN K H I G++DVP N+E L++AV QPVSV I FQ YSSG
Sbjct: 213 YKGTDGTCNTNKAAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSG 272
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
IFTG C T LDH V VGY +G YW++KNSWG WG GY+ MQ++ G+CGI
Sbjct: 273 IFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIA 332
Query: 326 MLASYPT 332
M ASYPT
Sbjct: 333 MQASYPT 339
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 218/336 (64%), Gaps = 21/336 (6%)
Query: 7 FLLSILLLSSL-------PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
+L ++L+ +L PL+ + LF+ + + K Y S +E+ +R +F N F
Sbjct: 1 MMLKLVLVCALVGAAMAEPLSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDF 60
Query: 60 VTQHNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD 116
+ +HN G + T+ +N FADLT++E++ +L + R+ + P
Sbjct: 61 INRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYLRPYPTELLGRERQEVWLDGPN---- 116
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
S+DWR+KGAVT +K+Q CG+CW+FS TG++EG + I TG+LVSLSEQ+L+DC S+
Sbjct: 117 -AGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSF 175
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC GGLMD A++++I N G+DTE+DYPY + G C+K K ++H V+I GYKDVP+NN
Sbjct: 176 GNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNN 235
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E QL AV PVSV I +++FQ+YSSG+F+GPC T+LDH VL+VGY S DYWI+
Sbjct: 236 EDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTS----DYWIV 291
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG SWG GY+ M+R +S GICGI M SYP
Sbjct: 292 KNSWGASWGDQGYIMMKRGV-SSAGICGIAMQPSYP 326
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/309 (48%), Positives = 194/309 (62%), Gaps = 6/309 (1%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ E E W ++GK Y EK++R IF+DN F+ N N + LS+N ADLT
Sbjct: 36 LQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLD 95
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFKAS G+ DR + N+ +P ++DWR KGAVT +KDQ CG+CWAF
Sbjct: 96 EFKASRNGYKKI----DREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAF 151
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
S AIEGIN+I TG L+SLSEQEL+DCD + + GC GGLM+ ++F+IKN GI +E +
Sbjct: 152 STVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETN 211
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY+ G C+ + I GY+ VP N+E LL+AV QP+SV I S+ +F YS
Sbjct: 212 YPYKAADGSCSAA-TTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYS 270
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SGI+TG C T LDH V VGY S NG DYWI+KNSWG WG GY+ MQR + G+CG
Sbjct: 271 SGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKEGLCG 330
Query: 324 INMLASYPT 332
I M +SYPT
Sbjct: 331 IAMDSSYPT 339
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 210/344 (61%), Gaps = 18/344 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCS---------DINELFETWCKQHGKAYSSEQEKQQRLK 51
M SL +++L +L L CS + E W +HG+ Y EK+QRL
Sbjct: 1 MASLVCLWMALL---ALGLGACSPAAAELGDASMAERHVEWMARHGRTYKDAAEKEQRLG 57
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP 111
IF+ N ++ + N G + L+ N FADLTH+EFKA GF + + N
Sbjct: 58 IFKSNVEYI-ESFNAGKRKYQLAANQFADLTHEEFKAMHTGFKPSGTGAKKAGNGFRH-- 114
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
G+L VP S+DWR KGAVT VKDQ CG+CWAF+ A+EGI KIVTG L+SLSEQ+L+D
Sbjct: 115 GSLSSVPDSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVD 174
Query: 172 CD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
CD + GC GG MD A++F++ N GI +E +YPY CN + + TI+ ++D
Sbjct: 175 CDVHGKDQGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQRLCNAHNASFVVATIESHED 234
Query: 231 VPENNEKQLLQAVVAQPVSVGI-CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSEN 288
VP N+EK L +AV QPVSVGI GS FQLYS G+F+G C T LDHAV +VGY + +
Sbjct: 235 VPTNDEKALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSD 294
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G YW+ KNSWG +WG NGY+ M+R+ G+CGI M ASYPT
Sbjct: 295 GTKYWLAKNSWGETWGENGYIRMERDVAAKEGLCGIAMQASYPT 338
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 152/326 (46%), Positives = 205/326 (62%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 37 DLYERW-RSHHTVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEF 94
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H R + G + VP S+DWRKKGAVT+VKDQ CG+C
Sbjct: 95 RSTYAG---SKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSC 151
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV+LSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 152 WAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+ Q G C+ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 212 SNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQF 271
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG CST L+H V IVGY + +G +YWI++NSWG WG +GY+ MQRN G
Sbjct: 272 YSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEG 331
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI ML SYP K + P P
Sbjct: 332 LCGIAMLPSYPIKNSSDNPTGSFSSP 357
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 204/319 (63%), Gaps = 11/319 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H R + G + VP S+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV+LSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+ Q G C+ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG CST L+H V IVGY + +G +YWI++NSWG WG +GY+ MQRN G
Sbjct: 273 YSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPP 339
+CGI ML SYP K + P
Sbjct: 333 LCGIAMLPSYPIKNSSDNP 351
>gi|297740510|emb|CBI30692.3| unnamed protein product [Vitis vinifera]
Length = 377
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 169/322 (52%), Positives = 206/322 (63%), Gaps = 32/322 (9%)
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+ P+S+DWRKKG VT +KDQ CG+CWAFS+TGA+EGIN IVTG L+SLSEQEL+DCD +
Sbjct: 11 EAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT 70
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC GG MDYA+++VI N GID+E DYPY G G CN K + +V+IDGYKDV E++
Sbjct: 71 -NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESD 129
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTG---PCSTSLDHAVLIVGYDSENGVDY 292
LL A V QP+SVG+ GS FQLY+SGI+ G +DHAVLIVGY SE+ DY
Sbjct: 130 -SALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSEDY 188
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK------------------- 333
WI KNSWG SWGM GY +++RNT G C IN +ASYPTK
Sbjct: 189 WICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKESSSPSPYPSPAVPPPPPP 248
Query: 334 --------TGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCC 385
PPPSP P P+ C +YC + ETCCC CL + CC + +AVCC
Sbjct: 249 PPSPPPPPPPSPPPPSPGPSPSECGDFSYCPSDETCCCIYEFYDFCLIYGCCEYENAVCC 308
Query: 386 SDHRYCCPSNYPICDSVRHQCL 407
+ YCCPS+YPICD CL
Sbjct: 309 TGTEYCCPSDYPICDVEEGLCL 330
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 198/312 (63%), Gaps = 11/312 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS-FTLSLNAFADLTHQE 85
E E W Q+ K Y QE+++R KIF N ++ NN N+ + L +N FADLT++E
Sbjct: 38 ERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGINQFADLTNEE 97
Query: 86 FKAS---FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
F AS F G +SI + N+ +P+++DWRKKGAVT VK+Q CG CW
Sbjct: 98 FIASRNKFKGHMCSSI-----AKTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCW 152
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A EGI K+ TG LVSLSEQEL+DCD + + GC GGLMD A++F+I+NHG+ TE
Sbjct: 153 AFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY+G G CN K + H TI GY+DVP NNE+ L +AV QP+SV I S FQ
Sbjct: 213 AAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y SG+F+G C T LDH V VGY N G YW++KNSWG WG GY+ MQR + G
Sbjct: 273 YKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIRMQRGVDAAEG 332
Query: 321 ICGINMLASYPT 332
+CGI M ASYPT
Sbjct: 333 LCGIAMQASYPT 344
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/340 (45%), Positives = 212/340 (62%), Gaps = 14/340 (4%)
Query: 4 LAFFLLSILLLSS-----LPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
LA F + + L SS P+NY + + + W H K Y EK+ R +IF++N
Sbjct: 12 LALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKENVE 71
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP----GNL 114
+ N + + L N F+DLT++EF+ G+ + H + +S N+
Sbjct: 72 RIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRS---HPKVMTSSKGKTHFRYTNV 128
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD- 173
D+P ++DWRKKGAVT +KDQ CG CWAFSA A+EG++++ TG L+ LSEQEL+DCD
Sbjct: 129 TDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDV 188
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+ GC GGL+D A+ F++KN G+ TE +YPY+G+ G CNK+K I GY+DVP
Sbjct: 189 EGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPA 248
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDY 292
N+EK LLQAV QPVSV I GS FQ YSSG+F+G CST L+HAV VGY + +G Y
Sbjct: 249 NSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKY 308
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WIIKNSWG WG +GYM ++R+ G+CG+ M ASYPT
Sbjct: 309 WIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPT 348
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 205/319 (64%), Gaps = 13/319 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W ++H EK +R F+DN ++ +HN G + L LN F D+ +EF
Sbjct: 44 DLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRGGRGYRLRLNRFGDMGREEF 102
Query: 87 KASFLGFSAASIDHDRRRNASVQSP------GNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+A+F G A +D RR+ P +RD+P ++DWR+KGAVT VKDQ CG+
Sbjct: 103 RATFAGSHA----NDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGS 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS ++EGIN I TG LVSLSEQELIDCD + NSGC GGLM+ A++++ + GI T
Sbjct: 159 CWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGITT 218
Query: 201 EKDYPYRGQAGQCNKQKLNRH-IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
E YPYR G C+ + R +V IDG+++VP N+E L +AV QPVSV I +++F
Sbjct: 219 ESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 278
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q YS G+F G C T LDH V +VGY ++ +G +YWI+KNSWG +WG GY+ MQR++G
Sbjct: 279 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGYD 338
Query: 319 LGICGINMLASYPTKTGQN 337
G+CGI M ASYP K N
Sbjct: 339 GGLCGIAMEASYPVKFSPN 357
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 159/348 (45%), Positives = 213/348 (61%), Gaps = 20/348 (5%)
Query: 3 SLAFFLLSILLLSSLP-----LNYCSDINELFETWCKQH---GKAYSSEQEKQQR-LKIF 53
SLA +L+ + +P L + L+E W + A EQ+ + R +F
Sbjct: 11 SLALLVLAPPARAGIPFTEKDLASEESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVF 70
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN 113
++N ++ + N G S F L+LN FAD+T EF+ ++ + + H R ++ ++ G+
Sbjct: 71 KENVRYIHEANKKGRS-FRLALNKFADMTTDEFRRAYA--AGSRTRHHRALSSGIRRHGD 127
Query: 114 -------LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSE 166
++P ++DWR++GAVT +KDQ CG+CWAFS A+EGINKI TG LVSLSE
Sbjct: 128 GSFMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSE 187
Query: 167 QELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID 226
QEL+DCD N GC GGLMDYA+Q++ +N GI TE +YPY + CNK K H VTID
Sbjct: 188 QELVDCDDVDNQGCNGGLMDYAFQYIKRNGGITTESNYPYLAEQRSCNKAKERSHDVTID 247
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD- 285
GY+DVP NNE L +AV QPVS+ I S + FQ YS G+FTG C T LDH V VGY
Sbjct: 248 GYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGI 307
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+ +G YWI+KNSWG WG GY+ MQR +S G+CGI M SYPTK
Sbjct: 308 TRDGTKYWIVKNSWGEDWGERGYIRMQRGISDSQGLCGIAMEPSYPTK 355
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 212/332 (63%), Gaps = 18/332 (5%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
L +SS L S + E E W ++G+ Y QEK++R IF++N ++ NN G+
Sbjct: 20 LWAFQVSSRTLQDAS-MQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYIEASNNAGD 78
Query: 69 SSFTLSLNAFADLTHQEFKAS---FLGFSAASIDHD---RRRNASVQSPGNLRDVPASID 122
+ L +N FADLT++EF A+ F G ++SI + N + P+++D
Sbjct: 79 KPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTTFKYENVTA---------PSTVD 129
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WR++GAVT VK+Q +CG CWAFSA A EGI+K+ TG+LVSLSEQEL+DCD S + GC
Sbjct: 130 WRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQ 189
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F+I+N G++TE YPY+G G CN + H+ TI GY+DVP NNE+ L Q
Sbjct: 190 GGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNEQALQQ 249
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWG 300
AV QP+S+ I S FQ Y SG+FTG C T LDH V +VGY S++G YW++KNSWG
Sbjct: 250 AVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWG 309
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CG+ M SYPT
Sbjct: 310 ADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPT 341
>gi|413945959|gb|AFW78608.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 289
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 157/251 (62%), Positives = 179/251 (71%), Gaps = 13/251 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-------------SF 71
I F+ WC +HGKAY++ +E+ RL +F DN AFV HN + S+
Sbjct: 32 IEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSY 91
Query: 72 TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
TL+LNAFADLTH+EF+A+ LG A R G VP ++DWRK GAVT+
Sbjct: 92 TLALNAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTK 151
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191
VKDQ SCGACW+FSATGA+EGINKI TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY+F
Sbjct: 152 VKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKF 211
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
VIKN GIDTE+DYPYR G CNK KL + +VTIDGY DVP N E LLQAV QPVSVG
Sbjct: 212 VIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVG 271
Query: 252 ICGSERAFQLY 262
ICGS RAFQLY
Sbjct: 272 ICGSARAFQLY 282
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 204/313 (65%), Gaps = 14/313 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IFEDN F+T N N S+ L LN FADL+ E+
Sbjct: 55 MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRN-AENLSYRLGLNRFADLSLHEY- 112
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRD------VPASIDWRKKGAVTEVKDQASCGAC 141
G D RN + N +P S+DWR +GAVTEVKDQ C +C
Sbjct: 113 ----GEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSC 168
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++ N G+ T+
Sbjct: 169 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTD 227
Query: 202 KDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
DYPY+ G C + K + V IDGY+++P N+E L++AV QPV+ + S R FQ
Sbjct: 228 NDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQ 287
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SG+F G C T+L+H V++VGY +ENG DYWI+KNS G +WG GYM M RN N G
Sbjct: 288 LYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRG 347
Query: 321 ICGINMLASYPTK 333
+CGI M ASYP K
Sbjct: 348 LCGIAMRASYPLK 360
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/326 (46%), Positives = 204/326 (62%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F++N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H + + G VPAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY Q G C+ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+ TG C+T L+H V IVGY + +G +YWI++NSWG WG GY+ MQRN G
Sbjct: 273 YSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI M+ASYP K + P P
Sbjct: 333 LCGIAMMASYPIKNSSDNPTGSFSSP 358
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 155/303 (51%), Positives = 197/303 (65%), Gaps = 13/303 (4%)
Query: 38 KAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS 97
KAY+S +EK +R ++F+DN + N +S+ L LN FADLTH EFKA++LG +
Sbjct: 38 KAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHDEFKATYLGLTPPP 96
Query: 98 IDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAI 150
R N+ S R +VP +DWRKK AVTEVK+Q CG+CWAFS A+
Sbjct: 97 T----RSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAV 152
Query: 151 EGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQA 210
EGIN IVTG+L SLSEQELIDC N+GC GGLMDYA+ ++ G+ TE+ YPY +
Sbjct: 153 EGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEE 212
Query: 211 GQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP 270
G C++ K +VTI GY+DVP N+E+ L++A+ QPVSV I S R FQ YS G+F GP
Sbjct: 213 GDCDEGK-GAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGP 271
Query: 271 CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASY 330
C LDH V VGY + G DY I+KNSWG WG GY+ M+R TG G+CGIN +ASY
Sbjct: 272 CGEQLDHGVTAVGYGTSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASY 331
Query: 331 PTK 333
PTK
Sbjct: 332 PTK 334
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 155/319 (48%), Positives = 212/319 (66%), Gaps = 20/319 (6%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS---FTLSLNAFADLTH 83
E+F+ W ++H K Y +E ++R + F+ N ++ + N ++ + LN FAD+++
Sbjct: 47 EIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKFADMSN 106
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLR------DVPASIDWRKKGAVTEVKDQAS 137
+EF+ ++L I N + N+R D P+S+DWR G VT VKDQ S
Sbjct: 107 EEFRKAYLSKVKKPI------NKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGS 160
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG+CWAFS+TGA+EGIN +VTG L+SLSEQEL++CD S N GC GG MDYA+++VI N G
Sbjct: 161 CGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVINNGG 219
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
ID+E DYPY G G CN K +V+IDGY+DV E ++ LL AV QPVSVGI GS
Sbjct: 220 IDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDV-EQSDSALLCAVAQQPVSVGIDGSAI 278
Query: 258 AFQLYSSGIFTGPCS---TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
FQLY+ GI+ G CS +DHAVLIVGY SE+ +YWI+KNSWG SWG++GY +++R+
Sbjct: 279 DFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWGIDGYFYLKRD 338
Query: 315 TGNSLGICGINMLASYPTK 333
T G+C +N +ASYPTK
Sbjct: 339 TDLPYGVCAVNAMASYPTK 357
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 214/336 (63%), Gaps = 14/336 (4%)
Query: 4 LAFFL-LSILLLSSLPLNYCSD-INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
A FL L +L + +D + E+ E W QHGK Y + EKQ+R IF++N ++
Sbjct: 12 FALFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIE 71
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKAS---FLGFSAASIDHDRRRNASVQSPGNLRDVP 118
NN+GN S+ L LN FADLT+ EF A+ F G+ SI + N+ DVP
Sbjct: 72 AFNNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYLHGSIITTFKYK-------NVSDVP 124
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YN 177
+++DWR++GAVT VK+Q CG CWAFSA + EGI+K+ TG+LVSLSEQEL+DCD + +
Sbjct: 125 SAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGED 184
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A++F+I+N+G+ TE +YPY+G G CNK ++ TI GY++VP N+E+
Sbjct: 185 QGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQ 244
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDH-AVLIVGYDSENGVDYWIIK 296
L +AV QPVSV I S FQ Y SG+FTG C T LDH ++ E+ +YW++K
Sbjct: 245 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYWLVK 304
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG WG GY+ MQR S G+CGI M SYPT
Sbjct: 305 NSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPT 340
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 152/336 (45%), Positives = 205/336 (61%), Gaps = 17/336 (5%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAY----SSEQEKQQRLKIFEDNYAFVTQHNNMG 67
+ S L + L+E W + + +Q++ +R +F++N +V + N
Sbjct: 24 IPFSERDLASEESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKD 83
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR---------DVP 118
F L+LN FAD+T EF+ ++ G + H R + +S + + ++P
Sbjct: 84 GRPFRLALNKFADMTTDEFRRTYAG---SRTRHHRAQLGEARSFAHAQHGRGGSGTTNLP 140
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
++DWR +GAVT VKDQ CG+CWAFSA A+EG+NKI+TG LVSLSEQEL+DCD N
Sbjct: 141 PAVDWRLRGAVTGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQ 200
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMDYA+Q++ +N G+ TE +YPY + CNK K H VTIDGY+DVP NNE
Sbjct: 201 GCDGGLMDYAFQYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDA 260
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
L +AV +QPV+V I S + FQ YS G+FTG C T LDH V VGY + +G YW +KN
Sbjct: 261 LQKAVASQPVAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKN 320
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG WG GY+ MQR +S G+CGI M SYPTK
Sbjct: 321 SWGEDWGERGYIRMQRGVPDSRGLCGIAMEPSYPTK 356
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 159/339 (46%), Positives = 208/339 (61%), Gaps = 15/339 (4%)
Query: 7 FLLSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
LLSI + N + + ++E E W K++GK Y EKQ+RL IF+DN F+ N
Sbjct: 15 LLLSICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNA 74
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---GNLRDVPASID 122
GN + LS+N AD T++EF AS G+ + + + Q+P GN+ D+P ++D
Sbjct: 75 AGNKPYKLSINHLADQTNEEFVASHNGY--------KYKGSHSQTPFKYGNVTDIPTAVD 126
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WR+ GAVT VKDQ CG+CWAFS A EGI +I TG L+SLSEQEL+DCD S + GC G
Sbjct: 127 WRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCD-SVDHGCDG 185
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242
GLM+ ++F+IKN GI +E +YPY G C+ K I GY+ VP N+E+ L QA
Sbjct: 186 GLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQQA 245
Query: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGV-DYWIIKNSWG 300
V QPVSV I FQ YSSG+FTG C T LDH V +VGY +++G +YWI+KNSWG
Sbjct: 246 VANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWG 305
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPP 339
WG GY+ MQR G+CGI M ASYP + P
Sbjct: 306 TQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMGKSSDSP 344
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 211/341 (61%), Gaps = 9/341 (2%)
Query: 1 MNSLAFFLLSILLLSS---LPLNYCSDINELF-----ETWCKQHGKAYSSEQEKQQRLKI 52
M S F++ + L+ + LP S + E + E W Q GK+Y EK++R +I
Sbjct: 1 MTSPNNFIIPMFLIFTTWMLPYVMSSRVLEPYLSNKHEKWMTQFGKSYKDAAEKEKRFQI 60
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG 112
F++N F+ N +GN F LS+N FADLT++EFKAS G D +
Sbjct: 61 FKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETTSFRYH 120
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
N+ VPAS+DWRK+GAVT +K+Q SCG+CWAFS +IEGI++I TG LVSLSEQELIDC
Sbjct: 121 NVTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDC 180
Query: 173 DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
R +SGC GG ++ A++F+ K G+ +E +YPY+ +C +K ++H+ I GY+ VP
Sbjct: 181 VRGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVP 240
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVD 291
N+E LL+AV QPVSV + + FQ YS GIFTG C T DH V IVGY S + +
Sbjct: 241 SNSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTE 300
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG WG GYM ++RN + G+CGI SYP
Sbjct: 301 YWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPV 341
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 206/319 (64%), Gaps = 11/319 (3%)
Query: 23 SDINELFETWCKQHG---KAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFA 79
++ L+E W + + ++ E ++R +F++N ++ + N + F L+LN FA
Sbjct: 34 ENLRGLYERWRSHYTVSRRGLGADAE-ERRFNVFKENARYIHEGNKK-DRPFRLALNKFA 91
Query: 80 DLTHQEFKASFLGFSAA---SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
D+T EF+ ++ G S+ RR + S + G+ ++P ++DWR+KGAVT +KDQ
Sbjct: 92 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRY-GDADNLPPAVDWRQKGAVTAIKDQG 150
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS A+EGINKI TG LVSLSEQEL+DCD N GC GGLMDYA+QF+ KN
Sbjct: 151 QCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN- 209
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GI TE +YPY+G+ G C+ K H VTIDGY+DVP N+E L +AV QPVSV I S
Sbjct: 210 GITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASG 269
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
FQ YS G+FTG CST LDH V VGY + +G YWI+KNSWG WG GY+ MQR
Sbjct: 270 NDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGV 329
Query: 316 GNSLGICGINMLASYPTKT 334
+ G CGI M ASYPTK+
Sbjct: 330 SQAEGQCGIAMQASYPTKS 348
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 152/331 (45%), Positives = 202/331 (61%), Gaps = 12/331 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+++E W +H K ++ EK +R +F+ N V + N M + + L LN FAD+T+ EF
Sbjct: 38 DMYERW--RH-KVATNHGEKLRRFNVFKSNVLHVHETNKM-DKPYKLKLNKFADMTNHEF 93
Query: 87 KASFLGFSAASIDHDR-----RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++ + G HDR R + N+ VP S+DWRKKGAV VKDQ CG+C
Sbjct: 94 RSVYAGSKIHH--HDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQGQCGSC 151
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGINKI T LVSLSEQEL+DCD N GC GGLMD A+ F+ K G+ E
Sbjct: 152 WAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTGGLTRE 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY + G+C+ K+N +V+IDG++DVP+N+E+ L++AV QPV+V I FQ
Sbjct: 212 DAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGSSDFQF 271
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C T LDH V VGY + +G YWI++NSWG WG GY+ M+R + G
Sbjct: 272 YSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMERGISDKRG 331
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSL 351
+CGI M ASYP K N P S P + L
Sbjct: 332 LCGIAMEASYPIKNSSNNPKSSPTSSLKDEL 362
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/329 (48%), Positives = 212/329 (64%), Gaps = 10/329 (3%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SIL L + ELF+ W +++ K Y S +++ R + F+ N ++ + N+ S
Sbjct: 31 SILALEIDKFPSEEGVIELFQRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRIS 90
Query: 70 SF--TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+ +L LN FAD++++EFK+ F +RN + D P S+DWRKKG
Sbjct: 91 PYGQSLGLNRFADMSNEEFKSKFTSKVKKPFS---KRNGLSGKDHSCEDAPYSLDWRKKG 147
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
VT VKDQ CG CWAFS+TGAIEGIN IV+G L+SLSE EL+DCDR+ N GC GG MDY
Sbjct: 148 VVTAVKDQGYCGCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRT-NDGCDGGHMDY 206
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
A+++V+ N GIDTE +YPY G G CN K ++ IDGY +V E +++ LL A V QP
Sbjct: 207 AFEWVMHNGGIDTETNYPYSGADGTCNVAKEETKVIGIDGYYNV-EQSDRSLLCATVKQP 265
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
+S GI GS FQLY GI+ G CS+ +DHA+L+VGY SE DYWI+KNSWG SWG
Sbjct: 266 ISAGIDGSSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGYGSEGDEDYWIVKNSWGTSWG 325
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYPTK 333
M GY++++RNT G+C IN +ASYPTK
Sbjct: 326 MEGYIYIRRNTNLKYGVCAINYMASYPTK 354
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 307 bits (787), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 162/337 (48%), Positives = 206/337 (61%), Gaps = 23/337 (6%)
Query: 24 DINELFETWCKQHGKAYS----SEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLN 76
++ ++E W +HG+ + E + RL++F DN ++ HN + G +F L L
Sbjct: 49 EVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLT 108
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLR-------------DVPASID 122
FADLT +E++ LGF A R A+ + G R D+P +ID
Sbjct: 109 PFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAID 168
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WR+ GAVT+VK+Q CG CWAFSA AIEGIN IVTG+LVSLSEQE+IDCD + +SGC G
Sbjct: 169 WRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCD-TQDSGCNG 227
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLN-RHIVTIDGYKDVPENNEKQLLQ 241
G M+ A+QFVI N GID+E DYP+ G C+ K N + IDG+ +V NNE L +
Sbjct: 228 GQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETALQE 287
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
AV QPVSV I RAFQ YSSGIF GPC T+LDH V +VGY SENG YWI+KNSW
Sbjct: 288 AVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGSENGKAYWIVKNSWSD 347
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
SWG GY+ ++RN +G CGI M ASYP K P
Sbjct: 348 SWGEAGYIRIRRNVFLPVGKCGIAMDASYPVKDTYGP 384
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 224/345 (64%), Gaps = 21/345 (6%)
Query: 11 ILLLS----SLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-- 64
+L LS ++P+ ++ L+ W ++ A + RL++F++N FV +HN
Sbjct: 29 VLTLSKQGGAVPVRSDEEVRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAA 88
Query: 65 -NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR-RRNAS--VQSPGNLR---DV 117
+ G +F L +N FADLT++E++ FL D R RR+AS + S LR D+
Sbjct: 89 ADRGEHTFRLGMNRFADLTNEEYRTRFLR------DFSRLRRSASGKISSRYRLREGDDL 142
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P SIDWR+KGAV VK+Q CG+CWAFS A+EGIN+IVTG L+SLSEQ+L+DC + N
Sbjct: 143 PDSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT-TAN 201
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG M+ A+QF++ N GI++E+ YPYRGQ G CN +N +V+ID Y++VP +NE+
Sbjct: 202 HGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICN-STVNAPVVSIDSYENVPSHNEQ 260
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVSV + + R FQLY SGIFTG C+ S +HA+ +VGY +EN DY +KN
Sbjct: 261 SLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDYRTVKN 320
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
SWG++WG +GY+ ++RN GN G CGI ASYP K G N P
Sbjct: 321 SWGKNWGESGYIRVERNIGNPNGKCGITRFASYPVKKGTNTAAIP 365
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 152/311 (48%), Positives = 199/311 (63%), Gaps = 9/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ E E W HGK Y+ EK+Q+ + F++N + N+ GN + L +N FADLT++
Sbjct: 36 MRERHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNE 95
Query: 85 EFKA--SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
EFKA F G + I R + + N+ VPA++DWR++GAVT +KDQ CG CW
Sbjct: 96 EFKAINRFKGHVCSKI----TRTPTFRYE-NMTAVPATLDWRQEGAVTPIKDQGQCGCCW 150
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A EGI K+ TG L+SLSEQEL+DCD + + GC GGLMD A++F+++N G+ E
Sbjct: 151 AFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAE 210
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G G CN + H +I GY+DVP N+E LL+AV QPVSV I S FQ
Sbjct: 211 AIYPYEGVDGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVAIEASGFEFQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C T+LDH V VGY S++G YW++KNSWG WG GY+ MQR+ G
Sbjct: 271 YSGGVFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEG 330
Query: 321 ICGINMLASYP 331
+CGI MLASYP
Sbjct: 331 LCGIAMLASYP 341
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 154/322 (47%), Positives = 206/322 (63%), Gaps = 7/322 (2%)
Query: 14 LSSLPLNYC-SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFT 72
+ SLP++ + + ++ W +Q+G+ Y ++ E R I+ N F+ ++ N N SF
Sbjct: 30 IHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFI-EYINSQNLSFK 88
Query: 73 LSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEV 132
L+ N FADLT+ EF + +LG+ S +RRN S N D+P ++DWR+ GAVT +
Sbjct: 89 LTDNKFADLTNDEFNSIYLGYQIRSY---KRRNLSHMHE-NSTDLPDAVDWRENGAVTPI 144
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQF 191
KDQ CG+CWAFSA A+EGINKI TG+LVSLSEQEL+DCD N GC GG M+ A+ F
Sbjct: 145 KDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTF 204
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
+ G+ TE DYPY+G G C K K + H V I GY+ VP NNE L AV QPVSV
Sbjct: 205 IKSIGGLTTENDYPYKGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVA 264
Query: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
I S FQLYS G+F+G C L+H V IVGY NG YW++KNSWG+ WG +GY+ M
Sbjct: 265 IDASGYEFQLYSEGVFSGYCGIQLNHGVTIVGYGDNNGQKYWLVKNSWGKGWGESGYIRM 324
Query: 312 QRNTGNSLGICGINMLASYPTK 333
+R++ ++ G+CGI M SYP K
Sbjct: 325 KRDSSDTKGMCGIAMEPSYPIK 346
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 307 bits (786), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 197/325 (60%), Gaps = 11/325 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W + H S EKQ+R +F+ N V N M + + L LN FAD+T+ EF+
Sbjct: 37 LYERW-RSHHTVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 88 ASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++ S + + H R + G + VPAS+DWRKKGAVT VKDQ CG+CW
Sbjct: 95 NTY---SGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCW 151
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS A+EGIN+I T LVSLSEQEL+DCD N GC GGLMDYA++F+ + GI TE
Sbjct: 152 AFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEA 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
+YPY G C+ K N V+IDG+++VPEN+E LL+AV QPVSV I FQ Y
Sbjct: 212 NYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
S G+FTG C T LDH V IVGY + +G YW +KNSWG WG GY+ M+R + G+
Sbjct: 272 SEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGL 331
Query: 322 CGINMLASYPTKTGQNPPPSPPPGP 346
CGI M ASYP K N P P
Sbjct: 332 CGIAMEASYPIKKSSNNPSGIKSSP 356
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 307 bits (786), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 205/338 (60%), Gaps = 7/338 (2%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELF----ETWCKQHGKAYSSEQEKQQRLKIFEDN 56
M L ++ L +L +D + L E W ++G+ YS EK +RL++F+ N
Sbjct: 1 MGFLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKAN 60
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD 116
F+ + N GN F L N FAD+T EF+A G+ I R + ++ D
Sbjct: 61 VGFI-ESVNAGNHKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARATGFRYANVSIDD 119
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+PAS+DWR GAVT VKDQ CG CWAFS ++EGI K+ TG L+SLSEQEL+DCD
Sbjct: 120 LPASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGM 179
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GCGGGLMD A++F++ N G+DTE DYPY G G CN K + +I GY+DVP N+
Sbjct: 180 QNKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPAND 239
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E L +AV AQPVS+ + G + F+ Y G+ TG C T LDH V VGY + +G YW+
Sbjct: 240 EASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWL 299
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG SWG +G++ ++R+ + G+CG+ M SYPT
Sbjct: 300 VKNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPT 337
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 307 bits (786), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 206/308 (66%), Gaps = 4/308 (1%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+F++W +HGK Y S EK++RL IFEDN F++ N N S+ L L FADL+ E+
Sbjct: 55 IFDSWMVKHGKVYGSVAEKERRLTIFEDNLRFISNRN-AENLSYRLGLTQFADLSLHEYG 113
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSA 146
G + +S + + DV P S+DWR +GAVTEVKDQ C +CWAFS
Sbjct: 114 EVCHGADPRPPRNHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 173
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++KN G+ T+ DYPY
Sbjct: 174 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPY 232
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
+ G C+ + K N V IDG++++P N+E L++AV QPV+ I S R FQLY SG
Sbjct: 233 KAVNGVCDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESG 292
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F G C T+L+H V++VGY +ENG DYW++KNS G +WG GYM M RN N G+CGI
Sbjct: 293 VFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIA 352
Query: 326 MLASYPTK 333
M ASYP K
Sbjct: 353 MRASYPLK 360
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/322 (47%), Positives = 197/322 (61%), Gaps = 11/322 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H + EKQ+R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRNLNEKQKRFNVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
K ++ G + ++H R + + G N PAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 KTTYAG---SKVNHHRMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV LSEQELIDCD N GC GGLM+YA++++ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G C+ K N V+IDG++ VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SYYPYTANDGSCDATKENVPAVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C L+H V IVGY + +G +YWI++NSWG WG GY+ M+RN N G
Sbjct: 273 YSEGVFTGDCGKELNHGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSP 342
+CGI M ASYP K P P
Sbjct: 333 LCGIAMEASYPVKNSSKNPAGP 354
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/345 (46%), Positives = 223/345 (64%), Gaps = 21/345 (6%)
Query: 11 ILLLS----SLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-- 64
+L LS ++P+ ++ L+ W ++ A + RL++F++N FV +HN
Sbjct: 31 VLTLSKQGGAVPVRSDEEVRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAA 90
Query: 65 -NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR-RRNAS--VQSPGNLR---DV 117
+ G +F L +N FADLT++E++ FL D R RR+AS + S LR D+
Sbjct: 91 ADRGEHTFLLGMNRFADLTNEEYRTRFLR------DFSRLRRSASGKISSRYRLREGDDL 144
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P SIDWR+ GAV VK+Q CG+CWAFS A+EGIN+IVTG L+SLSEQ+L+DC + N
Sbjct: 145 PDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT-TAN 203
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG M+ A+QF++ N GI++E+ YPYRGQ G CN +N +V+ID Y++VP +NE+
Sbjct: 204 HGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNS-TVNAPVVSIDSYENVPSHNEQ 262
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVSV + + R FQLY SGIFTG C+ S +HA+ +VGY +EN D+WI+KN
Sbjct: 263 SLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKN 322
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
SWG++WG +GY+ +RN N G CGI ASYP K G N P
Sbjct: 323 SWGKNWGESGYIRAERNIENPNGKCGITRFASYPVKKGANTAAIP 367
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 151/312 (48%), Positives = 205/312 (65%), Gaps = 9/312 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
SDI + ++ W ++G+ Y S +E ++R I++ N ++ N+M N S TL+ N FADLT
Sbjct: 13 SDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLT 71
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EFKA++LG+ SI R GN+ ++P ++DWR++GAVT +K+Q CG+CW
Sbjct: 72 NEEFKATYLGYKTVSIPDTCFR------YGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A+EGINKI G L+SLSEQEL+DCD S N GC GG M A++F IK G+ TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRTGLTTE 184
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+G CN+QK V+I GY+ VP N+EK L AV QPVSV I FQ
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YS GIF+G C L+H V IVGY + YW++KNSWG WG +GY+ M+R++ + G
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDRQGT 304
Query: 322 CGINMLASYPTK 333
CGI M+ASYPTK
Sbjct: 305 CGIAMMASYPTK 316
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 170/361 (47%), Positives = 219/361 (60%), Gaps = 37/361 (10%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGK-AYSSEQEKQQRLKIFEDNYAFVTQ 62
LA SI+ S L+ + ELFE W +H K AY+S +EK +R ++F+DN + +
Sbjct: 23 LARGDFSIVGYSEEDLSSHESLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDE 82
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD--------------------- 101
N SS+ L LN FADLTH EFKA++LG S + D
Sbjct: 83 -TNRKVSSYWLGLNEFADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSS 141
Query: 102 ---RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVT 158
R R V + +P S+DWR KGAVT VK+Q CG+CWAFS A+EGIN+IVT
Sbjct: 142 SSFRFRYEGVDAA----RLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVT 197
Query: 159 GSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKL 218
G+L +LSEQEL+DCD N+GC GGLMDYA+ ++ N G+ TE+ YPY + G C++
Sbjct: 198 GNLTALSEQELVDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGS- 256
Query: 219 NRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHA 278
+ +VTI GY+DVP NNE+ LL+A+ QPVSV I S R Q YS G+F GPC T LDH
Sbjct: 257 SAAVVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHG 316
Query: 279 VLIVGYDS---ENG---VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
V VGY + +NG DY I+KNSWG SWG GY+ M+R TG G+CGIN + SYPT
Sbjct: 317 VAAVGYGTAGKDNGHVVADYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYPT 376
Query: 333 K 333
K
Sbjct: 377 K 377
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 156/321 (48%), Positives = 211/321 (65%), Gaps = 15/321 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQ-QRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
+ L++ W QH + S + E+ +R +IF++N ++ N +S + L LN FADL++
Sbjct: 42 LRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFADLSN 100
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCG 139
+EFKA ++G D R + VQS N +PASIDWR+KGAV VK+Q CG
Sbjct: 101 EEFKAIYMG-----TKMDLRGDREVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCG 155
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS ++EGIN I TG+LVSLSEQ+L+DC + NSGC GGLMD A+Q++I N GI
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDC-STENSGCNGGLMDTAFQYIINNGGIV 214
Query: 200 TEKDYPYRGQAGQCNKQKLNRHI--VTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
TE +YPY +A +C+ K+N V IDG++DVP NNE+ L +AV QPVSV I S +
Sbjct: 215 TEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQ 274
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
FQ YS+G+FTG C T+LDH V+ VGY S G++YWI++NSWG WG GY+ MQ+
Sbjct: 275 DFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQGIE 334
Query: 317 NSLGICGINMLASYPTKTGQN 337
+ G CGI M ASYPTK Q+
Sbjct: 335 AAEGKCGIAMQASYPTKKTQD 355
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 151/326 (46%), Positives = 201/326 (61%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S +K +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H R + + G VP S+DWRK GAVT VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD N+GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY Q G C+ K N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG CST L+H V IVGY + +G +YW ++NSWG WG GY+ MQR+ G
Sbjct: 273 YSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRSISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI M+ASYP K N P P P
Sbjct: 333 LCGIAMMASYPIKNSSNNPTGPSSSP 358
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 152/305 (49%), Positives = 198/305 (64%), Gaps = 6/305 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W Q+G+ Y +E EK +R IF++N ++ N G + L +NAFADLT++EF AS
Sbjct: 38 EQWMAQYGRVYKNEVEKTKRYNIFKENVEYIESFNKAGTKPYKLGINAFADLTNKEFIAS 97
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
G+ + H+ N + N+ VP ++DWRKKGAVT VKDQ CG CWAFSA A
Sbjct: 98 RNGYI---LPHECSSNTPFRYE-NVSAVPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAA 153
Query: 150 IEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EGI K+ TG+L+SLSEQEL+DCD + + GC GGLMD A+ F+I N G+ TE +YPY+G
Sbjct: 154 MEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQG 213
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G C K K + I GY+DVP N+E L +AV QPVSV I FQ YSSG+FT
Sbjct: 214 TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFT 273
Query: 269 GPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C T LDH V VGY +E+G YW++KNSWG SWG GY+ MQ++ G+CGI M
Sbjct: 274 GECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQ 333
Query: 328 ASYPT 332
+SYP+
Sbjct: 334 SSYPS 338
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 152/314 (48%), Positives = 200/314 (63%), Gaps = 7/314 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SIL + L + LFE+ +H K Y S EK R +IF DN + + N
Sbjct: 29 FSILGYAPEDLTSIHKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDE-TNKKV 87
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS--PGNLRDVPASIDWRKK 126
S++ L LN FADLTH+EFK FLGF + R++ S++ + D+P S+DWRKK
Sbjct: 88 SNYWLGLNEFADLTHEEFKNKFLGFKGELAE---RKDESIEQFRYRDFVDLPKSVDWRKK 144
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
GAV+ VK+Q CG+CWAFS A+EGIN+IVTG+L LSEQELIDCD ++N+GC GGLMD
Sbjct: 145 GAVSPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMD 204
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
YA+ +V +N G+ E++YPY G C++++ VTI GY DVP NNE L+A+ Q
Sbjct: 205 YAFAYVTRN-GLHKEEEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQ 263
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
P+SV I S R FQ YS G+F G C T LDH V VGY + G+DY I++NSWG WG
Sbjct: 264 PISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTSKGLDYVIVRNSWGPKWGEK 323
Query: 307 GYMHMQRNTGNSLG 320
GY+ M+RNTG +G
Sbjct: 324 GYIRMKRNTGKPMG 337
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 305 bits (780), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 210/351 (59%), Gaps = 23/351 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINE-----------LFETWCKQHGKAYSSEQEKQQRLKIFE 54
FF++ I LS L + D +E L+E W H + +S E +R +F
Sbjct: 4 FFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVSRAS-HEAIKRFNVFR 62
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG-- 112
N V N N + L +N FAD+TH EF++S+ G +++ H R + G
Sbjct: 63 HNVLHV-HRTNKKNKPYKLKINRFADITHHEFRSSYAG---SNVKHHRMLRGPKRGSGGF 118
Query: 113 ---NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
N+ VP+S+DWR+KGAVTEVK+Q CG+CWAFS A+EGINKI T LVSLSEQEL
Sbjct: 119 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 178
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGY 228
+DCD N GC GGLM+ A++F+ N GI TE+ YPY Q C + VTIDG+
Sbjct: 179 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGH 238
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSE 287
+ VPEN+E++LL+AV QPVSV I FQLYS G+F G C T L+H V+IVGY +++
Sbjct: 239 EHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETK 298
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
NG YWI++NSWG WG GY+ ++R + G CGI M ASYPTK P
Sbjct: 299 NGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKLSSTP 349
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 145/298 (48%), Positives = 195/298 (65%), Gaps = 10/298 (3%)
Query: 42 SEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD 101
++ + +R +F++N ++ + N + F L+LN FAD+T E + S+ G + + H
Sbjct: 61 ADHDPARRFNVFKENVKYIHEANKK-DRPFRLALNKFADMTTDELRHSYAG---SRVRHH 116
Query: 102 RRRNASVQSPGNL-----RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKI 156
R + ++ GN ++P ++DWR+KGAVT +KDQ CG+CWAFS A+E INKI
Sbjct: 117 RALSGGRRAQGNFTYSDAENLPPAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKI 176
Query: 157 VTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQ 216
TG LVSLSEQEL+DCD + GC GGLMDYA+QF+ KN G+ +E +YPY+GQ C++
Sbjct: 177 RTGKLVSLSEQELMDCDNVNDQGCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQA 236
Query: 217 KLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLD 276
K N H V IDGY+DVP N+E L +AV QPVSV I S + FQ YS G+FTG C+T LD
Sbjct: 237 KENTHDVAIDGYEDVPANDESALQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLD 296
Query: 277 HAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
H V VGY + +G YWI+KNSWG WG GY+ MQR + G+CGI M ASYP K
Sbjct: 297 HGVAAVGYGTARDGTKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPIK 354
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 204/332 (61%), Gaps = 8/332 (2%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L + ++ + + + E E W ++G+ Y EK++R KIF+DN A +
Sbjct: 13 ALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIES 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N + ++ LS+N FADLT++EF++ F A A+ N+ VP++ID
Sbjct: 73 FNKAMDKTYKLSINEFADLTNEEFRSLRNRFKAHICSE-----ATTFKYENVTAVPSTID 127
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WRKKGAVT +KDQ CG CWAFSA A EGI +I TG L+SLSEQEL+DCD N GC
Sbjct: 128 WRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCS 187
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A++F IK HG+ +E YPY G G CN +K I GY+DVP NNEK L +
Sbjct: 188 GGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQK 246
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWG 300
AV QPV+V I FQ Y+SG+FTG C T LDH V VGY ++G+ YW++KNSWG
Sbjct: 247 AVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWG 306
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 307 TGWGEEGYIRMQRDVTAKEGLCGIAMQASYPT 338
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 167/364 (45%), Positives = 215/364 (59%), Gaps = 29/364 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDIN-------------ELFETWCKQHGKAYSSEQEKQ 47
M LA LL + L++ + C I +L+E W + H + EK
Sbjct: 1 MAQLAKTLLLVALVAMSAVELCRAIEFDERDLASDEALWDLYERW-QTHHHVHRHHGEKG 59
Query: 48 QRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNAS 107
+R F++N F+ HN G+ + LSLN F D+ +EF+++F A S +D RR S
Sbjct: 60 RRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTF----ADSRINDLRRAES 115
Query: 108 VQSPG-------NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGS 160
+P + D+P S+DWRK+GAVT VKDQ CG+CWAFS ++EGIN I TGS
Sbjct: 116 PAAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGS 175
Query: 161 LVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
LVSLSEQELIDCD N GC GGLM+ A++F+ G+ TE YPYR G C+ + R
Sbjct: 176 LVSLSEQELIDCDTDEN-GCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRR 234
Query: 221 -HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAV 279
IV+IDG++ VP +E L +AV QPVSV I +AFQ YS G+FTG C T LDH V
Sbjct: 235 GQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGV 294
Query: 280 LIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
VGY S++G YWI+KNSWG SWG GY+ MQR GN G+CGI M AS+P KT NP
Sbjct: 295 AAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGAGNG-GLCGIAMEASFPIKTSPNP 353
Query: 339 PPSP 342
P
Sbjct: 354 ARKP 357
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 201/324 (62%), Gaps = 14/324 (4%)
Query: 27 ELFETWCKQ----HGKAYSSEQE-KQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADL 81
E F+ W +AY+S E ++R I+ DN F ++N ++S LS+ +ADL
Sbjct: 44 EAFDFWVHTVKPPSNRAYASSAEVYERRFNIWLDNLRFAHEYNAR-HTSHWLSMGVYADL 102
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+ E+++ LG++A R A G + P +DW GAVT VKDQ CG+C
Sbjct: 103 SQDEYRSKALGYNAHLHKKRPLRAAPFLYKGTVP--PEEVDWVAGGAVTPVKDQLLCGSC 160
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS TGA+EG N I TG LVSLSEQ L+DCDR Y++GC GG MD A+ F++ N GIDTE
Sbjct: 161 WAFSTTGAVEGANAIATGKLVSLSEQMLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTE 220
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPYR + G C + RH+VTIDGY+DVP N+E L++AV QPVSV I + AFQL
Sbjct: 221 DDYPYRAEDGICQDNRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQL 280
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY----DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
Y G+F C T+LDHAVL+VGY + + + YW++KNSWG WG GY+ + RN G
Sbjct: 281 YGGGVFDAECGTALDHAVLVVGYGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGK 340
Query: 318 SL--GICGINMLASYPTKTGQNPP 339
G CG+ M AS+P K G NPP
Sbjct: 341 DAPEGQCGLAMYASFPIKKGANPP 364
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 154/353 (43%), Positives = 211/353 (59%), Gaps = 17/353 (4%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINE-----LFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L L + + ++P N +E L+E W + H EK +R +F++N
Sbjct: 9 ALVVALAFVGVARTIPFNEKDLASEESLWGLYERW-RSHHTVSRDLSEKNKRFNVFKENA 67
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG----- 112
F+ + N ++ + L LN FAD+T+QEF++++ G + I H R + + ++ G
Sbjct: 68 KFIHEFNKK-DAPYKLGLNKFADMTNQEFRSTYAG---SKIHHHRTQRGTPRATGSFMYE 123
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
N+ +PAS+DWR +GAV VKDQ CG+CWAFS ++EGINKI T LV LS Q+L+DC
Sbjct: 124 NVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDC 183
Query: 173 DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
D N GC GGLMDYA++F+ N GI +E YPY + G C + + +VTIDGY+DVP
Sbjct: 184 DTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCASES-SAPVVTIDGYEDVP 242
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVD 291
NNE L++AV Q VSV I S AFQ YS G+FTG C LDH V +VGY + +G
Sbjct: 243 ANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGATRDGTK 302
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPP 344
YWI++NSWG WG GY+ MQR G+CGI M SYP KT NP + P
Sbjct: 303 YWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPLKTSPNPKNNISP 355
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 304 bits (778), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 151/295 (51%), Positives = 196/295 (66%), Gaps = 11/295 (3%)
Query: 46 KQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDH----- 100
+++R +F++N +V + N + F L+LN FAD+T EF+ ++ G + + H
Sbjct: 60 EERRFNVFKENARYVHEGNKR-DRPFRLALNKFADMTTDEFRRTYAG---SRVRHHLSLS 115
Query: 101 DRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGS 160
RR + ++P ++DWR+KGAVT +KDQ CG+CWAFS A+EGINKI TG
Sbjct: 116 GGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGK 175
Query: 161 LVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
LVSLSEQEL+DCD N GC GGLMDYA+QF+ KN GI TE +YPY+G+ G C++ K N
Sbjct: 176 LVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN-GITTESNYPYQGEQGSCDQAKENA 234
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
VTIDGY+DVP N+E L +AV QPVSV I S + FQ YS G+FTG CST LDH V
Sbjct: 235 QAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFYSEGVFTGECSTDLDHGVA 294
Query: 281 IVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
VGY + +G YWI+KNSWG WG GY+ MQR + G+CGI M ASYPTK+
Sbjct: 295 AVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGLCGIAMQASYPTKS 349
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 303 bits (777), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 162/361 (44%), Positives = 212/361 (58%), Gaps = 23/361 (6%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINEL---------FETWCKQHGKAYSSEQEKQQRLKIFE 54
L F +LS L L + D EL +E W H +S E +R +F
Sbjct: 3 LFFIVLSFLCLLQASKGFDFDEKELETEENVWKLYERWRDHHSVTRAS-HEALKRFNVFR 61
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG-- 112
N V N N + L +N FAD+TH EF++S+ G +++ H R + G
Sbjct: 62 HNVLHV-HRTNKKNKPYKLKVNRFADITHHEFRSSYAG---SNVKHHRMLRGPKRGSGGF 117
Query: 113 ---NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
N+ VP+S+DWR+KGAVTEVK+Q CG+CWAFS A+EGINKI T LVSLSEQEL
Sbjct: 118 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 177
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGY 228
+DCD N GC GGLM+ A++F+ N GI TE+ YPY Q C + ++ VTIDG+
Sbjct: 178 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGH 237
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSE 287
+ VPEN+E+ LL+AV QPVSV I FQLYS G+F G C T L+H V+IVGY +++
Sbjct: 238 EHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETK 297
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT 347
NG YWI++NSWG WG GY+ ++R + G CGI M ASYPTK + PS P
Sbjct: 298 NGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKV--SSTPSTPESVV 355
Query: 348 R 348
R
Sbjct: 356 R 356
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 158/331 (47%), Positives = 202/331 (61%), Gaps = 14/331 (4%)
Query: 7 FLLSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
LLSI + N + + ++E E W K++GK Y EKQ+RL IF+DN F+ N
Sbjct: 15 LLLSICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNA 74
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---GNLRDVPASID 122
GN + LS+N AD T++EF AS G+ + + + Q+P N+ VP ++D
Sbjct: 75 AGNRPYKLSINHLADQTNEEFVASHNGY--------KHKGSHSQTPFKYENVTGVPNAVD 126
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WR+ GAVT VKDQ CG+CWAFS A EGI +I T L+SLSEQEL+DCD S + GC G
Sbjct: 127 WRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCD-SVDHGCDG 185
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242
G M+ ++F+IKN GI +E +YPY G C+ K I GY+ VP N+E L +A
Sbjct: 186 GYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDALQKA 245
Query: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGR 301
V QPVSV I AFQ YSSG+FTG C T LDH V VGY S ++G YWI+KNSWG
Sbjct: 246 VANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGT 305
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR T G+CGI M ASYPT
Sbjct: 306 QWGEEGYIRMQRGTDAQEGLCGIAMDASYPT 336
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/295 (51%), Positives = 196/295 (66%), Gaps = 11/295 (3%)
Query: 46 KQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDH----- 100
+++R +F+ N +V + N + F L+LN FAD+T EF+ ++ G + + H
Sbjct: 60 EERRFNVFKQNARYVHEGNKR-DMPFRLALNKFADMTTDEFRRTYAG---SRVRHHLSLS 115
Query: 101 DRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGS 160
RR G+ ++P ++DWR+KGAVT +KDQ CG+CWAFS A+EGINKI TG
Sbjct: 116 GGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGK 175
Query: 161 LVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
LVSLSEQEL+DCD N GC GGLMDYA+QF+ KN GI TE +YPY+G+ G C++ K N
Sbjct: 176 LVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN-GITTESNYPYQGEQGSCDQAKENA 234
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
VTIDGY+DVP N+E L +AV QPVSV I S + FQ YS G+FTG CST LDH V
Sbjct: 235 QAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFYSEGVFTGECSTDLDHGVA 294
Query: 281 IVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
VGY + +G YWI+KNSWG WG GY+ MQR + G+CGI M ASYPTK+
Sbjct: 295 AVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGLCGIAMQASYPTKS 349
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 158/349 (45%), Positives = 213/349 (61%), Gaps = 20/349 (5%)
Query: 4 LAFFLLSI----LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
+A FL S+ + ++ L + L+E W + H EKQ+R +F++N +
Sbjct: 9 VASFLASVAATAIDIADKDLETEDSLWNLYERW-RSHHTVSRDLDEKQKRFNVFKENPRY 67
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR-----RRNASVQS---- 110
+ N + + L LN FADLT+ EF++++ G + I+H R RR + S
Sbjct: 68 IHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAG---SRINHHRSLRGSRRGGATNSFMYQ 124
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
+ R +PASIDWR+KGAVT VKDQ CG+CWAFS A+EGIN+I T L+SLSEQELI
Sbjct: 125 SLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELI 184
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCD N+GC GGLMDYA+ F+ KN GI +E +YPY + C +K H+V+IDG++D
Sbjct: 185 DCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEK-KSHVVSIDGHED 243
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENG 289
VP N+E LL+AV QPVS+ I S FQ YS G+FTG T LDH V IVGY ++ G
Sbjct: 244 VPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQG 303
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
YWI++NSWG WG GY+ + +S +CG+ M ASYP KT NP
Sbjct: 304 TKYWIVRNSWGAEWGEKGYIRIS-AASDSKRLCGLAMEASYPIKTSPNP 351
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 196/308 (63%), Gaps = 9/308 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W QHG+ Y + EK R +IF N + + N N F L +N FADLT++EF
Sbjct: 39 ERHEQWMAQHGRVYKNAAEKAHRFEIFRANVERI-ESFNAENHKFKLGVNQFADLTNEEF 97
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K + ++ + + N+ VPA++DWR KGAVT +KDQ CG+CWAFSA
Sbjct: 98 K------TRNTLKPSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSA 151
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A EGI K+ TG L+SLSEQE++DCD S + GC GG MD A++++IKN GI TE +YP
Sbjct: 152 VAATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYP 211
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y+ G CN +K H +I GY+DV N+E LL+A QP++V I + AFQ+YSSG
Sbjct: 212 YKAADGTCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSG 271
Query: 266 IFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+FTG C T LDH V +VGY + +G YW++KNSWG SWG +GY+ M+R+ G+CGI
Sbjct: 272 VFTGDCGTDLDHGVTLVGYGATSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGI 331
Query: 325 NMLASYPT 332
M ASYPT
Sbjct: 332 AMDASYPT 339
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 217/336 (64%), Gaps = 19/336 (5%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
FF+L+ L +SL ++ S + E E W ++HGK Y EK+QR +IF++N F+ N
Sbjct: 16 FFILT--LWTSLVIS--SRLLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNA 71
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFL--------GFSAASIDHDRRRNASVQSPGNLRDV 117
G++ F LS+N F D T+ EFKA++L G A+I+ + SV N+ +V
Sbjct: 72 AGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEE-----SVFRYENVTEV 126
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
PA++DWR++GAVT +K Q CG+CWAF+ AIEGI++I TG LVSLSEQEL+DC ++
Sbjct: 127 PATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNT 186
Query: 178 S-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG ++ A F++K GI +E +YPY G+CN +K ++ I GY+ VP NNE
Sbjct: 187 TDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVPANNE 246
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWII 295
K LL+AV QP++V I ++RAFQ YSSGI G C LDH V IVGY S++GV YW++
Sbjct: 247 KALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLV 306
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG WG GY+ ++R+ G CGI M+ +YP
Sbjct: 307 KNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYP 342
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 216/339 (63%), Gaps = 15/339 (4%)
Query: 3 SLAFFLLSILLLSSL-PLNYCSD------INELFETWCKQHGKAYSSEQEKQQRLKIFED 55
+L+ +L++ +++S P + + + + +ETW K++G+ Y +E + R I++
Sbjct: 6 TLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQS 65
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
N ++ +N+ N S+ L N FAD+T++EFK+++LG+ R R +
Sbjct: 66 NVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGYLP------RFRVQTEFRYHKHG 118
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
++P SIDWRKKGAVT VKDQ CG+CWAFSA A+EGINKI T +LVSLSEQ+LIDCD +
Sbjct: 119 ELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIK 178
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
S N GC GG M A+ ++ K+ GI T K+YPY+G+ G CNK K + VTI GY+ VP
Sbjct: 179 SGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESVPAR 238
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NEK L AV QPVS+ AFQ YS GIF+G C +L+H + IVGY ENG YWI
Sbjct: 239 NEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYGEENGDKYWI 298
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+KNSW WG +GY+ M+R+T + G CGI M A+YP K
Sbjct: 299 VKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPVK 337
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 149/310 (48%), Positives = 203/310 (65%), Gaps = 9/310 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
SDI + ++ W ++G+ Y S +E ++R I++ N ++ N+M N S TL+ N FADLT
Sbjct: 13 SDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLT 71
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EFKA++LG+ SI R GN+ ++P ++DWR++GAVT +K+Q CG+CW
Sbjct: 72 NEEFKATYLGYKTVSIPDTCFR------YGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A+EGINKI G L+SLSEQEL+DCD S N GC GG M A++F IK G+ TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRTGLTTE 184
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+G CN+QK V+I GY+ VP N+EK L AV QPVSV I FQ
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YS GIF+G C L+H V IVGY + YW++KNSWG WG +GY+ M+R++ + G
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDKQGT 304
Query: 322 CGINMLASYP 331
CGI M+ASYP
Sbjct: 305 CGIAMMASYP 314
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 201/311 (64%), Gaps = 10/311 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W ++ K Y +E+++R KIF++N ++ NN N + L +N FADLT++EF
Sbjct: 37 ERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEF 96
Query: 87 KA---SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
A F G +SI R + + N+ +P+++DWR+KGAVT +KDQ CG CWA
Sbjct: 97 IAPRNRFKGHMCSSI----TRTTTFKYE-NVTALPSTVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSA A EGI+ + +G L+SLSEQE++DCD + + GC GG MD A++F+I+NHG++TE
Sbjct: 152 FSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEA 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
+YPY+ G+CN + H TI GY+DVP NNEK L +AV QPVSV I S FQ Y
Sbjct: 212 NYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
+G+FTG C T LDH V VGY S +G YW++KNSWG WG GY+ MQR G+
Sbjct: 272 KTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGL 331
Query: 322 CGINMLASYPT 332
CGI M+ASYPT
Sbjct: 332 CGIAMMASYPT 342
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/294 (51%), Positives = 195/294 (66%), Gaps = 11/294 (3%)
Query: 47 QQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDH-----D 101
++R +F+ N +V + N + F L+LN FAD+T EF+ ++ G + + H
Sbjct: 61 ERRFNVFKQNARYVHEGNKR-DMPFRLALNKFADMTTDEFRRTYAG---SRVRHHLSLSG 116
Query: 102 RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSL 161
RR G+ ++P ++DWR+KGAVT +KDQ CG+CWAFS A+EGINKI TG L
Sbjct: 117 GRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKL 176
Query: 162 VSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH 221
VSLSEQEL+DCD N GC GGLMDYA+QF+ KN GI TE +YPY+G+ G C++ K N
Sbjct: 177 VSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN-GITTESNYPYQGEQGSCDQAKENAQ 235
Query: 222 IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLI 281
VTIDGY+DVP N+E L +AV QPVSV I S + FQ YS G+FTG CST LDH V
Sbjct: 236 AVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFYSEGVFTGECSTDLDHGVAA 295
Query: 282 VGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
VGY + +G YWI+KNSWG WG GY+ MQR + G+CGI M ASYPTK+
Sbjct: 296 VGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGLCGIAMQASYPTKS 349
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 208/334 (62%), Gaps = 8/334 (2%)
Query: 3 SLAFFL---LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
SLA + L + ++S L S + E + W Q+ K Y+ QE ++R +IF++N +
Sbjct: 11 SLALLMCLGLWAVQVTSRTLQDAS-MYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVNY 69
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ N G + L +N F DLT++EF A F R N N+ VP+
Sbjct: 70 IETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTNTYKYE--NVTTVPS 127
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNS 178
++DWR+KGAVT VKDQ CG CWAFSA A EGI+++ TG L+SLSEQEL+DCD + +
Sbjct: 128 NVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQ 187
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++F+I+NHG+DTE YPY+G G CN + + + TI Y+DVP NNE+
Sbjct: 188 GCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPTNNEQA 247
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKN 297
L +AV QP+SV I S FQ Y+SG+FTG C T LDH V VGY S++G YW++KN
Sbjct: 248 LQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTKYWLVKN 307
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG SWG GY+ MQR G+CGI M ASYP
Sbjct: 308 SWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYP 341
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 201/311 (64%), Gaps = 10/311 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E W ++ K Y QE+++R +IF++N ++ N+ N S+ L +N FADLT++EF
Sbjct: 37 ERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEEF 96
Query: 87 KA---SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
A F G +SI R + + N+ +P+++DWR+KGAVT +KDQ CG CWA
Sbjct: 97 IAPRNRFKGHMCSSI----TRTTTFKYE-NVTVIPSTVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSA A EGI+ + G L+SLSEQE++DCD + + GC GG MD A++F+I+NHG++TE
Sbjct: 152 FSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNHGLNTEP 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
+YPY+ G+CN + H TI GY+DVP NNEK L +AV QPVSV I S FQ Y
Sbjct: 212 NYPYKAADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
SG+FTG C T LDH V VGY S +G +YW++KNSWG WG GY+ MQR G+
Sbjct: 272 KSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEEGL 331
Query: 322 CGINMLASYPT 332
CGI M+ASYPT
Sbjct: 332 CGIAMMASYPT 342
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/323 (47%), Positives = 205/323 (63%), Gaps = 11/323 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W + H S EK QR +F++N + + N + + L LN FAD+T+ EF
Sbjct: 39 LYERW-RSHHTVSRSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADMTNHEFL 96
Query: 88 ASFLGFSAASIDHDRRRNASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+ G + + H R + S + G N ++P+SIDWRK+GAVT VKDQ CG+CWA
Sbjct: 97 QHYGG---SKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVKDQGKCGSCWA 153
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS+ A+EGINKI TG L+SLSEQEL+DC+ S N GC GGLM+ A+ F+ K G+ TE +
Sbjct: 154 FSSVAAVEGINKIKTGELISLSEQELVDCN-SVNHGCDGGLMEQAFSFIEKTGGLTTENN 212
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPYR + G C+ K+N +VTIDGY+ VPEN+E L+QAV QPVS+ I + FQ YS
Sbjct: 213 YPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQFYS 272
Query: 264 SGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G++TG C T L+H V +VGY +++G YWI+KNSWG WG NG++ MQR G+C
Sbjct: 273 EGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDVEEGLC 332
Query: 323 GINMLASYPTKTGQNPPPSPPPG 345
GI + ASYP K + P G
Sbjct: 333 GITLEASYPIKQRSDIKQPPSSG 355
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 180/420 (42%), Positives = 235/420 (55%), Gaps = 40/420 (9%)
Query: 13 LLSSLPLNYCSDIN--ELFETWCKQHGKAYSS-EQEKQQRLKIFEDNYAFVTQHNNMGNS 69
LLSS + + + F W QH + YS E +RL +F DN + + N N+
Sbjct: 22 LLSSADMLALAQVEPERAFGLWATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNRR-NT 80
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR------------RNASVQSPGNLRDV 117
TL+LN +AD T +EF A LG + R R A VQ+P
Sbjct: 81 GITLALNEYADETWEEFAAKRLGLKISQEQLKAREARSSSSSSSSWRYAQVQTP------ 134
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
A++DWR K AVT+VK+Q CG+CWAFSA G+IEG N + TG LV+LSEQ+L+DCD + N
Sbjct: 135 -AAVDWRAKNAVTQVKNQGQCGSCWAFSAVGSIEGANALATGQLVALSEQQLVDCDTASN 193
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAG---QCNKQK-LNRHIVTIDGYKDVPE 233
GC GGLMD A+++V+ N GIDTE+DY Y G CNK+K +R V+IDGY+DVP
Sbjct: 194 MGCSGGLMDDAFKYVLDNGGIDTEEDYSYWSGYGFGFWCNKRKQTDRPAVSIDGYEDVP- 252
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDY 292
+E LL+AV QPV+V IC S Q YSSG+ C L+H VL VGYD S+ Y
Sbjct: 253 TSEPALLKAVAGQPVAVAICASAN-MQFYSSGVINS-CCEGLNHGVLAVGYDTSDKAQPY 310
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLL 352
WI+KNSWG SWG GY ++ G G+CGI ASY KT P PT C +
Sbjct: 311 WIVKNSWGGSWGEQGYFRLKMGEGPK-GLCGIASAASYAVKTSAVNKPV----PTMCDMF 365
Query: 353 --TYCAAGETCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTV 409
T C G TC C S+ G +CL CC + AV C D ++CCP+ C++ + C+
Sbjct: 366 GWTECGVGNTCSCSFSLFGWLCLWHDCCPLADAVSCPDLKHCCPAG-TTCNAAQGACIAA 424
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 202/335 (60%), Gaps = 15/335 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
LA LL + S + Y + ++E E W K++GK Y EKQ+RL IF+DN F+
Sbjct: 11 LALVLLLSICTSQVMSRYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIE 70
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---GNLRDVP 118
N GN + L +N AD T++EF AS G+ + + + Q+P N+ VP
Sbjct: 71 SFNAAGNKPYKLGINHLADQTNEEFVASHNGY--------KHKASHSQTPFKYENVTGVP 122
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
++DWR+ GAVT VKDQ CG+CWAFS A EGI +I T L+SLSEQEL+DCD S +
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCD-SVDH 181
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG M+ ++F+IKN GI +E +YPY G C+ K I GY+ VP N+E
Sbjct: 182 GCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDA 241
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKN 297
L +AV QPVSV I AFQ YSSG+FTG C T LDH V VGY S ++G YWI+KN
Sbjct: 242 LQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKN 301
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG WG GY+ MQR T G+CGI M ASYPT
Sbjct: 302 SWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPT 336
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 211/340 (62%), Gaps = 18/340 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H + + EK +R F++N F+ HN G+ + L LN F D+ +EF
Sbjct: 40 DLYERW-QTHHRVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGREEF 98
Query: 87 KASFLGFSAASIDHDRRR-NASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++ GF+ + I+ RR A+ PG + D+P S+DWR+KGAVT VK+Q CG+C
Sbjct: 99 RS---GFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCGSC 155
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN I TGSLVSLSEQELIDCD N GC GGLM+ A++F+ + GI TE
Sbjct: 156 WAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDEN-GCQGGLMENAFEFIKSHGGITTE 214
Query: 202 KDYPYRGQAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
YPY G C+ + R +V IDG++ VP +E L +AV QPVSV I +A Q
Sbjct: 215 SAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQALQ 274
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+FTG C T LDH V VGY S++G YWI+KNSWG SWG GY+ MQR TGN
Sbjct: 275 FYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGTGNG- 333
Query: 320 GICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGE 359
G+CGI M AS+P KT NP P R +L+T A+ +
Sbjct: 334 GLCGIAMEASFPIKTSPNPSRKP-----RRALITRDASSQ 368
>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
Length = 300
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/282 (54%), Positives = 183/282 (64%), Gaps = 7/282 (2%)
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS GA+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+IKN GIDTE
Sbjct: 1 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEA 60
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ G+C++ + N +VTID Y+DVPEN+E L +A+ QP+SV I RAFQLY
Sbjct: 61 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 120
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SSG+F G C T LDH V+ VGY +ENG YWI++NSWG WG +GY+ M RN G C
Sbjct: 121 SSGVFDGLCGTELDHGVVAVGYGTENGKGYWIVRNSWGNRWGESGYIKMARNIEAPTGKC 180
Query: 323 GINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKC 376
GI M ASYP K GQ PPSP PT C C TCCC C W C
Sbjct: 181 GIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYCFGWGC 240
Query: 377 CGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
C +A CC D+ CCP YP+CD R CL +S F+VK
Sbjct: 241 CPLEAATCCDDNSSCCPHEYPVCDVNRGTCL-MSKNSPFSVK 281
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 203/322 (63%), Gaps = 18/322 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+ FE W +HG+AY+ EKQ+R +++ N V N+M N + L+ N FADLT++EF
Sbjct: 29 DRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG-YKLADNKFADLTNEEF 87
Query: 87 KASFLGFSA-ASIDHDRRR-NASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACW 142
+A LGF +I +A + PG D +P S+DWRKKGAV EVK+Q CG+CW
Sbjct: 88 RAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCW 147
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFSA AIEGIN+I G LVSLSEQEL+DCD GCGGG M +A++FV+ NHG+ TE
Sbjct: 148 AFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVVGNHGLTTEA 206
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY G C KLN+ V I GY++V ++E L +A AQPVSV + G FQLY
Sbjct: 207 SYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQLY 266
Query: 263 SSGIFTGPCSTSLDHAVLIVGY-DSENGVD----------YWIIKNSWGRSWGMNGYMHM 311
SG++TGPC+ ++H V +VGY +SE D YWI+KNSWG WG GY+ M
Sbjct: 267 GSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILM 326
Query: 312 QRNT-GNSLGICGINMLASYPT 332
QR+ G + G+CGI +L SYP
Sbjct: 327 QRDVAGLASGLCGIALLPSYPV 348
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 203/322 (63%), Gaps = 18/322 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+ FE W +HG+AY+ EKQ+R +++ N V N+M N + L+ N FADLT++EF
Sbjct: 30 DRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNG-YKLADNKFADLTNEEF 88
Query: 87 KASFLGFSA-ASIDHDRRR-NASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACW 142
+A LGF +I +A + PG D +P S+DWRKKGAV EVK+Q CG+CW
Sbjct: 89 RAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCW 148
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFSA AIEGIN+I G LVSLSEQEL+DCD GCGGG M +A++FV+ NHG+ TE
Sbjct: 149 AFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVVGNHGLTTEA 207
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY G C KLN+ V I GY++V ++E L +A AQPVSV + G FQLY
Sbjct: 208 SYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQLY 267
Query: 263 SSGIFTGPCSTSLDHAVLIVGY-DSENGVD----------YWIIKNSWGRSWGMNGYMHM 311
SG++TGPC+ ++H V +VGY +SE D YWI+KNSWG WG GY+ M
Sbjct: 268 GSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILM 327
Query: 312 QRNT-GNSLGICGINMLASYPT 332
QR+ G + G+CGI +L SYP
Sbjct: 328 QRDVAGLASGLCGIALLPSYPV 349
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 139/229 (60%), Positives = 171/229 (74%), Gaps = 3/229 (1%)
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
PG + +P S+DWR+ GAV VKDQ SCG+CWAFS A+EGIN+IVTG L+SLSEQEL+
Sbjct: 2 PGEV--LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELV 59
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCD Y+ GC GGLMDYA+ F+IKN G+DTEKDYPY G G+CN + +V+IDGY+D
Sbjct: 60 DCDTEYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYED 119
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGV 290
VP +EK L +AV QPVSV + RA QLY SGIFTG C T+LDH ++ VGY +ENG
Sbjct: 120 VPPFDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGTENGT 179
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSL-GICGINMLASYPTKTGQNP 338
DYWI++NSWG SWG NGY+ M+RN ++ G CGI M ASYP K G+NP
Sbjct: 180 DYWIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIKNGENP 228
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 196/322 (60%), Gaps = 11/322 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H + EKQ+R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRNLNEKQKRFNVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
K ++ G + ++H R + + G N PAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 KTTYAG---SKVNHHRMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV LSEQELIDCD N GC GGLM+YA++++ + G+ TE
Sbjct: 153 WAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G C+ K N V+IDG++ VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SYYPYTANDGSCDATKENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C L+H V IVGY + +G +YWI++NSWG WG G + M+RN N G
Sbjct: 273 YSEGVFTGDCGKELNHGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSP 342
+CGI M ASYP K P P
Sbjct: 333 LCGIAMEASYPVKNSSKNPAGP 354
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 201/311 (64%), Gaps = 10/311 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W ++ K Y +E+++R KIF++N ++ NN + + L +N FADLT++EF
Sbjct: 37 ERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEF 96
Query: 87 KA---SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
A F G +SI R + + N+ +P+++DWR+KGAVT +KDQ CG CWA
Sbjct: 97 IAPRNKFKGHMCSSI----TRTTTFKYE-NVTALPSTVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSA A EGI+ + +G L+SLSEQE++DCD + + GC GG MD A++F+I+NHG++TE
Sbjct: 152 FSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEA 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
+YPY+ G+CN + H TI GY+DVP NNEK L +AV QPVSV I S FQ Y
Sbjct: 212 NYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
+G+FTG C T LDH V VGY S +G YW++KNSWG WG GY+ MQR G+
Sbjct: 272 KTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRGVKAQEGL 331
Query: 322 CGINMLASYPT 332
CGI M+ASYPT
Sbjct: 332 CGIAMMASYPT 342
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 301 bits (771), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 195/322 (60%), Gaps = 11/322 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H + EKQ+R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRNLNEKQKRFNVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
K ++ G ++H R + + G N PAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 KTTYAG---TKVNHHRMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV LSEQELIDCD N GC GGLM+YA++++ + G+ TE
Sbjct: 153 WAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G C+ K N V+IDG++ VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SYYPYTANDGSCDATKENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C L+H V IVGY + +G +YWI++NSWG WG G + M+RN N G
Sbjct: 273 YSEGVFTGDCGKELNHGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSP 342
+CGI M ASYP K P P
Sbjct: 333 LCGIAMEASYPVKNSSKNPAGP 354
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 208/335 (62%), Gaps = 11/335 (3%)
Query: 7 FLLSILL----LSSLPLNYCSDINELF---ETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
FL++IL +S+L +D + E W ++G+ Y+ EK QRL++F+ N AF
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ + N GN F+L N FAD+T EF+A+ G+ + R + +L +PA
Sbjct: 142 I-ELVNAGNDKFSLEANQFADMTVDEFRAAHTGYKPVPANKGRTTQFKYANV-SLDALPA 199
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNS 178
S+DWR KGAVT +KDQ CG CWAFS ++EGI K+ TG L+SLSEQEL+DCD +
Sbjct: 200 SMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQ 259
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++F+I N G+ TE +YPY G CN K + + +I GY+DVP N+E
Sbjct: 260 GCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDETS 319
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKN 297
LL+AV AQPVS+ + G + F+ Y G+ +G C T LDH + VGY + +G +W++KN
Sbjct: 320 LLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMKN 379
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG SWG G++ M+R+ + G+CG+ M SYPT
Sbjct: 380 SWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPT 414
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 155/298 (52%), Positives = 196/298 (65%), Gaps = 17/298 (5%)
Query: 44 QEKQQRLKIFEDNYAFVTQHNN-MGNSSFTLSLNAFADLTHQEFKAS---FLGFSAASID 99
QE+++RL+IF N ++ N+ + N + LS+N FADLT++EF AS F G +SI
Sbjct: 2 QEREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSII 61
Query: 100 HD---RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKI 156
+ NAS +P+++DWRKKGAVT VK+Q CG+CWAFSA A EGI+++
Sbjct: 62 RTTTFKYENASA--------IPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQL 113
Query: 157 VTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNK 215
TG LVSLSEQELIDCD + + GC GGLMD A++F+I+NHG+ TE YPY G G CN
Sbjct: 114 STGKLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNA 173
Query: 216 QKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSL 275
K + H VTI GY+DVP NNE L +AV QP+SV I S FQ Y+SG+FTG C T L
Sbjct: 174 NKASIHAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTEL 233
Query: 276 DHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
DH V VGY N G YW++KNSWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 234 DHGVTAVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPT 291
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 301 bits (770), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 212/333 (63%), Gaps = 12/333 (3%)
Query: 7 FLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66
F SI L S PL+ + + W +HG+ Y+ +E+ R +F++N + N++
Sbjct: 18 FCFSITL--SRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSI 75
Query: 67 -GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-----PAS 120
+F L++N FADLT+ EF++ + GF S + + SP ++V P S
Sbjct: 76 PAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTK--MSPFRYQNVSSGALPVS 133
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWRKKGAVT +K+Q SCG CWAFSA AIEG +I G L+SLSEQ+L+DCD + + GC
Sbjct: 134 VDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGC 192
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLMD A++ + G+ TE +YPY+G+ CN +K N +I GY+DVP N+E+ L+
Sbjct: 193 EGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALM 252
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSW 299
+AV QPVSVGI G FQ YSSG+FTG C+T LDHAV +GY +S NG YWIIKNSW
Sbjct: 253 KAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSW 312
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG +GYM +Q++ + G+CG+ M ASYPT
Sbjct: 313 GTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+ + E E W ++GK Y EK +R +IF+DN F+ N GN + L +N ADLT
Sbjct: 32 TSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLT 91
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+EFKAS GF + + N+ +PA+IDWR KGAVT +KDQ CG+CW
Sbjct: 92 VEEFKASRNGFK-----RPHEFSTTTFKYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCW 146
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS A EGI++I TG LVSLSEQEL+DCD + + GC GG M+ ++F+IKN GI +E
Sbjct: 147 AFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSE 206
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+ G+CNK + I GY+ VP N+E L +AV QPVSV I F
Sbjct: 207 TNYPYKAVDGKCNK--ATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGFMF 264
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YSSGI+ G C T LDH V VGY + NG DYWI+KNSWG WG GY+ MQR G+
Sbjct: 265 YSSGIYNGECGTELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRMQRGIAAKHGL 324
Query: 322 CGINMLASYPT 332
CGI + +SYPT
Sbjct: 325 CGIALDSSYPT 335
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 300 bits (769), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 198/308 (64%), Gaps = 7/308 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ + F+ W K+HG+ Y E++ R I++ N ++ Q N +S+ L+ N FADLT++
Sbjct: 42 MKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYI-QCKNAQKNSYNLTDNKFADLTNE 100
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EF+++++G S H+ D+P S DWRK+GAVTE+ DQ CG CWAF
Sbjct: 101 EFQSTYMGLSTRLRSHNTGFRYDEHG-----DLPESKDWRKEGAVTEIMDQGQCGGCWAF 155
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
+A A+EGINKI +G L+SLSEQELIDCD +S N GC GGLM+ AY F+I+N G+ TE+D
Sbjct: 156 AAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQD 215
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G G C +K + +I GY++VP +NE +L A QPVSV I +FQ YS
Sbjct: 216 YPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYS 275
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G+F+G C L+H V +VGY E YWI+KNSWG WG +GY+ M+R+T + G+CG
Sbjct: 276 EGVFSGICGKQLNHGVTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCG 335
Query: 324 INMLASYP 331
I M ASYP
Sbjct: 336 IAMQASYP 343
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 300 bits (769), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 195/310 (62%), Gaps = 6/310 (1%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ + FE W K H K Y E R I++ N + N++ + F L+ N FAD+T+
Sbjct: 39 LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSL-HLPFKLTDNRFADMTNS 97
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFKA FLG + +S+ +++ GN VP ++DWR +GAVT +++Q CG CWAF
Sbjct: 98 EFKAHFLGLNTSSLRLHKKQRPVCDPAGN---VPDAVDWRTQGAVTPIRNQGKCGGCWAF 154
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SA AIEGINKI TG+LVSLSEQ+LIDCD +YN GC GGLM+ A++F+ N G+ TE D
Sbjct: 155 SAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETD 214
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G G C+++K +VTI GY+ V +N E L A QPVSVGI FQLYS
Sbjct: 215 YPYTGIEGTCDQEKAKNKVVTIQGYQKVAQN-EASLQIAAAQQPVSVGIDAGGFIFQLYS 273
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SG+FT C T+L+H V +VGY E YWI+KNSWG WG GY+ M+R G CG
Sbjct: 274 SGVFTSYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCG 333
Query: 324 INMLASYPTK 333
I MLASYP +
Sbjct: 334 IAMLASYPLQ 343
>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
Length = 480
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 169/417 (40%), Positives = 224/417 (53%), Gaps = 50/417 (11%)
Query: 29 FETWCKQHGKAYSSE--QEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQ 84
++ W ++G + E ++R +F DN FV HN + F L +N +HQ
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRLR-RSHQ 110
Query: 85 EFKASFL--------------------GFSAASIDHDRRRNASV--QSPGNLRDVPASID 122
L G AA + Q PG +R +
Sbjct: 111 RGVPRDLPRRQGRREEPRRRGEVPPRRGGGAAGVRRLEGEGRRRPRQEPGPMRSFSVHLS 170
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCG 181
+ G G+CWAFSA +E IN++VTG +++LSEQEL++C NSGC
Sbjct: 171 VKYFGQ----------GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCN 220
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A+ F+IKN GIDTE DYPY+ G+C+ + N +V+IDG++DVP+N+EK L +
Sbjct: 221 GGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQK 280
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
AV QPVSV I R FQLY SG+F+G C TSLDH V+ VGY ++NG DYWI++NSWG
Sbjct: 281 AVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGP 340
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------C 349
WG +GY+ M+RN + G CGI M+ASYPTK+G NPP P PT C
Sbjct: 341 KWGESGYVRMERNINVTTGKCGIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVC 400
Query: 350 SLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
C AG TCCC +CL W CC A CC DH CCP +YP+C++ C
Sbjct: 401 DDNFSCPAGSTCCCAFGFRNLCLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 457
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 146/305 (47%), Positives = 197/305 (64%), Gaps = 5/305 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W +HGK Y ++EK +R +IF+ N F+ N GN S+ L +N FADLT++EF+A
Sbjct: 40 EKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAF 99
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
+ G+ R + N+ +P+SIDWR KGAVT +KDQ CG+CWAFSA A
Sbjct: 100 WNGYKRP---LGASRKITPFKYENVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAA 156
Query: 150 IEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
EGI+K+ TG LVSLSEQEL+DCD + + GC GGLM A++F+ ++ G+ +E +YPY+G
Sbjct: 157 TEGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQG 216
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
+ G+C+ +K V I GY+ VP+N+E LL+AV QPVSV I +FQ Y SGIFT
Sbjct: 217 RDGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFT 276
Query: 269 GPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
G C ++H V VGY N G YWI+KNSWG WG GY+ M+R+ + G+CGI M
Sbjct: 277 GICGKDINHGVAAVGYGRSNSGSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAME 336
Query: 328 ASYPT 332
SYPT
Sbjct: 337 CSYPT 341
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/334 (46%), Positives = 209/334 (62%), Gaps = 14/334 (4%)
Query: 7 FLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66
F SI L S PL+ + + W +HG+ Y+ +EK R +F+ N + NN+
Sbjct: 18 FYFSISL--SRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNI 75
Query: 67 -GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ------SPGNLRDVPA 119
+F L++N FADLT+ EF++ + GF S + + + S G L P
Sbjct: 76 PAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGAL---PI 132
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
S+DWR KGAVT +K+Q SCG CWAFSA AIEG +I G L+SLSEQ+L+DCD + + G
Sbjct: 133 SVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFG 191
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A++ ++ G+ TE +YPY+G+ CN +K N +I GY+DVP N+E+ L
Sbjct: 192 CEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQAL 251
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNS 298
++AV QPVSVGI G FQ YSSG+FTG C+T LDHAV +GY S NG YWIIKNS
Sbjct: 252 MKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSKYWIIKNS 311
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG WG +GYM +Q++ + G+CG+ M ASYPT
Sbjct: 312 WGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 136/223 (60%), Positives = 170/223 (76%), Gaps = 1/223 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P ++DWR+KGAV +K+Q +CG+CWAFS +EGINKIVTG L+SLSEQEL+DCD+SY
Sbjct: 4 LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDKSY 63
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA+QF++KN G++TE+DYPYRG G+CN N +VTIDGY+DVP N+E
Sbjct: 64 NQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTNDE 123
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L +AV QPVSV I R FQ Y SGIFTG C T +DHAV+ VGY SENGVDYWI++
Sbjct: 124 TALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGSENGVDYWIVR 183
Query: 297 NSWGRSWGMNGYMHMQRNTGNSL-GICGINMLASYPTKTGQNP 338
NSWG+ WG +GY+ ++RN +S G CGI + ASYP K NP
Sbjct: 184 NSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPVKYSPNP 226
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/333 (46%), Positives = 211/333 (63%), Gaps = 12/333 (3%)
Query: 7 FLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66
F SI L S PL+ + + W +HG+ Y+ +E+ R +F++N + N++
Sbjct: 18 FCFSITL--SRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSI 75
Query: 67 -GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-----PAS 120
+F L++N FADLT+ EF + + GF S + + SP ++V P S
Sbjct: 76 PAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTK--MSPFRYQNVSSGALPVS 133
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWRKKGAVT +K+Q SCG CWAFSA AIEG +I G L+SLSEQ+L+DCD + + GC
Sbjct: 134 VDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGC 192
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLMD A++ + G+ TE DYPY+G+ CN +K N +I GY+DVP N+E+ L+
Sbjct: 193 EGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALM 252
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSW 299
+AV QPVSVGI G FQ YSSG+FTG C+T LDHAV +GY +S NG YWIIKNSW
Sbjct: 253 KAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSW 312
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG +GYM +Q++ + G+CG+ M ASYPT
Sbjct: 313 GTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 201/319 (63%), Gaps = 12/319 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLDEKHNRFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++ + G + ++H R + + G N+ VP+S+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSIYAG---SKVNHHRMFRGTPRGNGTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV LSEQEL+DCD + N GC GGLM+ A++F IK +GI T
Sbjct: 153 WAFSTIVAVEGINQIKTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEF-IKQYGITTA 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY + G C+ K+N V+IDG+++VP NNE LL+AV QPVSV I FQ
Sbjct: 212 SNYPYEAKDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQF 271
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C T+LDH V IVGY +++G YW +KNSWG WG GY+ M+R+ G
Sbjct: 272 YSEGVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKRSISVKKG 331
Query: 321 ICGINMLASYPTKTGQNPP 339
+CGI M ASYP K + P
Sbjct: 332 LCGIAMEASYPIKKSSSKP 350
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 199/316 (62%), Gaps = 14/316 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
LF +W +HGK Y+S EK +R +IF+ N + + N N S+ L LN FAD+ H+EFK
Sbjct: 43 LFRSWSVKHGKLYASPTEKLERYEIFKQNLMHIAE-TNRKNGSYWLGLNQFADVAHEEFK 101
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEVKDQASCGA 140
AS+LG A R ++P R +P S+DWR KGAVT VK+Q CG+
Sbjct: 102 ASYLGLKRAL---PRAGAPQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGS 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS+ A+EGIN+IVTG LVSLSEQEL+DCD + + GC GG MD A+ +++ + GI
Sbjct: 159 CWAFSSVAAVEGINQIVTGKLVSLSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHA 218
Query: 201 EKDYPYRGQAGQCNKQK---LNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
E DYPY + G C +++ L + G++DVPEN+E LL+A+ QPVSVGI R
Sbjct: 219 EDDYPYLMEEGYCKEKQPCVLGITEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSR 278
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQ Y G+F G CS LDHA+ VGY S G +Y +KNSWG++WG GY+ ++ TG
Sbjct: 279 DFQFYRGGVFDGACSVELDHALTAVGYGSSYGQNYITMKNSWGKNWGEQGYVRIKMGTGK 338
Query: 318 SLGICGINMLASYPTK 333
G+CGI +ASYP K
Sbjct: 339 PEGVCGIYTMASYPVK 354
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 214/342 (62%), Gaps = 21/342 (6%)
Query: 3 SLAFFLLSILLLSS--LPLNYCSDINELF---ETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +LS L L S L SD + E W +Q+G+ Y EK +R +IF+ N
Sbjct: 6 ALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD---RRRNASVQSPG 112
AF+ + N GN F LS+N FADLT+ EF+A+ GF +++ R N S+ +
Sbjct: 66 AFI-ESFNAGNHKFWLSVNQFADLTNYEFRATKTNKGFIPSTVRVPTTFRYENVSIDT-- 122
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
+PA++DWR KGAVT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DC
Sbjct: 123 ----LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDC 178
Query: 173 D-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
D + GC GGLMD A++F+IKN G+ TE YPY G+CN + TI GY+DV
Sbjct: 179 DVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGG--SNSAATIKGYEDV 236
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGV 290
P NNE L++AV QPVSV + G + FQ YS G+ TG C T LDH ++ +GY + +G
Sbjct: 237 PANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGT 296
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 297 QYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 338
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 202/318 (63%), Gaps = 14/318 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W ++H EK +R F+DN ++ +HN + LN F D+ +EF
Sbjct: 44 DLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNK--RAPGYAPLNRFGDMGREEF 100
Query: 87 KASFLGFSAASIDHDRRRNASVQSP------GNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+A+F G A +D RR+ P +RD+P ++DWR+KGAVT VKDQ CG+
Sbjct: 101 RATFAGSHA----NDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGS 156
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS ++EGIN I TG LVSLSEQELIDCD + NSGC GGLM+ A++++ + GI T
Sbjct: 157 CWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGITT 216
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E YPYR G C+ + +V IDG+++VP N+E L +AV QPVSV I +++FQ
Sbjct: 217 ESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQ 276
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+F G C T LDH V +VGY ++ +G +YWI+KNSWG +WG GY+ MQR++G
Sbjct: 277 FYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGYDG 336
Query: 320 GICGINMLASYPTKTGQN 337
G+CGI M ASYP K N
Sbjct: 337 GLCGIAMEASYPVKFSPN 354
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 155/336 (46%), Positives = 208/336 (61%), Gaps = 12/336 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
SLA LL S D ++E E W QHGK Y EK+ R KIF+ N +
Sbjct: 11 SLALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKA--SFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
NN GN S L +N FADLT +EFKA G+ + I R ++ + ++ VP
Sbjct: 71 EGFNNAGNKSHKLGVNQFADLTEEEFKAINKLKGYMWSKIS----RTSTFKYE-HVTKVP 125
Query: 119 ASIDWRKKGAVTEVKDQA-SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-Y 176
A++DWR+KGAVT +K Q CG+CWAF+A A EGI K+ TG L+SLSEQELIDCD +
Sbjct: 126 ATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGD 185
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC G++ A++F+++N G+ TE YPY+ G CN + ++H+ +I GY+DVP NNE
Sbjct: 186 NGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNE 245
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
LL AV QPVSV + S+ F+ YSSG+ +G C T+ DHAV +VGY S++G YW+I
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG WG GY+ ++R+ G+CGI M ASYP
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYP 341
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 161/376 (42%), Positives = 220/376 (58%), Gaps = 33/376 (8%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDIN-------------ELFETWCKQHGKAYSSEQEKQQR 49
S L++++ +SS + C I+ +L+E W + H + + EK +R
Sbjct: 5 SKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERW-QTHHRVHRHHGEKGRR 63
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ 109
F++N F+ HN G+ + L LN F D+ +EF+++F + + I+ RR+++
Sbjct: 64 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTF---ADSRINDLRRQDSPAA 120
Query: 110 SPGNL--------RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSL 161
G + D P S+DWR++GAVT VKDQ CG+CWAFS A+EGIN I TGSL
Sbjct: 121 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSL 180
Query: 162 VSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR- 220
SLSEQELIDCD N GC GGLM+ A++F+ GI TE YPYR G C+ + R
Sbjct: 181 ASLSEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRG 239
Query: 221 --HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHA 278
+V IDG++ VP +E L +AV QPVSV + +AFQ YS G+FTG C T LDH
Sbjct: 240 GGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHG 299
Query: 279 VLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
V VGY ++G YWI+KNSWG SWG GY+ MQR GN G+CGI M AS+P KT +
Sbjct: 300 VAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNG-GLCGIAMEASFPIKT--S 356
Query: 338 PPPSPPPGPTRCSLLT 353
P P+ PP R +L+
Sbjct: 357 PNPADPPRKPRRALIA 372
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 161/372 (43%), Positives = 216/372 (58%), Gaps = 32/372 (8%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDIN-------------ELFETWCKQHGKAYSSEQEKQQR 49
S L++++ +SS + C I+ +L+E W + H + + EK +R
Sbjct: 49 SKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERW-QTHHRVHRHHGEKGRR 107
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ 109
F++N F+ HN G+ + L LN F D+ +EF+++F + + I+ RR+++
Sbjct: 108 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTF---ADSRINDLRRQDSPAA 164
Query: 110 SPGNL--------RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSL 161
G + D P S+DWR++GAVT VKDQ CG+CWAFS A+EGIN I TGSL
Sbjct: 165 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSL 224
Query: 162 VSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR- 220
SLSEQELIDCD N GC GGLM+ A++F+ GI TE YPYR G C+ + R
Sbjct: 225 ASLSEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRG 283
Query: 221 --HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHA 278
+V IDG++ VP +E L +AV QPVSV + +AFQ YS G+FTG C T LDH
Sbjct: 284 GGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHG 343
Query: 279 VLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
V VGY ++G YWI+KNSWG SWG GY+ MQR GN G+CGI M AS+P KT N
Sbjct: 344 VAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNG-GLCGIAMEASFPIKTSPN 402
Query: 338 PPPSPPPGPTRC 349
P PP P R
Sbjct: 403 -PADPPRKPRRA 413
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/310 (48%), Positives = 195/310 (62%), Gaps = 6/310 (1%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ + FE W K H K Y E R I++ N + N++ + F L+ N FAD+T+
Sbjct: 39 LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSL-HLPFKLTDNRFADMTNS 97
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFKA FLG + +S+ +++ GN VP ++DWR +GAVT +++Q CG CWAF
Sbjct: 98 EFKAHFLGLNTSSLRLHKKQRPVCDPAGN---VPDAVDWRTQGAVTPIRNQGKCGGCWAF 154
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SA AIEGINKI TG+LVSLSEQ+LIDCD +YN GC GGLM+ A++F+ N G+ TE D
Sbjct: 155 SAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETD 214
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G G C+++K +VTI GY+ V +N E L A QPVSVGI FQLYS
Sbjct: 215 YPYTGIEGTCDQEKSKNKVVTIQGYQKVAQN-EASLQIAAAQQPVSVGIDAGGFIFQLYS 273
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
SG+FT C T+L+H V +VGY E YWI+KNSWG WG GY+ M+R G CG
Sbjct: 274 SGVFTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCG 333
Query: 324 INMLASYPTK 333
I M+ASYP +
Sbjct: 334 IAMMASYPLQ 343
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 205/318 (64%), Gaps = 14/318 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W ++H EK +R F+DN ++ +HN + LN F D+ +EF
Sbjct: 44 DLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNK--RAPGYPPLNRFGDMGREEF 100
Query: 87 KASFLGFSAASIDHDRRRN--ASVQSPG----NLRDVPASIDWRKKGAVTEVKDQASCGA 140
+A+F G A +D RR+ A+ PG +RD+P ++DWR+KGAVT VKDQ CG+
Sbjct: 101 RATFAGSHA----NDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGS 156
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS ++EGIN I TG LVSLSEQELIDCD + NSGC GGLM+ A++++ + GI T
Sbjct: 157 CWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGITT 216
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E YPYR G C+ + +V IDG+++VP N+E L +AV QPVSV I +++FQ
Sbjct: 217 ESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQ 276
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+F G C T LDH V +VGY ++ +G +YWI+KNSWG +WG GY+ MQR++G
Sbjct: 277 FYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGYDG 336
Query: 320 GICGINMLASYPTKTGQN 337
G+CGI M ASYP K N
Sbjct: 337 GLCGIAMEASYPVKFSPN 354
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 200/332 (60%), Gaps = 4/332 (1%)
Query: 4 LAFFL-LSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
LA FL L++ + +P + + + E E W ++GK Y EK++R +IF+DN F+
Sbjct: 11 LALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIE 70
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
N GN + L +N ADLT +EFK S G + N+ D+P +I
Sbjct: 71 SFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAI 130
Query: 122 DWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
DWR KGAVT +KDQ CG+CWAFS A EGI++I TG+LVSLSEQEL+DCD S + GC
Sbjct: 131 DWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCD-SVDDGC 189
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GG M+ ++F+IKN GI +E +YPY+G G CN + I GY+ VP +E+ L
Sbjct: 190 EGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALQ 249
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300
+AV QPVSV I + F YSSGI+ G C T LDH V VGY +ENG DYWI+KNSWG
Sbjct: 250 KAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWG 309
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ M R GICGI + +SYPT
Sbjct: 310 TQWGEKGYIRMHRGIAAKHGICGIALDSSYPT 341
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 200/317 (63%), Gaps = 15/317 (4%)
Query: 28 LFETWCKQHG---KAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
L+ETW H + +E E + R +F++N ++ + N + F L+LN FAD+T
Sbjct: 39 LYETWRSHHTVSRRGLGAEAEAR-RFNVFKENVRYIHEANKK-DRPFRLALNKFADMTTD 96
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSP------GNLRDVPASIDWRKKGAVTEVKDQASC 138
EF+ ++ G + + H R + + + ++PA++DWR+KGAVT +KDQ C
Sbjct: 97 EFRRTYAG---SRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKDQGQC 153
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS A+EGINKI TG LVSLSEQEL+DC+ N GC GGLMD A+QF+ +N GI
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQQNGGI 213
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
TE YPY+G+ C++ K N H V+IDGY+DVP N+E L +AV QPVSV I S
Sbjct: 214 TTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASGND 273
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQ YS G+FT T LDH V VGY + +G YWI+KNSWG WG GY+ MQR
Sbjct: 274 FQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVKQ 333
Query: 318 SLGICGINMLASYPTKT 334
+ G+CGI M ASYPTK+
Sbjct: 334 AEGLCGIAMEASYPTKS 350
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 195/308 (63%), Gaps = 5/308 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFKA 88
E W +HG+AY+ + EK +RL++F DN AF+ N + F L N FADLT+ EF+A
Sbjct: 41 ERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRA 100
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+ G +S +R + + + D+PAS+DWR KGAV VKDQ CG CWAFSA
Sbjct: 101 TRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAVA 160
Query: 149 AIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG K+ TG LVSLSEQ+L+ CD + + GC GGLMD A+ F+IKN G+ E DYPY
Sbjct: 161 AMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDYPYT 220
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+C TI GY+DVP N+E LL+AV QPVSV I G +R FQ Y G+
Sbjct: 221 ASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKGGVL 280
Query: 268 TGP--CSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+G C+T LDHA+ VGY + +G YW++KNSWG SWG +GY+ M+R + G+CG+
Sbjct: 281 SGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGVCGL 340
Query: 325 NMLASYPT 332
M+ASYPT
Sbjct: 341 AMMASYPT 348
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 195/310 (62%), Gaps = 8/310 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W +++GK Y E ++R IFE+N F+ N GN + LS+N AD T++EF
Sbjct: 36 ERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSP---GNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
AS G+ + H + + Q+P N+ D+P ++DWR+KG T +KDQ CG CWA
Sbjct: 96 MASHKGYKGS---HWQGLRITTQTPFKYENVTDIPWAVDWRQKGDATSIKDQGQCGICWA 152
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSA A EGI +I TG+LVSLSEQEL+DCD S + GC GGLM++ ++F+IKN GI +E +
Sbjct: 153 FSAVAATEGIYQITTGNLVSLSEQELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEAN 211
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G C+ K I GY+ VP N E++L +AV QPVSV I AFQ YS
Sbjct: 212 YPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVSVSIDAGGSAFQFYS 271
Query: 264 SGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SG+FTG C T LDH V VGY S ++G+ YWI+KNSWG WG GY+ M R G+C
Sbjct: 272 SGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRGIDAQEGLC 331
Query: 323 GINMLASYPT 332
GI M ASYPT
Sbjct: 332 GIAMDASYPT 341
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 194/318 (61%), Gaps = 11/318 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W +H A +K +R +F+ N + + N + + L LN F D+T EF+
Sbjct: 155 LYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFR 212
Query: 88 ASFLGFSAAS---IDHDRRRNASVQSP---GNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+ G A DR+ +++ S + RDVPAS+DWR+KGAVT+VKDQ CG+C
Sbjct: 213 RHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGSC 272
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN I T +L SLSEQ+L+DCD N+GC GGLMDYA+Q++ K+ G+ E
Sbjct: 273 WAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAE 332
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPYR + C K +VTIDGY+DVP N+E L +AV QPVSV I S FQ
Sbjct: 333 DAYPYRARQASCKKSPAP--VVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 390
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+F+G C T LDH V VGY + +G YW++KNSWG WG GY+ M R+ G
Sbjct: 391 YSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEG 450
Query: 321 ICGINMLASYPTKTGQNP 338
CGI M ASYP KT NP
Sbjct: 451 HCGIAMEASYPVKTSPNP 468
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 205/317 (64%), Gaps = 9/317 (2%)
Query: 28 LFETWCKQHGKAYS-SEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
L++ W QH S E +R +IF++N + N + + L LN FADL+++EF
Sbjct: 44 LYDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFADLSNEEF 102
Query: 87 KASFLGFSAA---SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
KA + S+ DR + N + +PASIDWRKKGAVT VK+Q CG+CWA
Sbjct: 103 KAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWA 162
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS ++EGIN I TG LVSLSEQ+L+DC + N+GC GGLMD A+Q++I N GI TE +
Sbjct: 163 FSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIIDNGGIVTEDE 221
Query: 204 YPYRGQAGQCNKQKL-NRHIVT-IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY +AG+C+ K+ ++ I T IDG++DVP NNE L +AV QPVS+ I S FQ
Sbjct: 222 YPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQF 281
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS+G+FTG C T LDH V++VGY S G++YWI++NSWG WG GY+ MQR + G
Sbjct: 282 YSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRMQRGIEATEG 341
Query: 321 ICGINMLASYPTKTGQN 337
CGI+M ASYPTK Q+
Sbjct: 342 KCGISMQASYPTKKTQD 358
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 195/308 (63%), Gaps = 5/308 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFKA 88
E W +HG+AY+ + EK +RL++F DN AF+ N + F L N FADLT+ EF+A
Sbjct: 6 ERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRA 65
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+ G +S +R + + + D+PAS+DWR KGAV VKDQ CG CWAFSA
Sbjct: 66 TRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAVA 125
Query: 149 AIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG K+ TG LVSLSEQ+L+ CD + + GC GGLMD A+ F+IKN G+ E DYPY
Sbjct: 126 AMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDYPYT 185
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+C TI GY+DVP N+E LL+AV QPVSV I G +R FQ Y G+
Sbjct: 186 ASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKGGVL 245
Query: 268 TGP--CSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+G C+T LDHA+ VGY + +G YW++KNSWG SWG +GY+ M+R + G+CG+
Sbjct: 246 SGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGVCGL 305
Query: 325 NMLASYPT 332
M+ASYPT
Sbjct: 306 AMMASYPT 313
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 156/342 (45%), Positives = 213/342 (62%), Gaps = 21/342 (6%)
Query: 3 SLAFFLLSILLLSS--LPLNYCSDINELF---ETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +LS L L S L SD + E W +Q+G+ Y EK +R +IF+ N
Sbjct: 6 ALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD---RRRNASVQSPG 112
AF+ + N GN F L +N FADLT+ EF+A+ GF +++ R N S+ +
Sbjct: 66 AFI-ESFNAGNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPTTFRYENVSIDT-- 122
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
+PA++DWR KGAVT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DC
Sbjct: 123 ----LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDC 178
Query: 173 D-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
D + GC GGLMD A++F+IKN G+ TE YPY G+CN + TI GY+DV
Sbjct: 179 DVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGG--SNSAATIKGYEDV 236
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGV 290
P NNE L++AV QPVSV + G + FQ YS G+ TG C T LDH ++ +GY + +G
Sbjct: 237 PANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGT 296
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 297 QYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 338
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 215/345 (62%), Gaps = 20/345 (5%)
Query: 4 LAFFLLSILLLS---SLPLNYCSDINELF-----ETWCKQHGKAYSSEQEKQQRLKIFED 55
+ FL+ L+ S S+ L+ D NEL + W +HG+ Y+ +EK R +F+
Sbjct: 6 IQIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKR 65
Query: 56 NYAFVTQHNNM-GNSSFTLSLNAFADLTHQEFKASFLGFSAASI--DHDRRRNASVQ--- 109
N + + NN+ +F L++N FADLT+ EF++ + G+ S+ + +S +
Sbjct: 66 NVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQN 125
Query: 110 -SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQE 168
S G L P S+DWRKKGAVT +K+Q +CG CWAFSA AIEG KI G L+SLSEQ+
Sbjct: 126 VSSGAL---PVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQ 182
Query: 169 LIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
L+DCD + + GC GGLMD A++ ++ G+ TE +YPY+G+ C + +I GY
Sbjct: 183 LVDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGY 241
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSE 287
+DVP N+EK L++AV QPVS+GI G FQ Y SG+FTG C+T LDHAV VGY S
Sbjct: 242 EDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSS 301
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NG YWIIKNSWG WG +GYM ++++ + G+CG+ M ASYPT
Sbjct: 302 NGSKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 150/300 (50%), Positives = 191/300 (63%), Gaps = 8/300 (2%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFS 94
++G+ Y EK++R KIF+DN A + N + ++ LS+N FADLT++EF++ F
Sbjct: 3 RYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNRFK 62
Query: 95 AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGIN 154
A A+ N+ VP++IDWRKKGAVT +KDQ CG CWAFSA A EGI
Sbjct: 63 AHICSE-----ATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGIT 117
Query: 155 KIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC 213
+I TG L+SLSEQEL+DCD N GC GGLMD A++F IK HG+ +E YPY G G C
Sbjct: 118 QITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDDGTC 176
Query: 214 NKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCST 273
N +K I GY+DVP NNEK L +AV QPV+V I FQ Y+SG+FTG C T
Sbjct: 177 NSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGT 236
Query: 274 SLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
LDH V VGY ++G+ YW++KNSWG WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 237 ELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPT 296
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 297 bits (760), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 195/308 (63%), Gaps = 5/308 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFKA 88
E W +HG+AY+ + EK +RL++F DN AF+ N + F L N FADLT+ EF+A
Sbjct: 6 ERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRA 65
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+ G +S +R + + + D+PAS+DWR KGAV VKDQ CG CWAFSA
Sbjct: 66 TRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAVA 125
Query: 149 AIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG K+ TG LVSLSEQ+L+ CD + + GC GGLMD A+ F+IKN G+ E DYPY
Sbjct: 126 AMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDYPYT 185
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+C TI GY+DVP N+E LL+AV QPVSV I G +R FQ Y G+
Sbjct: 186 ASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKGGVL 245
Query: 268 TGP--CSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+G C+T LDHA+ VGY + +G YW++KNSWG SWG +GY+ M+R + G+CG+
Sbjct: 246 SGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKEGVCGL 305
Query: 325 NMLASYPT 332
M+ASYPT
Sbjct: 306 AMMASYPT 313
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 199/327 (60%), Gaps = 23/327 (7%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E FE W +HG+ Y+ EKQ+RL+++ N V N+MGN + L+ N FADLT++EF
Sbjct: 52 ERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG-YRLADNKFADLTNEEF 110
Query: 87 KASFLGF----SAASIDHDRRRN------ASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
+A LGF S H + + + D+P S+DWR+KGAV VK Q
Sbjct: 111 RAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKSQG 170
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFSA AIEGIN+I G LVSLSEQEL+DCD + GC GG M +A++FV+KN
Sbjct: 171 DCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWAFEFVMKNR 229
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
G+ TE++YPY+G G C KL V+I GY +V ++E LL+A AQPVSV +
Sbjct: 230 GLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAGS 289
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGY-----DSEN------GVDYWIIKNSWGRSWGM 305
+QLY G+FTGPC+ L+H V +VGY D++ G YWI+KNSWG WG
Sbjct: 290 FVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWGD 349
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPT 332
GY+ MQR + G+CGI ML SYP
Sbjct: 350 AGYILMQREASVASGLCGIAMLPSYPV 376
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 199/327 (60%), Gaps = 23/327 (7%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E FE W +HG+ Y+ EKQ+RL+++ N V N+MGN + L+ N FADLT++EF
Sbjct: 31 ERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG-YRLADNKFADLTNEEF 89
Query: 87 KASFLGF----SAASIDHDRRRN------ASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
+A LGF S H + + + D+P S+DWR+KGAV VK Q
Sbjct: 90 RAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKSQG 149
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFSA AIEGIN+I G LVSLSEQEL+DCD + GC GG M +A++FV+KN
Sbjct: 150 DCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWAFEFVMKNR 208
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
G+ TE++YPY+G G C KL V+I GY +V ++E LL+A AQPVSV +
Sbjct: 209 GLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAGS 268
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGY-----DSEN------GVDYWIIKNSWGRSWGM 305
+QLY G+FTGPC+ L+H V +VGY D++ G YWI+KNSWG WG
Sbjct: 269 FVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWGD 328
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPT 332
GY+ MQR + G+CGI ML SYP
Sbjct: 329 AGYILMQREASVASGLCGIAMLPSYPV 355
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 199/304 (65%), Gaps = 4/304 (1%)
Query: 32 WCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFKASF 90
W +HG+ Y+ EK R +F+ N + + N++ + +F L++N FADLT++EF++ +
Sbjct: 41 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100
Query: 91 LGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
GF S+ R + S + D +P S+DWRKKGAVT +KDQ CG+CWAFSA A
Sbjct: 101 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 160
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
IEG+ +I G L+SLSEQEL+DCD + + GC GGLMD A+ + I G+ +E +YPY+
Sbjct: 161 IEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKST 219
Query: 210 AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTG 269
G CN K + +I G++DVP N+EK L++AV PVS+GI G + FQ YSSG+F+G
Sbjct: 220 NGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSG 279
Query: 270 PCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
C+T LDH V VGY S+NG+ YWI+KNSWG WG GYM ++++ G CG+ M A
Sbjct: 280 ECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNA 339
Query: 329 SYPT 332
SYPT
Sbjct: 340 SYPT 343
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 157/350 (44%), Positives = 213/350 (60%), Gaps = 31/350 (8%)
Query: 6 FFLLSILL----------LSSLPLNYC--------SDINELFETWCKQHGKAYSSEQEKQ 47
F++SILL +S++ Y ++ E++E W +H K YS E +
Sbjct: 4 LFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEEVKEIYELWLAKHDKVYSGLVEYE 63
Query: 48 QRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRR-NA 106
+R +IF+DN F+ +HN+ N ++ + L + DLT++EF+A +LG + +I +R N
Sbjct: 64 KRFEIFKDNLKFIDEHNSE-NHTYKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLKRTINI 122
Query: 107 SVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLS 165
S + D +P IDWRKKGAVT VK+Q CG+CWAFS +E IN+I TG+L+SLS
Sbjct: 123 SERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLS 182
Query: 166 EQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTI 225
EQ+L+DC++ N GC GG YAYQ++I N GIDTE +YPY+ G C K +V I
Sbjct: 183 EQQLVDCNKK-NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPCRAAK---KVVRI 238
Query: 226 DGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD 285
DGYK VP NE L +AV +QP V I S + FQ Y SGIF+GPC T L+H V+IVGY
Sbjct: 239 DGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYW 298
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG 335
DYWI++NSWGR WG GY+ M+R G G+CGI L YPTK
Sbjct: 299 K----DYWIVRNSWGRYWGEQGYIRMKRVGG--CGLCGIARLPYYPTKAA 342
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 213/343 (62%), Gaps = 21/343 (6%)
Query: 2 NSLAFFLLSILLLSS--LPLNYCSDINELF---ETWCKQHGKAYSSEQEKQQRLKIFEDN 56
+L F +LS L L S L SD + E W +Q+G+ Y EK +R +IF+ N
Sbjct: 5 KALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKAN 64
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD---RRRNASVQSP 111
AF+ + N GN F L +N FADLT+ EF+A+ GF +++ R N S+ +
Sbjct: 65 VAFI-ESFNAGNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPTTFRYENVSIDT- 122
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
+PA++DWR KGAVT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+D
Sbjct: 123 -----LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVD 177
Query: 172 CD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
CD + GC GGLMD A++F+IKN G+ TE YPY G+CN + TI GY++
Sbjct: 178 CDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGG--SNSAATIKGYEE 235
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NG 289
VP NNE L++AV QPVSV + G + FQ YS G+ TG C T LDH ++ +GY + +G
Sbjct: 236 VPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDG 295
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 296 TQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 338
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/344 (42%), Positives = 213/344 (61%), Gaps = 18/344 (5%)
Query: 7 FLLSILL---------LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
FLL+++L LS+ L + + E E W QHG+ Y EK +R + F +N
Sbjct: 7 FLLAVVLGCICLCSTVLSARELGDAAMV-ERHEQWMAQHGRVYKDGAEKARRFEAFRNNV 65
Query: 58 AFVTQHNNMGNS-SFTLSLNAFADLTHQEFKAS-----FLGFSAASIDHDRRRNASVQSP 111
F+ N GN F L +N F DLT+ EF+A+ F+ +AA+++ S
Sbjct: 66 VFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRYSN 125
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
+ +PA++DWR KGAVT +K+Q CG CWAFSA A EGI ++ TG LV LSEQEL+D
Sbjct: 126 VSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQELVD 185
Query: 172 CD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
CD + GC GG MD A++F+IKN G+ +E +YPY Q GQC + + TI GY+D
Sbjct: 186 CDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKGYED 245
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENG 289
VP N+E L++AV AQPVSV + G + FQ Y+ G+ +G C TSLDH ++ VGY +++G
Sbjct: 246 VPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAADDG 305
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+W++KNSWG +WG +GY+ M+++ ++ G+CG+ M SYPT+
Sbjct: 306 TKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYPTE 349
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 206/351 (58%), Gaps = 22/351 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINE-----------LFETWCKQHGKAYSSEQEKQQRLKIFE 54
F +L++ +L L D +E L+E W H A S E EK +R +F+
Sbjct: 4 FIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE-EKAKRFNVFK 62
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP--- 111
N + + N NS + L LN F D+T +EF+ ++ G ++I H R Q+
Sbjct: 63 HNVKHIHETNKKENS-YKLKLNKFGDMTSEEFRRTYAG---SNIKHHRMFQGERQTTKSF 118
Query: 112 --GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
N+ +P S+DWRK GAVT VK+Q CG+CWAFS A+EGIN+I T L SLSEQEL
Sbjct: 119 MYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DCD + N GC GGLMD A++F+ + G+ +E YPY+ C+ K N +V+IDG++
Sbjct: 179 VDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-N 288
DVP+N+E L++AV QPVSV I FQ YS G+FTG C T L+H V +VGY + +
Sbjct: 239 DVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTID 298
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPP 339
G YWI+KNSWG WG GY+ MQR + G+CGI M ASYP K P
Sbjct: 299 GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLKNSNTNP 349
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 169/382 (44%), Positives = 221/382 (57%), Gaps = 44/382 (11%)
Query: 31 TWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT-----QHNNMGNSSFT------------- 72
T+ + K YS+E+E RL IF+ N ++T Q + + F+
Sbjct: 2 TFTRLFNKKYSNEEEAALRLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFL 61
Query: 73 -----------LSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-PA- 119
L LN FAD T +EF ++ LG +A D +S + DV PA
Sbjct: 62 SQLAHTDLLPQLGLNEFADQTWEEFSSTHLGLNAG---EDGSFRSSANTGFRHADVTPAN 118
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
SI+W + GAVT VK+QA CG+CWAFS TG++EG N + TG LVSLSEQ+L+DCD + G
Sbjct: 119 SINWVEAGAVTPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTKKDQG 178
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
CGGGLMDYA+ ++IKN G+DTE+DY Y G CNK + R +V+IDGY+DVP N+E L
Sbjct: 179 CGGGLMDYAFDYIIKNGGLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVAL 238
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCS-TSLDHAVLIVGYD-SENGVDYWIIKN 297
+AV QPVSV IC SE A Q YSSG+ S L+H VL GYD E+G YW++KN
Sbjct: 239 AKAVSKQPVSVAICASE-AMQFYSSGVIAAKGSCIGLNHGVLAAGYDVDESGKPYWLVKN 297
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTY--C 355
SWG +WGM GYM +++++ G CGI M ASYP K+ P+P P C + C
Sbjct: 298 SWGGTWGMQGYMKLEKDSSVKEGACGIAMAASYPVKS----SPNPKHVPEVCGYFGWSEC 353
Query: 356 AAGETCCCGSSILGI-CLSWKC 376
G C C +LGI CL W C
Sbjct: 354 EYGSKCSCNFDLLGIFCLQWGC 375
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/320 (46%), Positives = 195/320 (60%), Gaps = 16/320 (5%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W +H A +K +R +F+ N + + N + + L LN F D+T EF+
Sbjct: 48 LYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFR 105
Query: 88 ASFLGFSAASIDHDR-----RRNASVQSP---GNLRDVPASIDWRKKGAVTEVKDQASCG 139
+ G + + H R R+ +S + + RDVPAS+DWR+KGAVT+VKDQ CG
Sbjct: 106 RHYAG---SRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCG 162
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGIN I T +L SLSEQ+L+DCD N+GC GGLMDYA+Q++ K+ G+
Sbjct: 163 SCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVA 222
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
E YPYR + C K +VTIDGY+DVP N+E L +AV QPVSV I S F
Sbjct: 223 AEDAYPYRARQASCKKSPAP--VVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHF 280
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q YS G+F+G C T LDH V VGY + +G YW++KNSWG WG GY+ M R+
Sbjct: 281 QFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAK 340
Query: 319 LGICGINMLASYPTKTGQNP 338
G CGI M ASYP KT NP
Sbjct: 341 EGHCGIAMEASYPVKTSPNP 360
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 199/326 (61%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + + S +K +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSYRTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H R + + G VP S DWRK GAVT VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD N+GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY Q G C+ K N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGFDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y G+FTG CST L+H V IVGY + +G +YW ++NSWG WG GY+ MQR+ G
Sbjct: 273 YFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRSIFKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI M+ASYP K N P P P
Sbjct: 333 LCGIAMMASYPIKNSSNNPTGPSSFP 358
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 160/376 (42%), Positives = 219/376 (58%), Gaps = 33/376 (8%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDIN-------------ELFETWCKQHGKAYSSEQEKQQR 49
S L++++ +SS + C I+ +L+E W + H + + EK +R
Sbjct: 5 SKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERW-QTHHRVHRHHGEKGRR 63
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ 109
F++N F+ HN G+ + L LN F D+ +EF+++F + + I+ RR+++
Sbjct: 64 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTF---ADSRINDLRRQDSPAA 120
Query: 110 SPGNL--------RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSL 161
G + D P S+DWR++GAVT VK Q CG+CWAFS A+EGIN I TGSL
Sbjct: 121 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSL 180
Query: 162 VSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR- 220
SLSEQELIDCD N GC GGLM+ A++F+ GI TE YPYR G C+ + R
Sbjct: 181 ASLSEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRG 239
Query: 221 --HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHA 278
+V IDG++ VP +E L +AV QPVSV + +AFQ YS G+FTG C T LDH
Sbjct: 240 GGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHG 299
Query: 279 VLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
V VGY ++G YWI+KNSWG SWG GY+ MQR GN G+CGI M AS+P KT +
Sbjct: 300 VAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNG-GLCGIAMEASFPIKT--S 356
Query: 338 PPPSPPPGPTRCSLLT 353
P P+ PP R +L+
Sbjct: 357 PNPADPPRKPRRALIA 372
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 211/352 (59%), Gaps = 20/352 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSD---------INELFETWCKQHGKAYSSEQEKQQRLK 51
M + LS++L+ L ++ D + +L+E W H + E EK +R
Sbjct: 3 MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYHTVSRDLE-EKNKRFN 61
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP 111
+F++N V + N M + + L LN FAD+T+ EF++S+ G + + H R +
Sbjct: 62 VFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGG---SKVKHYRMLRGDRRGT 117
Query: 112 GNLRD-----VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSE 166
G +P S+DWRKKGAVT +KDQ CG+CWAFS +EGIN+I T L+SLSE
Sbjct: 118 GGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSE 177
Query: 167 QELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID 226
Q+LIDCDRS + GC GGLM+ A++F+ KN GI TE +YPY+ + +C+ K+N +VTID
Sbjct: 178 QQLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTID 237
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS 286
G++ VP N+E+ L++AV QPVSV I Q YS G+F G C T LDH V IVGY +
Sbjct: 238 GHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGT 297
Query: 287 E-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
+G YWI+KNSWG WG GY+ M R + G CGI M ASYP K+ N
Sbjct: 298 TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPVKSSNN 349
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 211/352 (59%), Gaps = 20/352 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSD---------INELFETWCKQHGKAYSSEQEKQQRLK 51
M + LS++L+ L ++ D + +L+E W H + E EK +R
Sbjct: 1 MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYHTVSRDLE-EKNKRFN 59
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP 111
+F++N V + N M + + L LN FAD+T+ EF++S+ G + + H R +
Sbjct: 60 VFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGG---SKVKHYRMLRGDRRGT 115
Query: 112 GNLRD-----VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSE 166
G +P S+DWRKKGAVT +KDQ CG+CWAFS +EGIN+I T L+SLSE
Sbjct: 116 GGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSE 175
Query: 167 QELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID 226
Q+LIDCDRS + GC GGLM+ A++F+ KN GI TE +YPY+ + +C+ K+N +VTID
Sbjct: 176 QQLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTID 235
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS 286
G++ VP N+E+ L++AV QPVSV I Q YS G+F G C T LDH V IVGY +
Sbjct: 236 GHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGT 295
Query: 287 E-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
+G YWI+KNSWG WG GY+ M R + G CGI M ASYP K+ N
Sbjct: 296 TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPVKSSNN 347
>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
Length = 514
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 182/421 (43%), Positives = 236/421 (56%), Gaps = 51/421 (12%)
Query: 29 FETWCKQHGKAYSSEQ-EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
F W +Q+G+ Y + E +RL IF DN + Q ++ + TL+LN +ADLT +EF
Sbjct: 38 FTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAI-QESHEKDPGVTLALNEYADLTWEEFS 96
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGACW 142
++ LG DRR S R D P +IDWR+KGAV EVK+Q CG+CW
Sbjct: 97 STRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAIDWREKGAVAEVKNQGQCGSCW 156
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDR-----------------SY--------- 176
AFS TGAIEGIN IVTG L SLSEQ+L+DCD SY
Sbjct: 157 AFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRNES 216
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAG---QCNKQK-LNRHIVTIDGYKDVP 232
N GC GGLMD A+++VI+N G+DTE+DY Y G CNK+K +R V+IDGY+DVP
Sbjct: 217 NMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRPAVSIDGYEDVP 276
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVD 291
+ E LL+AV QPV+V IC + Q YS G+ + C L+H VL VGY+ S++G
Sbjct: 277 Q-GEDNLLKAVAHQPVAVAICAGA-SMQFYSRGVIS-TCCEGLNHGVLTVGYNVSQDGEK 333
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSL 351
YWI+KNSWG WG GY ++ G + G+CGI ASYPTKT N P P C +
Sbjct: 334 YWIVKNSWGAGWGEQGYFRLKMGVGET-GLCGIASAASYPTKTSPNKPV-----PEICDI 387
Query: 352 L--TYCAAGETCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408
T C G +C C S G +CL CC + V C D ++CCPS CD + C++
Sbjct: 388 FGWTECPVGNSCSCSFSFFGFLCLWHDCCPLAGGVTCPDLKHCCPSGTN-CDQRQGVCVS 446
Query: 409 V 409
Sbjct: 447 A 447
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 156/346 (45%), Positives = 213/346 (61%), Gaps = 21/346 (6%)
Query: 1 MNSLAFFLLSILLLSSL-----PLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIF 53
M S FLL+IL +SL SD + E E W ++G+ Y EK +R ++F
Sbjct: 1 MVSSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVF 60
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS--FLGFSAASIDHD--RRRNASVQ 109
+DN AFV N N+ F L +N FADLT +EFKA+ F SA + + N SV
Sbjct: 61 KDNVAFVESFNTNKNNKFWLGINQFADLTIEEFKANKGFKPISAEKVPTTGFKYENLSVS 120
Query: 110 SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
+ +P ++DWR KGAVT +K+Q CG CWAFSA A+EGI K+ TG+L+SLSEQEL
Sbjct: 121 A------LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQEL 174
Query: 170 IDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
+DCD S + GC GG MD A++FVIKN G+ T YPY+ G+C ++ TI G+
Sbjct: 175 VDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGG--SKSAATIKGH 232
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE- 287
+DVP N+E L++AV QPVSV + S+R F LYS G+ TG C T LDH + +GY E
Sbjct: 233 EDVPVNDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVES 292
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+G YWI+KNSWG +WG G++ M+++ + G+CG+ M SYPT+
Sbjct: 293 DGTKYWILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPTE 338
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 154/348 (44%), Positives = 207/348 (59%), Gaps = 23/348 (6%)
Query: 1 MNSLAFFLLSIL--------LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKI 52
M +L +L+IL L++ LN S + E W Q+ + Y EK QR ++
Sbjct: 1 MATLKGSILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEV 60
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD---RRRNAS 107
F+ N F+ N GN F L +N FADLT+ EF+A+ GF + + R N S
Sbjct: 61 FKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVPTGFRYENVS 120
Query: 108 VQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
V + +PASIDWR KGAVT +KDQ CG CWAFSA A EGI KI T L+SLSEQ
Sbjct: 121 VDA------LPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQ 174
Query: 168 ELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID 226
EL+DCD + GC GGLMD A++F+IKN G+ TE YPY G+C + I
Sbjct: 175 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKC--KSGTNSAANIK 232
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-D 285
G++DVP N+E L++AV QPVSV + G + FQLYS G+ TG C T LDH + +GY
Sbjct: 233 GFEDVPANDEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQ 292
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+ +G YW++KNSWG +WG NGY+ M+++ + G+CG+ M SYPT+
Sbjct: 293 TSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTE 340
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 155/338 (45%), Positives = 201/338 (59%), Gaps = 19/338 (5%)
Query: 3 SLAFFLL---SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
++A FLL I + S L+ S + E E W ++GK Y EK++R IF+ N F
Sbjct: 10 TIALFLLLALGIPQMMSRKLHETS-MRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNVEF 68
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---GNLRD 116
+ N N + L +N ADLT +EFKAS G +R +P N+
Sbjct: 69 IESFNAAANKPYKLGVNHLADLTVEEFKASRNGL--------KRPYELSTTPFKYENVTA 120
Query: 117 VPASIDWRKKGAVTEVKDQASC-GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+PA+IDWR KGAVT +KDQ C G+CWAFS A EGI++I TG LVSLSEQEL+DCD +
Sbjct: 121 IPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTK 180
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GG M+ ++F+IKN GI +E +YPY+ G+CNK + I GY+ VP N
Sbjct: 181 GVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCNKA--TSPVAQIKGYEKVPPN 238
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
+EK L +AV QPVSV I + F YSSGI+ G C T LDH V VGY NG DYW+
Sbjct: 239 SEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIANGTDYWL 298
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG WG GY+ MQR G+CGI + +SYPT
Sbjct: 299 VKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPT 336
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 143/285 (50%), Positives = 186/285 (65%), Gaps = 1/285 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SI+ S L + ELFE W KAY + +EK R ++F+DN + + N G S
Sbjct: 32 SIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS 91
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV 129
+ L LN FADL+H+EFK +LG + D R+ + + ++ VP S+DWRKKGAV
Sbjct: 92 -YWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAV 150
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
EVK+Q SCG+CWAFS A+EGINKIVTG+L +LSEQELIDCD +YN+GC GGLMDYA+
Sbjct: 151 AEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAF 210
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
++++KN G+ E+DYPY + G C QK VTI+G++DVP N+EK LL+A+ QP+S
Sbjct: 211 EYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLS 270
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
V I S R FQ YS G+F G C LDH V VGY S G DY I
Sbjct: 271 VAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYII 315
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 294 bits (753), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 152/332 (45%), Positives = 198/332 (59%), Gaps = 4/332 (1%)
Query: 4 LAFFL-LSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
LA FL L++ + +P + + + E E W ++GK Y EK++R +IF+DN F+
Sbjct: 11 LALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIE 70
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
N GN + L +N ADLT +EFK S G + N+ D+P +I
Sbjct: 71 SFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAI 130
Query: 122 DWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
DWR KGAVT +KDQ CG WAFS A EGI++I TG+LVSLSEQEL+DCD S + GC
Sbjct: 131 DWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCD-SVDDGC 189
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GG M+ ++F+IKN GI +E +YPY+G G CN + I GY+ VP +E+ L
Sbjct: 190 EGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALK 249
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300
+AV QPVSV I + F YSSGI+ G C T LDH V VGY +ENG DYWI+KNSWG
Sbjct: 250 KAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWG 309
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ M R GICGI + +SYPT
Sbjct: 310 TQWGEKGYIRMHRGIAAKHGICGIALDSSYPT 341
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 294 bits (752), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 157/346 (45%), Positives = 203/346 (58%), Gaps = 15/346 (4%)
Query: 1 MNSLAFFLLSILL--LSSLPLNYCSDIN----ELFETWCKQHGKAYSSEQEKQQRLKIFE 54
M S F+L+I L +SL + S E E W + + YS E EK+ R IF+
Sbjct: 1 MASTIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFK 60
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASI-----DHDRRRNASVQ 109
N FV N ++ + +N F+DLT +EF+A+ G +N
Sbjct: 61 KNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPF 120
Query: 110 SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
GN+ D S+DWR++GAVT VK Q CG CWAFSA A+EGI KI G LVSLSEQ+L
Sbjct: 121 RYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 180
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR---HIVTID 226
+DCDR YN GC GG+M A++++IKN GI TE +YPY+ C+ TI
Sbjct: 181 LDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATIS 240
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD- 285
GY+ VP NNE+ LLQAV QPVSVGI G+ AF+ YS G+F G C T L HAV IVGY
Sbjct: 241 GYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGM 300
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SE G YW++KNSWG +WG NGYM ++R+ G+CG+ +LA YP
Sbjct: 301 SEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYP 346
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 294 bits (752), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 203/338 (60%), Gaps = 17/338 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L+F L++ LN S + E W Q+ + Y EK +R ++F+ N F+
Sbjct: 12 LSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESF 71
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD----RRRNASVQSPGNLRDV 117
N GN F L +N FADLT+ EF+ + GF S+D R N SV + +
Sbjct: 72 NTGGNRKFWLGINQFADLTNDEFRTTKTNKGFKP-SLDKVSTGFRYENVSVDA------I 124
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
PA+IDWR GAVT +KDQ CG CWAFSA A EGI KI TG L+SLSEQEL+DCD
Sbjct: 125 PATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHGE 184
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD A++F+IKN G+ TE +YPY G+C + + I GY+DVP N+E
Sbjct: 185 DQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSNSAANIKGYEDVPTNDE 242
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWII 295
L++AV QPVSV + G + FQ YS G+ TG C T LDH + +GY + +G YW++
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLM 302
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
KNSWG +WG NGY+ M+++ + G+CG+ M SYPT+
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPTE 340
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 213/350 (60%), Gaps = 25/350 (7%)
Query: 1 MNSLAFFLLSILL-----------LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQR 49
++S AF LL +L L++ L+ + + E E W +G+ Y EK +R
Sbjct: 2 VSSRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARR 61
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS--FLGFSAASIDHD--RRRN 105
++F+DN AFV N + F L +N FADLT +EFKA+ F SA + + N
Sbjct: 62 FEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKANKGFKPISAEEVPTTGFKYEN 121
Query: 106 ASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLS 165
SV + +P ++DWR KGAVT +K+Q CG CWAFSA A+EGI K+ T +LVSLS
Sbjct: 122 LSVSA------LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLS 175
Query: 166 EQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVT 224
EQEL+DCD S + GC GG MD A++FVIKN G+ TE YPY+ G+C ++ T
Sbjct: 176 EQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGG--SKSAAT 233
Query: 225 IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY 284
I G++DVP NNE L++AV +QPVSV + S+R F LYS G+ TG C T LDH + +GY
Sbjct: 234 IKGHEDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGY 293
Query: 285 DSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
E +G YWI+KNSWG +WG ++ M+++ + G+CG+ M SYPT+
Sbjct: 294 GVESDGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPTE 343
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 195/319 (61%), Gaps = 11/319 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
EL+E W H A S E EK +R +F+ N + N + S+ L LN F D+T +EF
Sbjct: 36 ELYERWRSHHTVARSLE-EKAKRFNVFKHNVKHI-HETNKKDKSYKLKLNKFGDMTSEEF 93
Query: 87 KASFLGFSAASIDHDRRRNASVQSP-----GNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+ ++ G ++I H R ++ N+ +P S+DWRK GAVT VK+Q CG+C
Sbjct: 94 RRTYAG---SNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 150
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T L SLSEQEL+DCD + N GC GGLMD A++F+ + G+ +E
Sbjct: 151 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSE 210
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY+ C+ K N +V+IDG++DVP+N+E L++AV QPVSV I FQ
Sbjct: 211 LVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C T L+H V +VGY + +G YWI+KNSWG WG GY+ MQR + G
Sbjct: 271 YSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEG 330
Query: 321 ICGINMLASYPTKTGQNPP 339
+CGI M ASYP K P
Sbjct: 331 LCGIAMEASYPLKNSNTNP 349
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 293 bits (751), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 147/264 (55%), Positives = 183/264 (69%), Gaps = 7/264 (2%)
Query: 4 LAFFLLSILLLSSLPLNYC-SDINE-----LFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
L +L ++ S + Y D++E LF+ WC HGK Y+++Q + R ++F++N
Sbjct: 8 LKLVMLLLVFSSVTAITYNPRDLSENGLLSLFDRWCNHHGKTYTAKQ-RPLRFQVFKENL 66
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
++++HN+ GN +F L LNAF+DLT EF+ +G RR L ++
Sbjct: 67 FYISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPPSLKSRRREPKSGLLELYNI 126
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P+S+DWR K AVT VKDQ +CG CWAFSATGAIEGINKIVTGSLVSLSEQEL DCD SYN
Sbjct: 127 PSSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSLSEQELCDCDTSYN 186
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
SGC GGLMDYA+Q+VI N GIDTE DYPY+G CN +K+NR +VTID Y DVP NNE+
Sbjct: 187 SGCDGGLMDYAFQWVIVNGGIDTEVDYPYKGVQKACNSKKVNRRVVTIDDYIDVPANNER 246
Query: 238 QLLQAVVAQPVSVGICGSERAFQL 261
LLQAVV QPVSVGI G ERAFQL
Sbjct: 247 ALLQAVVGQPVSVGISGGERAFQL 270
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 202/318 (63%), Gaps = 5/318 (1%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLN 76
PL+ + + + W +HG+ Y+ EK R +F+ N + + N + +F L++N
Sbjct: 27 PLDEVT-MQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 85
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQ 135
FADLT++EF++ + G+ S+ R + S + D +P S+DWRKKGAVT +KDQ
Sbjct: 86 QFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQ 145
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
SCG+CWAFSA AIEG+ +I G L+SLSEQEL+DCD + + GC GG M+ A+ + +
Sbjct: 146 GSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMTT 204
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G+ +E +YPY+ G CN K + +I G++DVP N+EK L++AV PVS+GI G
Sbjct: 205 GGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGG 264
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
FQ YSSG+F+G CST LDH V +VGY S NG YWI+KNSWG WG GYM ++++
Sbjct: 265 GTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKD 324
Query: 315 TGNSLGICGINMLASYPT 332
T G CG+ M ASYPT
Sbjct: 325 TKAKHGQCGLAMNASYPT 342
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 203/345 (58%), Gaps = 14/345 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLN------YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFE 54
M+S F+L+I L L + + E E W + + YS E EK+ R IF+
Sbjct: 1 MSSTIIFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFK 60
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSA-ASIDHDRRRNASVQSP-- 111
N FV N N ++ L +N F+DLT +EF+A+ G I ++ P
Sbjct: 61 KNLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFR 120
Query: 112 -GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
GN+ D S+DWR++GAVT VK Q CG CWAFSA A+EGI KI G LVSLSEQ+L+
Sbjct: 121 YGNVSDTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLL 180
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR---HIVTIDG 227
DCD YN GC GG+M A++++IKN GI TE +YPY+ C+ TI G
Sbjct: 181 DCDTDYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISG 240
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-S 286
Y+ VP NNE+ LLQAV QPVSVGI G+ F+ YS GIF G C T L HAV IVGY S
Sbjct: 241 YETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGMS 300
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
E G YW++KNSWG +WG +G+M ++R+ G+CG+ MLA YP
Sbjct: 301 EEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYP 345
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 203/330 (61%), Gaps = 21/330 (6%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQE 85
+L+E W + H + + EK +R F++N F+ HN G+ S+ L LN F D+ +E
Sbjct: 44 DLYERW-QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGDMGPEE 102
Query: 86 FKASFLGFSAASIDHDRRR-----NASVQSPG----NLRDVPASIDWRKKGAVTEVKDQA 136
F+++F A S +D RR A+ PG + DVP S+DWR+ GAVT VK+Q
Sbjct: 103 FRSTF----ADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQG 158
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS A+EGIN I TGSLVSLSEQEL+DCD + N GC GGLM+ A+ F+
Sbjct: 159 RCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAEN-GCQGGLMENAFDFIKSYG 217
Query: 197 GIDTEKDYPYRGQAGQCN--KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
GI TE YPYR G C+ + + R V+IDG++ VP +E L +AV QPVSV I
Sbjct: 218 GITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDA 277
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
+AFQ YS G+FTG C T LDH V +VGY +G YWI+KNSWG SWG GY+ MQ
Sbjct: 278 GGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGGYIRMQ 337
Query: 313 RNTGNSLGICGINMLASYPTKTGQNPPPSP 342
R GN G+CGI M AS+P KT NP P
Sbjct: 338 RGAGNG-GLCGIAMEASFPIKTSHNPARKP 366
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 150/300 (50%), Positives = 194/300 (64%), Gaps = 16/300 (5%)
Query: 36 HGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQEFKASFLG 92
+ K+Y SE + +RL FE N F+ +HN G S+T+ +N FADLT EF A ++
Sbjct: 5 YSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALYV- 63
Query: 93 FSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEG 152
+ + N +V P D S+DWR KGAVT +K+Q CG+CW+FS TG+ EG
Sbjct: 64 --PSKFNRTMPYN-TVYLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEG 117
Query: 153 INKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAG 211
+ I TG+LVSLSEQ+L+DC S+ N GC GGLMD A++++I N G+DTE+DYPY Q G
Sbjct: 118 AHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDG 177
Query: 212 QCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPC 271
CNK+K +H TI Y DVP+NNE QL AV PVSV I + FQLY SG+F G C
Sbjct: 178 TCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFDGNC 237
Query: 272 STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
T+LDH VL+VGY DYWI+KNSWG +WG+ GY++M+R S GICGI M SYP
Sbjct: 238 GTNLDHGVLVVGYTD----DYWIVKNSWGTTWGVEGYINMKRGVSAS-GICGIAMQPSYP 292
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/351 (41%), Positives = 211/351 (60%), Gaps = 18/351 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-------INELFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
LA F + ++ + +Y + + +L+E W + H S EKQ+R +F++N
Sbjct: 8 LAVFSVVLVFRLADSFDYTEEDLASEERLRDLYERW-RSHHTVSRSLAEKQERFNVFKEN 66
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD 116
+ + N+ + + L LN+FAD+T+ EF + G + + H R Q G++ +
Sbjct: 67 LKHIHKVNHK-DRPYKLKLNSFADMTNHEFLQHYGG---SKVSHYRVLRGQRQGTGSMHE 122
Query: 117 ----VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
+P+S+DWRK GAVT +KDQ CG+CWAFS A+EGINKI TG L+SLSEQEL+DC
Sbjct: 123 DTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQELVDC 182
Query: 173 DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
D S N GC GGLM+ A+ F+ + G+ +E YPYR + C+ K+N +V IDGY+ VP
Sbjct: 183 D-SDNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVP 241
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVD 291
EN+E L++AV QPV++ + + Q YS IFTG C T L+H V +VGY +++G
Sbjct: 242 ENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDGTK 301
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
YWI+KNSWG WG GY+ MQR G+CGI M ASYP K + +P
Sbjct: 302 YWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPVKLRSDNKKAP 352
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 205/337 (60%), Gaps = 15/337 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L F L++ L+ S + E W Q+ + Y EK +R ++F+ N F+
Sbjct: 12 LGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKFIESF 71
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKA--SFLGFSAASIDHD---RRRNASVQSPGNLRDVP 118
N GN+ F L +N FADLT+ EF++ + GF ++++ R N SV + +P
Sbjct: 72 NAGGNNKFWLGVNQFADLTNDEFRSIKTNKGFKSSNMKIPTGFRYENVSVDA------LP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYN 177
+IDWR KGAVT +KDQ CG CWAFSA A EGI KI TG LVSL+EQEL+DCD +
Sbjct: 126 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 185
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A++F+I N G+ TE YPY G+C + + TI GY+DVP N+E
Sbjct: 186 QGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYEDVPANDEA 243
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIK 296
L++AV QPVSV + G + FQ YSSG+ TG C T LDH + +GY + +G YW++K
Sbjct: 244 ALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMK 303
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG +WG NGY+ M+++ + G+CG+ M SYPT+
Sbjct: 304 NSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTE 340
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 189/315 (60%), Gaps = 8/315 (2%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W +H A +K +R +F++N + N + + L LN F D+T EF+
Sbjct: 46 LYERWRGRHAVA-RDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDMTADEFR 103
Query: 88 ASFLGFSAAS---IDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
+ G A DR+ +AS RD+P S+DWR+KGAVT+VKDQ CG+CWAF
Sbjct: 104 RHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQCGSCWAF 163
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
S A+EGIN I T +L SLSEQ+L+DCD N+GC GGLMDYA+Q++ K+ G+ E Y
Sbjct: 164 STIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGVAAEDAY 223
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY+ + C K VTIDGY+DVP N+E L +AV QPVSV I S FQ YS
Sbjct: 224 PYKARQASCKKSPAP--AVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSE 281
Query: 265 GIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G+F G C T LDH V VGY + +G YW++KNSWG WG GY+ M R+ G CG
Sbjct: 282 GVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVAAKEGHCG 341
Query: 324 INMLASYPTKTGQNP 338
I M ASYP KT NP
Sbjct: 342 IAMEASYPVKTSPNP 356
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 155/334 (46%), Positives = 201/334 (60%), Gaps = 6/334 (1%)
Query: 4 LAFFL-LSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
LA FL L++ + +P + + + E E W ++GK Y EK++R +IF+DN F+
Sbjct: 11 LALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIE 70
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
N GN + L +N ADLT +EFK S G + N+ D+P +I
Sbjct: 71 SFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAI 130
Query: 122 DWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
DWR KGAVT +KDQ CG+CWAFS A EGI +I TG L+SLSEQEL+DCD S + GC
Sbjct: 131 DWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCD-SVDHGC 189
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLM+ ++F+IKN GI +E +YPY G C+ K I GY+ VP N+E+ L
Sbjct: 190 DGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQ 249
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGV-DYWIIKNS 298
QAV QPVSV I FQ YSSG+FTG C T LDH V +VGY +++G +YWI+KNS
Sbjct: 250 QAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNS 309
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG WG GY+ MQR G+CGI M ASYPT
Sbjct: 310 WGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPT 343
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 145/333 (43%), Positives = 204/333 (61%), Gaps = 10/333 (3%)
Query: 7 FLLSILLLSSLPLNYCSD------INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
+L+ L+L+ + S +E E W Q+GK Y+ EK++R +IF++N F+
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N G+ F LS+N FADL ++EFKAS + + S + ++ +P +
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE-SITKIPVT 127
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWRK+GAVT +KDQ +CG+CWAFS AIEGI++I TG LVSLSEQEL+DC + + GC
Sbjct: 128 MDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGC 187
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
G + A++FV KN G+ +E YPY+ C +K + + I GY++VP N+EK LL
Sbjct: 188 NFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKALL 247
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSW 299
+AV QPVSV I A Q YSSGIFTG C T+ +HAV ++GY + G YW++KNSW
Sbjct: 248 KAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNSW 305
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG GY+ M+R+ G+CGI ASYPT
Sbjct: 306 GTKWGEKGYIKMKRDIRAKEGLCGIATNASYPT 338
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 158/357 (44%), Positives = 213/357 (59%), Gaps = 24/357 (6%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+ F W HG++Y S E ++R +F +N V + N NS L+LN FADLT +EF
Sbjct: 44 QAFSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNAR-NSGLVLALNQFADLTLEEF 102
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
A+ LG++ + + S Q + D+P+++DWRKK AVT VK+QA CG+CWAFSA
Sbjct: 103 AATHLGYNPSLREGKEHTTTSFQY-ADANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSA 161
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
TGA+EGIN I TG LVSLSEQ+L+DCD + GCGGGLMD+A+ ++ KN GID+E DY Y
Sbjct: 162 TGAVEGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSY 221
Query: 207 RGQAGQCNKQK-LNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
G C ++K +RH+VTIDG++DVP+N+ + L +A+ QPVS LY SG
Sbjct: 222 WGYGLICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVS-----------LYHSG 270
Query: 266 IF-TGPCSTSLDHAVLIVGYD--SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
+ C L+H VL VGYD S+ G +++IKNSWG WG G+ + + + G C
Sbjct: 271 VVGDDACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASGAC 330
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTRCSLL--TYCAAGETCCCGSSILG-ICLSWKC 376
G+ ASYP K + P PT C T C A +C C S L IC SW C
Sbjct: 331 GVYKAASYPLKK----DATNPEVPTFCGYFGWTECPANSSCECRWSFLDLICFSWGC 383
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 204/314 (64%), Gaps = 7/314 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM-GNSSFTLSLNAFADL 81
S + + + W Q+G++Y+++ E ++R KIF +N ++ + NN GN S+ L LN F+DL
Sbjct: 32 SVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAPGNKSYKLDLNQFSDL 91
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T++EF AS G + + +L D P S+DWR++GAVT+VK+Q +CG+C
Sbjct: 92 TNEEFIASHTGLMIDPSKPSSSSKRASPASLDLSDTPTSLDWREQGAVTDVKNQGNCGSC 151
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSA A+EGI KI G+L+SLSEQ+L+DC N GCGGG MD A+ ++ +N GI +
Sbjct: 152 WAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNAFSYITEN-GIAS 210
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DY YRG AG C ++ I GY+DVP E QLL AV QPVSV I + +F
Sbjct: 211 ENDYQYRGGAGTCQNNEMITPAARISGYEDVPA-GEDQLLLAVSQQPVSVAIAVGQ-SFH 268
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDS--ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
LY GI++GPC +SL+H V +VGY + E+G YW+IKNSWG SWG NGYM + R +G S
Sbjct: 269 LYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKYWLIKNSWGESWGENGYMRLLRESGQS 328
Query: 319 LGICGINMLASYPT 332
G CGI + AS+PT
Sbjct: 329 EGHCGIAVKASHPT 342
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 210/340 (61%), Gaps = 22/340 (6%)
Query: 7 FLLSILLLSSL-----PLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
FLL+IL +SL SD + E E W ++G+ Y EK +R + F+ N AF
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS----IDHDRRRNASVQSPGNLR 115
V N + F L +N FADLT +EFKA+ GF + + N SV +
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKAN-KGFKPTAEKVPTTGFKYENLSVSA----- 120
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+P ++DWR KGAVT +K+Q CG CWAFSA A+EGI K+ TG+L+SLSEQEL+DCD
Sbjct: 121 -LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 179
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
S + GC GG MD A++FVIKN G+ TE +YPY+ G+C + ++ TI G++DVP N
Sbjct: 180 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKC--KGGSKSAATIKGHEDVPVN 237
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYW 293
NE L++AV QPVSV + S+R F LYS G+ TG C T LDH + +GY E +G YW
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 297
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
I+KNSWG +WG G++ M+++ + G+CG+ M SYPT+
Sbjct: 298 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPTE 337
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 209/340 (61%), Gaps = 21/340 (6%)
Query: 7 FLLSILLLSSL-----PLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
FLL+IL +SL SD + E E W ++G+ Y EK +R + F+ N AF
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKAS--FLGFSAASIDHD--RRRNASVQSPGNLR 115
V N + F L +N FADLT +EFKA+ F SA + + N SV +
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKANKGFKPISAEMVPTTGFKYENLSVSA----- 121
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+P ++DWR KGAVT +K+Q CG CWAFSA A+EGI K+ TG+L+SLSEQEL+DCD
Sbjct: 122 -LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 180
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
S + GC GG MD A++FVIKN G+ TE YPY+ G+C ++ TI G++DVP N
Sbjct: 181 SMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGG--SKSAATIKGHEDVPVN 238
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYW 293
+E L++AV QPVSV + S+R F LYS G+ TG C T LDH + +GY E +G YW
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
I+KNSWG +WG G++ M+++ + G+CG+ M SYPT+
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPTE 338
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 196/323 (60%), Gaps = 5/323 (1%)
Query: 13 LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFT 72
++S C+ +E E W Q+GK Y EK++R +IF++N F+ N G+ F
Sbjct: 24 IMSRRLFEACT--SERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFN 81
Query: 73 LSLNAFADLTHQEFKASFLGFSAA--SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVT 130
LS+N FADL +EFKA + S+ + + + A++DWRK+GAVT
Sbjct: 82 LSINQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVT 141
Query: 131 EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQ 190
+KDQ CG+CWAFSA AIEGI++I T LVSLSEQEL+DC + + GC GG M+ A++
Sbjct: 142 PIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFE 201
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
FV K GI +E YPY+G+ C +K + I GY+ VP N+EK L +AV QPVSV
Sbjct: 202 FVAKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSV 261
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYM 309
+ AFQ YSSGIFTG C T+ DHA+ +VGY S G YW++KNSWG WG GY+
Sbjct: 262 YVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYI 321
Query: 310 HMQRNTGNSLGICGINMLASYPT 332
M+R+ G+CGI M A YPT
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPT 344
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 291 bits (744), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 144/338 (42%), Positives = 212/338 (62%), Gaps = 13/338 (3%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +L L +L++ L+ + + E W Q+G+ Y + EK +R ++F+ N
Sbjct: 6 ALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG-NLRD 116
AF+ + N GN +F L +N FADLT+ EF+ ++ + I R + N+
Sbjct: 66 AFI-ESFNAGNHNFWLGVNQFADLTNDEFR--WMKTNKGFIPSTTRVPTGFRYENVNIDA 122
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RS 175
+PA++DWR KGAVT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DCD
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
+ GC GGLMD A++F+IKN G+ TE +YPY +C + ++ + +I GY+DVP NN
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANN 240
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWI 294
E L++AV QPVSV + G + FQ Y G+ TG C T LDH ++ +GY + +G YW+
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWL 300
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 301 LKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 338
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 290 bits (743), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 129/216 (59%), Positives = 164/216 (75%)
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+DWR KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCD+SYN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++FVI N GIDTE+DYPY+ + G C++ + N +VTID Y+DVP NNEK
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVS+ + R FQ Y SGIFTG C T++DH V++ GY +ENG+DYWI++N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGTENGMDYWIVRN 181
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG WG GY+ +QRN +S G+CG+ + SYP K
Sbjct: 182 SWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPVK 217
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 290 bits (743), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 194/311 (62%), Gaps = 9/311 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W +++GK Y E Q+R IFE+N F+ N GN + LS+N AD T++EF
Sbjct: 36 ERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSP---GNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
AS G+ + H + + Q+P N+ D+P ++DWR+KG VT +KDQA CG CWA
Sbjct: 96 MASHKGYKGS---HWQGLRITTQTPFKYENVTDIPWAVDWRQKGDVTSIKDQAQCGNCWA 152
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FSA A EGI +I TG+LVSLSE+EL+DCD S + GC GGLM++ ++F+IKN GI +E +
Sbjct: 153 FSAVAATEGIYQITTGNLVSLSEKELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEAN 211
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLY 262
YPY G C+ K + I GY+ VP N E++L +AV Q +SV I AFQ Y
Sbjct: 212 YPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTMSVSIDAGGSAFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
SG+FTG C T LDH V VGY S + G YWI+KNSWG WG GY+ M R G+
Sbjct: 272 PSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYIRMLRGIDAQEGL 331
Query: 322 CGINMLASYPT 332
CGI M ASYPT
Sbjct: 332 CGIAMDASYPT 342
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 212/339 (62%), Gaps = 15/339 (4%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +L L +L++ L+ + + E W Q+G+ Y + EK +R ++F+ N
Sbjct: 6 ALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFK--ASFLGFSAASIDHDRRRNASVQSPGNLR 115
AF+ + N GN +F L +N FADLT+ EF+ + GF ++ R N+
Sbjct: 66 AFI-ESFNAGNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTT---RVPTGFRYENVNID 121
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+PA++DWR KGAVT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DCD
Sbjct: 122 ALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GGLMD A++F+IKN G+ TE +YPY +C + ++ + +I GY+DVP N
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPAN 239
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYW 293
NE L++AV QPVSV + G + FQ Y G+ TG C T LDH ++ +GY + +G YW
Sbjct: 240 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYW 299
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 300 LLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 338
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 154/350 (44%), Positives = 214/350 (61%), Gaps = 17/350 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
LA L++ + + + S+ + +L+E W + H EK++R +F+ N +
Sbjct: 13 LAVILVAAMSMEITERDLASEESLWDLYERW-RSHHTVSRDLSEKRKRFNVFKANVHHIH 71
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL----RDV 117
+ N + + L LN+FAD+T+ EF+ F ++ + H R + S + G + +
Sbjct: 72 KVNQK-DKPYKLKLNSFADMTNHEFRE----FYSSKVKHYRMLHGSRANTGFMHGKTESL 126
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
PAS+DWRK+GAVT VK+Q CG+CWAFS +EGINKI TG LVSLSEQEL+DC+ N
Sbjct: 127 PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD-N 185
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLM+ AY+F+ K+ GI TE+ YPY+ + G C+ K+N VTIDG++ VP N+E
Sbjct: 186 EGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDEN 245
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTG-PCSTSLDHAVLIVGYDSE-NGVDYWII 295
L++AV QPVSV I S Q YS G++ G C LDH V +VGY + +G YWI+
Sbjct: 246 ALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTALDGTKYWIV 305
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLG-ICGINMLASYPTK-TGQNPPPSPP 343
KNSWG WG GY+ MQR + G +CGI M ASYP K + NP PSPP
Sbjct: 306 KNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLKLSSHNPKPSPP 355
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 290 bits (742), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 147/307 (47%), Positives = 189/307 (61%), Gaps = 13/307 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W Q+G+ Y + EK+ R IF++N A + N+ S+ L +N FADL+++EF
Sbjct: 3 ERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNEEF 62
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
KAS F H A N+ VPA++DWRKKGAVT VKDQ C A
Sbjct: 63 KASRNRFKG----HMCSPQAGPFRYENVSAVPATMDWRKKGAVTPVKDQGQCVA------ 112
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGIN++ TG L+SLSEQE++DCD + + GC GGLMD A++F+ +N G+ TE +YP
Sbjct: 113 --AMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 170
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y G G CN QK H I G++DVP N+E L++AV QPVSV I FQ YSSG
Sbjct: 171 YTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSG 230
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
IFTG C T LDH V VGY +G YW++KNSWG WG GY+ MQ++ G+CGI
Sbjct: 231 IFTGSCGTELDHGVTAVGYGGSDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIA 290
Query: 326 MLASYPT 332
M ASYPT
Sbjct: 291 MQASYPT 297
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/333 (43%), Positives = 203/333 (60%), Gaps = 10/333 (3%)
Query: 7 FLLSILLLSSLPLNYCSD------INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
+L+ L+L+ + S +E E W Q+GK Y+ EK++R +IF++N F+
Sbjct: 9 YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N G+ F LS+N FADL ++EFKAS + + S + ++ +P +
Sbjct: 69 ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE-SITKIPVT 127
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWRK+GAVT +KDQ +CG+CWAFS AIEGI++I TG LVSLSEQEL+DC + + GC
Sbjct: 128 MDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGC 187
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
G + A++FV KN G+ +E YPY+ C +K + + I GY++VP N+EK LL
Sbjct: 188 NFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKALL 247
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSW 299
+AV QPVSV I A Q YSSGIFTG C T+ +HA ++GY + G YW++KNSW
Sbjct: 248 KAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLVKNSW 305
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG GY+ M+R+ G+CGI ASYPT
Sbjct: 306 GTKWGEKGYIRMKRDIRAKEGLCGIATNASYPT 338
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 201/329 (61%), Gaps = 26/329 (7%)
Query: 13 LLSSLPLNYC-SDINELFETWCKQHGKAYSS-EQEKQQRLKIFEDNYAFVTQHN---NMG 67
L S+ PL ++ +L++TW +HG+ RLK+F DN ++ HN + G
Sbjct: 34 LRSAAPLERADEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAG 93
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+F L L F DLT +EF+A LGF +++ R + P D+P ++DWR++G
Sbjct: 94 LHTFRLGLTPFTDLTLEEFRAHALGFLNSTLP---RVASDRYLPRAGDDLPDAVDWRQQG 150
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AVT VK+Q CG CWAFSA A+EGINKIVT +L+SLSEQELIDCD + + GC GG M
Sbjct: 151 AVTGVKNQLDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCD-TEDYGCQGGEMQK 209
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
A+QFVI N GIDTE DYP+ G G C+ + R +V+ID Y++VP N+E+ L +AV QP
Sbjct: 210 AFQFVIDNGGIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP 269
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
GIF GPC LDH V VGY S+NG D+WI+KNSWG WG +G
Sbjct: 270 -----------------GIFNGPCGFILDHGVTAVGYGSDNGEDFWIVKNSWGAEWGESG 312
Query: 308 YMHMQRNTGNSLGICGINMLASYPTKTGQ 336
Y+ M+RN +G CGI M ASYP K G+
Sbjct: 313 YIRMKRNVLLPMGKCGIAMYASYPVKNGR 341
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/339 (42%), Positives = 211/339 (62%), Gaps = 15/339 (4%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +L L +L++ L+ + + E W Q+G+ Y + EK +R ++F+ N
Sbjct: 6 ALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHDRRRNASVQSPGNLR 115
AF+ + N GN F L +N FADLT+ EF+++ GF ++ R N+
Sbjct: 66 AFI-ESFNAGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTT---RVPTGFRYENVNID 121
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+PA++DWR KG VT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DCD
Sbjct: 122 ALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GGLMD A++F+IKN G+ TE +YPY +C + ++ + +I GY+DVP N
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPAN 239
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYW 293
NE L++AV QPVSV + G + FQ Y G+ TG C T LDH ++ +GY + +G YW
Sbjct: 240 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYW 299
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 300 LLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 338
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/333 (46%), Positives = 202/333 (60%), Gaps = 24/333 (7%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
+++LF W ++HGK Y SE+EK+ RLKIF DN+ FV +HN G + + LN ADL
Sbjct: 64 LSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADL 123
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDV--PASIDWRKKGAVTEVKDQASC 138
T EFK LG++AA R A V S DV P IDW GAVT VK+Q C
Sbjct: 124 TKDEFK-KMLGYNAAL----RASRAPVDASTWEYADVTPPEEIDWVASGAVTPVKNQKQC 178
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS TGA+EG+N I TG L+SLSE+ELI C + N GC GGLMD +++++ N GI
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGI 238
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTE + Y + +C + + V IDG+KDVP N+E L++AV QPVSV I ++
Sbjct: 239 DTEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQS 298
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVD--------YWIIKNSWGRSWGMNGYM 309
FQLY+ G+++ C T LDH VL+VGY GVD +W IKNSWG +WG +GY+
Sbjct: 299 FQLYAGGVYSAKDCGTELDHGVLLVGY----GVDPKSTKHKHFWKIKNSWGPAWGEDGYI 354
Query: 310 HMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+ + G CG+ M SYPTK G P P
Sbjct: 355 RIAKGGSGVEGQCGVAMQPSYPTKLGTTPLGEP 387
>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
[Glycine max]
Length = 400
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 208/330 (63%), Gaps = 11/330 (3%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SIL L + ELF+ W +++ K Y + +E++ R + F+ N ++ + N+ S
Sbjct: 31 SILALEIDKFPSEEGVVELFQRWKEENKKIYRNPEEEKLRFENFKRNLKYIVEKNSKRIS 90
Query: 70 SF--TLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+ +L LN FAD++++EFK+ F+ +RN + D P S+DWRKKG
Sbjct: 91 PYGQSLGLNQFADMSNEEFKSKFMSKVKKPF---SKRNGVSSKDHSCEDEPYSLDWRKKG 147
Query: 128 AVT-EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
VT VKDQ CG+ WAFS+T AIEGIN IVT L+SLSEQEL+DCD S N GC GG MD
Sbjct: 148 VVTLAVKDQGYCGSYWAFSSTDAIEGINAIVTADLISLSEQELVDCD-STNDGCDGGXMD 206
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
YA+++V+ N GIDTE +YPY G G CN K ++ IDGY DV +++ LL A V Q
Sbjct: 207 YAFEWVMYNGGIDTETNYPYIGADGTCNVTKEKTKVIGIDGYYDVGQSD-SSLLCATVKQ 265
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIKNSWGRSW 303
P+S GI G+ FQLY GI+ G CS+ +DHA+L+VGY SE DYWI+KNSW SW
Sbjct: 266 PISAGIDGTSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGYGSEGDDDYWIVKNSWRTSW 325
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPTK 333
GM G +++++NT G C IN +ASYPTK
Sbjct: 326 GMEGCIYLRKNTNLKYGXCAINYMASYPTK 355
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 194/313 (61%), Gaps = 18/313 (5%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEF 86
E W QHG+ Y E +K R +F+ N F+ N GN F L +N FADLT+ EF
Sbjct: 42 EQWMVQHGRVYKDETDKAHRFLVFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEF 101
Query: 87 KASFL--GFSAASIDHD---RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+A+ GF+ + R +N S+ + +P ++DWR KGAVT +KDQ CG C
Sbjct: 102 RATKTNKGFNPNVVKVPTGFRYQNLSIDA------LPQTVDWRTKGAVTPIKDQGQCGCC 155
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSA A EGI KI TG L SLSEQEL+DCD + GC GG MD A++F+IKN G+ T
Sbjct: 156 WAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTT 215
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E +YPY Q GQC + + TI GY+DVP N+E L++AV +QPVSV + G + FQ
Sbjct: 216 ESNYPYTAQDGQC--KSGSNGAATIKGYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQ 273
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+ TG C T LDH + +GY + +G YW++KNSWG +WG NG++ M+++ +
Sbjct: 274 FYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKK 333
Query: 320 GICGINMLASYPT 332
G+CG+ M SYPT
Sbjct: 334 GMCGLAMQPSYPT 346
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 139/307 (45%), Positives = 192/307 (62%), Gaps = 6/307 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W QHG+ Y EK +RL++F+ N AF+ N G + + L +N FADLT +EFKA+
Sbjct: 45 ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
S ++ R ++ N+ +PAS+DWR KGAVT +KDQ CG CWAFSA
Sbjct: 105 MTNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAV 164
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGI K+ TG L+SLSEQEL+DCD N GC GG +D A+QF++ N G+ E +YPY
Sbjct: 165 AAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPY 224
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G+C +I GY+DVP N+E L++AV QPVSV + S+ FQ Y G+
Sbjct: 225 TAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASK--FQFYGGGV 282
Query: 267 FTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
G C TSLDH V ++GY + +G YW++KNSWG +WG GY+ M+++ + G+CG+
Sbjct: 283 MAGECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLA 342
Query: 326 MLASYPT 332
M SYPT
Sbjct: 343 MQPSYPT 349
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 191/311 (61%), Gaps = 15/311 (4%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W Q+ + Y EK +R ++F+ N F+ N GN+ F L +N FADLT+ EF+++
Sbjct: 131 EQWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRST 190
Query: 90 FLGFSAASIDHD-----RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
S + R N S + +P +IDWR KGAVT +KDQ CG CWAF
Sbjct: 191 KTNKGLKSSNMKIPTGFRYENVSADA------LPTTIDWRTKGAVTPIKDQGQCGCCWAF 244
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SA A EGI KI TG LVSL+EQEL+DCD + GC GGLMD A++F+IKN G+ TE
Sbjct: 245 SAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESS 304
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G+C + + TI GY+DVP N+E L++AV QPVSV + G + FQ YS
Sbjct: 305 YPYTAADGKC--KSGSNSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYS 362
Query: 264 SGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+ TG C T LDH + +GY + +G YW++KNSWG +WG NGY+ M+++ + G+C
Sbjct: 363 GGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMC 422
Query: 323 GINMLASYPTK 333
G+ M SYPT+
Sbjct: 423 GLAMEPSYPTE 433
>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
Length = 1140
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 139/272 (51%), Positives = 165/272 (60%), Gaps = 27/272 (9%)
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
A G+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N
Sbjct: 777 AVAGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN 836
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
GIDTEKDYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QPVSV I +
Sbjct: 837 GGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAA 896
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
FQLYSSGIFTG C T+LDH V VGY +ENG DYWI+KNSWG SWG +G +R
Sbjct: 897 GTTFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIMKNSWGSSWGESGRAPTRRTL 956
Query: 316 GNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWK 375
P P C C TCCC C +W
Sbjct: 957 A---------------------------PAPAVCDNYYSCPDSTTCCCIYEYGKYCFAWG 989
Query: 376 CCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP +YPIC+ + CL
Sbjct: 990 CCPLEGATCCDDHYSCCPHDYPICNVRQGTCL 1021
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 200/317 (63%), Gaps = 29/317 (9%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
++ ++E +HGK Y++ E ++R +I ++N FV QHN GN ++ + LN FAD +
Sbjct: 47 EVMSIYEEXLAKHGKVYNAIDEMEERFQISKENLKFVEQHN-AGNRTYKVGLNRFADRSR 105
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
R +S +P ++ S+DWRK+GAV VK Q+ C +C
Sbjct: 106 M-----------------MTRPSSRYAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRT 148
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
F+ A+EGINKIVTG+L +LS DCDR+ N+GC GGL DYA +F+I N GIDTE+D
Sbjct: 149 FTVIAAVEGINKIVTGNLTALS-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEED 203
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG-ICGSERAFQLY 262
YP++G G C++ K+N +DGY+ VP +E L +AV QPVSV I + FQLY
Sbjct: 204 YPFQGAVGICDQYKIN----AVDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLY 259
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG-NSLGI 321
SGIFTG C TS+DH V VGY +ENG+DYWI+KNSWG +WG GY+ M+RNT ++ G
Sbjct: 260 ESGIFTGKCGTSIDHGVTAVGYGTENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGK 319
Query: 322 CGINMLASYPTKTGQNP 338
CGI +L YP K+GQNP
Sbjct: 320 CGIAILTLYPIKSGQNP 336
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 205/337 (60%), Gaps = 18/337 (5%)
Query: 8 LLSIL--------LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
LL+IL +L++ LN + E W Q+G+ Y EK Q+ ++F+ N F
Sbjct: 8 LLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEF 67
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHDRRRNASVQSPGNLRDV 117
+ N GN F L +N FAD+T++EFKA+ GF + + R + + +
Sbjct: 68 INSFN-AGNHKFWLGINQFADITNEEFKATKTNKGFISNKV---RVPTGFMYENMSFDAL 123
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
PA+IDWR KGAVT +KDQ CG CWAFSA A+EGI K+ TG LVSLSEQEL+DCD
Sbjct: 124 PATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGE 183
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD A++F+IKN G+ E +YPY G+C + + TI Y+DVP NNE
Sbjct: 184 DQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKC--KSGSSSAATIKSYEDVPANNE 241
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWII 295
L++AV QPVSV + G + FQ YS G+ TG C T LDH + +GY + +G +WI+
Sbjct: 242 GALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKFWIM 301
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG SWG NG++ M+++ + G+CG+ M SYPT
Sbjct: 302 KNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPT 338
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 151/325 (46%), Positives = 197/325 (60%), Gaps = 7/325 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
+I+ S L + LFE+W ++ K Y + EK R +IF+DN ++ + N N
Sbjct: 2 FAIVGYSQDDLTSIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-N 60
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
SS+ L LN FADLTH EFKA ++G + + ++ D P SIDWR+KGA
Sbjct: 61 SSYWLGLNEFADLTHDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHVVDYPESIDWRQKGA 120
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT VK+Q CG+CWAFS +EGINKIVTG L+SLSEQEL+DCDR + GC GG +
Sbjct: 121 VTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTS 179
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
Q+V N G+ TEK+YPY + G+C + V I GYK VP NNE L+QA+ QPV
Sbjct: 180 LQYVADN-GVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPV 238
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV + RAFQ Y GIF GPC T +DHAV VGY G +Y +IKNSWG WG GY
Sbjct: 239 SVVVESKGRAFQFYKGGIFEGPCGTKVDHAVTAVGY----GKNYILIKNSWGPKWGEKGY 294
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ ++R +G S G CG+ + +PTK
Sbjct: 295 IRIKRASGKSKGTCGVYSSSYFPTK 319
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 141/285 (49%), Positives = 187/285 (65%), Gaps = 5/285 (1%)
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
F+ +HN N S+ + LN FADLT +EF++++LGF+ S ++ + ++ P + +P
Sbjct: 3 FIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGS---NKTKVSNRYEPRVSQVLP 59
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELI C + N+
Sbjct: 60 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNT 119
Query: 179 -GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG + +QF+I N GI+T ++YPY Q G+CN N VTID Y +VP NNE
Sbjct: 120 RGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYNNEW 179
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI++N
Sbjct: 180 ALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIVEN 239
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
SW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 240 SWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNYPKP 283
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 148/283 (52%), Positives = 180/283 (63%), Gaps = 8/283 (2%)
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG- 112
++N ++ NN N + L +N FADLT +EF F+ H R N +
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNG----HMRFSNTRTTTFKY 60
Query: 113 -NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
N+ +P SIDWR+KGAVT +K+Q SCG CWAFSA A EGI+KI TG LVSLSEQE++D
Sbjct: 61 ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120
Query: 172 CD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
CD + + GC GG MD A++F+I+NHGI+TE YPY+G G+CN ++ H TI GY+D
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYED 180
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-G 289
VP NNEK L +AV QPVSV I FQ Y SGIFTG C T LDH V VGY N G
Sbjct: 181 VPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEG 240
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG WG GY MQR GICGI MLASYPT
Sbjct: 241 TKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPT 283
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 138/300 (46%), Positives = 195/300 (65%), Gaps = 4/300 (1%)
Query: 32 WCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLTHQEFKASF 90
W +HG+ Y+ EK R +F+ N + + N++ + +F L++N FADLT++EF++ +
Sbjct: 35 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94
Query: 91 LGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
GF S+ R + S + D +P S+DWRKKGAVT +KDQ CG+CWAFSA A
Sbjct: 95 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 154
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
IEG+ +I G L+SLSEQEL+DCD + + GC GGLMD A+ + I G+ +E +YPY+
Sbjct: 155 IEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKST 213
Query: 210 AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTG 269
G CN K + +I G++DVP N+EK L++AV PVS+GI G + FQ YSSG+F+G
Sbjct: 214 NGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSG 273
Query: 270 PCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
C+T LDH V VGY S+NG+ YWI+KNSWG WG GYM ++++ G CG+ M A
Sbjct: 274 ECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNA 333
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 184/275 (66%), Gaps = 10/275 (3%)
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKAS---FLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
++N+ N + L +N FADLT++EFKAS F G +SI R + + N +P+
Sbjct: 2 NSNVNNKLYKLGINKFADLTNEEFKASRNKFKGHMCSSI----IRTTTFKYE-NASAIPS 56
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNS 178
++DWRKKGAVT VK+Q CG+CWAFSA A EGI+++ TG LVSLSEQELIDCD + +
Sbjct: 57 TVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQ 116
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++F+I+NHG+ TE YPY G G CN + + H VTI GY+DVP NNE
Sbjct: 117 GCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELA 176
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKN 297
L +AV QP+SV I S FQ Y+SG+FTG C T LDH V VGY N G YW++KN
Sbjct: 177 LQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKN 236
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG WG GY+ MQR + G+CGI M ASYPT
Sbjct: 237 SWGADWGEEGYIRMQRGIDAAEGLCGIAMQASYPT 271
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 204/343 (59%), Gaps = 16/343 (4%)
Query: 1 MNSLAFFLLSILL------LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFE 54
M S+ FFLL+ILL ++S + + E E W + + YS + EK R +IF
Sbjct: 1 MTSIVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFT 60
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS-----IDHDRRRNASVQ 109
+N FV N N ++TL +N F+DLT +EFKA + G D S +
Sbjct: 61 NNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFR 120
Query: 110 SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
N+ + S+DW ++GAVT VK Q CG CWAFSA A+EG+ KI G LVSLSEQ+L
Sbjct: 121 YE-NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQL 179
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DC + N+GCGGG+M A+ ++ +N GI TE +YPY+G C L TI GY+
Sbjct: 180 LDCS-TENNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCESNHL--AAATISGYE 236
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SEN 288
VP+N+E+ LL+AV QPVSV I GS F YS GIF G C T L HAV IVGY SE
Sbjct: 237 TVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEE 296
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G+ YW++KNSWG SWG NGYM + R+ + G+CG+ LA YP
Sbjct: 297 GIKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYP 339
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 192/308 (62%), Gaps = 6/308 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W QHG+ Y EK +RL++F+ N AF+ N G + + L +N FADLT +EFKA+
Sbjct: 45 ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
S ++ R ++ N+ +PAS+DWR KGAVT +KDQ CG CWAFSA
Sbjct: 105 MTNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAV 164
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EG K+ TG L+SLSEQEL+DCD N GC GG +D A+QF++ N G+ E +YPY
Sbjct: 165 AAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPY 224
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G+C +I GY+DVP N+E L++AV QPVSV + S+ FQ Y G+
Sbjct: 225 TAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASK--FQFYGGGV 282
Query: 267 FTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
G C TSLDH V ++GY + +G YW++KNSWG +WG GY+ M+++ + G+CG+
Sbjct: 283 MAGECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLA 342
Query: 326 MLASYPTK 333
M SYPT+
Sbjct: 343 MQPSYPTE 350
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 152/322 (47%), Positives = 202/322 (62%), Gaps = 13/322 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK R +F+ N V N + + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVTRSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADMTNYEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+ + + + + H R G N+++VP+SIDWRKKGAVT+VKDQ CG+C
Sbjct: 96 RRIY---ADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD N GC GGLM+YA++F IK +GI TE
Sbjct: 153 WAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEF-IKQNGITTE 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY + G C+ +K ++ V+IDGY++VP NNE LL+A QPVSV I FQ
Sbjct: 212 SNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNFQF 271
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+F+G C T L+H V +VGY +++ YWI+KNSWG WG GY+ MQR + G
Sbjct: 272 YSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQRGISHKEG 331
Query: 321 ICGINMLASYP-TKTGQNPPPS 341
+CGI M ASYP K+ NP S
Sbjct: 332 LCGIAMEASYPIKKSSTNPTES 353
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 142/298 (47%), Positives = 184/298 (61%), Gaps = 15/298 (5%)
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR-----RR 104
+F+ N + + N + + L LN F D+T EF+ + G + + H R R+
Sbjct: 70 FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAG---SRVAHHRMFRGDRQ 125
Query: 105 NASVQSP---GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSL 161
+S + + RDVPAS+DWR+KGAVT+VKDQ CG+CWAFS A+EGIN I T +L
Sbjct: 126 GSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNL 185
Query: 162 VSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH 221
SLSEQ+L+DCD N+GC GGLMDYA+Q++ K+ G+ E YPYR + C K
Sbjct: 186 TSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKSPAP-- 243
Query: 222 IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLI 281
+VTIDGY+DVP N+E L +AV QPVSV I S FQ YS G+F+G C T LDH V
Sbjct: 244 VVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAA 303
Query: 282 VGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
VGY + +G YW++KNSWG WG GY+ M R+ G CGI M ASYP KT NP
Sbjct: 304 VGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPVKTSPNP 361
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 150/336 (44%), Positives = 206/336 (61%), Gaps = 19/336 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
FL I SS L+ S I E W H + Y+ EK +R +IF++N F+ +H
Sbjct: 14 FMLFLTCICRASSRTLSESS-IATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFIEKH 72
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLG--------FSAASIDHDRRRNASVQSPGNLR 115
NN G + LSLN+FADLT++EF AS G + I+H + ++
Sbjct: 73 NNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKM-----SVG 127
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
D+ AS+DWRK+GAV ++K+Q CG+CWAFSA A+EGIN+I G LVSLSEQ L+DC +
Sbjct: 128 DIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDC--A 185
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC G ++ A+ + I+++G+ E++YPY G C+ + + I GY+ V N
Sbjct: 186 SNDGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGN--SNPAIQIRGYQSVTPQN 242
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E+QLL AV +QPVSV + + FQ YS G+F+G C T L+HAV IVGY E YW+I
Sbjct: 243 EEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEAEGKYWLI 302
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+NSWG+SWG GYM + R+TGN G+CGINM ASYP
Sbjct: 303 RNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 195/321 (60%), Gaps = 12/321 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
EL+E W QH + EK +R +F+DN + + N + + L LN F D+T EF
Sbjct: 46 ELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDMTADEF 103
Query: 87 KASFLGFSAASIDHDRR-RNASVQSPGNL----RDVPASIDWRKKGAVTEVKDQASCGAC 141
+ ++ +++ + H R R + G + RD+PA++DWR+KGAV VKDQ CG+C
Sbjct: 104 RRAY---ASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCGSC 160
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS A+EGIN I T +L +LSEQ+L+DCD ++ N+GC GGLMD A+Q++ K+ G+
Sbjct: 161 WAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVAA 220
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
YPYR + C + VTIDGY+DVP N+E L +AV QPVSV I FQ
Sbjct: 221 SSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHFQ 280
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G+F G C T LDH V VGY + +G YWI++NSWG WG GY+ M+R+
Sbjct: 281 FYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAKE 340
Query: 320 GICGINMLASYPTKTGQNPPP 340
G+CGI M ASYP KT NP P
Sbjct: 341 GLCGIAMEASYPIKTSPNPAP 361
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 133/217 (61%), Positives = 161/217 (74%), Gaps = 1/217 (0%)
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
A G+CWAFS A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N
Sbjct: 710 AVAGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN 769
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
GIDTEKDYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QPVSV I +
Sbjct: 770 GGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAA 829
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
FQLYSSGIFTG C T+LDH V +VGY +ENG DYWI+KNSWG SWG +GY+ M+RN
Sbjct: 830 GTTFQLYSSGIFTGSCGTALDHGVTVVGYGTENGKDYWIMKNSWGSSWGESGYVRMERNI 889
Query: 316 GNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLL 352
S G CGI + SYP K G N PP+P PG R ++
Sbjct: 890 KASSGKCGIAVEPSYPLKEGAN-PPNPGPGARRACIV 925
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 144/327 (44%), Positives = 199/327 (60%), Gaps = 16/327 (4%)
Query: 13 LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFT 72
+L++ LN + ETW Q+G+ Y EK Q+ ++F+ N F+ N N F
Sbjct: 21 VLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAE-NHKFW 79
Query: 73 LSLNAFADLTHQEFKAS-----FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
L +N FADLT++EFKA+ F+ A + N +++ +P SIDWR KG
Sbjct: 80 LGINQFADLTNEEFKATKTNKGFISNKARVSTGFKYENLKIEA------LPTSIDWRTKG 133
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMD 186
AVT VKDQ CG CWAFSA A EGI K+ TG LVSLSEQEL+DCD + GC GGLMD
Sbjct: 134 AVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMD 193
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
A++F+I N G+ E YPY + G+C + ++ TI Y+DVP NNE L++AV Q
Sbjct: 194 DAFKFIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANNEGALMKAVANQ 251
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGM 305
PVSV + G + FQ YS G+ TG C T LDH + +GY + +G +W++KNSWG +WG
Sbjct: 252 PVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKFWLMKNSWGTTWGE 311
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPT 332
NG++ M+++ + G+CG+ M SYPT
Sbjct: 312 NGFLRMEKDIADKKGMCGLAMEPSYPT 338
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 162/216 (75%)
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+DWR KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCD+SYN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++FVI N GID+E+DYPY+ + G C++ + N +V ID Y+DVP NNEK
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNEK 121
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+ GY +ENG+DYWI++N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGLDYWIVRN 181
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG WG GY+ +QRN +S G+CG+ + SYP K
Sbjct: 182 SWGADWGEKGYLRVQRNVASSSGLCGLAIEPSYPVK 217
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 134/256 (52%), Positives = 176/256 (68%), Gaps = 5/256 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
+ ++ W HG+ Y++ E+++R ++F DN +V HN + G SF L LN FA
Sbjct: 40 EEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFA 99
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
DLT+ E++A++LG S RR G+ D+P S+DWR KGAV EVKDQ SCG
Sbjct: 100 DLTNDEYRATYLGVR--SRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCG 157
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGIN+IVTG ++SLSEQEL+DCD SYN GC GGLMDYA++F+I N GID
Sbjct: 158 SCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGID 217
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE+DYPY+G G+C+ + N +VTID Y+DVP N+EK L +AV QP+SV I RAF
Sbjct: 218 TEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAF 277
Query: 260 QLYSSGIFTGPCSTSL 275
QLY+SGIFTG C S+
Sbjct: 278 QLYNSGIFTGTCGNSV 293
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 203/337 (60%), Gaps = 18/337 (5%)
Query: 8 LLSIL--------LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
LL+IL +L++ LN + E+W Q+G+ Y EK + ++F+ N F
Sbjct: 8 LLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGF 67
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHDRRRNASVQSPGNLRDV 117
+ N GN F L +N FAD+T++EFKA+ GF + + R + +
Sbjct: 68 IDSFN-AGNHKFWLGINQFADITNKEFKATKTNKGFISNKV---RAPTGFSYENVSFDAL 123
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
PASIDWR KGAVT VKDQ CG CWAFSA A EGI K+ TG LVSLSEQEL+DCD
Sbjct: 124 PASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGE 183
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GGLMD A++F+I N G+ E YPY + G+C + ++ TI Y+DVP NNE
Sbjct: 184 DQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANNE 241
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWII 295
L++AV QPVSV + G + FQ YS G+ TG C T LDH + +GY + +G YW++
Sbjct: 242 GALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLM 301
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG SWG NG++ M+++ + G+CG+ M SYPT
Sbjct: 302 KNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPT 338
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 150/330 (45%), Positives = 195/330 (59%), Gaps = 26/330 (7%)
Query: 6 FFLLSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F+L+ + N + + + E E W Q+G+ Y EK +R KIF+DN A + N
Sbjct: 15 LFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFN 74
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
+ S+ LS+N FADLT++EF S F A H A+ N+ VP++IDWR
Sbjct: 75 KAMDKSYKLSINEFADLTNEEFGTSRNRFKA----HICSTEATSFKYENVTAVPSTIDWR 130
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGG 183
KKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC G
Sbjct: 131 KKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGA 190
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
+YPY G G CN++K I+GY+DVP NNEK L +AV
Sbjct: 191 -------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAV 231
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 302
V QP++V I FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSWG
Sbjct: 232 VHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWGTG 291
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 292 WGEEGYIRMQRDVTAKEGLCGIAMQASYPT 321
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 147/346 (42%), Positives = 206/346 (59%), Gaps = 18/346 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI-------NELF-----ETWCKQHGKAYSSEQEKQQRLK 51
+A + I L+ SL ++C +EL + W +HG+ Y+ EK R
Sbjct: 1 MALEHIKIFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYV 60
Query: 52 IFEDNYAFVTQHNNM-GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110
+F+ N + + NN+ +F L++N FADLT+ EF+ + G+ + + + S
Sbjct: 61 VFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSF 120
Query: 111 PGN---LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
+P ++DWRKKGAVT +K+Q SCG CWAFSA AIEG +I G L+SLSEQ
Sbjct: 121 RYQNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQ 180
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
+L+DCD + + GC GGLMD A++ ++ G+ TE +YPY+G+ C + +I G
Sbjct: 181 QLVDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITG 239
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DS 286
Y+DVP N+E L++AV QPVSVGI G FQ YSSG+FTG C+T LDHAV VGY S
Sbjct: 240 YEDVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQS 299
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G YWIIKNSWG WG GYM ++++ + G+CG+ M ASYPT
Sbjct: 300 SAGSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYPT 345
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 149/325 (45%), Positives = 197/325 (60%), Gaps = 8/325 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + LFE+W +H + Y++ +EK R +IF+DN ++ + N N
Sbjct: 28 FSIVGYSQDDLTSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDE-TNKKN 86
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+S+ L LN F DLTH EFK ++G + N ++ D P SIDWR KGA
Sbjct: 87 NSYWLGLNEFVDLTHDEFKEKYVGSIGEDFVTIEQSNDEEFPYKHVVDYPESIDWRDKGA 146
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT VK CG+CWAFS +EGINKIVTG L+SLSEQEL+DCDR + GC GG +
Sbjct: 147 VTPVKPNP-CGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTS 204
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
Q+V+ N G+ TEK+YPY + G+C ++ V I GYK VP N+E L+QA+ QPV
Sbjct: 205 LQYVVDN-GVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPV 263
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV + RAFQLY GIF GPC T LDHAV +GY G Y +IKNSWG +WG GY
Sbjct: 264 SVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIGY----GKTYILIKNSWGPNWGEKGY 319
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ ++R +G S G CG+ + +PTK
Sbjct: 320 LKIKRASGKSEGTCGVYKSSYFPTK 344
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 284 bits (726), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 198/314 (63%), Gaps = 5/314 (1%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLN 76
PL+ + + + W +HG+ Y+ EK R +F+ N + + N + +F L++N
Sbjct: 21 PLDEVT-MQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 79
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQ 135
FADLT++EF++ + G+ S+ R + S + D +P S+DWRKKGAVT +KDQ
Sbjct: 80 QFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQ 139
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
SCG+CWAFSA AIEG+ +I G L+SLSEQEL+DCD + + GC GG M+ A+ + +
Sbjct: 140 GSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMTT 198
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G+ +E +YPY+ G CN K + +I G++DVP N+EK L++AV PVS+GI G
Sbjct: 199 GGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGG 258
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
FQ YSSG+F+G CST LDH V +VGY S NG YWI+KNSWG WG GYM ++++
Sbjct: 259 GTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKD 318
Query: 315 TGNSLGICGINMLA 328
T G CG+ M A
Sbjct: 319 TKAKHGQCGLAMNA 332
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 284 bits (726), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 151/320 (47%), Positives = 202/320 (63%), Gaps = 16/320 (5%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W +QH A EK +R +F +N + + N G++ + L LN F D+T EF+
Sbjct: 46 LYERWREQHTVA-RDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDMTADEFR 103
Query: 88 ASFLGFSAASIDHDRRRNASVQSPG-------NLRDVPASIDWRKKGAVTEVKDQASCGA 140
++ +++ + H R + G ++RDVP S+DWR+KGAVT VKDQ CG+
Sbjct: 104 RAY---ASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQCGS 160
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS A+EGIN I + +L SLSEQ+L+DCD N+GC GGLMDYA+Q++ K+ G+
Sbjct: 161 CWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGGVAA 220
Query: 201 EKDYPYRG-QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
E YPY+ QA CNK+ +VTIDGY+DVP N+E L +AV AQPV+V I S F
Sbjct: 221 EDAYPYKARQASSCNKKP--SAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASGSHF 278
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q YS G+F G C T LDH V VGY + +G YWI+KNSWG WG GY+ M+R+ +
Sbjct: 279 QFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKDK 338
Query: 319 LGICGINMLASYPTKTGQNP 338
G+CGI M ASYP KT NP
Sbjct: 339 EGLCGIAMEASYPVKTSANP 358
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 283 bits (725), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 152/345 (44%), Positives = 202/345 (58%), Gaps = 22/345 (6%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN--- 64
L S + + L + EL+ W H EK +R F+ N F+ HN
Sbjct: 21 LCSAIPFDAKDLESEEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRL 80
Query: 65 -----NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG----NLR 115
N S+ L LN F D+ EF+++F G + R S+ PG ++
Sbjct: 81 NDTSTNNNGPSYRLRLNRFGDMDQAEFRSTFAG----PLHRHTRPAQSI--PGFIYDTVK 134
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR- 174
D+P ++DWR+KGAVT VKDQ CG+CWAFSA ++EG+N I TGSLVSLSEQELIDCD
Sbjct: 135 DIPQAVDWRQKGAVTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTG 194
Query: 175 SYNSGCGGGLMDYAYQFVIKNH-GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
++GC GGLM+ A++F+ + G+ TE YPY G CN + + V IDG++ VP
Sbjct: 195 GDDNGCQGGLMESAFEFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPA 254
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD--SENGVD 291
NE+ L +AV QPVSV I +AFQ YS G+FTG C + LDH V +VGY E+G +
Sbjct: 255 GNEEALAKAVAHQPVSVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKE 314
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQ 336
YWI+KNSWG WG +GY+ MQR++G G+CGI M ASYP K Q
Sbjct: 315 YWIVKNSWGPGWGEHGYVRMQRDSGVDGGLCGIAMEASYPVKNEQ 359
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 283 bits (725), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 143/323 (44%), Positives = 199/323 (61%), Gaps = 10/323 (3%)
Query: 14 LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTL 73
L++ LN + E+W Q+G++Y EK ++ ++F+ N AF+ N N F L
Sbjct: 22 LAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAK-NHKFWL 80
Query: 74 SLNAFADLTHQEFKASFL--GFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
+N FAD+T++EFK + GF + + R ++ +PA+IDWR KGAVT
Sbjct: 81 GINQFADITNEEFKVTKTNKGFISNKV---RASTGFSYENVSIDALPATIDWRTKGAVTP 137
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQ 190
VKDQ CG CWAFSA A EGI K+ TG LVSLSEQEL+DCD + GC GGLMD A++
Sbjct: 138 VKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFK 197
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
F+I N G+ E YPY + G+C + ++ TI Y+DVP NNE L++AV QPVSV
Sbjct: 198 FIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANNEGALMKAVANQPVSV 255
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYM 309
+ G + FQ YS G+ TG C T LDH + +GY + +G YW++KNSWG SWG NG++
Sbjct: 256 AVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFL 315
Query: 310 HMQRNTGNSLGICGINMLASYPT 332
M+++ + G+CG+ M SYPT
Sbjct: 316 RMEKDIADKKGMCGLAMEPSYPT 338
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 202/344 (58%), Gaps = 18/344 (5%)
Query: 1 MNSLAFFLLSILLLSSLP-------LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIF 53
M S+ FFLL+I+L S L S I E E W + + YS + EK R +IF
Sbjct: 1 MTSIIFFLLAIILSSRTSGATSRGGLFEASAI-EKHEQWMSRFHRVYSDDSEKTSRFEIF 59
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS-----IDHDRRRNASV 108
+ N FV N N ++TL +N F+DLT +EFKA + G D S
Sbjct: 60 KKNLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSF 119
Query: 109 QSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQE 168
+ N+ + S+DWR++GAVT VK Q CG CWAFSA A+EG+ KI G LVSLSEQ+
Sbjct: 120 RYE-NVGETGESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQ 178
Query: 169 LIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
L+DC + N GC GG+M A+ ++++N GI E +YPY+G C + TI GY
Sbjct: 179 LLDCS-TENDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCESNHVA--AATISGY 235
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SE 287
+ VP+N+E+ LL+AV QPVSV I GS F YS GIF G C T L+HAV IVGY SE
Sbjct: 236 ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSE 295
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G+ YW++KNSWG SWG +GYM + R+ G+CG+ LA YP
Sbjct: 296 EGIKYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYP 339
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/332 (44%), Positives = 196/332 (59%), Gaps = 27/332 (8%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L + ++ + + + E E W Q+G+ Y EK +R KIF+DN A +
Sbjct: 13 ALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N + S+ LS+N FADLT++EF+AS F A H A+ N+ VP+++D
Sbjct: 73 FNKAMDKSYKLSINEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVPSTVD 128
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCG 181
WRKKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC
Sbjct: 129 WRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGC- 187
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
+YPY G G CN++K I+GY+DVP NNEK L +
Sbjct: 188 --------------------TNYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQK 227
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWG 300
AV QP++V I S FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSW
Sbjct: 228 AVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWS 287
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 288 TGWGEEGYIRMQRDVTAKEGLCGIAMQASYPT 319
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 192/309 (62%), Gaps = 9/309 (2%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W Q G+ Y EK RL++F+ N AF+ + N N F L N FADLT+ EF+AS
Sbjct: 42 EQWMAQFGRVYKDPAEKAHRLEVFKANVAFI-ESFNAENHEFWLGANQFADLTNDEFRAS 100
Query: 90 FLGFSAASIDHDRRRNASVQ---SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+ I R+A S ++ +PAS+DWR KGAVT +K+Q CG+CWAFSA
Sbjct: 101 K---TNKGIKQGGVRDAPTGFKYSDVSIDALPASVDWRTKGAVTPIKNQGQCGSCWAFSA 157
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A EG+ K+ TG LVSLSEQEL+DCD + GC GG MD A++F+IKN G+ TE +YP
Sbjct: 158 VAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGGLTTEANYP 217
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
Y G+ +C + TI GY+DVP N+E L++AV QPVSV + G + FQLY+ G
Sbjct: 218 YTGEDDKCKSNETVNVAATIKGYEDVPANDESALMKAVAHQPVSVVVDGGDMTFQLYAGG 277
Query: 266 IFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+ TG C +DH + +GY + NG YW++KNSWG +WG G++ M ++ + G+CG+
Sbjct: 278 VMTGSCGVEMDHGIAAIGYGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGL 337
Query: 325 NMLASYPTK 333
M SYPT+
Sbjct: 338 AMKPSYPTE 346
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 212/339 (62%), Gaps = 14/339 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L+ L++ ++SSL +++ +D +E + W +HGK Y S++E+ R I+E N V
Sbjct: 1 MKYLSVLLVAACVVSSLSMSF-TDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+HN ++G+ ++ L +N FADL ++EF A GF + + + S N+ ++
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRVNGTSKAAKGSTFLPS-NNIGEL 118
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P ++DWR KG VT VKDQ CG+CWAFS TG++EG + TG LVSLSEQ L+DC +
Sbjct: 119 PKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEG 178
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMD A+Q++IK GIDTE+ YPY+ G+C+ +K N T+ GY DV ++E
Sbjct: 179 NEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGECHFKKANIG-ATVTGYTDVTSDSE 237
Query: 237 KQLLQAVV-AQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-DSENGVDY 292
L +AV P+SV I S +FQLY SG++ P ST LDH VL VGY + +G DY
Sbjct: 238 TALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDY 297
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSW +WGMNGY+ M RN N CGI ASYP
Sbjct: 298 WIVKNSWAETWGMNGYLWMSRNKDNQ---CGIATQASYP 333
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 159/216 (73%)
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+DWR KG + VKDQ SCG+CWAFSA A+E IN IVTG L+SLSEQEL+DCD+SYN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++FVI N GIDTE+DYPY+ + C++ + N +V ID Y+DVP NNEK
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+ GY +ENG+DYWI++N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG WG GY+ +QRN +S G+CG+ SYP K
Sbjct: 182 SWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 149/356 (41%), Positives = 199/356 (55%), Gaps = 54/356 (15%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E FE W +HG+ Y+ EKQ+RL+++ N A V N+M N + L+ N FADLT++EF
Sbjct: 30 ERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNEEF 89
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR------------DVPASIDWRKKGAVTEVKD 134
+A LGF R +PG + ++P S+DWR+KGAV VK+
Sbjct: 90 RAKMLGFGRPPPHG--RATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPVKN 147
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIK 194
Q CG+CWAFSA AIEGIN+I G LVSLSEQEL+DCD + GC GG M +A++FV+
Sbjct: 148 QGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWAFEFVMN 206
Query: 195 NHGIDTEKDYPYRG----------------------------QAGQCNKQKLNRHIVTID 226
N G+ TE++YPY+G G C KL V+I
Sbjct: 207 NSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVSIS 266
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-- 284
GY +V ++E LL+A AQPVSV + +QLY G+FTGPC+ L+H V +VGY
Sbjct: 267 GYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGYGE 326
Query: 285 ---DSEN------GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
D++ G YWI+KNSWG WG GY+ MQR + G+CGI +L SYP
Sbjct: 327 TQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYP 382
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 148/330 (44%), Positives = 195/330 (59%), Gaps = 28/330 (8%)
Query: 6 FFLLSILLLSSLPLN-YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F+L+ + N + + + E E W Q+G+ Y EK +R KIF+DN A + N
Sbjct: 15 LFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFN 74
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
+ S+ LS+N FADLT++EF+AS F A H A+ N+ VP+++DWR
Sbjct: 75 KAMDKSYKLSINEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVPSTVDWR 130
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGG 183
KKGAVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC
Sbjct: 131 KKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGC--- 187
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
+YPY G G CN++K I+GY+DVP NNEK L +AV
Sbjct: 188 ------------------TNYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAV 229
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 302
QP++V I FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSWG
Sbjct: 230 AHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKNSWGTG 289
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 290 WGEEGYIRMQRDVTAKEGLCGIAMQASYPT 319
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 187/312 (59%), Gaps = 13/312 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W K++GK Y EKQ+RL IF+DN F+ N GN + LS+N D T++
Sbjct: 36 MSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNE 95
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSP---GNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
EF AS G+ + + + Q+P N+ VP ++DWR+ GAV +KDQ CG C
Sbjct: 96 EFVASHNGY--------KHKGSHSQTPFKYENITGVPNAVDWRENGAVXAMKDQGQCGNC 147
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS EGI +I T L+SLSEQEL+DCD S + GC GG M+ ++F+ KN GI +E
Sbjct: 148 WAFSTVATTEGIYQITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIXKNGGISSE 206
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY G + K I GY+ VP N+E L +AV QPVSV I AFQ
Sbjct: 207 ANYPYTAVDGTYDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQF 266
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
SSG+FTG C T LDH V VGY S ++G YWI+KNSWG WG GY+ MQR T G
Sbjct: 267 NSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEG 326
Query: 321 ICGINMLASYPT 332
+CGI M ASYPT
Sbjct: 327 LCGIAMDASYPT 338
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 198/328 (60%), Gaps = 5/328 (1%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
L S +LS+ L + + E E W + + Y EK QR ++F+ N AF+ + N
Sbjct: 17 LCSSAVLSARELGDTAMV-ERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFI-ESFNAE 74
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
N F L +N F DLT+ EF+A+ + R S ++ +P ++DWR KG
Sbjct: 75 NRKFWLGVNQFTDLTNDEFRATKTN-KGLKMSGGRAPTGFKYSNVSIDALPTAVDWRTKG 133
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMD 186
VT +KDQ CG CWAFSA A EGI K+ TG L+SLSEQEL+DCD + GC GG MD
Sbjct: 134 VVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMD 193
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
A++F+IKN G+ TE +YPY Q GQC + + TI GY+DVP N+E L++AV Q
Sbjct: 194 DAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDESSLMKAVANQ 253
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGM 305
PVSV + G + FQ YS G+ TG C T LDH + +GY + +G YW++KNSWG +WG
Sbjct: 254 PVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGE 313
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPTK 333
+GY+ M+++ + G+CG+ M SYPT+
Sbjct: 314 SGYLRMEKDISDKSGMCGLAMQPSYPTE 341
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 158/356 (44%), Positives = 207/356 (58%), Gaps = 26/356 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINE-----------LFETWCKQHGKAYSSEQEKQQR 49
M L F LS+ L+ ++ + D NE L+E W + H + EK R
Sbjct: 3 MKKLLFISLSLALIFTVANTF--DFNEHDLESEKSLWNLYERW-RSHHTVTRNLDEKHNR 59
Query: 50 LKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ 109
+F+ N V N + + + L LN F D+T+ EF+ + + + I H R
Sbjct: 60 FNVFKANVMHVHNTNKL-DKPYKLKLNKFGDMTNYEFRRIY---ADSKISHHRMFRGMSH 115
Query: 110 SPG-----NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSL 164
G N DVP+SIDWR KGAVT VKDQ CG+CWAFS A+EGIN+I T LVSL
Sbjct: 116 ENGTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSL 175
Query: 165 SEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVT 224
SEQ+L+DCD N GC GGLM+YA++F IK +GI TE +YPY + G C+ +K ++ V+
Sbjct: 176 SEQQLVDCDTEENEGCNGGLMEYAFEF-IKQNGITTESNYPYAAKDGTCDVEKEDK-AVS 233
Query: 225 IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY 284
IDG+++VP NNE LL+A QPVSV I FQ YS G+FTG C T L+H V IVGY
Sbjct: 234 IDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGY 293
Query: 285 D-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPP 339
+++ YWI+KNSWG WG GY+ MQR + G+CGI M ASYP K P
Sbjct: 294 GVTQDRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPIKKSSTKP 349
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 141/315 (44%), Positives = 198/315 (62%), Gaps = 14/315 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ E E W ++ + Y EK +R ++F+DN+AFV N + F L +N FADLT +
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 85 EFKAS--FLGFSAASIDHD--RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
EFKA+ F SA + + N SV + +P ++DWR KGAVT +K+Q CG
Sbjct: 61 EFKANKGFKPISAEEVPTTGFKYENLSVSA------LPTAVDWRTKGAVTPIKNQGQCGC 114
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGID 199
CWAFSA A+EGI K+ TG+LVSLSEQE +DCD + + GC GG MD A++FVIKN G+
Sbjct: 115 CWAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLA 174
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE YPY+ G+C + ++ TI G++DVP NNE L++ V +QPVSV + S+R F
Sbjct: 175 TESSYPYKVVDGKC--KGGSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTF 232
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
LYS G+ TG C T LDH + +GY E + YWI+KNSWG +WG G++ M+++ +
Sbjct: 233 MLYSGGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDK 292
Query: 319 LGICGINMLASYPTK 333
G+C + M SYPT+
Sbjct: 293 RGMCDLAMKPSYPTE 307
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 282 bits (721), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 126/216 (58%), Positives = 161/216 (74%)
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+DWR KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCD+SYN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++FVI N GID+E+DYPY+ + C++ + N +V ID Y+DVP NNEK
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+ GY +ENG+DYWI++N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG +WG GY+ +QRN +S G+CG+ SYP K
Sbjct: 182 SWGANWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 126/216 (58%), Positives = 160/216 (74%)
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+DWR KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCD+SYN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++FVI N GID+E+DYPY+ + C++ + N +V ID Y+DVP NNEK
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+ GY +ENG+DYWI++N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG WG GY+ +QRN +S G+CG+ SYP K
Sbjct: 182 SWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPVK 217
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 197/323 (60%), Gaps = 19/323 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+ FE W +HG+AY+ EKQ+R +++ N V N+M N + L+ N FADLT++EF
Sbjct: 29 DRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG-YKLADNKFADLTNEEF 87
Query: 87 KASFLGFSA-ASIDHDRRR-NASVQSPGNLRD--VPASIDWRKKGAV-TEVKDQASCGAC 141
+A LGF +I +A + PG D +P S+DWR KGAV K G+C
Sbjct: 88 RAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDAGSC 147
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA AIEGIN+I G LVSLSEQEL+DCD GCGGG M +A++FV+ NHG+ TE
Sbjct: 148 WAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE-AVGCGGGYMSWAFEFVVGNHGLTTE 206
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G C KLN+ V I GY++V ++E L +A AQPVSV + G FQL
Sbjct: 207 ASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQL 266
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY-DSENGVD----------YWIIKNSWGRSWGMNGYMH 310
Y SG++TGPC+ ++H V +VGY +SE D YWI+KNSWG WG GY+
Sbjct: 267 YGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYIL 326
Query: 311 MQRNT-GNSLGICGINMLASYPT 332
MQR+ G + G+CGI +L SYP
Sbjct: 327 MQRDVAGLASGLCGIALLPSYPV 349
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 198/333 (59%), Gaps = 5/333 (1%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
L FL+ + S + S+ +E E W Q+G+ Y EK++R ++F++N F+
Sbjct: 10 LILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIE 69
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
N G+ F LS+N FADL +EFKA + + + S + ++ +PA+I
Sbjct: 70 SFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE-SVTKIPATI 128
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRK+GAVT +KDQ CG+CWAFSA A EGI++I TG LV LSEQEL+DC + + GC
Sbjct: 129 DWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEGCI 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GG +D A++F+ K GI +E YPY+G C +K + I GY+ VP NNEK LL+
Sbjct: 189 GGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGY-DSENGVDYWIIKNSW 299
AV QPVSV I AF+ YSSGIF C T +HAV +VGY + +G YW++KNSW
Sbjct: 249 AVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSKYWLVKNSW 308
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG GY+ ++R+ G+CGI YPT
Sbjct: 309 GTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPT 341
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 138/286 (48%), Positives = 190/286 (66%), Gaps = 13/286 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQE 85
F+ + K Y S +E+ +R IF DN AF+ +HN G + T+ +N FADLT++E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
++ +L + R+ + P S+DWR+KGAVT +K+Q CG+CW+FS
Sbjct: 80 YRQLYLRPYPTELLGRERQEVWLDGPN-----AGSVDWRQKGAVTPIKNQGQCGSCWSFS 134
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
TG++EG + I TG+LVSLSEQ+L+DC S+ N GC GGLMD A++++I N G+DTE+DY
Sbjct: 135 TTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDY 194
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY + G C+K K ++H V+I GYKDVP+NNE QL AV PVSV I +++FQ+YSS
Sbjct: 195 PYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSS 254
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMH 310
G+F+GPC T+LDH VL+VGY S DYWI+KNSWG SW G H
Sbjct: 255 GVFSGPCGTNLDHGVLVVGYTS----DYWIVKNSWGASWVTRGGCH 296
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 126/216 (58%), Positives = 159/216 (73%)
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+DWR KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCD+SYN
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMDYA++FVI N GID+E+DYPY+ + C++ + N +V ID Y+DVP NNEK
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+ GY +ENG+DYWI++N
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
SWG WG GY+ +QRN S G+CG+ SYP K
Sbjct: 182 SWGAKWGEKGYLRVQRNIARSSGLCGLATEPSYPVK 217
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 199/349 (57%), Gaps = 28/349 (8%)
Query: 3 SLAFFLLSILLLSSLPL----------NYCSDINELFETWCKQHGKAYSSEQEKQQRLKI 52
S + FLL++L++ S L + + E W +HG+AY E EK +RL++
Sbjct: 2 SASRFLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEV 61
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG 112
F N + N G S L+ N FADLT +EF+A+ G R R A G
Sbjct: 62 FRANAELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGL--------RPRPAPSAGAG 113
Query: 113 NLR-------DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLS 165
R D S+DWR GAVT VKDQ +CG CWAFSA A+EG+NKI TG LVSLS
Sbjct: 114 RFRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLS 173
Query: 166 EQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVT 224
EQEL+DCD S + GC GGLMD A+QFV + G+ +E YPY+G+ G C +
Sbjct: 174 EQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAAS 233
Query: 225 IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY 284
I G++DVP NNE L AV QPVSV I G + AF+ Y SG+ G C T L+HA+ VGY
Sbjct: 234 IRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGY 293
Query: 285 DSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+ N G YW++KNSWG SWG GY+ ++R G+CG+ L SYP
Sbjct: 294 GTANDGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 341
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 147/344 (42%), Positives = 205/344 (59%), Gaps = 15/344 (4%)
Query: 1 MNSLA--FFLLSILLLSSLPLNYCSD------INELFETWCKQHGKAYSSEQEKQQRLKI 52
MNS + +L+ L+LS + S +E E W Q+G+ Y EK++R ++
Sbjct: 1 MNSFSQNHYLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQV 60
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGF--SAASIDHDRRRNASVQS 110
F++N F+ N G+ F LS+N FADL +EFKA + A+ ++ + + +S
Sbjct: 61 FKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTQTSFRYES 120
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
+ +PA+IDWRK+GAVT +KDQ CG+CWAFSA A EGI++I TG LV LSEQEL+
Sbjct: 121 ---VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELV 177
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DC + + GC GG +D A++F+ K GI +E YPY+G C +K + I GY+
Sbjct: 178 DCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEK 237
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF-TGPCSTSLDHAVLIVGY-DSEN 288
VP NNEK LL+AV QPVSV I AF+ YSSGIF C T +HAV +VGY + +
Sbjct: 238 VPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALD 297
Query: 289 GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G YW++KNSWG WG GY+ ++R+ G+CGI YPT
Sbjct: 298 GSKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPT 341
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 203/337 (60%), Gaps = 10/337 (2%)
Query: 2 NSLAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
N L FL+ + S + S+ + E W Q+GK Y EK++R +IF++N F
Sbjct: 9 NILVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHF 68
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---GNLRD 116
+ + G+ F LS+N FADL +FKA L + +H+ R + ++ ++
Sbjct: 69 IESFHAAGDKPFNLSINQFADL--HKFKA--LLINGQKKEHNVRTATATEASFKYDSVTR 124
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P+S+DWRK+GAVT +KDQ +C +CWAFS IEG+++I G LVSLSEQEL+DC +
Sbjct: 125 IPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGD 184
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG ++ A++F+ K G+ +E YPY+G C +K +V I GY+ VP N+E
Sbjct: 185 SEGCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGYEQVPSNSE 244
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWII 295
K LL+AV QPVS + AFQ YSSGIFTG C T +DH+V +VGY + G YW++
Sbjct: 245 KALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKARGGNKYWLV 304
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WG GY+ M+R+ G+CGI A YPT
Sbjct: 305 KNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPT 341
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 192/307 (62%), Gaps = 3/307 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+LF++W +H K Y S EK R +IF DN ++ + N N+S+ L LN FADL++ EF
Sbjct: 46 QLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEF 104
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K ++GF A + + ++ + P SIDWR KGAVT VK+Q +CG+CWAFS
Sbjct: 105 KKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+EGINKIVTG+L+ LSEQEL+DCD+ ++ GC GG + Q+V N+G+ T K YPY
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDK-HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPY 222
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ + +C V I GYK VP N E L A+ QP+SV + + FQLY SG+
Sbjct: 223 QAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGV 282
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDHAV VGY + +G +Y IIKNSWG +WG GYM ++R +GNS G CG+
Sbjct: 283 FDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYK 342
Query: 327 LASYPTK 333
+ YP K
Sbjct: 343 SSYYPFK 349
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 280 bits (716), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 191/307 (62%), Gaps = 3/307 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+LF++W +H K Y S EK R +IF DN ++ + N N+S+ L LN FADL++ EF
Sbjct: 46 QLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEF 104
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K ++G A + + ++ + P SIDWR KGAVT VK+Q SCG+CWAFS
Sbjct: 105 KKKYVGSVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFST 164
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+EG+NKIVTG+L+ LSEQEL+DCD++ + GC GG + Q+V N G+ T K YPY
Sbjct: 165 IATVEGVNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADN-GVHTSKVYPY 222
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ +A QC V I GYK VP N E L A+ QP+SV + + FQLY SG+
Sbjct: 223 QAKAMQCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGV 282
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDHAV VGY + +G +Y IIKNSWG +WG GYM ++R +GNS G CG+
Sbjct: 283 FDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYK 342
Query: 327 LASYPTK 333
+ YP K
Sbjct: 343 SSYYPFK 349
>gi|125592011|gb|EAZ32361.1| hypothetical protein OsJ_16571 [Oryza sativa Japonica Group]
Length = 416
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 159/371 (42%), Positives = 206/371 (55%), Gaps = 40/371 (10%)
Query: 45 EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEFKASFLGFSAASIDHDR 102
E ++R ++F DN FV HN + F L +N FADLT+ EF+A++LG + A R
Sbjct: 48 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAG--RGR 105
Query: 103 RRNASVQSPGNLRDVPASIDWRKKGAVTE-VKDQASCGACWAFSATGAIEGINKIVTGSL 161
R + + G + +P S+DWR KGAV VK+Q CGA G
Sbjct: 106 RVGEAYRHDG-VEALPDSVDWRDKGAVVAPVKNQGQCGA-----------------GGVR 147
Query: 162 VSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH 221
+EQ L +MD A+ F+ +N G+DTE+DYPY G+CN K +R
Sbjct: 148 EERAEQRLQRW-----------IMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRK 196
Query: 222 IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLI 281
+V+IDG++DVPEN+E L +AV QPVSV I R FQLY SG+FTG C T+LDH V+
Sbjct: 197 VVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVA 256
Query: 282 VGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPP 339
VGY D+ G YW ++NSWG WG NGY+ M+RN G CGI M+ASYP K G NP
Sbjct: 257 VGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKKGPNPK 316
Query: 340 PSPPPGPT----RCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSN 395
PSPP +C + C AG TCCC I C+ W CC A CC DH CCP
Sbjct: 317 PSPPSPAPSPPQQCDRYSKCPAGTTCCCNYGIRNHCIVWGCCPVEGATCCKDHSTCCPKE 376
Query: 396 YPICDSVRHQC 406
YP+C++ C
Sbjct: 377 YPVCNAKARTC 387
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 139/275 (50%), Positives = 183/275 (66%), Gaps = 13/275 (4%)
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
V + +N GNSSFT+ + FADLT EF A F ++ R RN + L++V
Sbjct: 57 VIEAHNAGNSSFTMGITQFADLTAAEFSAYVKRFP---MNVTRPRNEVWITEAPLQEV-- 111
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
DWR+K AVTE+K+Q CG+CW+FS TG++EG + I TG LVSLSEQ+L+DC Y N
Sbjct: 112 --DWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNH 169
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMDYA+++VI N G+DTE+DYPY + G+CN +K +H I G+++VP+ +E Q
Sbjct: 170 GCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQ 229
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNS 298
L AV PVSV I + FQ Y+SG+F G C TSLDH VL+VGY DYWI+KNS
Sbjct: 230 LAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD----DYWIVKNS 285
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG+SWG GY+ ++R + G+CGI M ASYP K
Sbjct: 286 WGKSWGEEGYIRLKRGV-DKKGMCGITMQASYPEK 319
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 134/249 (53%), Positives = 171/249 (68%), Gaps = 4/249 (1%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
ELFE+W +HGK Y S +EK R +IF+DN + + N + S++ L LN FADL+H EF
Sbjct: 6 ELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKV-VSNYWLGLNEFADLSHHEF 64
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K +LG +D RR +S + D+P S+DWRKKGAVT +K+Q SCG+CWAFS
Sbjct: 65 KKQYLGLK---VDFSTRRESSEEFTYRDVDLPKSVDWRKKGAVTNIKNQGSCGSCWAFST 121
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGIN+IVTG+L SLSEQELIDCDR+YNSGC GGLMDYA+ F+++N G+ E DYPY
Sbjct: 122 VAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDDYPY 181
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ G C K +VTI GY DVP+NNE+ LL+A+ QP+SV I S R FQ YS G+
Sbjct: 182 IMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGV 241
Query: 267 FTGPCSTSL 275
F G C T L
Sbjct: 242 FDGHCGTQL 250
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 212/339 (62%), Gaps = 16/339 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L+ L+++ ++SSL +++ +D +E ++ W +HGK Y S++E+ R I++ N V
Sbjct: 1 MKYLSVLLVAVCVVSSLSMSF-TDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+HN ++G+ ++ L +N FADL ++EF A GF + ++ P N+ +
Sbjct: 60 IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRVNGTSK-AAKGSTFLPPNNVGKL 118
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC-DRSY 176
P ++DWR KG VT VKDQ CG+CWAFSATG++EG + TG LVSLSEQ L+DC D++Y
Sbjct: 119 PKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSDKNY 178
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
GC GGLMD A+Q++I GIDTE+ YPY G C+ + N T+ GY DV +E
Sbjct: 179 --GCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNCHFKTANVG-ATVTGYTDVTSGSE 235
Query: 237 KQLLQAVV-AQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVDY 292
K L +AV P+SV I S +FQLY SG++ P ST LDH VL VGY + +G DY
Sbjct: 236 KALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDY 295
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSW +WGMNGY+ M RN N CGI ASYP
Sbjct: 296 WIVKNSWAETWGMNGYIWMSRNKDNQ---CGIATQASYP 331
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 194/312 (62%), Gaps = 28/312 (8%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +G+ Y EK++R KIF++N ++ N
Sbjct: 32 MSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIESVN-------------------- 71
Query: 85 EFKASFLGFSAASIDHDRRRNASVQS--PGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
+FKAS G++ +S R R++ + S N+ VP+S+DWRKKGAVT +KDQ CG CW
Sbjct: 72 KFKASRNGYNMSS----RPRSSEITSFRYENVAAVPSSMDWRKKGAVTPIKDQGQCGCCW 127
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA A+EG+ ++ TG L+SLSEQEL+DCD S + GCGGGLMD A++F+I N G+ TE
Sbjct: 128 AFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTE 187
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+G CNK+K I Y+DVP N+E LL+AV PVSV I FQ
Sbjct: 188 ANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQF 247
Query: 262 YSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YSSG+FTG C T LDH V VGY +++G YW++KNSWG WG +GY+ M+R+ G G
Sbjct: 248 YSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERDIGADEG 307
Query: 321 ICGINMLASYPT 332
+CGI M ASYPT
Sbjct: 308 LCGIAMEASYPT 319
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/333 (45%), Positives = 202/333 (60%), Gaps = 24/333 (7%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+A F+ S +S PL +F W ++H K+Y++E E R ++ +NY ++ H
Sbjct: 11 VALFVASTFAVSHDPLT------GVFADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAH 63
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDW 123
N+ N SF L++N F DLT+ EF F G S + D ++ + +PG +PA DW
Sbjct: 64 NHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITA-DQAKQESDIAPAPG----LPADFDW 117
Query: 124 RKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGG 182
R+KGAVT VK+Q CG+CW+FS TG+ EG N + G L SLSEQ L+DC SY N GC G
Sbjct: 118 RQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNG 177
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC--NKQKLNRHIVTIDGYKDVPENNEKQLL 240
GLMDYA++++I+N GIDTE+ YPY G C NKQ +V+ Y +VP NE LL
Sbjct: 178 GLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVS---YTNVPSGNEGALL 234
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNS 298
AV QP SV I S +FQ Y G++ P CS+S LDH VL VG+ +G DYW++KNS
Sbjct: 235 NAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGVRDGKDYWLVKNS 294
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WG WG++GY+ M RN N CGI AS+P
Sbjct: 295 WGADWGLSGYIEMSRNKHNQ---CGIATAASHP 324
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 149/327 (45%), Positives = 196/327 (59%), Gaps = 23/327 (7%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W +H EK +R +F +N V + N ++ + L LN FADLT EF+
Sbjct: 48 LYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADLTSDEFR 106
Query: 88 ASFLGFSAASIDHDR--------------RRNASVQSPGNLRDVPASIDWRKKGAVTEVK 133
S+ +++ + H R + +S G L P S+DWR+KGAVT VK
Sbjct: 107 RSY---ASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGAL---PTSVDWREKGAVTGVK 160
Query: 134 DQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVI 193
DQ CG+CWAFS A+EGIN I T +L SLSEQ+L+DCD N+GC GGLMD A+ ++
Sbjct: 161 DQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSYIA 220
Query: 194 KNHGIDTEKDYPYRG-QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI 252
K+ G+ EK YPYR Q+ CN +K +V+IDGY+DVP N+E L +AV AQPV+V I
Sbjct: 221 KHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAVAI 280
Query: 253 CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHM 311
FQ YS G+F G C T LDH V VGY + +G YWI+KNSWG WG GY+ M
Sbjct: 281 EAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGYIRM 340
Query: 312 QRNTGNSLGICGINMLASYPTKTGQNP 338
+R+ + G+CGI M ASYP KT NP
Sbjct: 341 KRDVADKEGLCGIAMEASYPVKTSPNP 367
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 191/307 (62%), Gaps = 3/307 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+LF++W +H K Y S EK R +IF DN ++ + N N+S+ L LN FADL++ EF
Sbjct: 46 QLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEF 104
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K ++GF A + + ++ + P SIDWR KGAVT VK+Q +CG+CWAFS
Sbjct: 105 KKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+EGINKIVTG+L+ LSEQEL+DCD+ ++ GC GG + Q+V N+G+ T K YPY
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDK-HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPY 222
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ + +C V I GYK VP N E L A+ QP+S + + FQLY SG+
Sbjct: 223 QAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGV 282
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDHAV VGY + +G +Y IIKNSWG +WG GYM ++R +GNS G CG+
Sbjct: 283 FDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYK 342
Query: 327 LASYPTK 333
+ YP K
Sbjct: 343 SSYYPFK 349
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 202/340 (59%), Gaps = 46/340 (13%)
Query: 9 LSILLLSSLPLNYCSDI---------NE----LFETWCKQHGKAYSSEQ-EKQQRLKIFE 54
LS+L++ LP + D+ NE +F+TW +HGK Y++ +K+QR + F+
Sbjct: 12 LSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQNFK 71
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL 114
DN F+ QHN N S+ L L FADLT QE++ F G R + V P
Sbjct: 72 DNLRFIDQHN-AKNLSYRLGLTQFADLTVQEYQDLFSGRPIQKQKALRVTHRYV--PLAE 128
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+P S+DWR+KGAV+E+KDQ C +E INKIVTG L+SLSEQEL+DC
Sbjct: 129 DQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVDCSI 178
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPE 233
N GC GGLMD A+QF+I N+G++ + DYPY+ G CN Q ++ ++ IDGY+DVP
Sbjct: 179 D-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYEDVPA 237
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYW 293
NNE L +AV QP GI+TGPC T LDHAV+IVGY +ENG DYW
Sbjct: 238 NNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGTENGQDYW 280
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
I++NSWG WG GY + RN N G+CGI M+ASYP K
Sbjct: 281 IVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPIK 320
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 278 bits (710), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 205/333 (61%), Gaps = 15/333 (4%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +L L +L++ L+ + + E W Q+G+ Y + EK +R ++F+ N
Sbjct: 6 ALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANA 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHDRRRNASVQSPGNLR 115
AF+ + N GN F L +N FADLT+ EF+ + GF ++ R N+
Sbjct: 66 AFI-ESFNAGNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTT---RVPTGFRYENVNID 121
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+PA++DWR KG VT +KDQ CG CWAFSA A+EGI K+ TG L+SLSEQEL+DCD
Sbjct: 122 ALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ GC GGLMD A++F+IKN G+ TE +YPY +C + ++ + +I GY+DVP N
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPAN 239
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYW 293
NE L++AV QPVSV + G + FQ Y G+ G C T LDH ++ +GY + +G YW
Sbjct: 240 NEAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYW 299
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
++KNSWG +WG NG++ M+++ + G+CG+ M
Sbjct: 300 LLKNSWGMTWGENGFLRMEKDISDKRGMCGLAM 332
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 133/269 (49%), Positives = 173/269 (64%), Gaps = 9/269 (3%)
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGN-----LRDVPASIDWRKKGAVTEVKDQ 135
+T+ EF++++ G + ++H R S + G+ ++ VP S+DWRKKGAVT +KDQ
Sbjct: 1 MTNHEFRSTYAG---SKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQ 57
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
CG+CWAFS A+EGIN I T LVSLSEQEL+DCD S N GC GGLM YA++F+ +
Sbjct: 58 GQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEK 117
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
GI TE+ YPY + G C+ K+N +V+IDG++ VP NNE LL+A QP+SV I
Sbjct: 118 GGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAG 177
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRN 314
AFQ YS G+F G C T LDH V IVGY + +G YWI+KNSWG WG NGY+ M+R
Sbjct: 178 GSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG 237
Query: 315 TGNSLGICGINMLASYPTKTGQNPPPSPP 343
G+CGI + ASYP K P P
Sbjct: 238 ISAKEGLCGIAVEASYPIKNSSTNPVGAP 266
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 190/312 (60%), Gaps = 16/312 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ ++F + KQ+ KAYS E R F+ N + HN + N+S+T+ LN FADL+ +
Sbjct: 38 LQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFE 96
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFK + G+ + R N + + P SIDWR AVT +KDQ CG+CWAF
Sbjct: 97 EFKGKYFGYKHVEREFARSNNLHQE----VEAAPTSIDWRTSNAVTPIKDQGQCGSCWAF 152
Query: 145 SATGAIEGINKIVTG--SLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
SATG+IEG ++ G +L SLSEQ+L+DC SY N+GC GGLMDYA++++I N GI E
Sbjct: 153 SATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAE 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
YPY+G G C QK +VTI GYKDV +E LL AV PVSV I + FQ
Sbjct: 212 SAYPYKGVGGLC--QKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQ 269
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YSSG+F+G C +LDH VL VGY + DYWI+KNSWG SWG +GY+ M RN
Sbjct: 270 FYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIRNKNQ--- 326
Query: 321 ICGINMLASYPT 332
CGI + SYPT
Sbjct: 327 -CGIAIQPSYPT 337
>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
Length = 234
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 128/199 (64%), Positives = 151/199 (75%), Gaps = 1/199 (0%)
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG CWAFS A+EGIN IVTG L+SLSEQEL+DCDRSYN GC GGLMDYA++F+IKN G
Sbjct: 1 CGRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQGCNGGLMDYAFEFIIKNGG 60
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
ID+E+DYPY+ G C+ + N +VTIDGY+DVPEN+E L +AV QPVSV I R
Sbjct: 61 IDSEEDYPYKAVDGTCDPIRKNAKVVTIDGYEDVPENDENSLKKAVAYQPVSVAIEAGGR 120
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQLY SGIFTG C T+LDH V VGY +ENG+DYWI++NSWG SWG NGY+ M+RN
Sbjct: 121 EFQLYQSGIFTGRCGTALDHGVAAVGYGTENGIDYWIVRNSWGSSWGENGYIRMERNVKT 180
Query: 318 S-LGICGINMLASYPTKTG 335
+ G CGI M ASYPTK G
Sbjct: 181 TKTGKCGIAMEASYPTKEG 199
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 152/338 (44%), Positives = 213/338 (63%), Gaps = 17/338 (5%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
F +L+ +++S +++ + E + ++ QH K Y SE E++ R+KIF +N V +HN
Sbjct: 4 FLILAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNK 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAA--SIDHDRRRNASVQ--SPGNLRDVP 118
+ G F L LN +AD+ H EF ++ GF+ +I N +V+ SP N++ +P
Sbjct: 64 LFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVK-LP 122
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
++DWR KGAVTEVKDQ CG+CW+FSATG++EG + TG LVSLSEQ L+DC Y N
Sbjct: 123 DTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGN 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GGLMD A++++ N GIDTEK YPY + +C+ + N T G+ D+ E NE
Sbjct: 183 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSG-ATDKGFVDIEEANED 241
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-DSENGVDYW 293
L AV PVS+ I S FQLYS G+++ P S LDH VL+VGY S++G DYW
Sbjct: 242 DLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYW 301
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
++KNSWG SWG+NGY+ M RN N +CG+ ASYP
Sbjct: 302 LVKNSWGPSWGLNGYIKMARNQDN---MCGVASQASYP 336
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 208/338 (61%), Gaps = 14/338 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L+ L++ ++SSL +++ +D +E + W +HGK Y S++E+ R I++ N V
Sbjct: 1 MKYLSVLLVAACVVSSLSMSF-TDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+HN ++G+ ++ L +N F DL ++EF A GF + + ++ P N+ ++
Sbjct: 60 IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVSGTSK-AAKGSTFLPPNNVGEL 118
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P ++DWR KG VT VKDQ CG+CWAFS TG++EG + TG LVSLSEQ L+DC +
Sbjct: 119 PKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDC-SGRD 177
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GG MD A+Q++I GIDTE YPY+ G+C+ +K N T+ GY DV +EK
Sbjct: 178 AGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANVG-ATVTGYTDVTSGSEK 236
Query: 238 QLLQAVV-AQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY-DSENGVDYW 293
L +AV P+SV I S +FQ Y SG++ P ST LDH VL VGY S +G DYW
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYW 296
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
I+KNSW +WGMNGY+ M RN N CGI ASYP
Sbjct: 297 IVKNSWAETWGMNGYVWMSRNKDNQ---CGIATNASYP 331
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 198/336 (58%), Gaps = 15/336 (4%)
Query: 3 SLAFFLLSILLLSSLPLN--YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
+LA FLL + +S + + + + E E W ++G+ Y EK+ +IF++N F+
Sbjct: 10 NLALFLLLSIEISQVMSRKLHETSLREEHENWIARYGQVYKVAAEKE-TFQIFKENVEFI 68
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---GNLRDV 117
N N + L +N FADLT +EFK G ++ + +P N+ D+
Sbjct: 69 ESFNAAANKPYKLGVNLFADLTLEEFKDFRFGL--------KKTHEFSITPFKYENVTDI 120
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P ++DWR+KGAVT +KDQ CG+CWAFS A EGI++I TG+LVSL EQEL+ CD +
Sbjct: 121 PEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGV 180
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG M+ ++F+IKN GI T+ +YPY+G G CN + I GY+ VP +E
Sbjct: 181 DQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSE 240
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
+ L +AV QPVSV I + F Y+ GI+TG C T LDH V VGY + N DYWI+K
Sbjct: 241 EALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETDYWIVK 300
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG W G++ MQR G+CG+ + +SYPT
Sbjct: 301 NSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYPT 336
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 201/342 (58%), Gaps = 21/342 (6%)
Query: 7 FLLSILLLSSLPLNYCSDINE-----------LFETWCKQHGKAYSSEQEKQQRLKIFED 55
F+LSI L + + C D E L+E W QH + + + EK++R +F+
Sbjct: 7 FVLSISLALFIGVVNCIDFTEKDLATDKSLWDLYERWGSQHMVSRAPD-EKKKRFNVFKY 65
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP---G 112
N + + N +G + L LN FAD+T+ EFKA GF + + + Q+P
Sbjct: 66 NVNHINRVNQLG-KPYKLKLNEFADMTNHEFKA---GFDSKILHFRMLKGKRRQTPFTHA 121
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
D P SIDWR GAV +K+Q CG+CWAFS +EGINKI T LVSLSEQEL+DC
Sbjct: 122 KTTDPPPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDC 181
Query: 173 DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
+ GC GGLM+ Y+F+ + G+ TE+ YPY + G+C+ K N +V IDG+++VP
Sbjct: 182 ETDC-EGCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVP 240
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVD 291
N+E +L+AV QPVS+ I FQ YS G+F G C T L+H V IVGY +++G +
Sbjct: 241 ANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTN 300
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
YWI++NSWG WG GY+ MQR G+CG+ M ASYP K
Sbjct: 301 YWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPIK 342
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 206/336 (61%), Gaps = 13/336 (3%)
Query: 4 LAFFLLSI--LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
L F +LS+ +++S L S + E E W HG+ Y + EK+ R K F++N F+
Sbjct: 15 LLFSILSLYPFIVTSRNLKELSML-ERHENWMVHHGRVYKDDIEKEHRFKTFKENVEFIE 73
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP-GNLRDVPAS 120
N G + L++N +ADLT +EF SF+G + + + ++ +VP S
Sbjct: 74 SFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEVPNS 133
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWRK+G+VT VKDQ CG CWAFSA AIEG +I L+SLSEQ+L+DC + N GC
Sbjct: 134 MDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDCS-TQNKGC 192
Query: 181 GGGLMDYAYQFVIKNH--GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GGLM AY F+++N+ GI TE +YPY C ++ VTI+GY+ VP ++E
Sbjct: 193 EGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQ--PAAVTINGYEVVP-SDESS 249
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDS--ENGVDYWIIK 296
LL+AVV QP+SVGI ++ F +Y SGI+ G C++ L+HAV ++GY + E+G YWI+K
Sbjct: 250 LLKAVVNQPISVGIAANDE-FHMYGSGIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVK 308
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG WG GYM + R+ G G CGI +AS+PT
Sbjct: 309 NSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPT 344
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 183/307 (59%), Gaps = 5/307 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W +HG+ Y+ E EK +RL+IF N F+ N+ G S L+ N FADLT +EF+A+
Sbjct: 48 EKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAA 107
Query: 90 FLGFSAASIDHDRRRNASVQSPGN--LRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
GF + N L D S+DWR GAVT VKDQ CG CWAFSA
Sbjct: 108 RTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAV 167
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EG+NKI TG LVSLSEQEL+DCD + GC GGLMD A+QF+ + G+ +E YPY
Sbjct: 168 AAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPY 227
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+G G C +I G++DVP NNE L AV QPVSV I G + AF+ Y SG+
Sbjct: 228 QGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGV 287
Query: 267 FTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
G C T L+HA+ VGY + +G YW++KNSWG SWG GY+ ++R G+CG+
Sbjct: 288 LGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV-RGEGVCGLA 346
Query: 326 MLASYPT 332
L SYP
Sbjct: 347 KLPSYPV 353
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 199/332 (59%), Gaps = 18/332 (5%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELF----ETWCKQHGKAYSSEQEKQQRLKIF 53
S AF LLS++ L SL +D ++ E W ++ + YS EK +R ++F
Sbjct: 6 SSAFVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVF 65
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGF---SAASIDHDRRRNASV-- 108
+ N A + + N GN F L N FADLT EF+A++ G+ +AA+ R R A+
Sbjct: 66 KANMALI-ESVNAGNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRSRTATTGF 124
Query: 109 -QSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
+ +L DVPAS+DWR KGAVT +K+Q CG CWAFSA ++EG+ K+ TG LVSLSEQ
Sbjct: 125 KYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQ 184
Query: 168 ELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID 226
EL+DCD + GC GG MD A+ F++ N G+ TE YPY G CN + + +I
Sbjct: 185 ELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSNEASGDAASIK 244
Query: 227 GYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD- 285
GY+DVP N+E L +AV QPVSV + G + F+ Y G+ +G C T LDH + VGY
Sbjct: 245 GYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHGIAAVGYGV 304
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
+ +G YW++KNSWG SWG GY+ M+R+ +
Sbjct: 305 ASDGTKYWVMKNSWGTSWGEAGYIRMERDIAD 336
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 190/312 (60%), Gaps = 16/312 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ ++F + KQ+ KAYS E R F+ N + HN + N+S+T+ LN FADL+ +
Sbjct: 38 LQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFE 96
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFK + G+ + R N + + P SIDWR AVT +KDQ CG+CWAF
Sbjct: 97 EFKGKYFGYKHVEREFARSNNLHQE----VEAAPTSIDWRTSNAVTPIKDQGQCGSCWAF 152
Query: 145 SATGAIEGINKIVTG--SLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
SATG+IEG ++ G +L SLSEQ+L+DC SY ++GC GGLMDYA++++I N GI E
Sbjct: 153 SATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAE 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
YPY+G G C QK +VTI GYKDV +E LL AV PVSV I + FQ
Sbjct: 212 SAYPYKGVGGLC--QKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQ 269
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YSSG+F+G C +LDH VL VGY + DYWI+KNSWG SWG +GY+ M RN
Sbjct: 270 FYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIRNKNQ--- 326
Query: 321 ICGINMLASYPT 332
CGI + SYPT
Sbjct: 327 -CGIAIQPSYPT 337
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 206/340 (60%), Gaps = 31/340 (9%)
Query: 7 FLLSILLLSSL-----PLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
FLL+IL +SL SD + E E W ++G+ Y EK +R ++F+DN AF
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAF 66
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS----IDHDRRRNASVQSPGNLR 115
V N N+ F L +N FADLT +EFKA+ GF + + N SV +
Sbjct: 67 VESFNTNKNNKFWLGVNQFADLTTEEFKAN-KGFKPTAEKVPTTGFKYENLSVSA----- 120
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-R 174
+P ++DWR KGAVT +K+Q C A +EGI K+ TG+L+SLSEQEL+DCD
Sbjct: 121 -LPTAVDWRTKGAVTPIKNQGQCAA---------MEGIVKLSTGNLISLSEQELVDCDTH 170
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
S + GC GG MD A++FVIKN G+ TE +YPY+ G+C + ++ TI G++DVP N
Sbjct: 171 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKC--KGGSKSAATIKGHEDVPVN 228
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYW 293
NE L++AV QPVSV + S+R F LYS G+ TG C T LDH + +GY E +G YW
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 288
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
I+KNSWG +WG G++ M+++ + G+CG+ M SYPT+
Sbjct: 289 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPTE 328
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 202/330 (61%), Gaps = 16/330 (4%)
Query: 8 LLSILLLSSLPLNYCSDINE--LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
L+ LL++ L S++++ + W HGK Y+ E+E +R I+ DN V +HN
Sbjct: 4 FLACLLVAVLIAQCFSELSQDRQWHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIVKKHN- 61
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRK 125
N S+ L +N FADLT EFK F+G+ AAS S P + +PA +DWR
Sbjct: 62 AENHSYKLDMNHFADLTVTEFKQRFMGYRAAS----NSTGGSTFLPLSNVQLPAEVDWRD 117
Query: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGL 184
KG VT VK+Q CG+CWAFS+TG++EG + TG LVSLSEQ L+DC + Y N+GC GGL
Sbjct: 118 KGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNNGCEGGL 177
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV- 243
MDYA++++ N GIDTE+ YPY + GQC+ K T+ GY DV +E L AV
Sbjct: 178 MDYAFKYIKNNDGIDTEQSYPYTARDGQCHF-KPGSVGATVTGYTDVQRGSEGDLQSAVA 236
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
P+SV I +FQLY +G+++ P ST LDH VL VGY +E+G DYW++KNSWG
Sbjct: 237 TVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYGAEDGKDYWLVKNSWGE 296
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WGMNGY+ M RN N CGI ASYP
Sbjct: 297 GWGMNGYIKMSRNKDNQ---CGIATQASYP 323
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 201/338 (59%), Gaps = 23/338 (6%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
L S +LS+ L + + E E W + + Y EK QR K F+ N AF+ + N G
Sbjct: 17 LCSSTVLSARELGDAAMV-EKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFI-ESFNTG 74
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPAS 120
N F L +N F DLT+ EF+A+ + +RN + ++P + +PA+
Sbjct: 75 NHKFWLGVNQFTDLTNDEFRAT-------KTNKGLKRNGA-RAPTRFKYNNVSTDALPAA 126
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWR KG VT +KDQ CG CWAFSA A EGI K+ TG LVSLSEQEL+DCD + G
Sbjct: 127 VDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQG 186
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GG MD A++F+IKN G+ TE +YPY Q GQC + + TI GY+DVP N+E L
Sbjct: 187 CEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDESSL 246
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNS 298
++AV QPVSV + G + FQ YS G+ TG C T LDH ++ +GY + +G +W++KNS
Sbjct: 247 MKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNS 306
Query: 299 WGRSWGMNGYMHMQRN----TGNSLGICGINMLASYPT 332
WG +WG +GY+ M+++ +G +G N+ A + T
Sbjct: 307 WGTTWGESGYLRMEKDISDKSGTIIGNNSYNLWAKWVT 344
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 153/339 (45%), Positives = 211/339 (62%), Gaps = 16/339 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L+ L+++ ++SSL +++ +D +E + W +HGK Y S++E+ R I+E N V
Sbjct: 1 MKYLSVLLVAVCVVSSLSMSF-TDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+HN ++G+ ++ L +N FADL ++EF A GF + + + S N+ +
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRVNGTSKAAKGSTFLPS-NNVDKL 118
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P ++DWR KG VT VKDQ CG+CWAFSATG++EG TG LVSLSEQ L+DC SY
Sbjct: 119 PKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDC--SYR 176
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG MD A+Q++I GIDTE Y YR G C+ +K N T+ GY DV +E
Sbjct: 177 NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVG-ATVTGYTDVTSGSE 235
Query: 237 KQLLQAVV-AQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGY-DSENGVDY 292
K L +AV P+SV I S + F+ Y SG++ P CST+ L HAVL+VGY + +G DY
Sbjct: 236 KALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDY 295
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSW ++WGMNGY+ M RN N CGI ASYP
Sbjct: 296 WIVKNSWAKTWGMNGYLWMSRNKDNQ---CGIASEASYP 331
>gi|308082013|ref|NP_001183396.1| uncharacterized protein LOC100501813 [Zea mays]
gi|238011208|gb|ACR36639.1| unknown [Zea mays]
Length = 291
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 136/253 (53%), Positives = 166/253 (65%), Gaps = 6/253 (2%)
Query: 161 LVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
++SLSEQEL+DCD SYN GC GGLMDYA++F+I N GIDTE+DYPY+G G+C+ + N
Sbjct: 1 MISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNA 60
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
+VTID Y+DVP N+EK L +AV QP+SV I RAFQLY+SGIFTG C T+LDH V
Sbjct: 61 KVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVT 120
Query: 281 IVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPP 340
VGY +ENG DYWI+KNSWG SWG +GY+ M+RN S G CGI + SYP K G NPP
Sbjct: 121 AVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCGIAVEPSYPLKKGANPPN 180
Query: 341 SPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPS 394
P P+ C C TCCC C +W CC A CC DH CCP
Sbjct: 181 PGPTPPSPTPPPTVCDNYYSCPDSTTCCCIYEYGKYCFAWGCCPLEGATCCDDHYSCCPH 240
Query: 395 NYPICDSVRHQCL 407
+YP+C+ + CL
Sbjct: 241 DYPVCNVKQGTCL 253
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 274 bits (700), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 151/346 (43%), Positives = 215/346 (62%), Gaps = 23/346 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M + F LL+++ ++ +++ I E ++T+ +H K Y E E++ RLKIF +N +
Sbjct: 1 MRTYIFALLALVAVAQ-AVSFADVIKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKI 59
Query: 61 TQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPG 112
+HN + G SF + LN +AD+ H EF + GF+ R +A+ SP
Sbjct: 60 AKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPE 119
Query: 113 NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
+++ +P S+DWR KGAVT VKDQ CG+CWAFS+TGA+EG + TG+L+SLSEQ L+DC
Sbjct: 120 HVK-LPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDC 178
Query: 173 DRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYK 229
Y N+GC GGLMD A++++ N GIDTEK YPY G C+ N+ + T G+
Sbjct: 179 STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCH---FNKGTIGATDRGFT 235
Query: 230 DVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDS 286
D+P+ +EK+L QAV PVSV I S +FQ YS+G++ P C +LDH VL+VGY +
Sbjct: 236 DIPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGT 295
Query: 287 -ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
ENG DYW++KNSWG +WG G++ M RN N CGI +SYP
Sbjct: 296 DENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQ---CGIATASSYP 338
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 201/334 (60%), Gaps = 27/334 (8%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFA 79
S I + F+ W ++ K ++ +E+ +RLKIF +NY FV +HN G S + +N FA
Sbjct: 66 SKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNKFA 125
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEV 132
T +E++ LGF + RR+ S ++ ++ + P SIDW +G +T
Sbjct: 126 AHTREEYR-KMLGFKKSL----RRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITTP 180
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQF 191
K+Q SCG+CWAFSA GA+EGIN I TG LVSLSEQEL+ C R N GC GGLMD A+++
Sbjct: 181 KNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEW 240
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251
+++N G+D+EK Y Y+ C +K HI +IDG+ DVP N+E L +AV QPVSV
Sbjct: 241 IVENGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSVA 300
Query: 252 ICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGY----DSENGV------DYWIIKNSWG 300
I +R+FQLY G++ C T LDH VL+VGY +S N + YW IKNSW
Sbjct: 301 IEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSWS 360
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
WG GY+ + R+ + G+CG+ +ASYP KT
Sbjct: 361 EQWGEGGYIRIARDVESPSGMCGVAEMASYPEKT 394
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 143/308 (46%), Positives = 185/308 (60%), Gaps = 10/308 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE W KQ+ + Y ++E + R I++ N ++ N+ S+ L+ N FADLT++EF +
Sbjct: 5 FERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFVS 63
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+LGF + H + D+P S DWRK+GAV+++KDQ +CG+CWAFSA
Sbjct: 64 PYLGFGTRFLPHTGFMYHEHE------DLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVA 117
Query: 149 AIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EGINKI +G LVSLSEQE DCD N GC GGLMD A+ F+ KN G+ T KDYPY
Sbjct: 118 AVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYE 177
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA--QPVSVGICGSERAFQLYSSG 265
G G CNK+K H I G+ VP N+E L A Q SV I AFQLY G
Sbjct: 178 GVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLKG 237
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F+G C L+H V IVGY YWI+KNSWG WG +GY+ M+R+ + G CGI
Sbjct: 238 VFSGICGKQLNHGVTIVGYGKGTSDKYWIVKNSWGADWGESGYIRMKRDAFDKAGTCGIA 297
Query: 326 MLASYPTK 333
M ASYP K
Sbjct: 298 MQASYPLK 305
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 190/307 (61%), Gaps = 3/307 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+LF++W +H K Y S EK R +IF DN ++ + N N+S+ L LN FADL++ EF
Sbjct: 46 QLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDE-TNKKNNSYWLGLNGFADLSNDEF 104
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K ++GF A + + ++ + P SIDWR KGAVT VK+Q +CG+CWAFS
Sbjct: 105 KKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+EGINKIVTG+L+ LSEQEL+DCD+ ++ GC GG + Q+V N+G+ T K YP
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDK-HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPC 222
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ + +C V I GYK VP N E L A+ QP+S + + FQLY SG+
Sbjct: 223 QAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGV 282
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDHAV VGY + +G +Y IIKNSWG +WG GYM ++R +GNS G CG+
Sbjct: 283 FDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYK 342
Query: 327 LASYPTK 333
+ YP K
Sbjct: 343 SSYYPFK 349
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 189/309 (61%), Gaps = 4/309 (1%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQE 85
+E E W Q+GK Y EK++R ++F++N F+ N G+ F LS+N FADL +E
Sbjct: 32 SERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEE 91
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA-SCGACWAF 144
FKA + + S + N+ +P+++DWRK+GAVT +KDQ +CG+CWAF
Sbjct: 92 FKALLNNVQKKASRVETATETSFRYE-NVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
+ +E +++I TG LVSLSEQEL+DC R + GC GG ++ A++F+ GI +E Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY+G+ C +K + I GY+ VP N+EK LL+AV QPVSV I AF+ YSS
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270
Query: 265 GIFTGP-CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
GIF C T LDHAV +VGY +G YW++KNSW +WG GYM ++R+ G+C
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKGLC 330
Query: 323 GINMLASYP 331
GI ASYP
Sbjct: 331 GIASNASYP 339
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 210/341 (61%), Gaps = 22/341 (6%)
Query: 6 FFLLSILLLSSL-PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F +L I + +++ +++ +N+ + T+ +H KAY S+ E++ R+KIF DN + +HN
Sbjct: 4 FLILFITIFATVHAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHN 63
Query: 65 N---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN-----ASVQSPGNLRD 116
+ M S+ L +N + D+ H EF GF+ SI+ R AS P N+
Sbjct: 64 SNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNK-SINTQLRSERMPIGASFIEPANVA- 121
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P +DWRK+GAVT VKDQ CG+CW+FSATGA+EG + TG LVSLSEQ LIDC Y
Sbjct: 122 LPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKY 181
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+GC GGLMD A+Q++ N G+DTE YPY + +C N + + GY D+P N
Sbjct: 182 GNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGN 240
Query: 236 EKQLLQAVVAQ--PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGV 290
EK LL+A VA PVSV I S ++FQ YS G++ P S LDH VL++GY + ENG
Sbjct: 241 EK-LLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGE 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG NGY+ M R N L CGI ASYP
Sbjct: 300 DYWLVKNSWGETWGNNGYIKMAR---NKLNHCGIASSASYP 337
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 202/341 (59%), Gaps = 12/341 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P D + + FE W ++G+ Y EK +R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ NN +S+TL +N F D+T+ EF A + G + ++ ++ S N+ V
Sbjct: 67 HIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVSFDDV-NISAVG 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAVTEVKDQ CG+CWAFSA +EGI KIVTG LVSLSEQE++DC S +
Sbjct: 126 QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--N 183
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG +D AY F+I N+G+ +E DYPY+ G C + I GY V N+E
Sbjct: 184 GCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSW-PNSAYITGYSYVRSNDESS 242
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
+ AV QP++ I S FQ Y+ G+F+GPC TSL+HA+ I+GY + +G YWI+KN
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKN 302
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 337
SWG SWG GY+ M R +S G+CGI M YPT ++G N
Sbjct: 303 SWGSSWGERGYIRMARGVSSS-GLCGIAMDPLYPTLQSGAN 342
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 191/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + ENG YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 204/341 (59%), Gaps = 12/341 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P D + + FE W ++G+ Y EK +R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ N+ +S+TL +N F D+T EF A + G + ++ +R S N+ VP
Sbjct: 67 HIETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVSFDDV-NISAVP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAV EVK+Q CG+CWAF+A +EGI KI TG LVSLSEQE++DC SY
Sbjct: 126 QSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY-- 183
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG ++ AY F+I N+G+ TE++YPY+ G CN + I GY V N+E+
Sbjct: 184 GCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSF-PNSAYITGYSYVRRNDERS 242
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
++ AV QP++ I SE FQ Y+ G+F+GPC TSL+HA+ I+GY + +G YWI++N
Sbjct: 243 MMYAVSNQPIAALIDASEN-FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 301
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 337
SWG SWG GY+ M R +S G CGI M +PT ++G N
Sbjct: 302 SWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFPTLQSGAN 342
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 189/314 (60%), Gaps = 14/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+N FE W + GK+YS E+ R ++E N V HN G S+TL +N FADLTH+
Sbjct: 26 LNMEFEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHE 85
Query: 85 EFKASFLGFSAASIDHDRRRN---ASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
EFK +LG +D +R R+ ++ N+ +P S+DWR G VT VKDQ CG+C
Sbjct: 86 EFKRFYLG---TKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSC 142
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
W+FS TG++EG + TG LVSLSEQ L+DC ++ N GC GGLMD A+Q++I N GIDT
Sbjct: 143 WSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDT 202
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
E YPY + G C N T+ ++D+ +E L AV PVSV I S+ +F
Sbjct: 203 EASYPYTAKDGTCKFNAANVG-ATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSF 261
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLY+SG++ STSLDH VL GY + NG YW++KNSWG SWG GY+ M RN N
Sbjct: 262 QLYTSGVYNEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANN 321
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 322 Q---CGIATSASYP 332
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 139/306 (45%), Positives = 189/306 (61%), Gaps = 9/306 (2%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
+E+W K++G+ Y ++ E + R +I+ N F+ +N+ N S+ L N F DLT++EF+
Sbjct: 44 YESWLKKYGQKYRNKDEWEFRFEIYRANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRR 102
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+L + S R Q G D+P IDWR +GAVT +KDQ CG+CW+FSA
Sbjct: 103 MYLVYQPRSHLQTR---FMYQKHG---DLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVA 156
Query: 149 AIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
+E INKI TG LVSLSEQ+LIDCD R+ N GC GG M+ + F+ K G+ T+K+YPY+
Sbjct: 157 TVEDINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQ 215
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G G NK K+ H V I GY+++P +NE L AV QP SV AFQLYS G F
Sbjct: 216 GSDGDXNKAKVRNHAVAICGYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTF 275
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
+G C L+H + IVGY ENG YW++KNSW G++GY+ M+R+ + G CG M
Sbjct: 276 SGSCGKDLNHRMTIVGYGEENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAME 335
Query: 328 ASYPTK 333
ASYP K
Sbjct: 336 ASYPDK 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPLSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 203/341 (59%), Gaps = 13/341 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + + + P D + + FE W ++G+ Y + EK +R +IF++N
Sbjct: 7 LVFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVK 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ N+ +S+TL +N F D+T EF A + G S ++ +R S N+ VP
Sbjct: 67 HIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLP-LNIEREPVVSFDDV-NISAVP 124
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAV EVK+Q CG+CW+F+A +EGI KI TG LVSLSEQE++DC SY
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY-- 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG ++ AY F+I N+G+ TE++YPY G CN I GY V N+E+
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNS-AYITGYSYVRRNDERS 241
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
++ AV QP++ I SE FQ Y+ G+F+GPC TSL+HA+ I+GY + +G YWI++N
Sbjct: 242 MMYAVSNQPIAALIDASEN-FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 300
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 337
SWG SWG GY+ M R +S G+CGI M +PT ++G N
Sbjct: 301 SWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPTLQSGAN 341
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 186/309 (60%), Gaps = 8/309 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W + + Y E EKQ R +F+ N F+ N GN S+ L +N FAD T++EF
Sbjct: 37 EKHEQWMARFSRVYRDELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEF 96
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-NLRD-VPASIDWRKKGAVTEVKDQASCGACWAF 144
A G S + + ++ S N+ D V S DWR +GAVT VK Q CG CWAF
Sbjct: 97 LAIHTGLKGLS---SKVVDETISSRSWNISDMVGVSKDWRAEGAVTPVKYQGQCGCCWAF 153
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
SA A+EG+ KI G+LVSLSEQ+L+DCDR Y+ GC GG+M A+ ++I+N GI +E DY
Sbjct: 154 SAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYIIQNRGIASENDY 213
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
Y+G G+C R I G++ VP NNE+ LL+AV QPVSV + + F YS
Sbjct: 214 SYQGSDGRCRSSA--RPAARISGFQTVPSNNEQALLEAVSRQPVSVSMDANGDGFMHYSG 271
Query: 265 GIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G++ GPC TS +HAV VGY S++G YW+ KNSWG +WG GY+ ++R+ G+CG
Sbjct: 272 GVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCG 331
Query: 324 INMLASYPT 332
+ A YP
Sbjct: 332 VAQYAFYPV 340
>gi|357437721|ref|XP_003589136.1| Cysteine proteinase [Medicago truncatula]
gi|355478184|gb|AES59387.1| Cysteine proteinase [Medicago truncatula]
Length = 295
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 136/259 (52%), Positives = 167/259 (64%), Gaps = 7/259 (2%)
Query: 156 IVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNK 215
IVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GID+E DYPY+ G+C++
Sbjct: 5 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 64
Query: 216 QKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSL 275
+ N +VTID Y+DVP +E L +AV QP++V + G R FQLY G+FTG C T+L
Sbjct: 65 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 124
Query: 276 DHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS-LGICGINMLASYPTKT 334
DH V VGY +ENG DYWI++NSWG SWG GY+ ++RN +S G CGI + SYP K
Sbjct: 125 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 184
Query: 335 GQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDH 388
GQNPP P P+ C CA G TCCC C W CC SA CC DH
Sbjct: 185 GQNPPNPGPSPPSPIKPPSVCDSYYSCAEGSTCCCIYEYGRSCFEWGCCPLESATCCDDH 244
Query: 389 RYCCPSNYPICDSVRHQCL 407
CCP YP+CD+ CL
Sbjct: 245 YSCCPHEYPVCDTRAGLCL 263
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 195/312 (62%), Gaps = 5/312 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
S + E + W ++ + Y++ E ++R KIF++N ++ NN+GN S+ L LN ++DLT
Sbjct: 27 SSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLT 86
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGAC 141
+EF AS GF + D + SV P NL D VP + DWR+KG VT+VK+Q CG C
Sbjct: 87 SEEFIASHTGFKVSDQLSDSKMR-SVAIPFNLNDDVPTNFDWREKGVVTDVKNQRQCGCC 145
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAF+A A+EGI KI G+L+SLSEQ+L+DCDR +SGCGGG A+ +IK+ GI E
Sbjct: 146 WAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGGGDFVLAFDSIIKSRGIVKE 204
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY+ Q + I+GY VP N+E+QLL+AV+ QPVSV I S F
Sbjct: 205 DDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAVLQQPVSVAISTS-YDFHH 263
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y G++ G C L+HAV I+GY SE G YW+IKNSWG +WG GYM + R + + G
Sbjct: 264 YMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGETWGEKGYMKVLRESSATGG 323
Query: 321 ICGINMLASYPT 332
C I + A+YPT
Sbjct: 324 QCSIAVHAAYPT 335
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/337 (44%), Positives = 209/337 (62%), Gaps = 18/337 (5%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN-- 65
LL L+ + ++Y + E + T+ +H K Y+ E+ R+KIF +N + +HN
Sbjct: 8 LLIALVAMTQAVSYSELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHIAKHNQRY 67
Query: 66 -MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPA 119
G S+ L+LN +AD+ H EF+ + GF+ R + S SP +++ +P
Sbjct: 68 ATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPEHVK-LPT 126
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
++DWR KGAVTEVKDQ CG+CWAFS+TGAIEG + +G+LVSLSEQ L+DC Y N+
Sbjct: 127 AVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCSTKYGNN 186
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A+++V N GIDTEK Y Y G C+ K N T G+ D+P+ NEK+
Sbjct: 187 GCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDK-NSIGATDRGFADIPQGNEKK 245
Query: 239 LLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CST-SLDHAVLIVGYDSE-NGVDYWI 294
L QAV PVSV I S+++FQ YS G++ P CS +LDH VL+VGY +E +G DYW+
Sbjct: 246 LAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGTEKDGSDYWL 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG +WG G++ M RN N CGI +SYP
Sbjct: 306 VKNSWGTTWGDKGFIKMSRNKENQ---CGIASASSYP 339
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 134/265 (50%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 81 LTHQEFKASFLGFSAAS---IDHDRRRNASVQSP---GNLRDVPASIDWRKKGAVTEVKD 134
+T EF+ + G A DR+ +++ S + RDVPAS+DWR+KGAVT+VKD
Sbjct: 1 MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIK 194
Q CG+CWAFS A+EGIN I T +L SLSEQ+L+DCD N+GC GGLMDYA+Q++ K
Sbjct: 61 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 120
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 254
+ G+ E YPYR + C K +VTIDGY+DVP N+E L +AV QPVSV I
Sbjct: 121 HGGVAAEDAYPYRARQASCKKSPAP--VVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 178
Query: 255 SERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQR 313
S FQ YS G+F+G C T LDH V VGY + +G YW++KNSWG WG GY+ M R
Sbjct: 179 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMAR 238
Query: 314 NTGNSLGICGINMLASYPTKTGQNP 338
+ G CGI M ASYP KT NP
Sbjct: 239 DVAAKEGHCGIAMEASYPVKTSPNP 263
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + ENG YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 140/332 (42%), Positives = 195/332 (58%), Gaps = 5/332 (1%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
L FL+ + S + S+ +E E W Q+G+ Y EK++R ++F++N F+
Sbjct: 10 LILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIE 69
Query: 62 QHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
N G+ F LS+N FADL +EFKA + + + S + ++ +PA+I
Sbjct: 70 SFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE-SVTKIPATI 128
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
D RK+GAVT +KDQ CG+CWAFSA A EGI++I TG LV LSEQEL+DC + + GC
Sbjct: 129 DRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEGCI 188
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GG +D A++F+ K GI +E YPY+G C +K + I GY+ VP NNEK LL+
Sbjct: 189 GGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLK 248
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGY-DSENGVDYWIIKNSW 299
AV QPVSV I AF+ YSSGIF C T +HAV +VGY + + YW++KNSW
Sbjct: 249 AVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSKYWLVKNSW 308
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G WG GY+ ++R+ G+CGI YP
Sbjct: 309 GTEWGERGYIRIKRDIRAKEGLCGIAKYPYYP 340
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 128/233 (54%), Positives = 165/233 (70%), Gaps = 4/233 (1%)
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
+ D+P S+DWR+KGAVT VKDQ CG+CWAFS ++EGIN I TGSLVSLSEQELIDCD
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKD 230
+ N GC GGLMD A++++ N G+ TE YPYR G CN + ++ +V IDG++D
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENG 289
VP N+E+ L +AV QPVSV + S +AF YS G+FTG C T LDH V +VGY +E+G
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDG 180
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
YW +KNSWG SWG GY+ +++++G S G+CGI M ASYP KT P P+P
Sbjct: 181 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTP 233
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 141/315 (44%), Positives = 191/315 (60%), Gaps = 16/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEVKDQAS 137
EF A F G + + + S S L+ D+P+++DWR+ GAVT+VK Q
Sbjct: 95 EFLAKFTGLNIP----NSYLSPSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGR 150
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N G
Sbjct: 151 CGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGG 209
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
I E DY Y G+ C Q+ V I YK VPE E LLQAV QPVS+GI S+
Sbjct: 210 ISRESDYEYLGEQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD 267
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Q Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G
Sbjct: 268 -LQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326
Query: 317 NSLGICGINMLASYP 331
N G+C I ++SYP
Sbjct: 327 NPSGLCDIAKMSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 199/337 (59%), Gaps = 34/337 (10%)
Query: 29 FETWCKQHG--KAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTH 83
FE WC +HG + +E +RL F +N A+V +HN + G S + LN+ A T
Sbjct: 98 FERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAATTR 157
Query: 84 QEFKASFLGF-------------SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVT 130
+E++A LG+ A S D + AS + D P +IDW + GAVT
Sbjct: 158 EEYRA-LLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASV--DPPEAIDWVELGAVT 214
Query: 131 EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQ 190
K+Q CG+CWAFS TGA+EGI KI TG LVSLSEQE++ C + N GC GGLMDYA++
Sbjct: 215 PPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ-NMGCNGGLMDYAFR 273
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
+++KN GID+E YPY +A CN+ KL H+ TIDG+KDVP +EK+L +AV QPVS+
Sbjct: 274 WIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQPVSI 333
Query: 251 GICGSERAFQLYSSGIF-TGPCSTSLDHAVLIVGY---DSENGV--------DYWIIKNS 298
I ++FQLY G++ + C + +DH VL+VGY D+ + +W +KNS
Sbjct: 334 AIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKVKNS 393
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG 335
WG +WG G++ M R + G CGI SYPTK+
Sbjct: 394 WGGTWGEGGFIRMARRISDETGQCGITTAPSYPTKSA 430
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 271 bits (693), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 198/349 (56%), Gaps = 29/349 (8%)
Query: 3 SLAFFLLSILLLSSLPL----------NYCSDINELFETWCKQHGKAYSSEQEKQQRLKI 52
S + FLL++L++ S L + + E W +HG+AY E EK +RL++
Sbjct: 2 SASRFLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEV 61
Query: 53 FEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG 112
F N + N G S L+ N FADLT QEF+A+ G R R A G
Sbjct: 62 FRANAELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGL--------RPRPAPSAGAG 113
Query: 113 NLR-------DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLS 165
R D S+DWR GAVT VKDQ + G CWAFSA A+EG+NKI TG LVSLS
Sbjct: 114 RFRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLS 173
Query: 166 EQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVT 224
EQEL+DCD S + GC GGLMD A+QFV + G+ +E YPY+ + G C + +
Sbjct: 174 EQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPC-RSSAAAAAAS 232
Query: 225 IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY 284
I G++DVP NNE L AV QPVSV I G + AF+ Y SG+ G C T L+HA+ VGY
Sbjct: 233 IRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGY 292
Query: 285 DS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+ +G YW++KNSWG SWG GY+ ++R G+CG+ L SYP
Sbjct: 293 GTAADGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 340
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDITKMSSYP 341
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 209/341 (61%), Gaps = 22/341 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+ L L+ + +++ I E + T+ +H K Y E E++ RLKIF +N + +HN
Sbjct: 4 LYALLALVAVAQAVSFADVIKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQ 63
Query: 66 ---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDV 117
G +F +++N +AD+ H EF+ + GF+ R + S SP +++ +
Sbjct: 64 RYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVK-L 122
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P S+DWR+KGAVT VKDQ CG+CWAFS+TGA+EG + TG+LVSLSEQ L+DC Y
Sbjct: 123 PKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYG 182
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPEN 234
N+GC GGLMD A++++ N GIDTEK YPY G C+ N+ V T G+ D+P+
Sbjct: 183 NNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCH---FNKDSVGATDRGFADIPQG 239
Query: 235 NEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGV 290
NEK++ +AV PVSV I S +FQ YS GI+ P S +LDH VL+VGY + E+G
Sbjct: 240 NEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGK 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG G++ M RN N CGI +SYP
Sbjct: 300 DYWLVKNSWGTTWGDKGFIKMARNEDNQ---CGIASASSYP 337
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 145/335 (43%), Positives = 196/335 (58%), Gaps = 12/335 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P D + + FE W ++G+ Y EK +R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ NN +S+TL +N F D+T+ EF + G S ++ R S N+ V
Sbjct: 67 HIETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLP-LNFKREPVVSFDDV-NISAVG 124
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAVTEVKDQ CG+CWAFSA +EGI KIVTG LVSLSEQE++DC S +
Sbjct: 125 QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--N 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG +D AY F+I N+G+ +E DYPY+ G C + I GY V N+E
Sbjct: 183 GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSW-PNSAYITGYSYVRSNDESS 241
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
+ AV QP++ I S FQ Y+ G+F+GPC TSL+HA+ I+GY + +G YWI+KN
Sbjct: 242 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKN 301
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG SWG GY+ M R +S G+CGI M YPT
Sbjct: 302 SWGSSWGERGYVRMARGVSSS-GLCGIAMDPLYPT 335
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YSGGTYDGSCADRINHAVTAIGYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 204/328 (62%), Gaps = 7/328 (2%)
Query: 11 ILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS 70
+ +L ++ +DI+ +E + + G++Y+ E+E+ +R +F N + + N+ G++
Sbjct: 1 MRVLCAVVFAAVADIDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHT- 59
Query: 71 FTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVT 130
+TL +N FADLT +EF +++GF + + + N +P S+DW +GAVT
Sbjct: 60 YTLGVNQFADLTVEEFSKTYMGFKKPAQKYGDAAYLG-RHVYNGEALPTSVDWSSQGAVT 118
Query: 131 EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAY 189
VK+Q CG+CW+FS TG++EG N+I TG LVSLSEQ+ +DC +Y N GC GGLMD A+
Sbjct: 119 PVKNQGQCGSCWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAF 178
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQP 247
++ N + TE+ YPY+G G C + + ++ GYKDV ++E+ ++ AV QP
Sbjct: 179 KYAEAN-ALCTEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQP 237
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
VS+ I + FQLYS G+ TG C SLDH VL VGY + +G DYW +KNSWG +WGM+G
Sbjct: 238 VSIAIEADKSVFQLYSGGVLTGACGASLDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSG 297
Query: 308 YMHMQRNTGNSLGICGINMLASYPTKTG 335
Y+ +QR G S G CG+ SYP TG
Sbjct: 298 YVLLQRGKGGS-GECGLLSEPSYPQVTG 324
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 198/337 (58%), Gaps = 10/337 (2%)
Query: 4 LAFFLLSILL---LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
L F + + + S + L S I + + W Q + Y E EKQ RL++ +N F+
Sbjct: 11 LTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFI 70
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN--LRDVP 118
NNMGN S+ L +N F D T +EF A++ G ++ + N + DV
Sbjct: 71 ESFNNMGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVL 130
Query: 119 AS-IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
+ DWR +GAVT VK Q CG CWAFSA A+EG+ KI G+L+SLSEQ+L+DC R N
Sbjct: 131 GTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQN 190
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GG A+ ++IK+ GI +E +YPY+ + G C R + I G+++VP NNE+
Sbjct: 191 NGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPCRSNA--RPAILIRGFENVPSNNER 248
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGY-DSENGVDYWII 295
LL+AV QPV+V I SE F YS G++ C TS++HAV +VGY S G+ YW+
Sbjct: 249 ALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLA 308
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG++WG NGY+ ++R+ G+CG+ ASYP
Sbjct: 309 KNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPV 345
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 191/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y+G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYQGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 201/309 (65%), Gaps = 11/309 (3%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E ++ W ++ Y + E+++ ++IF+ N A++ N GN S+ L++N FADL +
Sbjct: 35 LSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRFADLPTE 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
S GF ++ +S+ N+ D+PA++DWRK+GAVT VK+Q CG+CWAF
Sbjct: 95 ---PSDDGFKKRKLEPT---TSSLFKYKNITDIPAAVDWRKRGAVTPVKNQRECGSCWAF 148
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SA GA+EGI +I +G+LVSLSEQEL+D RS + +GC GG + A++FV++N GI TE
Sbjct: 149 SAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGGIATEAS 208
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPYRG G N +K++R V I Y+ VP N+E LL+ V QPVSVGI S + YS
Sbjct: 209 YPYRGVKGN-NSKKVSRQ-VQIKSYEQVPRNSEDSLLKVVANQPVSVGIDISG-MIRFYS 265
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SGIFTG C T +HAV+IVGY + N G YW++KNSWG WG Y+ M+R+ G+C
Sbjct: 266 SGIFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSWGIRWGEKRYIRMKRDIDAKEGLC 325
Query: 323 GINMLASYP 331
GI M ASYP
Sbjct: 326 GIPMDASYP 334
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 29/337 (8%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
LAFF + L ++ LN S + E W Q+ + Y EK +R ++F+ N F+
Sbjct: 14 LAFFCGAAL--AARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESF 71
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD---RRRNASVQSPGNLRDVP 118
N GN F L +N FADLT+ EF+A+ GF + + R N SV + +P
Sbjct: 72 NAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVSTGFRYENVSVDA------LP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYN 177
A+IDWR KGAVT +KDQ C EGI KI TG L+SLSEQEL+DCD +
Sbjct: 126 ATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHGED 173
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A++F+IKN G+ TE YPY G+C + + T+ G++DVP N+E
Sbjct: 174 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGFEDVPANDEA 231
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIK 296
L++AV QPVSV + G + FQ YS G+ TG C T LDH + +GY + +G YW++K
Sbjct: 232 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLK 291
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG +WG NGY+ M+++ + G+CG+ M SYPT+
Sbjct: 292 NSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTE 328
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 198/313 (63%), Gaps = 13/313 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
++ +FE W +H K Y++ EK++R +IF++N F+ + N++ N ++ L LN FADLT
Sbjct: 39 DEVMSMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSL-NRTYKLGLNVFADLT 97
Query: 83 HQEFKASFLGF--SAASIDHDRR-RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ-ASC 138
+ E++A +L +D D RN V G+ +P S+DWRK+GAVT VK+Q A+C
Sbjct: 98 NAEYRAMYLRTWDDGPRLDLDTPPRNRYVPRVGDT--IPKSVDWRKEGAVTPVKNQGATC 155
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
+CWAF+A GA+E + KI TG L+SLSEQE++DC S + GCGGG + + Y ++ KN GI
Sbjct: 156 NSCWAFTAVGAVESLVKIKTGDLISLSEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GI 214
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
EKDYPYRG G+C+ K N IVTIDG+ VP E+ L Q + QPV+V I +
Sbjct: 215 SLEKDYPYRGDEGKCDSNKKN-AIVTIDGHGWVPTQLEEALKQGIANQPVAVPIPADDYE 273
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
FQ Y+SG+F G C T L+HA+L+VGY +E DYWI KNS+ WG NGY+ +QR
Sbjct: 274 FQYYTSGVFKGKCGTELNHALLLVGYGAEKDGDYWIAKNSYSDKWGENGYIRIQR----K 329
Query: 319 LGICGINMLASYP 331
L C YP
Sbjct: 330 LSTCKFGNGGYYP 342
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 141/295 (47%), Positives = 186/295 (63%), Gaps = 15/295 (5%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
LFE+W +H K Y + EK R + F+DN ++ + N N+S+ L LN FADLTH EFK
Sbjct: 47 LFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDE-TNKKNNSYWLGLNEFADLTHDEFK 105
Query: 88 ASFLGFSAASIDHDR---RRNASVQSPG-NLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
++G SI D ++ V+ P ++ D P SIDWR+KGAVT VK+Q CG+CWA
Sbjct: 106 EKYVG----SIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWA 161
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS +EGINKIVTG+L+SLSEQEL+DCDR + GC GG + ++V+ N G+ TEK+
Sbjct: 162 FSTVATVEGINKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTEKE 219
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY + G C + V I+GYK VP N+E L++ + QPVSV + R FQ Y
Sbjct: 220 YPYEKKQGNCRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYK 279
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
G+F GPC T LDHAV VGY G DY +IKNSWG WG GY+ ++R +G S
Sbjct: 280 GGVFGGPCGTKLDHAVTAVGY----GKDYILIKNSWGPKWGDKGYIKIKRASGQS 330
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 147/333 (44%), Positives = 205/333 (61%), Gaps = 14/333 (4%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L + +S+ + F+ W +H K+Y+++ E R IF+DN FVT+
Sbjct: 6 ALVFCFLIVNCISAARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDNMDFVTK 64
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N G+ + L LN+ ADLT+QE++ +LG ++ + ++ PAS+D
Sbjct: 65 WNQKGSDTI-LGLNSMADLTNQEYQRIYLGTKTTV-----KKPNLIIGVTDVSKAPASVD 118
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCG 181
WR GAVT VK+Q CG C++FS TG++EGI++I + LVSLSEQ+++DC S N+GC
Sbjct: 119 WRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGNNGCD 178
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLM +++++I G+DTE YPY G G+C K N TI GYK+V +E L
Sbjct: 179 GGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIG-ATITGYKNVKSGSESDLQT 237
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSW 299
AV AQPVSV I S+ +FQLYSSG++ P ST LDH VL VGY S++G DYWI+KNSW
Sbjct: 238 AVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGSQSGQDYWIVKNSW 297
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG G++ M RN N+ CGI +ASYPT
Sbjct: 298 GADWGEKGFILMARNKHNN---CGIATMASYPT 327
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 198/337 (58%), Gaps = 13/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L ++L+ S L+ + + + + + H K Y S+ E++ R+KI+ +N V +
Sbjct: 5 TLIFLLAAVLVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKLRMKIYLENKHKVAK 64
Query: 63 HNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN + G S+ +++N F DL H EF++ G+ + R + + P N+ +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EVP 123
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR+KGA+T VKDQ CG+CWAFS+TGA+EG TG LVSLSEQ LIDC Y N
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 183
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+Q++ N GIDTE YPY + G C NR V G+ D+P E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVD-RGFVDIPSGEED 242
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
+L AV PVSV I S +FQ YS G + P S LDH VL+VGY S+NG DYW+
Sbjct: 243 KLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGSDNGEDYWL 302
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSW WG GY+ + RN N CG+ ASYP
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNH---CGVATAASYP 336
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 191/312 (61%), Gaps = 9/312 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAAS--IDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGA 140
EF A F G + + + + + +L D +P+++DWR+ GAVT+VK Q CG
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGC 154
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI
Sbjct: 155 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISR 213
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 ESDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQ 270
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NGYM + R++G+
Sbjct: 271 FYAGGTYDGNCADRINHAVTAIGYGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPS 330
Query: 320 GICGINMLASYP 331
G+C I ++SYP
Sbjct: 331 GLCDIAKMSSYP 342
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 198/315 (62%), Gaps = 12/315 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E FE W ++G+ Y+ EK +R +IF++N + NN +S+TL +N F D+T+ EF
Sbjct: 8 ERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNNEF 67
Query: 87 KASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
A + G AS+ + R+ V ++ VP SIDWR GAVT VK+Q SCG+CWAFS
Sbjct: 68 LARYTG---ASLPLNIERDPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWAFS 124
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A +EGI KI G+L+SLSEQE++DC SY GC GG ++ AY F+I N+G+ + + P
Sbjct: 125 AIATVEGIYKIKAGNLISLSEQEVLDCALSY--GCDGGWVNKAYDFIISNNGVTSFANLP 182
Query: 206 YRGQAGQCNKQKL-NRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
Y+G G CN L N+ +T GY V NNE+ ++ AV QP++ + + FQ Y S
Sbjct: 183 YKGYKGPCNHNDLPNKAYIT--GYTYVQSNNERSMMIAVANQPIAA-LIDAGGDFQYYKS 239
Query: 265 GIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
G+FTG C TSL+HA+ ++GY + +G YWI+KNSWG SWG GY+ M R+ + G+CG
Sbjct: 240 GVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLCG 299
Query: 324 INMLASYPT-KTGQN 337
I M +PT ++G N
Sbjct: 300 IAMAPLFPTLQSGAN 314
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 193/311 (62%), Gaps = 8/311 (2%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
FE W ++G+ Y EK +R +IF++N + N+ +S+TL +N F D+T EF A
Sbjct: 10 FEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFVA 69
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+ G S ++ +R S N+ VP SIDWR GAV EVK+Q CG+CWAF+A
Sbjct: 70 QYTGVSLP-LNIEREPVVSFDDV-NISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIA 127
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EGI KI TG LVSLSEQE++DC SY GC GG ++ AY F+I N+G+ TE++YPY+
Sbjct: 128 TVEGIYKIKTGYLVSLSEQEVLDCAVSY--GCKGGWVNKAYDFIISNNGVTTEENYPYQA 185
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G CN + I GY V N+E+ ++ AV QP++ I SE FQ Y+ G+F+
Sbjct: 186 YQGTCNANSF-PNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASEN-FQYYNGGVFS 243
Query: 269 GPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
GPC TSL+HA+ I+GY + +G YWI++NSWG SWG GY+ M R +S G CGI M
Sbjct: 244 GPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMS 303
Query: 328 ASYPT-KTGQN 337
+PT ++G N
Sbjct: 304 PLFPTLQSGAN 314
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 193/312 (61%), Gaps = 15/312 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTH 83
E + + HGK Y ++ E+ R+KIF DN + HN G S+ + +N F DL
Sbjct: 25 EEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMV 84
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EFKA GF + D +RN + P N ++P ++DWR+KGAVT VKDQ CG+CW+
Sbjct: 85 HEFKALMNGFKMSP---DTKRNGELYFPSN-SNLPKTVDWRQKGAVTPVKDQGQCGSCWS 140
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSATG++EG + TG LVSLSEQ L+DC SY N+GC GGLMD A+Q+V N GIDTE
Sbjct: 141 FSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDTEA 200
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQL 261
YPY + C +K N+ T G+ D+P +EK L A+ P+SV I + +FQ
Sbjct: 201 SYPYEARENTCRFKK-NKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSFQF 259
Query: 262 YSSGIFTGP-CST-SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G++ P CS+ LDH VL VGY +ENG DYW++KNSWG SWG NGY+ + RN N
Sbjct: 260 YSKGVYNEPNCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENGYIKIARNHSNH- 318
Query: 320 GICGINMLASYP 331
CGI +ASYP
Sbjct: 319 --CGIASMASYP 328
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 206/339 (60%), Gaps = 17/339 (5%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
A FLL +L ++ +++ + + E + T+ H KAY S+ E+ R+KIF +N+ + HN
Sbjct: 4 AIFLLLGILAAAQAISFFNLVTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHN 63
Query: 65 ---NMGNSSFTLSLNAFADLTHQEFKASFLGFS---AASIDHDRRRNAS-VQSPGNLRDV 117
+ S+ L +N + D+ H EF + GF+ +A + RR S P N+ ++
Sbjct: 64 QKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV-EI 122
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P+S+DWR GAVT +KDQ CG+CW+FSATGA+EG + +TG LVSLSEQ LIDC Y
Sbjct: 123 PSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYG 182
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N+GC GGLMD A+Q++ NHG+DTE YPY + +C N T GY D+PE NE
Sbjct: 183 NNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYNPRNNG-ATDSGYVDIPEGNE 241
Query: 237 KQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDY 292
K+L AV PVSV I S +FQ Y G++ P S +LDH VL+VGY + +N DY
Sbjct: 242 KKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQDY 301
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
W++KNSWG +WG GY+ M RN N CGI ASYP
Sbjct: 302 WLVKNSWGVTWGDEGYIKMARNKDNH---CGIASSASYP 337
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 190/312 (60%), Gaps = 9/312 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAAS--IDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGA 140
EF A F G + + + + + +L D +P+++DWR+ GAVT+VK Q CG
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGC 154
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA G++EG KI TG L+ SEQEL+DC + N GC GG M A+ F+I+N GI
Sbjct: 155 CWAFSAVGSLEGAYKIATGKLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISR 213
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 ESDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQ 270
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN
Sbjct: 271 FYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS 330
Query: 320 GICGINMLASYP 331
G+C I ++SYP
Sbjct: 331 GLCDIAKMSSYP 342
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 128/224 (57%), Positives = 153/224 (68%), Gaps = 1/224 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
VPAS+DWRKKGAVT VKDQ CG+CWAFS A+EGIN+I T LVSLSEQEL+DCD
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA++F+ + GI TE +YPY G C+ K N V+IDG+++VPEN+E
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWII 295
LL+AV QPVSV I FQ YS G+FTG C T LDH V IVGY + +G YW +
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPP 339
KNSWG WG GY+ M+R + G+CGI M ASYP K N P
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKKSSNNP 225
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 192/310 (61%), Gaps = 10/310 (3%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQE 85
N F++W HG +Y++ E+ R I+ N F+ +HN+ G+S + L++N FADLT+ E
Sbjct: 19 NPCFDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHS-YKLAVNKFADLTYPE 77
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F A +LG + + + AS P + +P S+DWR G VT +KDQ CG+CW+FS
Sbjct: 78 FAAKYLGLRFDATNATKSFAASTYLP-RMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFS 136
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
TG++EG + TG LVSLSEQ L+DC + N+GC GGLMD A+Q++I N+GIDTE Y
Sbjct: 137 TTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSY 196
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYS 263
PY Q G C N T+ Y+D+ +E L AV P+SV I S+ +FQ YS
Sbjct: 197 PYTAQDGTCQFNSANVG-ATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYS 255
Query: 264 SGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
SG++ P CS+S LDH VL VGY + DYW++KNSWG SWG +GY+ M RN+ N
Sbjct: 256 SGVYNEPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQ--- 312
Query: 322 CGINMLASYP 331
CGI ASYP
Sbjct: 313 CGIATAASYP 322
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 206/341 (60%), Gaps = 19/341 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L FL+ +L ++ +++ +N+ + T+ +H K Y ++ E++ R+KIF DN + +H
Sbjct: 3 LFLFLIVAVLATAQAISFFELVNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNKHKIAKH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN-----ASVQSPGNLR 115
N M S+ L +N + D+ H EF + GF+ SI+ R AS P N+
Sbjct: 63 NGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNK-SINTQLRSERLPIAASFIEPANVV 121
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P ++DWR+ GAVT VKDQ CG+CW+FSATGA+EG + TG L+ LSEQ LIDC
Sbjct: 122 -LPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGK 180
Query: 176 Y-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
Y N+GC GGLMD A+Q++ N G+DTE YPY + +C N + GY D+P+
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDV-GYVDIPQG 239
Query: 235 NEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGV 290
NEK+L AV PVSV I S ++FQ YS G++ P S +LDH VL VGY + ENG
Sbjct: 240 NEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDENGQ 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG NGY+ M R N L CGI ASYP
Sbjct: 300 DYWLVKNSWGETWGDNGYIKMAR---NKLNHCGIASTASYP 337
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 197/337 (58%), Gaps = 29/337 (8%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
LAFF + L ++ LN S + E W Q+ + Y EK +R ++F+ N F+
Sbjct: 14 LAFFCGAAL--AARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESF 71
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHD---RRRNASVQSPGNLRDVP 118
N GN F L +N FADLT+ EF+A+ GF + + R N SV + +P
Sbjct: 72 NAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVPTGFRYENVSVDA------LP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYN 177
A+IDWR KGAVT +KDQ C EGI KI TG L+SLSEQEL+DCD +
Sbjct: 126 ATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHGED 173
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+QF+IKN G+ TE YPY G+C + + T+ G++DVP N+E
Sbjct: 174 QGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGFEDVPANDEA 231
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIK 296
L++AV QPVSV + G + FQ YS G+ TG C T LDH + +GY + +G YW++K
Sbjct: 232 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLK 291
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG +WG NGY+ M+++ + G+CG+ M SYP +
Sbjct: 292 NSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPIE 328
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 199/307 (64%), Gaps = 16/307 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F+ W +H K+Y+++ E R +F+DN V + N G+++ L LN ADLT++EFK
Sbjct: 32 FQNWMVKHQKSYTND-EFGSRYSVFQDNMDIVAKWNQKGSNTI-LGLNVMADLTNEEFKK 89
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+LG + A++ + ++ V +PAS+DWR GAVT VK+Q CG C+AFS TG
Sbjct: 90 LYLG-TKANVTYKKKTLVGVSG------LPASVDWRANGAVTAVKNQGQCGGCYAFSTTG 142
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
++EGI++I + LV LSEQ+++DC S N+GC GGLM +++++I G+DTE YPY
Sbjct: 143 SVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYT 202
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G+ G+C K N TI GYK+V +E L AV AQPVSV I S+ +FQLY+SG++
Sbjct: 203 GEVGKCKFNKKNIG-ATITGYKNVESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVY 261
Query: 268 TGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
P ST LDH VL VGY S++G DYWI+KNSWG WG NG++ M RN N+ CGI
Sbjct: 262 YEPECSSTQLDHGVLAVGYGSQSGQDYWIVKNSWGADWGENGFILMARNKDNN---CGIA 318
Query: 326 MLASYPT 332
+AS+PT
Sbjct: 319 TMASFPT 325
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 190/313 (60%), Gaps = 19/313 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-----PASIDWRKKGAVTEVKDQASCG 139
EF A F G + + S SP + D+ P+++DWR+ GAVT+VK+Q CG
Sbjct: 95 EFLAKFTGLNIPN---------SYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCG 145
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI
Sbjct: 146 CCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGIS 204
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
E DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+
Sbjct: 205 RESDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-L 261
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG +G+M + R++GN
Sbjct: 262 QFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP 321
Query: 319 LGICGINMLASYP 331
G+C I ++SYP
Sbjct: 322 AGLCDIAKVSSYP 334
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 190/313 (60%), Gaps = 19/313 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-----PASIDWRKKGAVTEVKDQASCG 139
EF A F G + + S SP + D+ P+++DWR+ GAVT+VK+Q CG
Sbjct: 95 EFLAKFTGLNIPN---------SYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCG 145
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI
Sbjct: 146 CCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGIS 204
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
E DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+
Sbjct: 205 RESDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-L 261
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
Q Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG +G+M + R++GN
Sbjct: 262 QFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP 321
Query: 319 LGICGINMLASYP 331
G+C I ++SYP
Sbjct: 322 AGLCDIAKVSSYP 334
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 195/311 (62%), Gaps = 12/311 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
F W + G++Y+S E+ +R++I+ N V HN M G+S++ L + +ADL H+E
Sbjct: 26 FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85
Query: 86 FKASFLGFSAASIDHDR-RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
FK + G S + + R +S ++P +IDWR+ G VT VK+Q SCG+CW+F
Sbjct: 86 FKQTVFGVCLGSFNASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSCWSF 145
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKD 203
S+TGA+EG N TG LVSLSEQEL+DC +Y N GC GG MD A+++++ GI TE
Sbjct: 146 SSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTEDS 205
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLY 262
YPY GQ GQC + T GY D+P NE L +AV PVSV I S+++FQLY
Sbjct: 206 YPYEGQVGQC-RANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQLY 264
Query: 263 SSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
SG++ P CS T+LDHAVLIVGY +E G DYW++KNSWG +WG GY+ M RN N
Sbjct: 265 HSGVYNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSRNRYNQ-- 322
Query: 321 ICGINMLASYP 331
CGI AS+P
Sbjct: 323 -CGIASAASFP 332
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF+ N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGKLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAEGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 124/245 (50%), Positives = 173/245 (70%), Gaps = 5/245 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
++ ++ W +HG Y++ E+++R + F DN ++ QHN + G SF L LN FA
Sbjct: 37 EEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFA 96
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
DLT++E+++++LG + D +R+ +A Q+ N ++P S+DWRKKGAV VKDQ CG
Sbjct: 97 DLTNEEYRSTYLG-ARTKPDRERKLSARYQAADN-DELPESVDWRKKGAVGAVKDQGGCG 154
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFSA A+EGIN+IVTG ++ LSEQEL+DCD SYN GC GGLMDYA++F+I N GID
Sbjct: 155 SCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGID 214
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+E+DYPY+ + +C+ K N +VTIDGY+DVP N+EK L +AV QP+SV I RAF
Sbjct: 215 SEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAF 274
Query: 260 QLYSS 264
QLY S
Sbjct: 275 QLYKS 279
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T +
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSE 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + S + N D+P+++DWR+ GAVT+VK+Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMPSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q V I Y+ VPE E LLQAV QPVS+GI S Q
Sbjct: 214 SDYEYLGQQYTCRSQG-KTAAVQISNYQVVPE-GETSLLQAVTKQPVSIGIAAS-HDLQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 149/313 (47%), Positives = 196/313 (62%), Gaps = 15/313 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQ 84
+LF+ W K+HG Y +E +R +IF N ++ + N +S + L LN FAD +
Sbjct: 50 QLFQLWRKEHGLVYKDLKEMAKRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPS 109
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EF+ +L S+D + P PAS+DWR K AVT +K+Q SCG+CWAF
Sbjct: 110 EFQEIYL----HSLDMPTDSAPKLNGPLLSCIAPASLDWRNKVAVTAIKNQGSCGSCWAF 165
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
SA GAIEGI+ I TG L+SLSEQEL++CDR + GC GG ++ A+ +VI N GI E +Y
Sbjct: 166 SAAGAIEGIHAITTGELISLSEQELVNCDR-VSKGCNGGWVNKAFDWVISNGGITLEAEY 224
Query: 205 PYRGQ-AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
PY G+ G CN K TIDGY+ V E ++ LL ++V QP+S IC + FQLY
Sbjct: 225 PYTGKDGGNCNSDKQVPIKATIDGYEQV-EQSDNGLLCSIVKQPIS--ICLNATDFQLYE 281
Query: 264 SGIFTG-PCSTS---LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
SGIF G CS+S +H VLIVGYDS NG DYWI+KNSWG WG+NGY+ ++RNTG
Sbjct: 282 SGIFDGQQCSSSSKYTNHCVLIVGYDSSNGEDYWIVKNSWGTKWGINGYIWIKRNTGLPY 341
Query: 320 GICGINMLASYPT 332
G+CG+N A PT
Sbjct: 342 GVCGMNAWAYNPT 354
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 191/310 (61%), Gaps = 7/310 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T +
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSE 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACW 142
EF F G + S +++ +L D +P+++DWR+ GAVT+VK+Q CG CW
Sbjct: 95 EFLTKFTGINIPSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCW 154
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI +E
Sbjct: 155 AFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISSES 213
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DY Y+GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q Y
Sbjct: 214 DYEYQGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQFY 270
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 AGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGH 330
Query: 322 CGINMLASYP 331
C I ++SYP
Sbjct: 331 CDIAKMSSYP 340
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI +E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 149/330 (45%), Positives = 197/330 (59%), Gaps = 22/330 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
I F+ W HGKAY+ +E+ +RL IF DN FV HN G S L LN ADL
Sbjct: 66 IEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADL 125
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-----DV--PASIDWRKKGAVTEVKD 134
T +EFK LG+ A+ ++R S P + DV P ++DW +GAVT VK+
Sbjct: 126 TREEFK-HMLGYDAS-----KKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKN 179
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVI 193
Q CG+CWAFS GA+EG+ + TG L+SLSEQEL+ C + N+GC GGLMD +++++
Sbjct: 180 QGQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIV 239
Query: 194 KNHGIDTEKDYPYRGQAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI 252
+N G+D E+D+ Y + +CN K R +IDG+KDVP N+E L +AV QPV+V I
Sbjct: 240 ENRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAI 299
Query: 253 CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY----DSENGVDYWIIKNSWGRSWGMNGY 308
R FQLYS G+F G C T+LDH VL+VGY +S YW +KNSWG WG GY
Sbjct: 300 EADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGY 359
Query: 309 MHMQRNTGNSLGICGINMLASYPTKTGQNP 338
+ + R G CG+ M ASYPTK+ P
Sbjct: 360 IRIARGGMGPAGQCGVAMQASYPTKSSSAP 389
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 199/321 (61%), Gaps = 19/321 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
I E + T+ QH K Y++E E++ R+KIF +N + +HN + G S+ L LN +AD+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQS---PGNLRDVPASIDWRKKGAVTEVKDQASC 138
H EFK + G++ R R V + P VP S+DWR+ GAVT VKDQ C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
G+CWAFS+TGA+EG + G LVSLSEQ L+DC Y N+GC GGLMD A++++ N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQ-PVSVGICG 254
IDTEK YPY G C+ N+ + T G+ D+PE +E+++ +AV PVSV I
Sbjct: 204 IDTEKSYPYEGIDDSCH---FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDA 260
Query: 255 SERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 311
S +FQLYS G++ P +LDH VL+VGY + E+G+DYW++KNSWG +WG GY+ M
Sbjct: 261 SHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 312 QRNTGNSLGICGINMLASYPT 332
RN N CGI +SYPT
Sbjct: 321 ARNQNNQ---CGIATASSYPT 338
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK+Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYVSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG +G+M + R++GN G
Sbjct: 271 YAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKVSSYP 341
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T +
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSE 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK+Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + ENG YW++KNSWG SWG G+M + R+ GN G
Sbjct: 271 YAGGTYDGSCANRINHAVTAIGYGTDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSG 330
Query: 321 ICGINMLASYP 331
+C I L+SYP
Sbjct: 331 LCDIAKLSSYP 341
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 191/320 (59%), Gaps = 4/320 (1%)
Query: 15 SSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLS 74
+S PL+ S + E E W ++ + Y + E+++R +F+DN F+ + GN L
Sbjct: 22 TSRPLHEAS-MYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNKLG 80
Query: 75 LNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKD 134
+NA AD+TH+EF+AS F R S + N+ +P+++DWRKK VT +K+
Sbjct: 81 VNALADMTHEEFRASGNTFKIPPNLGLRSETTSFRHQ-NVTRIPSTMDWRKKRTVTHIKN 139
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVI 193
Q CG CWAFSA A+EGI K+ T +SLSEQEL+DCD N GC GG MD A++F+I
Sbjct: 140 QLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFII 199
Query: 194 KNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGIC 253
+N G+++E Y Y+G G CNK+K + I+ Y+++PE +EK LL+ V QP+SV I
Sbjct: 200 QNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQPISVAID 259
Query: 254 GSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
AFQ Y GI T LD+ V GY S +G +W++KNSWG WG NGY M+
Sbjct: 260 AGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYTRME 319
Query: 313 RNTGNSLGICGINMLASYPT 332
R + G+CG M ASYPT
Sbjct: 320 RGVKATTGLCGFTMQASYPT 339
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 192/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGN-LRD--VPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N L D +P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GGLM A+ F+I+N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C + + V I YK VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTC-RSREKTAAVQISSYKVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGNCADQINHAVTAIGYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 126/242 (52%), Positives = 168/242 (69%), Gaps = 10/242 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W +HGK+Y+ EK +R +IF+DN F+ +HN + NS++ L L FADLT++E++
Sbjct: 54 MYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYR 112
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNL------RDVPASIDWRKKGAVTEVKDQASCGAC 141
+ FLG ID +RR S N +P S+DWRK+GAV VKDQASCG+C
Sbjct: 113 SKFLG---TKIDPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSC 169
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA A+EGINKIVTG L+SLSEQEL+DCD SYN GC GGLMDYA++F+I N GID+E
Sbjct: 170 WAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSE 229
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DYPY+ G+C++ + N +VTID Y+DVP +E L +AV QP++V + G R FQL
Sbjct: 230 DDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQL 289
Query: 262 YS 263
Y
Sbjct: 290 YE 291
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 197/337 (58%), Gaps = 13/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L ++L+ S L+ + + + + + H K Y S+ E++ R+KI+ +N V +
Sbjct: 1 TLIFLLGAVLVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 63 HNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN + G S+ +++N F DL H EF++ G+ + R + + P N+ VP
Sbjct: 61 HNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVT-VP 119
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR+KGA+T VKDQ CG+CWAFS+TGA+EG TG LVSLSEQ LIDC Y N
Sbjct: 120 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 179
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+Q++ N GIDTE YPY + C NR V G+ D+P E
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEED 238
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
+L AV PVSV I S +FQ YS G++ P S LDH VL+VGY S+NG DYW+
Sbjct: 239 KLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWL 298
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSW WG GY+ M RN N CG+ ASYP
Sbjct: 299 VKNSWSEHWGDEGYIKMARNRKNH---CGVASAASYP 332
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 267 bits (682), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 198/337 (58%), Gaps = 13/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L ++L+ S L+ + + + + + H K Y S+ E++ R+KI+ +N V +
Sbjct: 5 TLIFLLGAVLVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 63 HNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN + G S+ +++N F DL H EF++ G+ + R + + P N+ +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EVP 123
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR+KGA+T VKDQ CG+CWAFS+TGA+EG TG L+SLSEQ LIDC Y N
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+Q++ N GIDTE YPY + C NR V G+ D+P E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEED 242
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
+L AV PVSV I S +FQ YS G++ P S LDH VL+VGY S+NG DYW+
Sbjct: 243 KLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWL 302
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSW WG GY+ + RN N CG+ ASYP
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNH---CGVATAASYP 336
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 267 bits (682), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 205/341 (60%), Gaps = 19/341 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L L+ +L ++ +++ +N+ + T+ +H K Y ++ E++ R+KIF DN + +H
Sbjct: 3 LFLLLIVAILATAQAISFFELVNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNKHKIAKH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN-----ASVQSPGNLR 115
N M S+ L +N + D+ H EF + GF+ SI+ R AS P N+
Sbjct: 63 NGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNK-SINTQLRSERLPIGASFIEPANVV 121
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P ++DWR+ GAVT VKDQ CG+CW+FSATGA+EG + TG L+ LSEQ LIDC
Sbjct: 122 -LPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGK 180
Query: 176 Y-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
Y N+GC GGLMD A+Q++ N G+DTE YPY + +C N + GY D+P+
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDV-GYVDIPQG 239
Query: 235 NEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGV 290
NEK+L AV PVSV I S ++FQ YS G++ P S +LDH VL VGY + ENG
Sbjct: 240 NEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDENGQ 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG NGY+ M R N L CGI ASYP
Sbjct: 300 DYWLVKNSWGETWGDNGYIKMAR---NKLNHCGIASTASYP 337
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 267 bits (682), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 157/351 (44%), Positives = 208/351 (59%), Gaps = 32/351 (9%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCK---QHGKAYSSEQEKQQRLKIFEDNYAFV 60
+ FLL + L++ N S N + E W QH K Y SE E++ R+KI+ N +
Sbjct: 1 MKLFLLLVSFLAAA--NAVSIFNLVKEEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKI 58
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGF----SAASIDHDRRRNASVQSP-- 111
+HN ++G F L +N +ADL H+EF + GF +A S R + +++ P
Sbjct: 59 AKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPIT 118
Query: 112 ----GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
N+ DVP +IDWR+KGAVT VKDQ CG+CW+FSATGA+EG + TG LVSLSEQ
Sbjct: 119 WIEPANV-DVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQ 177
Query: 168 ELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC--NKQKLNRHIVT 224
L+DC Y N+GC GGLMD A+Q+V N GIDTEK YPY +C N + + T
Sbjct: 178 NLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDECHYNPKAIG---AT 234
Query: 225 IDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLI 281
G+ D+P+ +EK L +A+ PVSV I S +FQ YS G++ P S LDH VL
Sbjct: 235 DKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLA 294
Query: 282 VGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
VGY +E+G DYW++KNSWG +WG GY+ M RN N CGI ASYP
Sbjct: 295 VGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRENH---CGIATTASYP 342
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 267 bits (682), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 191/342 (55%), Gaps = 15/342 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELF---------ETWCKQHGKAYSSEQEKQQRLKIFE 54
+ + I+L + ++ + +F E W + + Y E EK R +F+
Sbjct: 5 MVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFK 64
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSP 111
N F+ N GN S+ L +N FAD T++EF A G + S + S Q+
Sbjct: 65 KNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
V S DWR +GAVT VK Q CG CWAFSA A+EG+ KI G+LVSLSEQ+L+D
Sbjct: 125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184
Query: 172 CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
CDR Y+ GC GG+M A+ +V++N GI +E DY Y+G G C R I G++ V
Sbjct: 185 CDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNA--RPAARISGFQTV 242
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGV 290
P NNE+ LL+AV QPVSV + + F YS G++ GPC TS +HAV VGY S++G
Sbjct: 243 PSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGT 302
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW+ KNSWG +WG GY+ ++R+ G+CG+ A YP
Sbjct: 303 KYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPV 344
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 267 bits (682), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 186/311 (59%), Gaps = 27/311 (8%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W Q+ + Y EK QR ++F+ N F+ N GN F L +N FADLT+ EF+A+
Sbjct: 6 EQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTNDEFRAT 65
Query: 90 FL--GFSAASIDHD---RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
GF + + R N SV + +PA+IDWR KGAVT +KDQ C
Sbjct: 66 KTNKGFKPSPVKVPTGFRYENISVDA------LPATIDWRTKGAVTPIKDQGQC------ 113
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
EGI KI TG L+SLSEQEL+DCD + GC GGLMD A++F+IK G+ TE
Sbjct: 114 ------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTESS 167
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G+C + + + T+ G++DVP N+E L++AV QPVSV + G + FQ YS
Sbjct: 168 YPYTAADGKC--KSGSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQFYS 225
Query: 264 SGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+ TG C T LDH + +GY + +G YW++KNSWG +WG NGY+ M+++ + G+C
Sbjct: 226 GGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMC 285
Query: 323 GINMLASYPTK 333
G+ M SYPT+
Sbjct: 286 GLAMEPSYPTE 296
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 267 bits (682), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 144/342 (42%), Positives = 193/342 (56%), Gaps = 18/342 (5%)
Query: 1 MNSLAFFLLSILLLSSLP----LNYCSD---INELFETWCKQHGKAYSSEQEKQQRLKIF 53
M S+ + +++ L ++ N SD ++FE W + GK Y EK+ R IF
Sbjct: 1 MTSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIF 60
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN 113
DN F+ + + +N FADLT+ EF A++ G A H + P +
Sbjct: 61 RDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTG---AKPPHPKE----APRPVD 113
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
P IDWR +GAVT VKDQ +CG+CWAF+A AIEG+ KI TG L LSEQEL+DCD
Sbjct: 114 PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 173
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVP 232
+ N GCGGG D A++ V GI E DY Y G G+C L H +I GY+ VP
Sbjct: 174 TNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVP 232
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGV 290
N+E+QL AV QPV+V I S AFQ Y SG+F GPC S +HAV +VGY D +G
Sbjct: 233 PNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGK 292
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW+ KNSWG++WG GY+ ++++ G CG+ + YPT
Sbjct: 293 KYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYPT 334
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 134/325 (41%), Positives = 195/325 (60%), Gaps = 34/325 (10%)
Query: 2 NSLAFFLLSILLLSSLPL----------NYCSDINELFETWCKQHGKAYSSEQEKQQRLK 51
N +A L+ ++++ + P + +I +FE W +HGK+YSS+ EK +R+
Sbjct: 4 NMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRMT 63
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP 111
IF D A++ +HN + N++FTL LN F+DLT+ EF+A+++G DRR V
Sbjct: 64 IFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVGKFKPPRYQDRRPAKDVDV- 122
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
++ +P S+DWR++GAVT +KDQ CG+CWAFSA +IE + + T LVSLSEQ+LID
Sbjct: 123 -DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQLID 181
Query: 172 CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
CD + + GC E+ YPY G AG CN K + I G+ V
Sbjct: 182 CD-TVDEGC-------------------QEEAYPYTGLAGSCNANK--NKVAEITGFNVV 219
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVD 291
++ L++AV PV+VGICGS++ FQ Y SGI +G C S DH VL++GY +E G+
Sbjct: 220 TKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYGTEGGMP 279
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTG 316
YWIIKNSWG SWG +G+M +++ G
Sbjct: 280 YWIIKNSWGTSWGEDGFMKIEKKDG 304
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGHVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI +E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 122/215 (56%), Positives = 160/215 (74%), Gaps = 2/215 (0%)
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179
S+DWRKKG VTE+KDQ CG CWAFSA A+EG+ + TG+LVSLSEQEL+DCD + N G
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GG+MDYA+Q++I+N GI ++ +YPYR Q G C+K K+ H TI+G++ +P +E+ L
Sbjct: 61 CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNS 298
L+AV QPVSV I + FQLYSSG+FTG C ++LDH V IVGY ++ G YW++KNS
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNS 180
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG WG +GY+ M+R G G+CGIN+ ASYPTK
Sbjct: 181 WGSGWGESGYVRMERQ-GPGAGVCGINLDASYPTK 214
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 189/315 (60%), Gaps = 16/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEVKDQAS 137
EF A F G + + + S S L+ D+P+++DWR+ GAVT+VK Q
Sbjct: 95 EFLAKFTGLNIP----NSYLSPSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGR 150
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N G
Sbjct: 151 CGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGG 209
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
I E DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+
Sbjct: 210 ISRESDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD 267
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Q Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ G
Sbjct: 268 -LQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYG 326
Query: 317 NSLGICGINMLASYP 331
N G+C I ++SYP
Sbjct: 327 NPAGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI +E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 209/333 (62%), Gaps = 17/333 (5%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F +L +L+ SS + + + W HGK+YS E++ R+ I++ N + +HN
Sbjct: 3 VFLVLCVLVASSRGWSVRFGQDSEWVAWKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHN 62
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
+ S+ +++N DLT EF+ +LG A + +R A+ P N++ +P+S+DW
Sbjct: 63 -AEDHSYKMAMNHLGDLTEDEFRYFYLGVRAHH-NSTKRGWATYMPPSNVK-IPSSVDWS 119
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGG 183
+KG VT VK+Q CG+CWAFS TG++EG + TGSLVSLSEQ LIDC SY N+GC GG
Sbjct: 120 QKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGG 179
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQA 242
LMD A++++ N GIDTE YPY GQ G C+ + H+ + GY+D+P+ +E Q LQ+
Sbjct: 180 LMDNAFRYIESNGGIDTESSYPYLGQQGSCHFS--SSHVGARVTGYQDIPQGSE-QALQS 236
Query: 243 VVAQ--PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNS 298
VA PVSV + S+ +Q YSSG++ P ST LDH VL++GY + NG DYW++KNS
Sbjct: 237 AVATVGPVSVAVDASQ--WQFYSSGVYDNPYCSSTQLDHGVLVIGYGNYNGQDYWLVKNS 294
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WG SWG+ GY+ M RN N CGI ASYP
Sbjct: 295 WGYSWGVEGYIMMSRNKNNQ---CGIASSASYP 324
>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
Length = 379
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 202/337 (59%), Gaps = 17/337 (5%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SIL L ++ LF+ W +HG+ Y + +E+ +RL+IF++N ++ N S
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKS 84
Query: 70 --SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKK 126
S L LN FAD+T QEF +L + N ++ D PAS DWRKK
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKK 144
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
G +T+VK Q CG+ WAFSATGAIE + I TG LVSLSEQEL+DC + GC G
Sbjct: 145 GVITQVKYQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGCYNGWHY 203
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV-------PENNEKQL 239
++++V+++ GI T+ DYPYR + G+C K+ + VTIDGY+ + E+
Sbjct: 204 QSFEWVLEHGGIATDDDYPYRAKEGRCKANKI-QDKVTIDGYETLIMSDESTESETEQAF 262
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIK 296
L A++ QP+SV I + F LY+ GI+ G TS ++H VL+VGY S +GVDYWI K
Sbjct: 263 LSAILEQPISVSI--DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAK 320
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG WG +GY+ +QRNTGN LG+CG+N ASYPTK
Sbjct: 321 NSWGEDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTK 357
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 189/315 (60%), Gaps = 16/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEVKDQAS 137
EF A F G + + + S S L+ D+P+++DWR+ GAVT+VK Q
Sbjct: 95 EFLAKFTGLNIP----NSYLSPSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGR 150
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N G
Sbjct: 151 CGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGG 209
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
I E DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+
Sbjct: 210 ISRESDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD 267
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Q Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ G
Sbjct: 268 -LQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYG 326
Query: 317 NSLGICGINMLASYP 331
N G+C I ++SYP
Sbjct: 327 NPAGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 190/311 (61%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI +E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y GQ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGQQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 196/332 (59%), Gaps = 7/332 (2%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
S++F SI+ S L + +LF +W H K Y + EK R +IF+DN ++ +
Sbjct: 22 SVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE 81
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLG-FSAASIDHDRRRNASVQSPGNLRDVPASI 121
N N+S+ L LN FADL++ EF ++G A+I+ + NL P ++
Sbjct: 82 TNKK-NNSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDTVNL---PENV 137
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAVT V+ Q SCG+CWAFSA +EGINKI TG LV LSEQEL+DC+R + GC
Sbjct: 138 DWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCK 196
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GG YA ++V KN GI YPY+ + G C +++ IV G V NNE LL
Sbjct: 197 GGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLN 255
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QPVSV + R FQLY GIF GPC T +DHAV VGY G Y +IKNSWG
Sbjct: 256 AIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGT 315
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+WG GY+ ++R GNS G+CG+ + YPTK
Sbjct: 316 AWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 149/336 (44%), Positives = 196/336 (58%), Gaps = 7/336 (2%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
S++F SI+ S L + +LF +W H K Y + EK R +IF+DN ++ +
Sbjct: 22 SVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE 81
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLG-FSAASIDHDRRRNASVQSPGNLRDVPASI 121
N N+S+ L LN FADL++ EF ++G A+I+ + NL P ++
Sbjct: 82 -TNKKNNSYRLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDIVNL---PENV 137
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAVT V+ Q SCG+CWAFSA +EGINKI TG LV LSEQEL+DC+R + GC
Sbjct: 138 DWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCK 196
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GG YA ++V KN GI YPY+ + G C +++ IV G V NNE LL
Sbjct: 197 GGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLN 255
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QPVSV + R FQLY GIF GPC T +DHAV VGY G Y +IKNSWG
Sbjct: 256 AIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGT 315
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
+WG GY+ ++R GNS G+CG+ + YP K N
Sbjct: 316 AWGEKGYIRIKRAPGNSPGVCGLYKSSYYPIKNRDN 351
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/344 (44%), Positives = 204/344 (59%), Gaps = 30/344 (8%)
Query: 11 ILLLSSLPLNYCSDINELF-ETWCK---QHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-- 64
IL+L + I EL E W QH K Y SE E++ R+KI+ N + +HN
Sbjct: 6 ILILGFVAAANAISIFELVKEEWTAFKLQHRKKYDSETEERIRMKIYVQNKHKIAKHNQR 65
Query: 65 -NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ---------SPGNL 114
++G F L +N +ADL H+EF + GF+ + + ++ P N+
Sbjct: 66 YDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANV 125
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
DVP ++DWR KGAVT+VKDQ CG+CW+FSATGA+EG + TG LVSLSEQ L+DC +
Sbjct: 126 -DVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQ 184
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDV 231
Y N+GC GG+MD+A+Q++ N GIDTEK YPY +C+ N V T G+ D+
Sbjct: 185 KYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECH---YNPKAVGATDKGFVDI 241
Query: 232 PENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY-DSE 287
P+ NEK L++A+ PVSV I S +FQ YS G++ P S LDH VL VGY +E
Sbjct: 242 PQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTE 301
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+G DYW++KNSWG +WG GY+ M RN N CGI ASYP
Sbjct: 302 DGEDYWLVKNSWGTTWGDQGYVKMARNRDNH---CGIATTASYP 342
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 193/306 (63%), Gaps = 14/306 (4%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQEFKASFL 91
+HGK+Y SE E+ RLKI+ +N + +HN G +++++N F D+ H EF ++
Sbjct: 33 KHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTRN 92
Query: 92 GFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
GF D R + ++ P N+ D +P ++DWR KGAVT VK+Q CG+CWAFSATG+
Sbjct: 93 GFKRNYKDQPREGSTYLE-PENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EG + +GS+VSLSEQ L+DC + N+GC GGLMD A++++ N GIDTEK YPY G
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNG 211
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIF 267
G C+ +K T G+ D+ E +E QL +AV P+SV I S +FQ YS G++
Sbjct: 212 TDGTCHFKKSTVG-ATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVY 270
Query: 268 TGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
P S SLDH VL+VGY + NG DYW++KNSWG +WG GY+ M RN N CGI
Sbjct: 271 DEPECDSESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQ---CGIA 327
Query: 326 MLASYP 331
ASYP
Sbjct: 328 SSASYP 333
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 180/309 (58%), Gaps = 11/309 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
++FE W + GK Y EK+ R IF DN F+ + + +N FADLT+ EF
Sbjct: 35 QMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEF 94
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
A++ G A H + P + P IDWR +GAVT VKDQ +CG+CWAF+A
Sbjct: 95 VATYTG---AKPPHPKE----APRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 147
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
AIEG+ KI TG L LSEQEL+DCD + N GCGGG D A++ V GI E DY Y
Sbjct: 148 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRY 206
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
G G+C L H +I GY+ VP N+E+QL AV QPV+V I S AFQ Y SG
Sbjct: 207 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 266
Query: 266 IFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
+F GPC S +HAV +VGY D +G YW+ KNSWG++WG GY+ ++++ G CG
Sbjct: 267 VFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCG 326
Query: 324 INMLASYPT 332
+ + YPT
Sbjct: 327 LAVSPFYPT 335
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/347 (42%), Positives = 207/347 (59%), Gaps = 24/347 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L L ++ +S++ + + E + + QH Y SE E R+KI+ ++ +
Sbjct: 1 MKCLVLLLCAVAAVSAV--QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHII 58
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR--------RNASVQ 109
+HN MG S+ L +N + D+ H EF + GF+ + H++ R A
Sbjct: 59 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA-KHNKNLYMKGGSVRGAKFI 117
Query: 110 SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
SP N++ +P +DWRK GAVT++KDQ CG+CW+FS TGA+EG + +G LVSLSEQ L
Sbjct: 118 SPANVK-LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNL 176
Query: 170 IDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
IDC Y N+GC GGLMD A++++ N GIDTE+ YPY G +C N + G+
Sbjct: 177 IDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDV-GF 235
Query: 229 KDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYD 285
D+PE +E++L++AV PVSV I S +FQLYSSG++ ST LDH VL+VGY
Sbjct: 236 VDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG 295
Query: 286 S-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+ E GVDYW++KNSWGRSWG GY+ M RN N CGI ASYP
Sbjct: 296 TDEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNR---CGIASSASYP 339
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 201/337 (59%), Gaps = 18/337 (5%)
Query: 8 LLSILLLSSLPLNYCSDINELFET----WCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
++++L L+ L + N++ + + H K Y S E+ R+KI+ DN + +H
Sbjct: 4 VVALLFLAVLAMGQTVSFNKILDAEWFIFKLHHNKVYKSPVEEGYRMKIYMDNKRKIAEH 63
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N + ++ L +N + D+ H EF + GF+ + + SP N++ +P
Sbjct: 64 NRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVTAGIETEGVTFISPANVK-LPDE 122
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DW K+GAVT VKDQ CG+CWAFS+TGA+EG + TG LVSLSEQ LIDC Y N+G
Sbjct: 123 VDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDCSGKYGNNG 182
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA+Q++ N G+DTEK YPY + +C N T GY D+P+ +E++L
Sbjct: 183 CNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRCRYNPRNSG-ATDKGYVDIPQGDEEKL 241
Query: 240 LQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CST-SLDHAVLIVGY--DSENGVDYWI 294
AV P+SV I S +FQLYS G++ P CS +LDH VLIVGY D +G DYW+
Sbjct: 242 KAAVATIGPISVAIDASHESFQLYSEGVYYDPDCSAENLDHGVLIVGYGTDETSGHDYWL 301
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG++WG GY+ M RN N CGI ASYP
Sbjct: 302 VKNSWGKTWGQKGYIKMARNKNNH---CGIASSASYP 335
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 212/346 (61%), Gaps = 33/346 (9%)
Query: 8 LLSILLLSSL--PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+L++L L + ++ I E ++T+ +H K Y SE E++ R+KIF +N + +HN
Sbjct: 4 VLALLALVAFVQAISITDVIKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQ 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ---------SPGN 113
+ G SF L LN +AD+ H EFK + G+ +H R+ Q SP N
Sbjct: 64 LYAQGKVSFKLGLNKYADMLHHEFKETMNGY-----NHTMRKELRAQEGFNGITYISPAN 118
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
++ VP ++DWR+ GAVT VKDQ CG+CW+FS+TG++EG + G LVSLSEQ L+DC
Sbjct: 119 VQ-VPKAVDWRQHGAVTSVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCS 177
Query: 174 RSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKD 230
Y N+GC GGLMD A++++ N G+DTEK YPY G C+ N+ V T G+ D
Sbjct: 178 TKYGNNGCNGGLMDNAFRYIKDNGGVDTEKSYPYEGIDDSCH---FNKATVGATDTGFVD 234
Query: 231 VPENNEKQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE 287
+P+ +E+ +++AV PV+V I S +FQLYS G++ P S +LDH VL+VGY ++
Sbjct: 235 IPQGDEEAMMKAVATMGPVAVAIDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTD 294
Query: 288 -NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+G DYW++KNSWG +WG GY+ M RN N CGI +S+PT
Sbjct: 295 KDGQDYWLVKNSWGTTWGDQGYIKMARNQDNQ---CGIATASSFPT 337
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 206/342 (60%), Gaps = 19/342 (5%)
Query: 4 LAFFLLS-ILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+ FF+L+ + ++ + +++ + E + T+ QH K Y S+ E++ R+KIF +N V +
Sbjct: 1 MKFFVLALVFIVGAQAVSFFDLVQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAK 60
Query: 63 HNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASID-----HDRRRNASVQSPGNL 114
N MG S+ L +N +AD+ H EF + GF+ + + A+ +P N+
Sbjct: 61 XNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANV 120
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+ P ++DWR+ GAVT VKDQ CG+CW+FSATGA+EG + T LVSLSEQ L+DC
Sbjct: 121 K-FPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCST 179
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+ N GC GGLMD A+++V NHGIDTE YPY +C+ T G+ D+P
Sbjct: 180 KFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTSG-ATDRGFVDIPT 238
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENG 289
+E++L+ AV PVSV I S +FQLYS G++ P S LDH VL+VGY + ENG
Sbjct: 239 GDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTDENG 298
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYWI+KNSWG SWG GY+ M RN N+ CGI ASYP
Sbjct: 299 QDYWIVKNSWGESWGEQGYIKMARNRDNN---CGIATQASYP 337
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 198/321 (61%), Gaps = 21/321 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADL 81
+N+ + T+ +H K Y S+ E++ R+KIF DN + +HN+ M S+ L +N + D+
Sbjct: 30 VNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDM 89
Query: 82 THQEFKASFLGFSAASIDHDRRRN-----ASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
H EF GF+ SI+ R AS P N+ +P +DWRK+GAVT VKDQ
Sbjct: 90 LHHEFVNILNGFNK-SINTQLRSERLPVGASFIEPANVV-LPKKVDWRKEGAVTPVKDQG 147
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
CG+CW+FSATGA+EG + TG LVSLSEQ LIDC Y N+GC GGLMD A+Q++ N
Sbjct: 148 HCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDN 207
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ--PVSVGIC 253
G+DTE YPY + +C N + + GY D+P +EK LL+A VA PVSV I
Sbjct: 208 KGLDTEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGDEK-LLKAAVATIGPVSVAID 265
Query: 254 GSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMH 310
S ++FQ YS G++ P S LDH VL++GY + ENG DYW++KNSWG +WG NGY+
Sbjct: 266 ASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIK 325
Query: 311 MQRNTGNSLGICGINMLASYP 331
M R N L CGI ASYP
Sbjct: 326 MAR---NKLNHCGIASSASYP 343
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDITKMSSYP 341
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
F L L+ + ++Y I E ++T+ +H K Y E E++ RLKIF +N + +HN
Sbjct: 4 LFALLALVAVAQAVSYADVIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQ 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDV 117
G SF +++N +AD+ H EF + GF+ R + S SP +++ +
Sbjct: 64 RYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVK-I 122
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P S+DWR KGAVTEVKDQ CG+CWAFS+TGA+EG + G+L+SLSEQ L+DC Y
Sbjct: 123 PKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYG 182
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPEN 234
N+GC GGLMD A++++ N GIDTEK YPY G C+ N+ + T G D+P+
Sbjct: 183 NNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCH---FNKATIGATDRGSVDIPQG 239
Query: 235 NEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDS-ENGV 290
+EK++ +AV PVSV I S +FQ YS GI+ P C +LDH VL+VGY + E+G
Sbjct: 240 DEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDESGQ 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG G++ M RN N CGI +SYP
Sbjct: 300 DYWLVKNSWGTTWGDKGFIKMARNADNQ---CGIASASSYP 337
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 205/339 (60%), Gaps = 15/339 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L+ L++ ++SSL +++ D +E + W +HGK Y S++E+ R I++ N V
Sbjct: 1 MKYLSVLLVAACVVSSLSMSFI-DFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+HN ++G+ ++ L +N FADL ++EF + GF S R ++ P N+ D+
Sbjct: 60 IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFRGNS--SKATRGSTFLPPSNVFDM 117
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSY 176
P +DWR KG VT VK+Q CG+CWAFSATG++EG + TG LVSLSEQ L+DC +
Sbjct: 118 PTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKEG 177
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMD A+Q+++ GIDTE YPY GQC+ K N T GY DV +E
Sbjct: 178 NMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKANIG-ATDTGYTDVTTGSE 236
Query: 237 KQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY-DSENGVDY 292
L AV + P+SV I S ++FQLY SG++ P ST LDH VL VGY S +G DY
Sbjct: 237 SALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTDY 296
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+ +SWG +WGMNGY+ M RN N CGI ASYP
Sbjct: 297 FFFFHSWGAAWGMNGYLWMSRNKDNQ---CGIATKASYP 332
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 150/333 (45%), Positives = 202/333 (60%), Gaps = 14/333 (4%)
Query: 8 LLSILLLSSLP--LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
LL++L + L L+ ++N+ +E + +H K Y S E+ R IFE+N+ F+ HN+
Sbjct: 58 LLAVLAVIGLASALSPNPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNS 117
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRK 125
F L +N F DLT++E++ +LG+ + + + + DVP IDWR
Sbjct: 118 KKEFDFYLGMNHFGDLTNKEYRERYLGYRRPE-NTPSKASYIFSRAEKIEDVPDQIDWRD 176
Query: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGL 184
+G VT VK+Q CG+CWAFSA G++EG + TG LVSLSEQ L+DC NSGC GG
Sbjct: 177 QGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGW 236
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQAV 243
MD A+++V NHGIDTE YPY G G C+ + N+ I T+ G+ DV E +E+ L QAV
Sbjct: 237 MDQAFEYVKDNHGIDTEDSYPYVGTDGSCHFK--NKSIGATLKGFMDVKEGDEEALRQAV 294
Query: 244 -VAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSE-NGVDYWIIKNSW 299
VA PVSV I S FQ Y G++ P CSTS LDH VL+VGY + G D+W++KNSW
Sbjct: 295 GVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSW 354
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G WG+ GY+ M RN GN CGI AS PT
Sbjct: 355 GVGWGIYGYIEMSRNKGNQ---CGIASKASIPT 384
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 204/336 (60%), Gaps = 14/336 (4%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F +L L +++ + + + + + HGK Y SE E+ RLKI+ +N + +HN
Sbjct: 26 GFVVLGCLFVTAAAITHQELVGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKIARHN 85
Query: 65 NM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG-NLRDVPAS 120
+S+ L++N F DL H EF ++ GF R + ++ G + +P +
Sbjct: 86 EKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKHLPKT 145
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWRKKGAVT VK+Q CG+CWAFS TG++EG + TG +VSLSEQ L+DC + N+G
Sbjct: 146 VDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNG 205
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A++++ N GIDTE YPY G G C+ +K + T G+ D+PE NE QL
Sbjct: 206 CEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSDVG-ATDTGFVDIPEGNE-QL 263
Query: 240 LQAVVAQ--PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWII 295
L+ VA PVSV I S +FQ YS G++ P S SLDH VL+VGY +++G DYW++
Sbjct: 264 LKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYGTKDGQDYWLV 323
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG +WG +GY++M RN N CGI ASYP
Sbjct: 324 KNSWGTTWGDDGYIYMTRNKENQ---CGIASSASYP 356
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 207/336 (61%), Gaps = 18/336 (5%)
Query: 1 MNSLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
M +L+ FL + + ++S++PL S +E W HGK Y ++ E R +F N
Sbjct: 1 MKTLSVFLAICLAVVSAIPLKDPS-----WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKT 55
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ HN S+F +++N F+DLT +EF ++ G+ S+ + ++ +P N ++P
Sbjct: 56 IAAHN--AKSTFKMAINEFSDLTRKEFVKTYNGYRL-SMKKSTNKPSTFMAPLNT-NMPT 111
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
+DWRK+G VT +K+Q CG+CWAFS TG++EG + TG LVSLSEQ LIDC + N
Sbjct: 112 EVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGND 171
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GCGGG MD A++++ N+GIDTE YPY G+ C +K N+ + GY D+ + +E
Sbjct: 172 GCGGGFMDDAFEYIKLNNGIDTEASYPYEGRDDICRYKKTNKGAIDT-GYMDIKQYSEDD 230
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWII 295
L AV P+SV I S ++F +Y +G++ P CS T LDH VL+VGY +ENG DYW++
Sbjct: 231 LKAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTVLDHGVLVVGYGTENGEDYWLV 290
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG WGMNGY+ M RN N+ CGI ASYP
Sbjct: 291 KNSWGTDWGMNGYIKMSRNRSNN---CGIATNASYP 323
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 201/338 (59%), Gaps = 30/338 (8%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKA-----YSSEQEKQQRLKIFEDNYAF 59
F+ S L + PL +F W +++ K+ YS+E E R ++ D
Sbjct: 12 GLFVASTLAATHDPLT------GVFAKWMRENTKSNYRFVYSNE-EFIYRWNVWRD---- 60
Query: 60 VTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
+ +N N S+ L++N F DLT+ EF F G + H + A+ ++P +P+
Sbjct: 61 --EEHNRQNKSYFLAMNQFGDLTNAEFNRLFKGLAFDYSKHAKIHTAAPEAPAT--GIPS 116
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
DWR+KGAVT VK+Q CG+CW+FS TG+ EG N + TG LVSLSEQ LIDC SY N+
Sbjct: 117 EFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNN 176
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAG--QCNKQKLNRHIVTIDGYKDVPENNE 236
GC GGLMDYA++++I N GIDTE YPY+ AG C N+ ++ GY DV +E
Sbjct: 177 GCNGGLMDYAFEYIINNRGIDTEASYPYQ-TAGPLTCQYNAANKG-GSLTGYTDVTSGDE 234
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIF--TGPCSTSLDHAVLIVGYDSENGVDYWI 294
LL A V +PVSV I S +FQ YS G++ + ST LDH VL+VG+ SENG D+W
Sbjct: 235 NALLNAAVKEPVSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWGSENGQDFWW 294
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG SWG+NGY+ M RN N+ CGI ASYPT
Sbjct: 295 VKNSWGASWGLNGYIKMSRNQNNN---CGIATAASYPT 329
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 196/348 (56%), Gaps = 18/348 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINEL-----FETWCKQHGKAYSSEQEKQQRLKIFED 55
+ S L + +L + C D+ ++ F W H ++Y S +E QR ++
Sbjct: 18 LASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRR 77
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD----RRRNASVQSP 111
N F+ N G+ ++ L+ N FADLT +EF A++ G+ A D V +
Sbjct: 78 NAEFIDAVNLRGDLTYRLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDAS 137
Query: 112 GNLR-DVPASIDWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
+ R DVPAS+DWR +GAV K Q S C +CWAF IE +N I TG LVSLSEQ+L
Sbjct: 138 FSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQL 197
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DCD SY+ GC G AY++V++N G+ TE DYPY + G CN+ K H I G+
Sbjct: 198 VDCD-SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFG 256
Query: 230 DVPENNEKQLLQAVVAQPVSVGI-CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DS 286
VP NE L AV QPV+V I GS Q Y G++TGPC T L HAV +VGY D+
Sbjct: 257 KVPPRNEAALQAAVARQPVAVAIEVGS--GMQFYKGGVYTGPCGTRLAHAVTVVGYGTDA 314
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
+G YW IKNSWG+SWG GY+ + R+ G G+CG+ + +YPT T
Sbjct: 315 SSGAKYWTIKNSWGQSWGERGYIRILRDVGGP-GLCGVTLDIAYPTLT 361
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 196/348 (56%), Gaps = 18/348 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINEL-----FETWCKQHGKAYSSEQEKQQRLKIFED 55
+ S L + +L + C D+ ++ F W H ++Y S +E QR ++
Sbjct: 18 LASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRR 77
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD----RRRNASVQSP 111
N F+ N G+ ++ L+ N FADLT +EF A++ G+ A D V +
Sbjct: 78 NAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDAS 137
Query: 112 GNLR-DVPASIDWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
+ R DVPAS+DWR +GAV K Q S C +CWAF IE +N I TG LVSLSEQ+L
Sbjct: 138 FSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQL 197
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DCD SY+ GC G AY++V++N G+ TE DYPY + G CN+ K H I G+
Sbjct: 198 VDCD-SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFG 256
Query: 230 DVPENNEKQLLQAVVAQPVSVGI-CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DS 286
VP NE L AV QPV+V I GS Q Y G++TGPC T L HAV +VGY D+
Sbjct: 257 KVPPRNEAALQAAVARQPVAVAIEVGS--GMQFYKGGVYTGPCGTRLAHAVTVVGYGTDA 314
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
+G YW IKNSWG+SWG GY+ + R+ G G+CG+ + +YPT T
Sbjct: 315 SSGAKYWTIKNSWGQSWGERGYIRILRDVGGP-GLCGVTLDIAYPTLT 361
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 180/309 (58%), Gaps = 11/309 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
++FE W + GK Y EK+ R IF DN F+ + + +N FADLT+ EF
Sbjct: 18 QMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEF 77
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
A++ G A H + P + P IDWR +GAVT VKDQ +CG+CWAF+A
Sbjct: 78 VATYTG---AKPPHPKE----APRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 130
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
AIEG+ KI TG L LSEQEL+DCD + N GCGGG D A++ V GI E DY Y
Sbjct: 131 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRY 189
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
G G+C L H +I GY+ VP N+E+QL AV QPV+V I S AFQ Y SG
Sbjct: 190 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 249
Query: 266 IFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
+F GPC S +HAV +VGY D +G YW+ KNSWG++WG GY+ ++++ G CG
Sbjct: 250 VFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCG 309
Query: 324 INMLASYPT 332
+ + YPT
Sbjct: 310 LAVSPFYPT 318
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDITKMSSYP 341
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 187/308 (60%), Gaps = 7/308 (2%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
+ W + Y E EKQ RL++F +N F+ NNMG+ S+ L +N F D T +EF A+
Sbjct: 39 QKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGVNKFTDWTKEEFLAT 98
Query: 90 FLGFSAASIDHDRRRNASVQSPGN--LRDVPASI-DWRKKGAVTEVKDQASCGACWAFSA 146
G S ++ N + DV + DWR +GAVT VK Q CG CWAFSA
Sbjct: 99 HTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGAVTPVKYQGECGGCWAFSA 158
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EG+ KI G+L+SLSEQ+L+DC R N+GC GG M A+ +++KN G+ +E YPY
Sbjct: 159 IAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEAFNYIVKNGGVSSENAYPY 218
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ + G C + + I G+++VP NNE+ LL+AV QPV+V I SE F YS G+
Sbjct: 219 QVKEGPCRSNDI--PAIVIRGFENVPSNNERALLEAVSRQPVAVDIDASETGFIHYSGGV 276
Query: 267 FTG-PCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+ C TS++HAV +VGY S+ G+ YW+ KNSWG++WG NGY+ ++R+ G+CG+
Sbjct: 277 YNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGV 336
Query: 325 NMLASYPT 332
ASYP
Sbjct: 337 AQYASYPV 344
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 179/309 (57%), Gaps = 11/309 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
++FE W + GK Y EK+ R IF DN F+ + + +N FADLT+ EF
Sbjct: 41 QMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEF 100
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
A++ G A H + P + P IDWR +GAVT VKDQ +CG+CWAF+A
Sbjct: 101 VATYTG---AKPPHPKE----APRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 153
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
AIEG+ KI TG L LSEQEL+DCD + N GCGGG D A++ V GI E DY Y
Sbjct: 154 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRY 212
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
G G+C L H I GY+ VP N+E+QL AV QPV+V I S AFQ Y SG
Sbjct: 213 EGFQGKCRVDDMLFNHAARIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 272
Query: 266 IFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
+F GPC S +HAV +VGY D +G YW+ KNSWG++WG GY+ ++++ G CG
Sbjct: 273 VFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCG 332
Query: 324 INMLASYPT 332
+ + YPT
Sbjct: 333 LAVSPFYPT 341
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 196/348 (56%), Gaps = 18/348 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINEL-----FETWCKQHGKAYSSEQEKQQRLKIFED 55
+ S L + +L + C D+ ++ F W H ++Y S +E QR ++
Sbjct: 14 LASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRR 73
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD----RRRNASVQSP 111
N F+ N G+ ++ L+ N FADLT +EF A++ G+ A D V +
Sbjct: 74 NAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDAS 133
Query: 112 GNLR-DVPASIDWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
+ R DVPAS+DWR +GAV K Q S C +CWAF IE +N I TG LVSLSEQ+L
Sbjct: 134 FSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQL 193
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DCD SY+ GC G AY++V++N G+ TE DYPY + G CN+ K H I G+
Sbjct: 194 VDCD-SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFG 252
Query: 230 DVPENNEKQLLQAVVAQPVSVGI-CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DS 286
VP NE L AV QPV+V I GS Q Y G++TGPC T L HAV +VGY D+
Sbjct: 253 KVPPRNEAALQAAVARQPVAVAIEVGS--GMQFYKGGVYTGPCGTRLAHAVTVVGYGTDA 310
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
+G YW IKNSWG+SWG GY+ + R+ G G+CG+ + +YPT T
Sbjct: 311 SSGAKYWTIKNSWGQSWGERGYIRILRDVGGP-GLCGVTLDIAYPTLT 357
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 209/342 (61%), Gaps = 18/342 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
MN L F L+I + S +++ + E + + H K Y SE E++ R+KIF +N V
Sbjct: 1 MNFLIF--LAICVAGSQAVSFFDLVQEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTV 58
Query: 61 TQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAAS---IDHDRRRNASVQSPGNL 114
+HN + G SF L +N +AD+ H EF GF+ + + + P N+
Sbjct: 59 AKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANV 118
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+ +P IDWR KGAVT VKDQ CG+CW+FSATG++EG + +G LVSLSEQ L+DC
Sbjct: 119 Q-LPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSE 177
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+ N+GC GGLMD A++++ N GIDTE+ YPY+ + +C+ + N+ T GY D+
Sbjct: 178 KFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKG-ATDRGYVDIES 236
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSE-NG 289
NE +L AV PVSV I S ++FQLYS G++ P CS S LDH VL+VGY +E +G
Sbjct: 237 GNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDG 296
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG+SWG GY+ M RN N+ CGI ASYP
Sbjct: 297 TDYWLVKNSWGKSWGDQGYIKMARNRNNN---CGIATEASYP 335
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 213/339 (62%), Gaps = 18/339 (5%)
Query: 1 MNSLAFFL-LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
MN+L L + +S LN D++ + + + H K YS ++E+ +RL I+EDN +
Sbjct: 1 MNTLIVVASLCVTAFASPILN--KDLDGDWVLYKQTHKKTYSQDEEQMRRL-IWEDNVNY 57
Query: 60 VTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD 116
+ +HN + G ++ L N +AD+T EF+A G+ ++ +R + SP N+ D
Sbjct: 58 IQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNGYKMSA---NRTKGDLYMSPSNIGD 114
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P S+DWRK+G VT++K+Q CG+CW+FSATG++EG + + LVSLSEQ L+DC +
Sbjct: 115 LPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSKKE 174
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC GGLMD A++++ N GIDTE+ YPY + G C+ + N T GY D+P
Sbjct: 175 GNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGFCHFKAENVG-ATDTGYVDIPHMQ 233
Query: 236 EKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDY 292
E +L +AV P+SVGI ++FQLY G+++ P CS+S LDH VL VGY +E+G DY
Sbjct: 234 EDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEPACSSSKLDHGVLAVGYGTESGDDY 293
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
W++KNSWG SWGM GY+ M RN N +CGI ASYP
Sbjct: 294 WLVKNSWGTSWGMQGYVMMARNKHN---MCGIATQASYP 329
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/311 (46%), Positives = 198/311 (63%), Gaps = 9/311 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W K H + + +EK +R +F++N V N M + + L LN FAD+++ EF
Sbjct: 39 QLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADMSNYEF 96
Query: 87 KASFLGFSAAS---IDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+F S S H+RRR A D+P+S+DWR++GAV VK+Q CG+CWA
Sbjct: 97 -VNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCGSCWA 155
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS+ A+EGINKI T L+SLSEQEL+DC+ N GC GG M+ A+ F+ +N GI TE
Sbjct: 156 FSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKRNGGIATENS 214
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G G C +++ IV IDGY+ VPE NE L+QAV QPVSV I + R FQ YS
Sbjct: 215 YPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQFYS 273
Query: 264 SGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+F G C T L+H V+ +GY +E+G DYW+++NSWG WG +GY+ M+R + G+C
Sbjct: 274 QGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAEGLC 333
Query: 323 GINMLASYPTK 333
GI M ASYP K
Sbjct: 334 GIAMEASYPIK 344
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 180/309 (58%), Gaps = 11/309 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
++FE W + GK Y EK+ R IF DN F+ + + +N FADLT+ EF
Sbjct: 18 QMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEF 77
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
A++ G A H + P + P IDWR +GAVT VKDQ +CG+CWAF+A
Sbjct: 78 VATYTG---AKPPHPKE----APRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAA 130
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
AIEG+ KI TG L LSEQEL+DCD + N GCGGG D A++ V GI E DY Y
Sbjct: 131 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRY 189
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
G G+C L H +I GY+ VP N+E+QL AV QPV+V I S AFQ Y SG
Sbjct: 190 EGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSG 249
Query: 266 IFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
+F GPC S +HAV +VGY D +G YW+ KNSWG++WG GY+ ++++ G CG
Sbjct: 250 VFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCG 309
Query: 324 INMLASYPT 332
+ + YPT
Sbjct: 310 LAVSPFYPT 318
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 192/310 (61%), Gaps = 14/310 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E FE W ++G Y E+++ +IF+ N A++ N GN + L++N F D +
Sbjct: 38 LSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPIE 97
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
+ F + + + N+ D+PA++DWRK+GAVT +K+Q CG+CWAF
Sbjct: 98 DSDDGFERTTTTTPTTTFKYE-------NVTDIPATVDWRKRGAVTPIKNQGKCGSCWAF 150
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SA AIEGI KI +G+LVSLSEQ+L+DCDRS GC G M A++F+++N GI TE +
Sbjct: 151 SAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATEAN 210
Query: 204 YPY-RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY R G C K H V I Y++VP N+E LL+AV QPVSVGI F+ Y
Sbjct: 211 YPYKRVVKGTCKKVS---HKVQIKSYEEVPSNSEDSLLKAVANQPVSVGI-DMRGMFKFY 266
Query: 263 SSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
SSGIFTG C T +HA+ IVGY S++G+ YW++KNSW + WG GY+ ++R+ G+
Sbjct: 267 SSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKEGL 326
Query: 322 CGINMLASYP 331
CGI M SYP
Sbjct: 327 CGIAMKPSYP 336
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (675), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 189/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++G+ G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 264 bits (675), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 139/339 (41%), Positives = 206/339 (60%), Gaps = 18/339 (5%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +L L +L++ L+ + + E W Q+G+ Y + EK +R ++F+ N
Sbjct: 6 ALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHDRR-RNASVQSPGNL 114
AF+ + N GN F L +N FADLT+ EF+++ GF ++ RN +V N+
Sbjct: 66 AFI-ESFNAGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTGFRNENV----NI 120
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+PA++DWR KG VT +KDQ CG CWAFSA A+EGI K+ TG L+S S + +
Sbjct: 121 DALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSLLTVM 180
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
S GC GGLMD A++F+IKN G+ TE +YPY A + ++ + +I GY+DVP N
Sbjct: 181 SM--GCEGGLMDDAFKFIIKNGGLTTESNYPY--AAVDDKFKSVSNSVASIKGYEDVPAN 236
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYW 293
NE L++AV QPVSV + G + FQ Y G+ TG C T LDH ++ +GY + +G YW
Sbjct: 237 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYW 296
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 297 LLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 335
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 179/310 (57%), Gaps = 9/310 (2%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM----GNSSFTLSLNAFADLTHQE 85
E W +HGK Y E+EK +RL++F N + N G L+ N FADLT E
Sbjct: 43 EKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDE 102
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F+A+ G+ + +L P S+DWR GAVT VKDQ SCG CWAFS
Sbjct: 103 FRAARTGYQRPPAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFS 162
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
A A+EG+ KI TG LVSLSEQEL+DCD R + GC GGLMD A+Q++ + G+ E Y
Sbjct: 163 AVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSY 222
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PYRG + R +I G++DVP N+E L+ AV QPVSV I G+ F+ Y
Sbjct: 223 PYRG-VDGACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDR 281
Query: 265 GIFTGP-CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+ G C T L+HAV VGY + +G YW++KNSWG SWG GY+ ++R G G C
Sbjct: 282 GVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGRE-GAC 340
Query: 323 GINMLASYPT 332
GI +ASYP
Sbjct: 341 GIAQMASYPV 350
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 201/342 (58%), Gaps = 23/342 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN- 64
FLL + ++ ++ + E + + QH K Y SE E++ RLKI+ N + +HN
Sbjct: 4 LFLLVAFVAAANAVSIFELVKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQ 63
Query: 65 --NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSP------GNLRD 116
G F L +N + DL H+EF + GF+ + + + P N+ +
Sbjct: 64 RFEQGQEKFRLRVNKYTDLLHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANV-E 122
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
VP ++DWR+KGAVT VKDQ CG+CW+FSATGA+EG + TG LVSLSEQ L+DC Y
Sbjct: 123 VPKTVDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKY 182
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPE 233
N+GC GG+MD+A+Q++ N GIDTEK YPY C+ N V T G+ D+P+
Sbjct: 183 GNNGCNGGMMDFAFQYIKDNGGIDTEKAYPYEAIDDTCH---YNPKAVGATDKGFVDIPQ 239
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY-DSENG 289
+EK L++A+ A PVSV I S +FQ YS G++ P S +LDH VL VGY SE G
Sbjct: 240 GDEKALMKAIATAGPVSVAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEG 299
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG GY+ M RN N CGI ASYP
Sbjct: 300 EDYWLVKNSWGTTWGDQGYVKMARNRDNH---CGIATAASYP 338
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 182/332 (54%), Gaps = 49/332 (14%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L + ++ + + + E E W ++G+ Y EK++R KIF+DN A T
Sbjct: 13 ALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVAQATT 72
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
FK N+ VP++ID
Sbjct: 73 -----------------------FKYE-----------------------NVTAVPSTID 86
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCG 181
WRKKGAVT +KDQ CG+CWAFSA A EGI +I TG L+SLSEQEL+DCD N GC
Sbjct: 87 WRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCS 146
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGL D A++F I HG+ +E YPY G G CN +K I GY+DVP NNEK L +
Sbjct: 147 GGLXDDAFRF-IXIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQK 205
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWG 300
AV QPV+V I FQ Y+SG+FTG C T LDH V VGY ++G+ YW++KNSWG
Sbjct: 206 AVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVKNSWG 265
Query: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WG GY+ MQR+ G+CGI M ASYPT
Sbjct: 266 TGWGEEGYIRMQRDVTAKEGLCGIAMQASYPT 297
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 188/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++E KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEVAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R++GN G
Sbjct: 271 YAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 264 bits (674), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/327 (44%), Positives = 199/327 (60%), Gaps = 18/327 (5%)
Query: 12 LLLSSLPLNYCSD---INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
LLL + L Y + +E + W H K YS + E+ R I++DN + +HN G
Sbjct: 7 LLLLGVTLAYTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKG- 65
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
F L +N F D+T+ EFKA F G+ + H ++ +P N P ++DWR +G
Sbjct: 66 GDFILKMNQFGDMTNSEFKA-FNGY----LSHKHVNGSTFLTPNNFV-APDTVDWRNEGY 119
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 187
VT VKDQ CG+CWAFS TG++EG + TG LVSLSEQ L+DC +Y N+GC GGLMD
Sbjct: 120 VTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDN 179
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 246
A+ ++ +N GID+E YPY + G+C +K + T G+ D+PE NE +L +AV +
Sbjct: 180 AFTYIKENKGIDSEASYPYTAEDGKCVFKK-SSVAATDTGFVDIPEGNENKLKEAVASVG 238
Query: 247 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
P+SV I S +FQ YSSG++ P ST LDH VL+VGY +E+G DYW++KNSW SWG
Sbjct: 239 PISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWG 298
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYP 331
GY+ M+RN N CGI ASYP
Sbjct: 299 DKGYIKMRRNAKNQ---CGIATKASYP 322
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 195/337 (57%), Gaps = 13/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L ++ + S L+ + + + + + H K Y S+ E++ R+KI+ +N V +
Sbjct: 1 TLIFLLGAVFVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60
Query: 63 HNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN + G S+ +++N F DL H EF++ G+ + R + + P N+ +VP
Sbjct: 61 HNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EVP 119
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR+KGA+T VKDQ CG CWAFS+TGA+EG TG LVSL EQ LIDC Y N
Sbjct: 120 ESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGN 179
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+Q++ N GIDTE YPY + C NR V G+ D+P E
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEED 238
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
+L AV PVSV I S +FQ YS G++ P S LDH VL+VGY S+NG DYW+
Sbjct: 239 KLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWL 298
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSW WG GY+ + RN N CG+ ASYP
Sbjct: 299 VKNSWSEHWGDQGYIKIARNRKNH---CGVATAASYP 332
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 203/338 (60%), Gaps = 18/338 (5%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
+ +L L +++ + + + + + HGK Y+S+ E+ RLKI+ +N + +HN
Sbjct: 3 GYIVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHN 62
Query: 65 NM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV--PA 119
S+ L++N F DL H EF ++ GF D R + V+ P D+ P
Sbjct: 63 EKYAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVE-PEGFEDLQLPK 121
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
++DWRKKGAVT VK+Q CG+CWAFS TG++EG + T LVSLSEQ L+DC RS+ N+
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNE 236
GC GGLMD A++++ N GIDTE YPY G C+ NR V T G+ D+PE +E
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCH---FNRSDVGATDTGFVDIPEGDE 238
Query: 237 KQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYW 293
+L +AV A PVSV I S +FQ YS G++ P S LDH VL+VGY +++G DYW
Sbjct: 239 NKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGTKDGQDYW 298
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
++KNSWG +WG GY++M RN N CGI ASYP
Sbjct: 299 LVKNSWGTTWGDEGYIYMTRNKDNQ---CGIASSASYP 333
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 204/333 (61%), Gaps = 15/333 (4%)
Query: 7 FLLSILLLSSLPLNYCSDINE---LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L++ LL ++ + +E ++ W H K Y++ E+ R I+ DN + +H
Sbjct: 3 LLVAACLLFAVASGFVVKFDEDEQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKH 62
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDW 123
N G+S FTL++N DLT EF+ + G + ++ +++ ++ +P +++ VP ++DW
Sbjct: 63 NAEGHS-FTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAPSHVQ-VPDTVDW 120
Query: 124 RKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGG 182
RK+G VT VK+Q CG+CWAFS TG++EG N TG LVSLSEQ L+DC +Y N+GC G
Sbjct: 121 RKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQG 180
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID-GYKDVPENNEKQLLQ 241
GLMDYA++++ +N GIDTE+ YPY + +C QK N I +D G+ DV +E+ L
Sbjct: 181 GLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSN--IGAVDTGFVDVTHGDEEALKT 238
Query: 242 AV-VAQPVSVGICGSERAFQLYSSGIF--TGPCSTSLDHAVLIVGYDSENGVDYWIIKNS 298
A P+SV I +FQ Y SG++ G STSLDH VL+VGY + G DYW++KNS
Sbjct: 239 AAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQGSDYWLVKNS 298
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WG WGM GY+ M RN N CG+ ASYP
Sbjct: 299 WGERWGMEGYIMMSRNKNNQ---CGVATQASYP 328
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 209/343 (60%), Gaps = 28/343 (8%)
Query: 8 LLSILLLSSL--PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+L++L L + ++Y I E ++T+ +H K + SE E++ R+KIF +N + +HN
Sbjct: 4 VLALLALVAFVQAISYTDVIKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQ 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS--------PGNL 114
+ G SF L LN ++D+ + EFK + G+ +H R+ Q P N+
Sbjct: 64 LYAQGKVSFKLGLNKYSDMLYHEFKETMNGY-----NHTMRKVLRAQGFSGIIYIPPANV 118
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
+ +P S+DWR+ GAVT VKDQ CG+CWAFS+T A+EG + G LVSLSEQ L+DC
Sbjct: 119 Q-IPKSVDWRQHGAVTAVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCST 177
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
Y N+GC GGLMD A++++ N GIDTEK YPY G C+ K T G+ D+P+
Sbjct: 178 KYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFTKSGVG-ATDTGFVDIPQ 236
Query: 234 NNEKQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSEN-G 289
+E+ L++AV PVSV I S +FQLYS G++ P + +LDH VL+VGY ++ G
Sbjct: 237 GDEEALMKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTG 296
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+DYW++KNSWG +WG GY+ M RN N CGI +SYPT
Sbjct: 297 LDYWLVKNSWGTTWGDQGYIKMARNQDNQ---CGIATASSYPT 336
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 196/337 (58%), Gaps = 13/337 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L F L ++L+ S L+ + + + + + H K Y S+ E++ R+KI+ +N V +
Sbjct: 5 TLIFLLGAVLVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64
Query: 63 HNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN + G S+ +++N F DL H EF++ G+ + R + + P N+ +VP
Sbjct: 65 HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EVP 123
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR KGA+T VKDQ CG+CWAFS+TGA+EG TG L+SLSEQ LIDC Y N
Sbjct: 124 ESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+Q++ N GIDTE YPY + C NR + G+ +P E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAID-RGFVHIPSGEED 242
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
+L AV PVSV I S +FQ YS G++ P S LDH VL+VGY S+NG DYW+
Sbjct: 243 KLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWL 302
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSW WG GY+ + RN N CGI ASYP
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNH---CGIATAASYP 336
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 187/322 (58%), Gaps = 36/322 (11%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
++ W Q+ + Y + EK R ++F+ N F+ + N G + L N FADLT +EF A
Sbjct: 59 YKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAA 118
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPAS---------------IDWRKKGAVTEVK 133
+ G R+ A+V P + +PA+ +DWR++GAVT VK
Sbjct: 119 MYTGL---------RKPAAV--PSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVK 167
Query: 134 DQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFV 192
+Q CG CWAFSA GA+EG+ I TG+LVSLSEQ+++DCD S N GC GG MD A+Q+V
Sbjct: 168 NQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYV 227
Query: 193 IKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI 252
I N G+ TE YPY G C + TI G++D+P +E L AV QPVSVG+
Sbjct: 228 INNGGVTTEDAYPYSAVQGTCQNVQ---PAATISGFQDLPSGDENALANAVANQPVSVGV 284
Query: 253 CGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMH 310
G FQ Y GI+ G C T ++HAV +GY +++ G YWI+KNSWG WG NG+M
Sbjct: 285 DGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQ 344
Query: 311 MQRNTGNSLGICGINMLASYPT 332
+Q +G CGI+ +ASYPT
Sbjct: 345 LQM----GVGACGISTMASYPT 362
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 185/310 (59%), Gaps = 12/310 (3%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
+ W +HG+ Y EK +R ++F+ N + + N GN + L+ N F DLT EF A
Sbjct: 43 DKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM 102
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
+ G++ A+ + NA+ + PA +DWR++GAVT VK+Q SCG CWAFS A
Sbjct: 103 YTGYNPANTMY-AAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 161
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
+EGI++I TG LVSLSEQ+L+DC + N GC GG +D A+Q++ + G+ TE Y Y+G
Sbjct: 162 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 219
Query: 210 AGQCN---KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
G C + TI GY+ V N+E L AV +QPVSV I GS F+ Y SG+
Sbjct: 220 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 279
Query: 267 FTG-PCSTSLDHAVLIVGY----DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
FT C T LDHAV +VGY D G YWIIKNSWG +WG GYM ++++ G S G
Sbjct: 280 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVG-SQGA 338
Query: 322 CGINMLASYP 331
CG+ M SYP
Sbjct: 339 CGVAMAPSYP 348
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 188/315 (59%), Gaps = 16/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLR-------DVPASIDWRKKGAVTEVKDQAS 137
EF A F G + + + S S L+ D+P+++DW + GAVT+VK Q
Sbjct: 95 EFLAKFTGLNIP----NSYLSPSPMSSTELKINDLSDDDMPSNLDWIESGAVTQVKHQGR 150
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG CWAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N G
Sbjct: 151 CGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGG 209
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
I E DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+
Sbjct: 210 ISRESDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD 267
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Q Y+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ G
Sbjct: 268 -LQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYG 326
Query: 317 NSLGICGINMLASYP 331
N G+C I ++SYP
Sbjct: 327 NPAGLCDIAKMSSYP 341
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/327 (44%), Positives = 199/327 (60%), Gaps = 18/327 (5%)
Query: 12 LLLSSLPLNYCSD---INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
LLL + L Y + +E + W H K YS + E+ R I++DN + +HN G
Sbjct: 7 LLLLGVTLAYTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKG- 65
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
F L +N F D+T+ EFKA F G+ + H ++ +P N P ++DWR +G
Sbjct: 66 GDFLLKMNQFGDMTNSEFKA-FNGY----LSHKHVNGSTFLTPNNFV-APDTVDWRNEGY 119
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 187
VT VKDQ CG+CWAFS TG++EG + TG LVSLSEQ L+DC +Y N+GC GGLMD
Sbjct: 120 VTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDN 179
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 246
A+ ++ +N GID+E YPY + G+C +K + T G+ D+PE NE +L +AV +
Sbjct: 180 AFTYIKENKGIDSEASYPYTAEDGKCVFKKPSV-AATDTGFVDLPEGNENKLKEAVASVG 238
Query: 247 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
P+SV I S +FQ YSSG++ P ST LDH VL+VGY +E+G DYW++KNSW SWG
Sbjct: 239 PISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWG 298
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYP 331
GY+ M+RN N CGI ASYP
Sbjct: 299 DKGYIKMRRNAKNQ---CGIATKASYP 322
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 185/310 (59%), Gaps = 12/310 (3%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
+ W +HG+ Y EK +R ++F+ N + + N GN + L+ N F DLT EF A
Sbjct: 33 DKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM 92
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
+ G++ A+ + NA+ + PA +DWR++GAVT VK+Q SCG CWAFS A
Sbjct: 93 YTGYNPANTMY-AAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 151
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
+EGI++I TG LVSLSEQ+L+DC + N GC GG +D A+Q++ + G+ TE Y Y+G
Sbjct: 152 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 209
Query: 210 AGQCN---KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
G C + TI GY+ V N+E L AV +QPVSV I GS F+ Y SG+
Sbjct: 210 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 269
Query: 267 FTG-PCSTSLDHAVLIVGY----DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
FT C T LDHAV +VGY D G YWIIKNSWG +WG GYM ++++ G S G
Sbjct: 270 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVG-SQGA 328
Query: 322 CGINMLASYP 331
CG+ M SYP
Sbjct: 329 CGVAMAPSYP 338
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/336 (44%), Positives = 202/336 (60%), Gaps = 16/336 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDIN-ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+ L+++ +++ N +IN E +ET+ HGK Y ++ E+ R KIF +N +
Sbjct: 1 MKVLLVAVAVIAVSCANRFYNINPEEWETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEA 60
Query: 63 HN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
HN G S+ + +N F DL E KA GF + +R + P N + +P
Sbjct: 61 HNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKMTP---NTKREGKIYFPSNDK-LPK 116
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
S+DWR+KGAVT VKDQ CG+CW+FSATG++EG + G LVSLSEQ L+DC + Y N+
Sbjct: 117 SVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNN 176
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A+Q+V N GIDTE YPY + C +K ++ T GY D+PE +EK
Sbjct: 177 GCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKK-DKVGGTDKGYVDIPEGDEKA 235
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CST-SLDHAVLIVGYDSENGVDYWII 295
L A+ P+SV I S +F YS G++ P CS+ LDH VL VGY +ENG DYW++
Sbjct: 236 LQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGTENGQDYWLV 295
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG SWG +GY+ + RN N CGI +ASYP
Sbjct: 296 KNSWGPSWGESGYIKIARNHSNH---CGIASMASYP 328
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 188/305 (61%), Gaps = 7/305 (2%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFS 94
++G+ Y EK +R +IF++N + NN +S+TL +N F D+T+ EF A + G
Sbjct: 3 EYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGI 62
Query: 95 AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGIN 154
+ ++ ++ S N+ V SIDWR GAVTEVKDQ CG+CWAFSA +EGI
Sbjct: 63 SRPLNIEKEPVVSFDDV-NISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIY 121
Query: 155 KIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN 214
KIVTG LVSLSEQE++DC S +GC GG +D AY F+I N+G+ +E DYPY+ G C
Sbjct: 122 KIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCA 179
Query: 215 KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS 274
+ I GY V N+E + AV QP++ I S FQ Y+ G+F+GPC TS
Sbjct: 180 ANSW-PNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTS 238
Query: 275 LDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT- 332
L+HA+ I+GY + +G YWI+KNSWG SWG GY+ M R +S G+CGI M YPT
Sbjct: 239 LNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSS-GLCGIAMDPLYPTL 297
Query: 333 KTGQN 337
++G N
Sbjct: 298 QSGAN 302
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 186/321 (57%), Gaps = 35/321 (10%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
++ W Q+ + Y + EK R ++F+ N F+ + N G + L N FADLT +EF A
Sbjct: 59 YKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAA 118
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPAS--------------IDWRKKGAVTEVKD 134
+ G R+ A+V P + +PA +DWR++GAVT VK+
Sbjct: 119 MYTGL---------RKPAAV--PSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKN 167
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVI 193
Q CG CWAFSA GA+EG+ I TG+LVSLSEQ+++DCD S N GC GG MD A+Q+V+
Sbjct: 168 QGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVV 227
Query: 194 KNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGIC 253
N G+ TE YPY G C + TI G++D+P +E L AV QPVSVG+
Sbjct: 228 NNGGVTTEDAYPYSAVQGTCQNVQ---PAATISGFQDLPSGDENALANAVANQPVSVGVD 284
Query: 254 GSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHM 311
G FQ Y GI+ G C T ++HAV +GY +++ G YWI+KNSWG WG NG+M +
Sbjct: 285 GGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQL 344
Query: 312 QRNTGNSLGICGINMLASYPT 332
Q +G CGI+ +ASYPT
Sbjct: 345 QM----GVGACGISTMASYPT 361
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/347 (40%), Positives = 200/347 (57%), Gaps = 17/347 (4%)
Query: 1 MNSLAFFLLSILLLS-SLPLNYCSD--------INELFETWCKQHGKAYSSEQEKQQRLK 51
M S+ F +S+ +LS SL ++ + + E + W + + YS E EKQ R
Sbjct: 1 MTSILFMFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFD 60
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS-IDHDRRRNASVQS 110
+F+ N F+ + N G+ ++ L +N FAD T +EF A+ G + I + + S
Sbjct: 61 VFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPS 120
Query: 111 PG-NLRDV--PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
N+ DV P DWR +GAVT VK Q CG CWAFS+ A+EG+ KIV G+LVSLSEQ
Sbjct: 121 WNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQ 180
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
+L+DCDR ++GC GG+M A+ ++IKN GI +E YPY+ G C + I G
Sbjct: 181 QLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTCRYNA--KPSAWIRG 238
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGY-D 285
++ VP NNE+ LL+AV QPVSV I F YS G++ P C T ++HAV VGY
Sbjct: 239 FQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGT 298
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
S G+ YW+ KNSWG +WG NGY+ ++R+ G+CG+ A YP
Sbjct: 299 SPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPV 345
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 205/338 (60%), Gaps = 17/338 (5%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
LL + ++ +++ + E + ++ QH K Y SE E++ R+KIF DN V +HN
Sbjct: 4 LVLLVTIAVACQAVSFSELVQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNK 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR---RRNASVQSPGNLRDVPA 119
+ G + L++N + DL H EF GF+ R + + + P ++ D+P
Sbjct: 64 LFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHV-DIPD 122
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
++DWR++GAVT VKDQ CG+CW+FSATGA+EG + T LVSLSEQ L+DC + N+
Sbjct: 123 TVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNN 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++++ N GIDTE YPY G+ + NR T G+ D+P +E +
Sbjct: 183 GCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKFRYSAKNRG-ATDKGFVDIPSGDEDK 241
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY--DSENGVDYW 293
L AV P+S+ I S +FQLYS+G+++ P ST LDH VL+VGY D + G+DYW
Sbjct: 242 LKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMDYW 301
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
++KNSWG +WG++GY+ M RN N CG+ ASYP
Sbjct: 302 LVKNSWGDTWGLDGYIKMARNQDNQ---CGVATQASYP 336
>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
max]
Length = 379
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 200/337 (59%), Gaps = 17/337 (5%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SIL L ++ LF+ W +HG+ Y + +E+ +RL+IF++N ++ N S
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 70 --SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKK 126
S L LN FAD+T QEF +L + N ++ D PAS DWRKK
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKK 144
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
G +T+VK Q CG WAFSATGAIE + I TG LVSLSEQEL+DC + G G
Sbjct: 145 GVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQY 203
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV-------PENNEKQL 239
++++V+++ GI T+ DYPYR + G+C K+ + VTIDGY+ + E+
Sbjct: 204 QSFEWVLEHGGIATDDDYPYRAKEGRCKANKI-QDKVTIDGYETLIMSDESTESETEQAF 262
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIK 296
L A++ QP+SV I + F LY+ GI+ G TS ++H VL+VGY S +GVDYWI K
Sbjct: 263 LSAILEQPISVSI--DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAK 320
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG WG +GY+ +QRNTGN LG+CG+N ASYPTK
Sbjct: 321 NSWGEDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTK 357
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 125/218 (57%), Positives = 153/218 (70%), Gaps = 2/218 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P +DWR GAV ++KDQ CG+CWAFS A+EGINKI TG L+SLSEQEL+DC R+
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 177 NS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+ GC GG M +QF+I N GI+TE +YPY + GQCN V+ID Y++VP NN
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E L AV QPVSV + + FQ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
KNSWG +WG GYM +QRN G +G CGI ASYP K
Sbjct: 181 KNSWGTTWGEEGYMRIQRNVG-GVGQCGIAKKASYPVK 217
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 185/314 (58%), Gaps = 12/314 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+ F+ W ++ + Y++ +E QQR ++ +N F+ N G SS+ L N FADLT +EF
Sbjct: 35 DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPG-SSYELGENQFADLTEEEF 93
Query: 87 KASFL--------GFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
K ++L A ++ D A N + P S+DWR KGAVT VK Q C
Sbjct: 94 KDTYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQHC 153
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM-DYAYQFVIKNHG 197
G+CWAF+A +IEG++KI TG LVSLSEQE++DCDR N+ G A ++V +N G
Sbjct: 154 GSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNGG 213
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
+ TE DYPY G+ GQC KL H I G + V NE L AV +PV+V I S R
Sbjct: 214 LTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINAS-R 272
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
AFQ Y GIF+GPC+T+ +HAV +VGY + +G YWI+KNSWG WG GY+ MQR
Sbjct: 273 AFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRGVR 332
Query: 317 NSLGICGINMLASY 330
G+CGI + Y
Sbjct: 333 AREGVCGIAIAPFY 346
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 152/345 (44%), Positives = 203/345 (58%), Gaps = 24/345 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN- 64
F +++ +LS +++ + E ++ + +H K Y+++ E++ R+KIF DN +T+HN
Sbjct: 4 LFFIALTVLSINAVSFYDLVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNT 63
Query: 65 --NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASID-HDRRRNASVQ-------SPGNL 114
G + L LN ++D+ H EF +F GF+ + I H R N P N+
Sbjct: 64 KYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANV 123
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD- 173
+ +P +DW K GAVT VKDQ CG+CWAFSATGA+EG++ T LVSLSEQ LIDC
Sbjct: 124 K-LPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCST 182
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
N+GC GGLMD A+Q+V N GIDTE+ YPY G C + N + GY DVP
Sbjct: 183 EEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVCRYEPENSGAIDT-GYTDVPL 241
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CST---SLDHAVLIVGY--DS 286
+E L AV PVSV I S+ +FQLYSSG++ P C SLDH VL+VGY D
Sbjct: 242 GDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDE 301
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
E DYW++KNSWG SWG NGY+ M RN N CGI S+P
Sbjct: 302 ETQQDYWLVKNSWGDSWGENGYIKMARNADNQ---CGIATQPSFP 343
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 191/307 (62%), Gaps = 18/307 (5%)
Query: 36 HGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEFKASFLG 92
HGK Y S+ E+ RLKI+ +N + +HN S+ L++N F D+ H EF ++ G
Sbjct: 30 HGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDMLHHEFVSTRNG 89
Query: 93 FSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWAFSATGAI 150
F D R + V+ P L D +P ++DWRKKGAVT VK+Q CG+CW+FS TG++
Sbjct: 90 FKRNYRDTPREGSFFVE-PEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGSCWSFSTTGSL 148
Query: 151 EGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
EG + LVSLSEQ LIDC RS+ N+GC GGLMDYA++++ N GIDTE+ YPY
Sbjct: 149 EGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGIDTEQSYPYNAT 208
Query: 210 AGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGI 266
G C+ N+ V T G+ D+PE +E +L +AV PVSV I S +FQ YS G+
Sbjct: 209 DGVCH---FNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEGV 265
Query: 267 FTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+ P S LDH VL+VGY +++G DYW++KNSWG +WG GY++M RN N CGI
Sbjct: 266 YDEPECDSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGGYIYMSRNKDNQ---CGI 322
Query: 325 NMLASYP 331
ASYP
Sbjct: 323 ASAASYP 329
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 206/329 (62%), Gaps = 18/329 (5%)
Query: 11 ILLLSSLPLNYCS-DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
+++LS + L+ + D E + W ++H K Y+ E E+ +R I++ N F+ HN++ +
Sbjct: 4 LIILSLVALSVAAFDFPEEWVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDK 63
Query: 70 -SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+TL +N F DL+ EFK + G+ I +R + + + + AS+DWR+KG
Sbjct: 64 FGYTLEMNEFGDLSGVEFKQIYNGY----IMQERANDTKLFTASPYMEPAASVDWRQKGV 119
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 187
V+EVK+Q CG+CW+FSATG++EG + + G LVSLSEQ L+DC + N GC GG+MD
Sbjct: 120 VSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDD 179
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVA 245
A+++VI NHG+DTE YPY + G C + N++ V T Y+D+ +E L QA
Sbjct: 180 AFRYVISNHGVDTESSYPYTAKDGYC---RFNQNNVGATETSYRDIARGSESSLTQASAQ 236
Query: 246 -QPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRS 302
P+SV I S R+FQ Y +G++ P CS+S LDH VL+VGY +E G DY+I+KNSWG
Sbjct: 237 IGPISVAIDASHRSFQFYKNGVYYEPSCSSSRLDHGVLVVGYGTEGGQDYFIVKNSWGTR 296
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYP 331
WGM+GY+ M RN N+ CGI ASYP
Sbjct: 297 WGMDGYIMMSRNRRNN---CGIASQASYP 322
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 189/311 (60%), Gaps = 13/311 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + A S EKQ R +F++N ++ + N M + + L LN F DLT EF
Sbjct: 42 DLYERWRSVYTSA-RSFGEKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFGDLTPSEF 99
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
++ A S + RN S +VP SIDWR KGAVT VK+Q CG CWAFSA
Sbjct: 100 ARTY----ANSKIIEGTRNESGGFMYENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFSA 155
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGIN+I TG L+SLSEQ+LIDCD + NSGC GG M A++++ + GI +E +YPY
Sbjct: 156 AAAVEGINQITTGQLISLSEQQLIDCD-TQNSGCRGGTMGRAFEYIKQRGGITSEANYPY 214
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI---CGSERAFQLYS 263
+ QAG C + R V+IDGY ++ +E +L+ + QPVSV + S + Y
Sbjct: 215 KAQAGMCKNNLIQRPTVSIDGYYNI-RRSEDAVLKILAHQPVSVAVDATTWSSLDWMFYF 273
Query: 264 SGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+FTGPC T L+H V VGY + N G DYWIIKNSWG +WG GYM M R + G+C
Sbjct: 274 QGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGV-SPYGLC 332
Query: 323 GINMLASYPTK 333
GI M AS+P K
Sbjct: 333 GIAMQASFPIK 343
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 123/222 (55%), Positives = 162/222 (72%), Gaps = 2/222 (0%)
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
D+P SIDWR+ GAV VK+Q CG+CWAFS A+EGIN+IVTG L+SLSEQ+L+DC +
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TT 60
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC GG M+ A+QF++ N GI++E+ YPYRGQ G CN +N +V+ID Y++VP +N
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNS-TVNAPVVSIDSYENVPSHN 119
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E+ L +AV QPVSV + + R FQLY SGIFTG C+ S +HA+ +VGY +EN D+WI+
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIV 179
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
KNSWG++WG +GY+ +RN N G CGI ASYP K G N
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKKGTN 221
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 187/311 (60%), Gaps = 8/311 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
++E E W +HG+ Y E EK +R IF++N F+ N GN S+ L +N FAD+T Q
Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQ 94
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL---RDVPASIDWRKKGAVTEVKDQASCGAC 141
EF A F G + + +S + N D+P+++DWR+ GAVT+VK Q CG C
Sbjct: 95 EFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFSA G++EG KI TG+L+ SEQEL+DC + N GC GG M A+ F+ +N GI E
Sbjct: 155 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRE 213
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
DY Y G+ C Q+ V I Y+ VPE E LLQAV QPVS+GI S+ Q
Sbjct: 214 SDYEYLGEQYTCRSQE-KTAAVQISSYQVVPE-GETSLLQAVTKQPVSIGIAASQD-LQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
+ G + G C+ ++HAV +GY + E G YW++KNSWG SWG NG+M + R+ GN G
Sbjct: 271 CAGGTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAG 330
Query: 321 ICGINMLASYP 331
+C I ++SYP
Sbjct: 331 LCDIAKMSSYP 341
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 210/348 (60%), Gaps = 26/348 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M + L L+ + ++Y I E + T+ +H K Y E E++ RLKIF +N +
Sbjct: 1 MRTALILPLLALVAVAQAVSYAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 61 TQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-------SVQS 110
+HN + G SF +++N +AD+ H EF ++ GF+ H + RNA + S
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTL--HKQLRNADESFKGVTFIS 118
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
P ++ +P +DWR KGAVT+VKDQ CG+CWAFS+TGA+EG + +G LVSLSEQ L+
Sbjct: 119 PEHVT-LPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLV 177
Query: 171 DCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDG 227
DC Y N+GC GGLMD A++++ N GIDTEK YPY C+ N+ + T G
Sbjct: 178 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCH---FNKGTIGATDRG 234
Query: 228 YKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY 284
+ D+P+ NEK++ +AV PV+V I S +FQ YS G++ P + +LDH VL+VG+
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294
Query: 285 DS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+ E+G DYW++KNSWG +WG G++ M RN N CGI +SYP
Sbjct: 295 GTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQ---CGIASASSYP 339
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 205/337 (60%), Gaps = 16/337 (4%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
L+I + S +++ + E + + H K Y S+ E++ R+KIF +N V +HN
Sbjct: 4 LIFLAICVAGSQAVSFFDLVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNK 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAAS---IDHDRRRNASVQSPGNLRDVPA 119
+ G SF L +N +AD+ H EF GF+ + + + P N++ +P
Sbjct: 64 LYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQ-LPG 122
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
IDWR KGAVT VKDQ CG+CW+FSATG++EG + +G LVSLSEQ L+DC + N+
Sbjct: 123 QIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNN 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++++ N GIDTE+ YPY+ + +C+ + N+ T GY D+ NE +
Sbjct: 183 GCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKG-ATDRGYVDIESGNEDK 241
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSE-NGVDYWI 294
L AV PVSV I S ++FQLYS G++ P CS S LDH VL+VGY +E +G DYW+
Sbjct: 242 LQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWL 301
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG+SWG GY+ M RN N+ CGI ASYP
Sbjct: 302 VKNSWGKSWGDQGYIKMARNRDNN---CGIATEASYP 335
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 169/254 (66%), Gaps = 2/254 (0%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ + L + ELFE+W +H KAY S +EK R ++F +N + Q NN N
Sbjct: 31 FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN 90
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
S + L LN FADLTH+EFK +LG + R+ +A+ + ++ D+P S+DWRKKGA
Sbjct: 91 S-YWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR-DITDLPKSVDWRKKGA 148
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
V VKDQ CG+CWAFS A+EGIN+I TG+L SLSEQELIDCD ++NSGC GGLMDYA
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+Q++I G+ E DYPY + G C +QK + VTI GY+DVPEN+++ L++A+ QPV
Sbjct: 209 FQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPV 268
Query: 249 SVGICGSERAFQLY 262
SV I S R FQ Y
Sbjct: 269 SVAIEASGRDFQFY 282
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 189/342 (55%), Gaps = 15/342 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELF---------ETWCKQHGKAYSSEQEKQQRLKIFE 54
+ + I+L + ++ + +F E W + + Y E EK R +F+
Sbjct: 5 MVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFK 64
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA---SFLGFSAASIDHDRRRNASVQSP 111
N F+ N GN S+ L +N FAD T++EF A G + S + S Q+
Sbjct: 65 KNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124
Query: 112 GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
V S DWR +GAVT VK Q CG CWAFSA A+EG+ KI G+LVSLSEQ+L+D
Sbjct: 125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184
Query: 172 CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231
CDR Y+ C GG+M A+ +V++N GI +E DY Y+G G C R I G++ V
Sbjct: 185 CDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNA--RPAARISGFQTV 242
Query: 232 PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGV 290
P NNE+ LL+AV QPVSV + + F YS G++ GPC TS +HAV VGY S++G
Sbjct: 243 PSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGT 302
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW+ KNSWG +W GY+ ++R+ G+CG+ A YP
Sbjct: 303 KYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPV 344
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 185/314 (58%), Gaps = 12/314 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+ F+ W ++ + Y++ +E QQR ++ +N F+ N G SS+ L N FADLT +EF
Sbjct: 35 DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPG-SSYELGENRFADLTEEEF 93
Query: 87 KASFL--------GFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
K ++L A ++ D A N + P S+DWR KGAVT VK Q C
Sbjct: 94 KDTYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQHC 153
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM-DYAYQFVIKNHG 197
G+CWAF+A +IEG++KI TG LVSLSEQE++DCDR N+ G A ++V +N G
Sbjct: 154 GSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNGG 213
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
+ TE DYPY G+ GQC KL H I G + V NE L AV +PV+V I S R
Sbjct: 214 LTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINAS-R 272
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
AFQ Y GIF+GPC+T+ +HAV +VGY + +G YWI+KNSWG WG GY+ MQR
Sbjct: 273 AFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRGVR 332
Query: 317 NSLGICGINMLASY 330
G+CGI + Y
Sbjct: 333 AREGVCGIAIAPFY 346
>gi|129353|sp|P22895.1|P34_SOYBN RecName: Full=P34 probable thiol protease; Flags: Precursor
Length = 379
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 200/337 (59%), Gaps = 17/337 (5%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SIL L ++ LF+ W +HG+ Y + +E+ +RL+IF++N ++ N S
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 70 --SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKK 126
S L LN FAD+T QEF +L + N ++ D PAS DWRKK
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKK 144
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
G +T+VK Q CG WAFSATGAIE + I TG LVSLSEQEL+DC + G G
Sbjct: 145 GVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQY 203
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV-------PENNEKQL 239
++++V+++ GI T+ DYPYR + G+C K+ + VTIDGY+ + E+
Sbjct: 204 QSFEWVLEHGGIATDDDYPYRAKEGRCKANKI-QDKVTIDGYETLIMSDESTESETEQAF 262
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIK 296
L A++ QP+SV I + F LY+ GI+ G TS ++H VL+VGY S +GVDYWI K
Sbjct: 263 LSAILEQPISVSI--DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAK 320
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG WG +GY+ +QRNTGN LG+CG+N ASYPTK
Sbjct: 321 NSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTK 357
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 202/341 (59%), Gaps = 22/341 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN- 64
LL + ++ ++ + E + + QH K Y SE E++ RLKI+ N + +HN
Sbjct: 4 LILLMAFVAAANAVSLYELVKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQ 63
Query: 65 --NMGNSSFTLSLNAFADLTHQEFKASFLGF----SAASIDHDR-RRNASVQSPGNLRDV 117
++G + L +N +ADL H+EF + GF S S+ R + P N+ +V
Sbjct: 64 RFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANV-EV 122
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P ++DWRKKGAVT VKDQ CG+CW+FSATGA+EG + TG LVSLSEQ L+DC Y
Sbjct: 123 PTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYG 182
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPEN 234
N+GC GG+MDYA+Q++ N GIDTEK YPY C+ N V T GY D+P+
Sbjct: 183 NNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTCH---FNPKAVGATDKGYVDIPQG 239
Query: 235 NEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY-DSENGV 290
+E+ L +A+ PVS+ I S +FQ YS G++ P S +LDH VL VGY SE G
Sbjct: 240 DEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGE 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG +WG GY+ M RN N CG+ ASYP
Sbjct: 300 DYWLVKNSWGTTWGDQGYVKMARNRDNH---CGVATCASYP 337
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 197/337 (58%), Gaps = 16/337 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L F LL ++ ++ + +E + H K Y S E+ R KIF +N F+ +H
Sbjct: 2 LRFALLCAIVAAATAATSQEILRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKH 61
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD--VP 118
N G S+ L +N FADL EF G+ + R ++ P NL D +P
Sbjct: 62 NVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQGKRL---AGRGSTYLPPANLNDSSLP 118
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
++DWRKKGAVT VKDQ CG+CWAFS+TG++EG + + TG LVSLSEQ L+DC +Y N
Sbjct: 119 KTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGN 178
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD ++ ++ N GIDTE YPY + G C +K + T G+ D+ E +EK
Sbjct: 179 QGCNGGLMDNSFNYIKANGGIDTEDSYPYEAEDGDCRYKKEDVG-ATDTGFVDIKEGSEK 237
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWI 294
L +AV PVSV I S+++FQLYS G++ P S SLDH VL VGY +NG YW+
Sbjct: 238 DLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGVKNGKKYWL 297
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSW +WG +GY+ M R+ N CGI ASYP
Sbjct: 298 VKNSWAETWGQDGYILMSRDKNNQ---CGIASSASYP 331
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 144/335 (42%), Positives = 198/335 (59%), Gaps = 16/335 (4%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
++ L +I + S+L + + +F W + + K+YS+E E R ++ +N + +
Sbjct: 5 TILVLLAAICVASTLATTH-DPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQLIEE 62
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA--SVQSPGNLRDVPAS 120
HN +SF L++N F DLT+ EF F G + H + A +V +PG + A
Sbjct: 63 HNRSNKTSF-LAMNKFGDLTNAEFNKLFKGLAFDYSFHANKAAAEKAVPAPG----LSAD 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
DWR+KGAVT VK+Q CG+CW+FS TG+ EG N + TG L SLSEQ LIDC SY N+G
Sbjct: 118 FDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMDYA++++I N GIDTE YPY+ C N ++ Y DV +E L
Sbjct: 178 CNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANSG-GSLTSYTDVSSGDENAL 236
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIF--TGPCSTSLDHAVLIVGYDSENGVDYWIIKN 297
L AV +P SV I S +FQ YS G++ + ST LDH VL VG+ +E+G DYW++KN
Sbjct: 237 LNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGTEDGQDYWLVKN 296
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG WG+ GY+ M RN N+ CGI ASYPT
Sbjct: 297 SWGADWGLAGYIKMARNRSNN---CGIATSASYPT 328
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 124/218 (56%), Positives = 154/218 (70%), Gaps = 2/218 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60
Query: 177 NS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP NN
Sbjct: 61 NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV 180
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
KNSW +WG GYM + RN G + G CGI + SYP K
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 217
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 205/337 (60%), Gaps = 16/337 (4%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
L+I + S +++ + E + + H K Y S+ E++ R+KIF +N V +HN
Sbjct: 4 LIFLAICVAGSQAVSFFDLVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNK 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAAS---IDHDRRRNASVQSPGNLRDVPA 119
+ G SF L +N +AD+ H EF GF+ + + + P N++ +P
Sbjct: 64 LYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQ-LPG 122
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
IDWR KGAVT VKDQ CG+CW+FSATG++EG + +G LVSLSEQ L+DC + N+
Sbjct: 123 QIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNN 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++++ N GIDTE+ YPY+ + +C+ + N+ T GY D+ NE +
Sbjct: 183 GCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKG-ATDRGYVDIESGNEDK 241
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSE-NGVDYWI 294
L AV PVSV I S ++FQLYS G++ P CS S LDH VL+VGY +E +G DYW+
Sbjct: 242 LQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDGTDYWL 301
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG+SWG GY+ M RN N+ CGI ASYP
Sbjct: 302 VKNSWGKSWGDQGYIKMARNRDNN---CGIATEASYP 335
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/348 (41%), Positives = 209/348 (60%), Gaps = 26/348 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M + L L+ + ++Y I E + T+ +H K Y E E++ RLKIF +N +
Sbjct: 1 MRTALILPLLALVAVAQAVSYAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 61 TQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-------SVQS 110
+HN + G SF +++N +AD+ H EF ++ GF+ H + RNA + S
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTL--HKQLRNADESFKGVTFIS 118
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
P ++ +P +DWR KGAVT+VKDQ CG+CWAFS+TGA+EG + +G LVSLSEQ L+
Sbjct: 119 PEHVT-LPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLV 177
Query: 171 DCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC--NKQKLNRHIVTIDG 227
DC Y N+GC GGLMD A++++ N GIDTEK YPY C NK + T G
Sbjct: 178 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIG---ATDRG 234
Query: 228 YKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY 284
+ D+P+ NEK++ +AV PV+V I S +FQ YS G++ P + +LDH VL+VG+
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294
Query: 285 DS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+ E+G DYW++KNSWG +WG G++ M RN N CGI +SYP
Sbjct: 295 GTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQ---CGIASASSYP 339
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 199/320 (62%), Gaps = 14/320 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++E W +HGK Y+ EK++R KIF+DN + +HN+ N S+ LN F+DLT
Sbjct: 35 AEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLT 94
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVT-EVKDQASCGA 140
EF+AS+LG I+ + + + D+ P +DWR++GAV VK Q CG+
Sbjct: 95 VDEFQASYLG---GKIEKKSLSDVAERYQYKEGDILPDEVDWRERGAVVPRVKRQGDCGS 151
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGID 199
CWAF+ATGA+EGIN+I TG L+SLSEQELIDCDR N GC GG +A++F+ +N GI
Sbjct: 152 CWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEFIKENGGIV 211
Query: 200 TEKDYPYRGQ---AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
T++DY Y G A + + K R +VTI+G++ VP N+E L +AV QP+SV I +
Sbjct: 212 TDEDYGYTGDDTAACKAIEMKTTR-VVTINGHEVVPVNDEMSLKKAVSYQPISVMISAAN 270
Query: 257 RAFQLYSSGIFTGPCSTSL-DHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
Y SG++ GPCS DH VLIVGY S + DYW+I+NSWG WG GY+ +QRN
Sbjct: 271 --MSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGYLRLQRN 328
Query: 315 TGNSLGICGINMLASYPTKT 334
G C + + YP KT
Sbjct: 329 FNEPTGKCAVAVAPVYPIKT 348
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 193/307 (62%), Gaps = 13/307 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F W ++H +AYS E E R + F++N F+ + N+ S L L FADLT++E+K
Sbjct: 33 FIGWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQ-ESDTVLGLTKFADLTNEEYKK 90
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSAT 147
+LG ++ + NA+ + + P SIDWR+KGAV++VKDQ CG+CW+FS T
Sbjct: 91 HYLGIK---VNVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTT 147
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG ++I +G++VSLSEQ L+DC Y N GC GGLM A++++I N GI TE YPY
Sbjct: 148 GAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPY 207
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
G+C K + + I GYK++P+ E L A+ QPVSV I S +FQLYSSG+
Sbjct: 208 TAAQGRCKFTK-SMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGV 266
Query: 267 FTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+ P S +LDH VL VGY + G DY+IIKNSWG +WG +GY+ M RN N CG+
Sbjct: 267 YDEPACSSEALDHGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYIFMSRNAQNQ---CGV 323
Query: 325 NMLASYP 331
+ASYP
Sbjct: 324 ATMASYP 330
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 203/346 (58%), Gaps = 22/346 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M SL L + S++ ++ + E + + +H K Y SE E + R+KI+ +N +
Sbjct: 1 MRSLVILLCVVAAASAV--SFFDLVKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHNI 58
Query: 61 TQHNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR-------RRNASVQS 110
+HN G SF L N + D+ H EF + GF+ + + R A+ +
Sbjct: 59 AKHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATFIT 118
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
P N+ +P +DWRK GAVTEVKDQ CG+CW+FS+TGA+EG + T LVSLSEQ LI
Sbjct: 119 PANVH-LPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLI 177
Query: 171 DCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
DC +Y N+GC GGLMD A++++ N GIDTEK YPY G +C N +G+
Sbjct: 178 DCSAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKCRYNPKNTG-ADDNGFV 236
Query: 230 DVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS 286
D+P +E +L+ AV PVSV I S+ +FQ YS G++ S+SLDH VL+VGY +
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGT 296
Query: 287 -ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
ENG DYW++KNSWGRSWG GY+ M RN N CGI ASYP
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNH---CGIATAASYP 339
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 189/307 (61%), Gaps = 15/307 (4%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFL 91
H K Y S E+ R+KIF DN + +HN M ++ L +N + D+ H E +
Sbjct: 69 HHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTLN 128
Query: 92 GFS-AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAI 150
GF+ + ++ ++ A+ P N+ ++P S+DWRKKGAVT +KDQ CG+CWAFS+TGA+
Sbjct: 129 GFNKSVTVSEEQLIGATFIEPANV-ELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTGAL 187
Query: 151 EGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
EG + +G LVSLSEQ LIDC Y N+GC GGLMDYA++++ +N G+DTEK YPY +
Sbjct: 188 EGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAE 247
Query: 210 AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFT 268
QC N + G+ D+PE +E +L AV P+SV I S +F YS G++
Sbjct: 248 NDQCRYNPKNSGASDV-GFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYY 306
Query: 269 GP-CS-TSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
P CS +LDH VLIVGY DS G DYW++KNSWG +WG GY+ M RN N CGI
Sbjct: 307 EPECSPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARNKENH---CGI 363
Query: 325 NMLASYP 331
ASYP
Sbjct: 364 ASSASYP 370
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 260 bits (665), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 140/347 (40%), Positives = 201/347 (57%), Gaps = 17/347 (4%)
Query: 1 MNSLAFFLLSILLLS-SLPLNYCSD--------INELFETWCKQHGKAYSSEQEKQQRLK 51
M S+ F L+S+ +LS +L ++ + + E + W + + YS E EKQ R
Sbjct: 10 MTSILFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFD 69
Query: 52 IFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAAS-IDHDRRRNASVQS 110
+F+ N F+ + N G+ ++ L +N FAD T +EF A+ G + I + + S
Sbjct: 70 VFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPS 129
Query: 111 PG-NLRDVPA--SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
N+ DV + DWR +GAVT VK Q CG CWAFS+ A+EG+ KIV +LVSLSEQ
Sbjct: 130 WNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQ 189
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
+L+DCDR ++GC GG+M A+ ++IKN GI +E YPY+ G C + I G
Sbjct: 190 QLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYN--GKPSAWIRG 247
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGY-D 285
++ VP NNE+ LL+AV QPVSV I F YS G++ P C T+++HAV VGY
Sbjct: 248 FQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGT 307
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
S G+ YW+ KNSWG +WG NGY+ ++R+ G+CG+ A YP
Sbjct: 308 SPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPV 354
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 260 bits (665), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 146/333 (43%), Positives = 204/333 (61%), Gaps = 19/333 (5%)
Query: 9 LSILLLSSLPLNYCS-DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NM 66
+ +L+L +L + D ++ W +HGK+Y + +E+ R ++ N ++ +HN +
Sbjct: 1 MKLLILCTLIAAVAAFDFSKELRAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHA 60
Query: 67 GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKK 126
G +TL +N F DL + EFK+ + G+ + + R+ ++D+PAS+DW KK
Sbjct: 61 GVFGYTLKMNQFGDLENSEFKSLYNGYR---MSNAPRKGKPFVPAARVQDLPASVDWSKK 117
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLM 185
G VT VK+Q CG+CW+FSATG++EG + TG+L+SLSEQ L+DC + N GC GGLM
Sbjct: 118 GWVTPVKNQGQCGSCWSFSATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLM 177
Query: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV 243
D A+++VIKN+GIDTE YPYR C K N V TI GY DV +++E L AV
Sbjct: 178 DDAFEYVIKNNGIDTEASYPYRAVDSTC---KFNTADVGATISGYVDVTKDSESDLQVAV 234
Query: 244 VA-QPVSVGICGSERAFQLYSSGIFTGPC---STSLDHAVLIVGYDSENGVDYWIIKNSW 299
PVSV I S +FQ YSSG++ P ST+LDH VL VGY ++ DYW++KNSW
Sbjct: 235 ATIGPVSVAIDASHISFQFYSSGVYD-PLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSW 293
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G SWGM+GY+ M RN N CGI ASYP
Sbjct: 294 GASWGMSGYIEMVRNHNNK---CGIATSASYPV 323
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 260 bits (665), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 195/314 (62%), Gaps = 18/314 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
F W + G+ YSS E+ QR + + +N V HN + G S+ L + FAD+ ++E
Sbjct: 26 FHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEE 85
Query: 86 FKASF----LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+K LG AS+ RR + + P N +D+PA++DWR KG VT+VKDQ CG+C
Sbjct: 86 YKRLISQGCLGSFNASLP--RRGSTFFRLPEN-KDLPAAVDWRDKGYVTDVKDQKQCGSC 142
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG TG LVSLSEQ+L+DC Y N GCGGGLMD A++++ GIDT
Sbjct: 143 WAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDT 202
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E+ YPY + G+C + K + T GY DV +E L +AV P+SVGI S +F
Sbjct: 203 EESYPYEAEDGEC-RYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISF 261
Query: 260 QLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLY SG++ P CS+S LDH VL VGY SENG DYW++KNSWG +WG GY+ M +N N
Sbjct: 262 QLYESGLYDEPQCSSSELDHGVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKNKSN 321
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 322 Q---CGIATAASYP 332
>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
Length = 339
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 191/320 (59%), Gaps = 13/320 (4%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSS--FTLSLNAFADLTHQ 84
E+F+ W K+HG+ Y E ++ IF N ++T+ N SS F L L F D + +
Sbjct: 16 EIFQLWMKEHGRVYKDLDEMAKKFDIFISNLKYITETNAKRKSSNGFLLGLTNFTDWSSE 75
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EF+ +L D D + V P+S+DWR KG V+++KDQ +CG+CWAF
Sbjct: 76 EFQERYLHNIDMPTDIDTMKVNDVHLSS--CSAPSSLDWRSKGVVSDIKDQKNCGSCWAF 133
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
SA GAIEGIN I TG L++LSEQEL+DCD + GC G ++ A+ +VI+N G+ + DY
Sbjct: 134 SAVGAIEGINAITTGKLINLSEQELLDCD-PISGGCNSGWVNKAFDWVIRNKGVALDNDY 192
Query: 205 PYRGQAGQCNKQKL-NRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
PY + G C ++ N I +I+ Y V E +++ LL AV QPVSV + + F YS
Sbjct: 193 PYTAEKGVCKASQIPNSAISSINTYHHV-EQSDQGLLCAVAKQPVSVCLYAPQD-FHHYS 250
Query: 264 SGIFTGPC----STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
SGI+ GP S +H VLIVGYDS +G DYWI+KN WG SWGM GYMH++RNT
Sbjct: 251 SGIYDGPNCPVNSKDTNHCVLIVGYDSVDGQDYWIVKNQWGTSWGMEGYMHIKRNTNKKY 310
Query: 320 GICGINMLASYPTK-TGQNP 338
G+C IN A P K G+ P
Sbjct: 311 GVCAINSWAYNPVKYNGRKP 330
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 190/304 (62%), Gaps = 15/304 (4%)
Query: 36 HGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLG 92
HGK Y ++ E+ R+K+F DN + +HN +G +S+ + +N DL EFKA G
Sbjct: 20 HGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKALMNG 79
Query: 93 FSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEG 152
F + RN + P N ++P S+DWR++GAVT VKDQ CG+CW+FSATG++EG
Sbjct: 80 FKKTP---NAERNGKIYVPSN-ENLPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGSLEG 135
Query: 153 INKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAG 211
+ TG LVSLSEQ L+DC ++Y NSGC GGLM+ A+Q+V N GIDTE YPY +
Sbjct: 136 QLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEAREN 195
Query: 212 QCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP 270
C + K ++ T GY D+ E +EK L AV P+SV I S +FQ YS G++
Sbjct: 196 NC-RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGVYKEQ 254
Query: 271 -CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
CS S LDH VL VGY +ENG DYW++KNSWG SWG +GY+ + RN N CGI +A
Sbjct: 255 YCSPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHKNH---CGIASMA 311
Query: 329 SYPT 332
SYP
Sbjct: 312 SYPV 315
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/326 (44%), Positives = 191/326 (58%), Gaps = 7/326 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + +LF +W H K Y + EK R +IF+DN ++ + N N
Sbjct: 2 FSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-N 60
Query: 69 SSFTLSLNAFADLTHQEFKASFLG-FSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+S+ L LN FADL++ EF ++G A+I+ + NL P ++DWRKKG
Sbjct: 61 NSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDIVNL---PENVDWRKKG 117
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 187
AVT V+ Q SCG+CWAFSA +EGINKI TG LV LSEQEL+DC+R + GC GG Y
Sbjct: 118 AVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPY 176
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 247
A ++V KN GI YPY+ + G C +++ IV G V NNE LL A+ QP
Sbjct: 177 ALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQP 235
Query: 248 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 307
VSV + R FQLY GIF GPC T +D AV VGY G Y +IKNSWG +WG G
Sbjct: 236 VSVVVESKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKG 295
Query: 308 YMHMQRNTGNSLGICGINMLASYPTK 333
Y+ ++R GNS G+CG+ + YPTK
Sbjct: 296 YIRIKRAPGNSPGVCGLYKSSYYPTK 321
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 197/311 (63%), Gaps = 9/311 (2%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W K H + + +EK +R +F++N V N M + + L LN FAD+++ EF
Sbjct: 39 QLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADMSNYEF 96
Query: 87 KASFLGFSAAS---IDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+F S S H+RRR A D+P+S+D R++GAV VK+Q CG+CWA
Sbjct: 97 -VNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCGSCWA 155
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS+ A+EGINKI T L+SLSEQEL+DC+ N GC GG M+ A+ F+ +N GI TE
Sbjct: 156 FSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKRNGGIATENS 214
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYS 263
YPY G G C +++ IV IDGY+ VPE NE L+QAV QPVSV I + R FQ YS
Sbjct: 215 YPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQFYS 273
Query: 264 SGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G+F G C T L+H V+ +GY +E+G DYW+++NSWG WG +GY+ M+R + G+C
Sbjct: 274 QGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAEGLC 333
Query: 323 GINMLASYPTK 333
GI M ASYP K
Sbjct: 334 GIAMEASYPIK 344
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 195/322 (60%), Gaps = 22/322 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
+ E + + QH K Y SE E++ RLKI+ N + +HN ++G + L +N +ADL
Sbjct: 23 VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82
Query: 82 THQEFKASFLGF----SAASIDHDR-RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
H+EF + GF S S+ R + P N+ +VP ++DWRKKGAVT VKDQ
Sbjct: 83 LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANV-EVPTTVDWRKKGAVTPVKDQG 141
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
CG+CW+FSATGA+EG + TG LVSLSEQ L+DC Y N+GC GG+MDYA+Q++ N
Sbjct: 142 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDN 201
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGI 252
GIDTEK YPY C+ N V T GY D+P+ +E+ L +A+ PVS+ I
Sbjct: 202 GGIDTEKSYPYEAIDDTCH---FNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAI 258
Query: 253 CGSERAFQLYSSGIFTGPC--STSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYM 309
S +FQ YS G++ P S +LDH VL VGY SE G DYW++KNSWG +WG GY+
Sbjct: 259 DASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYV 318
Query: 310 HMQRNTGNSLGICGINMLASYP 331
M RN N CG+ ASYP
Sbjct: 319 KMARNHDNH---CGVATCASYP 337
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 124/218 (56%), Positives = 152/218 (69%), Gaps = 2/218 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P +DWR GAV ++KDQ CG+ WAFS A+EGINKI TG L+SLSEQEL+DC R+
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 177 NS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+ GC GG M +QF+I N GI+TE +YPY + GQCN V+ID Y++VP NN
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E L AV QPVSV + + FQ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
KNSWG +WG GYM +QRN G +G CGI ASYP K
Sbjct: 181 KNSWGTTWGEEGYMRIQRNVG-GVGQCGIAKKASYPVK 217
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/306 (45%), Positives = 190/306 (62%), Gaps = 14/306 (4%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQEFKASFL 91
+HGK+Y SE E+ RLKI+ +N + +HN G +++++N F D+ H EF ++
Sbjct: 33 KHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTRN 92
Query: 92 GFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWAFSATGA 149
GF D R + ++ P N+ D +P ++DWR KGAVT VK+Q CG+CWAFSATG+
Sbjct: 93 GFKRNYKDQPREGSTYLE-PENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151
Query: 150 IEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
+EG + +GS+VSLSEQ L+ C + N+GC GGLMD A++++ N GIDTEK YPY G
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNG 211
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIF 267
G C+ +K T G+ D+ E +E QL +AV P+SV I S +FQ YS G++
Sbjct: 212 TDGTCHFKKSTVG-ATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVY 270
Query: 268 TGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
P S SLDH VL+VGY + NG DYW +KNSWG +WG GY+ M RN N CGI
Sbjct: 271 DEPECDSESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQ---CGIA 327
Query: 326 MLASYP 331
AS P
Sbjct: 328 SSASIP 333
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 204/339 (60%), Gaps = 19/339 (5%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN- 64
L+ I + +++ +N+ + + +H K Y E E++ R+KI+ N + QHN
Sbjct: 5 LLLIVITCAAVQAISFFELVNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNC 64
Query: 65 --NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN-----ASVQSPGNLRDV 117
+ ++ L +N + D+ + EFK G++ +I+H R A+ P N+ ++
Sbjct: 65 DYELKKVTYRLKINKYGDMLNHEFKNMLNGYNR-TINHTLRNERLPVGAAFIEPCNV-EL 122
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P +DWRK GAVTEVKDQ CG+CWAFSATG++EG + TG LVSLSEQ LIDC SY
Sbjct: 123 PKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYG 182
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N+GC GGLMD A+ ++ N G+DTEK YPY G+ +C K + + G+ D+P +E
Sbjct: 183 NNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDV-GFVDIPVGDE 241
Query: 237 KQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDY 292
++L AV PVSV I S ++FQ YS GI+ P ST+LDH VL+VGY + E G DY
Sbjct: 242 QKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEGRDY 301
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSWG SWG GY+ M RN N CGI ASYP
Sbjct: 302 WIVKNSWGESWGEKGYIKMARNIDNH---CGIASSASYP 337
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 190/308 (61%), Gaps = 7/308 (2%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F + + H K Y++E+E+ +R IF++N ++ HN M S+ L +N F DLT +EF+
Sbjct: 89 FYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHN-MQGYSYVLKMNKFGDLTLEEFRQ 147
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+LG+ + R + D+P +DWR++G VT VKDQ CG+CWAFSATG
Sbjct: 148 RYLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATG 207
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG+ TG LV+LS+Q+L+DC R N GC GG M+ A+++V++N GI + ++YPY
Sbjct: 208 AMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYM 267
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLYSSGI 266
+ G C + + TI GY+ VP +EK + A+ + PVSV I ++ AFQ Y GI
Sbjct: 268 RKDGVCKSSQCT-SVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGI 326
Query: 267 FTGPCSTSLDHAVLIVGYDSENG--VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
F PC T+LDH VL+VGY +E DYWI+KNSWG +WG GYM M + G + G CG+
Sbjct: 327 FDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPA-GQCGV 385
Query: 325 NMLASYPT 332
+ S+P
Sbjct: 386 LLDGSFPV 393
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 198/334 (59%), Gaps = 11/334 (3%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
+LS L+ +++ + F + K H K Y +E E+ R KIF +N + +HN
Sbjct: 3 GLLVLSCLIALGQAVSFFDLSADEFTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHN 62
Query: 65 N---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
+ G SF L LN AD+ E+ +LGF+ +S ++ + + P + +
Sbjct: 63 SRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEV 122
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGC 180
DWR KGAVT VK+Q CG+CWAFS TGA+EG N TG LVSLSEQ L+DC SY N+GC
Sbjct: 123 DWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGC 182
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLMD A+Q++ +NHGIDTEK YPY G+ C +K + T G+ D+ + +E+ L+
Sbjct: 183 EGGLMDNAFQYIKENHGIDTEKSYPYEGEDETCRFRKTSIG-ATDSGFVDITQGDEEALM 241
Query: 241 QAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKN 297
QAV P+SV I S ++FQ YS G++ P S +LDH VL+VGY E+ YW++KN
Sbjct: 242 QAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYGVEDNQKYWLVKN 301
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG WG GY+ M R+ N+ CGI ASYP
Sbjct: 302 SWGTQWGDGGYIKMARDQDNN---CGIATQASYP 332
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 189/312 (60%), Gaps = 7/312 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ F ++ + K+Y++E+EKQ+R IF++N ++ HN G S++L +N F DL+
Sbjct: 113 FQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG-YSYSLKMNHFGDLSRD 171
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL-RDVPASIDWRKKGAVTEVKDQASCGACWA 143
EF+ +LGF + + + L ++PA +DWR +G VT VKDQ CG+CWA
Sbjct: 172 EFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWA 231
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FS TGA+EG + TG LVSLSEQEL+DC R+ N C GG M+ A+Q+V+ + GI +E
Sbjct: 232 FSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSED 291
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY + +C Q + +V I G+KDVP +E + A+ PVS+ I + FQ Y
Sbjct: 292 AYPYLARDEECRAQSCEK-VVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFY 350
Query: 263 SSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
G+F C T LDH VL+VGY D E+ D+WI+KNSWG WG +GYM+M + G G
Sbjct: 351 HEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEE-G 409
Query: 321 ICGINMLASYPT 332
CG+ + AS+P
Sbjct: 410 QCGLLLDASFPV 421
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 200/342 (58%), Gaps = 20/342 (5%)
Query: 5 AFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
+L ++ ++ +++ + E + + +H K Y SE E + R+KI+ +N + +HN
Sbjct: 3 GLVVLMCVVAAASAVSFFDLVKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHKIAKHN 62
Query: 65 N---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR-------RRNASVQSPGNL 114
G F + N + D+ H EF + GF+ + + R A+ P N+
Sbjct: 63 QKFARGQVPFRVKQNKYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATFIPPANV 122
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
R VP +DWRK GAVTEVKDQ CG+CW+FSATGA+EG + T LVSLSEQ LIDC
Sbjct: 123 R-VPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCST 181
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+Y N+GC GGLMD A++++ N GIDTEK YPY +C N + G+ D+P
Sbjct: 182 AYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKCRYNPRNSGADDV-GFIDIPS 240
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGI-FTGPC-STSLDHAVLIVGYDS-ENG 289
+E +L+ AV PVSV I S+ FQ YS G+ F C STSLDH VL+VGY + ENG
Sbjct: 241 GDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGTDENG 300
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWGRSWG GY+ M RN N CGI AS+P
Sbjct: 301 GDYWLVKNSWGRSWGDLGYIKMARNRDNH---CGIATAASFP 339
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 143/335 (42%), Positives = 187/335 (55%), Gaps = 37/335 (11%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F+ W K +G Y ++E + R I++ N ++ + NS + L+ N FADLT++EF +
Sbjct: 5 FDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNS-YNLTDNKFADLTNEEFVS 63
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG--------- 139
++LGF+ I H R + GNL P S DWRK+GAVT++KDQ +CG
Sbjct: 64 TYLGFATRLIPHTRFK---YHEHGNL---PXSKDWRKEGAVTDIKDQGNCGKHSTWFSPE 117
Query: 140 --------------------ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNS 178
+ WAFS A+E INKI +G LVSLSEQEL+D D + N
Sbjct: 118 ISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANKNQ 177
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD + F+ KN G+ T KDYPY G G CNK+K H V I GY+ P +E
Sbjct: 178 GCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKDEAM 237
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNS 298
L A QP+SV I AFQLYS G+F+G C L+H V IVGYD Y +KNS
Sbjct: 238 LKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFDKYRTVKNS 297
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G WG +GY+ M+R+ + G CGI M ASYP K
Sbjct: 298 XGADWGESGYIRMKRDAFDKAGTCGIAMKASYPLK 332
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 149/350 (42%), Positives = 207/350 (59%), Gaps = 27/350 (7%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L L ++ +S++ + + E + + QH Y SE E R+KI+ ++ +
Sbjct: 1 MKCLVLLLCAVAAVSAV--QFFDLVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHII 58
Query: 61 TQHN---NMGNSSFTLSLNAF---ADLTHQEFKASFLGFSAASIDHDRR--------RNA 106
+HN MG S+ L +N++ D+ H EF + GF+ + H++ R A
Sbjct: 59 AKHNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTA-KHNKNLYMKGGSVRGA 117
Query: 107 SVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSE 166
SP N++ +P +DWRK GAVT++KDQ CG+CW+FS TGA+EG + +G LVSLSE
Sbjct: 118 KFISPANVK-LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSE 176
Query: 167 QELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTI 225
Q LIDC Y N+GC GGLMD A++++ N GIDTE+ YPY G +C N +
Sbjct: 177 QNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDV 236
Query: 226 DGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIV 282
G+ D+PE +E++L++AV PVSV I S FQLYSSG++ ST LDH VL+V
Sbjct: 237 -GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVV 295
Query: 283 GYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
GY + E GVDYW++KNSWGRSWG GY+ M RN N CGI ASYP
Sbjct: 296 GYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNR---CGIASSASYP 342
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 189/312 (60%), Gaps = 7/312 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ F ++ + K+Y++E+EKQ+R IF++N ++ HN G S++L +N F DL+
Sbjct: 112 FQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG-YSYSLKMNHFGDLSRD 170
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNL-RDVPASIDWRKKGAVTEVKDQASCGACWA 143
EF+ +LGF + + + L ++PA +DWR +G VT VKDQ CG+CWA
Sbjct: 171 EFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWA 230
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FS TGA+EG + TG LVSLSEQEL+DC R+ N C GG M+ A+Q+V+ + GI +E
Sbjct: 231 FSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSED 290
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
YPY + +C Q + +V I G+KDVP +E + A+ PVS+ I + FQ Y
Sbjct: 291 AYPYLARDEECRAQSCEK-VVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFY 349
Query: 263 SSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
G+F C T LDH VL+VGY D E+ D+WI+KNSWG WG +GYM+M + G G
Sbjct: 350 HEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEE-G 408
Query: 321 ICGINMLASYPT 332
CG+ + AS+P
Sbjct: 409 QCGLLLDASFPV 420
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 135/304 (44%), Positives = 191/304 (62%), Gaps = 12/304 (3%)
Query: 36 HGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEFKASFLG 92
HGK Y SE E+ RLKI+ +N + +HN S+ L++N + D+ H EF ++ G
Sbjct: 36 HGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNG 95
Query: 93 FSAASIDHDRRRNASVQSPG-NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIE 151
F R+ + ++ G + +P ++DWRKKGAVT VK+Q CG+CWAFS TG++E
Sbjct: 96 FRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 155
Query: 152 GINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQA 210
G + +G +VSLSEQ L+DC ++ N+GC GGLMD A++++ N GIDTEK YPY G
Sbjct: 156 GQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTD 215
Query: 211 GQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTG 269
G C+ +K + T G+ D+PE NE L +AV P+SV I S ++FQ YS G++
Sbjct: 216 GTCHFKKSDVG-ATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDE 274
Query: 270 P--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
P S +LDH VL+VGY +++ DYW++KNSWG +WG GY++M RN N CGI
Sbjct: 275 PECSSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGGYIYMTRNKDNQ---CGIASS 331
Query: 328 ASYP 331
ASYP
Sbjct: 332 ASYP 335
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 150/314 (47%), Positives = 191/314 (60%), Gaps = 18/314 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
F W Q G++Y+S E+ QR +I+ N V HN M G S+ L + FAD+ ++E
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 86 FKASF----LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+K LG AS+ RR +A ++ P D+P S+DWR+KG VTEVKDQ CG+C
Sbjct: 86 YKRQISQGCLGSFNASLP--RRGSAYLRLPEGA-DLPNSVDWREKGYVTEVKDQKQCGSC 142
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG TG LVSLSEQ+L+DC Y N GC GGLMD A++++ N GIDT
Sbjct: 143 WAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDT 202
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E YPY + GQC N T GY DV + +E L +AV PVSV I S +F
Sbjct: 203 EDSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSF 261
Query: 260 QLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLY SG++ P CS+S LDH VL VGY S+NG DYW++KNSWG WG GY+ M RN N
Sbjct: 262 QLYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHN 321
Query: 318 SLGICGINMLASYP 331
CGI +SYP
Sbjct: 322 Q---CGIATASSYP 332
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 193/330 (58%), Gaps = 8/330 (2%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L+F SI+ S L + +LFE+W +H K Y + EK R +IF+DN ++ +
Sbjct: 23 LSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDET 82
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDW 123
N N+S+ L LN FAD+++ EFK + G A + V + G++ ++P +DW
Sbjct: 83 NKK-NNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDV-NIPEYVDW 140
Query: 124 RKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG 183
R+KGAVT VK+Q SCG+CWAFSA IEGI KI TG+L SEQEL+DCDR + GC GG
Sbjct: 141 RQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR-SYGCNGG 199
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
A Q V + +GI YPY G C ++ + DG + V NE LL ++
Sbjct: 200 YPWSALQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSI 258
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 303
QPVSV + + + FQLY GIF GPC +DHAV VGY G +Y +IKNSWG W
Sbjct: 259 ANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY----GPNYILIKNSWGTGW 314
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G NGY+ ++R TGNS G+CG+ + YP K
Sbjct: 315 GENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 195/321 (60%), Gaps = 18/321 (5%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLS 74
PL + ++E++ + H K Y++E E +R I+E + + QHN ++G +F+L
Sbjct: 13 PLVFDEALDEMWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLG 71
Query: 75 LNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKD 134
+N + DLT E+ A+ G+ A +S P NL+ VP ++DWR+KG VT VK+
Sbjct: 72 MNEYGDLTQHEY-AAMSGYKMAK----SSVGSSFLEPENLQ-VPKTVDWREKGYVTPVKN 125
Query: 135 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVI 193
Q CG+CWAFS+TG++EG TG L S+SEQ L+DC R N GC GGLMD A+ ++
Sbjct: 126 QGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIK 185
Query: 194 KNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGI 252
KN GID+EK YPY G+C +K + + T G+ D+P +E L AV + PVSV I
Sbjct: 186 KNMGIDSEKSYPYEAVDGECRYKKSDS-VTTDSGFVDIPHGDETALRTAVASVGPVSVAI 244
Query: 253 CGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMH 310
S +FQ Y +G++T ST LDH VL+VGY ENG DYW++KNSWG SWG GY+
Sbjct: 245 DASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIK 304
Query: 311 MQRNTGNSLGICGINMLASYP 331
+ RN GN CGI ASYP
Sbjct: 305 LARNHGNQ---CGIASQASYP 322
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 179/316 (56%), Gaps = 18/316 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
EL+E W QH + EK +R +F+DN + + N + + L LN F D+T E
Sbjct: 46 ELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDMTADE- 102
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
S ++++ + H R + L GAV VKDQ CG+CWAFS
Sbjct: 103 --SAGAYASSRVSHHRMFRGRGEKAQRLH-----------GAVGAVKDQGQCGSCWAFST 149
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A+EGIN I T +L +LSEQ+L+DCD ++ N+GC GGLMD A+Q++ K+ G+ YP
Sbjct: 150 IAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVAASSAYP 209
Query: 206 YRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
YR + C + VTIDGY+DVP N+E L +AV QPVSV I FQ YS G
Sbjct: 210 YRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHFQFYSEG 269
Query: 266 IFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+F G C T LDH V VGY + +G YWI++NSWG WG GY+ M+R+ G+CGI
Sbjct: 270 VFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAKEGLCGI 329
Query: 325 NMLASYPTKTGQNPPP 340
M ASYP KT NP P
Sbjct: 330 AMEASYPIKTSPNPAP 345
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 188/308 (61%), Gaps = 10/308 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+F W + H K+YS+E E R ++ +NY F+ Q N N+S+ L++N F DLT+ EF
Sbjct: 29 VFADWMRTHTKSYSNE-EFVFRWNVWRENYNFI-QEENRKNNSYYLTMNKFGDLTNAEFN 86
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
+ G + H + A+ + +PA+ DWR+KGAVT VK+Q CG+CW+FS T
Sbjct: 87 KVYKGLAFDYSAHILKAKAATPAA-PAPGLPANFDWRQKGAVTHVKNQGQCGSCWSFSTT 145
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
G+ EG N + G+LVSLSEQ LIDC SY N+GC GGLMDYA++++I N GIDTE YPY
Sbjct: 146 GSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPY 205
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
C N ++ Y DV +E LL AV +P SV I S +FQ YS G+
Sbjct: 206 ETAQYNCRYNPANSG-GSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNSFQFYSGGV 264
Query: 267 F--TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+ + ST LDH VL VG+ +ENG DYW++KNSWG WG+ GY+ M RN N+ CGI
Sbjct: 265 YYESSCSSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQGYIKMARNRHNN---CGI 321
Query: 325 NMLASYPT 332
ASYPT
Sbjct: 322 ATAASYPT 329
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 135/276 (48%), Positives = 170/276 (61%), Gaps = 27/276 (9%)
Query: 75 LNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAV 129
LN FAD+T+ EF++ + + + ++H R G N+ VP+SIDWRK GAV
Sbjct: 2 LNKFADMTNYEFRSIY---ADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAV 58
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
T VKDQ CG+CWAFS A+EGIN+I T LVSLSEQEL+DCD N GC GGLM+YA+
Sbjct: 59 TGVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAF 118
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
+F IK +GI TE +YPY + G CN QK N+ V+IDG+++VP NNEK LL+A QP+S
Sbjct: 119 EF-IKQNGITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPIS 177
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
V I FQ YS G+FTG C T L+H V NSWG WG GY+
Sbjct: 178 VAIDAGGSDFQFYSEGVFTGHCGTELNHGV-----------------NSWGSEWGEQGYI 220
Query: 310 HMQRNTGNSLGICGINMLASYP-TKTGQNPPPSPPP 344
MQR + G+CGI M ASYP K+ +NP S P
Sbjct: 221 RMQRAISHKQGLCGIAMEASYPIKKSSKNPTKSSLP 256
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/301 (42%), Positives = 190/301 (63%), Gaps = 21/301 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F ++ +GK+Y++E+E Q+R IF++N A++ HN G S++L +N F DL+ +EF+
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRR 177
Query: 89 SFLGFSAASIDHDRRRNASVQSPG--------NLRDVPASIDWRKKGAVTEVKDQASCGA 140
+LG+ ++ RN + G + DVP+++DWR+KG VT VKDQ CG+
Sbjct: 178 KYLGY-------NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGS 230
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGID 199
CWAFSATGA+EG + TG L+SLSEQEL+DC + N GC GG M+ A+Q+V+ + G+
Sbjct: 231 CWAFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLC 290
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+E+ YPY + G+C ++ + +VTI G+KDVP +E + A+ PVS+ I + F
Sbjct: 291 SEEGYPYLARDGEC--KRACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPF 348
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
Q Y G+F C T LDH VL+VGY D E D+WI+KNSWG WG +GYM+M + G
Sbjct: 349 QFYHEGVFDASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHKGE 408
Query: 318 S 318
Sbjct: 409 E 409
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 196/319 (61%), Gaps = 17/319 (5%)
Query: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNA 77
+ +++++ + + K Y +++E+ +RL ++EDN ++ +HN + G F L N
Sbjct: 20 FRAELDQEWAIYKDMFAKNYVADEERMRRL-VWEDNIDYIEKHNRRADRGEHKFWLGTNE 78
Query: 78 FADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQAS 137
+AD+T EFKA GF I + + + SP N+ D+P +DWR KG VT VK+Q
Sbjct: 79 YADMTIDEFKAIMNGF----IMQNGTKGDTYMSPSNIGDLPDKVDWRDKGYVTPVKNQGH 134
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNH 196
CG+CW+FSATG++EG + TG LVSLSEQ LIDC + N GC GGLMD+A++++ KN
Sbjct: 135 CGSCWSFSATGSLEGQHFKSTGKLVSLSEQNLIDCSKKEGNHGCKGGLMDFAFEYIQKND 194
Query: 197 GIDTEKDYPYRGQAG-QCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICG 254
GIDTE+ YPY + G +C +K + T G D+P +EK L +AV P+SV +
Sbjct: 195 GIDTEQSYPYTAKDGIECRFKKADVG-ATDKGKVDLPRQSEKALQEAVATVGPISVAMDA 253
Query: 255 SERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R+FQLY GI+T P ST LDH VL VGY SE DYW++KNSWG +WGM G+ +
Sbjct: 254 GHRSFQLYKRGIYTEPMCSSTKLDHGVLAVGYGSEGEGDYWLVKNSWGATWGMEGFFMLA 313
Query: 313 RNTGNSLGICGINMLASYP 331
RN N CGI ASYP
Sbjct: 314 RNHRNE---CGIATQASYP 329
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 133/335 (39%), Positives = 192/335 (57%), Gaps = 11/335 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
+ F L + ++ + P +D + + FE W ++G+ Y EK +R +IF++N
Sbjct: 7 VVFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ N+ +S+TL +N F D+T+ EF A + G + ++ +R S ++ VP
Sbjct: 67 HIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVSFDDV-DISAVP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAVT VK+Q CGACWAF+A +E I KI G L LSEQ+++DC + Y
Sbjct: 126 QSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKGY-- 183
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG A++F+I N G+ + YPY+ G C + I GY VP NNE
Sbjct: 184 GCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNS-AYITGYARVPRNNESS 242
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
++ AV QP++V + + FQ Y SG+F GPC TSL+HAV +GY + NG YWI+KN
Sbjct: 243 MMYAVSKQPITVAVDANAN-FQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG WG GY+ M R+ +S GICGI + + YPT
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPT 336
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/338 (42%), Positives = 211/338 (62%), Gaps = 19/338 (5%)
Query: 7 FLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ-- 62
FLL + ++ + N+ SD + L+E W +H K YSS EK +R +IF+DN ++ Q
Sbjct: 10 FLLFVSAITCISTNWRSDDEVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQN 69
Query: 63 -HNNMGNSSFTLSLNAFADLTHQEFKASFLGFS-------AASIDHDRRRNASVQSPGNL 114
+N + + +FTL LN FADLT EF + +LG S +++ +HD ++ ++
Sbjct: 70 HYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDDVEEDILKE--DV 127
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
++P S+DWR+KG V +++Q CG+CW FSA +IE +N I G +++LSEQEL+DC+
Sbjct: 128 VELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIALSEQELLDCE- 186
Query: 175 SYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
+ + GC GG + A+ +V KN GI +E+ YPY + GQC +++ +V I GYK VP N
Sbjct: 187 TISQGCKGGHYNNAFAYVAKN-GITSEEKYPYIFRQGQCYQKE---KVVKISGYKRVPRN 242
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
N QL AV Q VSV + + FQ Y GIF+G C LDHAV IVGY S+ G +YWI
Sbjct: 243 NGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAVNIVGYGSKGGANYWI 302
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++NSWG +WG NGYM +Q+N+ + G CGI M SYP
Sbjct: 303 MRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 197/317 (62%), Gaps = 15/317 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
S+++ ++ + K HGK Y +E+E ++R+ I+E N ++ +HN + G+ SF L +N +
Sbjct: 21 SELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYG 79
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
D+T++EF+++ G+ + + R + P N+ D+P ++DWR KG VT +K+Q CG
Sbjct: 80 DMTNEEFRSTMNGYK---MRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYQFVIKNHGI 198
+CW+FSATG++EG TG L SLSEQ L+DC + N GC GGLMD A+Q++ N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGI 196
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSER 257
DTE YPY + G+C N T G+ D+ +E L AV P+SV I S
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVG-ATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHM 255
Query: 258 AFQLYSSGIFTG-PCS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
+FQLY SG++ CS T LDH VL VGY +E+G DYW++KNSWG SWG GY+ M RN
Sbjct: 256 SFQLYRSGVYHEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNK 315
Query: 316 GNSLGICGINMLASYPT 332
N+ CGI ASYPT
Sbjct: 316 RNN---CGIATSASYPT 329
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 188/315 (59%), Gaps = 17/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
+ + ++ + +HG+ Y+S QE++ RL +FE N F+ HN G +FTL +N F D+
Sbjct: 18 LRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 77
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E A+ GF A RR A+V + +P +DWR KGAVT VKDQ CG+C
Sbjct: 78 TSEEIVATMNGFLGAPT----RRPAAVLKADD-ETLPEKVDWRTKGAVTPVKDQKQCGSC 132
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG + + G LVSLSEQ L+DC D+ N GC GGLMD A++++ N GIDT
Sbjct: 133 WAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGIDT 192
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E YPY Q G+C N T GY DV +E L +AV P+SVGI S+ F
Sbjct: 193 EDSYPYEAQDGKCRFDASNVG-ATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTF 251
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Y +G++ ST LDH VL VGY S ENG D+W++KNSW SWG GY+ M RN
Sbjct: 252 HFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRN 311
Query: 317 NSLGICGINMLASYP 331
N+ CGI ASYP
Sbjct: 312 NN---CGIASQASYP 323
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 198/317 (62%), Gaps = 15/317 (4%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
S+++ ++ + K HGK Y +E+E ++R+ I+E N ++ +HN + G+ SF L +N +
Sbjct: 21 SELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYG 79
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
D+T++EF+++ G+ + + R + P N+ D+P ++DWR KG VT +K+Q CG
Sbjct: 80 DMTNEEFRSTMNGYK---MRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGI 198
+CW+FSATG++EG TG L SLSEQ L+DC + N GC GGLMD A+Q++ N+GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGI 196
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSER 257
DTE YPY + G+C N T G+ D+ +E L AV P++V I S
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVG-ATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHM 255
Query: 258 AFQLYSSGIFTG-PCS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
+FQLY SG++ CS T LDH VL VGY +E+G DYW++KNSWG SWG GY+ M RN
Sbjct: 256 SFQLYKSGVYHEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNK 315
Query: 316 GNSLGICGINMLASYPT 332
N+ CGI ASYPT
Sbjct: 316 RNN---CGIATSASYPT 329
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 17/312 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K+Y S+ E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 7 WEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHE 66
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWA 143
F F G+ + R ++ P N+ D +P ++DWRKKGAVT VKDQ CG+CWA
Sbjct: 67 FAKMFNGYHGER----KGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWA 122
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSATG++EG + + +G LVSLSEQ LIDC S+ N GCGGGLMD A++++ N GIDTE+
Sbjct: 123 FSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEE 182
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQL 261
YPY G C +K + T G+ D+ + +E L +AV P+SV I S +FQL
Sbjct: 183 SYPYEAMDGDCRFKKEDVG-ATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQL 241
Query: 262 YSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G++ P S LDH VL VGY +NG YW++KNSW +WG NGY+ M R+ N
Sbjct: 242 YSEGVYDEPNCSSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQ- 300
Query: 320 GICGINMLASYP 331
CGI ASYP
Sbjct: 301 --CGIASSASYP 310
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 186/312 (59%), Gaps = 16/312 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQE 85
+E + H K+Y S E+ R KIF +N V +HN G S+ L +N F DL E
Sbjct: 27 WEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPHE 86
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLR--DVPASIDWRKKGAVTEVKDQASCGACWA 143
F F G+ A R ++ P N+ +P S+DWR+KGAVT VK+Q CG+CWA
Sbjct: 87 FARMFNGYRGART---AGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWA 143
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FS TG++EG + + TG LVSLSEQ L+DC ++ N GC GGLMD A+Q++ N GIDTEK
Sbjct: 144 FSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEK 203
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQL 261
YPY + G+C +K N T G+ D+ + +E L +AV PVSV I S +FQL
Sbjct: 204 SYPYEAEDGECRFKKQNVG-ATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQL 262
Query: 262 YSSGIF--TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G++ T S LDH VL+VGY E+G YW++KNSW SWG NGY+ M R+ N
Sbjct: 263 YSEGVYDETECSSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQ- 321
Query: 320 GICGINMLASYP 331
CGI ASYP
Sbjct: 322 --CGIASAASYP 331
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 190/317 (59%), Gaps = 20/317 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQE 85
FE + ++ K Y S +E+ +R IF+++ F+ +HN G ++ + +N FADLT +E
Sbjct: 31 FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS--------IDWRKKGAVTEVKDQAS 137
F+ + + D D+R + + V A+ IDWRK+GAVT V++Q
Sbjct: 91 FRQHHV--TRLPFDDDKRDPVTATLHLDEHAVHAADSNGDSSGIDWRKRGAVTPVRNQGQ 148
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
CG F+A A+EG++ I +G+LV LS Q++IDC S GC GG + ++++ +N G
Sbjct: 149 CGNPAIFAAVEAVEGMHAISSGNLVELSTQQVIDC--SGTPGCSGGSLVSFFKYIARNGG 206
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 257
+D+ DYP G GQCNK K RH+ + GY VP NE +L AV PV+V I
Sbjct: 207 LDSAADYPTSGAGGQCNKAKEARHVAKVGGYSVVPPRNETKLAAAVFKMPVAVAIEADTP 266
Query: 258 AFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
+FQ+Y+SG+++GPC T LDHAVL+VGY E YWI+KNSWG SWG GY+ M+R G
Sbjct: 267 SFQMYTSGVYSGPCGTQLDHAVLVVGYTDE----YWIVKNSWGASWGDQGYIMMKRGVG- 321
Query: 318 SLGICGINMLASYPTKT 334
+ GICGI + A YPT T
Sbjct: 322 AAGICGITLDAMYPTAT 338
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 205/341 (60%), Gaps = 17/341 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M S + LLS+++ ++ +++ + +E+W H K Y S E++ RLKIF +N +
Sbjct: 1 MKSQSILLLSVIISTASAVSFFDVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRI 60
Query: 61 TQHNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
++HN G ++ + +N + DL H EF A G+ I +++ P ++
Sbjct: 61 SRHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGY----IYNNKTTLGGTFIPSKNINL 116
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P +DWR++GAVT VK+Q CG+CW+FSATG++EG + TG L+SLSEQ L+DC R Y
Sbjct: 117 PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYG 176
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N+GC GGLMDYA++++ N+GIDTE YPY G G C+ N+ I G+ D+ + +E
Sbjct: 177 NNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDI-GFVDIKKGSE 235
Query: 237 KQLLQAV-VAQPVSVGICGSERAFQLYSSGIFT-GPCS-TSLDHAVLIVGY--DSENGVD 291
K L +A+ P+SV I S +FQ YS G+++ CS +LDH VL VGY D G D
Sbjct: 236 KDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGED 295
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSW WG +GY+ M RN N +CGI ASYP
Sbjct: 296 YWLVKNSWSEKWGEDGYIKMARNKDN---MCGIASSASYPV 333
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 192/320 (60%), Gaps = 17/320 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+ E +E++ +H K Y S+ E+ R+KIF +N + HN + G+ ++ L +N + D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEVKDQA 136
H EF GF A + + N Q P +P S+DWR+KGAVTEVKDQ
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
SCG+CWAFSATGA+EG + TG LVSLSEQ L+DC + N+GC GGLMD A+Q++ N
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICG 254
GIDTEK YPY + C N G+ DV E NE L +A+ PVSV I
Sbjct: 205 GGIDTEKSYPYEAEDEPCRYNPANAG-ADDRGFVDVREGNENALKKAIATIGPVSVAIDA 263
Query: 255 SERAFQLYSSGIFTGP-CST-SLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHM 311
S+ +FQ Y G+++ P CS +LDH VL VGY +E+G DYW++KNSW +SWG GY+ +
Sbjct: 264 SQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKI 323
Query: 312 QRNTGNSLGICGINMLASYP 331
RN N +CGI ASYP
Sbjct: 324 ARNQNN---MCGIASAASYP 340
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 133/335 (39%), Positives = 192/335 (57%), Gaps = 11/335 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P +D + + FE W ++G+ Y EK +R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ N+ S+TL +N F D+T+ EF A + G + ++ +R S ++ VP
Sbjct: 67 HIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVSFDDV-DISAVP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAVT VK+Q CGACWAF+A +E I KI G L LSEQ+++DC + Y
Sbjct: 126 QSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKGY-- 183
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG A++F+I N G+ + YPY+ G C + I GY VP NNE
Sbjct: 184 GCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNS-AYITGYARVPRNNESS 242
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
++ AV QP++V + + + Q Y+SG+F GPC TSL+HAV +GY + NG YWI+KN
Sbjct: 243 MMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SWG WG GY+ M R+ +S GICGI + + YPT
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPT 336
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 198/326 (60%), Gaps = 22/326 (6%)
Query: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNA 77
+ + E + T+ +H K Y E E++ RLKIF +N + +HN G SF L++N
Sbjct: 51 FADVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNK 110
Query: 78 FADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEV 132
+ADL H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT V
Sbjct: 111 YADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAV 169
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQF 191
KDQ CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A+++
Sbjct: 170 KDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 229
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPV 248
+ N GIDTEK YPY C+ N+ V T G+ D+P+ +EK++ +AV PV
Sbjct: 230 IKDNGGIDTEKSYPYEAIDDSCH---FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 286
Query: 249 SVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGM 305
SV I S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG
Sbjct: 287 SVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGD 346
Query: 306 NGYMHMQRNTGNSLGICGINMLASYP 331
G++ M RN N CGI +SYP
Sbjct: 347 KGFIKMLRNKENQ---CGIASASSYP 369
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 189/315 (60%), Gaps = 22/315 (6%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W +HG+ Y++E+EK +RL++F N + N+ +S+ L+ N FADLT +EF+A+
Sbjct: 45 EKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAA 104
Query: 90 FLGF---------SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
G + + R N S L D S+DWR GAVT VKDQ SCG
Sbjct: 105 RTGLRRPPAAAAGAGSGAGGFRYENFS------LADAAGSMDWRAMGAVTGVKDQGSCGC 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGID 199
CWAFSA A+EG+ KI TG LVSLSEQ+L+DCD + GC GGLMD A++++I G+
Sbjct: 159 CWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLT 218
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE YPYRG G C + +I GY+DVP NNE L+ AV QPVSV I G + F
Sbjct: 219 TESSYPYRGTDGSCRRSA---SAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVF 275
Query: 260 QLYSSGIFTGP-CSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
+ Y SG+ G C T L+HA+ VGY + +G YWI+KNSWG SWG GY+ ++R
Sbjct: 276 RFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGV-R 334
Query: 318 SLGICGINMLASYPT 332
G+CG+ LASYP
Sbjct: 335 GEGVCGLAQLASYPV 349
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 198/326 (60%), Gaps = 22/326 (6%)
Query: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNA 77
+ + E + T+ +H K Y E E++ RLKIF +N + +HN G SF L++N
Sbjct: 55 FADVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNK 114
Query: 78 FADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEV 132
+ADL H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT V
Sbjct: 115 YADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAV 173
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQF 191
KDQ CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A+++
Sbjct: 174 KDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 233
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPV 248
+ N GIDTEK YPY C+ N+ V T G+ D+P+ +EK++ +AV PV
Sbjct: 234 IKDNGGIDTEKSYPYEAIDDSCH---FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 290
Query: 249 SVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGM 305
SV I S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG
Sbjct: 291 SVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGD 350
Query: 306 NGYMHMQRNTGNSLGICGINMLASYP 331
G++ M RN N CGI +SYP
Sbjct: 351 KGFIKMLRNKENQ---CGIASASSYP 373
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 191/314 (60%), Gaps = 18/314 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
F W Q G++Y+S E+ QR +I+ N V HN M G S+ L + FAD+ ++E
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 86 FKASF----LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+K LG AS+ RR +A ++ P D+P S+DWR+KG VT+VKDQ CG+C
Sbjct: 86 YKRQISQGCLGSFNASLP--RRGSAYLRLPEGA-DLPNSVDWREKGYVTDVKDQKQCGSC 142
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG TG LVSLSEQ+L+DC Y N GC GGLMD A++++ N GIDT
Sbjct: 143 WAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDT 202
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E YPY + GQC N T GY DV + +E L +A+ PVSV I S +F
Sbjct: 203 EDSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSF 261
Query: 260 QLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLY SG++ P CS+S LDH VL VGY S+NG DYW++KNSWG WG GY+ M RN N
Sbjct: 262 QLYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHN 321
Query: 318 SLGICGINMLASYP 331
CGI +SYP
Sbjct: 322 Q---CGIATASSYP 332
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 201/348 (57%), Gaps = 26/348 (7%)
Query: 1 MNSLAFFLLSILLLSS--LPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
M + AF ++ S+ +++ I E +E + Q KAY++E E++ R+K+F DN
Sbjct: 1 MKAFAFLCCVLIYHSNSVTAVSFNDLIAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNKH 60
Query: 59 FVTQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRR------NASVQ 109
+ +HN + G S+ L +N F DL H EF + G+ H RR ++
Sbjct: 61 KIARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYR-----HSLRRVTGDEIDSVTF 115
Query: 110 SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
P VP S+DWR +GAVTEVK+Q CG+CWAFS TG++EG + T L SLSEQ L
Sbjct: 116 IPAYNVTVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNL 175
Query: 170 IDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
IDC Y N+GC GGLMD A+ ++ N GIDTE+ YPY G +C + K T G+
Sbjct: 176 IDCSGKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKC-RYKPQESGATDKGF 234
Query: 229 KDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFT----GPCSTSLDHAVLIVG 283
D+P+ +E++L AV P+SV I S ++FQ Y G++ G LDH VL VG
Sbjct: 235 VDIPQGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVG 294
Query: 284 YDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
Y +ENG DYW++KNSWG+ WG++GY+ M RN N CGI ASYP
Sbjct: 295 YGTENGKDYWLVKNSWGKRWGLDGYIKMARNKHNH---CGIATSASYP 339
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 188/307 (61%), Gaps = 18/307 (5%)
Query: 31 TWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASF 90
W H KAYS E E+ R I++DN +T++N+ + + L +N F D+T+ EF+A
Sbjct: 29 VWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSK-SKNVILRMNHFGDMTNTEFRAKM 87
Query: 91 LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAI 150
G + H + ++ P + P ++DWR +G VT VK+Q CG+CWAFS+TGA+
Sbjct: 88 NGL----LLHKHQNGSTFLVPSHTA-APDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGAL 142
Query: 151 EGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
EG + TG LVSLSEQ L+DC Y N+GC GGLMD A+ ++ N GIDTE YPY GQ
Sbjct: 143 EGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEGQ 202
Query: 210 AGQCNKQKLNRHIVTID--GYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGI 266
G C + ++ + D G+ D+PE +E L QAV PVSV I S +FQ Y SG+
Sbjct: 203 DGTC---RYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGV 259
Query: 267 FTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
+ P CS S LDH VL+VGY ++NG DYW++KNSWG WG GY++M RN N CGI
Sbjct: 260 YDEPQCSPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNNQNQ---CGI 316
Query: 325 NMLASYP 331
ASYP
Sbjct: 317 ASKASYP 323
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 188/315 (59%), Gaps = 17/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
+ + ++ + +HG+ Y+S QE++ RL +FE N F+ HN G +FTL +N F D+
Sbjct: 19 LRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 78
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E A+ GF A RR A+V + +P +DWR KGAVT VKDQ CG+C
Sbjct: 79 TSEEIVATMNGFLGAPT----RRPAAVLKADD-ETLPEKVDWRTKGAVTPVKDQKQCGSC 133
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG + + G LVSLSEQ L+DC D+ N GC GGLMD A++++ N GIDT
Sbjct: 134 WAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDT 193
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E YPY Q G+C N T GY DV +E L +AV P+SVGI S+ F
Sbjct: 194 EDSYPYEAQDGKCRFDASNVG-ATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTF 252
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Y +G++ ST LDH VL VGY S ENG D+W++KNSW SWG GY+ M RN
Sbjct: 253 HFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRN 312
Query: 317 NSLGICGINMLASYP 331
N+ CGI ASYP
Sbjct: 313 NN---CGIASQASYP 324
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 197/322 (61%), Gaps = 22/322 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+ E + T+ +H K Y E E++ RLKIF +N + +HN G SF L++N +ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEVKDQA 136
H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT VKDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAVKDQG 143
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++++ N
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGI 252
GIDTEK YPY C+ N+ V T G+ D+P+ +EK++ +AV PVSV I
Sbjct: 204 GGIDTEKSYPYEAIDDSCH---FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 253 CGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYM 309
S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320
Query: 310 HMQRNTGNSLGICGINMLASYP 331
M RN N CGI +SYP
Sbjct: 321 KMLRNKENQ---CGIASASSYP 339
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 200/340 (58%), Gaps = 33/340 (9%)
Query: 3 SLAFFLLSIL-----LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNY 57
+L F +L L +L++ L+ + + E W Q+G+ Y + EK +R ++F+ N
Sbjct: 6 ALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANV 65
Query: 58 AFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFL--GFSAASIDHDRR-RNASVQSPGNL 114
AF+ + N GN F L +N FADLT+ EF+++ GF ++ RN +V N+
Sbjct: 66 AFI-ESFNAGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTGFRNENV----NI 120
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD- 173
+PA++DWR KG VT +KDQ CG CWAFSA A+E EL+DCD
Sbjct: 121 DALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDV 164
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+ GC GGLMD A++F+IKN G+ TE +YPY A + ++ + +I GY+DVP
Sbjct: 165 HGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPY--AAVDDKFKSVSNSVASIKGYEDVPA 222
Query: 234 NNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDY 292
NNE L++AV QPVSV + G + FQ Y G+ TG C T LDH ++ +GY + +G Y
Sbjct: 223 NNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKY 282
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
W++KNSWG +WG NG++ M+++ + G+CG+ M SYPT
Sbjct: 283 WLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPT 322
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 186/311 (59%), Gaps = 12/311 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
F W + G++Y + E+ QR++I+ +N V HN + G S+ L + FAD+ ++E
Sbjct: 27 FHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEE 86
Query: 86 FKASF-LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
+K+ LG A RR ++ +P ++DWR KG VT VKDQ CG+CWAF
Sbjct: 87 YKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAF 146
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SATG++EG N TG LVSLSEQ+L+DC Y N GC GGLMDYA++++ +N GIDTEK
Sbjct: 147 SATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKS 206
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLY 262
YPY + GQC + N GY DV +E L +AV PVSVGI S +FQLY
Sbjct: 207 YPYEAEDGQCRFKPENVG-AKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLY 265
Query: 263 SSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
SG++ S LDH VL VGY ++NG DYW++KNSWG WG GY+ M RN N
Sbjct: 266 DSGVYDEQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDNQ-- 323
Query: 321 ICGINMLASYP 331
CGI ASYP
Sbjct: 324 -CGIATAASYP 333
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 185/314 (58%), Gaps = 8/314 (2%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ E + W + + YS E EKQ R +F+ N F+ + N G+ ++ L +N FAD T +
Sbjct: 19 VAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTRE 78
Query: 85 EFKASFLGFSAAS-IDHDRRRNASVQSPG-NLRDVPA--SIDWRKKGAVTEVKDQASCGA 140
EF A+ G + I + + S N+ DV + DWR +GAVT VK Q CG
Sbjct: 79 EFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGC 138
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFS+ A+EG+ KIV +LVSLSEQ+L+DCDR ++GC GG+M A+ ++IKN GI +
Sbjct: 139 CWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIAS 198
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E YPY+ G C + I G++ VP NNE+ LL+AV QPVSV I F
Sbjct: 199 EASYPYQAAEGTCRYN--GKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFM 256
Query: 261 LYSSGIFTGP-CSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
YS G++ P C T+++HAV VGY S G+ YW+ KNSWG +WG NGY+ ++R+
Sbjct: 257 HYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWP 316
Query: 319 LGICGINMLASYPT 332
G+CG+ A YP
Sbjct: 317 QGMCGVAQYAFYPV 330
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 149/347 (42%), Positives = 204/347 (58%), Gaps = 24/347 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M S+A L + ++ L + E + + +H K Y SE E + R+KI+ +N +
Sbjct: 1 MKSIAVLLCVVGAACAVSL--LDLVREEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRI 58
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDR--------RRNASVQ 109
+HN G S+ L N +AD+ EF GF+ ++ H + R A+
Sbjct: 59 AKHNQRFEQGAVSYKLRPNKYADMLSHEFVHVMNGFNK-TLKHPKAVHGKGRESRPATFI 117
Query: 110 SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
+P ++ P +DWRKKGAVTEVKDQ CG+CWAFS TGA+EG + TG LVSLSEQ L
Sbjct: 118 APAHVT-YPDHVDWRKKGAVTEVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNL 176
Query: 170 IDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
IDC +Y N+GC GGLMD A++++ N GIDTEK YPY G +C N + G+
Sbjct: 177 IDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKAYPYEGVDDKCRYNAKNSGADDV-GF 235
Query: 229 KDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYD 285
D+P+ +E++L+QAV PVSV I S+ +FQ YS G++ ST LDH V++VGY
Sbjct: 236 VDIPQGDEEKLMQAVATVGPVSVAIDASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYG 295
Query: 286 S-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+ E G DYW++KNSWGR+WG GY+ M RN N CGI ASYP
Sbjct: 296 TDEQGGDYWLVKNSWGRTWGDLGYIKMARNKNNH---CGIASSASYP 339
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 203/334 (60%), Gaps = 20/334 (5%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
F L+IL L+ ++ N + + +H K YS +++ +R I++ N + HN
Sbjct: 1 MFKLTILALAISVAAASTEAN--WAIFKAKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNE 57
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASI 121
+ G S++ L N +AD+T++EF+ + G D+ G +D +P ++
Sbjct: 58 LYAKGLSTYFLGENKYADMTNEEFRRTLSGLRV-----DKELTPGDFVSGMFKDSLPTAV 112
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGC 180
DWRK+G VTEVKDQ CG+CWAFS TG++EG + T LVSLSE L+DC + + N GC
Sbjct: 113 DWRKEGYVTEVKDQGQCGSCWAFSTTGSLEGQHFKATKQLVSLSESNLVDCSKKWGNQGC 172
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLMD A++++ N GIDTEK YPY+ + +CN +K N T YKD+ +E L
Sbjct: 173 NGGLMDNAFKYIADNKGIDTEKSYPYKPEDRKCNFKKANVG-ATDKLYKDITSGSEDALQ 231
Query: 241 QAVVA-QPVSVGICGSERAFQLYSSGIFT-GPCST-SLDHAVLIVGYDSENGVDYWIIKN 297
+AV P+SV I S +FQLYS G++ CST +LDH VL VGYDS+NG DYWI+KN
Sbjct: 232 EAVATIGPISVAIDASHDSFQLYSGGVYNEKACSTKTLDHGVLAVGYDSKNGDDYWIVKN 291
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG+SWG++GY+ M RN N CGI +ASYP
Sbjct: 292 SWGKSWGIDGYIWMSRNKKNQ---CGIATMASYP 322
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 192/308 (62%), Gaps = 18/308 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F W K+H ++Y E + + F+DN F+ N NS L L FADLT++E++
Sbjct: 33 FLGWMKKHDRSYH-HHEFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNEEYRK 91
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+LG + ++ ++ + G P SIDWR KGAV+ VKDQ CG+CW+FS TG
Sbjct: 92 IYLG-TKVNVAPEKHNFNMIHFTG-----PDSIDWRTKGAVSHVKDQGQCGSCWSFSTTG 145
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
++EG ++I TG++V+LSEQ L+DC + N+GC GGLM A++F++ G+ TE YPY
Sbjct: 146 SVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYN 205
Query: 208 GQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
G+C K + +V I GYK++ + +E +L A+ QPVS+ I S+++FQLY SG
Sbjct: 206 AVQGKC---KFTKSMVGANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSG 262
Query: 266 IFTGP-CST-SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
++ P CS+ LDH VL VGY +ENG DY+I+KNSW SWG +GY+ M RN N CG
Sbjct: 263 VYDEPECSSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDGYIFMSRNAKNQ---CG 319
Query: 324 INMLASYP 331
+ +ASYP
Sbjct: 320 VATMASYP 327
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 152/338 (44%), Positives = 209/338 (61%), Gaps = 29/338 (8%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M S+ F L ++ L SL L+ + +LF+T+ ++GK Y S E++ R K+ N ++
Sbjct: 1 MKSIFFVLFAVAL--SLNLHSDAYYEKLFQTFEAKYGKNYLS-SEREYRKKVLAYNMDWI 57
Query: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFL-GFSAASIDHDRRR---NASVQSPGNLRD 116
+ N+ SFTL + FAD+T+ EF S L G ++H + R N +V+S
Sbjct: 58 EKFNS-DEHSFTLGMTPFADMTNTEFATSKLCGCMKKPLNHKQARVLNNMAVES------ 110
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
IDWR+KGAVT VK+Q SCG+CWAFSATGA+EG N + TG LVSLSEQ+L+DCD
Sbjct: 111 ----IDWREKGAVTPVKNQGSCGSCWAFSATGALEGGNFVATGKLVSLSEQQLVDCDTE- 165
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
++GCGGG MD A+++V+K G+ TE+DYPY + C + +++I GY+DVP N+
Sbjct: 166 DAGCGGGFMDTAFEYVMKK-GLCTEEDYPYHAKDEDCKDDQCTS-VISITGYEDVPANDG 223
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIF-TGPCSTSLDHAVLIVGYDSENGVDYWII 295
L QA+ PVSV I FQ+Y+ G+ + C TSL+H VL VGY E Y I+
Sbjct: 224 VALKQALTKAPVSVAIQADSFVFQMYTGGVLDSDMCGTSLNHGVLAVGYAKE----YIIV 279
Query: 296 KNSWGRSWGMNGYMHM-QRNTGNSLGICGINMLASYPT 332
KNSWG SWG GY+ + R+ G GICGINM ASYPT
Sbjct: 280 KNSWGASWGDKGYVKIAHRDQGE--GICGINMAASYPT 315
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 129/307 (42%), Positives = 178/307 (57%), Gaps = 3/307 (0%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F +W K+ + E R ++F N + HN +SSFT+ N ++ LT EFK
Sbjct: 28 FLSWMKKFAVKLNP-LEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKK 86
Query: 89 SFLGFSAASIDHDRRRNASVQSPG-NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
G + R ++ +P N+ DVP +DW ++G VT VK+Q CG+CWAFS T
Sbjct: 87 LRTGLRVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTT 146
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
GAIEG + + LVS+SEQEL+DCD + + GC GGLMD A+++V + G+ E+DYPY
Sbjct: 147 GAIEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYH 206
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
+ G C +K + + + + DVP N+E+ L AV QPVSV I + FQ Y SG+F
Sbjct: 207 AKEGTCALKKC-KPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVF 265
Query: 268 TGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
C T LDH VL+VGY E G YW +KNSWG WG GY+ + R G G CG+ M+
Sbjct: 266 DKSCGTKLDHGVLVVGYGEEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMV 325
Query: 328 ASYPTKT 334
SYPT +
Sbjct: 326 PSYPTAS 332
>gi|66378018|gb|AAY45870.1| cathepsin L-like cysteine proteinase [Rotylenchulus reniformis]
Length = 369
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 190/308 (61%), Gaps = 15/308 (4%)
Query: 34 KQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEFKASF 90
+QH K+Y ++Q + +R+ + N F+ +HN G SF++ N ADL E+K
Sbjct: 67 QQHEKSYKNQQLETERMLAYLSNKQFIDKHNQAFREGKKSFSIGENHIADLPFSEYK-KL 125
Query: 91 LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAI 150
G+ A D+ RR ++ +P N+ D+P S+DWR K VTEVK+Q CG+CWAFSATGA+
Sbjct: 126 NGYRRALGDNLRRNASTFLAPMNIGDIPESVDWRDKQWVTEVKNQGQCGSCWAFSATGAL 185
Query: 151 EGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQ 209
EG + TG LVSLSEQ L+DC + Y N GC GGLMD A+Q++ N GID E YPY+ +
Sbjct: 186 EGQHARKTGQLVSLSEQNLVDCTKKYGNMGCNGGLMDNAFQYIKDNEGIDKEMTYPYKAK 245
Query: 210 AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLYSSGI-F 267
AG+C+ K N T G+ DV E +E +L AV Q PVSV I R+FQLY G+ F
Sbjct: 246 AGRCHF-KRNDVGATDTGFFDVAEGDEDKLKLAVATQGPVSVAIDAGHRSFQLYKHGVYF 304
Query: 268 TGPCS-TSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
C+ LDH VL+VGY D E+G DYWI+KNSW WG GY+ M N N+ CGI
Sbjct: 305 EEECNPEELDHGVLVVGYGTDPEHG-DYWIVKNSWSTHWGEQGYIRMAPNRNNN---CGI 360
Query: 325 NMLASYPT 332
ASYPT
Sbjct: 361 PSHASYPT 368
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 187/312 (59%), Gaps = 16/312 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + QH KAYSS E+ R KIF +N V +HN G S+ L++N F DL E
Sbjct: 27 WEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDLLPHE 86
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWA 143
F G+ ++ + + P NL D +P ++DWRKKGAVT VK+Q CG+CWA
Sbjct: 87 FAKMVNGYRGK---QNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCWA 143
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FS TG++EG + TG LVSLSEQ L+DC + N GC GGLMD +Q++ N GIDTE+
Sbjct: 144 FSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGIDTEE 203
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQL 261
+PY Q G C +K + T G+ D+ + +E L +AV PVSV I S +FQL
Sbjct: 204 SHPYTAQDGDCKFKKADVG-ATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQL 262
Query: 262 YSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G++ P CS+S LDH VL VGY +NG YW++KNSWG WG NGY+ M R+ N
Sbjct: 263 YSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYILMSRDKDNQ- 321
Query: 320 GICGINMLASYP 331
CGI ASYP
Sbjct: 322 --CGIASSASYP 331
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 128/227 (56%), Positives = 159/227 (70%), Gaps = 4/227 (1%)
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
+RDVP+S+DWR+KGAVT VKDQ CG+CWAFS A+EGIN I T +L SLSEQ+L+DCD
Sbjct: 58 VRDVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCD 117
Query: 174 RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG-QAGQCNKQKLNRHIVTIDGYKDVP 232
N+GC GGLMDYA+Q++ K+ G+ E YPY+ QA CNK+ +VTIDGY+DVP
Sbjct: 118 TKSNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKKP--SAVVTIDGYEDVP 175
Query: 233 ENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVD 291
N+E L +AV AQPV+V I S FQ YS G+F G C T LDH V VGY + +G
Sbjct: 176 ANDETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTK 235
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
YWI+KNSWG WG GY+ M+R+ + G+CGI M ASYP KT NP
Sbjct: 236 YWIVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPVKTSTNP 282
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 188/315 (59%), Gaps = 22/315 (6%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W +HG+ Y++E+EK +RL++F N + N+ +S+ L+ N FADLT +EF+A+
Sbjct: 45 EKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAA 104
Query: 90 FLGF---------SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
G + + R N S L D S+DWR GAVT VKDQ SCG
Sbjct: 105 RTGLRRPPAAAAGAGSGAGGFRYENFS------LADAAGSMDWRAMGAVTGVKDQGSCGC 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGID 199
CWAFSA A+EG+ KI TG LVSLSEQ+L+DCD + GC GGLMD A++++I G+
Sbjct: 159 CWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLT 218
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TE YPYRG G C + +I GY+DVP NNE L+ AV QPVSV I G + F
Sbjct: 219 TESSYPYRGTDGSCRRSA---SAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVF 275
Query: 260 QLYSSGIFTGP-CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
+ Y SG+ G C T L+HA+ GY + +G YWI+KNSWG SWG GY+ ++R
Sbjct: 276 RFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGV-R 334
Query: 318 SLGICGINMLASYPT 332
G+CG+ LASYP
Sbjct: 335 GEGVCGLAQLASYPV 349
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 179/308 (58%), Gaps = 10/308 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F W H + Y+S QE+ R +I+ N + +HN G S+TL +N F DL H EF A
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
+LG ++ + +S P + +P S+DWR G VT VK+Q CG+CW+FS TG
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLP-RMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTG 139
Query: 149 AIEGINKIVTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
++EG + TG+LVSLSEQ L+DC + N GC GGLMD A++++IKN GIDTE YPY
Sbjct: 140 SVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYT 199
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGI 266
G C N T+ Y+D+ +E L AV PVSV I S FQ Y +G+
Sbjct: 200 ATTGTCKFNAANIG-ATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGV 258
Query: 267 FT-GPCSTS-LDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
+ CST+ LDH VL VGY S G DYW++KNSWG +WG GY+ M RN N CG
Sbjct: 259 YNEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQ---CG 315
Query: 324 INMLASYP 331
I ASYP
Sbjct: 316 IATSASYP 323
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 193/312 (61%), Gaps = 22/312 (7%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEFKASFL 91
+H K Y E E++ RLKIF +N + +HN + G S+ L++N +AD+ H EF+
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170
Query: 92 GFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
GF+ R + S + SP ++ +P S+DWR KGAVT VKDQ CG+CWAFS+
Sbjct: 171 GFNYTLHKELRAADESFKGVTFISPEHVT-LPKSVDWRDKGAVTGVKDQGHCGSCWAFSS 229
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++++ N GIDTEK YP
Sbjct: 230 TGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 289
Query: 206 YRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLY 262
Y C+ N+ + T G+ D+P+ NEK+L +AV PVSV I S +FQ Y
Sbjct: 290 YEALDDSCH---FNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFY 346
Query: 263 SSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
S G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG G++ M RN N
Sbjct: 347 SEGVYVEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQ- 405
Query: 320 GICGINMLASYP 331
CGI +SYP
Sbjct: 406 --CGIASASSYP 415
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 150/345 (43%), Positives = 200/345 (57%), Gaps = 30/345 (8%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINEL--------FETWCKQHGKAYSSEQEKQQRLKIFED 55
LA FL+ L++ L +N C+ N F W K+H KAY E + + F+D
Sbjct: 3 LAVFLIVSLVI--LSINVCAATNLFSAQTYQTSFLGWMKKHNKAYH-HHEFNDKYQTFKD 59
Query: 56 NYAFVTQHN-NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL 114
N F+ HN N S L LN FADLT++E+K ++LG S I+ + R N + N
Sbjct: 60 NMDFI--HNWNSKESDTVLGLNRFADLTNEEYKKTYLGMS---INVNLRANQVPMNGLNF 114
Query: 115 RDV--PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC 172
P+SIDWR+ GAV VKDQ CG+CWAF+ TGA+EG ++I TG++V+ SEQ L+DC
Sbjct: 115 ERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDC 174
Query: 173 DRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQC--NKQKLNRHIVTIDGYK 229
Y N+GC GGLM A++++I N GI TE+ YPY +C N L I GYK
Sbjct: 175 SGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCVYNTTMLG---TAISGYK 231
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF-TGPCST-SLDHAVLIVGYDSE 287
DVP +E L A+ QPV+V I S FQLY SG++ CS+ L+H VL VGY +
Sbjct: 232 DVPRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTL 291
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
G DY+I+KNSW +WG GY+ M RN N CGI +ASY +
Sbjct: 292 EGKDYYIVKNSWAETWGNQGYILMARNANNH---CGIATMASYAS 333
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 134/289 (46%), Positives = 177/289 (61%), Gaps = 12/289 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQ 84
+ ++F + KQ+ KAYS E R F+ + + HN + N+S+T+ LN FADL+ +
Sbjct: 38 LQDMFTAFMKQYSKAYS-HAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADLSFE 96
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
EFK + G + R N + + P SIDWR AVT +KDQ CG+CWAF
Sbjct: 97 EFKGKYFGCKHVEREFARSNNLHQE----VEAAPTSIDWRTSNAVTPIKDQGQCGSCWAF 152
Query: 145 SATGAIEGINKIVTG--SLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
SATG+IEG ++ G +L SLSEQ+L+DC SY N+GC GGLMDYA++++I N GI E
Sbjct: 153 SATGSIEGA-WVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAE 211
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
YPY+G G C QK +VTI G+KDV +E L AV PVSV I + FQ
Sbjct: 212 SAYPYKGVGGLC--QKSCTKVVTISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQ 269
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
YSSG+F+G C +LDH VL VGY + DYWI+KNSWG SWG +GY+
Sbjct: 270 FYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYI 318
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 151/389 (38%), Positives = 200/389 (51%), Gaps = 71/389 (18%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN----NMGNSSFTLSLNAFAD 80
+ ELF+ W ++H K Y +E ++RL+ F N +V + N N+G S+ T+ LN FAD
Sbjct: 45 VKELFQQWKEKHRKVYKHVEEAEKRLENFRRNLKYVVEKNQKKKNLG-SAHTVGLNKFAD 103
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASC 138
+++ EF+ +L I R N NL+ P+S+DWRKKG VT VKDQ C
Sbjct: 104 MSNVEFRQKYLSKVKKPIKK-RNNNLMTSRQRNLQSCVAPSSLDWRKKGVVTPVKDQGDC 162
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFS+TGAIEGIN IVTG LVSLSEQEL+DCD + N GC GG MDYA+++VI N GI
Sbjct: 163 GSCWAFSSTGAIEGINAIVTGDLVSLSEQELMDCDTT-NYGCDGGYMDYAFEWVINNGGI 221
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
DTE DYPY G G CN K +V++DGY+D VA+ S +C + +
Sbjct: 222 DTEIDYPYTGVDGTCNIAKEETKVVSVDGYED-------------VAESDSALLCATVQQ 268
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
P S +D + +D+ + +G
Sbjct: 269 -----------PISVGIDGS----------AIDFQLY------------------TSGIY 289
Query: 319 LGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCG 378
G C N N P P P+ C +YC ETCCC CL + CC
Sbjct: 290 NGSCSDN----------PNDIXXPSPSPSECGDFSYCPTDETCCCLYEFFDFCLVYGCCP 339
Query: 379 FSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
+ +AVCC+ YCCPS+YPICD CL
Sbjct: 340 YENAVCCTGTEYCCPSDYPICDIKEGLCL 368
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 201/329 (61%), Gaps = 24/329 (7%)
Query: 19 LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSL 75
+++ + E + T+ +H K Y + E++ RLKIF +N + +HN G SF L++
Sbjct: 19 ISFADVVMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAV 78
Query: 76 NAFADLTHQEFKASFLGFSAA------SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV 129
N +ADL H EF+ GF+ S D D + + SP ++ +P S+DWR KGAV
Sbjct: 79 NKYADLLHHEFRQLMNGFNYTLHKQLRSTD-DSFKGVTFISPAHVT-LPKSVDWRTKGAV 136
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYA 188
T VKDQ CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A
Sbjct: 137 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 196
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VA 245
++++ N GIDTEK YPY C+ N+ + T G+ D+P+ +EK++ +AV
Sbjct: 197 FRYIKDNGGIDTEKSYPYEAIDDSCH---FNKGAIGATDRGFTDIPQGDEKKMAEAVATV 253
Query: 246 QPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRS 302
PV+V I S +FQ YS G++ P + +LDH VL+VGY + E+G DYW++KNSWG +
Sbjct: 254 GPVAVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTT 313
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYP 331
WG G++ M RN N CGI +SYP
Sbjct: 314 WGDKGFIKMLRNKDNQ---CGIASASSYP 339
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/337 (40%), Positives = 188/337 (55%), Gaps = 41/337 (12%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L F L++ L+ S + E W Q+ + Y EK +R K
Sbjct: 12 LGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFK------------ 59
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKA--SFLGFSAASID---HDRRRNASVQSPGNLRDVP 118
FADLT+ EF++ + GF ++++ R N S + +P
Sbjct: 60 --------------FADLTNHEFRSVKTNKGFKSSNMKILTGFRYENVSADA------LP 99
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYN 177
+IDWR KG VT +KDQ CG C AFSA A EGI KI TG LVSL++QEL+DCD +
Sbjct: 100 TTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGED 159
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A++F+IKN G+ TE YPY G+CN + TI GY+DVP N+E
Sbjct: 160 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSG--SNSAATIKGYEDVPANDEA 217
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIK 296
L++A+ QPVSV + G + F+ YS G+ TG C T LDH + +GY + +G YW++K
Sbjct: 218 ALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMK 277
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG +WG NGY+ M+++ + G+CG+ M SYPTK
Sbjct: 278 NSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTK 314
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 191/328 (58%), Gaps = 19/328 (5%)
Query: 20 NYCSDINELFET---WCK---QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSS 70
N S+ E+ + W K +H K Y +E+ R IF NY F+ HN + G S
Sbjct: 26 NLYSNFQEVLDAEVAWHKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKS 85
Query: 71 FTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVT 130
FT+ +N FAD+T EF G D R ++ SP +P +DWR KG V+
Sbjct: 86 FTVGVNEFADMTVHEFAQMMNGLKP---DSTRVSGSTYLSPNIDAPLPVEVDWRTKGLVS 142
Query: 131 EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAY 189
EVK+Q SCG+CWAFS TG++EG + TG++V LSEQ L+DC SY N GC GGLM A+
Sbjct: 143 EVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNAF 202
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPV 248
+++ N GIDTE+ YPY G+ G C K K N+ T+ G+ ++P NEK+L +A+ PV
Sbjct: 203 KYIKDNKGIDTEEAYPYAGRDGDC-KFKKNKVGATVTGFVEIPAGNEKKLQEALATVGPV 261
Query: 249 SVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 306
SV I + ++F LY SG++ P S LDH VL VGY S +G DY+I+KNSWG +WG
Sbjct: 262 SVAIDANHQSFMLYKSGVYDEPECDSAQLDHGVLAVGYGSIHGKDYYIVKNSWGTTWGEQ 321
Query: 307 GYMHMQRNTGNSL--GICGINMLASYPT 332
GY+ GICGI + ASYP
Sbjct: 322 GYIRFSTTAVPDAIGGICGILLDASYPV 349
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/336 (39%), Positives = 207/336 (61%), Gaps = 18/336 (5%)
Query: 7 FLLSILLLSSLPLNYCS----DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
FL++I L++ + D++ + W H K+Y+++ + +R ++E+N +
Sbjct: 6 FLVAIGLVACATAAFVKPTNPDLDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKMINM 65
Query: 63 HN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
HN ++ F L +N + D+ E +++ G+ ++++ + + ++ +P N++ VP
Sbjct: 66 HNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYKSSNVT--KVQGSTFLTPSNIQ-VPD 122
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
++DWR KG VT VK+Q CG+CWAFS TG++EG T LVSLSEQ L+DC R+ N
Sbjct: 123 TVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTEGNM 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD +Q+VI NHGID+E YPY + C+ K + + G+ DV +E+
Sbjct: 183 GCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCH-YKASCDSAEVTGFTDVTSGDEQA 241
Query: 239 LLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWII 295
L++AV + PVSV I S ++FQLY SG++ P CS+S LDH VL+VGY ++ G DYW++
Sbjct: 242 LMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYGTDGGKDYWLV 301
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG +WG++GY+ M RN N CGI ASYP
Sbjct: 302 KNSWGETWGLSGYIKMSRNKSNQ---CGIATSASYP 334
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 204/342 (59%), Gaps = 22/342 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+L ++ + +++ + E + T+ +H K Y SE E++ R+KI+ +N V +HN
Sbjct: 4 LLVLCAVVAAGTAVSFFDLVREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKVAKHNQ 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR--------RNASVQSPGNL 114
G S+ L N ++D+ H EF + GF+ ++ H++ R A+ SP N+
Sbjct: 64 RYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNK-TVKHNKGLYAKGNDIRGATFVSPANV 122
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
P ++DWR+ GAVT VKDQ CG+CW+FS TGA+EG + +G LVSLSEQ LIDC
Sbjct: 123 A-APPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSS 181
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+Y N+GC GGLMD A++++ N GIDTEK YPY +C N + G+ D+P
Sbjct: 182 AYGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDV-GFVDIPA 240
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENG 289
+E +L+ A+ PVSV I S+ +FQLYS G++ S +LDH VL+VGY + E+G
Sbjct: 241 GDEHKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGTDEDG 300
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG SWG GY+ M RN N CGI ASYP
Sbjct: 301 GDYWLVKNSWGPSWGDEGYIKMARNRDNH---CGIASSASYP 339
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 183/305 (60%), Gaps = 16/305 (5%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFL 91
+HG+ Y+S QE++ RL +FE N F+ HN G +FTL +N F D+T +EF A+
Sbjct: 30 EHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEEFTATMN 89
Query: 92 GFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIE 151
GF ++ RR ++ +P +DWR KGAVT VKDQ CG+CWAFS TG++E
Sbjct: 90 GF----LNVPSRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLE 145
Query: 152 GINKIVTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQA 210
G + + G LVSLSEQ L+DC D+ N GC GGLMD A++++ N GIDTE YPY Q
Sbjct: 146 GQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQD 205
Query: 211 GQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIF-- 267
G+C N T GY DV +E L +AV P+SV I S+ +FQ Y G++
Sbjct: 206 GKCRFDASNVG-ATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSFQFYHDGVYYE 264
Query: 268 TGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
G ST LDH VL VGY ++E G YW++KNSW SWG GY+ M R+ N+ CGI
Sbjct: 265 EGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRDKKNN---CGIAS 321
Query: 327 LASYP 331
ASYP
Sbjct: 322 QASYP 326
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 137/330 (41%), Positives = 185/330 (56%), Gaps = 17/330 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINEL-----FETWCKQHGKAYSSEQEKQQRLKIFED 55
+ S L + +L + C D+ ++ F W H ++Y S +E QR ++
Sbjct: 18 LASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRR 77
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHD----RRRNASVQSP 111
N F+ N G+ ++ L+ N FADLT +EF A++ G+ A D V +
Sbjct: 78 NAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDAS 137
Query: 112 GNLR-DVPASIDWRKKGAVTEVKDQAS-CGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
+ R DVPAS+DWR +GAV K Q S C +CWAF IE +N I TG LVSLSEQ+L
Sbjct: 138 FSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQL 197
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
+DCD SY+ GC G AY++V++N G+ TE DYPY + G CN+ K H I G+
Sbjct: 198 VDCD-SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFG 256
Query: 230 DVPENNEKQLLQAVVAQPVSVGI-CGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DS 286
VP NE L AV QPV+V I GS Q Y G++TGPC T L HAV +VGY D+
Sbjct: 257 KVPPRNEAALQAAVARQPVAVAIEVGS--GMQFYKGGVYTGPCGTRLAHAVTVVGYGTDA 314
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
+G YW IKNSWG+SWG GY+ + R+ G
Sbjct: 315 SSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 181/304 (59%), Gaps = 16/304 (5%)
Query: 36 HGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEFKASFLGF 93
H K+Y QE+ R IFEDN + + N + S FTL +N FAD+T+ EF LG
Sbjct: 35 HLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGL 94
Query: 94 SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGI 153
++ SV +++D+PA +DW +KG VTEVK+Q CG+CWAFS TG++EG
Sbjct: 95 GG----RNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQ 150
Query: 154 NKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ 212
TG LVSLSEQ L+DC S N GC GGLMD A+ ++ KN GIDTE YPY G G
Sbjct: 151 VFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGT 210
Query: 213 CNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP- 270
C + N+ T+ G+ DV +E L +AV P+SV I S FQ Y G++ P
Sbjct: 211 CRFLE-NKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYN-PW 268
Query: 271 --CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
ST LDH VL+VGY +E G DYW++KNSWG SWG+ GY+ M RN N CGI A
Sbjct: 269 FCSSTELDHGVLVVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNR---CGIATQA 325
Query: 329 SYPT 332
SYPT
Sbjct: 326 SYPT 329
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 197/322 (61%), Gaps = 22/322 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+ E + T+ +H K Y + E++ RLKIF +N + +HN G SF L++N +ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEVKDQA 136
H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT VKDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVT-LPKSVDWRSKGAVTAVKDQG 143
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++++ N
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGI 252
GIDTEK YPY C+ N+ + T G+ D+P+ +EK++ +AV PVSV I
Sbjct: 204 GGIDTEKSYPYEAIDDSCH---FNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 253 CGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYM 309
S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 310 HMQRNTGNSLGICGINMLASYP 331
M RN N CGI +SYP
Sbjct: 321 KMLRNKDNQ---CGIASASSYP 339
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 189/321 (58%), Gaps = 18/321 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM----GNSSFTLSLNAFAD 80
+ E +E W + G+ Y EK +R ++F+ N F+ HN G S L+ N FAD
Sbjct: 16 MRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNKFAD 75
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPG--NLRDVPASIDWRKKGAVTEVKDQASC 138
LT EF+ ++ + +V G +L DVP SIDWR +GAVT VKDQ C
Sbjct: 76 LTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKDQHLC 135
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
CWAFS+ A+EGI++I TG+ VSLS Q+L+DC + N C G +D AY+++ ++ G+
Sbjct: 136 ACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIARSGGL 195
Query: 199 DTEKDYPYRGQAGQCN---KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
++DYPY G +G C KQ + R I G++ VP NE LL AV QPVSV + G
Sbjct: 196 VADQDYPYEGHSGTCRVYGKQAVAR----ISGFQYVPARNETALLLAVAHQPVSVALDGL 251
Query: 256 ERAFQLYSSGIFTG---PCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 311
RA Q +GIF PC+T+L+HA+ IVGY + E+G YW++KNSWG WG GY+
Sbjct: 252 SRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVKF 311
Query: 312 QRNTGNSL-GICGINMLASYP 331
R+ + + G+CG+ + ASYP
Sbjct: 312 ARDVASEINGVCGLALEASYP 332
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 203/341 (59%), Gaps = 22/341 (6%)
Query: 7 FLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66
FLL + S+ +++ + E + + QH K Y SE E + R+KI+ +N + +HN +
Sbjct: 6 FLLCAVAASASAVSFFDLVKEEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQL 65
Query: 67 ---GNSSFTLSLNAFADLTHQEFKASFLGFSAASI--------DHDRRRNASVQSPGNLR 115
G S+ L N + D+ H EF + G++ + HD R A+ P +++
Sbjct: 66 YEQGLVSYKLGPNKYTDMLHHEFIQAMNGYNRTAKHNKGLYGKKHDVR-GATFIPPAHVK 124
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
P +DW KKGAVTEVKDQ CG+CWAFS TGA+EG + +G LVSLSEQ LIDC +
Sbjct: 125 -YPDHVDWTKKGAVTEVKDQGKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSST 183
Query: 176 Y-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
Y N+GC GGLMD A++++ N GIDTEK YPY G +C N + G+ D+P
Sbjct: 184 YGNNGCNGGLMDNAFKYIKDNGGIDTEKTYPYEGVDDKCRYNPKNSGAEDV-GFVDIPSG 242
Query: 235 NEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIF--TGPCSTSLDHAVLIVGYDS-ENGV 290
+E++L+QAV PVSV I S+ +FQ YS G++ T ST LDH VL+VGY + E G
Sbjct: 243 DEEKLMQAVATVGPVSVAIDASQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGG 302
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSW R+WG GY+ M RN N CGI ASYP
Sbjct: 303 DYWLVKNSWSRTWGELGYIKMARNRDNH---CGIATDASYP 340
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 186/324 (57%), Gaps = 23/324 (7%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E WC + A EK +R +F++N + +HN+ GN+++TL LN F+D+T +EF
Sbjct: 47 LYERWCAHYNMA-RDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDMTDEEFN 105
Query: 88 ASFLG--FSAASIDHDRRR---------------NASVQSPGNLRDVPASIDWRKKGAVT 130
S G +A + D N + S G P ++DWR + AVT
Sbjct: 106 RSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AVT 164
Query: 131 EVKDQA-SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
VKDQ +CG+CWAFSA A+EGIN I T +LV LSEQ+L+DCD+ N GC GGLM A+
Sbjct: 165 RVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDK-LNHGCNGGLMTTAF 223
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
FV++N G+ E YPY G+ G+C + + VTI GY+ VP + L+ AV AQPVS
Sbjct: 224 SFVVRNRGVVPEGAYPYMGREGRC--KHVMAPPVTIYGYQRVPRFDANALMNAVAAQPVS 281
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
V I S F+ Y G+F G C L HA VGY ++ G +WI+KNSWG WG GY+
Sbjct: 282 VAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGADAGGPFWIVKNSWGPGWGEGGYV 341
Query: 310 HMQRNTGNSLGICGINMLASYPTK 333
+ RNT G+CGI SYP K
Sbjct: 342 RISRNTPVRQGVCGILTENSYPVK 365
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 205/340 (60%), Gaps = 24/340 (7%)
Query: 7 FLLSILLLSSLP--LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
LLS+L+++S +++ + +E+W HGK YSS E++ RLKI+ +N +++HN
Sbjct: 6 LLLSVLVIASTANAVSFFDVVLSDWESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHN 65
Query: 65 NM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS---PGNLRDVP 118
+ G + + +N + DL H EF A G+ A+ + AS+ P +P
Sbjct: 66 SEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYAN------KTASLGGTYIPNKNIQLP 119
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+DWR++GAVT VK+Q CG+CW+FSATGA+EG + TG L+SLSEQ L+DC R + N
Sbjct: 120 THVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFGN 179
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GGLMD+A+ ++ N GIDTE YPY G G C+ N+ I G+ D+ + +EK
Sbjct: 180 NGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDI-GFVDIKKGSEK 238
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGIFT-GPCST-SLDHAVLIVGY--DSENGVDY 292
L +AV P+SV I S +FQ YS G++ CS+ LDH VL+VG+ DS +G DY
Sbjct: 239 DLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGEDY 298
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
W++KNSW WG GY+ M RN N +CGI ASYP
Sbjct: 299 WLVKNSWSEKWGDQGYIKMARNKEN---MCGIASSASYPV 335
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 203/337 (60%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
F +++ L+ S ++ + + KQ+ K Y +E+E ++RL ++E N F+T H
Sbjct: 2 FRFAIVAALVAVSFARVPRVGLDNEWNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLH 60
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASV-QSPGNLRDVPA 119
N + G +F + +N + D+T++EF + G+ ++ NA V P N+ D+P
Sbjct: 61 NLAADRGEHTFWVGMNEYGDMTNEEFTKTMNGYRM----RNKTSNAPVFMPPNNMGDLPD 116
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
++DWR KG VT +K+Q CG+CW+FSATG++EG TG LVSLSEQ L+DC + N
Sbjct: 117 TVDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNH 176
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A+ ++ N+GIDTE YPY+ + G+C + + T G+ D+ +E+
Sbjct: 177 GCEGGLMDDAFTYIKANNGIDTEASYPYKARDGKCEFKSADVG-ATDTGFVDIKTKDEEA 235
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWII 295
L QAV P+SV I S +FQLY +G++ CS T LDH VL VGY +E+ DYW++
Sbjct: 236 LKQAVATVGPISVAIDASHMSFQLYRTGVYHDWFCSQTKLDHGVLAVGYGTEDSKDYWLV 295
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG SWG GY+ M RN N+ CGI ASYPT
Sbjct: 296 KNSWGESWGQKGYIQMSRNRRNN---CGIATSASYPT 329
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 197/322 (61%), Gaps = 22/322 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+ E + T+ +H K Y + E++ RLKIF +N + +HN G SF L++N +ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEVKDQA 136
H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT VKDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAVKDQG 143
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++++ N
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGI 252
GIDTEK YPY C+ N+ + T G+ D+P+ +EK++ +AV PVSV I
Sbjct: 204 GGIDTEKSYPYEAIDDSCH---FNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 253 CGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYM 309
S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 310 HMQRNTGNSLGICGINMLASYP 331
M RN N CGI +SYP
Sbjct: 321 KMLRNKENQ---CGIASASSYP 339
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 188/312 (60%), Gaps = 19/312 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLTHQEFK 87
+E W +H K YS + E+ R KI++ N + HN N FTL +N F DL EF
Sbjct: 22 WEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFA 81
Query: 88 ASFLGFSAASIDHDRRRNAS---VQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
F G+ + R N++ V P N + P ++DWR KGAVT VK+Q CG+CWAF
Sbjct: 82 EMFNGYMMQA-----RSNSTKVFVADP-NYKADP-TVDWRTKGAVTGVKNQGQCGSCWAF 134
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
S TG++EG + + TG LVSLSEQ L+DC + N GC GGLMD A++++ KN GIDTE
Sbjct: 135 STTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEAS 194
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLY 262
YPY+ +C + K + T GY D+ +E L+QAV PVSV I S +FQLY
Sbjct: 195 YPYQAHDERC-RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQLY 253
Query: 263 SSGI-FTGPCS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
SG+ + CS T+LDH VL +GY +E G DYW++KNSWG WGM GY+ M RN N+
Sbjct: 254 RSGVYYERECSQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEGYIMMSRNRNNN-- 311
Query: 321 ICGINMLASYPT 332
CGI ASYPT
Sbjct: 312 -CGIATEASYPT 322
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/339 (42%), Positives = 194/339 (57%), Gaps = 17/339 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L + +I +SS+ LN I E + + Q K Y +E+ R K++ DN + +H
Sbjct: 7 LGLVVFAISSVSSINLNEV--IEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIARH 64
Query: 64 NNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR---RNASVQSPGNLRDV 117
N + G ++ L +N F DL E+K GF + D+ +A V
Sbjct: 65 NKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVV 124
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P +IDWRKKG VT VK+Q CG+CW+FSATG++EG + TG LVSLSEQ LIDC R Y
Sbjct: 125 PKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYG 184
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N+GC GGLMD A++++ N G+DTEK YPY + +C N T G+ D+PE +E
Sbjct: 185 NNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSG-ATDKGFVDIPEGDE 243
Query: 237 KQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSEN-GVDY 292
L+ A+ PVS+ I S FQ Y G+F P ST LDH VL VGY +++ G DY
Sbjct: 244 DALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDY 303
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSWG++WG GY+ M RN N+ CG+ ASYP
Sbjct: 304 WIVKNSWGKTWGDQGYIMMARNKKNN---CGVASSASYP 339
>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
Crystal Structure Of A Plant Cysteine Protease Ervatamin
B: Insight Into The Structural Basis Of Its Stability
And Substrate Specificity
Length = 215
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 116/217 (53%), Positives = 158/217 (72%), Gaps = 3/217 (1%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P+ +DWR KGAV +K+Q CG+CWAFSA A+E INKI TG L+SLSEQEL+DCD +
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG M+ A+Q++I N GIDT+++YPY G C +L +V+I+G++ V NNE
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGFQRVTRNNE 117
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L AV +QPVSV + + FQ YSSGIFTGPC T+ +H V+IVGY +++G +YWI++
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG++WG GY+ M+RN +S G+CGI L SYPTK
Sbjct: 178 NSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 151/312 (48%), Positives = 197/312 (63%), Gaps = 14/312 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
+ N FE Q+ K SE EK++R IF++N ++ NN GN S+ L LN ++DLT
Sbjct: 61 ETNSAFEFKATQNDKI--SELEKRKR--IFKNNLEYIENFNNAGNKSYKLGLNQYSDLTS 116
Query: 84 QEFKASFLGFSAAS-IDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCGAC 141
EF AS G + + + R+A+V P NL D VP + DWR++GAVT+VKDQ SCG C
Sbjct: 117 DEFLASHTGLKVSKQLSSSKMRSAAV--PFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCC 174
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EG KI TG L+SLSEQ+L+DCD NSGC GG MD A++++I+ GI +E
Sbjct: 175 WAFSVVAAVEGAVKINTGELISLSEQQLVDCDER-NSGCHGGNMDSAFKYIIQK-GIVSE 232
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI-CGSERAFQ 260
DYPY+ + C + I + DVP N+E+QLLQAV QPVSVGI G E FQ
Sbjct: 233 ADYPYQEGSQTCQLNDQMKFEAQITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDE--FQ 290
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y +++G C S++HAV VGY SE+G YW+IKNSWG+ WG GYM + R +G
Sbjct: 291 HYMGDVYSGTCGQSMNHAVTAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPG 350
Query: 320 GICGINMLASYP 331
G CGI ASYP
Sbjct: 351 GQCGIAAHASYP 362
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 197/322 (61%), Gaps = 22/322 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+ E + T+ +H K Y + E++ RLKIF +N + +HN G SF L++N +ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEVKDQA 136
H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT VKDQ
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAVKDQG 143
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN 195
CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++++ N
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGI 252
GIDTEK YPY C+ N+ + T G+ D+P+ +EK++ +AV PV+V I
Sbjct: 204 GGIDTEKSYPYEAIDDSCH---FNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAI 260
Query: 253 CGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYM 309
S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320
Query: 310 HMQRNTGNSLGICGINMLASYP 331
M RN N CGI +SYP
Sbjct: 321 KMLRNKENQ---CGIASASSYP 339
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 183/318 (57%), Gaps = 8/318 (2%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLN 76
PL Y + F W K H ++S E +RL+ + N ++ +HN + L N
Sbjct: 22 PLEYEHE----FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHN 77
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
F+ ++ +EFK G+ ++R + V + + VP S+DW+ KG VT VK+Q
Sbjct: 78 EFSSMSFEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQG 137
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS TGA+EG + +G LVSLSEQEL+DCD + + GC GGLMD+A+ ++ N
Sbjct: 138 MCGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNG 197
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GI +E DY Y+ +A C + +V I G++DV +E L AV QPVSV I +
Sbjct: 198 GICSEDDYEYKAKAQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQ 254
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
+AFQ Y SG+F C T LDH VL VGY SENG +W +KNSWG SWG GY+ + R
Sbjct: 255 KAFQFYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREEN 314
Query: 317 NSLGICGINMLASYPTKT 334
G CGI + SYP T
Sbjct: 315 GPAGQCGIASVPSYPFAT 332
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/327 (43%), Positives = 194/327 (59%), Gaps = 18/327 (5%)
Query: 12 LLLSSLPLNYCSDINELFETWCK---QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
LLL + L Y + ++W + H KAYS + E+ R I++DN + +HN G
Sbjct: 7 LLLLGVTLAYIIERPTEDDSWIRWKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQG- 65
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
F L +N F D+T+ EFK F G+ + H ++ +P + P S+DWR +G
Sbjct: 66 GDFLLEMNQFGDMTNNEFK-DFNGY----LSHKHVSGSTFLTPNSFV-APDSVDWRNEGY 119
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 187
VT VKDQ CG+CWAFS TG++EG N TG LVSLSEQ L+DC +Y N+GC GGLMD
Sbjct: 120 VTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDN 179
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 246
A+ ++ +N+GID+E YPY + G+C K N T G+ D+P +E +L +AV +
Sbjct: 180 AFTYIKENNGIDSEASYPYTAKDGKCAFTKPNV-AATDTGFVDIPSGDENKLKEAVASVG 238
Query: 247 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
P+SV I S +FQ Y G++ ST LDH VL+VGY +E+G DYW++KNSW SWG
Sbjct: 239 PISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWG 298
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYP 331
GY+ M RN N CGI ASYP
Sbjct: 299 DKGYIKMSRNAKNQ---CGIATNASYP 322
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 178/305 (58%), Gaps = 5/305 (1%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
E W QHGK Y EK++ L+IFE+N F+ + G+ SF LS N FADL +EFKA
Sbjct: 33 EKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA- 91
Query: 90 FLGFSAASIDHDR-RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA-T 147
L + +H ++ N+ +PAS+DWRK+G VT +KDQ C +CWAFS
Sbjct: 92 -LLTNGHKKEHSLWTTTETLFRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCV 150
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
IEG+++I+T LV LSEQEL+D + + GC G ++ A++F+ K I++E YPY+
Sbjct: 151 ATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYPYK 210
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF 267
G C +K + I GYK VP +E LL+AV Q VSV + + AFQ YSSGIF
Sbjct: 211 GVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSGIF 270
Query: 268 TGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
TG C T DH V + Y +S +G YW+ KNSWG WG GY+ ++ + G+CGI
Sbjct: 271 TGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAK 330
Query: 327 LASYP 331
YP
Sbjct: 331 YPYYP 335
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 183/318 (57%), Gaps = 8/318 (2%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLN 76
PL Y + F W K H ++S E +RL+ + N ++ +HN + L N
Sbjct: 22 PLEYEHE----FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHN 77
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
F+ ++ +EFK G+ ++R + V + + VP S+DW+ KG VT VK+Q
Sbjct: 78 EFSSMSFEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQG 137
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS TGA+EG + +G LVSLSEQEL+DCD + + GC GGLMD+A+ ++ N
Sbjct: 138 MCGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNG 197
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GI +E DY Y+ +A C + +V I G++DV +E L AV QPVSV I +
Sbjct: 198 GICSEDDYEYKAKAQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQ 254
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
+AFQ Y SG+F C T LDH VL VGY SENG +W +KNSWG SWG GY+ + R
Sbjct: 255 KAFQFYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREEN 314
Query: 317 NSLGICGINMLASYPTKT 334
G CGI + SYP T
Sbjct: 315 GPAGQCGIASVPSYPFAT 332
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 137/327 (41%), Positives = 190/327 (58%), Gaps = 16/327 (4%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GN 68
+ L L L S F + Q+G+ Y++ QE++ R +++ N F+ HN G
Sbjct: 5 VFLCGLALAAASPTFTSFHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGE 64
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
++ L++N F D+T++E A G AS R +V G +PA +DWR KGA
Sbjct: 65 VTYMLAINQFGDMTNEEINAVMNGLLPAS----ESRGVAVLG-GRDDTLPAEVDWRTKGA 119
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDY 187
VT VKDQ +CG+CWAFSATG++EG + + G LVSLSEQ L+DC + + GCGGGLMD+
Sbjct: 120 VTPVKDQKACGSCWAFSATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDF 179
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 246
A+ ++ N GIDTE YPY G+C N T+ GY DV ++E L +AV
Sbjct: 180 AFTYIKDNGGIDTEASYPYEATDGKCQYNPANSG-ATVTGYVDVEHDSEDALQKAVATIG 238
Query: 247 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
P+SV I S F Y G++ STSLDH VL VGY +++G DYW++KNSW +WG
Sbjct: 239 PISVAIDASRSTFHFYHKGVYYDKECSSTSLDHGVLAVGYGTQDGTDYWLVKNSWNITWG 298
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYP 331
+G++ M RN N+ CGI ASYP
Sbjct: 299 NHGFIEMSRNRNNN---CGIATQASYP 322
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 131/336 (38%), Positives = 192/336 (57%), Gaps = 14/336 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P D + + FE W ++G+ Y EK R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDV 117
+ NN +S+TL +N F D+T+ EF A + G S + + +R V ++ V
Sbjct: 67 HIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLS---LPLNIKREPVVSFDDVDISSV 123
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P SIDWR GAVT VK+Q CG+CWAF++ +E I KI G+LVSLSEQ+++DC SY
Sbjct: 124 PQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSY- 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG ++ AY F+I N G+ + YPY+ G C + I Y V NNE+
Sbjct: 183 -GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNS-AYITRYTYVQRNNER 240
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIK 296
++ AV QP++ + S FQ Y G+FTGPC T L+HA++I+GY + +G +WI++
Sbjct: 241 NMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVR 299
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG WG GY+ + R+ +S G+CGI M YPT
Sbjct: 300 NSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYPT 335
>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
Length = 376
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 196/314 (62%), Gaps = 20/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + +GK++ E + +R+ F + + +HN G SF L N+ ADL E
Sbjct: 70 WEAYKGLNGKSFYDEDTENERMLAFLSSQQHIKKHNEQYEQGKVSFKLDANSIADLPFSE 129
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
++ G+ D RR ++ +P N+ +VP S+DWR G VTEVK+Q CG+CWAFS
Sbjct: 130 YQ-KLNGYRRIYGDPLRRNSSRFLAPHNV-EVPESMDWRDHGYVTEVKNQGMCGSCWAFS 187
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
ATG++EG +K G+LVSLSEQ L+DC +Y N+GC GGLMD+A+Q++ +NHGIDTE Y
Sbjct: 188 ATGSLEGQHKRSKGTLVSLSEQNLVDCSAAYGNNGCNGGLMDFAFQYIKENHGIDTETSY 247
Query: 205 PYRGQAGQCNKQKLNRHIVTID--GYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQL 261
PY+ + +C+ Q R V D G+ D+PE +E QL AV Q P+SV I R+FQL
Sbjct: 248 PYKARQKKCHFQ---RSSVGADDTGFMDLPEGDEDQLKIAVATQGPISVAIDAGHRSFQL 304
Query: 262 YSSGI-FTGPCSTS-LDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
Y +G+ + CS+ LDH VL+VGY D ++G DYWI+KNSWG +WG GY+ M RN N
Sbjct: 305 YKTGVYYEKECSSEQLDHGVLVVGYGTDPDHG-DYWIVKNSWGTTWGEQGYVRMARNKNN 363
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 364 H---CGIATKASYP 374
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 198/345 (57%), Gaps = 29/345 (8%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
+L L+ + ++ + E + + +H K Y SE E + R+KI+ +N + +HN
Sbjct: 6 VLLCLVAGACAVSLLDLVREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRF 65
Query: 68 NS---SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-------- 116
S+ L N +AD+ H EF + GF+ + RN +V S G RD
Sbjct: 66 EQRLVSYKLKPNKYADMLHHEFVHTMNGFNKTA--KHGGRNKAVHSKG--RDGRAATFIA 121
Query: 117 -----VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171
P +DWRKKGAVT+VKDQ CG+CWAFS TGA+EG + TG LVSLSEQ L+D
Sbjct: 122 PAHVSYPDHVDWRKKGAVTDVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVD 181
Query: 172 CDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
C +Y N+GC GGLMD A++++ N GIDTEK YPY +C N + G+ D
Sbjct: 182 CSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDV-GFVD 240
Query: 231 VPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS- 286
+P+ +E++L+QAV P+SV I S+ FQ YS G++ ST LDH V++VGY +
Sbjct: 241 IPQGDEEKLMQAVATVGPISVAIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTE 300
Query: 287 ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
E G DYW++KNSWGRSWG GY+ M N N CGI ASYP
Sbjct: 301 EEGGDYWLVKNSWGRSWGELGYIKMAHNKNNH---CGIASSASYP 342
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 189/315 (60%), Gaps = 23/315 (7%)
Query: 35 QHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS---SFTLSLNAFADLTHQEFKASFL 91
+H K Y SE E + R+KI+ +N +T+HN S+ L N +AD+ H EF +
Sbjct: 33 EHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHHEFVHTMN 92
Query: 92 GFSAASIDHDRRRN----------ASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
GF+ + R +N A+ +P ++ P +DWRKKGAVT+VKDQ CG+C
Sbjct: 93 GFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVS-YPDHVDWRKKGAVTDVKDQGKCGSC 151
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TGA+EG + TG LVSLSEQ LIDC +Y N+GC GGLMD A++++ N GIDT
Sbjct: 152 WAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDT 211
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY +C + G+ D+P+ +E++L+QAV P+SV I S+ F
Sbjct: 212 EKSYPYEAVDDKCRYNPKESGADDV-GFVDIPQGDEEKLMQAVATVGPISVAIDASQETF 270
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
Q YS G++ ST LDH V++VGY + E+G D W++KNSWGRSWG GY+ M RN
Sbjct: 271 QFYSKGVYYDENCSSTDLDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGELGYIKMARNKN 330
Query: 317 NSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 331 NH---CGIASSASYP 342
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 188/315 (59%), Gaps = 20/315 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
F W + GK+Y S +E+ R + N V HN M G S+ L + FAD++++E
Sbjct: 26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRD---VPASIDWRKKGAVTEVKDQASCGACW 142
++ S+++ + R S + LR VP ++DWR KG VT++KDQ CG+CW
Sbjct: 86 YRQLVFRGCLGSMNNTKARGGS--TFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCW 143
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSATG++EG TG LVSLSEQ+L+DC SY N GC GGLMD A+Q++ N G+DTE
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTE 203
Query: 202 KDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERA 258
YPY Q G+C + N V + GY D+ +E L +AV P+SV I +
Sbjct: 204 DSYPYEAQDGEC---RFNPSTVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSS 260
Query: 259 FQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
FQLYSSG++ P CS+S LDH VL VGY S NG DYWI+KNSWG WG+ GY+ M RN
Sbjct: 261 FQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKS 320
Query: 317 NSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 321 NQ---CGIATAASYP 332
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 195/348 (56%), Gaps = 24/348 (6%)
Query: 7 FLLSILLLSSLPLNYCSDIN-------------ELFETWCKQHGKAYSSEQEKQQRLKIF 53
FL+ L+L + N C +L++ W H + + E R K+F
Sbjct: 6 FLIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKVF 64
Query: 54 EDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASID-HDRRRNASVQSPG 112
++N V + N MG S L LN FAD++ EF+ + D H ++ A+ G
Sbjct: 65 KNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRIG 123
Query: 113 NL-----RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
++P+SIDWRKKGAV +K+Q CG+CWAF+A A+E I++I T LVSLSE+
Sbjct: 124 GFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSLSEE 183
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
E++DCD + GC GG + A++F++ N G+ E +YPY G C ++ V IDG
Sbjct: 184 EVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRVRIDG 242
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYD 285
Y++VP NNE L++AV QPV+V I F+ Y G+FT C ++DH V++VGY
Sbjct: 243 YENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVVGYG 302
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
++ DYWII+N +G WGMNGYM MQR + G+CG+ M +YP K
Sbjct: 303 TDEDGDYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPVK 350
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 193/337 (57%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L LL ++ ++ N + +E + H K+Y S E+ R KIF +N + +H
Sbjct: 2 LRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKH 61
Query: 64 NNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD--VP 118
N G S+ L +N F DL EF F G+ R ++ P N+ D +P
Sbjct: 62 NAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYRGQRT----SRGSTFMPPANVNDSSLP 117
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+++DWRKKGAVT VKDQ CG+CWAFSATG++EG + + G LVSLSEQ L+DC +S+ N
Sbjct: 118 STVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFGN 177
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GGLMD A++++ N GID E+ YPY +C +K + T G+ D+ +E
Sbjct: 178 NGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKEDVG-ATDTGFVDIEGGSED 236
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWI 294
L +AV P+SV I +FQLYS G++ P S LDH VL VGY ++G YW+
Sbjct: 237 DLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGVKDGKKYWL 296
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG SWG NGY+ M R+ N CGI ASYP
Sbjct: 297 VKNSWGGSWGDNGYILMSRDKNNQ---CGIASAASYP 330
>gi|339252572|ref|XP_003371509.1| cathepsin L1 [Trichinella spiralis]
gi|316968239|gb|EFV52542.1| cathepsin L1 [Trichinella spiralis]
Length = 448
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 188/339 (55%), Gaps = 47/339 (13%)
Query: 37 GKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGF 93
GK Y++E E+ R ++F N V +HN G S+++ LN ++DLTH EF GF
Sbjct: 111 GKTYANESEENYRREVFYANRLKVIRHNEQFDGGAKSYSMKLNKYSDLTHGEFVQLMNGF 170
Query: 94 SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA------- 146
AS D R ++ + D+P ++DWR +G VT VKDQ CG+CWAFSA
Sbjct: 171 KIASKSGDYRPSSVFKPLLFTGDLPLNVDWRSEGMVTPVKDQGHCGSCWAFSAVNSNALH 230
Query: 147 --------TGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
TGA+EG NK TG LVSLSEQ LIDC R Y N GC GGLMD A+++V +NHG
Sbjct: 231 VHSRAFQQTGALEGQNKRKTGKLVSLSEQNLIDCSRKYGNKGCSGGLMDNAFEYVKENHG 290
Query: 198 IDTEKDYPYRGQAGQCNKQ---KLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGIC 253
IDTE+ YPY +K+ K + T G+ D+ NE L+ AV P+SV I
Sbjct: 291 IDTEESYPYEAAVRMLDKKCRFKNSTIGATDKGFVDIEPGNETYLMHAVATIGPLSVAID 350
Query: 254 GSERAFQLYSSGI--------------------FTGPCSTS-LDHAVLIVGYDSENGVDY 292
S +FQ YSSG+ F CS+ LDH VL+VGY S G DY
Sbjct: 351 ASHESFQFYSSGMLLMVDIFNTVEVMWTNLGVYFEPMCSSQFLDHGVLVVGYGSLKGKDY 410
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSWG SWG +GY+ M RN NS CGI ASYP
Sbjct: 411 WIVKNSWGTSWGNDGYIFMARNKNNS---CGIASFASYP 446
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 134/330 (40%), Positives = 190/330 (57%), Gaps = 22/330 (6%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS---SFTLSLNAFA 79
S + E F+ W + K+Y++ E+++R +++ N A++ N + ++ L A+
Sbjct: 44 SSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETAYT 103
Query: 80 DLTHQEFKASFLGFSAASIDHDRR----RNASVQS----PGNL-------RDVPASIDWR 124
DLT+QEF A + + A + D R V + PG L PAS+DWR
Sbjct: 104 DLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVDWR 163
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
GAVT VK+Q CG+CWAFS +EGI +I TG LVSLSEQEL+DCD + + GC GG+
Sbjct: 164 ASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TLDDGCDGGI 222
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
A +++ N GI TE DYPY G CN+ KL+ + V+I G + V +E L AV
Sbjct: 223 SYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAVA 282
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE--NGVDYWIIKNSWGRS 302
QPV+V I FQ Y G++ GPC T+L+H V +VGY E G YWI+KNSWG+
Sbjct: 283 GQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSWGQG 342
Query: 303 WGMNGYMHMQRNT-GNSLGICGINMLASYP 331
WG +GY+ M+++ G G+CGI + SYP
Sbjct: 343 WGDDGYIRMKKDVAGKPEGLCGIAIRPSYP 372
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 192/320 (60%), Gaps = 24/320 (7%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ FE + G+ Y S + + R IF N F+ +HN G+S+F++S+N F D
Sbjct: 28 ELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTD 87
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNA-----SVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
L+++EF+A+F G+ RR A SV + ++ +PA++DW KG VT +K+Q
Sbjct: 88 LSNEEFRATFNGY--------RRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQ 139
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIK 194
CG+CWAFSA ++EG + + TG LVSLSEQ L+DC + + GC GG MDYA+++VI+
Sbjct: 140 QQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGIC 253
N GIDTE YPY+ C + K N TI + DV +E L AV + P+SV I
Sbjct: 200 NRGIDTEASYPYKAIDESC-EFKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAID 258
Query: 254 GSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
S+ +FQ YSSG++ P CST LDH V VGY + NGV YW +KNSWG SWG GY+ M
Sbjct: 259 ASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTSWGQKGYIFM 318
Query: 312 QRNTGNSLGICGINMLASYP 331
RN N CGI ASYP
Sbjct: 319 SRNKQNQ---CGIATKASYP 335
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 187/320 (58%), Gaps = 22/320 (6%)
Query: 25 INELFETW---CKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAF 78
I + +E W +QHGK Y E+ + + F N + +HN G SSF + N
Sbjct: 76 IKQGYEQWRLFKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHI 135
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
DL +E++ L D R P N+ +VP DWR G VTEVK+Q C
Sbjct: 136 TDLPFEEYRK--LNGYKPRYDDSHRNGTKFLVPFNI-NVPGHWDWRDHGYVTEVKNQGMC 192
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
G+CWAFSATGA+EG +K GSLVSLSEQ L+DC R Y N+GC GGLMDYA++++ NHG
Sbjct: 193 GSCWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYIKDNHG 252
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTI--DGYKDVPENNEKQLLQAVVAQ-PVSVGICG 254
+DTE YPY+G+ +C+ N+ V +GY D+PE +E++L AV Q P+SV I
Sbjct: 253 VDTEASYPYKGKEMKCH---FNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDA 309
Query: 255 SERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 311
+FQ+Y G++ P S SLDH VL+VGY + E DYWI+KNSWG WG GY+ +
Sbjct: 310 GHPSFQMYRKGVYYEPQCSSESLDHGVLVVGYGTDEIDGDYWIVKNSWGPGWGEKGYVRI 369
Query: 312 QRNTGNSLGICGINMLASYP 331
RN N CGI ASYP
Sbjct: 370 ARNRDNH---CGIASKASYP 386
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 200/336 (59%), Gaps = 30/336 (8%)
Query: 11 ILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---G 67
+ L+ + + + +N +E+W + +GK Y+ ++E+ R I+ N + HN G
Sbjct: 4 FISLALVAMAAATSVNTEWESWKRTYGKEYT-QKEEALRHMIWNVNLKMIQMHNEKYMSG 62
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS-------PGNLRDVPAS 120
S++T ++N F DLT++E++ G+ ++ N +V S P N R PAS
Sbjct: 63 KSTYTQNMNQFGDLTNEEYRELMCGY--------KKSNKTVISKPSTFLLPSNYR-APAS 113
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
IDWR +G VT+VKDQ +CG+CWAFS+TG++EG TG LV LSEQ+L+DC Y N G
Sbjct: 114 IDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGDYGNMG 173
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
CGGG MD A+ + IK+ G ++E YPY G C ++ + T GY D+PE +E L
Sbjct: 174 CGGGWMDQAFSY-IKDKGEESEDGYPYTGTDDTC-VYDASKVVATDTGYTDIPEMDENAL 231
Query: 240 LQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGY-DSENGVDYWII 295
QAV P+SV I + +FQ Y SG++ P CS T+LDHAVL VGY SE G+DYWI+
Sbjct: 232 QQAVATVGPISVAIDATHSSFQFYESGVYDEPECSQTNLDHAVLAVGYGTSEEGLDYWIV 291
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSW WGM GY+ M RN N CGI ASYP
Sbjct: 292 KNSWSTGWGMQGYIEMSRNKDNQ---CGIASKASYP 324
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 251 bits (640), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 203/329 (61%), Gaps = 14/329 (4%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+L+ +++S +++ + E + ++ QH K Y SE E++ R+KIF +N V +H+
Sbjct: 4 LLILAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENDHKVAKHSK 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAA--SIDHDRRRNASVQ--SPGNLRDVP 118
+ G F L LN +AD+ H EF ++ GF+ +I N +V+ SP N++ +P
Sbjct: 64 LFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVK-LP 122
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
++DWR KGAVT+VKDQ CG+CW+FS +G++EG + TG LVSLSEQ L+DC Y N
Sbjct: 123 DTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGN 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GGLMD A++++ N GIDTE+ YPY + +C+ + N T G+ D+ E NE
Sbjct: 183 TGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAEDEKCHYKTQNSG-ATDKGFVDIEEGNED 241
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-DSENGVDYW 293
L AV PVS+ I S FQLYS G+++ P S LDH VL+VGY S++G DYW
Sbjct: 242 DLKAAVATVGPVSIAIDASYETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYW 301
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
++KNSW S G+NGY+ M RN N G+
Sbjct: 302 LVKNSWRPSCGLNGYIKMARNQDNMCGVA 330
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 154/364 (42%), Positives = 205/364 (56%), Gaps = 43/364 (11%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINE-----------LFETWCK---QHGKAYSSEQEK 46
M + L SI LL + S I E + W +H K+Y ++ E+
Sbjct: 1 MIRITLLLHSIFLLGFVNSEQISQIQEHPRNNLLINHPYYPVWTNFKLKHAKSYKTKDEE 60
Query: 47 QQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR 103
R ++F N+ + QHN G SF LSLN FAD+T+ EF+ GF + +R
Sbjct: 61 LLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPA----KR 116
Query: 104 RNASVQS----------PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGI 153
+ A Q P N+ +P S+DWRK+G VT+VKDQ SCG+CWAFSATG++EG
Sbjct: 117 KLAKSQPLKEDGMIFEMPDNVT-IPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ 175
Query: 154 NKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ 212
+ TG LVSLSEQ L+DCD + GC GG MD A+Q+V N GIDTE YPY+G+ G+
Sbjct: 176 HYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEASYPYKGRDGR 235
Query: 213 CNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ--PVSVGICGSERAFQLYSSGIFTG- 269
C + K T G+ D+PE NE LL+A +A PVSV I + FQ YS G++
Sbjct: 236 C-RFKSEDVGATDTGFVDIPEGNET-LLEAAIATVGPVSVAIDAASFKFQFYSHGVYYDR 293
Query: 270 PCSTS-LDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINML 327
CS LDH VL VGY+S ++G Y+I+KNSW WG +GY+ M R N+ CGI +
Sbjct: 294 SCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSRRKNNN---CGIATM 350
Query: 328 ASYP 331
ASYP
Sbjct: 351 ASYP 354
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 195/338 (57%), Gaps = 17/338 (5%)
Query: 3 SLAFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
S+ F +L++L+ +S L + ++ + H K Y + R KIF N +
Sbjct: 5 SMKFLILAVLVGAASAALTLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIA 64
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+HN G +++ L +N F D+ H EF ++ G + +R S +P
Sbjct: 65 RHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL----LRSNRTYFGSTWIEPESVSLP 120
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR+KGAVT VK+Q CG+CW+FS TGA+EG TG LVSLSEQ LIDC SY N
Sbjct: 121 KSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGN 180
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GCGGGLMD A+ ++ +NHGIDTE+ YPY G+ G+C K + G+ D+P NE+
Sbjct: 181 NGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDS-AGRDTGFVDIPSGNER 239
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-DSENGVDYW 293
L +A+ PVSV I S +FQ Y G++ P S SLDH VL VGY +++G DY+
Sbjct: 240 ALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYY 299
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
IIKNSWG WG GY+ M RN+ N CG+ ASYP
Sbjct: 300 IIKNSWGERWGQEGYVLMARNSKNE---CGVATQASYP 334
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 194/313 (61%), Gaps = 18/313 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
++ + K HGK+Y ++E +R ++F + A + HN ++G +++ + LN F D+T +E
Sbjct: 19 WDLYKKVHGKSYGHDEEHFRR-QLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEE 77
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F+ +F G + +R Q +P +DWR+KG VT VK+Q CG+CWAFS
Sbjct: 78 FR-NFKGLKFDAT-KTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFS 135
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
TG++EG + TG LVSLSEQ L+DC R N+GC GGLMD + ++ +N GIDTE+ Y
Sbjct: 136 TTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESY 195
Query: 205 PYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQL 261
PY G+ G C N + V + G+ DVP+ +E L AV + PVSV I S +FQ
Sbjct: 196 PYTGKDGDC---AFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQY 252
Query: 262 YSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y G++ P CS S LDH VL+VGY +ENGVDYW++KNSWG +WG +GY+ M RN N
Sbjct: 253 YKEGVYDEPSCSFSQLDHGVLVVGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRNKENQ- 311
Query: 320 GICGINMLASYPT 332
CGI +ASYPT
Sbjct: 312 --CGIASMASYPT 322
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 137/325 (42%), Positives = 188/325 (57%), Gaps = 8/325 (2%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ S L + +LFE+W +H K Y + EK R +IF+DN ++ + N N
Sbjct: 46 FSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-N 104
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+S+ L LN FAD+++ EFK + G A + V + G++ ++P +DWR+KGA
Sbjct: 105 NSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDV-NIPEYVDWRQKGA 163
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT VK+Q SCG+ WAFSA IE I KI TG+L SEQEL+DCDR + GC GG A
Sbjct: 164 VTPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSA 222
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
Q V + +GI YPY G C ++ + DG + V NE LL ++ QPV
Sbjct: 223 LQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPV 281
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV + + + FQLY GIF GPC +DHAV VGY G +Y +I+NSWG WG NGY
Sbjct: 282 SVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY----GPNYILIRNSWGTGWGENGY 337
Query: 309 MHMQRNTGNSLGICGINMLASYPTK 333
+ ++R TGNS G+CG+ + YP K
Sbjct: 338 IRIKRGTGNSYGVCGLYTSSFYPVK 362
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 193/309 (62%), Gaps = 13/309 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLTHQEFK 87
F+ W ++ K Y +++ + +R I+E N FV HN N FT+++N FADL EF
Sbjct: 24 FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFG 83
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSAT 147
F G ++ + ++ P ++ VP ++DW++KGAVT +K+Q CG+CW+FS+T
Sbjct: 84 RIFNGLLPRPSSYN---STNIYKPSGVK-VPDTVDWKEKGAVTPIKNQGQCGSCWSFSST 139
Query: 148 GAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
G++EG + I TG+LVSLSEQ+L+DC Y N GC GGLMD +++++ G +TE +YPY
Sbjct: 140 GSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYPY 199
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQPVSVGICGSERAFQLYSSG 265
+ G C + + +VT Y D+P+ +E L AV P+SV I S +FQLY+SG
Sbjct: 200 TAENGVC-RYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSG 258
Query: 266 IFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
++ ST LDH VL +GY +E+G DYW++KNSWG SWGM GY+ M RN N+ CG
Sbjct: 259 VYYASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRNRNNN---CG 315
Query: 324 INMLASYPT 332
I ASYPT
Sbjct: 316 IATQASYPT 324
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 198/319 (62%), Gaps = 20/319 (6%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFA 79
S +N +++ + +++ + Y S+ E+++RL IF +N+ +++HN + G S+++ +NAF+
Sbjct: 61 SILNSMWQAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFS 120
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
D T+ E GF +S R+ S P + PA +DWR KGAVT VK+Q CG
Sbjct: 121 DKTNSELDV-LRGFRHSS---KASRSGSQYIPFDAAP-PAEVDWRTKGAVTPVKNQGDCG 175
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFSATG IEG + + TG LVSLSEQ+L+DC S N GC GGLMD A+++V ++ GID
Sbjct: 176 SCWAFSATGGIEGQHYLATGKLVSLSEQQLVDCSSS-NDGCDGGLMDLAFEYVKEHKGID 234
Query: 200 TEKDYPY----RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICG 254
TE YPY G A QC+ V + GY D+PE E L QAV P+SVGI
Sbjct: 235 TEVHYPYVSGNTGYARQCSFDP-KYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINA 293
Query: 255 SERAFQLYSSGIFTG-PCST-SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
+F Y SGI++ C+ LDH VL+VGY +NGV YW+IKNSWG WG NGY+ +
Sbjct: 294 GLPSFMAYESGIYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRIL 353
Query: 313 RNTGNSLGICGINMLASYP 331
RN N +CG+ +ASYP
Sbjct: 354 RNHNN---LCGVATMASYP 369
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 186/311 (59%), Gaps = 12/311 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
F W + ++Y S E+ R +I+ +N FV HN + G S+ L + FAD+ ++E
Sbjct: 26 FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85
Query: 86 FKASFLGFSAASIDHDR-RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
+K S + RR ++ D+P ++DWR KG VT+VKDQ CG+CWAF
Sbjct: 86 YKRVISQGCLHSFNASLPRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWAF 145
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKD 203
SATG++EG + TG+LVSLSEQ+L+DC Y N GC GGLMDYA+Q++ N GIDTE+
Sbjct: 146 SATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDTEES 205
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLY 262
YPY + G+C N T GY +V + +E L +AV P+SVGI S+ +FQ Y
Sbjct: 206 YPYEAENGKCRYNPDNIG-ATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQFY 264
Query: 263 SSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
SG++ P S LDH VL VGY +E+G DYW++KNSWG WG GY+ M RN N
Sbjct: 265 ESGVYNEPDCSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWGDKGYIKMSRNKSNQ-- 322
Query: 321 ICGINMLASYP 331
CGI ASYP
Sbjct: 323 -CGIATAASYP 332
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 142/339 (41%), Positives = 194/339 (57%), Gaps = 17/339 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L + +I +SS+ LN I E ++ + Q K Y +E+ R K++ DN + +H
Sbjct: 7 LGLVVFAISSVSSINLNEI--IEEEWDLFKVQFKKIYEDVKEEAFRKKVYLDNKLKIARH 64
Query: 64 NNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR---RNASVQSPGNLRDV 117
N + G ++ L +N F DL E+ GF + D+ +A +
Sbjct: 65 NKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVI 124
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P SIDWRKKG VT VK+Q CG+CW+FSATG++EG + TG LVSLSEQ LIDC R Y
Sbjct: 125 PKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYG 184
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N+GC GGLMD A++++ N G+DTEK YPY + +C N T G+ D+PE +E
Sbjct: 185 NNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSG-ATDKGFVDIPEGDE 243
Query: 237 KQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSEN-GVDY 292
L+ A+ PVS+ I S FQ Y G+F P ST LDH VL VGY +++ G DY
Sbjct: 244 DALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDY 303
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSWG++WG GY+ M RN N+ CG+ ASYP
Sbjct: 304 WIVKNSWGKTWGDQGYIMMARNKKNN---CGVASSASYP 339
>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
At 1.7 Angstroms Resolution By Fast Fourier
Least-Squares Methods
Length = 220
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 118/218 (54%), Positives = 154/218 (70%), Gaps = 2/218 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P+ +DWR GAV ++K Q CG WAFSA +EGINKI +GSL+SLSEQELIDC R+
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQ 60
Query: 177 NS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+ GC GG + +QF+I + GI+TE++YPY Q G C+ ++ VTID Y++VP NN
Sbjct: 61 NTRGCDGGYITDGFQFIINDGGINTEENYPYTAQDGDCDVALQDQKYVTIDTYENVPYNN 120
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E L AV QPVSV + + AF+ Y+SGIFTGPC T++DHA++IVGY +E GVDYWI+
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGTEGGVDYWIV 180
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
KNSW +WG GYM + RN G + G CGI + SYP K
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 217
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 203/329 (61%), Gaps = 14/329 (4%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN 65
+L+ +++S +++ + E + ++ QH K Y SE E++ R+KIF +N V +H+
Sbjct: 4 LLILAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHSK 63
Query: 66 M---GNSSFTLSLNAFADLTHQEFKASFLGFSAA--SIDHDRRRNASVQ--SPGNLRDVP 118
+ G F L LN +AD+ H EF ++ GF+ +I N +V+ SP N++ +P
Sbjct: 64 LFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVK-LP 122
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
++DWR KGAVT+VKDQ CG+CW+FS +G++EG + TG LVSLSEQ L+DC Y N
Sbjct: 123 DTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGN 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
+GC GGLMD A++++ N GIDTE+ YPY + +C+ + N T G+ D+ E NE
Sbjct: 183 NGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAEDEKCHYKTQNSG-ATDKGFVDIEEGNED 241
Query: 238 QLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-DSENGVDYW 293
L AV P+S+ I S FQLYS G+++ P S LDH VL+VGY S++G DYW
Sbjct: 242 DLKAAVATVGPISIAIDASYETFQLYSDGVYSDPECISQELDHGVLVVGYGTSDDGQDYW 301
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
++KNSW S G+NGY+ M RN N G+
Sbjct: 302 LVKNSWRPSCGLNGYIKMARNQDNMCGVA 330
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 135/343 (39%), Positives = 198/343 (57%), Gaps = 15/343 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P D + + FE W ++G+ Y EK +R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ N+ +S+TL +N F D+T+ EF A + G S ++ +R S ++ VP
Sbjct: 67 HIETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLP-LNIEREPVVSFDDV-DISAVP 124
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAVT VK+ CG+CWAF+A +E I KI G L+SLSEQ+++DC SY
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAVSY-- 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ--CNKQKLNRHIVTIDGYKDVPENNE 236
GC GG ++ AY F+I N G+ + YPY+ GQ C + I GY V NNE
Sbjct: 183 GCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPNS-AYITGYTRVQSNNE 241
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWII 295
+ ++ AV QP++ I S FQ Y G+F+GPC TSL+HA+ I+GY + +G +WI+
Sbjct: 242 RSMMYAVSNQPIAASIEASGD-FQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIV 300
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 337
+NSWG SWG GY+ M R+ +S G+CGI + YPT ++G N
Sbjct: 301 RNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYPTLQSGAN 343
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 192/329 (58%), Gaps = 13/329 (3%)
Query: 11 ILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MG 67
++L+ S+ + D+ +E + HGK Y S E+ R IF DN + +HN MG
Sbjct: 4 LILVLSVTMATAMDVE--WEAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMG 61
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
S+ + +N F DL H E+ +G ++ +S L+ V ++DWR+KG
Sbjct: 62 RRSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENVFESTPGLQ-VDDTVDWRQKG 120
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMD 186
AVT +KDQ CG+CWAFS TG++EG + + TG LVSLSEQ L+DC R + N GC GGLMD
Sbjct: 121 AVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMD 180
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VA 245
A++++ N GIDTE+ YPY + + K + T+ Y D+ +E L+QAV
Sbjct: 181 QAFRYIKSNGGIDTEECYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTV 240
Query: 246 QPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 303
PVSV I S ++ + Y SGI+ P CS T LDH VL VGY S +G+DYW++KNSWG +W
Sbjct: 241 GPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYGSMDGMDYWLVKNSWGSAW 300
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPT 332
G GY+ M RN N CGI ASYP
Sbjct: 301 GDMGYVKMTRNKNNQ---CGIATKASYPV 326
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 184/316 (58%), Gaps = 14/316 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R IFE N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H+EF +G I + V + +P S+DWR V+EVKDQ CG+CW
Sbjct: 81 HEEFHQRIMG-GCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DTE
Sbjct: 140 AFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITANGGLDTE 199
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
+ YPY + K + T+ GYKDV NE L +AV PVSV I +FQ
Sbjct: 200 ESYPYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQ 259
Query: 261 LYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRNT 315
YSSG++ P CST LDH VL VGY + N +WI+KNSWG SWG GY+ M RN
Sbjct: 260 FYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNK 319
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 320 NNQ---CGIATSASYP 332
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 191/336 (56%), Gaps = 14/336 (4%)
Query: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN- 64
LL ++ + S+ N+ +E W QHGK Y +E E+ R IFE N + +HN
Sbjct: 1 MMLLILVAVISMATAGVLPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNI 60
Query: 65 --NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
++G S+TL++N F D+ H+EF +G I + V + +P S+D
Sbjct: 61 RASLGMHSYTLAMNKFGDMHHEEFHQRIMG-GCLKIVKKPLLGSEVGDNDDNGTLPKSVD 119
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCG 181
WR V+EVKDQ CG+CWAFS TG++EG + TG LV LSEQ+L+DC + + N GCG
Sbjct: 120 WRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCG 179
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLMD A+Q++ N G+DTE+ YPY + K + T+ GYKDV NE L +
Sbjct: 180 GGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKR 239
Query: 242 AV-VAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWII 295
AV PVSV I +FQ YSSG++ P CST LDH VL VGY + N +WI+
Sbjct: 240 AVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIV 299
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG SWG GY+ M RN N CGI ASYP
Sbjct: 300 KNSWGPSWGDQGYIMMSRNKNNQ---CGIATSASYP 332
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 20/316 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+NE ++ + ++GK Y S +E R ++E N F+ HN G SFTL++N F D+
Sbjct: 19 LNE-WQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDM 77
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E A+ GF +A R ++ P + ++P ++DWR KGAVT VKDQ +CG+C
Sbjct: 78 TTEEINAAMNGFLSAGKKVPR---GTMYQPL-VDELPDTVDWRDKGAVTPVKDQKACGSC 133
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + TG LVSLSEQ L+DC Y N GCGGGLMD A++++ N+GIDT
Sbjct: 134 WAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDT 193
Query: 201 EKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSER 257
E+ YPY + G C + N V T+ Y D+ +E L +AV + PVSV I S
Sbjct: 194 EESYPYEAKNGPC---RFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTS 250
Query: 258 AFQLYSSGI-FTGPCSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
F YS GI + CS+S LDH VL VGY +++ DYW++KNSW +WG +GY+ M RN
Sbjct: 251 TFHFYSRGIYYDEKCSSSFLDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSRNR 310
Query: 316 GNSLGICGINMLASYP 331
N+ CGI ASYP
Sbjct: 311 NNN---CGIASQASYP 323
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 178/310 (57%), Gaps = 10/310 (3%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN--NMGNSSFTLSLNAFADLTHQEF 86
F W + H K+Y + R +I++ N ++T N + SSFT+++N F DLT EF
Sbjct: 95 FTEWMRTHRKSYHHDH-FLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDLTSDEF 153
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
+ G S + + N +P S DWR+KG V+ VKDQ CG+CWAFS
Sbjct: 154 NRLYNGLHVFSAPKASEKVERPRQWANTAGIPESGDWRQKGVVSRVKDQGMCGSCWAFST 213
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSY--NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
TG+ EGIN I T LV LSEQ L+DC + N GC GG MD A++++I N GID+E Y
Sbjct: 214 TGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKGIDSEASY 273
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
PY GQC + K +P+ +EK LL A QP+SVGI +FQ YS
Sbjct: 274 PYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDAGRPSFQFYSK 333
Query: 265 GIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
G++ P ST L+H VLIVG+ E G YW++KNSWG++WGM+GY+ M R+ N C
Sbjct: 334 GVYNEPECSSTELNHGVLIVGWGVERGQAYWLVKNSWGQTWGMDGYIKMSRDKNNQ---C 390
Query: 323 GINMLASYPT 332
GI LASYP+
Sbjct: 391 GIATLASYPS 400
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 186/316 (58%), Gaps = 14/316 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R IFE N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H+EF +G I + V + +P S+DWR V+EVKDQ CG+CW
Sbjct: 81 HEEFHQRIMG-GCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DTE
Sbjct: 140 AFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTE 199
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
+ YPY + K + T+ GYKDV +NE L +AV PVSV I +FQ
Sbjct: 200 ESYPYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSVAIDAGHESFQ 259
Query: 261 LYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRNT 315
YSSG++ P CST LDH VL+VGY + N +WI+KNSWG +WG GY+ M RN
Sbjct: 260 FYSSGVYDEPQCSTEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNWGDQGYIMMSRNK 319
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 320 NNQ---CGIATSASYP 332
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 195/341 (57%), Gaps = 21/341 (6%)
Query: 2 NSLAFFLLSILLLS-----SLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
N L S+LL+S + L+ D++ +E W K HGK Y +E E +R +++E N
Sbjct: 5 NERGLMLASLLLVSLCVEAAAMLDVRLDVH--WELWKKSHGKTYPNEVEDVRRRELWERN 62
Query: 57 YAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGN 113
+T+HN +MG ++ LS+N DLT +E S+ + + D +R A G+
Sbjct: 63 LMLITKHNLEASMGLQTYDLSMNHMGDLTTEEIMQSYATLTPPA---DIQR-APAPFVGS 118
Query: 114 LRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD 173
DVP S+DWR +G VT VK Q SCG+CWAFSA GA+EG TG LV LS Q L+DC
Sbjct: 119 GADVPVSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCS 178
Query: 174 RSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVP 232
Y N GC GG MD A+Q+VI N GID+E YPYRGQ QC+ R Y +P
Sbjct: 179 LKYGNKGCNGGFMDRAFQYVIDNKGIDSEASYPYRGQLQQCSYNPSYR-AANCSRYSFLP 237
Query: 233 ENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGV 290
E +E L A+ P+SV I + F Y SG++ P C+ ++H VL VGY +E+G
Sbjct: 238 EGDEGALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQRVNHGVLAVGYGTESGQ 297
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG S+G GY+ M RN + CGI + SYP
Sbjct: 298 DYWLVKNSWGTSFGDKGYIRMSRNKNDQ---CGIALYCSYP 335
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 192/340 (56%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L +I +SS+ LN I E + + Q K Y +E+ R K++ DN + +H
Sbjct: 7 LGLVAFAISTVSSINLNEV--IEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKIARH 64
Query: 64 NNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR----RNASVQSPGNLRD 116
N + G ++ L +N F DL E+ GF + DR + N+
Sbjct: 65 NKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSENVV- 123
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P S+DWRKKG VT VK+Q CG+CW+FSATG++EG + TG LVSLSEQ LIDC R Y
Sbjct: 124 IPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKY 183
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+GC GGLMD A++++ N G+DTEK YPY + +C N T G+ D+PE +
Sbjct: 184 GNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSG-ATDKGFVDIPEGD 242
Query: 236 EKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVD 291
E L+ A+ PVS+ I S FQ Y G+F P ST LDH VL VG+ S+ G D
Sbjct: 243 EDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKKGGD 302
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSWG++WG GY+ M RN N+ CG+ ASYP
Sbjct: 303 YWIVKNSWGKTWGDEGYIMMARNKKNN---CGVASSASYP 339
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 196/326 (60%), Gaps = 18/326 (5%)
Query: 19 LNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSL 75
+++ S + E +E + +H K Y SE E+ R+KIF +N + HN G+ ++ LS+
Sbjct: 19 VSFFSVVLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSM 78
Query: 76 NAFADLTHQEFKASFLGF----SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTE 131
N + D+ H EF ++ GF + ++ A+ P + +P ++DWR KGAVT
Sbjct: 79 NKYGDMLHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTP 138
Query: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQ 190
+KDQ CG+CWAFSATGA+EG TG LVSLSEQ L+DC R + N+GC GGLMD A++
Sbjct: 139 IKDQGQCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFE 198
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID-GYKDVPENNEKQLLQAV-VAQPV 248
+V +N GIDTE+ YPY + +C+ R D G+ DV E +E L +AV PV
Sbjct: 199 YVKENGGIDTEESYPYDAEDEKCHYNP--RAAGAEDKGFVDVREGSEHALKKAVATVGPV 256
Query: 249 SVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYD-SENGVDYWIIKNSWGRSWGM 305
SV I S +FQ YS G++ P CS LDH VL+VGY ++G DYW++KNSWG +WG
Sbjct: 257 SVAIDASHESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGD 316
Query: 306 NGYMHMQRNTGNSLGICGINMLASYP 331
GY+ M RN N CGI AS+P
Sbjct: 317 QGYVKMARNRDNQ---CGIASSASFP 339
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/330 (40%), Positives = 189/330 (57%), Gaps = 22/330 (6%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS---SFTLSLNAFA 79
S + E F+ W + K+Y++ E+++R ++ N A++ N + ++ L A+
Sbjct: 44 SSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGETAYT 103
Query: 80 DLTHQEFKASFLGFSAASIDHDRR----RNASVQS----PGNL-------RDVPASIDWR 124
DLT+QEF A + + A + D R V + PG L PAS+DWR
Sbjct: 104 DLTNQEFMAMYTAPAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSAPASVDWR 163
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
GAVT VK+Q CG+CWAFS +EGI +I TG LVSLSEQEL+DCD + + GC GG+
Sbjct: 164 ASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TLDDGCDGGI 222
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
A +++ N GI TE DYPY G CN+ KL+ + V+I G + V +E L AV
Sbjct: 223 SYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAVA 282
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE--NGVDYWIIKNSWGRS 302
QPV+V I FQ Y G++ GPC T+L+H V +VGY E G YWI+KNSWG+
Sbjct: 283 GQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRYWIVKNSWGQG 342
Query: 303 WGMNGYMHMQRNT-GNSLGICGINMLASYP 331
WG +GY+ M+++ G G+CGI + SYP
Sbjct: 343 WGDDGYIRMKKDVAGKPEGLCGIAIRPSYP 372
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 184/316 (58%), Gaps = 16/316 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ +E W + H K Y+ E+E +R KI+EDN V++HN ++G S+TL +N +ADL
Sbjct: 24 FDDTWEAWKQTHSKQYTKEEEDNRR-KIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADL 82
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+EF G D R R P S+DWR +G VT VKDQ CG+C
Sbjct: 83 RGEEFVQMMNGLK---FDASRERQGIKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG + TG L SLSEQ L+DC SY N+GC GGLMDYA+Q++ N GIDT
Sbjct: 140 WAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDT 199
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAF 259
E YPY + C N T GY DV +E L +A A P+SV I S +F
Sbjct: 200 EDKYPYEAEDDTCRFSPDNVG-ATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESF 258
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
QLY SG++ S LDH VL+VGY +++ G DYWI+KNSWG SWG GY+ M RN
Sbjct: 259 QLYESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKD 318
Query: 317 NSLGICGINMLASYPT 332
N CGI ASYPT
Sbjct: 319 NQ---CGIATSASYPT 331
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/300 (45%), Positives = 181/300 (60%), Gaps = 20/300 (6%)
Query: 44 QEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQEFKASFLG-----FSA 95
+E+ +R++IFE+N + HNN +G ++ L N FA +T+ EF A+ +G +A
Sbjct: 14 KEESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDRNA 73
Query: 96 ASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINK 155
+ DR Q NL ++P ++DWR KG VT VK+Q CG+CWAFS TG++EG
Sbjct: 74 SKSTADRVH----QYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTF 129
Query: 156 IVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCN 214
TG LVSLSEQ L+DC + N GC GGLMD A++++ N GIDTE YPY + G+C
Sbjct: 130 KKTGKLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCR 189
Query: 215 KQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--C 271
+ + T+ GY D+ E +E L QAV P+SV I S FQ+YS G++ P
Sbjct: 190 FKPADVG-ATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCS 248
Query: 272 STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
ST LDH VL VGY +E G DYW++KNSWG WG NGY+ M RN N CGI ASYP
Sbjct: 249 STELDHGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQ---CGIATSASYP 305
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 190/316 (60%), Gaps = 17/316 (5%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-SFTLSLNAFADLT 82
D E +E+W K+HGK Y+S++E+ R I++ N +V +HN FT+ +N FADL
Sbjct: 17 DFPEEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLE 76
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
EF + G++ ++ S + D+P S+DWR KG VT +K+Q CG+CW
Sbjct: 77 SSEFGRLYNGYNNKP---SMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCW 133
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA +EG + TG+LVSLSEQ L+DC + N GC GGLMD A+Q+VIKN GIDTE
Sbjct: 134 AFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTE 193
Query: 202 KDYPYRGQAGQCNKQKLNRHIV--TIDGYKDV-PENNEKQLLQAVVAQ-PVSVGICGSER 257
YPY+ +C K N V T G+ D+ P +E L AV P+SV I S
Sbjct: 194 ASYPYKAVDQKC---KFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHT 250
Query: 258 AFQLYSSGIFT-GPCS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
+FQLY SG+++ CS TSLDH V VGYDS +GV YWI+KNSWG +WG GY+ M RN
Sbjct: 251 SFQLYKSGVYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRNK 310
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 311 NNQ---CGIATAASYP 323
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 184/316 (58%), Gaps = 14/316 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R IFE N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H+EF +G I + V + +P S+DWR V+EVKDQ CG+CW
Sbjct: 81 HEEFHQRIMG-GCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DTE
Sbjct: 140 AFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTE 199
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
+ YPY + K + T+ GYKDV NE L +AV PVSV I +FQ
Sbjct: 200 ESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQ 259
Query: 261 LYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRNT 315
YSSG++ P CST LDH VL VGY + N +WI+KNSWG SWG GY+ M RN
Sbjct: 260 FYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNK 319
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 320 NNQ---CGIATSASYP 332
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 196/315 (62%), Gaps = 14/315 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W ++GK Y+ EK++R KIF+DN + +HN+ N S+ LN F+DLT EF+
Sbjct: 40 MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVT-EVKDQASCGACWAFS 145
AS+LG ++ + + + DV P +DWR++GAV VK Q CG+CWAF+
Sbjct: 100 ASYLG---GKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFA 156
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
ATGA+EGIN+I TG LVSLSEQELIDCDR + N GC GG +A++F+ +N GI +++ Y
Sbjct: 157 ATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVY 216
Query: 205 PYRGQ---AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
Y G+ A + + K R +VTI+G++ VP N+E L +AV QP+SV I +
Sbjct: 217 GYTGEDTAACKAIEMKTTR-VVTINGHEVVPVNDEMSLKKAVAYQPISVMISAAN--MSD 273
Query: 262 YSSGIFTGPCSTSL-DHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y SG++ G CS DH VLIVGY S + DYW+I+NSWG WG GY+ +QRN
Sbjct: 274 YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPT 333
Query: 320 GICGINMLASYPTKT 334
G C + + YP K+
Sbjct: 334 GKCAVAVAPVYPIKS 348
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 196/315 (62%), Gaps = 14/315 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W ++GK Y+ EK++R KIF+DN + +HN+ N S+ LN F+DLT EF+
Sbjct: 40 MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVT-EVKDQASCGACWAFS 145
AS+LG ++ + + + DV P +DWR++GAV VK Q CG+CWAF+
Sbjct: 100 ASYLG---GKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFA 156
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
ATGA+EGIN+I TG LVSLSEQELIDCDR + N GC GG +A++F+ +N GI +++ Y
Sbjct: 157 ATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVY 216
Query: 205 PYRGQ---AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
Y G+ A + + K R +VTI+G++ VP N+E L +AV QP+SV I +
Sbjct: 217 GYTGEDTAACKAIEMKTTR-VVTINGHEVVPVNDEMSLKKAVAYQPISVMISAAN--MSD 273
Query: 262 YSSGIFTGPCSTSL-DHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y SG++ G CS DH VLIVGY S + DYW+I+NSWG WG GY+ +QRN
Sbjct: 274 YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPT 333
Query: 320 GICGINMLASYPTKT 334
G C + + YP K+
Sbjct: 334 GKCAVAVAPVYPIKS 348
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 194/337 (57%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+ F +L++L+ +S L + ++ + H K Y + R KIF N + +
Sbjct: 1 MKFLILAVLVGAASAALTLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIAR 60
Query: 63 HN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
HN G +++ L +N F D+ H EF ++ G + +R S +P
Sbjct: 61 HNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL----LRSNRTYFGSTWIEPESVSLPK 116
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
S+DWR+KGAVT VK+Q CG+CW+FS TGA+EG TG LVSLSEQ LIDC SY N+
Sbjct: 117 SVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNN 176
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GCGGGLMD A+ ++ +NHGIDTE+ YPY G+ G+C K + G+ D+P NE+
Sbjct: 177 GCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDS-AGRDTGFVDIPSGNERA 235
Query: 239 LLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-DSENGVDYWI 294
L +A+ PVSV I S +FQ Y G++ P S SLDH VL VGY +++G DY+I
Sbjct: 236 LAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYYI 295
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
IKNSWG WG GY+ M RN+ N CG+ ASYP
Sbjct: 296 IKNSWGERWGQEGYVLMARNSKNE---CGVATQASYP 329
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 197/319 (61%), Gaps = 18/319 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
S++NEL+ + + +GK+Y +++ +R ++E N ++ HN ++G SF++ +N +
Sbjct: 34 SELNELWTEYKETYGKSYDMKEDVVRR-SLWEGNLRHISMHNVKHDLGKHSFSMGINELS 92
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
DLT E++ LG A + ++ N VP +DWR KG VT VK+Q +CG
Sbjct: 93 DLTPSEYRQR-LGLRPALGERTGKKFVY-----NGEKVPEHVDWRDKGYVTPVKNQGACG 146
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGI 198
+CWAFS+TG++EG + +TG LVSLSEQ L+DC + Y N+GC GG MD A+ +V N+GI
Sbjct: 147 SCWAFSSTGSLEGQHFRLTGQLVSLSEQNLVDCTKKYGNAGCNGGWMDNAFNYVKANNGI 206
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSE 256
DTE YPY G C H G+ DV + +E L QAV PVSVGI +
Sbjct: 207 DTEAFYPYEGHDDWCGYDGSPGHKGANCTGHVDVQQGDELALKQAVATVGPVSVGIDATH 266
Query: 257 RAFQLYSSGIFTG-PCS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 314
R+FQLY SGI+ CS +S DHAVL+VGY S+ G DYW++KNSWG SWGM+GY+ M RN
Sbjct: 267 RSFQLYKSGIYDEVACSNSSTDHAVLVVGYGSQGGHDYWLVKNSWGTSWGMDGYIMMSRN 326
Query: 315 TGNSLGICGINMLASYPTK 333
GN C I ASYPT+
Sbjct: 327 KGNQ---CAIASYASYPTE 342
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 184/316 (58%), Gaps = 14/316 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R IFE N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H+EF +G I + V + +P S+DWR V+EVKDQ CG+CW
Sbjct: 81 HEEFHQRIMG-GCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DTE
Sbjct: 140 AFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTE 199
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
+ YPY + K + T+ GYKDV NE L +AV PVSV I +FQ
Sbjct: 200 ESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQ 259
Query: 261 LYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRNT 315
YSSG++ P CST LDH VL VGY + N +WI+KNSWG SWG GY+ M RN
Sbjct: 260 FYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNK 319
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 320 NNQ---CGIATSASYP 332
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 192/316 (60%), Gaps = 15/316 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+L++ + H + Y E E+ QR ++F +N + HN + G SS+ + +N FAD+
Sbjct: 40 FEKLWQDFKTVHERNYG-ETEEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADM 98
Query: 82 THQEFKASFLGFSAASIDHDRRR-NASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+EF + GF + R ++ SP +PA +DWRK+G VT +KDQ CG+
Sbjct: 99 EVKEFASVVNGFRMNNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGS 158
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGID 199
CW+FS TGA+EG + TG LVSLSEQ LIDC SY N+GC GG+MDYA+Q++ N G D
Sbjct: 159 CWSFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDD 218
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTID-GYKDVPENNEKQLLQAV-VAQPVSVGICGSER 257
TE YPY G C +K ++ D GY D+P+ +E+++ +AV + PVSV I S
Sbjct: 219 TEDSYPYEAADGPCRFKK--EYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHT 276
Query: 258 AFQLYSSGIFTG-PCSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
+FQ+Y SG++ C LDH VL+VGY +E G DYW++KNSWG WG GY+ M RN
Sbjct: 277 SFQMYQSGVYDEVECDPEGLDHGVLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRNK 336
Query: 316 GNSLGICGINMLASYP 331
N CGI+ +ASYP
Sbjct: 337 NNQ---CGISSMASYP 349
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 146/347 (42%), Positives = 197/347 (56%), Gaps = 19/347 (5%)
Query: 3 SLAFFLLSILLLSSLPLN---YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
SLA LLL+ + + E F+ W ++ + Y++ +E QQR I+ +N F
Sbjct: 35 SLALMFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRF 94
Query: 60 VTQHNNMGN-SSFTLSLNAFADLTHQEFKASFL--------GFSAASIDHDRRRNASVQS 110
+ N + SS+ L N F DLT +EFK ++L A A + +
Sbjct: 95 IKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPTVGTMSTAGMSN 154
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
N + P S+DWR KGAVT VKDQ CG+CWAF+ +IEG+++I TG LVSLSEQE++
Sbjct: 155 GNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIV 214
Query: 171 DCDRSYN-SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
DCDR N +GC GG A ++V +N G+ TE DYPY G QC KL H I GY+
Sbjct: 215 DCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQ 274
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCS-TSLDHAVLIVGYDS-- 286
V NNE +L +AV QPV+V + S RAFQ Y SG+F+GPC T+++H V +VGY S
Sbjct: 275 AVQRNNEAELERAVAGQPVAVFVDAS-RAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTG 333
Query: 287 --ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G YWI+KNSWG+ WG NGY+ M R G+C I + YP
Sbjct: 334 SDSGGRKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAIAIEPYYP 380
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 201/332 (60%), Gaps = 9/332 (2%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+LSI +S+ + + F W + + KAY+ +E R + F+ N +V
Sbjct: 9 FTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYT-HKEFMPRYEEFKKNMDYVHNW 67
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSA-ASIDHDRRRNASVQSPGNLRDVPASID 122
N+ G S L LN ADL+++E++ ++LG A ++ +RN ++ P ++D
Sbjct: 68 NSKG-SKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQPLNVD 126
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCG 181
WR+K AVT VKDQ CG+C++FS TG++EG+ I TG LVSLSEQ ++DC S+ N GC
Sbjct: 127 WREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCN 186
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLM A++++IKN+G+++E+ YPY + K + I YK++ +E L
Sbjct: 187 GGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQN 246
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSW 299
A++ PVSV I S +FQLY++G++ P S LDH VL VG ++NG DY+I+KNSW
Sbjct: 247 ALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSW 306
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G SWG+NGY+HM RN N+ CGI+ +ASYP
Sbjct: 307 GPSWGLNGYIHMARNKDNN---CGISTMASYP 335
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/264 (46%), Positives = 169/264 (64%), Gaps = 17/264 (6%)
Query: 78 FADLTHQEFKASFLGFSAASI---------DHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
FA++T+ EF++ + G+ S+ R +N S + +P ++DWRKKGA
Sbjct: 2 FAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGA------LPIAVDWRKKGA 55
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
VT +K+Q SCG CWAFSA AIEG +I G L+SLSEQ+L+DCD + + GC GGL+D A
Sbjct: 56 VTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTA 114
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
++ ++ G+ TE +YPY+G+ C + +I GY+DVP N+E L++AV QPV
Sbjct: 115 FEHIMATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPV 174
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNG 307
SVGI G FQ YSSG+FTG C+T LDHAV VGY S G YWIIKNSWG WG G
Sbjct: 175 SVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGG 234
Query: 308 YMHMQRNTGNSLGICGINMLASYP 331
YM ++++ + G+CG+ M ASYP
Sbjct: 235 YMRIKKDIKDKEGLCGLAMKASYP 258
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 198/342 (57%), Gaps = 21/342 (6%)
Query: 5 AFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+ FL ++ L ++S + +++ + W +GK Y+ ++E +R ++E N + QH
Sbjct: 4 SLFLAALCLGIASAAPRFNENLDARWTRWKAANGKLYNKDEEVWRR-AVWEKNMKMIDQH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N + G SF L++NAF DLT++EFK G I + R N P + P+S
Sbjct: 63 NEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLK---IQNPREGNMFQLLP--FAETPSS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VKDQ CG+CWAFSATGA+EG TG LVSLSEQ L+DC R+ N+G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A+++V N G+D+E+ YPY Q G+C K K + G+ D+ ++ E +
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRC-KYKPEQSAANDTGFADIHQDEESLM 236
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYW 293
L P+SV I S F+ Y GI+ P S LDH VL+VGY S+ +YW
Sbjct: 237 LSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVLVVGYGSDEREAENKNYW 296
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG 335
I+KNSWG WGM GY+ M ++ GN CGI AS+P G
Sbjct: 297 IVKNSWGTQWGMQGYILMAKDRGNH---CGIATSASFPIVEG 335
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
Query: 225 IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIV 282
+ G +VP+ E +L PVS I S +FQ GI+ P S LDH VL+V
Sbjct: 394 VTGPVNVPQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNCSSEDLDHGVLVV 453
Query: 283 GYDSEN----GVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
GY S+ +YWI+KNSWG WG+ GYM + R+ N
Sbjct: 454 GYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRDWDN 492
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/306 (43%), Positives = 186/306 (60%), Gaps = 12/306 (3%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKAS 89
+ W +HG+ Y E EK +R ++F+ N FV + N G S+ L++N FAD+T+ EF A
Sbjct: 50 QQWMAEHGRTYKDEAEKARRFQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAM 109
Query: 90 FLGFSAASIDHDRRRNASVQSPGNLRDVPA-SIDWRKKGAVTEVKDQASCGACWAFSATG 148
+ G + ++ L DV ++DWR+KGAVT +K+Q CG CWAF+A
Sbjct: 110 YTGLKPVPAGPKKMAGFKYENL-TLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVA 168
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRG 208
A+E I++I TG+LVSLSEQ+++DCD N+GC GG +D A+Q++I N G+ TE YPY
Sbjct: 169 AVESIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAA 228
Query: 209 QAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFT 268
G C Q + VTI Y+DVP +E L AV QPV+V I + FQ YSSG+ T
Sbjct: 229 AQGTC--QSSVQPAVTISSYQDVPSGDEAALAAAVANQPVAVAI-DAHNNFQFYSSGVLT 285
Query: 269 G-PCST-SLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
C T SL+HAV VGY + E+G YW++KN WG++WG GY+ ++R T CG+
Sbjct: 286 ADTCGTPSLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLRVERGT----NACGVA 341
Query: 326 MLASYP 331
ASYP
Sbjct: 342 QQASYP 347
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 183/316 (57%), Gaps = 14/316 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R IFE N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H+EF +G I + V + +P S+DWR V+EVKDQ CG CW
Sbjct: 81 HEEFHQRIMG-GCLKIVKKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQGECGPCW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DTE
Sbjct: 140 AFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIPANGGLDTE 199
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
+ YPY + K + T+ GYKDV NE L +AV PVSV I +FQ
Sbjct: 200 ESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQ 259
Query: 261 LYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRNT 315
YSSG++ P CST LDH VL VGY + N +WI+KNSWG SWG GY+ M RN
Sbjct: 260 FYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNK 319
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 320 NNQ---CGIATSASYP 332
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 202/337 (59%), Gaps = 23/337 (6%)
Query: 7 FLLSIL---LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
FLL+ L ++S+ P + S + ++E W +HGK Y++ +E Q+R ++E+N + H
Sbjct: 5 FLLATLCLGMISAAPTHDPS-FDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLH 62
Query: 64 NN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N G F+L +NAF DLT+ EF+ GF + + ++ L D+P S
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMG-----PKETTIFREPFLGDIPKS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+ G VT VK+Q CG+CWAFSA G++EG TG LVSLSEQ L+DC SY N G
Sbjct: 118 LDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLM++A+Q+V +N G+DT + Y Y Q G C + + G+ VP +E L
Sbjct: 178 CNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLC-RYNPKYSAANVTGFVKVPL-SEDDL 235
Query: 240 LQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVDYWII 295
+ AV + PVSVGI ++F+ YS G++ P ST +DHAVL+VGY E +G YW++
Sbjct: 236 MSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLV 295
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWG WGM+GY+ M ++ N+ CGI A YPT
Sbjct: 296 KNSWGEDWGMDGYIKMAKDQNNN---CGIATYAIYPT 329
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 202/340 (59%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F LL L +S++ DI ++ + +W QHGK+Y + E +R+ I+E+N +
Sbjct: 1 MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
QHN ++GN +F + +N F D+T++EF+ + G+ D +R + P
Sbjct: 60 QHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGPLFMEPKFFAAP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+DWR++G VT VKDQ CG+CW+FS+TGA+EG TG L+S+SEQ L+DC R + N
Sbjct: 117 QQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGN 176
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLMD A+Q+V +N G+D+E+ YPY + + ++ I G+ D+P+ NE
Sbjct: 177 QGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNEL 236
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCSTSLDHAVLIVGYDSEN----GVD 291
L+ AV A PVSV I S ++ Q Y SGI + C++ LDHAVL+VGY + G
Sbjct: 237 ALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNR 296
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY++M ++ N CGI +ASYP
Sbjct: 297 YWIVKNSWSDKWGDKGYIYMAKDKNNH---CGIATMASYP 333
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 192/314 (61%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ ++ W K + K Y + E+ R I+E N FV HN +MG S+ LS+N D+
Sbjct: 24 LDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLSMNHLGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E + S+ + +RN + +S N + +P S+DWR+KG VT+VK Q SCGAC
Sbjct: 84 TSEEVMSLM---SSLRVPSQWQRNVTFKSNPNQK-LPDSLDWREKGCVTDVKYQGSCGAC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+GID
Sbjct: 140 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNNGID 199
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
+E YPY+ G+C NR T Y ++P +E L +AV + PVSVGI S +
Sbjct: 200 SEASYPYKATDGKCQYDPKNR-AATCSKYTELPYGSEDALKEAVANKGPVSVGIDASRPS 258
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ +++H VL+VGY + NG DYW++KNSWG ++G GY+ M RN+GN
Sbjct: 259 FFLYKSGVYYDPSCTDNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQGYIRMARNSGN 318
Query: 318 SLGICGINMLASYP 331
CGI SYP
Sbjct: 319 H---CGIASFPSYP 329
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 196/338 (57%), Gaps = 16/338 (4%)
Query: 1 MNSLAFFLLSIL--LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
M ++ F +L + L+ P+ D N ++ W HGK Y ++ E+ R I+++N
Sbjct: 1 MEAVIFAVLLCISSALAMPPMEPLQDPN--WKAWKSFHGKEYPNKNEETMRNFIWQNNLK 58
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ HN G SF L++N D+T E + LG + + A+ P N++ V
Sbjct: 59 KIVTHNE-GKHSFKLAMNHLGDMTSLEISQTLLGLKLKKHAESQPKGATFLPPANVK-VV 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
SIDWR KG VT VK+Q CG+CWAFS TGA+EG + TG LVSLSEQ L+DC Y N
Sbjct: 117 DSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSGKYGN 176
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTID-GYKDVPENNE 236
+GC GGLMD A+Q++ +N GIDTEK YPY + G C+ K I D G+ D+P +E
Sbjct: 177 NGCEGGLMDNAFQYIKENGGIDTEKSYPYLAKDGVCHYNK--SAIGAKDTGFVDIPTGDE 234
Query: 237 KQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYW 293
L QA+ + P+S+ I S+ F Y G++ P ST LDH VL VGY +++G DYW
Sbjct: 235 NALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTRLDHGVLAVGYGTDDGKDYW 294
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
++KNSWG SWG GY+ + RN + CG+ ASYP
Sbjct: 295 LVKNSWGPSWGEEGYIKIARNDHDK---CGVASKASYP 329
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 179/303 (59%), Gaps = 10/303 (3%)
Query: 37 GKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADLTHQEFKASFLGF 93
GK Y+S E+ R IFE+N V QHN MG +F + +N F DLT +EF+ +G
Sbjct: 8 GKQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIGS 67
Query: 94 SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGI 153
+ ++ V V ++DWR+KGAVT+VK+Q CG+CWAFSATG++EG
Sbjct: 68 GFMQSNKTQQAEGGVFESLPGLKVDDTVDWRQKGAVTKVKNQEQCGSCWAFSATGSLEGQ 127
Query: 154 NKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ 212
+ + T +LVSLSEQ L+DC R N GC GG MD A++++ N GIDTE+ Y YRG+
Sbjct: 128 HFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSYRGRDES 187
Query: 213 CNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP- 270
+ K + T+ Y D+ +E L+QAV P+SV I ++FQLY G++ P
Sbjct: 188 MCRYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHHGVYDEPK 247
Query: 271 -CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLAS 329
ST LDH VL VGY S NG DYW++KNSWG WGM GY+ M RN N CGI A
Sbjct: 248 CSSTHLDHGVLAVGYGSSNGSDYWLVKNSWGTEWGMEGYIMMSRNKHNQ---CGIATRAI 304
Query: 330 YPT 332
YP
Sbjct: 305 YPV 307
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 196/340 (57%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L +I +SS+ LN I E + + Q K Y +E+ R K++ DN + +H
Sbjct: 7 LGLVAFAISSVSSINLNEV--IEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKLKIARH 64
Query: 64 NNM---GNSSFTLSLNAFADLTHQEFKASFLGF--SAASIDHDRRRNASVQ--SPGNLRD 116
N + G ++ L +N F DL E+ GF S A D + + V N+
Sbjct: 65 NKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKSENVV- 123
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P SIDWRKKG VT VK+Q CG+CW+FSATG++EG + TG LVSLSEQ LIDC R Y
Sbjct: 124 IPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKY 183
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+GC GGLMD A++++ N G+DTEK YPY + +C N T +G+ D+PE +
Sbjct: 184 GNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDNSG-ATDNGFVDIPEGD 242
Query: 236 EKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVD 291
E+ L+ A+ PVS+ I S FQ Y G+F P ST LDH VL VG+ ++ G D
Sbjct: 243 EEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKKGGD 302
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSWG++WG GY+ M RN N+ CG+ ASYP
Sbjct: 303 YWIVKNSWGKTWGDEGYIMMARNKKNN---CGVASSASYP 339
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 182/318 (57%), Gaps = 8/318 (2%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN-SSFTLSLN 76
PL Y + F W HG +S E +RL+ + N ++ +HN + L N
Sbjct: 21 PLEYEHE----FSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHN 76
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
AF+ ++ EFK G ++R + V + +VP+++DW KG VT VK+Q
Sbjct: 77 AFSHMSFDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQG 136
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS TGA+EG + +G L+SLSEQEL+DCD + + GC GGLMD+A+Q++ +
Sbjct: 137 MCGSCWAFSTTGAVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHG 196
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GI +E DY Y+ +A C K +V + G++DV +E L AV QPVSV I +
Sbjct: 197 GICSEDDYEYKAKAQVCRKCD---SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQ 253
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
+AFQ Y SG+F C T LDH VL VGY ++NG +W +KNSWG SWG GY+ + R
Sbjct: 254 KAFQFYKSGVFNLTCGTRLDHGVLAVGYGNDNGQKFWKVKNSWGASWGEQGYIRLAREEN 313
Query: 317 NSLGICGINMLASYPTKT 334
G CGI + SYP T
Sbjct: 314 GPAGQCGIASVPSYPFAT 331
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 146/347 (42%), Positives = 197/347 (56%), Gaps = 19/347 (5%)
Query: 3 SLAFFLLSILLLSSLPLN---YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
SLA LLL+ + + E F+ W ++ + Y++ +E QQR I+ +N F
Sbjct: 9 SLALMFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRF 68
Query: 60 VTQHNNMGN-SSFTLSLNAFADLTHQEFKASFL--------GFSAASIDHDRRRNASVQS 110
+ N + SS+ L N F DLT +EFK ++L A A + +
Sbjct: 69 IKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMGPTVGTMSTAGMSN 128
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
N + P S+DWR KGAVT VKDQ CG+CWAF+ +IEG+++I TG LVSLSEQE++
Sbjct: 129 GNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIV 188
Query: 171 DCDRSYN-SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229
DCDR N +GC GG A ++V +N G+ TE DYPY G QC KL H I GY+
Sbjct: 189 DCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQ 248
Query: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCS-TSLDHAVLIVGYDS-- 286
V NNE +L +AV +PV+V I S RAFQ Y SG+F+GPC T+++H V +VGY S
Sbjct: 249 AVQRNNEAELERAVAERPVAVFIDAS-RAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTG 307
Query: 287 --ENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G YWI+KNSWG+ WG NGY+ M R G+C I + YP
Sbjct: 308 SDSGGRKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAIAIEPYYP 354
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 129/267 (48%), Positives = 163/267 (61%), Gaps = 25/267 (9%)
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
+ S+ LS+N FADLT++EF S F A H A+ N+ VP++ DWRKKG
Sbjct: 2 DKSYKLSINEFADLTNEEFGTSRNRFKA----HICSTEATSFKYENVTAVPSTXDWRKKG 57
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMD 186
AVT +KDQ CG+CWAFSA A+EGI ++ TG L+SLSEQEL+DCD S + GC G
Sbjct: 58 AVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGA--- 114
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 246
+YPY G G CN++K I+GY+DVP NNEK L +AV Q
Sbjct: 115 ----------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQ 158
Query: 247 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGM 305
P++V I FQ YSSG+FTG C T LDH V VGY S++G+ YW++KNSWG WG
Sbjct: 159 PIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWGE 218
Query: 306 NGYMHMQRNTGNSLGICGINMLASYPT 332
GY+ MQR+ G+CGI M ASYPT
Sbjct: 219 EGYIRMQRDVTAKEGLCGIAMQASYPT 245
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/320 (43%), Positives = 191/320 (59%), Gaps = 24/320 (7%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
++ FE + G+ Y S + + R IF N F+ +HN G+S+F++S+N F D
Sbjct: 28 ELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTD 87
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNA-----SVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
L+++EF+A+F G+ RR A SV + ++ +PA++DW KG VT +K+Q
Sbjct: 88 LSNEEFRATFNGY--------RRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQ 139
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIK 194
CG+CWAFSA ++EG + + TG LVSLSEQ L+DC + + GC GG MDYA+++VI+
Sbjct: 140 QQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 195 NHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGIC 253
N GIDTE YPY+ C + K N TI + DV +E L AV + P+SV I
Sbjct: 200 NRGIDTEASYPYKAIDESC-EFKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAID 258
Query: 254 GSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311
++ +FQ YSSG++ P CST LDH V VGY + NG YW +KNSWG SWG GY+ M
Sbjct: 259 AAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTSWGRKGYIFM 318
Query: 312 QRNTGNSLGICGINMLASYP 331
RN N CGI ASYP
Sbjct: 319 SRNKQNQ---CGIATKASYP 335
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 191/340 (56%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L +I +SS+ LN I E + + Q K Y +E+ R K++ DN + H
Sbjct: 7 LGLVAFAISTVSSINLNEV--IEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKIAGH 64
Query: 64 NNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR----RNASVQSPGNLRD 116
N + G ++ L +N F DL E+ GF + DR + N+
Sbjct: 65 NKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSENVV- 123
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P S+DWRKKG VT VK+Q CG+CW+FSATG++EG + TG LVSLSEQ LIDC R Y
Sbjct: 124 IPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKY 183
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+GC GGLMD A++++ N G+DTEK YPY + +C N T G+ D+PE +
Sbjct: 184 GNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSG-ATDKGFVDIPEGD 242
Query: 236 EKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVD 291
E L+ A+ PVS+ I S FQ Y G+F P ST LDH VL VG+ S+ G D
Sbjct: 243 EDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKKGGD 302
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSWG++WG GY+ M RN N+ CG+ ASYP
Sbjct: 303 YWIVKNSWGKTWGDEGYIMMARNKKNN---CGVASSASYP 339
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 144/335 (42%), Positives = 192/335 (57%), Gaps = 17/335 (5%)
Query: 8 LLSILLLSSLPLNYCSDINEL-FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-- 64
L+ I L +L + +L F +W + GK Y S +E+ QR + +N V HN
Sbjct: 4 LIVITALVALASATSISLEDLEFHSWKLKFGKIYKSVEEESQRKNTWLENRKLVLVHNML 63
Query: 65 -NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNAS---VQSPGNLRDVPAS 120
+ G S+ L + FAD+ +QE++ S S + + AS +Q+ G + +P +
Sbjct: 64 ADQGIKSYRLGMTYFADMDNQEYRQSVFKGCLGSFNRTKGHRASTFLLQAGGAV--LPDT 121
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR KG V EVKDQ +CG+CWAFSATG++EG TG LVSLSEQ+L+DC Y N G
Sbjct: 122 VDWRDKGYVAEVKDQKNCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMG 181
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
CGGGLMD A++++ N GIDTE+ YPY G C + K T GY D+ +E L
Sbjct: 182 CGGGLMDLAFEYIEDNKGIDTEESYPYEATDGDC-RFKPATVGATCTGYVDINSEDENAL 240
Query: 240 LQAVV-AQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIK 296
+AV P+SV I +FQLY SGI+ P S LDH VL VGY ++N DYW++K
Sbjct: 241 QKAVANIGPISVAIDAGHISFQLYGSGIYNEPNCSSEDLDHGVLAVGYGTDNQQDYWLVK 300
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
NSWG WG GY+ M RN N CGI ASYP
Sbjct: 301 NSWGLDWGDQGYIKMTRNKNNQ---CGIATAASYP 332
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 201/343 (58%), Gaps = 27/343 (7%)
Query: 9 LSILLLSS--LPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66
+ + L+++ L L Y ++ F + Q+ K Y S+ ++ R K+++ N FV +HN
Sbjct: 1 MKVFLVAAACLTLVYIAEAASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNER 60
Query: 67 ---GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNL-----RD-- 116
G ++ ++LN AD+ +EF A+FLGF +R A+ + P + +D
Sbjct: 61 YERGEVTYKMALNHLADMHPREFMATFLGF-------NRSLRATNKVPEGIPFRHNKDAV 113
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+ +DWR+KGA++ VKDQ CG+CWAFS+TGA+E + G VSLSEQ LIDC +Y
Sbjct: 114 IQKEVDWRQKGAISPVKDQGHCGSCWAFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNY 173
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N+GC GGLM+ A+Q+V N GIDTE+ YPY G+ +C +K N T G+ +P +
Sbjct: 174 GNNGCEGGLMEQAFQYVRDNDGIDTEEAYPYEGEDSECRFKK-NNVGATDAGFVTIPSGD 232
Query: 236 EKQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDY 292
E+ L++AV Q P+S+ I S +FQ YS G++ P S LDH VL+VGY E Y
Sbjct: 233 EQALMEAVATQGPLSIAIDASNPSFQFYSEGVYYEPECSSAQLDHGVLLVGYGVEKDQKY 292
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG 335
W++KNSW WG NGY+ M RN N+ CGI AS+P G
Sbjct: 293 WLVKNSWSEQWGENGYIKMARNKDNN---CGIATQASFPIVEG 332
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 184/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ ++ P N+ D +P ++DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 146/347 (42%), Positives = 201/347 (57%), Gaps = 30/347 (8%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M ++A L + + S P + +++ + + GK YS+ +E +RL +E N A +
Sbjct: 1 MKAIAAICLFFVCVYSAP-TFNVELDSHWALFKTTFGKQYSTAEEITRRLA-WEANVAII 58
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-- 115
QHN ++G ++TL LN +ADLT+ EF G R NAS N R
Sbjct: 59 RQHNLEHDLGLHTYTLGLNNYADLTNAEFNQVMNGL---------RVNASQTKSANRRTY 109
Query: 116 ------DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
++P S+DWR KG VT +KDQ CG+CWAFS+TG++EG + TG LVSLSEQ L
Sbjct: 110 VAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFSSTGSLEGQHFAKTGQLVSLSEQNL 169
Query: 170 IDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
DC + N GC GGLMD A+ ++ +N+GIDTE YPY+ +C+ + + T GY
Sbjct: 170 TDCSQKQGNMGCNGGLMDQAFTYIKENNGIDTESSYPYKAVDEKCHFKAADVG-ATDTGY 228
Query: 229 KDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTG-PCS-TSLDHAVLIVGYD 285
D+ + +E L A+ P+SV I S +FQLY SG + CS T LDH VL VGYD
Sbjct: 229 TDIAQQDENALQSAIATVGPISVAIDASHSSFQLYRSGAYNERACSATQLDHGVLAVGYD 288
Query: 286 SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
SE+G DY+I+KNSWG SWG GY+ M RN N CGI +++YPT
Sbjct: 289 SEDGKDYYIVKNSWGTSWGQKGYIWMTRNKNNQ---CGIATMSTYPT 332
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 184/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ +S P N+ D +P +DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY+ G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYKAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 197/338 (58%), Gaps = 21/338 (6%)
Query: 5 AFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+ FL ++ L ++S + +++ + W +GK Y+ ++E +R ++E N + QH
Sbjct: 4 SLFLAALCLGIASAAPRFNENLDARWTRWKAANGKLYNKDEEVWRR-AVWEKNMKMIDQH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N + G SF L++NAF DLT++EFK G I + R N P + P+S
Sbjct: 63 NEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLK---IQNPREGNMFQLLP--FAETPSS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VKDQ CG+CWAFSATGA+EG TG LVSLSEQ L+DC R+ N+G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A+++V N G+D+E+ YPY Q G+C K K + G+ D+ ++ E +
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRC-KYKPEQSAANDTGFADIHQDEESLM 236
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYW 293
L P+SV I S F+ Y GI+ P S LDH VL+VGY S+ +YW
Sbjct: 237 LSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVLVVGYGSDEREAENKNYW 296
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
I+KNSWG WGM GY+ M ++ GN CGI AS+P
Sbjct: 297 IVKNSWGTQWGMQGYILMAKDRGNH---CGIATSASFP 331
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 200/340 (58%), Gaps = 23/340 (6%)
Query: 5 AFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+ FL ++ L ++S ++EL+ W HGK Y ++E +R ++++ N + QH
Sbjct: 4 SLFLAALCLGIASAAPQLNQSLDELWSQWKATHGKLYGMDEEGWRR-EVWKKNMKMIRQH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N + G SFT+++N F D+T++EFK G ++ Q+P +P+S
Sbjct: 63 NWEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQM----QKHKKGKMFQAP-LFAKIPSS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VKDQ CG+CWAFSATGA+EG TG LVSLSEQ L+DC ++ N G
Sbjct: 118 VDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEGNEG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLM+ A+Q+V N G+D+E+ YPY Q C K K G+ D+P+ EK L
Sbjct: 178 CNGGLMNNAFQYVKDNGGLDSEESYPYHAQDESC-KYKPQDSAANDTGFFDIPQ-QEKAL 235
Query: 240 LQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVD----Y 292
+ AV + P+SVGI S FQ Y GI+ P S LDH VL++GY +E G Y
Sbjct: 236 MVAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSEDLDHGVLVIGYGTEIGQSINKTY 295
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WI+KNSWG +WG++GY+ M ++ N CGI +AS+P
Sbjct: 296 WIVKNSWGANWGIDGYIKMAKDRKNH---CGIATMASFPV 332
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 115/219 (52%), Positives = 149/219 (68%), Gaps = 4/219 (1%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RS 175
+P +IDWR KGAVT +KDQ CG CWAFSA A EGI KI TG LVSL+EQEL+DCD
Sbjct: 17 LPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHD 76
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
+ GC GGLMD A++F+IKN G+ TE YPY G+C + + TI GY+DVP N+
Sbjct: 77 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYEDVPAND 134
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWI 294
E L++AV QPVSV + G + FQ YS G+ TG C T LDH + +GY + +G YW+
Sbjct: 135 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWL 194
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+KNSWG +WG NGY+ M+++ + G+CG+ M SYPTK
Sbjct: 195 MKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPTK 233
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 184/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ ++ P N+ D +P +DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ +N GIDT
Sbjct: 141 WAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKENDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 187/319 (58%), Gaps = 18/319 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
+N+ + +W H K Y ++E +R+ I+E N + HN ++G S+ L +N F D+
Sbjct: 24 LNDHWLSWKSWHSKKYHEKEEGWRRM-IWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDM 82
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T++EF+ GF + R+ S N P S+DWR+KG VT VKDQ CG+C
Sbjct: 83 TNEEFRQVMNGFKQSR--SQRKYKGSQFLEPNFLQAPKSVDWREKGYVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATGA+EG + TG LVSLSEQ LIDC N GC GGLMD A+Q++ N+GID+
Sbjct: 141 WAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGPEGNQGCNGGLMDQAFQYIKDNNGIDS 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E+ YPY G+ + K + G+ D+PE E+ L++AV A P+SV I S +F
Sbjct: 201 EESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEGRERALMKAVAAVGPISVAIDASHTSF 260
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGY-----DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
Q Y SG++ P S LDH VL+VGY D +N YWI+KNSW WG GY+HM
Sbjct: 261 QFYESGVYYEPQCNSEELDHGVLVVGYGYEGTDDDNKKRYWIVKNSWSEKWGDQGYIHMA 320
Query: 313 RNTGNSLGICGINMLASYP 331
++ N+ CGI ASYP
Sbjct: 321 KDRSNN---CGIASAASYP 336
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 132/324 (40%), Positives = 184/324 (56%), Gaps = 22/324 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSL--NAFADLTHQEF 86
F+ W +HG+AY++ E+ +RL+++ N ++ N + T L A+ DLT EF
Sbjct: 53 FQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDLTADEF 112
Query: 87 KASFLGFSAASIDHDRR---------RNASVQSPG-------NLRDVPASIDWRKKGAVT 130
A + S HD R +V + G + PAS+DWR KGAVT
Sbjct: 113 TAMYTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASVDWRAKGAVT 172
Query: 131 EVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQ 190
EVK+Q CG+CWAFS +EGI++I TG+L+SLSEQEL+DCD + + GC GG+ +A +
Sbjct: 173 EVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCD-TLDYGCDGGVSYHALE 231
Query: 191 FVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSV 250
++ N GI TE DYPY G+ G C KL H I G+ V +E L AV AQPV+V
Sbjct: 232 WIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLANAVAAQPVAV 291
Query: 251 GICGSERAFQLYSSGIFTGPCSTSLDHAVLIV--GYDSENGVDYWIIKNSWGRSWGMNGY 308
I FQ Y G++ GPC T L+H V +V G + +G YWI+KNSWG+ WG GY
Sbjct: 292 SIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSWGKKWGDGGY 351
Query: 309 MHMQRNT-GNSLGICGINMLASYP 331
M+++ G G+CGI + S+P
Sbjct: 352 FRMKKDVAGKPEGLCGIAIRPSFP 375
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 176/319 (55%), Gaps = 20/319 (6%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQE 85
++FE W + GK Y EK+ R +F DN F+ + + L +N FADLT+ E
Sbjct: 38 TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDE 97
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F ++ G R + +P IDWR KGAVT+VKDQ +CG+CWAF+
Sbjct: 98 FVSTHTGAKPPCPKDAPRGVDPIW-------LPCCIDWRYKGAVTDVKDQGACGSCWAFA 150
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A AIEG+ +I TG L LSEQEL+DCD +SGC GG D A++ V GI E Y
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDTG-SSGCAGGHTDRAFELVAAKGGITAESGYR 209
Query: 206 YRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
Y G G+C L H I G++ VP +E+QL AV QPV+ I S AFQ Y S
Sbjct: 210 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 269
Query: 265 GIFTGPCST---------SLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQR 313
G+F GPC + + +HAV +VGY D +G YW+ KNSWG++WG GY+ +++
Sbjct: 270 GVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEK 329
Query: 314 NTGNSLGICGINMLASYPT 332
+ + G CG+ + YPT
Sbjct: 330 DVASPHGTCGVAVSPFYPT 348
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 189/320 (59%), Gaps = 18/320 (5%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG-NSSFTLSLNAFADL 81
S + E +E W HG+ Y EK +R ++F N F+ N G S L+ N FADL
Sbjct: 43 SAMRERYEKWAADHGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADL 102
Query: 82 THQEFKASFLG--FSAASIDHDRRRNASVQSPGNLR--DVPASIDWRKKGAVTEVKDQAS 137
T++EF A + G FS I S GN+R DVPA+I+WR +GAVT+VK+Q
Sbjct: 103 TNEEF-AEYYGRPFSTPVI------GGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKD 155
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYQFVIKNH 196
C +CWAFSA A+EGI++I + +LV+LS Q+L+DC N+ GC G MD A++++ N
Sbjct: 156 CASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNG 215
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GI E DYPY +A + +I G++ VP NNE LL AV QPVSV + G
Sbjct: 216 GIAAESDYPYEDRALGTCRASGKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVG 275
Query: 257 RAFQLYSSGIFTG----PCSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 311
+ Q +SSG+F C+T L+HA+ VGY + E+G YW++KNSWG WG GYM +
Sbjct: 276 KVSQFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKI 335
Query: 312 QRNTGNSLGICGINMLASYP 331
R+ ++ G+CG+ M SYP
Sbjct: 336 ARDVASNTGLCGLAMQPSYP 355
>gi|47213723|emb|CAF95154.1| unnamed protein product [Tetraodon nigroviridis]
Length = 334
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 189/334 (56%), Gaps = 14/334 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L LLS L S+ + + SD+N +E W K H K Y SE E++ R +++E N + H
Sbjct: 9 LGALLLSWLCASAAAM-FDSDLNVHWELWKKTHDKMYQSEVEERSRRELWESNLRLINMH 67
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N +MG ++ L +N D + +E + + S D +R + D+PA+
Sbjct: 68 NLEASMGLHTYQLGMNHMGDWSQEEIVQAGTKLTPPS---DHQRGLAYFDASGRADLPAT 124
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR KG VT VK Q SCG+CWAFSA GA+EG+ TG LV LS Q L+DC R Y N G
Sbjct: 125 VDWRNKGLVTSVKMQGSCGSCWAFSAAGALEGLLAKTTGKLVDLSPQNLVDCTRKYGNHG 184
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GG M + +Q+VI NHGID+E YPY GQ G C R Y + + +E L
Sbjct: 185 CNGGYMHHTFQYVIDNHGIDSEASYPYTGQEGVCRYNPAFR-AANCSHYWFLRQGDEGAL 243
Query: 240 LQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKN 297
+AV P+SVGI + F Y SG++ P CS +++HAVL VGY ++NG DYW++KN
Sbjct: 244 QEAVATIGPISVGIDATRHQFVYYRSGVYNDPGCSQTVNHAVLAVGYGTDNGQDYWLVKN 303
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG +G +GY+ M RN + CGI +P
Sbjct: 304 SWGVGFGEDGYIRMARNKNDQ---CGIAQFPCFP 334
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 183/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ +S P N+ D +P +DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 169/248 (68%), Gaps = 5/248 (2%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++E W ++ K Y+ EK++R KIF+DN FV +HN++ + +F + L FADLT
Sbjct: 38 TEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLT 97
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EF+A +L + + G++ +P +DWR GAV VKDQ +CG+CW
Sbjct: 98 NEEFRAIYLRKKMERTKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCGSCW 155
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA GA+EGIN+I TG L+SLSEQEL+DCDR + N+GC GG+M+YA++F++KN GI+T+
Sbjct: 156 AFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETD 215
Query: 202 KDYPYRG-QAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+DYPY G CN K N +VTIDGY+DVP ++EK L +AV QPVSV I S +AF
Sbjct: 216 QDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAF 275
Query: 260 QLYSSGIF 267
QLY S F
Sbjct: 276 QLYKSVNF 283
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 183/316 (57%), Gaps = 14/316 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R I E N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFILEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
H+EF +G I + V + +P S+DWR V+EVKDQ CG+CW
Sbjct: 81 HEEFHQRIMG-GCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCW 139
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DTE
Sbjct: 140 AFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTE 199
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQ 260
+ YPY + K + T+ GYKDV NE L +AV PVSV I +FQ
Sbjct: 200 ESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHESFQ 259
Query: 261 LYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRNT 315
YSSG++ P CST LDH VL VGY + N +WI+KNSWG SWG GY+ M RN
Sbjct: 260 FYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNK 319
Query: 316 GNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 320 NNQ---CGIATSASYP 332
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 183/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ +S P N+ D +P +DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/330 (43%), Positives = 187/330 (56%), Gaps = 19/330 (5%)
Query: 11 ILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MG 67
L+LS ++ + + W HGK Y+S E+ R KIF++N +TQHN G
Sbjct: 5 FLILSLGAFVSGAEFSSEWLKWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQG 64
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKG 127
++ L +N F DL H EF GF D V + VP+ +W KG
Sbjct: 65 FHTYILGMNHFGDLLHSEFLERSNGFQGGVSGGD------VFTFDTNAPVPSYANWTAKG 118
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMD 186
AVT VKDQ CG+CWAFSATG++EG + L+SLSEQ+L+DC N GCGGGLMD
Sbjct: 119 AVTPVKDQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMD 178
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-A 245
A+++ I N GI EK YPY + C K K + + TI +KDV +E QL AV
Sbjct: 179 NAFKYFIANKGIANEKSYPYTAKDNDC-KYKKSMSVATISSFKDVKHKDEDQLKMAVANV 237
Query: 246 QPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGY--DSENGVDYWIIKNSWGR 301
PVSV I S FQ Y SG++ CS+ LDH VL VGY D ++G+D+W++KNSW
Sbjct: 238 GPVSVAIDASSSKFQFYESGVYYDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAA 297
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG+NGY+ M RN N+ CGI +ASYP
Sbjct: 298 SWGLNGYIKMARNKDNN---CGIATMASYP 324
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 188/323 (58%), Gaps = 26/323 (8%)
Query: 30 ETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS-------SFTLSLNAFADLT 82
E+W +HG+ Y+ +EK +RL+IF N + N+ ++ S L+ N FADLT
Sbjct: 44 ESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADLT 103
Query: 83 HQEFKASFLGFSAASIDHD------RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
+EF+A+ G + R N S+Q+ D S+DWR GAVT VKDQ
Sbjct: 104 DEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQA-----DAAGSMDWRAMGAVTGVKDQG 158
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKN 195
SCG CWAFSA A+EG+ KI TG LVSLSEQ+L+DCD + GC GGLMD A+Q++ +
Sbjct: 159 SCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQ 218
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G+ +E YPY G+ G + + +I G++DVP NNE L+ AV QPVSV I G
Sbjct: 219 GGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGG 278
Query: 256 ERAFQLYSSGIFTGPC-----STSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYM 309
+ F+ Y G+ ST LDHA+ VGY + +G YW++KNSWG WG +GY+
Sbjct: 279 DYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYV 338
Query: 310 HMQRNTGNSLGICGINMLASYPT 332
++R + G+CG+ LASYP
Sbjct: 339 RIRRGS-RGEGVCGLAKLASYPV 360
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 182/318 (57%), Gaps = 8/318 (2%)
Query: 18 PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN-SSFTLSLN 76
PL Y + F W HG +S E +RL+ + N ++ +HN + TL N
Sbjct: 21 PLEYEHE----FSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHN 76
Query: 77 AFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQA 136
AF+ ++ EFK G ++R + V + +VP+++DW KG VT VK+Q
Sbjct: 77 AFSHMSFDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQG 136
Query: 137 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNH 196
CG+CWAFS TGA+EG + +G L SLSEQEL+DCD + + GC GGLMD+A+Q++ +
Sbjct: 137 MCGSCWAFSTTGAVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHG 196
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
GI +E DY Y+ +A C + +V + G++DV +E L AV QPVSV I +
Sbjct: 197 GICSEDDYEYKAKAQVCRECD---SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQ 253
Query: 257 RAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
+AFQ Y SG+F C T LDH VL VGY ++NG +W +KNSWG SWG GY+ + R
Sbjct: 254 KAFQFYKSGVFNLTCGTRLDHGVLAVGYGNDNGHKFWKVKNSWGASWGEQGYIRLAREEN 313
Query: 317 NSLGICGINMLASYPTKT 334
G CGI + SYP T
Sbjct: 314 GPAGQCGIASVPSYPFAT 331
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/312 (43%), Positives = 182/312 (58%), Gaps = 17/312 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGACWA 143
F F G + ++ P N+ D +P ++DWRKKGAVT VKDQ CG+CWA
Sbjct: 87 FARIFNGHRGTR----KTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWA 142
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDTEK
Sbjct: 143 FSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEK 202
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQL 261
YPY G+C +K + T GY ++ +E L +AV P+SV I S +FQL
Sbjct: 203 SYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQL 261
Query: 262 YSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
YS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 262 YSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQ- 320
Query: 320 GICGINMLASYP 331
CGI ASYP
Sbjct: 321 --CGIASQASYP 330
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ + LL +L SS D ++ ++ W K +GK Y+ E E+ R I+E N +V
Sbjct: 10 MKWLLLVLLGCSSAMAQLHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLKYVM 69
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
HN +MG S+ L +N AD+T +E L S+ + +RN + +S N + +P
Sbjct: 70 LHNLEHSMGMHSYDLGMNHLADMTSEEV---MLLMSSLRVPSQWQRNVTFKSNPN-QKLP 125
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD--RSY 176
S+DWR KG VTEVK Q SCG+CWAFSA GA+E K+ TG LVSLS Q L+DC +
Sbjct: 126 DSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTGKYS 185
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P NE
Sbjct: 186 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNR-AATCSKYVELPFGNE 244
Query: 237 KQLLQAVVAQ-PVSVGICGSERAFQLYSSGI-FTGPCSTSLDHAVLIVGYDSENGVDYWI 294
+ L +AV + PVSV I S +F LY SG+ + C+ +++H VL VGY + NG DYW+
Sbjct: 245 EALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYGNYNGKDYWL 304
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG +G GY+ M RN+GN CGI SYP
Sbjct: 305 VKNSWGLHFGEQGYIRMARNSGNH---CGIASYPSYP 338
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 187/313 (59%), Gaps = 17/313 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E+W HGK+Y S E++ RLKI +N +++HN G S+ + +N + DL H E
Sbjct: 27 WESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHE 86
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F A G+ ++ P +P +DWR+ GAVT VK+Q CG+CWAFS
Sbjct: 87 FVAMVNGYEYV----NKTSLGGSFIPSKNVKLPTHVDWREDGAVTPVKNQGQCGSCWAFS 142
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
+TG++EG TG L+ LSEQ L+DC R Y N+GC GGLMD+A+ ++ N GIDTE Y
Sbjct: 143 STGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSY 202
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYS 263
PY G G+C+ + I G+ DV + +E++LL+AV + PVSV I S +FQ YS
Sbjct: 203 PYEGVGGRCHYDPSKKGSSDI-GFVDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYS 261
Query: 264 SGI-FTGPCS-TSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
G+ F CS +LDH VL+VGY D +G DYW++KNSW +WG GY+ M RN N
Sbjct: 262 HGVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMARNKKN-- 319
Query: 320 GICGINMLASYPT 332
+CGI ASYP
Sbjct: 320 -MCGIASSASYPV 331
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 184/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K+Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ ++ P N+ D +P +DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 186/321 (57%), Gaps = 17/321 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN-SSFTLSLNAFADLTHQE 85
E F+ W ++ + Y++ +E QQR ++ +N F+ N + SS+ L N F DLT +E
Sbjct: 38 ERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEE 97
Query: 86 FKASFL--------GFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQAS 137
FK ++L A A + + N + P S+DWR KGAVT VK+Q
Sbjct: 98 FKDTYLMKLDEQPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQ 157
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYQFVIKNH 196
CG+CWAF+ +IEG+++I TG LVSLSEQE++DCDR N GC GG A ++V +N
Sbjct: 158 CGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNG 217
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
G+ TE DYPY G QC KL H I GY+ V NE +L +AV +PV+V I S
Sbjct: 218 GLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDAS- 276
Query: 257 RAFQLYSSGIFTGPC-STSLDHAVLIVGYDSENGV-----DYWIIKNSWGRSWGMNGYMH 310
RAFQ Y G+F+GPC +T+++HAV +VGY S YWI+KNSWG+ WG NGY+
Sbjct: 277 RAFQFYKRGVFSGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVR 336
Query: 311 MQRNTGNSLGICGINMLASYP 331
M R G+C I + YP
Sbjct: 337 MARRVRAREGMCAIAIEPYYP 357
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 176/319 (55%), Gaps = 20/319 (6%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQE 85
++FE W + GK Y EK+ R +F DN F+ + + L +N FADLT+ E
Sbjct: 16 TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDE 75
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F ++ G R + +P IDWR KGAVT+VKDQ +CG+CWAF+
Sbjct: 76 FVSTHTGAKPPCPKDAPRGVDPIW-------LPCCIDWRYKGAVTDVKDQGACGSCWAFA 128
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYP 205
A AIEG+ +I TG L LSEQEL+DCD +SGC GG D A++ V GI E Y
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDTG-SSGCAGGHTDRAFELVAAKGGITAESGYR 187
Query: 206 YRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
Y G G+C L H I G++ VP +E+QL AV QPV+ I S AFQ Y S
Sbjct: 188 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 247
Query: 265 GIFTGPCST---------SLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQR 313
G+F GPC + + +HAV +VGY D +G YW+ KNSWG++WG GY+ +++
Sbjct: 248 GVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEK 307
Query: 314 NTGNSLGICGINMLASYPT 332
+ + G CG+ + YPT
Sbjct: 308 DVASPHGTCGVAVSPFYPT 326
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 195/341 (57%), Gaps = 22/341 (6%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+LA FL + SL S +++ ++ W H K Y ++E +R+ I+E N +
Sbjct: 7 ALALFLEACFAAPSLD----SALDDHWQAWKTWHSKKYHQQEEGWRRM-IWEKNLKMIQL 61
Query: 63 HN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
HN ++G S+ L +N F D+T++EF+ G+ + + + R + P N VP
Sbjct: 62 HNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKHSKTEK-KYRGSEFLEP-NFLVVPK 119
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
S+DWR+KG VT VKDQ CG+CWAFS TG++EG + TG LVSLSEQ L+DC R N
Sbjct: 120 SVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGNQ 179
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLMD A++++ N GID+E+ YPY + + K + G+ DVPE +E+
Sbjct: 180 GCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERA 239
Query: 239 LLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGY-----DSENGV 290
L++AV A PVSV I S FQ Y SGI+ P S LDH VL+VGY D +N
Sbjct: 240 LMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFEGTDDDNKK 299
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY+ M ++ N CGI ASYP
Sbjct: 300 KYWIVKNSWSDKWGDKGYILMAKDRNNH---CGIATAASYP 337
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 193/332 (58%), Gaps = 19/332 (5%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN--- 64
+L + L++ + +D E + K + K+Y S E+Q R +IF++N + HN
Sbjct: 3 VLIFIFLATAAVQALNDKEEWVQFKVKNN-KSYKSYVEEQTRFRIFQENLRKIENHNEKY 61
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
N G S+F + F DLT +EF L S + R + LRD+P++ DWR
Sbjct: 62 NNGESTFKFGVTKFTDLTEKEF----LDLLVLSKNARPNRTHATHLLAPLRDLPSAFDWR 117
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 184
KGAVTEVKDQ CG+CW FS TG++E + + TG+LVSLSEQ L+DC + GCGGG
Sbjct: 118 DKGAVTEVKDQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKDTCYGCGGGW 177
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
MD A +++ K GI +EKDYPY G C + +++ I + + +N+E+ L AV
Sbjct: 178 MDKALEYIEKG-GIMSEKDYPYEGVDDNC-RFDISKVAAKISNFTYIKKNDEEDLKNAVA 235
Query: 245 AQ-PVSVGICGSERAFQLYSSGIFTG-PCST---SLDHAVLIVGYDSENGVDYWIIKNSW 299
A+ P+SV I S FQLY SGI CS SL+H VL+VGY +ENG DYWIIKNSW
Sbjct: 236 AKGPISVAIDASA-TFQLYVSGILDDTECSNEFDSLNHGVLVVGYGTENGKDYWIIKNSW 294
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G +WGM+GY+ M RN N CGI YP
Sbjct: 295 GVNWGMDGYIRMSRNKNNQ---CGITTDGVYP 323
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 192/311 (61%), Gaps = 14/311 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLTHQEFK 87
F+ W ++ KAY +++ + R I+E N FV HN N FT+++N FADL EF
Sbjct: 23 FQDWKVKYNKAYETKETELARQVIWESNKKFVENHNANSDKFGFTVAMNEFADLGAGEFA 82
Query: 88 ASFLGF--SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
+ G S ++ +V+S L D S+DWRK GAVT VK+Q CGACWAFS
Sbjct: 83 NIYNGIIPHPPSYNNTNTFKRTVRSTFALAD---SVDWRKSGAVTGVKNQGKCGACWAFS 139
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
ATGA+EG + I TG+L+SLSEQ+L+DC S+ N+GC GGLMD A++++ G TE+ Y
Sbjct: 140 ATGALEGQHFINTGTLISLSEQQLMDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAY 199
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYS 263
PY + G C + + V YKD+PE +E L +AV P+SV I +FQLY
Sbjct: 200 PYLAEVGTC-RYNSSEAKVKNTVYKDIPEGDEDALQEAVATIGPISVSINSEHSSFQLYD 258
Query: 264 SGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
G++ P CS+S LDH VL++GY + + DYW++KNSWG +WGM+GY+ M RN N+
Sbjct: 259 QGVYYEPTCSSSKLDHGVLVIGYGTSDNNDYWLVKNSWGTNWGMDGYIMMSRNKENN--- 315
Query: 322 CGINMLASYPT 332
CGI ASYPT
Sbjct: 316 CGIATRASYPT 326
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 200/335 (59%), Gaps = 16/335 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
LA+ LL+ ++ P++ ++ + W K +GK Y + E+ R I+E N FVT H
Sbjct: 1 LAWALLACSYAAA-PVDRDPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLH 59
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N +MG S+ L +N D+T +E + S+ + RN + +S N + +P S
Sbjct: 60 NLEHSMGMHSYDLGMNHLGDMTSEEVISLM---SSLRVPSQWPRNVTYKSNSNQK-LPDS 115
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNS 178
+DWR+KG VT+VK Q +CGACWAFSA GA+E K+ TG LVSLS Q L+DC ++ N
Sbjct: 116 VDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNK 175
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P +E
Sbjct: 176 GCNGGFMTEAFQYIIDNNGIDSEASYPYKATDGKCRYDSKNR-AATCSKYTELPSGSEDD 234
Query: 239 LLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIK 296
L +AV + PVSV I +F LY SG++ P C+ +++H VL+VGY + NG DYW++K
Sbjct: 235 LKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVK 294
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
NSWG ++G GY+ M RN+GN CGI SYP
Sbjct: 295 NSWGLNFGDQGYIRMARNSGNH---CGIASYPSYP 326
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 202/340 (59%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F LL L +S++ DI ++ + +W QHGK+Y + E +R+ I+E+N +
Sbjct: 1 MMFALLITLCISAVFTAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
QHN ++GN +F + +N F D+T++EF+ + G+ D +R ++ + P
Sbjct: 60 QHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQ---DPNRTSKGALFMEPSFFAAP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+DWR++G VT VKDQ CG+CW+FS+TGA+EG TG L+S+SEQ L+DC R N
Sbjct: 117 QQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGN 176
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG+MD A+Q+V +N G+D+E+ YPY + + ++ I G+ D+P NE
Sbjct: 177 QGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNEL 236
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCSTSLDHAVLIVGYDSEN----GVD 291
L+ AV A PVSV I S ++ Q Y SGI + C++ LDHAVL+VGY + G
Sbjct: 237 ALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNR 296
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY++M ++ N CGI +ASYP
Sbjct: 297 YWIVKNSWSDKWGDKGYIYMAKDKNNH---CGIATMASYP 333
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 200/335 (59%), Gaps = 16/335 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
LA+ LL+ ++ P++ ++ + W K +GK Y + E+ R I+E N FVT H
Sbjct: 13 LAWALLACSYAAA-PVDRDPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLH 71
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N +MG S+ L +N D+T +E + S+ + RN + +S N + +P S
Sbjct: 72 NLEHSMGMHSYDLGMNHLGDMTSEEVISLM---SSLRVPSQWPRNVTYKSNSNQK-LPDS 127
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNS 178
+DWR+KG VT+VK Q +CGACWAFSA GA+E K+ TG LVSLS Q L+DC ++ N
Sbjct: 128 VDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNK 187
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P +E
Sbjct: 188 GCNGGFMTEAFQYIIDNNGIDSEASYPYKATDGKCRYDSKNR-AATCSKYTELPSGSEDD 246
Query: 239 LLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIK 296
L +AV + PVSV I +F LY SG++ P C+ +++H VL+VGY + NG DYW++K
Sbjct: 247 LKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVK 306
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
NSWG ++G GY+ M RN+GN CGI SYP
Sbjct: 307 NSWGLNFGDQGYIRMARNSGNH---CGIASYPSYP 338
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 186/319 (58%), Gaps = 22/319 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
N + W H + Y + +E+ +R ++E N + HN + G FT+ +NAF D+
Sbjct: 25 FNAQWHKWKSTHRRLYDTNEEEWRRA-VWEKNMKMIELHNGEYSEGKHGFTMEMNAFGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T++EF+ G+ H + R + + +P S+DWR+KG VT VK+Q CG+C
Sbjct: 84 TNEEFRQLVNGYK-----HQKHRKGKLFQEPLMLQLPKSVDWREKGCVTPVKNQGQCGSC 138
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSA GA+EG + TG LVSLSEQ L+DC R N GC GGLMD+A+Q+V+ N G+D+
Sbjct: 139 WAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKGLDS 198
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
E+ YPY + G C K K GY D+P+ EK L++AV P++V I S +F
Sbjct: 199 EESYPYEAKDGTC-KYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAVAIDASHPSF 256
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWGRSWGMNGYMHMQR 313
Q YSSGI+ P S LDH VL++GY E N YWI+KNSWG WGM G+ H+ +
Sbjct: 257 QFYSSGIYFEPNCSSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFHIAK 316
Query: 314 NTGNSLGICGINMLASYPT 332
+ N CGI ASYPT
Sbjct: 317 DKNNH---CGIATAASYPT 332
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 189/315 (60%), Gaps = 13/315 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
+L++ + H + Y E E+ QR ++F +N + HN++ G S + + +N FAD+
Sbjct: 39 FEKLWQDFKTVHERTYG-ETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADM 97
Query: 82 THQEFKASFLGFSAASIDHDRRR-NASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
EF + GF + R +A+ SP VPA +DWRK+G VT VK+Q CG+
Sbjct: 98 EANEFASIMNGFRMNNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGS 157
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGID 199
CWAFS TG++EG + TG LVSLSEQ L+DC SY N GC GG++DYA+Q++ N G D
Sbjct: 158 CWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDD 217
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERA 258
TE YPY G C + + T GY D+P+ +E ++ +AV + PVSV I S +
Sbjct: 218 TEACYPYEAVDGTCRFKSVCVG-ATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSS 276
Query: 259 FQLYSSGIFT-GPCS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
FQ+Y SGI+ CS LDHAVL+VGY +E G DYW++KNSWG +WG GY+ M RN
Sbjct: 277 FQMYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMARNMD 336
Query: 317 NSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 337 NQ---CGIASQASYP 348
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 193/328 (58%), Gaps = 11/328 (3%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGN 68
LL+ + + + I+ +E + HGK YS E E R IF++N V QHN MG
Sbjct: 3 LLIFVVCVAVATAIDPQWEAFKLLHGKQYS-EYEDGARYAIFQENSRIVKQHNEEAAMGK 61
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
+F + +N F D+T++EF+ +G + ++ V V ++DWR+KGA
Sbjct: 62 HTFFMRMNKFGDMTNEEFQMLVIGSGLLYSNKTQQTEGGVFESLPGLKVNDTVDWRQKGA 121
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 187
VT+VK+Q CG+CWAFS TG++EG + + +G+LVSLSEQ L+DC R N GC GGLMD
Sbjct: 122 VTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCQGGLMDQ 181
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA-VVAQ 246
A++++ N GIDTE+ YPY+G+ + + K + T+ Y D+ +E L+QA
Sbjct: 182 AFKYIKTNGGIDTEECYPYKGKNERKCEYKSSCSGATLSSYVDIKTGDEDALMQASATIG 241
Query: 247 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
P+SVGI S +FQLY G++ S LDH VL+VGY ++ DYW++KNSWG WG
Sbjct: 242 PISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTDGEKDYWLVKNSWGEEWG 301
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYPT 332
M GY+ M RN N CGI ASYP
Sbjct: 302 MEGYIKMSRNKDNQ---CGIATQASYPV 326
>gi|224062065|ref|XP_002300737.1| predicted protein [Populus trichocarpa]
gi|222842463|gb|EEE80010.1| predicted protein [Populus trichocarpa]
Length = 211
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/250 (55%), Positives = 157/250 (62%), Gaps = 63/250 (25%)
Query: 44 QEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR 103
+EK RLK FEDNY F FK S LG SAA ++ D+R
Sbjct: 13 EEKSYRLKAFEDNYDF--------------------------FKTSRLGLSAAPLNLDQR 46
Query: 104 RNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVS 163
+ ++ G + DVPASIDWRKKGAVT VKDQ SCG +V G ++
Sbjct: 47 K---LEGTGLVGDVPASIDWRKKGAVTNVKDQGSCGT---------------LVIG--LT 86
Query: 164 LSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV 223
LSEQEL+DCDRS+NSGC GGLMDYA+QFV + CNK+KL RH+V
Sbjct: 87 LSEQELVDCDRSFNSGCEGGLMDYAFQFVDET-----------------CNKEKLKRHVV 129
Query: 224 TIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVG 283
TID Y DV +NNEKQLLQAV AQPVSVGICGSERAFQ+YS GIFTG C TSLDHAVLIVG
Sbjct: 130 TIDKYVDVQQNNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGACLTSLDHAVLIVG 189
Query: 284 YDSENGVDYW 293
Y SENGVD W
Sbjct: 190 YGSENGVDPW 199
>gi|2239107|emb|CAA70693.1| cathepsin L-like cysteine proteinase [Heterodera glycines]
Length = 374
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 187/319 (58%), Gaps = 17/319 (5%)
Query: 25 INELFETWC---KQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAF 78
I F W ++HGKAY+ ++ + +R+ + F+ +HN G SF +
Sbjct: 59 IERGFSDWNAYKQKHGKAYADQEVENERMLTYLSAKQFIDKHNEAYKEGKVSFRVGETHI 118
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
ADL E++ GF D RR ++ +P N+ D+P S+DWR KG VTEVK+Q C
Sbjct: 119 ADLPFSEYQ-KLNGFRRLMGDSLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMC 177
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
G+CWAFSATGA+EG + G LVSLSEQ LIDC + Y N GC GG+MD A+Q++ N G
Sbjct: 178 GSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNKG 237
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSE 256
ID E YPY+ + G+ K N T GY D+ E +E+ L AV Q PVSV I
Sbjct: 238 IDKETAYPYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLKMAVATQGPVSVAIDAGH 297
Query: 257 RAFQLYSSGI-FTGPCS-TSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R+FQLY++G+ F C +LDH VL+VGY D G DYWI+KNSWG WG GY+ M
Sbjct: 298 RSFQLYTNGVYFEKECDPENLDHGVLVVGYGTDPTQG-DYWIVKNSWGTRWGEQGYIRMA 356
Query: 313 RNTGNSLGICGINMLASYP 331
RN N+ CGI AS+P
Sbjct: 357 RNRNNN---CGIASHASFP 372
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 184/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF +N + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G+ H R++ ++ P N+ D +P ++DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGY------HGSRKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGCEDDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 186/341 (54%), Gaps = 36/341 (10%)
Query: 22 CSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADL 81
+ + E+F+ W ++ ++Y++ +E+++RL+++ N ++ N ++ L A+ DL
Sbjct: 45 ATTMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDL 104
Query: 82 THQEFKASFLGFSAASI---------------------DHDRRRNASVQSPGNLRDVPAS 120
T+ EF A + S +H + +S G PAS
Sbjct: 105 TNDEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAG----APAS 160
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWR GAVTEVKDQ CG+CWAFS +EGI KI G LVSLSEQEL+DCD + +SGC
Sbjct: 161 VDWRASGAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCD-TLDSGC 219
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRG-QAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
GG+ A +++ N GI T DYPY G A C++ KL H TI G + V +E L
Sbjct: 220 DGGVSYRALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASL 279
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN--------GVD 291
A AQPV+V I FQ Y G++ GPC T L+H V +VGY E G
Sbjct: 280 QNAAAAQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDK 339
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNT-GNSLGICGINMLASYP 331
YWIIKNSWG++WG GY+ M+++ G G+CGI + S+P
Sbjct: 340 YWIIKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFP 380
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 184/317 (58%), Gaps = 14/317 (4%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
N+ +E W QHGK Y +E E+ R FE N + +HN ++G S+TL++N F D+
Sbjct: 21 NKEWEMWKLQHGKQYETEAEEYSRRFTFEKNTIKIAEHNIRASLGMHSYTLAMNKFGDMH 80
Query: 83 HQEFKASFLGFSAASIDHDR-RRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
H+EF +G + ++ + V + +P S+DWR V+EVKDQ CG+C
Sbjct: 81 HEEFHQRIMGGCLKIVKVNKPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVKDQGECGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TG++EG + TG LV LSEQ+L+DC + + N GCGGGLMD A+Q++ N G+DT
Sbjct: 141 WAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
E+ YPY + K + T+ GYKDV NE L +AV P+SV I +F
Sbjct: 201 EESYPYTATDDKPCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPISVAIDAGHESF 260
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVD---YWIIKNSWGRSWGMNGYMHMQRN 314
Q YSSG++ P S LDH VL+VGY + N +WI+KNSWG +WG GY+ M RN
Sbjct: 261 QFYSSGVYDEPQCSSEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNWGDQGYIMMSRN 320
Query: 315 TGNSLGICGINMLASYP 331
N CGI ASYP
Sbjct: 321 KDNQ---CGIATSASYP 334
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 184/314 (58%), Gaps = 21/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQE 85
+E + H K Y S E+ R KIF ++ + +HN G S+ L +N F DL E
Sbjct: 27 WEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAHE 86
Query: 86 FKASFLGFSAASIDHDRRRN--ASVQSPGNLRD--VPASIDWRKKGAVTEVKDQASCGAC 141
F F G H R+ ++ P N+ D +P ++DWRKKGAVT VKDQ CG+C
Sbjct: 87 FARIFNGH------HGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATG++EG + + G LVSLSEQ L+DC +S+ N+GC GGLM+ A++++ N GIDT
Sbjct: 141 WAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDT 200
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
EK YPY G+C +K + T GY ++ +E L +AV P+SV I S +F
Sbjct: 201 EKSYPYEAVDGECRFKKEDVG-ATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSF 259
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
QLYS G++ P S LDH VL+VGY + G YW++KNSW SWG GY+ M R+ N
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNN 319
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 320 Q---CGIASQASYP 330
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 202/340 (59%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F LL L +S++ DI ++ + +W QHGK+Y + E +R+ I+E+N +
Sbjct: 1 MMFALLVTLCISAVFTAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
QHN + GN +F + +N F D+T++EF+ + G+ D +R ++ + P
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQ---DPNRTSKGALFMEPSFFAAP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+DWR++G VT VKDQ CG+CW+FS+TGA+EG TG L+S+SEQ L+DC R N
Sbjct: 117 QQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGN 176
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG+MD A+Q+V +N G+D+E+ YPY + + ++ I G+ D+P+ NE
Sbjct: 177 QGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNEL 236
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCSTSLDHAVLIVGYDSEN----GVD 291
L+ AV A PVSV I S ++ Q Y SGI + C++ LDHAVL+VGY + G
Sbjct: 237 ALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNR 296
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY++M ++ N CGI +ASYP
Sbjct: 297 YWIVKNSWSDKWGDKGYIYMAKDKNNH---CGIATMASYP 333
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 192/317 (60%), Gaps = 20/317 (6%)
Query: 26 NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLT 82
+E +E + +QH K Y +Q+ +R IFE N + HN ++G SS+ L LN FAD+T
Sbjct: 23 DEHWELFKRQHNKTYLQKQDVGRR-AIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMT 81
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLR-DVPASIDWRKKGAVTEVKDQASCGAC 141
EF+ + + + R + +Q N VP ++DWR +G VT VK+Q CG+C
Sbjct: 82 PDEFEK----YRGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSC 137
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFS TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++F+ G++T
Sbjct: 138 WAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLET 197
Query: 201 EKDYPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERA 258
EK YPY G+ G C+ R I + G+ DVP +E+ L +A V PVSV I S +
Sbjct: 198 EKSYPYTGKDGTCHFDA--RGIGAKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQN 255
Query: 259 FQLYSSGIFTGPC--STSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
FQ Y G++ STSLDH VL+VGY + +G DYW++KNSWG SWG +GY+ M RN
Sbjct: 256 FQFYKDGVYDEITCSSTSLDHGVLVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRNK 315
Query: 316 GNSLGICGINMLASYPT 332
N CGI +ASYPT
Sbjct: 316 ENQ---CGIATMASYPT 329
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 191/328 (58%), Gaps = 15/328 (4%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGN 68
++++ L L CS ++ + + +H K Y QE+ R +F ++ QHN + G
Sbjct: 6 VVVALLALASCS-LDREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGV 64
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
SF + +N +AD+ ++EF G+ + R + + P N+ D+PA++DWR KG
Sbjct: 65 HSFRVGINEYADMPNEEFVRVMNGYK---MQEQRPKAPTYMPPSNVGDLPATVDWRTKGY 121
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 187
VTEVK+Q CG+CWAFS+TG++EG L+SLSEQ L+DC N GCGGGLMD
Sbjct: 122 VTEVKNQGQCGSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQ 181
Query: 188 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQ 246
A+ ++ N GIDTE YPY +G+C K N GY D+ +E L AV
Sbjct: 182 AFTYIKVNDGIDTETSYPYEAASGKCRFNKANVG-ANDTGYTDIKSKSESDLQSAVATVG 240
Query: 247 PVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 304
P++V I S +FQLY SG++ CS T LDH VL VGY +++G DYW++KNSWG +WG
Sbjct: 241 PIAVAIDASHMSFQLYKSGVYHYIFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGATWG 300
Query: 305 MNGYMHMQRNTGNSLGICGINMLASYPT 332
GY+ M RN N+ CGI ASYPT
Sbjct: 301 QQGYIMMSRNRDNN---CGIATQASYPT 325
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 198/337 (58%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ + +L +L SS D ++ ++ W K +GK Y + E+ R I+E N FV
Sbjct: 12 MKWLVLVLLGCSSAMAQLHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLKFVM 71
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
HN +MG S+ L +N D+T +E A S+ + +RN + +S N + +P
Sbjct: 72 LHNLEHSMGMHSYDLGMNHLGDMTSEEVTALM---SSLRVPSQWQRNVTYKSNPN-QKLP 127
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD--RSY 176
S+DWR KG VT+VK Q SCG+CWAFSA GA+E K+ TG LVSLS Q L+DC +
Sbjct: 128 DSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYS 187
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG M A+Q++I N+GI++E YPY+ G+C R T Y ++PE++E
Sbjct: 188 NRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKCQYDSKYR-AATCSRYTELPEDSE 246
Query: 237 KQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWI 294
L +AV + PVSV I S +F LY SG++ P C+ ++H VL+VGY + NG DYW+
Sbjct: 247 DALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNLNGKDYWL 306
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG +G GY+ M RN+GN CGI ASYP
Sbjct: 307 VKNSWGLHFGDQGYIRMARNSGNH---CGIASYASYP 340
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 190/334 (56%), Gaps = 17/334 (5%)
Query: 8 LLSILLLSSLPLNYCS----DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+L L+L SL + + ++ ++ W HGK Y +E E R +++E N +T H
Sbjct: 9 MLGSLMLVSLCVGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLMLITMH 68
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N +MG ++ LS+N DLT +E SF S + D +R AS + DVP +
Sbjct: 69 NLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT---DIQRAASPFAGTTGADVPDT 125
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VK Q SCG+CWAFSA GA+EG TG LV LS Q L+DC Y N G
Sbjct: 126 MDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHG 185
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLM +A+Q+VI N GID++ YPY G+ G+C R Y +PE NE L
Sbjct: 186 CNGGLMHHAFQYVIDNQGIDSDASYPYTGRNGECRYNSKFR-AANCSQYSFLPEGNEGAL 244
Query: 240 LQAVV-AQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKN 297
+A+ P+SV I + F Y SG++ P CS ++H VL VGY + +G DYW++KN
Sbjct: 245 KEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTLDGQDYWLVKN 304
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG+++G GY+ M RN + CGI + YP
Sbjct: 305 SWGKTFGDQGYIRMSRNKNDQ---CGIALYGCYP 335
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/321 (43%), Positives = 185/321 (57%), Gaps = 20/321 (6%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFA 79
+ +NE ++ W H K Y ++E +R+ ++E N + HN +MG SF L +N F
Sbjct: 22 AQLNEHWDLWKSWHSKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHSFRLGMNHFG 80
Query: 80 DLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
D+TH+EF+ G+ + R+ S+ N P+++DWR+KG VT VKDQ CG
Sbjct: 81 DMTHEEFRQIMNGYKLKT---QRKFTGSLFMEPNFMTAPSAVDWREKGYVTPVKDQGQCG 137
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGI 198
+CWAFS TGA+EG TG LVSLSEQ L+DC R N GCGGGLMD A+Q+V N G+
Sbjct: 138 SCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCGGGLMDQAFQYVTDNQGL 197
Query: 199 DTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSE 256
D+E YPY G Q C+ L + G+ DVP E L++AV + PVSV I
Sbjct: 198 DSEDSYPYTGTDDQPCHYDPL-YNSANDTGFVDVPSGKEHALMKAVASVGPVSVAIDAGH 256
Query: 257 RAFQLYSSGI-FTGPCST-SLDHAVLIVGYDSEN----GVDYWIIKNSWGRSWGMNGYMH 310
+FQ Y SGI + CS+ LDH VL VGY E G +WI+KNSWG WG GY++
Sbjct: 257 ESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKMGKKFWIVKNSWGEKWGDKGYIY 316
Query: 311 MQRNTGNSLGICGINMLASYP 331
M ++ N CGI ASYP
Sbjct: 317 MAKDRKNH---CGIATAASYP 334
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 181/313 (57%), Gaps = 31/313 (9%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F +W K H +S E +RL+ + N ++ HN + SSF L NAF+ LT++EF+
Sbjct: 33 FVSWLKTHHLTFSDAFEYAKRLETYIANDIYILTHN-LQESSFKLGHNAFSHLTNEEFRQ 91
Query: 89 SFLGFSAASIDHDRRRNA--SVQSPGNLR--DVPASIDWRKKGAVTEVKDQASCGACWAF 144
F GF A S D+ +R A +V S N + D+P S+DW +KGAVT VK+Q CG+CWAF
Sbjct: 92 RFNGFKA-SDDYLTKRLAQSNVASSTNFQYIDLPESVDWVEKGAVTGVKNQGMCGSCWAF 150
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
S TGAIEG I +G LVSLSEQEL+DCD + + GC GGLMD+A+ ++ ++ GI +E+DY
Sbjct: 151 STTGAIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWISEHDGICSEEDY 210
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSS 264
Y C K V PV+V I +R+FQ Y S
Sbjct: 211 AYIHSQSLCRSCK-------------------------PVVSPVAVAIDAGDRSFQFYQS 245
Query: 265 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGI 324
G++ C T LDH VL VGY E+G YW +KNSWG SWG GY+ + R+ G CGI
Sbjct: 246 GVYNKTCGTQLDHGVLTVGYGVEDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRSGQCGI 305
Query: 325 NMLASYPTKTGQN 337
M+ SYPT + +N
Sbjct: 306 AMVPSYPTASLRN 318
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 197/340 (57%), Gaps = 27/340 (7%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+L+ +++LLL L +D E + W ++GK Y S E R KI+ N +V +
Sbjct: 4 TLSLRFVAVLLLIGLVSAAVNDAEE-WRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNE 62
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
HN+M +SSF L +N FADLT +EF + + G+ + + G +P S+D
Sbjct: 63 HNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGKGRNRENHENTTIYRYTGGA--IPDSVD 119
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WR KG VT VK+Q CG+CWAFS TG++EG + TG LVSLSEQ L+DCD+ + GC G
Sbjct: 120 WRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKK-DHGCQG 178
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQK------LNRHIVTIDGYKDVPENNE 236
GLM A++++ +N GIDTE+ YPY+ + G+C +K + RH+ + +
Sbjct: 179 GLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDDIGATVERHVSIL--------TTD 230
Query: 237 KQLLQAVVAQ--PVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDY 292
+ L+ VA+ P+SV + S +FQLY SGI+ S LDH VL+VGY E+G +Y
Sbjct: 231 CEALKKAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYGKEDGEEY 290
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
W++KNSWG++WGM GY + + +CGI A YP
Sbjct: 291 WLVKNSWGKNWGMEGYFKI----ASKKNLCGICTSACYPV 326
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 198/337 (58%), Gaps = 23/337 (6%)
Query: 6 FFLLSIL---LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
FLL+ L ++S+ P + S + ++E W +HGK Y++ +E Q+R ++E+N +
Sbjct: 4 IFLLATLCLGMISAAPTHDPS-FDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINL 61
Query: 63 HNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPA 119
HN G F+L +NAF DLT+ EF+ GF + + V L DVP
Sbjct: 62 HNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQG-----QKTKMMKVFPEPFLGDVPK 116
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
++DWRK G VT VK+Q CG+CWAFSA G++EG TG LV LSEQ L+DC S+ N
Sbjct: 117 TVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNK 176
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGL D+A+Q+V N G+DT YPY G C + + G+ +P +E
Sbjct: 177 GCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTC-RYNPKYSAAKVVGFMSIPP-SENA 234
Query: 239 LLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVDYWI 294
L++AV P+SVGI ++FQ Y G++ P ST+L+HAVL+VGY E +G YW+
Sbjct: 235 LMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWL 294
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWGR WGM+GY+ M ++ N+ CGI ASYP
Sbjct: 295 VKNSWGRDWGMDGYIKMAKDWNNN---CGIASDASYP 328
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 180/309 (58%), Gaps = 29/309 (9%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
E E W +HG+ Y +EK++R +IF+ N ++ N N ++ L LN FADL+H+E+
Sbjct: 37 EKHEQWMARHGRTYQDSEEKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEY 96
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
A++ R V +VP SIDWR GAVT +K+Q CG CWAFSA
Sbjct: 97 VATYTA-----------RKMPV-------EVPESIDWRDHGAVTPIKNQYQCGCCWAFSA 138
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
A+EGI + G VSLS Q+L+DC S N GC GG M+ A+ ++I+N GI E DYPY
Sbjct: 139 AAAVEGI--VANG--VSLSAQQLLDC-VSDNQGCKGGWMNNAFNYIIQNQGIALETDYPY 193
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGI-CGSERAFQLYSSG 265
+ C+ + I G++DV +E+ L++AV QPVSV I S F+LY G
Sbjct: 194 QQMQQMCSSRMA---AAQISGFEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEG 250
Query: 266 IFTGP-CSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICG 323
+FT C HAV +VGY SE+G YW+ KNSWG +WG +GYM +QR+ G G CG
Sbjct: 251 VFTAAGCGNGHSHAVTLVGYGTSEDGTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCG 310
Query: 324 INMLASYPT 332
I + ASYPT
Sbjct: 311 IALYASYPT 319
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 200/338 (59%), Gaps = 18/338 (5%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+ FFLL++ S+ Y EL++ W K Y S +E+ R + F +N F+ +
Sbjct: 8 AFLFFLLTVCRGSTGSETYVR--RELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIR 65
Query: 63 HNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN S+ + LN F+DLT EF +L + RR+ A SV NL P
Sbjct: 66 HNQRYYQQLESYAVRLNDFSDLTPGEFAERYLCLRGIVLTKLRRKEAVSVPLKENL---P 122
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S++WR++GAVT VK+Q CG+CW+FSA GAIEG +I TG+L SLSEQ+L+DC Y N
Sbjct: 123 DSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGN 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLM A+Q+ + +G++ E DY Y + G C + + + + + GY ++PE +E
Sbjct: 183 QGCNGGLMPQAFQYA-QRYGVEAEVDYRYTERDGVC-RYRQDLVVANVTGYAELPEGDEG 240
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CST-SLDHAVLIVGYDSENGVDYWI 294
L +AV P+SVGI ++ F YS G+F CS ++DH VL+VGY +ENG YW+
Sbjct: 241 GLQRAVATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYAIDHGVLVVGYGAENGEAYWL 300
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG SWG GY+ M RN N +CGI +ASYPT
Sbjct: 301 VKNSWGSSWGEGGYVKMARNRNN---MCGIASMASYPT 335
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 192/314 (61%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ ++ W K +GK Y + E+ R I+E N FV HN +MG S+ L +N D+
Sbjct: 25 LDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 84
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E + S+ + +RN + +S N + +P S+DWR+KG VTEVK Q SCGAC
Sbjct: 85 TSEEVTSLM---SSLRVPSQWQRNVTYKSNPNEK-LPDSLDWREKGCVTEVKYQGSCGAC 140
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG+LVSLS Q L+DC ++ N GC GG M A+Q++I N+GID
Sbjct: 141 WAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNNGID 200
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
++ YPY+ G+C NR T Y ++P +E L +AV + PVSV I S +
Sbjct: 201 SDASYPYKAMDGKCRYDSKNR-AATCSKYTELPFGSEDDLKEAVANKGPVSVAIDASHPS 259
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ +++H VL+VGY + NG DYW++KNSWG ++G GY+ M RN+GN
Sbjct: 260 FFLYKSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGINFGDKGYIRMARNSGN 319
Query: 318 SLGICGINMLASYP 331
CGI SYP
Sbjct: 320 H---CGIANYCSYP 330
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 138/338 (40%), Positives = 192/338 (56%), Gaps = 19/338 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F L+ L L +P D +++ ++ W +HGK YS ++E Q+R ++E+N +
Sbjct: 2 IPIFFLATLCLGVVPAAPTHDPSLDDEWQEWKTRHGKTYSMDEEGQKR-AVWENNRKMIE 60
Query: 62 QHNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
HN G F L +NAF DLT+ EF+ GF + + +V L DVP
Sbjct: 61 LHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQSMGT-----KEMNVFQEPLLGDVP 115
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S+DWR VT VKDQ C +CWAFSA G++EG TG L+SLSEQ L+DC SY N
Sbjct: 116 KSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSWSYGN 175
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLM+YA+++V +N G+DT YPY + G C N D K +P + +
Sbjct: 176 IGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAANVTDFVK-IPISEDA 234
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSE-NGVDYWI 294
+ P+SVG+ +F+ Y G++ P CS+S LDHAVL+VGY E +G YW+
Sbjct: 235 LMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVLVVGYGEESDGNKYWM 294
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG+ WGMNGY+ M R+ N+ CGI A YPT
Sbjct: 295 VKNSWGQGWGMNGYIKMARDRNNN---CGIATYAIYPT 329
>gi|432108215|gb|ELK33129.1| Cathepsin L1 [Myotis davidii]
Length = 334
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 141/339 (41%), Positives = 192/339 (56%), Gaps = 28/339 (8%)
Query: 12 LLLSSLPLNYCSDINEL-------FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
LLL++L L S +L + W H + Y +E +R ++E N + HN
Sbjct: 5 LLLTALCLGIASATPKLDPRLDAQWYEWKAAHRRLYGVNEEGWRR-AVWEKNMKMIELHN 63
Query: 65 ---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
++ FT+++NAF D+T++EF+ GF + ++RN V +P+S+
Sbjct: 64 REYSLRKQGFTMAMNAFGDMTNEEFRQVMNGFQ-----NQKQRNGKVFREPLFAQIPSSV 118
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGC 180
DWR KG VT VK+Q CG+CWAFSATG++EG TG LVSLSEQ L+DC R+ N GC
Sbjct: 119 DWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGC 178
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLMD A+Q+V N G+DTE+ YPY + + G+ D+P+ EK LL
Sbjct: 179 NGGLMDNAFQYVKDNKGLDTEESYPYLARESNTCNYRPEYSAANDTGFVDIPQ-REKALL 237
Query: 241 QAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGV----DYW 293
+AV P+SV I +FQ Y++GI+ P S LDH VL+VGY SE G +W
Sbjct: 238 KAVATVGPISVAIDAGHSSFQFYNAGIYYEPNCSSKDLDHGVLVVGYGSEGGESKNNKFW 297
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
I+KNSWG WGMNGY+ M R+ N CGI ASYPT
Sbjct: 298 IVKNSWGSGWGMNGYVKMARDQSNH---CGIATAASYPT 333
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 184/317 (58%), Gaps = 13/317 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
D +F + ++GK Y+ E R IF+ N + N N +F L +N F DLT
Sbjct: 22 DYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDII-YATNARNLTFALGVNEFTDLTQ 80
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+E AS+ G AS+ R ++ + G + +S+DW +G VT VK+Q CG+CW+
Sbjct: 81 EELAASYTGLKPASLWSGLPRLSTHEYNG--APLASSVDWTTQGVVTPVKNQGQCGSCWS 138
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS TGA+EG + TG+LVSLSEQ+ +DCD + +SGC GG MD A+ F KN I TE
Sbjct: 139 FSTTGALEGAWALSTGNLVSLSEQQFVDCDTT-DSGCNGGWMDNAFSFAKKNS-ICTEGS 196
Query: 204 YPYRGQAGQCNKQKLNRHIVT--IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G CN I + GY DV ++E+ ++ AV QPVS+ I + +FQL
Sbjct: 197 YPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQL 256
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YSSG+ T C T LDH VL VGY SE G DYW +KNSWG SWG GY+ +QR G + G
Sbjct: 257 YSSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGGA-GE 315
Query: 322 CGINMLA---SYPTKTG 335
CG +LA SYP +G
Sbjct: 316 CG--LLAGPPSYPVVSG 330
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 199/335 (59%), Gaps = 14/335 (4%)
Query: 5 AFFLLSILLLS-SLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
A +L++L L+ S L + + +N+ ++ W + + K YS +E +R +E N V +H
Sbjct: 3 AISVLAVLALAFSCTLAFDAKLNQHWKLWKEANNKRYSDAEEHVRR-ATWEGNLQKVQEH 61
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N ++G ++ L +N +AD+T EF G++A ++ R ++ S + +P +
Sbjct: 62 NLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNA-TMRGQRTQDRHTFSFNSKIALPDT 120
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSG 179
+DWR KG VT+VKDQ CG+CWAFS TGA+EG + TG LVSLSEQ L+DC + N G
Sbjct: 121 VDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGNMG 180
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A++++ +N+GIDTE YPY QC + N T G+ D+ +E L
Sbjct: 181 CNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCRFKAANVG-ATDTGFTDITSKDESAL 239
Query: 240 LQAV-VAQPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWIIK 296
QAV P+SV I +FQLY G++ P CS T LDH VL VGY +++G DYW++K
Sbjct: 240 QQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGTDSGKDYWLVK 299
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
NSWG WG GY+ M RN N CGI ASYP
Sbjct: 300 NSWGEGWGDKGYIKMTRNKRNQ---CGIATAASYP 331
>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 185/329 (56%), Gaps = 25/329 (7%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM-------------GNS 69
S++ E F W ++ K YS +QE++ R ++F++N + Q + G+
Sbjct: 42 SEVRERFSKWMIKYSKHYSCKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPSGSQ 101
Query: 70 SFT---LSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKK 126
T +S+N F DL+ +E + G + S R AS P +DWR
Sbjct: 102 VHTFQKVSMNRFGDLSPREVIQQYTGLNTTSF-----RTASPTYLPYHSFKPCCVDWRSS 156
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
GAVT VK Q +CG+CWAF+A AIEG+NKI TG LVSLSEQ L+DCD + ++GCGGG D
Sbjct: 157 GAVTGVKHQGTCGSCWAFAAVAAIEGMNKIRTGELVSLSEQVLVDCD-TVSTGCGGGHSD 215
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLN-RHIVTIDGYKDVPENNEKQLLQAVVA 245
A V GI +E+ YPY G G+C+ KL H +I G+K VP NNE QL AV
Sbjct: 216 SAMALVAARGGITSEERYPYAGFQGKCDVDKLMFDHQASIKGFKAVPSNNEAQLAIAVAM 275
Query: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWGRSW 303
QPV+V I S AFQ YS GI+ GPCS +++HAV IVGY G YWI KNSW W
Sbjct: 276 QPVTVYIDASGSAFQFYSGGIYRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSWSNDW 335
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPT 332
G GY+++ ++ S G CG+ YPT
Sbjct: 336 GEQGYVYLAKDVAWSTGTCGLATSPFYPT 364
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 191/343 (55%), Gaps = 22/343 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
MN L L+S L + + + W H + Y +E+ +R ++E N +
Sbjct: 1 MNPTLILAAFCLGLASAALTFNHSLEAQWIKWKAMHNRLYGKNEEEWRRA-VWEKNMKTI 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
HN N G SFT+++N F D+T++EF+ GF + + RN V L +
Sbjct: 60 ELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQ-----NRKPRNGKVFQEPLLHEA 114
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P S+DWR+KG VT VK+Q CG+CWAFSATGA+EG TG LVSLSEQ L+DC
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQG 174
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA+Q+V +N G+D+E+ YPY C K + G+ D+P+ E
Sbjct: 175 NQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK-LE 232
Query: 237 KQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSEN-GVD- 291
K L++AV P+SV I +FQ Y GI+ P S +DH VL+VGY E G D
Sbjct: 233 KALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTGSDN 292
Query: 292 --YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG WGM+GY+ M ++ N CGI ASYPT
Sbjct: 293 SKYWLVKNSWGEEWGMDGYIKMAKDRKNH---CGIASAASYPT 332
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 134/335 (40%), Positives = 200/335 (59%), Gaps = 19/335 (5%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN-- 65
L S+ + + +++ ++ +++ + + + Y E ++R KIF +N+ +++HN
Sbjct: 45 LDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRF 104
Query: 66 -MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWR 124
G S+T+ +N F+D T +E K + + D + ++ +P P+ IDWR
Sbjct: 105 IQGQVSYTMGINEFSDKTDEELKRLRCFRGSLNASRDGSKYITIAAP-----PPSEIDWR 159
Query: 125 KKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGG 183
KGAVT VK+Q +CG+CWAFSATGAIEG N + TG+LVSLSEQ+L+DC Y N+ C GG
Sbjct: 160 NKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGG 219
Query: 184 LMDYAYQFVIKNHGIDTEKDYPY-RGQAGQCN---KQKLNRHIVTIDGYKDVPENNEKQL 239
LMD A+++V ++GIDTE YPY G+ G N + L +V + GY D+P +L
Sbjct: 220 LMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSEL 279
Query: 240 LQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIK 296
QAV P+SV I +F Y SG+++ S LDH VL+VGY ENG+ YW+IK
Sbjct: 280 KQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIK 339
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
NSWG WG NGY+ + R+ N +CG+ +ASYP
Sbjct: 340 NSWGPHWGENGYVKILRDHNN---LCGVASMASYP 371
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 244 bits (623), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 200/340 (58%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F LL L +S++ DI ++ + +W QHGK+Y + E +R+ I+E+N +
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
QHN + GN +F + +N F D+T++EF+ + G+ D +R + + P
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGPLFMEPSFFAAP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+DWR++G VT VKDQ CG+CW+FS+TGA+EG TG L+S+SEQ L+DC R N
Sbjct: 117 QQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGN 176
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG+MD A+Q+V +N G+D+E+ YPY + + ++ I G+ D+P NE
Sbjct: 177 QGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNEL 236
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCSTSLDHAVLIVGYDSEN----GVD 291
L+ AV A PVSV I S ++ Q Y SGI + C++ LDHAVL+VGY + G
Sbjct: 237 ALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNR 296
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY++M ++ N CGI +ASYP
Sbjct: 297 YWIVKNSWSDKWGDKGYIYMAKDKNNH---CGIATMASYP 333
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 139/342 (40%), Positives = 204/342 (59%), Gaps = 18/342 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L F +++ ++ S +++ + E + + H K Y SE E++ R+KIF +N V
Sbjct: 1 MKFLVF--VALCVVGSQAVSFFDLVQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKV 58
Query: 61 TQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASI---DHDRRRNASVQSPGNL 114
+HN + G SF L +N ++D+ + EF + G++ + + + + P N+
Sbjct: 59 AKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPPANV 118
Query: 115 RDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
++P IDWRK GAVT VKDQ CG+CW+FS TG++EG + + LVSLSEQ LIDC
Sbjct: 119 -ELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSE 177
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
Y N+GC GGLMD A++++ N GIDTE+ YPY+ + +C+ + N+ T G+ D+
Sbjct: 178 KYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKCHYKPRNKG-ATDRGFVDIES 236
Query: 234 NNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENG 289
+E++L AV P+SV I S FQ YS G++ P S LDH VL+VGY + E+G
Sbjct: 237 GDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDEDG 296
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWG SWG GY+ M RN N+ CGI ASYP
Sbjct: 297 NDYWLVKNSWGDSWGDQGYIKMARNRDNN---CGIATQASYP 335
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 189/334 (56%), Gaps = 23/334 (6%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---N 65
L + ++S+ P Y S ++ + W HGK Y E E+ R ++E N + QHN +
Sbjct: 10 LCLGIVSAAPKLYQS-LDARWSQWKAAHGKLYD-ENEEGWRRAVWEKNLKVIKQHNQEYS 67
Query: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRK 125
G SFT+++NAF DLT++EFK G + +R+ +V + P+S+DWRK
Sbjct: 68 QGKHSFTMAMNAFGDLTNEEFKQVMNGLKS-----QKRKEGNVFQAPPFAETPSSVDWRK 122
Query: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGL 184
KG VT VK+Q CG+CWAFSATGA+EG T LVSLSEQ L+DC ++ N GC GGL
Sbjct: 123 KGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEGCSGGL 182
Query: 185 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 244
MDYA+Q+V N G+D+E+ YPYR Q C K K + G+ D+ E L
Sbjct: 183 MDYAFQYVKDNGGLDSEESYPYRAQDESC-KYKPEQSAANDTGFMDIHPEEESLKLAVAT 241
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVD-----YWIIKN 297
P+S I S FQ Y GI+ P S +LDH +L+VGY S+ G D YWI+KN
Sbjct: 242 VGPISAAIDASLSTFQFYHKGIYYDPDCSSENLDHGILVVGYGSQ-GEDSEKQKYWIVKN 300
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG WG GY+ M ++ N CGI AS+P
Sbjct: 301 SWGTDWGTQGYILMAKDRDNH---CGIATAASFP 331
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 193/342 (56%), Gaps = 19/342 (5%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M LA L + + S P + + +++ +E W H K Y ++E +R+ I+E N +
Sbjct: 1 MLPLALLALGVSAVLSAP-SLDARLSDHWELWKNWHSKKYHEKEEGWRRM-IWEKNLNKI 58
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
HN +MG S+ L +N F D+TH+EF+ G+ + +R+ S+ N
Sbjct: 59 ELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRKT---ERKAIGSLFMEPNFMVA 115
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P+++DWR+KG VT VKDQ CG+CWAFS TGA+ZG N G LVSLSEQ L+DC R
Sbjct: 116 PSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEG 175
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GCGGGLMD A+Q+V N G+D+E YPY G Q + V G+ D+P E
Sbjct: 176 NEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKE 235
Query: 237 KQLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCST-SLDHAVLIVGYDSE----NG 289
L++AV + PVSV I +FQ Y SGI + CS+ LDH VL VGY E +G
Sbjct: 236 HALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDG 295
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY++M ++ N CGI ASYP
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKDRKNH---CGIATAASYP 334
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 184/317 (58%), Gaps = 13/317 (4%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTH 83
D +F + ++GK Y+ E R IF+ N + N N +F L +N F DLT
Sbjct: 22 DYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDII-YATNARNLTFALGVNEFTDLTQ 80
Query: 84 QEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
+EF AS+ G AS+ R ++ + G + +S+DW +G VT VK+Q CG+CW+
Sbjct: 81 EEFAASYTGLKPASLWSGLPRLSTHEYNG--APLASSVDWTTQGVVTPVKNQGQCGSCWS 138
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
FS TGA+EG + TG+LVSLSEQ+ DCD + +SGC GG MD A+ F KN I TE
Sbjct: 139 FSTTGALEGAWALSTGNLVSLSEQQFEDCDTT-DSGCNGGWMDNAFSFAKKNS-ICTEGS 196
Query: 204 YPYRGQAGQCNKQKLNRHIVT--IDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY G CN I + GY DV ++E+ ++ AV QPVS+ I + +FQL
Sbjct: 197 YPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQL 256
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
YSSG+ T C T LDH VL VGY SE G DYW +KNSWG SWG GY+ +QR G + G
Sbjct: 257 YSSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGGA-GE 315
Query: 322 CGINMLA---SYPTKTG 335
CG +LA SYP +G
Sbjct: 316 CG--LLAGPPSYPVVSG 330
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 142/339 (41%), Positives = 191/339 (56%), Gaps = 19/339 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
LA F L + + + P +N+ ++ W K H K Y + +E +R+ I+E N + H
Sbjct: 5 LAAFTLCLSAVFAAP-TLDQQLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N +MG ++ L +N F D+TH+EF+ GF DRR S+ N +VP
Sbjct: 63 NLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKK---DRRFRGSLFMEPNFIEVPNK 119
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VKDQ CG+CWAFS TGA+EG TG LVSLSEQ L+DC R N G
Sbjct: 120 LDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEG 179
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A+Q+V +G+D+E+ YPY G Q G+ D+P E+ L
Sbjct: 180 CNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERAL 239
Query: 240 LQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCST-SLDHAVLIVGYDSE----NGVDY 292
++A+ A PVSV I +FQ Y SGI + CS+ LDH VL VGY E +G Y
Sbjct: 240 MKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKY 299
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
WI+KNSW +WG GY++M ++ N CGI ASYP
Sbjct: 300 WIVKNSWSENWGDKGYIYMAKDRHNH---CGIATAASYP 335
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 244 bits (622), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 200/340 (58%), Gaps = 19/340 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F LL L +S++ DI ++ + +W QHGK+Y + E +R+ I+E+N +
Sbjct: 1 MMFALLVTLCISAVFTAPSIDIQLDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRKIE 59
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
QHN + GN +F + +N F D+T++EF+ + G+ D +R + + P
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGPLFMEPSFFAAP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
+DWR++G VT VKDQ CG+CW+FS+TGA+EG TG L+S+SEQ L+DC R N
Sbjct: 117 QQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGN 176
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG+MD A+Q+V +N G+D+E+ YPY + + ++ I G+ D+P NE
Sbjct: 177 QGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNEL 236
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCSTSLDHAVLIVGYDSEN----GVD 291
L+ AV A PVSV I S ++ Q Y SGI + C++ LDHAVL+VGY + G
Sbjct: 237 ALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNR 296
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YWI+KNSW WG GY++M ++ N CGI +ASYP
Sbjct: 297 YWIVKNSWSDKWGDKGYIYMAKDKNNH---CGIATMASYP 333
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 191/343 (55%), Gaps = 22/343 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
MN L L+S L + + + W H + Y +E+ +R ++E N +
Sbjct: 1 MNPTLILTAFCLGLASSALTFDRSLEAQWIKWKAMHNRLYGMNEEEWRRA-VWEKNMKMI 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
HN N G SFT+++NAF D+T++EF+ GF + + RN V +
Sbjct: 60 ELHNHEYNQGKHSFTMAMNAFGDMTNEEFRQVMNGFQ-----NRKPRNGKVFQEPLFHEA 114
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P S+DWR+KG VT VK+Q CG+CWAFSATGA+EG TG LVSLSEQ L+DC
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQG 174
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA+Q+V +N G+D+E+ YPY C K + G+ D+P+ E
Sbjct: 175 NQGCDGGLMDYAFQYVQENGGLDSEESYPYEATEESC-KYNPEYSVANDTGFVDIPK-LE 232
Query: 237 KQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSEN-GVD- 291
K L++AV P+SV I +FQ Y GI+ P S +DH VL+VGY E G D
Sbjct: 233 KALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTGSDN 292
Query: 292 --YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG WGM+GY+ M ++ N CGI ASYPT
Sbjct: 293 SKYWLVKNSWGEKWGMDGYIKMAKDRKNH---CGIASAASYPT 332
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 200/338 (59%), Gaps = 18/338 (5%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
+ F LL++ S+ Y EL++ W K Y S +E+ R + F +N F+ +
Sbjct: 8 AFLFLLLTVCRGSTESETYVR--RELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIR 65
Query: 63 HNN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVP 118
HN S+ + LN F+DLT EF +L + RR+ A SV NL P
Sbjct: 66 HNQRYYQQLESYAVRLNDFSDLTPGEFAERYLCLRGIVLTKLRRKEAVSVPLKENL---P 122
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-N 177
S++WR++GAVT VK+Q CG+CW+FSA GAIEG +I TG+L SLSEQ+L+DC Y N
Sbjct: 123 DSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGN 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GGLM A+Q+ + +G++ E DY Y + G C + + + + + GY ++PE +E
Sbjct: 183 QGCNGGLMPQAFQYA-QRYGVEAEVDYRYTERDGVC-RYRQDLVVANVTGYAELPEGDEG 240
Query: 238 QLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CST-SLDHAVLIVGYDSENGVDYWI 294
L +AV P+SVGI ++ F YS G+F CS ++DH VL+VGY +ENG YW+
Sbjct: 241 GLQRAVATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYAIDHGVLVVGYGAENGDAYWL 300
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+KNSWG SWG +GY+ M RN N +CGI +ASYPT
Sbjct: 301 VKNSWGSSWGEDGYLKMARNRNN---MCGIASMASYPT 335
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 188/319 (58%), Gaps = 22/319 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
N + W + + Y + +E+ +R ++E N + HN + G +T+ +NAF D+
Sbjct: 25 FNAQWHKWKSTYRRLYGTNEEEWRRA-VWEKNMKMIELHNGEYSEGKHGYTMEMNAFGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T++EF+ G+ H + R V + +P S+DWR+KG VT VK+Q CG+C
Sbjct: 84 TNEEFRQLVNGYK-----HQKHRKGKVFQEPLMLQLPKSVDWREKGCVTPVKNQGQCGSC 138
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSA GA+EG + TG LVSLSEQ L+DC ++ N GC GGLMD+A+Q+V+ N G+D+
Sbjct: 139 WAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDS 198
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
E+ YPY + G C K K GY D+P+ EK L++AV P+++ I S +F
Sbjct: 199 EESYPYEAKDGTC-KYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAIAIDASHPSF 256
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWGRSWGMNGYMHMQR 313
Q YSSGI+ P S LDH VL+VGY E N YWI+KNSWG SWGM G+ H+ +
Sbjct: 257 QFYSSGIYYEPNCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAK 316
Query: 314 NTGNSLGICGINMLASYPT 332
+ N CG+ ASYPT
Sbjct: 317 DKNNH---CGVATAASYPT 332
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 195/337 (57%), Gaps = 22/337 (6%)
Query: 7 FLLSIL---LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
FLL+ L ++S+ P + S ++ ++E W +H K Y+ E Q+R ++E+N + H
Sbjct: 5 FLLATLCLGVVSAAPAHNPS-LDAVWEEWKTKHKKTYNMNDEGQKR-AVWENNKKMIDLH 62
Query: 64 NN---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N G F+L +NAF DLT+ EF+ GF + Q P L DVP S
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQGQKT---KMMMKVFQEP-LLGDVPKS 118
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR G VT VKDQ SCG+CWAFSA G++EG TG LV LS Q L+DC S N G
Sbjct: 119 VDWRDHGYVTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCSWSQGNQG 178
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGL D A+Q+V N G+DT YPY G C N T+ G+ +V +++E L
Sbjct: 179 CDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKNS-AATVTGFVNV-QSSEDAL 236
Query: 240 LQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVDYWII 295
++AV P+SVGI ++FQ Y G++ P ST LDHAVL+VGY E +G YW++
Sbjct: 237 MKAVATVGPISVGIDTKHKSFQFYKEGMYYEPDCSSTVLDHAVLVVGYGEESDGRKYWLV 296
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
KNSWGR WGMNGY+ M ++ N+ CGI ASYP
Sbjct: 297 KNSWGRDWGMNGYIKMAKDRNNN---CGIASDASYPV 330
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 186/319 (58%), Gaps = 17/319 (5%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
D++ ++ W H K Y +E +R+ ++E N + HN ++G S+ L +N F D
Sbjct: 39 DLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFGD 97
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+T +EF+ G+ + + R + P L + P S+DWR+KG VT VKDQ CG+
Sbjct: 98 MTAEEFRQLMNGYKHKKSER-KYRGSQFLEPSFL-EAPRSVDWREKGYVTPVKDQGQCGS 155
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGID 199
CWAFS TGA+EG + TG LVSLSEQ L+DC R N GC GGLMD A+Q+V N GID
Sbjct: 156 CWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGID 215
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERA 258
+E+ YPY + + + K + G+ D+P+ +E+ L++AV + PVSV I +
Sbjct: 216 SEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSS 275
Query: 259 FQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWGRSWGMNGYMHMQ 312
FQ Y SGI+ P S LDH VL+VGY E +G YWI+KNSWG WG GY++M
Sbjct: 276 FQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMA 335
Query: 313 RNTGNSLGICGINMLASYP 331
++ N CGI ASYP
Sbjct: 336 KDRKNH---CGIATAASYP 351
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 116/234 (49%), Positives = 154/234 (65%), Gaps = 10/234 (4%)
Query: 102 RRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSL 161
R N SV + +PA+IDWR GAVT +KDQ CG CWAFSA A EGI KI TG L
Sbjct: 7 RYENVSVDA------IPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKL 60
Query: 162 VSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
+SLSEQEL+DCD + GC GGLMD A++F+IKN G+ TE +YPY G+C + +
Sbjct: 61 ISLSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSN 118
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
I GY+DVP N+E L++AV QPVSV + G + FQ YS G+ TG C T LDH +
Sbjct: 119 SAANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIA 178
Query: 281 IVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+GY + +G YW++KNSWG +WG NGY+ M+++ + G+CG+ + SYPT+
Sbjct: 179 AIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPTE 232
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 194/315 (61%), Gaps = 16/315 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ +E W K +GK Y + ++ R I+E N FVT HN +MG S+ LS+N +D+
Sbjct: 39 LDNHWELWKKTYGKQYEEQNQEVTRRLIWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDM 98
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E AS + S+ I + RN + + N + +P S+DWR KG VTEVK Q +CG+C
Sbjct: 99 TSEEV-ASLM--SSLRIPNQWSRNTTYRLNSNQK-LPDSVDWRDKGCVTEVKYQGTCGSC 154
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC---DRSYNSGCGGGLMDYAYQFVIKNHGI 198
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+GI
Sbjct: 155 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGI 214
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSER 257
D++ YPY+ + G+C NR T Y ++P +E L +AV + PVSVGI S
Sbjct: 215 DSDASYPYKAKDGKCQYNPANR-AATCSRYTELPYGSEDALKEAVANKGPVSVGIDASLP 273
Query: 258 AFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTG 316
+F LY SG++ P C+ +++H VL+ GY + +G DYW++KNSWG S+G GY+ + RN G
Sbjct: 274 SFFLYKSGVYYDPSCTQNVNHGVLVTGYGNLDGKDYWLVKNSWGLSFGDKGYIRIARNRG 333
Query: 317 NSLGICGINMLASYP 331
N CGI SYP
Sbjct: 334 NH---CGIANFPSYP 345
>gi|46251290|gb|AAS84611.1| cathepsin L-like cysteine proteinase I variant form precursor
[Heterodera glycines]
Length = 374
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 186/319 (58%), Gaps = 17/319 (5%)
Query: 25 INELFETWC---KQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAF 78
I F W ++HGKAY+ ++ + +R+ + F+ +HN G SF +
Sbjct: 59 IERGFSDWNAYKQKHGKAYADQEVENERMLTYLSAKQFIDKHNEAYKEGKVSFRVGETHI 118
Query: 79 ADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASC 138
ADL E++ GF D RR ++ +P N+ D+P S+DWR KG VTEVK+Q C
Sbjct: 119 ADLPFSEYQ-KLNGFRRLMGDSLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMC 177
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
G+CWAFSATGA+EG + G LVSLSEQ LIDC + Y N GC GG+MD A+Q++ N G
Sbjct: 178 GSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNKG 237
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSE 256
ID E YPY+ + G+ K N T GY D+ E +E+ L AV Q PVSV I
Sbjct: 238 IDKETAYPYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLRMAVATQGPVSVAIDAGH 297
Query: 257 RAFQLYSSGI-FTGPCS-TSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQ 312
R+FQLY++G+ F C +LDH VL+ GY D G DYWI+KNSWG WG GY+ M
Sbjct: 298 RSFQLYTNGVYFEKECDPQNLDHGVLVEGYGTDPTQG-DYWIVKNSWGTRWGEQGYIRMA 356
Query: 313 RNTGNSLGICGINMLASYP 331
RN N+ CGI AS+P
Sbjct: 357 RNRNNN---CGIASHASFP 372
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 114/196 (58%), Positives = 138/196 (70%), Gaps = 1/196 (0%)
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 198
G+CWAFSA A+EG+NKI+TG LVSLSEQEL+DCD N GC GGLMDYA+Q++ +N G+
Sbjct: 13 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQYIQRNGGV 72
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 258
TE +YPY + CNK K H VTIDGY+DVP NNE L +AV +QPV+V I S +
Sbjct: 73 TTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQD 132
Query: 259 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
FQ YS G+FTG C T LDH V VGY + +G YW +KNSWG WG GY+ MQR +
Sbjct: 133 FQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVPD 192
Query: 318 SLGICGINMLASYPTK 333
S G+CGI M SYPTK
Sbjct: 193 SRGLCGIAMEPSYPTK 208
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 204/329 (62%), Gaps = 22/329 (6%)
Query: 14 LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSS 70
++ L++ S NE ++ KQHG+ Y +E+++R +IF+ N ++ +HN ++G S
Sbjct: 28 VTKARLSFASYTNEWV-SFKKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKS 86
Query: 71 FTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD----VPASIDWRKK 126
+ L +N FAD+ ++EF+ ++ D++ R VQ +L P +DWRKK
Sbjct: 87 YYLGINQFADMKNEEFRM----YNGLRRDYNYSR--EVQCSNHLTPEYLVAPDEVDWRKK 140
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLM 185
G VT VK+Q CG+CW+FS TG++EG + +G LVSLSEQ+L+DC + N GC GGLM
Sbjct: 141 GYVTAVKNQGQCGSCWSFSTTGSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLM 200
Query: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV- 244
D A++++I N GI+TE++YPY + +C+ +K + T G DV +E L +V
Sbjct: 201 DQAFEYIITNGGIETEEEYPYDARQERCHFKK-SEVAATASGCVDVKSGDETDLKNSVAE 259
Query: 245 AQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRS 302
PVS+ I S ++FQLYS G++ P ST LDH VL+VGY +++G DYW++KNSWG +
Sbjct: 260 VGPVSIAIDASHQSFQLYSGGVYDEPKCSSTELDHGVLVVGYGTDDGQDYWLVKNSWGTT 319
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYP 331
WG+ GY+ M RN N CG+ ASYP
Sbjct: 320 WGLEGYVKMSRNQDNQ---CGVATQASYP 345
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 200/340 (58%), Gaps = 28/340 (8%)
Query: 7 FLLSIL---LLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
F+L+ L ++S+LP ++ ++ W HG+ Y +E +R ++E N + H
Sbjct: 5 FVLAALCLGIVSALP-KLDQTLDAQWDQWKAAHGRLYGLNEEGWRR-AVWEKNLRMIELH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N + G SFTL +N F D+T++EF+ GF H + + + L +P S
Sbjct: 63 NGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQ-----HQKHKTGKMYQEPLLLQLPKS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VTEVK+Q CG+CWAFSATG++EG TG+LVSLSEQ L+DC R N G
Sbjct: 118 VDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGNQG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD+A+Q+V N G++ EK YPY G+ G+C K K G+ DVP+ +++
Sbjct: 178 CNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGEC-KYKPELSAANDTGFVDVPQ--REKV 234
Query: 240 LQAVVAQ--PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYD---SENGV-D 291
+Q +A P+SV I ++FQ Y GI+ P S L+H VL+VGY SE G D
Sbjct: 235 VQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDASETGKGD 294
Query: 292 YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
YW+IKNSWG +WG +GY+ + RN N CG+ ASYP
Sbjct: 295 YWLIKNSWGTTWGADGYVKIARNRNNH---CGVATAASYP 331
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 200/337 (59%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ + ++LL SS D ++ ++ W K +GK Y + E+ R I+E N V
Sbjct: 1 MKWLGWALLLCSSAMAQVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVM 60
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
HN +MG S+ L +N D+T +E +S S+ + RN + +S N + +P
Sbjct: 61 LHNLEHSMGMHSYELGMNHLGDMTSEEVISSM---SSLRVPSQWPRNVTYKSSPN-QKLP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD--RSY 176
S+DWR+KG VTEVK Q +CG+CWAFSA GA+E K+ TG LVSLS Q L+DC +
Sbjct: 117 DSLDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYG 176
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P +E
Sbjct: 177 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRCQYDVKNR-AATCSRYIELPFGSE 235
Query: 237 KQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWI 294
+ L +AV + PVSVGI + +F LY +G++ P C+ +++H VL+VGY S NG DYW+
Sbjct: 236 EALKEAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSLNGKDYWL 295
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG ++G GY+ M RN+GN CGI SYP
Sbjct: 296 VKNSWGLNFGDQGYIRMARNSGNH---CGIANFPSYP 329
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 186/322 (57%), Gaps = 17/322 (5%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN-SSFTLSLNAFADLTHQE 85
E F+ W ++ + Y++ +E QQR ++ +N F+ N + SS+ L N F DLT +E
Sbjct: 38 ERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEE 97
Query: 86 FKASFL--------GFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQAS 137
FK ++L A A + + N + P S+DWR KGAVT VK+Q
Sbjct: 98 FKDTYLMKLDEQPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQ 157
Query: 138 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYQFVIKNH 196
CG+CWAF+ +IEG+++I TG LVSLSEQE++DCDR N GC GG A ++V +N
Sbjct: 158 CGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNG 217
Query: 197 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSE 256
G+ TE DYPY G QC KL H I GY+ V NE +L +AV +PV+V I S
Sbjct: 218 GLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDAS- 276
Query: 257 RAFQLYSSGIFTGPC-STSLDHAVLIVGYDSENGV-----DYWIIKNSWGRSWGMNGYMH 310
RAFQ Y G+F+GPC +T+++HAV +VGY S YWI+KNSWG+ WG NGY+
Sbjct: 277 RAFQFYKRGVFSGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVR 336
Query: 311 MQRNTGNSLGICGINMLASYPT 332
M R G+C I + P+
Sbjct: 337 MARRVRAREGMCAIAIEPLLPS 358
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 188/310 (60%), Gaps = 13/310 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
+E+W ++GK+Y E+ R +++E N V QHN + G +++ L +N +ADL ++E
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
F A L S+ + + + P +P+S+DWR +G VT VKDQ CG+CW+FS
Sbjct: 79 FMA--LKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
ATG++EG + TG+LVSLSEQ+L+DC SY N GC GGLM+ AY ++ G+ E Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYS 263
PY Q G+C+ + ++ + T G+ +P +E+ L+QAV PV+V I S FQLY
Sbjct: 197 PYTAQNGRCHFDQ-SKAVATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYE 255
Query: 264 SGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
SG++ S+SLDH VL GY +E G DYW++KNSWG WG GY+ M RN N
Sbjct: 256 SGVYDRSRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQ--- 312
Query: 322 CGINMLASYP 331
CGI +A YP
Sbjct: 313 CGIATMACYP 322
>gi|403371627|gb|EJY85692.1| Cysteine protease [Oxytricha trifallax]
Length = 384
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 198/345 (57%), Gaps = 17/345 (4%)
Query: 3 SLAFFLLSILLLS---SLPLNYCSDINELFET----WCKQHGKAYSSEQEKQQRLKIFED 55
+LA F +SI + S +N S +N ET + +H K++ +++E + RL F +
Sbjct: 41 ALALFGISINSQNGGLSDRMNLASKVNPEVETAFNNFLARHSKSFLTKEEFRARLSNFRN 100
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASV-----QS 110
+ V HN++ S+F + LN F+D + E D D + + ++
Sbjct: 101 TFEEVKLHNSIQGSNFKMGLNQFSDWSQSEIDEMLQFKEPLDTDEDNTNDEDLDQTLLKA 160
Query: 111 PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
G+L PASIDWR KGAVT V DQ C +C+ FSA A+EG +I TG L+ +S+Q+L+
Sbjct: 161 DGDLLQAPASIDWRAKGAVTPVLDQGRCSSCYTFSAAHAVEGAYQIKTGKLIEMSKQQLL 220
Query: 171 DCD-RSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGY 228
+C R Y NSGC GG M AY++ +K++ + ++ YPY G AG C K ++ I + Y
Sbjct: 221 ECSGRPYGNSGCRGGYMTNAYKY-LKDNKLQSDASYPYTGTAGTC-KHDASKGITNVVSY 278
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIF-TGPCSTSLDHAVLIVGYDSE 287
+P N+ LL AV QPVS+ I S A Y SGI T C T+++HAV +VGY SE
Sbjct: 279 TALPANDPTALLNAVAKQPVSIAIYASSSALLAYKSGIVDTAKCGTNVNHAVTLVGYGSE 338
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NG+DYWIIKNSWG WG G++ ++R+ GICGI L+S PT
Sbjct: 339 NGIDYWIIKNSWGAKWGEKGFIRIKRDMTKGPGICGIYKLSSIPT 383
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 188/310 (60%), Gaps = 15/310 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
+ W K +GK Y + E+Q R I+E N FV HN +MG S+ L +N D+T +E
Sbjct: 28 WHLWKKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEE 87
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
++ S+ + RN + +S N + +P S+DWR+KG VTEVK Q +CG+CWAFS
Sbjct: 88 VRSLM---SSLRVPRQWLRNVTYKSDPNQK-LPDSVDWREKGCVTEVKYQGACGSCWAFS 143
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
A GA+EG K+ TG LVSLS Q L+DC ++ N GC GG M A+Q+VI N+GID+E
Sbjct: 144 AVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSGGFMTEAFQYVIDNNGIDSETS 203
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLY 262
YPY+ +C+ NR T Y ++P +E+ L +AV + PVSV + S +F LY
Sbjct: 204 YPYKATDEKCHYDSKNR-AATCSRYTELPYGSEEALKEAVANKGPVSVAVDASRPSFFLY 262
Query: 263 SSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
+G++ P C+ ++ H VL VGY + NG DYW++KNSWG +G GY+ M RN GN
Sbjct: 263 KNGVYDDPSCTQNVTHGVLAVGYGNLNGKDYWLVKNSWGLYFGDQGYIRMARNKGNH--- 319
Query: 322 CGINMLASYP 331
CGI +SYP
Sbjct: 320 CGIASYSSYP 329
>gi|291224872|ref|XP_002732426.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 691
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 138/303 (45%), Positives = 191/303 (63%), Gaps = 17/303 (5%)
Query: 37 GKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGF 93
GK Y+S+++ +++ I+ N V HN G SS+T+ +N F D+T++EF G+
Sbjct: 396 GKVYNSDEDGVRQM-IWSQNKKNVELHNMKYRKGESSYTMEMNQFGDMTNKEFTDMMCGY 454
Query: 94 SAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGI 153
+ R+++ +P N + P S+DWR KG VTEVKDQ +CG+CWAFS TG++EG
Sbjct: 455 KGKK--QNSPRSSTFLAPSNYK-APDSVDWRTKGYVTEVKDQGACGSCWAFSTTGSMEGQ 511
Query: 154 NKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ 212
+ TG LVS SEQ+L+DC SY N GCGGGLMD A+ + I+++GI+ E DYPY +
Sbjct: 512 SFKNTGKLVSFSEQQLVDCSGSYGNMGCGGGLMDQAFAY-IEDYGIEPEADYPYTAKDDP 570
Query: 213 CNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFTGP- 270
C+ ++ + T GY D+ +EK L QAV P+SV I S +F+LY SG++ P
Sbjct: 571 CSYDT-SKAVATNTGYTDIATMDEKALQQAVATVGPISVAIDASHSSFRLYKSGVYDEPA 629
Query: 271 CS-TSLDHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLA 328
CS T LDH VL VGY +++G DYWI+KNSWG +WG GY+HM RN N CGI A
Sbjct: 630 CSQTMLDHGVLAVGYGTTDDGNDYWIVKNSWGSTWGNQGYIHMSRNNDNQ---CGIATNA 686
Query: 329 SYP 331
SYP
Sbjct: 687 SYP 689
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 187/314 (59%), Gaps = 18/314 (5%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
F W + GK+Y S E+ R +I+ N V HN + G S+ L + FAD+ ++E
Sbjct: 26 FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85
Query: 86 FKASF----LGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+K LG AS+ RR + ++ P + D+P ++DWR++G VT VKDQ CG+C
Sbjct: 86 YKKLVSRGCLGSFNASLP--RRGSTFLRLPEGI-DLPDAVDWREQGYVTGVKDQKQCGSC 142
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSATGA+EG + TG LVSLSEQ+L+DC +Y N GC GG MD A++++ N GIDT
Sbjct: 143 WAFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGIDT 202
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAF 259
E YPY + C + T GY DV + +E+ L +AV PVSV I S +F
Sbjct: 203 EASYPYEAEDWLCRYNPASVG-ATCSGYVDVNKYDEEALKEAVATIGPVSVAIDASHASF 261
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
Q Y+SG++ P S LDH VL VGY +ENG DYW++KNSWGR WG GY+ M RN N
Sbjct: 262 QFYTSGVYDEPGCSSIELDHGVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSRNKHN 321
Query: 318 SLGICGINMLASYP 331
CGI ASYP
Sbjct: 322 Q---CGIASAASYP 332
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 133/322 (41%), Positives = 187/322 (58%), Gaps = 25/322 (7%)
Query: 29 FETWCKQHGKAY----SSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
+ +W K++ K + S E + ++F+ N + +HN N G S+ + LN FA L
Sbjct: 27 WSSWVKEYNKEHWVDPYSSPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHL 86
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +EF A +LG+ A ++ + R A + ++PAS+DWR+KGAV EVK+Q +CG+C
Sbjct: 87 TFEEFSAQYLGYGGAEVEQPKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSC 146
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKN--HGI 198
WAFSA A+EG + + +G L+SLSEQ+L+DC + + N GC GG MD A+++ + N HG
Sbjct: 147 WAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGD 206
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQPVSVGICGSER 257
D+EKDYPY+G G+C K + TI GY DV + NE LL AV PVSV I
Sbjct: 207 DSEKDYPYKGMDGKC-KFSADGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHAGA- 264
Query: 258 AFQLYSSGIF---TGPCSTSLDHAVLIVGYDSEN-----GVDYWIIKNSWGRSWGMNGYM 309
A Q Y G+F G C L+H V VGY + + +DYWIIKNSWG WG G++
Sbjct: 265 ALQFYLRGVFNGVAGTCFGPLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFV 324
Query: 310 HMQRNTGNSLGICGINMLASYP 331
R +CG+ ASYP
Sbjct: 325 RFARGK----NLCGVANGASYP 342
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/339 (41%), Positives = 194/339 (57%), Gaps = 24/339 (7%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
LA F L I +S + ++ + W H K Y +E ++R I+E N + +H
Sbjct: 7 LAAFCLGI---ASAAPRHDHSLDADWYKWKATHRKLYGLNEEGRRRA-IWEKNMKMIERH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N G SFT+++NAF D+T++EF+ + GF + + + V P S
Sbjct: 63 NWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQ-----NQKHKKGKVFLDAGSALTPHS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VK+Q CG+CWAFSATGA+EG T L+SLSEQ L+DC N G
Sbjct: 118 VDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A+Q++ N G+D+E+ YPY G+ G C K K GY D+P+ EK L
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSC-KYKPQSSAANDTGYVDIPK-QEKAL 235
Query: 240 LQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGV---DYW 293
++AV P+SVGI S +FQ YS+GI+ P S LDH VL+VGY E YW
Sbjct: 236 MKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYW 295
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++KNSWG +WGM+GY+ M ++ N CGI +ASYP
Sbjct: 296 LVKNSWGNTWGMDGYIKMTKDQNNH---CGIATMASYPV 331
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ + W K + K Y E E+ R I+E N FV HN +MG S+ L +N D+
Sbjct: 24 LDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E S +G + + +RN + +S N + +P S+DWR+KG VTEVK Q SCGAC
Sbjct: 84 TGEEV-ISLMG--SLRVPSQWQRNVTYRSNSN-QKLPDSVDWREKGCVTEVKYQGSCGAC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+GID
Sbjct: 140 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGID 199
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
+E YPY+ G+C R T Y ++P +E L +AV + PVSV I S +
Sbjct: 200 SEASYPYKAMNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYS 258
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ +++H VL+VGY + NG DYW++KNSWG ++G GY+ M RN+GN
Sbjct: 259 FFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGN 318
Query: 318 SLGICGINMLASYP 331
CGI SYP
Sbjct: 319 H---CGIASYPSYP 329
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 205/345 (59%), Gaps = 28/345 (8%)
Query: 4 LAFFLLSILLLSSLPLNYCSDI--NELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ F LL L +S++ DI ++ + +W QHGK+Y + E +R+ I+E+N +
Sbjct: 1 MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-- 116
QHN + GN +F + +N F D+T++EF+ + G+ HD N + Q P +
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYK-----HDP--NQTSQGPLFMEPSF 112
Query: 117 --VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR 174
P +DWR++G VT VKDQ CG+CW+FS+TGA+EG TG L+S+SEQ L+DC R
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172
Query: 175 SY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPE 233
+ N GC GGLMD A+Q+V +N G+D+E+ YPY + + ++ I G+ D+P+
Sbjct: 173 PHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPK 232
Query: 234 NNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGI-FTGPCSTS-LDHAVLIVGYDSEN-- 288
NE L+ AV A PVSV I S ++ Q Y SGI + CS+S LDHAVL+VGY +
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGAD 292
Query: 289 --GVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G YWI+KNSW WG GY++M ++ N CGI +ASYP
Sbjct: 293 VAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNH---CGIATMASYP 334
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ + W K + K Y E E+ R I+E N FV HN +MG S+ L +N D+
Sbjct: 32 LDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 91
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E S +G + + +RN + +S N + +P S+DWR+KG VTEVK Q SCGAC
Sbjct: 92 TGEEV-ISLMG--SLRVPSQWQRNVTYRSNSN-QKLPDSVDWREKGCVTEVKYQGSCGAC 147
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+GID
Sbjct: 148 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGID 207
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
+E YPY+ G+C R T Y ++P +E L +AV + PVSV I S +
Sbjct: 208 SEASYPYKAMNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYS 266
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ +++H VL+VGY + NG DYW++KNSWG ++G GY+ M RN+GN
Sbjct: 267 FFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGN 326
Query: 318 SLGICGINMLASYP 331
CGI SYP
Sbjct: 327 H---CGIASYPSYP 337
>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
Length = 359
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 204/336 (60%), Gaps = 27/336 (8%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN--- 64
L+++ + SLPL DI F+ W ++ GK Y S +E+ QR K +++N+ V HN
Sbjct: 10 LMALANVDSLPL----DIE--FQEWKQKFGKIYKSVEEESQRKKTWQENHKLVMNHNILA 63
Query: 65 NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV-----PA 119
+ G S+ L +N FAD+++QE++ S + +R N S + LR V P
Sbjct: 64 DKGIKSYRLGMNYFADMSNQEYRQSVF---KGCLSFNRTLNHSAATF--LRQVGGPALPN 118
Query: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NS 178
+++W + G VTEV++Q C +CWAFSATGA+EG TG LVSLS+Q+L+DC + + N+
Sbjct: 119 TVNWTQMGYVTEVEEQKQCNSCWAFSATGALEGQTFKKTGKLVSLSKQQLVDCSKKFGNN 178
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GGLM++A+++V +N G+ TE+ YPY + G C + L VT G+ + +E
Sbjct: 179 GCKGGLMNWAFEYVKENGGLHTEESYPYEAKDGSC-RDNLGTVGVTCTGHVQINSEDENA 237
Query: 239 LLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGVDYWII 295
L +AV P+SV I + +FQLY SG++ P CS T ++H VL VGY +++G DYW+I
Sbjct: 238 LQEAVATIGPISVAIDANHTSFQLYESGLYDEPDCSCTDMNHGVLAVGYGTDDGKDYWLI 297
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
KNSWG +WG GY+ M RN N CGI ASYP
Sbjct: 298 KNSWGINWGDKGYIKMSRNKNNQ---CGIATAASYP 330
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 192/317 (60%), Gaps = 16/317 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGNSSFTLSLNAFADL 81
+ ++ W + K Y++ +E+ R++IF +NY FV HN +G +++ +LNAFADL
Sbjct: 26 LQSIWRGWKVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADL 85
Query: 82 THQEFKASFLGFSAASIDHDRRRNAS--VQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
T +EF +L ++ + ++ V+ P + VP SIDWRKKG VT +KDQ CG
Sbjct: 86 TLEEFAEKYLTLKQTPMEGIWQDMSTQYVERPTRML-VPDSIDWRKKGLVTPIKDQGDCG 144
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGI 198
+CWAFSATGA+EG K TG L+SLSEQ+L+DC + N GC GG M+ A+++ ++N G
Sbjct: 145 SCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYWMRN-GA 203
Query: 199 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL-LQAVVAQPVSVGICGSER 257
++E DYPY G+C K ++ + + + VP+ E QL L PVSV I +
Sbjct: 204 ESESDYPYTAMDGKC-KFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAIDATSS 262
Query: 258 AFQLYSSGIFT-GPCSTS-LDHAVLIVGYDSENGVD-YWIIKNSWGRSWGMNGYMHMQRN 314
F LY GI+ CS LDHAVL+VGYD++ YWI+KNSWG WG GY+ M R+
Sbjct: 263 GFMLYKKGIYQDNTCSQQYLDHAVLVVGYDADKTRQKYWIVKNSWGEDWGQRGYIWMARD 322
Query: 315 TGNSLGICGINMLASYP 331
GN +CGI +ASYP
Sbjct: 323 KGN---MCGIATMASYP 336
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 146/336 (43%), Positives = 193/336 (57%), Gaps = 21/336 (6%)
Query: 7 FLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
FLL+ L L + D ++ ++E W +H K YS +E Q+R ++E+N + HN
Sbjct: 5 FLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYSMNEEAQKR-AVWENNMKMIGLHN 63
Query: 65 N---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
G F L +NAF DLT+ EF+ GF S+ H + Q P L DVP S+
Sbjct: 64 EDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGH--KEMTIFQEP-LLGDVPKSV 118
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGC 180
DWR G VT VKDQ CG+CWAFSA G++EG TG LV LSEQ L+DC SY N GC
Sbjct: 119 DWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGC 178
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLM+ A+Q+V +N G+DT + Y Y G C + V I G+ VP +E L+
Sbjct: 179 NGGLMELAFQYVKENRGLDTRESYAYEAWDGPC-RYDPKYSAVNITGFVKVPL-SEDALM 236
Query: 241 QAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVDYWIIK 296
AV + PVSVGI +F+ Y G + P ST+LDHAVL+VGY E +G YW++K
Sbjct: 237 NAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDGRKYWLVK 296
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG WGM+GY+ M ++ N+ CGI A YPT
Sbjct: 297 NSWGEDWGMDGYIKMAKDRDNN---CGIATYAIYPT 329
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/338 (40%), Positives = 201/338 (59%), Gaps = 17/338 (5%)
Query: 3 SLAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
++ + + ++LL SS D ++ ++ W K +GK Y + E+ R I+E N V
Sbjct: 11 TMNWLVWALLLCSSAMAQVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 70
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
T HN +MG S+ L +N D+T +E + S+ + RN + +S N + +
Sbjct: 71 TLHNLEHSMGMHSYELGMNHLGDMTSEEVISLM---SSLRVPSQWPRNVTYKSDPN-QKL 126
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P S+DWR+KG VTEVK Q +CG+CWAFSA GA+E K+ TG LVSLS Q L+DC +
Sbjct: 127 PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKY 186
Query: 177 -NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P +
Sbjct: 187 GNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNR-AATCSRYIELPFGS 245
Query: 236 EKQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYW 293
E+ L +AV + PVSVGI S +F LY +G++ P C+ +++H VL+VGY + +G DYW
Sbjct: 246 EEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYW 305
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
++KNSWG +G GY+ M RN+GN CGI SYP
Sbjct: 306 LVKNSWGLHFGDQGYIRMARNSGNH---CGIASYPSYP 340
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 140/314 (44%), Positives = 193/314 (61%), Gaps = 20/314 (6%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
+ W HGK+Y+S +E +++L I+E N VTQHN + G ++T+++ FADL + E
Sbjct: 23 WNEWKNTHGKSYASHEELKRQL-IWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDE 81
Query: 86 FKASFLGFSAASIDHDRRRN-ASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
F A +L + D R S Q G + P SIDWR +G VT VK+Q CG+CWAF
Sbjct: 82 FAAMYL----PRMRKDSRNGFCSAQPVGGFVENPTSIDWRTRGYVTPVKNQLQCGSCWAF 137
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
S TG++EG + T +LVSLSEQ+L+DC + + GCGGG+MDYA+ ++ G+++E D
Sbjct: 138 STTGSLEGQHFAKTKNLVSLSEQQLMDCSFKEGDEGCGGGIMDYAFDYIFLAGGVESEAD 197
Query: 204 YPYRGQAGQCNKQKLNRHI-VTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQL 261
YPY + C N I T+ G DV +E QL +AV + PVSV I S +FQL
Sbjct: 198 YPYEARNDHCRFD--NSSIAATLTGCVDVTSGSETQLEKAVGSIGPVSVAIDASHISFQL 255
Query: 262 YSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG-MNGYMHMQRNTGNS 318
Y SG+ P +T+LDH VL VGY ++NG +YWI+KNSWG WG +NGY+ M +N N+
Sbjct: 256 YGSGVNYEPMCSTTTLDHGVLAVGYGADNGNEYWIVKNSWGEGWGHLNGYIKMSKNRNNN 315
Query: 319 LGICGINMLASYPT 332
CGI ASYPT
Sbjct: 316 ---CGIATQASYPT 326
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 203/341 (59%), Gaps = 20/341 (5%)
Query: 1 MNSLAFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAF 59
M+SL+ ++ + L L S S +N ++ + + H K YS+ +E R ++++N
Sbjct: 1 MHSLSIPIVIVFLHLKSADGLSVSALNIGWQEFVRTHNKTYSAHEE-LFRYAVWKENVLA 59
Query: 60 VTQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD 116
+ +HN + G ++ LS+N + DLT++E+ GF ++ + R+ S+ NL +
Sbjct: 60 INRHNSKADQGVHTYWLSMNEYGDLTNEEYFRLRTGFI---MNGNIERSGSIFKYTNLSE 116
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RS 175
P +DWR+KG VT VKDQ CG+C+AFSATGA+EG + TG LVSLSEQ ++DC +
Sbjct: 117 YPRQVDWRRKGYVTRVKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNIVDCSFKE 176
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPE 233
N GC GGLMD ++ ++ N+GID E+ YPY + G C + R V T GY D+PE
Sbjct: 177 GNKGCKGGLMDKSFTYIKNNNGIDKEEAYPYEARDGPC---RFRRSEVGATDRGYVDLPE 233
Query: 234 NNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP-CS-TSLDHAVLIVGYDSENGV 290
N+E L AV P+SV I G F+ Y G+F P CS T ++H VL+VGY + NG+
Sbjct: 234 NDETALRHAVATIGPISVAIDGHHFNFRFYDHGVFDNPNCSKTKINHGVLVVGYGTRNGL 293
Query: 291 DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
DYW++KNSWGR WG GY+ M RN N C I ASYP
Sbjct: 294 DYWMVKNSWGRGWGAKGYILMSRNNDNQ---CCIACAASYP 331
>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
Length = 376
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 196/348 (56%), Gaps = 50/348 (14%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F W +HGK Y + QE +R IF+DN +V N+ G S L LN FADLT+ E++
Sbjct: 34 FTEWTIKHGKQYEN-QEFGRRYGIFKDNMDYVHDWNSKG-SETVLGLNIFADLTNLEYQK 91
Query: 89 SFLGFSAASIDH---DRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
+LG S+ H D R + + R+ P S+DW KKGAVT +KDQ CG+CW+FS
Sbjct: 92 YYLGTHVNSLLHRGYDGRALEEIFGSDDGRN-PTSVDWNKKGAVTPIKDQGQCGSCWSFS 150
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
TG++EG ++I TG LVSLSEQ L+DC + N GC GGLMD A+ ++I+N GIDTE Y
Sbjct: 151 TTGSVEGAHQIKTGKLVSLSEQNLVDCSGAEGNLGCDGGLMDNAFIYIIQNKGIDTESSY 210
Query: 205 PYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLYS 263
PY+ Q+G K T+ GY ++ +E QL AV PVSV I S +FQLYS
Sbjct: 211 PYKAQSGTKCLFKPTSIGATLSGYVNITAGSESQLETAVAKNGPVSVAIDASHNSFQLYS 270
Query: 264 SGIFTGP-CS-TSLDHAVLIVGY-----DSEN----------------GVD--------- 291
SG++ P CS T LDH VL+VGY D N G+D
Sbjct: 271 SGVYYEPKCSPTELDHGVLVVGYGVAKKDENNASPNKHQIRIRHNDDFGIDEIVTDSSSD 330
Query: 292 -------YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
YW++KNSWG SWGM G++ M +N N+ CGI ASYPT
Sbjct: 331 DGRKTSQYWLVKNSWGVSWGMQGFIQMSKNRKNN---CGIASCASYPT 375
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ + W K +GK Y+ + E+ +R I+E N FV HN +MG S+ L +N D+
Sbjct: 24 LDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E + + + +RN + +S N + +P S+DWR+KG VTEVK Q SCG+C
Sbjct: 84 TSEEVVSLM---TCLKVPRQSQRNVTYKSSPN-QKLPDSLDWREKGCVTEVKYQGSCGSC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+GID
Sbjct: 140 WAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKYRNEGCHGGFMTEAFQYIIDNNGID 199
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
+E YPY+ +C NR T Y ++P +E+ L +AV ++ PVSV I S +
Sbjct: 200 SEASYPYKAMDEKCQYDSKNR-AATCSKYTELPFGSEEALKEAVASKGPVSVAIDASHSS 258
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ ++H VL+VGY + NG DYW++KNSWG +G GY+ M RN N
Sbjct: 259 FFLYRSGVYYEPACTQVVNHGVLVVGYGNLNGNDYWLVKNSWGLYFGDKGYIRMARNREN 318
Query: 318 SLGICGINMLASYP 331
CGI +SYP
Sbjct: 319 H---CGIASYSSYP 329
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 188/310 (60%), Gaps = 15/310 (4%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQE 85
++ W K HGK Y + E+ R I+E N +VT HN +MG S+ LS+N D+T +E
Sbjct: 28 WDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSMNHLGDMTSEE 87
Query: 86 FKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFS 145
+ S+ I + RN + + N + +P S+DWR+KG VTEVK Q SCG+CWAFS
Sbjct: 88 VISLM---SSLRIPNQWNRNTTYRLSSNQK-LPDSVDWREKGCVTEVKYQGSCGSCWAFS 143
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGIDTEKD 203
A GA+E K+ TG LVSLS Q L+DC D+ N GC GG M A+Q+VI N+GID++
Sbjct: 144 AVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVIDNNGIDSDVS 203
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLY 262
YPY+ G+C +R T Y ++P +E+ L +AV + PVSVGI +F LY
Sbjct: 204 YPYKATDGKCQYNPASR-AATCSKYTELPYGSEEALKEAVANKGPVSVGIDAKTPSFFLY 262
Query: 263 SSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
SG++ P C+ ++H VL++GY + +G DYW++KNSWG +G GY+ + RN GN
Sbjct: 263 KSGVYYDPSCTQKVNHGVLVIGYGNLDGQDYWLVKNSWGLHFGDKGYVRIARNRGNH--- 319
Query: 322 CGINMLASYP 331
CGI SYP
Sbjct: 320 CGIANFPSYP 329
>gi|66394764|gb|AAY46196.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 191/313 (61%), Gaps = 15/313 (4%)
Query: 29 FETWCKQHG-KAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQ 84
+ + ++HG KAY+ + + +R+ + F+ +HN G +F + N ADL
Sbjct: 70 WNAYKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFS 129
Query: 85 EFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAF 144
E+K G+ D+ RR ++ +P N+ D+P S+DWR KG VTEVK+Q CG+CWAF
Sbjct: 130 EYK-KLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAF 188
Query: 145 SATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKD 203
S+TGA+E + TG L+SLSEQ LIDC + Y N GC GG+MD A+Q++ N+G+D E D
Sbjct: 189 SSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELD 248
Query: 204 YPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAFQLY 262
YPY+ + G+ K N T G+ D+ E +E++L AV Q P SV I R+FQLY
Sbjct: 249 YPYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLY 308
Query: 263 SSGI-FTGPCS-TSLDHAVLIVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
+ G+ F CS +LDH VL+VGY D++ G DYWI+KNSWG WG GY+ M RN N+
Sbjct: 309 THGVYFEKECSPENLDHGVLVVGYGTDAQQG-DYWIVKNSWGAHWGEQGYIRMARNRKNN 367
Query: 319 LGICGINMLASYP 331
CGI ASYP
Sbjct: 368 ---CGIASHASYP 377
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 182/306 (59%), Gaps = 13/306 (4%)
Query: 32 WCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADLTHQEFKA 88
W K H K Y+SE E+ R +I+E N +T HN ++G ++ L +N D+T +E
Sbjct: 29 WKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTREEILQ 88
Query: 89 SFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATG 148
F G + + RR + V S G VP S+DWR+KG VTEVK+Q SCG+CWAFSA G
Sbjct: 89 MFAG-TRVRPNLTRRSSPFVASAG--ISVPDSVDWREKGYVTEVKNQGSCGSCWAFSAAG 145
Query: 149 AIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYR 207
A+EG K TG + SLS Q L+DC Y N GC GG M A+Q+VI + GID+++ YPY
Sbjct: 146 ALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPYT 205
Query: 208 GQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERAFQLYSSGI 266
GQC + R Y V E +E+ L QAV P+SV I + F LY SG+
Sbjct: 206 AMDGQCRYDQSQR-AANCSSYNYVSEGDEEALKQAVATIGPISVAIDATRPMFILYHSGV 264
Query: 267 FTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
++ P C+ +++H VL+VGY S NG DYW++KNSWG +G GY+ + RN GN +CGI
Sbjct: 265 YSDPTCTQNVNHGVLVVGYGSLNGEDYWLVKNSWGTRFGDGGYIRIARNKGN---MCGIA 321
Query: 326 MLASYP 331
A YP
Sbjct: 322 NYACYP 327
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 188/334 (56%), Gaps = 26/334 (7%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS---SFTLSLNAFA 79
S + E F+ W + K+Y++ E ++R ++ N A++ N + ++ L A+
Sbjct: 46 SPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGETAYT 105
Query: 80 DLTHQEFKASFLGF-SAASIDHDRR-----------RNASVQSPGNL-------RDVPAS 120
DLT+QEF A + S A + D R V + G L PAS
Sbjct: 106 DLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTAAPAS 165
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180
+DWR GAVT VK+Q CG+CWAFS +EGI +I TG LVSLSEQEL+DCD + ++GC
Sbjct: 166 VDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TLDAGC 224
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GG+ A +++ N G+ TE+DYPY G CN+ KL + +I G + V +E L
Sbjct: 225 DGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSEASLA 284
Query: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNS 298
AV QPV+V I FQ Y G++ GPC TSL+H V +VGY + E+G YWIIKNS
Sbjct: 285 NAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEEDGDKYWIIKNS 344
Query: 299 WGRSWGMNGYMHMQRNT-GNSLGICGINMLASYP 331
WG SWG GY+ M+++ G G+CGI + S+P
Sbjct: 345 WGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFP 378
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 196/329 (59%), Gaps = 13/329 (3%)
Query: 12 LLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNN---MGN 68
LL+ + + + I+ +E + HGK Y+ E E R IF +N V QHN MG
Sbjct: 3 LLIVLVCVAVATAIDNEWEAFKLLHGKQYN-EYEDTARHAIFLENCKIVKQHNEEAAMGK 61
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASV-QSPGNLRDVPASIDWRKKG 127
+F + +N F DLT++EF+ +G + ++ V +S L+ V ++DWR+KG
Sbjct: 62 HTFFMRMNKFGDLTNEEFRMLVIGSGLMQSNRTQQAEGGVFESIPGLK-VNDTVDWRQKG 120
Query: 128 AVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMD 186
AVT+VK+Q CG+CWAFS TG++EG + + +G+LVSLSEQ L+DC R N GC GGLMD
Sbjct: 121 AVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMD 180
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA-VVA 245
A++++ N GIDTE+ YPY+G+ + + K + T+ + DV +E L QA
Sbjct: 181 QAFKYIKTNGGIDTEECYPYKGRDERKCEYKASCSGATLSSFVDVKTGDEDALKQASATI 240
Query: 246 QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 303
P+SVGI S +FQLY G++ S LDH VL+VGY +++ DYW++KNSWG W
Sbjct: 241 GPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADW 300
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPT 332
GM GY+ M RN N CGI ASYP
Sbjct: 301 GMEGYIMMSRNKDNQ---CGIATQASYPV 326
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 188/334 (56%), Gaps = 17/334 (5%)
Query: 8 LLSILLLSSLPLNYCS----DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+L L+L SL + + ++ ++ W HGK Y +E E R +++E N +T H
Sbjct: 9 MLGSLMLVSLCVGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLMLITMH 68
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N +MG ++ LS+N DLT +E SF S + D +R AS + DVP +
Sbjct: 69 NLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT---DIQRAASPFAGTTGADVPDT 125
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VK Q SCG+CWAFSA GA+EG TG LV LS Q L+DC Y N G
Sbjct: 126 MDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHG 185
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GG M A+Q+VI N GID++ YPY G+ G+C R Y +PE NE L
Sbjct: 186 CNGGFMHQAFQYVIDNQGIDSDASYPYTGRNGECRYNSKFR-AANCSQYSFLPEGNEGAL 244
Query: 240 LQAVV-AQPVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKN 297
+A+ P+SV I + F Y SG++ P CS ++H VL VGY + +G DYW++KN
Sbjct: 245 KEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTLDGQDYWLVKN 304
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
SWG+++G GY+ M RN + CGI + YP
Sbjct: 305 SWGKTFGDQGYIRMSRNKNDQ---CGIALYGCYP 335
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 186/319 (58%), Gaps = 17/319 (5%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+++ ++ W H K Y +E +R+ ++E N + HN +G S+ L +N F D
Sbjct: 5 ELDGHWQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMNQFGD 63
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
+T +EF+ G++ + + R + P L + P S+DWR+KG VT VKDQ CG+
Sbjct: 64 MTTEEFRQLMNGYAHKKSER-KYRGSQFLEPSFL-EAPRSVDWREKGYVTPVKDQGQCGS 121
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGID 199
CWAFS TGA+EG + TG LVSLSEQ L+DC R N GC GGLMD A+Q+V N GID
Sbjct: 122 CWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGID 181
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSERA 258
+E+ YPY + + + K + G+ D+P+ +E+ L++AV A PVSV I +
Sbjct: 182 SEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSS 241
Query: 259 FQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWGRSWGMNGYMHMQ 312
FQ Y SGI+ P S LDH VL+VGY E +G YWI+KNSWG WG GY++M
Sbjct: 242 FQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMA 301
Query: 313 RNTGNSLGICGINMLASYP 331
++ N CGI ASYP
Sbjct: 302 KDRKNH---CGIATAASYP 317
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 135/299 (45%), Positives = 179/299 (59%), Gaps = 16/299 (5%)
Query: 43 EQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADLTHQEFKASFLGFSAASID 99
E E+ QR ++F +N + HN + G S FT+ +N F+D+ +EF GF +
Sbjct: 1 ETEENQRKEVFRNNIKKIQMHNYLHEQGKSPFTMGINQFSDMDEKEFSTIMNGFRMNNRT 60
Query: 100 HDRRR-NASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVT 158
R ++ SP VPA +DWRKKG VT VK+Q CG+CWAFSA GA+EG + T
Sbjct: 61 KVRDHLHSHYISPAIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHFRKT 120
Query: 159 GSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQK 217
G LVSLSEQ L+DC +SY N+GC GG+MDYA++++ N G DTE YPY G C +
Sbjct: 121 GKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMC---R 177
Query: 218 LNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAFQLYSSGIFT-GPCST 273
R V T GY D+P NE ++ +AV + PVSV I S +F Y G++ CS
Sbjct: 178 FKRECVGATCRGYTDLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKECSP 237
Query: 274 -SLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
LDH VL+VGY +E G+DYW++KNSWG +WG GY+ M RN N CGI +A YP
Sbjct: 238 YQLDHGVLVVGYGTEQGLDYWLVKNSWGTTWGDQGYIKMARNMHNH---CGIASMACYP 293
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 193/336 (57%), Gaps = 21/336 (6%)
Query: 7 FLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
FLL+ L L + D ++ ++E W +H K Y+ +E Q+R ++E+N + HN
Sbjct: 24 FLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNMKMIGLHN 82
Query: 65 N---MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
G F L +NAF DLT+ EF+ GF S+ H + Q P L DVP S+
Sbjct: 83 EDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGH--KEMTIFQEP-LLGDVPKSV 137
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGC 180
DWR G VT VKDQ CG+CWAFSA G++EG TG LV LSEQ L+DC SY N GC
Sbjct: 138 DWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGC 197
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLM+ A+Q+V +N G+DT + Y Y G C + V I G+ VP +E L+
Sbjct: 198 NGGLMELAFQYVKENRGLDTRESYAYEAWDGPC-RYDPKYSAVNITGFVKVPL-SEDALM 255
Query: 241 QAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE-NGVDYWIIK 296
AV + PVSVGI +F+ Y G + P ST+LDHAVL+VGY E +G YW++K
Sbjct: 256 NAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDGRKYWLVK 315
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG WGM+GY+ M ++ N+ CGI A YPT
Sbjct: 316 NSWGEDWGMDGYIKMAKDRDNN---CGIATYAIYPT 348
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 138/337 (40%), Positives = 201/337 (59%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ + + ++LL SS + D ++ ++ W K +GK Y + E+ R I+E N VT
Sbjct: 1 MNWLVWALLLCSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVT 60
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
HN +MG S+ L +N D+T +E + S+ + RN + +S N + +P
Sbjct: 61 LHNLEHSMGMHSYELGMNHLGDMTSEEVISLM---SSLRVPSQWPRNVTYKSDPNQK-LP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-- 176
S+DWR+KG VTEVK Q +CG+CWAFSA GA+E K+ TG LVSLS Q L+DC +
Sbjct: 117 DSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYG 176
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P +E
Sbjct: 177 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNR-AATCSRYIELPFGSE 235
Query: 237 KQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWI 294
+ L +AV + PVSVGI S +F LY +G++ P C+ +++H VL+VGY + +G DYW+
Sbjct: 236 EALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWL 295
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG +G GY+ M RN+GN CGI SYP
Sbjct: 296 VKNSWGLHFGDQGYIRMARNSGNH---CGIANYPSYP 329
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.430
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,879,055,612
Number of Sequences: 23463169
Number of extensions: 302213758
Number of successful extensions: 1069103
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6454
Number of HSP's successfully gapped in prelim test: 1390
Number of HSP's that attempted gapping in prelim test: 1038387
Number of HSP's gapped (non-prelim): 10887
length of query: 419
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 274
effective length of database: 8,957,035,862
effective search space: 2454227826188
effective search space used: 2454227826188
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)