BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017419
(372 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 263/359 (73%), Positives = 310/359 (86%), Gaps = 2/359 (0%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+++STL+FLFF + SSA DMSI+S+++ H H SSWR+D+EV+++Y WLAKH KT N +G
Sbjct: 5 ISLSTLLFLFF-TLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLG 63
Query: 68 HNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
EKRF+IFK+NLRFIDEHN S NRTYKVGL +FADLTNEEYRA +LGT+SD KRRLMKS
Sbjct: 64 EREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTKSDPKRRLMKS 123
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
K SQRYA KAGD LPES+DWR+ GAV+ +KDQGSCGSCWAFST+AAVEG+NKIVTGELI
Sbjct: 124 KNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELI 183
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SLSEQELVDCDR NAGCNGGLMD AFQFII NGG+D+++DYPY + KCD ++ K
Sbjct: 184 SLSEQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKA 243
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V+IDG+EDV FDEM+L+KAVA QPVSVAIEA G A Q Y+SGVFTGECGSALDHGVV V
Sbjct: 244 VTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIV 303
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
GYGTE+G+DYWLVRNSWG DWGENGY+K+QRN++DT TGKCGIAME+SYP+KN+QN K
Sbjct: 304 GYGTEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPIKNTQNPVK 362
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 559 bits (1441), Expect = e-157, Method: Compositional matrix adjust.
Identities = 262/342 (76%), Positives = 298/342 (87%), Gaps = 2/342 (0%)
Query: 27 MSIISYDNNH--DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID 84
MSI ++D+NH SSWR+DDEVM+IY+ WL KHGK N +G KRF+IFK+NLRFID
Sbjct: 1 MSIFNHDDNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFID 60
Query: 85 EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
EHNS NRTYKVGL KFADLTN+EYRAM+LGTRSD KRRLMKSK S+RYA KAGD+LPES
Sbjct: 61 EHNSQNRTYKVGLTKFADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPES 120
Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
VDWR KGAVNP+KDQGSCGSCWAFSTVAAVEGIN+IVTGELISLSEQELVDCDR NAGC
Sbjct: 121 VDWRGKGAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGC 180
Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLK 264
NGGLMDYAFQFII NGG+D+E+DYPYLG ++ CD + K VSIDG+EDV PFDE +L+
Sbjct: 181 NGGLMDYAFQFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQ 240
Query: 265 KAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
KAVA QPVSVAIEA G A Q Y+SGVFTGECG+ALDHGVV VGYGTE G+DYWLVRNSWG
Sbjct: 241 KAVAHQPVSVAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGTEKGLDYWLVRNSWG 300
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
++WGE+GY+K+QRN+ DT TG+CGIAME+SYPVKN QN+AKP
Sbjct: 301 TEWGEHGYIKMQRNVRDTYTGRCGIAMESSYPVKNGQNTAKP 342
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 251/344 (72%), Positives = 295/344 (85%), Gaps = 1/344 (0%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
AA MSII Y+ N +H SS RTD+EVM IY WLAKHGK NG+G E+RF+IFKDNL+F+
Sbjct: 19 AAHMSIIDYNTNPNHKSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFV 78
Query: 84 DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
DEHNS NR+YKVGLN+FADLTNEEYR+M+LGT++D+KRR MKSK AS+RYA + D LPE
Sbjct: 79 DEHNSENRSYKVGLNRFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPE 138
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG 203
SVDWRE GAV P+KDQGSCGSCWAFSTVAAVEG+N+I TGE+I LSEQELVDCDR +AG
Sbjct: 139 SVDWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAG 198
Query: 204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSL 263
CNGGLMDYAF+FII NGG+D+E+DYPY G + CDP R+N KVVSI+ YEDV P+DEM+L
Sbjct: 199 CNGGLMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMAL 258
Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
KKAVA QPVSVAIEA GRAFQ Y SGVFTGECG ALDHGVV VGYGT+NG D+W+VRNSW
Sbjct: 259 KKAVAHQPVSVAIEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGTDNGADHWIVRNSW 318
Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA-KP 366
G+ WGENGY++++RN++D GKCGIAM+ASYP+KN +N A KP
Sbjct: 319 GTSWGENGYIRMERNVVDNFGGKCGIAMQASYPIKNGENPANKP 362
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 256/347 (73%), Positives = 303/347 (87%), Gaps = 4/347 (1%)
Query: 22 SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
SSA +S ++ + NH SSSWR+DDEVM +Y++W+ +HGK NG+G EKRF+IFKDNLR
Sbjct: 15 SSATYISTLTLNQNHPSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLR 74
Query: 82 FIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
FIDEHNS N TYK+GLNKFADLTN+EYRA +LGTR+D +RRLMKSK+ S RYA +AGD
Sbjct: 75 FIDEHNSNNNTTYKLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDN 134
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP+SVDWR+ GAV+PVKDQGSCGSCWAFST+A VEGINKIV+GEL+SLSEQELVDCDR
Sbjct: 135 LPDSVDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSY 194
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
+AGCNGGLMDYAFQFI+ NGG+D+E+DYPYLG N+CDP+++NAKVVSIDGYEDV P +E
Sbjct: 195 DAGCNGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDV-PNNE 253
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLV 319
+LKKAVA QPVS+AIEAGGRAFQ YESGVF GECG ALDHGVVAVGYGT +NG DYW+V
Sbjct: 254 NALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIV 313
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
RNSWGS+WGENGY++++RN ++ NTGKCGIAMEASYPVKN N +P
Sbjct: 314 RNSWGSNWGENGYIRMERN-INANTGKCGIAMEASYPVKNGANIIQP 359
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 543 bits (1398), Expect = e-152, Method: Compositional matrix adjust.
Identities = 258/361 (71%), Positives = 305/361 (84%), Gaps = 12/361 (3%)
Query: 5 SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
+M L+ISTL+FLFF++SS+A SSSWR+++EVM +YQ W+AKHGK N
Sbjct: 11 AMALSISTLLFLFFVASSAAD------------LSSSWRSEEEVMGMYQWWMAKHGKAYN 58
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
G+G EKRF+IFKDNL+FIDEHN+ NRTYKVGLN+FADLTNEEYRA+YLGTRSD KRR
Sbjct: 59 GLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADLTNEEYRAIYLGTRSDPKRRFA 118
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
K K AS RYA G+ LPESVDWRE GAVNPVKDQ SCGSCWAFSTVAAVEGIN+IVTGE
Sbjct: 119 KLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGE 178
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
LISLSEQELVDCD + + GCNGGLMDYAF FII+NGG+D+E+DYPY G + +C+ S +++
Sbjct: 179 LISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSS 238
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
KVVSIDGYEDV PFDE +L+KAVA QPVSVA+EAGGRA Q Y SG+FTGECG+ALDHG+V
Sbjct: 239 KVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIV 298
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
AVGYGTENG DYW+VRNSWGS WGENGY++++RN+ D +GKCGIAMEASYP+KN +N +
Sbjct: 299 AVGYGTENGTDYWIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIKNGENPS 358
Query: 365 K 365
K
Sbjct: 359 K 359
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 254/354 (71%), Positives = 294/354 (83%), Gaps = 5/354 (1%)
Query: 11 STLVFLFFI---SSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
S VFLF + +S+SA DMSII YD H SSWRTD++VM +Y+ WLAKHGK+ N +G
Sbjct: 9 SMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALG 68
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
E+RFQIFKDNLRFIDEHN+ NRTYKVGLN+FADLTNEEYR+MYLGTR+ AKRR S
Sbjct: 69 EKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRR--SSN 126
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S RYA + GD LPESVDWR+KGAV VKDQGSCGSCWAFST+AAVEGINKIVTG LIS
Sbjct: 127 KISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLIS 186
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY ++ +CD R+NA VV
Sbjct: 187 LSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXVV 246
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+IDGYEDV DE SL+KAVA+QPVSVAIEAGGR FQ Y+SG+FTG CG+ALDHGV AVG
Sbjct: 247 TIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVG 306
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
YGTENGVDYW+V+NSWG+ WGE GY++++R+L + TGKCGIAMEASYP+K Q
Sbjct: 307 YGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKKGQ 360
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 254/352 (72%), Positives = 293/352 (83%), Gaps = 3/352 (0%)
Query: 11 STLVFLFFISS-SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
S VFLF + +SA DMSII YD H SSWRTD++VM +Y+ WLAKHGK+ N +G
Sbjct: 9 SMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEK 68
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
E+RFQIFKDNLRFIDEHN+ NRTYKVGLN+FADLTNEEYR+MYLGTR+ AKRR S
Sbjct: 69 ERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRR--SSNKI 126
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S RYA + GD LPESVDWR+KGAV VKDQGSCGSCWAFST+AAVEGINKIVTG LISLS
Sbjct: 127 SDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLS 186
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQELVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY ++ +CD R+NAKVV+I
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTI 246
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
DGYEDV DE SL+KAVA+QPVSVAIEAGGR FQ Y+SG+FTG CG+ALDHGV AVGYG
Sbjct: 247 DGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYG 306
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
TENGVDYW+V+NSWG+ WGE GY++++R+L + TGKCGIAMEASYP+K Q
Sbjct: 307 TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKKGQ 358
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 526 bits (1355), Expect = e-147, Method: Compositional matrix adjust.
Identities = 254/348 (72%), Positives = 301/348 (86%), Gaps = 5/348 (1%)
Query: 22 SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
SSA +S ++ + NH SSSWR+DDEVM +Y++W+ +HGK NG+G EKRF+IFKDNL
Sbjct: 15 SSATYISTLTLNQNHPSSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNL 74
Query: 81 RFIDEHNSLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD 139
RFIDEHNS N T YK+GLNKFADLTN+EYRA +LGTR+D +RRLMKSK+ S RYA +AGD
Sbjct: 75 RFIDEHNSNNNTTYKLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGD 134
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
LP+SV+WR+ GAV+ VKDQGSCGSCWAFS +AAVEGINKIV+GELISLSEQELVDCDR
Sbjct: 135 NLPDSVNWRDHGAVSRVKDQGSCGSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRS 194
Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
+AGCNGGLMDYAFQFII NGG+D+E+DYPYLG N+CDP+++NAKVVSIDGYEDV P +
Sbjct: 195 YDAGCNGGLMDYAFQFIIDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDV-PNN 253
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWL 318
E +LKKAVA QPVS+AIEAGGRAFQ YESGVF GECG ALDHGVVAVGYG+ +NG DYW+
Sbjct: 254 ENALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWI 313
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
VRNSWG +WGENGY++++RN ++ NTGKCGIAMEASYPVKN N +P
Sbjct: 314 VRNSWGGNWGENGYIRMERN-INANTGKCGIAMEASYPVKNGANIIQP 360
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 246/318 (77%), Positives = 283/318 (88%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
M++Y+ WLAKHGK NG+G +RF+IFK+NLRFIDEHNS N TYKVGL KFADLTNEEY
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEY 60
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
RAM+LGTRSDAKRRLMKSK S+RYA KAGD+LPESVDWR KGAVNP+KDQGSCGSCWAF
Sbjct: 61 RAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAF 120
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
STVAAVEGIN+IVTGELISLSEQELVDCDR NAGCNGGLMDYAFQFII NGG+D+E+DY
Sbjct: 121 STVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDY 180
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY+G ++KCD + K VSIDG+EDV P+DE +L+KAVA QPVSVAIEA G A Q Y+S
Sbjct: 181 PYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQS 240
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GVFTGECG+ALDHGVV VGY +ENG+DYWLVRNSWG++WGE+GY+K+QRN+ DT TG+CG
Sbjct: 241 GVFTGECGTALDHGVVVVGYASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRCG 300
Query: 349 IAMEASYPVKNSQNSAKP 366
IAME+SYPVKN +N+AKP
Sbjct: 301 IAMESSYPVKNGENTAKP 318
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 245/337 (72%), Positives = 288/337 (85%), Gaps = 5/337 (1%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISY + + RTD EVM +Y+ WL KHGK+ N +G E+RF+IFKDNLRFI+E
Sbjct: 32 DMSIISYGDRLEK----RTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEE 87
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
HN++NRTYKVGLN+FADLTNEEYR+ YLG R + +R L S+V S RY+ +AG++LPESV
Sbjct: 88 HNAVNRTYKVGLNRFADLTNEEYRSRYLGRRDETRRGLRASRV-SDRYSFRAGEDLPESV 146
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWREKGAV PVKDQG+CGSCWAFST+AAVEGIN+I TG+LISLSEQELVDCD+ N GCN
Sbjct: 147 DWREKGAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCN 206
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMDYAF+FII NGG+DSE+DYPY A+ CDP+R+NA+VVSIDGYEDV DE SLKK
Sbjct: 207 GGLMDYAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKK 266
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
AVA+QPVSVAIEAGGRAFQ Y+SGVFTG+CG+ LDHGVVAVGYGTEN VDYW+VRNSWG
Sbjct: 267 AVANQPVSVAIEAGGRAFQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGP 326
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
+WGE+GY+KL+RNL T TGKCGIA+E SYP+KN QN
Sbjct: 327 NWGESGYIKLERNLAGTETGKCGIAIEPSYPIKNGQN 363
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 248/360 (68%), Positives = 288/360 (80%), Gaps = 13/360 (3%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M L ISTL+FL F S + +I +Y TD+EVMT+Y+ WL KH K NG
Sbjct: 5 MTLMISTLLFLSFTLSCAIDTSTITNY-----------TDNEVMTMYEEWLVKHQKVYNG 53
Query: 66 MGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
+G +KRFQ+FKDNL FI EHN+ N TYK+GLNKFAD+TNEEYR MY GT+SDAKRRLM
Sbjct: 54 LGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLM 113
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
K+K RYA AGD+LP VDWR KGAV P+KDQGSCGSCWAFSTVA VE INKIVTG+
Sbjct: 114 KTKSTGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGK 173
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+SLSEQELVDCDR N GCNGGLMDYAF+FIIQNGG+D+++DYPY G + CDP+++NA
Sbjct: 174 FVSLSEQELVDCDRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNA 233
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
K V+IDGYEDV P+DE +LKKAVA QPVS+AIEA GRA Q Y+SGVFTGECG++LDHGVV
Sbjct: 234 KAVNIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVV 293
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYG+ENGVDYWLVRNSWG+ WGE+GY K+QRN + T TGKCGI MEASYPVKN NSA
Sbjct: 294 VVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRN-VRTPTGKCGITMEASYPVKNGLNSA 352
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 254/365 (69%), Positives = 289/365 (79%), Gaps = 17/365 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MATA+ LA L+ FF+S S++A S R+D EV IY WLAKHG
Sbjct: 1 MATATTSLA---LLSFFFLSISASA--------------LSRRSDGEVREIYDLWLAKHG 43
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K NG+ EKRFQIFK+NL+FID+HNS NRTYKVGLN FADLTNEEYRA+YLGTRS
Sbjct: 44 KAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPA 103
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
RR+MK+K AS+RYA D LPES+DWR +GAV PVK+QGSCGSCWAFST+AAVEGIN+I
Sbjct: 104 RRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQI 163
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTGELISLSEQELV CD+K N+GCNGGLMDYAFQFII NGG+D+E+DYPY + +CDP+
Sbjct: 164 VTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPT 223
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R+NAKVVSID YEDV DE SLKKAVA QPVSVAIEA G A Q Y+SGVFTG+CGSALD
Sbjct: 224 RKNAKVVSIDAYEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALD 283
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
HGVVAVGYG ENGVDYWLVRNSWG+ WGE+GY KL+RN+ GKCGIAM+ASYPVKN
Sbjct: 284 HGVVAVGYGKENGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVKND 343
Query: 361 QNSAK 365
N K
Sbjct: 344 NNPTK 348
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 242/349 (69%), Positives = 291/349 (83%), Gaps = 3/349 (0%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
L+F F + SSA DMSIISYDN H ++WRTD+EV ++Y+ WL KHGK N +G +KR
Sbjct: 2 LLFALF-ALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKR 60
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
FQIFKDNLRFID+ N+ NRTYK+GLN+FADLTNEEYRA YLGT+ D RRL + S R
Sbjct: 61 FQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRL--GRTPSNR 118
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
YA + G+ LP+SVDWR++GAV PVKDQ SCGSCWAFS + AVEGINKIVTG+LISLSEQE
Sbjct: 119 YAPRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQE 178
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
LVDCD N GCNGGLMDYAF+FII+NGG+DSE+DYPY G + +CD R+NAKVVSIDGY
Sbjct: 179 LVDCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGY 238
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV+ +DE++LKKAVA+QPVSVA+E GGR FQ Y SGVFTG CG+ALDHGVVAVGYGT+N
Sbjct: 239 EDVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGTDN 298
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
G D+W+VRNSWG+DWGE GY++L+RNL ++ +GKCGIA+E SYP+K Q
Sbjct: 299 GHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPIKTGQ 347
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 249/363 (68%), Positives = 291/363 (80%), Gaps = 7/363 (1%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M S +AI+ L LF +SSA DMSII+YD H SSWRTDDEVM +Y++WL KHG
Sbjct: 1 MKLLSPSMAIALLFALFV--ASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHG 58
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G EKRFQIFKDNLRFIDEHN+ N +YKVGLN+FADLTNEEYR+ YLG +S
Sbjct: 59 KSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKP 118
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K SKV S RYA + GD LPESVDWR KGAV P+KDQGSCGSCWAFSTV AVEGIN+
Sbjct: 119 KL----SKVKSDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQ 174
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTGELI+LSEQELVDCD+ N GC+GGLMDY F+FII NGG+D+++DYPYLG + +CD
Sbjct: 175 IVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQ 234
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
R+NAKVV+ID YEDV +E +LKKAVA QPVSV IE GGRAFQ Y+SG+FTG+CG+AL
Sbjct: 235 YRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTAL 294
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGV VGYGTE G DYW+VRNSWGS WGE GY++++RNL T+ GKCGIAME SYP+KN
Sbjct: 295 DHGVNVVGYGTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKN 354
Query: 360 SQN 362
QN
Sbjct: 355 GQN 357
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 249/367 (67%), Positives = 293/367 (79%), Gaps = 16/367 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ +M I TL+FL F S + +II+Y TD+EVM +Y+ WL +H
Sbjct: 1 MASMTM---IYTLLFLSFTLSYAIKTSTIINY-----------TDNEVMAMYEEWLVRHQ 46
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K N +G +KRFQ+FKDNL FI EHN+ LN TYK+GLNKFAD+TNEEYRAMYLGT+S+A
Sbjct: 47 KGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNA 106
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
KRRLMK+K RYA A D LP VDWR KGAV P+KDQGSCGSCWAFSTVA VE INK
Sbjct: 107 KRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINK 166
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG+ +SLSEQELVDCDR N GCNGGLMDYAF+FIIQNGG+D+++DYPY G + CDP
Sbjct: 167 IVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDP 226
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+++NAKVV+IDGYEDV P+DE +LKKAVA QPVSVAIEA GRA Q Y+SGVFTG+CG++L
Sbjct: 227 TKKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSL 286
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGVV VGYG+ENGVDYWLVRNSWG+ WGE+GY K+QRN + T+TGKCGI MEASYPVKN
Sbjct: 287 DHGVVVVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRN-VRTSTGKCGITMEASYPVKN 345
Query: 360 SQNSAKP 366
NSA P
Sbjct: 346 GLNSAVP 352
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 516 bits (1330), Expect = e-144, Method: Compositional matrix adjust.
Identities = 249/345 (72%), Positives = 286/345 (82%), Gaps = 8/345 (2%)
Query: 22 SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT---SNGMGHNEKRFQIFKD 78
SSA DMSI+SYD H SSWRTDDEVM IY+ WL K+GK +N +G E+RFQ+FKD
Sbjct: 21 SSALDMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKD 80
Query: 79 NLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR-RLMKSKVASQRYACKA 137
NLRFIDEHNS NR+YKVGLN+FADLTNEEYR+MYLG RS AKR RL +S S RY +
Sbjct: 81 NLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRS---SNRYLPRV 137
Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
GD LP+SVDWR++GAV VKDQGSCGSCWAFST+AAVEGINKIVTG+LISLSEQELVDCD
Sbjct: 138 GDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCD 197
Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
R N GCNGGLMDYAFQFII NGG+DSE+DYPYL + CD R+NAKVV+ID YEDV
Sbjct: 198 RSYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPV 257
Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
DE +L+KAVA+QPVSVAIEAGGR FQ Y+SG+FTG CG+ALDHGV AVGYGTENG DYW
Sbjct: 258 NDEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYW 317
Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
+VRNSWG WGE+GY++++RN+ T TGKCGIA+E SYP+K QN
Sbjct: 318 IVRNSWGKSWGESGYIRMERNIA-TATGKCGIAIEPSYPIKKGQN 361
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 516 bits (1328), Expect = e-144, Method: Compositional matrix adjust.
Identities = 246/352 (69%), Positives = 292/352 (82%), Gaps = 6/352 (1%)
Query: 8 LAISTLVFLFFI-SSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGM 66
+A++T++ LF + + SSA DMSIISYDN H +S R+D+E+M++Y+ WL KHGK N +
Sbjct: 36 MAMATILLLFTVFAVSSALDMSIISYDNAHAATS--RSDEELMSMYEQWLVKHGKVYNAL 93
Query: 67 GHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
G EKRFQIFKDNLRFID+HNS +RTYK+GLN+FADLTNEEYRA YLGT+ D RRL
Sbjct: 94 GEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRL-- 151
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
K S RYA + GD+LPESVDWR++GAV PVKDQG CGSCWAFS + AVEGINKIVTGEL
Sbjct: 152 GKTPSNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGEL 211
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
ISLSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY G + +CD R+NAK
Sbjct: 212 ISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAK 271
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VVSID YEDV +DE++LKKAVA+QPVSVAIE GGR FQ Y SGVFTG CG+ALDHGVVA
Sbjct: 272 VVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVA 331
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYGT NG DYW+VRNSWG WGE+GY++L+RNL ++ +GKCGIA+E SYP+
Sbjct: 332 VGYGTANGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 248/370 (67%), Positives = 297/370 (80%), Gaps = 10/370 (2%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRT---DDEVMTIYQTWLAKHGKTSNGM 66
I+TL+F F S S A DMSII Y NNH ++ W +D+V Y+ WLA+HG+ N +
Sbjct: 6 ITTLLFALFSSLSYAIDMSIIDYKNNH-YARKWTLQSDEDQVKNRYEMWLAEHGRAYNAL 64
Query: 67 GHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
G EKRF+IFKDNLRFI+ HN S NRTYKVGLN+FADLTNEEYR MYLGT+SDA+RR +K
Sbjct: 65 GEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRRFVK 124
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
SK SQRYA + + +P SVDWR++GAV P+K+QGSCGSCWAFSTVAAVEGIN+IVTGE+
Sbjct: 125 SKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVEGINQIVTGEM 184
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
I+LSEQELVDCDR N+GCNGGLMDYAF+FII NGGMD+E+ YPY G E +CDP R+N K
Sbjct: 185 ITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRKNYK 244
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VVSIDGYEDV P +E +L+KAVA QPV VAIEA GRAFQ Y SGVFTGECG +DHGVV
Sbjct: 245 VVSIDGYEDV-PRNERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHGVVV 303
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK----NSQ 361
VGYG+E+GVDYW+VRNSWG+ WGENGYVK++RN+ ++ GKCGI EASYP K N +
Sbjct: 304 VGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTKDSAINKR 363
Query: 362 NSAKPKPHSS 371
N++K + SS
Sbjct: 364 NTSKEEKISS 373
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 246/350 (70%), Positives = 286/350 (81%), Gaps = 5/350 (1%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
L+FL F + SSA DMSIISY H SSWRTDDEVM +Y+ WL KHGK N +G EKR
Sbjct: 4 LLFLVF-ALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKR 62
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
F+IFKDNL FID+HNS NRTY VGLN+FADLTNEE+R+MYLGTR+ K+RL K+ S R
Sbjct: 63 FEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT---SDR 119
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
YA + GD LP+SVDWR++GAV VKDQG CGSCWAFST+AAVEGINKIVTG+LI+LSEQE
Sbjct: 120 YAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQE 179
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
LVDCD N GCNGGLMDYAF+FII NGG+D+E DYPYLG + +CD R+NAKVVSID Y
Sbjct: 180 LVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY 239
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV DE +LKKAVA+QPVSVAIE GGR FQ Y SGVFTGECG++LDHGV AVGYGTE
Sbjct: 240 EDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEK 299
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
G DYW+VRNSWG WGE+GY++++RN+ + TGKCGIA+E SYP+K QN
Sbjct: 300 GKDYWIVRNSWGKSWGESGYIRMERNIA-SPTGKCGIAIEPSYPIKKGQN 348
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 242/358 (67%), Positives = 287/358 (80%), Gaps = 14/358 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+ I++L+F I+ S A D S+ R+++EVMT+Y+ WL KH K NG+G
Sbjct: 4 ITITSLLFFSLITLSLAMDTSM-------------RSNEEVMTMYEEWLVKHHKVYNGLG 50
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
++RF+IFKDNL FIDEHN+ N TYKVGLNKFAD TNEEYR MYLGT++DAKR +MK K
Sbjct: 51 EKDQRFEIFKDNLGFIDEHNAQNYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIK 110
Query: 128 VAS-QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ + RYA +GD LP VDWR KGAV +KDQGSCGSCWAFST+A VE INKIVTG+L+
Sbjct: 111 ITTGHRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLV 170
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SLSEQELVDCDR N GCNGGLMDYAF+FI++NGG+D+EQDYPY G E +CDP+R+NAKV
Sbjct: 171 SLSEQELVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKV 230
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
VSIDGYEDV ++E +LKKAV QPVSVAIEAGGRA Q Y+SGVFTG CG+ LDHGVV V
Sbjct: 231 VSIDGYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVV 290
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
GYG ENGVDYWLVRNSWG++WGE+GY KL+RN+ NTGKCGIAM+ASYPVK QNSA
Sbjct: 291 GYGFENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPVKYGQNSA 348
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 239/347 (68%), Positives = 282/347 (81%)
Query: 12 TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
+L L ++SSA DMSI+SYD H SSWRTDDEVM +Y+ WL KHGK N +G EK
Sbjct: 9 SLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEK 68
Query: 72 RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
RF IFKDNLRFIDEHNS N TY++GLN+FADLTNEEYR+MYLG + A R K S
Sbjct: 69 RFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRKSD 128
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
R+A + GD LP+ +DWR++GAV VKDQGSCGSCWAFST+AAVEGIN+IVTG+LISLSEQ
Sbjct: 129 RFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQ 188
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
ELVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY A+ KCD R+NA VVSIDG
Sbjct: 189 ELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSIDG 248
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
YEDV DE +LKKAVA QPVSVAIEAGGRAFQ Y+SGVFTG+CG++LDHGV AVGYGTE
Sbjct: 249 YEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYGTE 308
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NG DYW+V NSWG +WGE+GY++++RNL +++GKCGIA+ SYP+K
Sbjct: 309 NGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIK 355
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 513 bits (1322), Expect = e-143, Method: Compositional matrix adjust.
Identities = 245/358 (68%), Positives = 294/358 (82%), Gaps = 5/358 (1%)
Query: 2 ATASMFLAISTLVFLFFISSSSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHG 60
+ A+M +A L+F F + SSA DMSIISYD+ H D +++ RT++E+M++Y+ WL KHG
Sbjct: 9 SPATMTMAAIVLLFTVF-AVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHG 67
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K N +G EKRFQIFKDNLRFID+HNS +RTYK+GLN+FADLTNEEYRA YLGT+ D
Sbjct: 68 KVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKIDP 127
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
RRL K S RYA + GD+LP+SVDWR++GAV PVKDQG CGSCWAFS + AVEGINK
Sbjct: 128 NRRL--GKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINK 185
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTGELISLSEQELVDCD N GCNGGLMDYAF+FII NGG+DS++DYPY G + +CD
Sbjct: 186 IVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDT 245
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
R+NAKVVSID YEDV +DE++LKKAVA+QPVSVAIE GGR FQ Y SGVFTG CG+AL
Sbjct: 246 YRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTAL 305
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGVVAVGYGT G DYW+VRNSWGS WGE+GY++L+RNL ++ +GKCGIA+E SYP+
Sbjct: 306 DHGVVAVGYGTAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 249/362 (68%), Positives = 292/362 (80%), Gaps = 10/362 (2%)
Query: 2 ATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK 61
++A+MF+ L+FL F + SSA+DMSIISYD H SSWRTDDEVM IY+ WL K GK
Sbjct: 7 SSAAMFV----LLFLSF-TLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGK 61
Query: 62 TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
N +G EKRFQ+FKDNLRFIDEHNS NRTYK+GLN FADLTNEEYR+ YLG R KR
Sbjct: 62 VYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKR 121
Query: 122 -RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
RL K+ S RYA + G+ LP+SVDWR++GAV VKDQGSCGSCWAFST+AAVEGINKI
Sbjct: 122 NRLRKT---SDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKI 178
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG+LISLSEQELVDCD N GCNGGLMDYAF+FII NGG+D+E+DYPYL + +CD
Sbjct: 179 VTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTY 238
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R+NAKVV+ID YEDV E +L+KAVA+QPVSVAIEAGGR FQ Y SG+F+G CG+ LD
Sbjct: 239 RKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLD 298
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
HGV AVGYGTENG DYW+VRNSWG WGENGY+++ R+ +++ TG CGIAMEASYP+K
Sbjct: 299 HGVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARS-INSPTGICGIAMEASYPIKKG 357
Query: 361 QN 362
QN
Sbjct: 358 QN 359
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 242/341 (70%), Positives = 280/341 (82%), Gaps = 4/341 (1%)
Query: 22 SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
SSA DMSIISY H SSWRTDDEVM +Y+ WL KHGK N +G EKRF+IFKDNL
Sbjct: 21 SSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLM 80
Query: 82 FIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
FID+HNS NRTY VGLN+FADLTNEE+R+MYLGTR+ K+RL K+ S RYA + GD L
Sbjct: 81 FIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT---SDRYAPRVGDSL 137
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P+SVDWR++GAV VKDQG CGSCWAFST+AAVEGINKIVTG+LI+LSEQELVDCD N
Sbjct: 138 PDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYN 197
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF+FII NGG+D+E DYPYLG + +CD R+NAKVVSID YEDV DE
Sbjct: 198 EGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDET 257
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+LKKAVA+QPVSVAIE GGR FQ Y SGVFTGECG++LDHGV AVGYGTE G DYW+VRN
Sbjct: 258 ALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRN 317
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GY++++RN+ + TGKCGIA+E SYP+K QN
Sbjct: 318 SWGKSWGESGYIRMERNIA-SPTGKCGIAIEPSYPIKKGQN 357
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 510 bits (1313), Expect = e-142, Method: Compositional matrix adjust.
Identities = 245/351 (69%), Positives = 285/351 (81%), Gaps = 4/351 (1%)
Query: 13 LVFLFFISS-SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
+ LFF S+ SSA+D+SIISYD +H SSWRTDDEVM IY+ WL KHGK N +G E+
Sbjct: 2 FMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKER 61
Query: 72 RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
RF++FKDNLRFIDEHNS NRTY+VGLN+FADLTNEEYR+MYLG S +R K + S
Sbjct: 62 RFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRN--KLRKISD 119
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + GD LP+SVDWR++GAV VKDQGSCGSCWAFS VAAVEGINKIVTG+LISLSEQ
Sbjct: 120 RYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQ 179
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
ELVDCD N GCNGGLMDY F+FII NGG+DSE+DYPYL + +CD R+NA+VVSID
Sbjct: 180 ELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDS 239
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
YEDV +E +L+KAVA+QPVSVAIEAGGR FQ Y SGVF+G CG+ALDHGVVAVGYGTE
Sbjct: 240 YEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGTE 299
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
NG DYW+VRNSWG WGE+GY+++ RN+ TG CGIAMEASYP+K QN
Sbjct: 300 NGQDYWIVRNSWGKSWGESGYLRMARNIRKP-TGICGIAMEASYPIKKGQN 349
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 510 bits (1313), Expect = e-142, Method: Compositional matrix adjust.
Identities = 243/357 (68%), Positives = 291/357 (81%), Gaps = 10/357 (2%)
Query: 10 ISTLVFL-FFISSSSAA-------DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK 61
+++L FL FFI S DMSI+ Y+ H RTD +V +Y+ WL +HGK
Sbjct: 1 MASLKFLAFFILFSGLLSSFSSALDMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGK 60
Query: 62 TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
N +G EKRF+IFKDNLRFIDEHNS++R+YKVGLN+FADLTNEEY+AM+LGT+ + K
Sbjct: 61 AYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAMFLGTKMERKN 120
Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
R + ++ SQRY K GD+LPE+VDWREKGAV PVKDQG CGSCWAFSTV AVEGIN+IV
Sbjct: 121 RFLGTR--SQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGAVEGINQIV 178
Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
TGELISLSEQELVDCD+ N GCNGGLMDYAF+FII NGG+D+E+DYPY ++N CDP+R
Sbjct: 179 TGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDTEEDYPYKASDNICDPNR 238
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
+NAKVV+IDGYEDV DE SLKKAVA QPVSVAIEAGGRAFQ Y+SGVFTG CG+ LDH
Sbjct: 239 KNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQLYKSGVFTGRCGTELDH 298
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GVVAVGYGTENGV+YW+VRNSWGS WGE+GY++++RN+ +T TGKCGIA++ SYP K
Sbjct: 299 GVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNVANTKTGKCGIAIQPSYPTK 355
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 240/356 (67%), Positives = 288/356 (80%), Gaps = 3/356 (0%)
Query: 4 ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
AS++ + + L +F+S A DMSII Y+ H RT+ E + +Y+ WL K+GK
Sbjct: 2 ASLYRSFAFLATFYFLSVCLAIDMSIIDYNLKHGQVPE-RTEAETLRLYEMWLVKYGKAY 60
Query: 64 NGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
N +G E+RF+IFKDNL+F+D+HNS+ N +YK+GLNKFADL+NEEYRA YLGTR D KRR
Sbjct: 61 NALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRR 120
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
L+ S RY K GD+LPESVDWREKGAV PVKDQG CGSCWAFSTV AVEGIN+IVT
Sbjct: 121 LLGGP-KSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVT 179
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQELVDCD+ N GCNGGLMDYAF+FI++NGG+D+E+DYPY ++ CDP+R+
Sbjct: 180 GNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRK 239
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
NA+VV+IDGYEDV DE SL+KAVA+QPVSVAIEAGGRAFQ Y+SGVFTG CG+ LDHG
Sbjct: 240 NARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHG 299
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
VVAVGYGTENGVDYW+VRNSWG WGENGY++++RN+ T TGKCGIAMEASYP K
Sbjct: 300 VVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTK 355
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 506 bits (1302), Expect = e-141, Method: Compositional matrix adjust.
Identities = 238/355 (67%), Positives = 286/355 (80%), Gaps = 5/355 (1%)
Query: 13 LVFLFFISS---SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
L+ + ISS S A DMSIISYD H D S+S RT+ EV+T+Y+ WL KHGK+ NG+G
Sbjct: 12 LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK-SK 127
+KRF+IFKDNL+FIDEHN LN TY++GL +FADLTNEEYR+ +LGT+ D RR+ K
Sbjct: 72 KDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGG 131
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S RYA + GD+LPESVDWR++GAV VKDQ SCGSCWAFS +AAVEGINKIVTG+LIS
Sbjct: 132 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 191
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE DYPY + +CD +R+NAKVV
Sbjct: 192 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 251
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+ID YEDV +DE++L+KAVA+QP++VA+E GGR FQ YE GVFTG CG+ALDHGV AVG
Sbjct: 252 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVG 311
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
YGTENG DYW+VRNSWG WGE GY++L+RNL + GKCGIA+E SYP+KN QN
Sbjct: 312 YGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKNGQN 366
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 238/355 (67%), Positives = 286/355 (80%), Gaps = 5/355 (1%)
Query: 13 LVFLFFISS---SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
L+ + ISS S A DMSIISYD H D S+S RT+ EV+T+Y+ WL KHGK+ NG+G
Sbjct: 12 LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK-SK 127
+KRF+IFKDNL+FIDEHN LN TY++GL +FADLTNEEYR+ +LGT+ D RR+ K
Sbjct: 72 KDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGG 131
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S RYA + GD+LPESVDWR++GAV VKDQ SCGSCWAFS +AAVEGINKIVTG+LIS
Sbjct: 132 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 191
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE DYPY + +CD +R+NAKVV
Sbjct: 192 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 251
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+ID YEDV +DE++L+KAVA+QP++VA+E GGR FQ YE GVFTG CG+ALDHGV AVG
Sbjct: 252 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVG 311
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
YGTENG DYW+VRNSWG WGE GY++L+RNL + GKCGIA+E SYP+KN QN
Sbjct: 312 YGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKNGQN 366
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 241/358 (67%), Positives = 283/358 (79%), Gaps = 13/358 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
L STL+FL F S + +I +Y TD+EVMT+Y+ WL KH K NG+
Sbjct: 7 LVTSTLLFLSFTLSCAIDTSTITNY-----------TDNEVMTMYEEWLVKHQKVYNGLR 55
Query: 68 HNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+KRFQ+FKDNL FI EHN+ N TYK+GLN+FAD+TNEEYR MY GT+SDAKRRLMK+
Sbjct: 56 EKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKT 115
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
K RYA AGD LP VDWR KGAV P+KDQGSCGSCWAFSTVA VE INKIVTG+ +
Sbjct: 116 KSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFV 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SLSEQELVDCDR N GCNGGLMDYAF+FIIQNGG+D+++DYPY G + CDP+++NAKV
Sbjct: 176 SLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKV 235
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V+IDG+EDV P+DE +LKKAVA QPVS+AIEA GR Q Y+SGVFTG+CG++LDHGVV V
Sbjct: 236 VNIDGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVV 295
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
GYG+ENGVDYWLVRNSWG+ WGE+GY K+QRN + T TGKCGI MEASYPVKN SA
Sbjct: 296 GYGSENGVDYWLVRNSWGTGWGEDGYFKMQRN-VRTPTGKCGITMEASYPVKNGLISA 352
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 503 bits (1295), Expect = e-140, Method: Compositional matrix adjust.
Identities = 239/359 (66%), Positives = 282/359 (78%), Gaps = 12/359 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+ I TL+ L F S + A MSII+Y N EVM +Y+ WL KH K NG+
Sbjct: 4 MLIPTLLLLSFTFSHATA-MSIINYSEN-----------EVMDMYEEWLVKHRKVYNGLD 51
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
EKRFQ+FKDNL FI +HN+ N TY +GLNKFAD+TNEEYRAMYLGTR+DAKRR+MK++
Sbjct: 52 EKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQ 111
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
RYA +GD+LP VDWR KGAV P+KDQG+CGSCWAFSTVAAVEGIN IVTGE +S
Sbjct: 112 NTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVS 171
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCDR+ + GCNGGLMDYAFQFIIQNGG+D+E+DYPY G + CD +++ KVV
Sbjct: 172 LSEQELVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVV 231
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
IDGYEDV +E +LKKAV+ QPVSVAIEA GRA Q Y+SGVFTG+CG+ALDHGVV VG
Sbjct: 232 QIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVG 291
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
YGTENGVDYWLVRNSWG+ WGE+GY K++RN+ T+ GKCGIAM+ SYPVK NSA P
Sbjct: 292 YGTENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVP 350
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 503 bits (1295), Expect = e-140, Method: Compositional matrix adjust.
Identities = 241/356 (67%), Positives = 288/356 (80%), Gaps = 10/356 (2%)
Query: 24 AADMSIISYDNNHDHSSSWRT---DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
A DMSII Y NNH ++ W +D+V Y+ WLA+HG+ N +G EKRF+IFKDNL
Sbjct: 20 AIDMSIIDYKNNH-YARKWTLQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNL 78
Query: 81 RFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD 139
RFI+EHN S NRTYKVGLN+FADLTNEEYR MYLGT+SDA+RR +KSK SQRYA + +
Sbjct: 79 RFIEEHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNE 138
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
+P SVDWR++GAV P+K+QGSCGSCWAFSTVAAV GIN+IVTGE+I+LSEQELVDCDR
Sbjct: 139 LMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRV 198
Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
N+GCNGGLMDYAF+FII NGGMD+E+ YPY G E +CDP R+N KVVSIDGYEDV P +
Sbjct: 199 QNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDV-PRN 257
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
E +L+KAVA QPV VAIEA GRAFQ Y SGVFTGECG +DHGVV VGYG+E+GVDYW+V
Sbjct: 258 ERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGSEDGVDYWIV 317
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK----NSQNSAKPKPHSS 371
RNSWG+ WGENGYVK++RN+ ++ GKCGI EASYP K N +N++K + SS
Sbjct: 318 RNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTKDSAINKRNTSKEEKISS 373
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 236/352 (67%), Positives = 283/352 (80%), Gaps = 12/352 (3%)
Query: 15 FLFF--ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
FLFF I+ S A D+ + + R++DEVMT+Y+ WL KH K NG+ ++R
Sbjct: 10 FLFFSLITFSLALDIQL----------PTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQR 59
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
FQIFKDNL FIDEHN+ N TY VGLNKFAD+TNEEYR MYLGTRSD KRR+MK+K+ R
Sbjct: 60 FQIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHR 119
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
YA +GD LP VDWR KGA+ +KDQGSCGSCWAFST+A VE INKIVTG+L+SLSEQE
Sbjct: 120 YAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQE 179
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
LVDCDR N GCNGGLMDYAF+FII NGG+D++Q YPY G E +CDP+R+ AK+VSIDGY
Sbjct: 180 LVDCDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGY 239
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV +E +LKKAVA QPVSVAIEA GRA Q Y+SGVFTG+CG++LDH VV VGYG+EN
Sbjct: 240 EDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGSEN 299
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
G+DYWLVRNSWG++WGE+GY K++RN+ T+TGKCGIA+EASYPVK +NSA
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPVKYGKNSA 351
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 236/357 (66%), Positives = 290/357 (81%), Gaps = 4/357 (1%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+ + +VF F +++ A DMSIISYD H SS R+D EV IY+ W KHGK +N +
Sbjct: 10 MLVILIVFTLF-TATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNID 68
Query: 68 HNEK--RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM- 124
+EK RF+IFKDNL+FIDEHN+ NRTYKVGLN+FADL+NEEYR+ YLGT+ D +M
Sbjct: 69 GSEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMA 128
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
++K S RYA GD+LP+SVDWR +GAV VKDQGSCGSCWAFST+AAVEGINKIVTGE
Sbjct: 129 RTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGE 188
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
L+SLSEQELVDCDR +NAGC+GGLM+YAF+FII NGG+DS++DYPY G + KCD ++NA
Sbjct: 189 LVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQYKKNA 248
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
+VVSID YE V +DE++LKKAVA+QP+SVAIEAGGR FQ Y SG+FTG+CG+ALDHGV
Sbjct: 249 RVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALDHGVT 308
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
AVGYGTENGVDYW+VRNSWG WGE+GYV+++RNL + GKCGI M++SYP+K Q
Sbjct: 309 AVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPIKKGQ 365
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 238/359 (66%), Positives = 282/359 (78%), Gaps = 12/359 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+ I TL+ L F S + A MSII+Y N EVM +Y+ WL KH K NG+
Sbjct: 4 MLIPTLLLLSFTFSHATA-MSIINYSEN-----------EVMDMYEEWLVKHRKVYNGLD 51
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
EKRFQ+FKDNL FI +HN+ N TY +GLNKFAD+TN+EYRAMYLGTR+DAKRR+MK++
Sbjct: 52 EKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQ 111
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
RYA +GD+LP VDWR KGAV P+KDQG+CGSCWAFSTVAAVEGIN IVTGE +S
Sbjct: 112 NTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVS 171
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCDR+ + GCNGGLMDYAFQFIIQNGG+D+E+DYPY G + CD +++ KVV
Sbjct: 172 LSEQELVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVV 231
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
IDGYEDV +E +LKKAV+ QPVSVAIEA GRA Q Y+SGVFTG+CG+ALDHGVV VG
Sbjct: 232 QIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVG 291
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
YGTENGVDYWLVRNSWG+ WGE+GY K++RN+ T+ GKCGIAM+ SYPVK NSA P
Sbjct: 292 YGTENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVP 350
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 239/361 (66%), Positives = 291/361 (80%), Gaps = 8/361 (2%)
Query: 7 FLAISTLVFLFFISSSSAADMSIISYDNNH----DHSSSWRTDDEVMTIYQTWLAKHGKT 62
+ ++TL F IS SA DMSII+YD H S+ RTDDEV +Y++WL KHGKT
Sbjct: 3 LIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKT 62
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS-DAKR 121
N +G ++RFQIFKDNLRFIDEHNS + TYK+GLNKFADLTNEEYR Y G ++ D K+
Sbjct: 63 YNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKK 122
Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
+L SK+ S RYA ++GD LPE VDWRE+GAV VKDQGSCGSCWAFST +VEG+NKIV
Sbjct: 123 KL--SKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIV 180
Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
TG+LIS+SEQELV+CD N GCNGGLMDYAF+FII+NGG+D+E+DYPY G + KCD ++
Sbjct: 181 TGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNK 240
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
+NAKVV+ID YEDV DE SLKKAV++QPV+VAIEAGGR FQ Y SG+FTG CG+ALDH
Sbjct: 241 KNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDH 300
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
GV+A GYGTE+G DYWLV+NSWG++WGE GY+K++RN+ D +GKCGIAMEASYP+KN
Sbjct: 301 GVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIAD-KSGKCGIAMEASYPIKNGD 359
Query: 362 N 362
N
Sbjct: 360 N 360
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 245/366 (66%), Positives = 288/366 (78%), Gaps = 7/366 (1%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDH-SSSWRTDDEVMTIYQTWLAKH 59
MA L I+ + FLF + S S A MSII YD D S+ RT+ +M +Y+ WL KH
Sbjct: 1 MAPPPFRLCIA-ISFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKH 59
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
GK N +G E+RF+IFKDNLRF+DE NS+ RTYK+GL KFADLTNEEYRAMYLG + +
Sbjct: 60 GKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKME 119
Query: 119 AKRRLMKSKVASQRYACKAG--DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
K +L + SQRY KAG D+LP VDWREKGAV VKDQG CGSCWAFSTV +VEG
Sbjct: 120 KKEKLRTER--SQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEG 177
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
IN+IVTG+LISLSEQELVDCD+ N GCNGGLMDYAF+FII+NGG+DSE DYPY ++N
Sbjct: 178 INQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNM 237
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
CD +R+NA VV+IDGYEDV DE SLKKAVA+QPVSVAIEAGGR FQ Y+SGVFTG CG
Sbjct: 238 CDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCG 297
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ LDHGVVAVGYGTENG+DYW+VRNSWG WGE+GY++++RN+ T+TGKCGIAMEASYP
Sbjct: 298 TNLDHGVVAVGYGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYP 357
Query: 357 VKNSQN 362
K QN
Sbjct: 358 TKKGQN 363
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 242/350 (69%), Positives = 282/350 (80%), Gaps = 7/350 (2%)
Query: 9 AISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
+++ L+FL F + SSA DMSIISYD H RTD E M IY+ WL HGK N +G
Sbjct: 8 SVACLLFLCF-AFSSALDMSIISYDQTH---PPQRTDAEAMAIYEKWLTTHGKAYNAIGE 63
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
E+RF+IFKDNLRF+DEHN++ +Y+VGLN+FADLTNEEYR+M+LG + K R +K
Sbjct: 64 KERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTK- 122
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
S RYA +AGD+LP SVDWREKGAV+PVKDQG CGSCWAFST++AVEGIN+IVTGELISL
Sbjct: 123 -SDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISL 181
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQELVDCD+ N GCNGGLMDY FQFII NGG+D+E+DYPY + CD R+NA+VVS
Sbjct: 182 SEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVS 241
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
I+GYEDV DE SLKKAVA+QPVSVAIEAGGRAFQ YESGVFTG CG+ LDHGVVAVGY
Sbjct: 242 INGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGY 301
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GTENGVDYW VRNSWG WGENGY+KL+RN ++ +GKCGIA ASYP K
Sbjct: 302 GTENGVDYWTVRNSWGPKWGENGYIKLERN-INATSGKCGIASMASYPTK 350
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 236/367 (64%), Positives = 288/367 (78%), Gaps = 6/367 (1%)
Query: 2 ATASMFLAISTLVFLF----FISSSSAADMSIISYDNNHD-HSSSWRTDDEVMTIYQTWL 56
++A+M LV F F+ SSA+DMSII+YD H +S RT D+++++Y++WL
Sbjct: 5 SSATMSPRPQCLVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYESWL 64
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGT 115
KH K N +G E RF IFKDN+ F+D HNS+ N++YK+GLNKFADLTN+EYR++YL
Sbjct: 65 VKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSG 124
Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
+ + R + S R+ + GD LPESVDWR++GAV PVKDQG CGSCWAFSTV AVE
Sbjct: 125 KMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVE 184
Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
GINKIVTGELISLSEQELVDCD N GCNGGLMDYAF+FI++NGG+D+E DYPY G +
Sbjct: 185 GINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDG 244
Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
CD +R+NAKVV+I+GYEDV DE SLKKAVA QPVSVAIEAGGRAFQ YESGVFTG+C
Sbjct: 245 LCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQC 304
Query: 296 GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
G+ LDHGVVAVGYG+ENG DYW+VRNSWG DWGE+GY++L+RN+ T+TGKCGIAM+ASY
Sbjct: 305 GTELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASY 364
Query: 356 PVKNSQN 362
P K N
Sbjct: 365 PTKTGDN 371
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 238/357 (66%), Positives = 292/357 (81%), Gaps = 4/357 (1%)
Query: 7 FLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGM 66
L +S V S+S++ADMSII+YD H R+D+EVM +Y++WL +HGK+ NG+
Sbjct: 4 LLILSLFVLAAVSSASASADMSIITYDEEHPAKGLSRSDEEVMALYESWLVEHGKSYNGL 63
Query: 67 G-HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
G +KRF+IFKDNLR+IDE NS +R+YK+GLN+FADLTNEEYR+ YLG ++DA+RR+
Sbjct: 64 GGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARRRIA 123
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
K+K + +RYA KAG LP+S+DWREKGAV VKDQGSCGSCWAFST+AAVEGIN+IVTGE
Sbjct: 124 KTK-SDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGE 182
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
LISLSEQELVDCD N GCNGGLMDYAF+FII+NGG+D+E DYPY G +CD +R+NA
Sbjct: 183 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGRYGRCDQTRKNA 242
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
KVVSIDGYEDV+P+DE +LK+AVA QPVSVAIEAGGR FQ Y SG+FTG CG+ LDHGV
Sbjct: 243 KVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCGTDLDHGVT 302
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
AVGYGTENGVDYW+V+NSW + WGE GY+++QRN+ D N G CGIA+E SYP K +
Sbjct: 303 AVGYGTENGVDYWIVKNSWAASWGEKGYLRMQRNVKDKN-GLCGIAIEPSYPTKTGE 358
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/359 (65%), Positives = 288/359 (80%), Gaps = 6/359 (1%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S L IS L+ L F + SSA+DMSIISYD H H RTDDEV +Y++WL +HG
Sbjct: 1 MAAHSSTLTISILLMLIFSTLSSASDMSIISYDETHIHR---RTDDEVSALYESWLIEHG 57
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G +KRFQIFKDNLR+IDE NS+ N++YK+GL KFADLTNEEYR++YLGT+S
Sbjct: 58 KSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSG 117
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
R+ + SK S RY K GD LPES+DWREKG + VKDQGSCGSCWAFS VAA+E IN
Sbjct: 118 DRKKL-SKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINA 176
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG LISLSEQELVDCDR N GC+GGLMDYAF+F+I+NGG+D+E+DYPY CD
Sbjct: 177 IVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQ 236
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
R+NAKVV ID YEDV +E +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+
Sbjct: 237 YRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAV 296
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
DHGVV GYGTENG+DYW+VRNSWG++WGENGY+++QRN+ +++G CG+A+E SYPVK
Sbjct: 297 DHGVVIAGYGTENGMDYWIVRNSWGANWGENGYLRVQRNVA-SSSGLCGLAIEPSYPVK 354
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 222/366 (60%), Positives = 284/366 (77%), Gaps = 3/366 (0%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M T +A +V ++ SSA DMSIISYD +H S W++D+EVM+IY+ WL KHG
Sbjct: 1 MGTNRSLMATILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHG 60
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K N + EKRFQIFKDNL FI+EHN++NRTYKVGLN+F+DL+NEEYR+ YLGT+ D
Sbjct: 61 KVYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKIDPS 120
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R + + S+RY+ + D LPESVDWR++GAV VK+Q C CWAFS +AAVEGINKI
Sbjct: 121 RMMAR---PSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKI 177
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L +LSEQEL+DCDR +NAGC+GGL+DYAF+FII NGG+D+E+DYP+ GA+ CD
Sbjct: 178 VTGNLTALSEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQY 237
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ NA+ V+IDGYE V +DE++LKKAVA+QPVSVAIEA G+ FQ YESG+FTG CG+++D
Sbjct: 238 KINARAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSID 297
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
HGV AVGYGTENG+DYW+V+NSWG +WGE GYV ++RN+ + GKCGIA+ YP+K
Sbjct: 298 HGVTAVGYGTENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIKIG 357
Query: 361 QNSAKP 366
QN + P
Sbjct: 358 QNPSNP 363
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 489 bits (1259), Expect = e-136, Method: Compositional matrix adjust.
Identities = 231/335 (68%), Positives = 273/335 (81%), Gaps = 3/335 (0%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSII Y+ H RT+ E IY+ WL KHG+ N +G E+RF+IFKDNL+FIDEH
Sbjct: 1 MSIIDYNIKHGQVPE-RTEAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEH 59
Query: 87 NSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
NS+ N +YK+GLNKFADL+N+EYR++YLGTR D K RL+ S+RY K GD+LPE+V
Sbjct: 60 NSVGNPSYKLGLNKFADLSNDEYRSVYLGTRMDGKGRLLGGP-KSERYLFKEGDDLPETV 118
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWREKGAV PVKDQG CGSCWAFSTV AVEGIN+IVTG L SLSEQELVDCD+ N GCN
Sbjct: 119 DWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCN 178
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMDYAF FII+NGG+D+E+DYPY ++ CDP+R+NA+VV+IDGYEDV DE SLKK
Sbjct: 179 GGLMDYAFDFIIENGGIDTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKK 238
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
AVA+QPVSVAIEAGGR FQ Y+SGVFTG CG+ LDHGVV VGYGTE+GVDYW+VRNSWG
Sbjct: 239 AVANQPVSVAIEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGP 298
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
WGENGY++++R++ T TGKCGIAMEASYP K S
Sbjct: 299 AWGENGYIRMERDVASTETGKCGIAMEASYPTKKS 333
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 236/345 (68%), Positives = 274/345 (79%), Gaps = 21/345 (6%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
A DMSII YD +H +Y+ WL KHGK N +G E+RF+IFKDNLRFI
Sbjct: 31 AMDMSIIDYDESHTRH-----------VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFI 79
Query: 84 DEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA-----SQRYACKA 137
+EHN +++YK+GLNKFADLTNEEYRAM+LGTR+ R K+K A + RYA +A
Sbjct: 80 EEHNGAGDKSYKLGLNKFADLTNEEYRAMFLGTRT----RGPKNKAAVVAKKTDRYAYRA 135
Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
G+ELP VDWREKGAV P+KDQG CGSCWAFSTV AVEGIN+IVTG L SLSEQELVDCD
Sbjct: 136 GEELPAMVDWREKGAVTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD 195
Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
R N GCNGGLMDYAF+FI+QNGG+D+E+DYPY +N CDP+R+NA+VV+IDGYEDV
Sbjct: 196 RGYNMGCNGGLMDYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPT 255
Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
DE SL KAVA+QPVSVAIEAGG FQ Y+SGVFTG CG+ LDHGVVAVGYGTENG DYW
Sbjct: 256 NDEKSLMKAVANQPVSVAIEAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGTENGTDYW 315
Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
LVRNSWGS WGENGY+KL+RN+ +T TGKCGIA+EASYP+KN N
Sbjct: 316 LVRNSWGSAWGENGYIKLERNVQNTETGKCGIAIEASYPIKNGAN 360
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 242/361 (67%), Positives = 289/361 (80%), Gaps = 9/361 (2%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M ++ F A++ L+ + SSA DMSII SS RTDDEVM +Y++WL KHG
Sbjct: 1 MDSSRSFTAMALLLLFSLFALSSALDMSIIG------ELSSSRTDDEVMAMYESWLVKHG 54
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K+ N +G EKRFQIFKDNLRFIDEHN+ +RTYKVGLN+FADLTN+EYR+MYLG R+ ++
Sbjct: 55 KSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSMYLGARTGSR 114
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
RRL K S RY AG+ LP+SVDWREKGAV VKDQGSCGSCWAFST+AAVEGIN+I
Sbjct: 115 RRLSTQK-RSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGINQI 173
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG+LISLSEQELVDCD N GCNGGLMDYAF+FII+NGG+D+E+DYPY + +CD
Sbjct: 174 VTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYNARDGRCDQY 233
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R+NAKVV+ID YEDV +E +L+KAVA+QPVSVAIEA G AFQ YESGVFTG CG+ALD
Sbjct: 234 RKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESGVFTGNCGTALD 293
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
HGV AVGYGTEN VDYW+V+NSWGS WGE+GY++++RN TGKCGIA+E SYP+K S
Sbjct: 294 HGVTAVGYGTENSVDYWIVKNSWGSSWGESGYIRMERNT--GATGKCGIAVEPSYPIKTS 351
Query: 361 Q 361
Q
Sbjct: 352 Q 352
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 228/351 (64%), Positives = 283/351 (80%), Gaps = 7/351 (1%)
Query: 11 STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
+ ++FL I SSA DMSIISYD NH H+ S R+D EV +Y+ W+ KHGK N + +
Sbjct: 2 TVILFLAMIVVSSAMDMSIISYDKNH-HTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKD 60
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
+RF+IFKDNLRFIDEHN N +Y++GL KFADLTN+EYR+MYLG+R KR+ K+ S
Sbjct: 61 RRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKT---S 115
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
RY + GD +PESVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKIVTG+LISLSE
Sbjct: 116 LRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSE 175
Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
QELVDCD N GCNGGLMDYAF+FII+NGG+D+E+DYPY G + +CD +R+NAKVV+ID
Sbjct: 176 QELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTID 235
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
YEDV E SLKKA++ QP+SVAIE GGRAFQ Y+SG+F G CG+ LDHGVVAVGYGT
Sbjct: 236 SYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT 295
Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
ENG DYW+V+NSWG+ WGE+GY++++RN+ ++ GKCGIA+E SYP+KN Q
Sbjct: 296 ENGKDYWIVKNSWGTSWGESGYIRMERNIA-SSAGKCGIAVEPSYPIKNGQ 345
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/351 (65%), Positives = 282/351 (80%), Gaps = 7/351 (1%)
Query: 11 STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
+ ++FL I SSA DMSIISYD NH H+ S R+D EV +Y+ WL KHGK N + +
Sbjct: 2 TVILFLTMIVVSSAMDMSIISYDKNH-HTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 60
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
+RF+IFKDNLRFIDEHN N +Y++GL KFADLTN+EYR+MYLG+R KR+ KS S
Sbjct: 61 RRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKS---S 115
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
RY + GD +PESVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKIVTG+LI+LSE
Sbjct: 116 LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSE 175
Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
QELVDCD N GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID
Sbjct: 176 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTID 235
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
YEDV E SLKKA++ QP+SVAIE GGRAFQ Y+SG+F G CG+ LDHGVVAVGYGT
Sbjct: 236 LYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT 295
Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
ENG DYW+V+NSWG+ WGE+GY++++RN+ ++ GKCGIA+E SYP+KN Q
Sbjct: 296 ENGKDYWIVKNSWGTSWGESGYIRMERNIA-SSAGKCGIAVEPSYPIKNGQ 345
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/351 (65%), Positives = 282/351 (80%), Gaps = 7/351 (1%)
Query: 11 STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
+ ++FL I SSA DMSIISYD NH H+ S R+D EV +Y+ WL KHGK N + +
Sbjct: 8 TVILFLTMIVVSSAMDMSIISYDKNH-HTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 66
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
+RF+IFKDNLRFIDEHN N +Y++GL KFADLTN+EYR+MYLG+R KR+ KS S
Sbjct: 67 RRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKS---S 121
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
RY + GD +PESVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKIVTG+LI+LSE
Sbjct: 122 LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSE 181
Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
QELVDCD N GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID
Sbjct: 182 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTID 241
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
YEDV E SLKKA++ QP+SVAIE GGRAFQ Y+SG+F G CG+ LDHGVVAVGYGT
Sbjct: 242 LYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT 301
Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
ENG DYW+V+NSWG+ WGE+GY++++RN+ ++ GKCGIA+E SYP+KN Q
Sbjct: 302 ENGKDYWIVKNSWGTSWGESGYIRMERNIA-SSAGKCGIAVEPSYPIKNGQ 351
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/363 (63%), Positives = 286/363 (78%), Gaps = 6/363 (1%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S L IS L+ L F + SSA+DMSIISYD H H R+DDEV +Y++WL +HG
Sbjct: 1 MAAHSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHH---RSDDEVSALYESWLIEHG 57
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G +KRFQIFKDNL++IDE NS+ N++YK+GL KFADLTNEEYR++YLGT+S
Sbjct: 58 KSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSG 117
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
RR + SK S RY K GD LPESVDWR+KG + VKDQGSCGSCWAFS VAA+E IN
Sbjct: 118 DRRKL-SKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINA 176
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG LISLSEQELVDCD+ N GC+GGLMDYAF+F+I NGG+D+E+DYPY + CD
Sbjct: 177 IVTGNLISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQ 236
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
R+NAKVV ID YEDV +E +L+KAVA QPVS+AIEAGGR QHY+SG+FTG+CG+A+
Sbjct: 237 YRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAV 296
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGVVA GYG+ENG+DYW+VRNSWG+ WGE GY+++QRN+ +++G CG+A E SYPVK
Sbjct: 297 DHGVVAAGYGSENGMDYWIVRNSWGAKWGEKGYLRVQRNVA-SSSGLCGLATEPSYPVKT 355
Query: 360 SQN 362
N
Sbjct: 356 GAN 358
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/340 (67%), Positives = 278/340 (81%), Gaps = 7/340 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISYD H R+++E+ +Y+ WLAKHG+ N +G E+RF+IFKDN+RFID
Sbjct: 24 DMSIISYDEAHGVQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDA 83
Query: 86 HN----SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN S +R++++GLN+FAD+TNEEYR +YLGTR + RR ++++ S RY AG+EL
Sbjct: 84 HNAAADSGHRSFRLGLNRFADMTNEEYRTVYLGTRPASHRR--RARLGSDRYRYNAGEEL 141
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR+KGAV VKDQGSCGSCWAFST+AAVEGINKIVTG+LISLSEQELVDCD N
Sbjct: 142 PESVDWRDKGAVTTVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQN 201
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF+FII NGG+D+E+DYPY + KCD R+NAKVVSIDGYEDV DE
Sbjct: 202 QGCNGGLMDYAFEFIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEK 261
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA+QPVSVAIEAGGR FQ Y SG+FTG CG+ LDHGVVAVGYGTENG DYW+VRN
Sbjct: 262 ALQKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRN 321
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
SWG DWGE+GY++++RN ++ +TGKCGIAME+SYP K Q
Sbjct: 322 SWGGDWGESGYIRMERN-VNASTGKCGIAMESSYPTKKGQ 360
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 224/357 (62%), Positives = 274/357 (76%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M ++ L + S A DM IISYD H S+ RT+D+V+T+Y+ WL KHGK N
Sbjct: 1 MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNA 60
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+G EKRF+IFKDNL FIDEHNS N ++++GLN+FADLTNEEYR +LGTR + RR K
Sbjct: 61 LGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRK 120
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ RYA + GD+LPESVDWR++GAV VKDQGSCGSCWAFS +AAVEG+NK+ TG+L
Sbjct: 121 VNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDL 180
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
ISLSEQELVDCD N GCNGGLMDYAF+FII + E+DYPY + +CD +R+NAK
Sbjct: 181 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAK 240
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VVSID YEDV +DE +LKKAVA+Q ++VA+E GGR FQ Y+SGVFTG CG+ALDHGV A
Sbjct: 241 VVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAA 300
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
VGYGTENG DYW+VRNSWG WGE GY++L+RNL + +GKCGIA+E SYP+KN N
Sbjct: 301 VGYGTENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIKNGLN 357
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 483 bits (1243), Expect = e-134, Method: Compositional matrix adjust.
Identities = 226/320 (70%), Positives = 264/320 (82%), Gaps = 2/320 (0%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
R D+EV +Y++WL HGK N +G E+RF+IFKDNLRFIDEHN +RTYKVGL +FAD
Sbjct: 53 RPDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFAD 112
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LTNEEYRA +LG R K RL +K S RYA GD+LP+ VDWR+KGAV VKDQG C
Sbjct: 113 LTNEEYRARFLGGRFSRKPRLSAAK--SGRYAAALGDDLPDDVDWRKKGAVATVKDQGQC 170
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFS+VAAVEGIN+IVTGELI LSEQELVDCD+ N GCNGGLMDYAFQFII NGG+
Sbjct: 171 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 230
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
D+E+DYPY G + CDP+R+NAKVV+IDGYEDV DE SLKKAVA+QPVSVAIEAGGRA
Sbjct: 231 DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 290
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
FQ Y+SGVFTG CG+ LDHGVVAVGYGT+NG DYW+VRNSWG DWGE+GY++L+RN+ +
Sbjct: 291 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 350
Query: 343 NTGKCGIAMEASYPVKNSQN 362
TGKCGIA++ SYP K+ N
Sbjct: 351 TTGKCGIAVQPSYPTKSGAN 370
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 229/341 (67%), Positives = 276/341 (80%), Gaps = 7/341 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISYD H R+++E+ +Y+ WLAKHG+ N +G E+RF+IFKDN+ FID
Sbjct: 24 DMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDA 83
Query: 86 HNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ +R++++GLN+FAD+TNEEYRA+YLGTR RR +++V S RY AG++L
Sbjct: 84 HNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRR--RARVGSDRYRYNAGEDL 141
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV VKDQGSCGSCWAFSTVAAVEGINKIVTG+LISLSEQELVDCD N
Sbjct: 142 PESVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYN 201
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDY F+FII NGG+D+E+DYPY + KCD R+NAKVVSIDGYEDV DE
Sbjct: 202 QGCNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEK 261
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA+QPVSVAIEAGGR FQ Y SG+FTG CG+ LDHGVVAVGYGTENG DYW+VRN
Sbjct: 262 ALQKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRN 321
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG DWGE+GY++++RN ++T+TGKCGIA+E SYP K QN
Sbjct: 322 SWGGDWGESGYIRMERN-VNTSTGKCGIAIEPSYPTKKGQN 361
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 235/361 (65%), Positives = 285/361 (78%), Gaps = 12/361 (3%)
Query: 7 FLAISTLVFLF-FISSSSAADMSIISYDNNHDHSS-SWRTDDEVMTIYQTWLAKHGK--- 61
FL +S ++ L I S A DMSIISYD NH ++ + R+D EV IY+ W+ +HGK
Sbjct: 3 FLKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKM 62
Query: 62 TSNGMG-HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
NG+G ++RF+IFKDNLRFIDEHN+ N +YK+GL +FADLTNEEYR+MYLG +
Sbjct: 63 NQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAK--PT 120
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+R++K+ S RY + GD LP+SVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKI
Sbjct: 121 KRVLKT---SDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG+LISLSEQELVDCD N GCNGGLMDYAF+FII+NGG+D+E DYPY A+ +CD +
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R+NAKVV+ID YEDV E SLKKA+A QP+SVAIEAGGRAFQ Y SGVF G CG+ LD
Sbjct: 238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELD 297
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
HGVVAVGYGTENG DYW+VRNSWG+ WGE+GY+K+ RN ++ TGKCGIAMEASYP+K
Sbjct: 298 HGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARN-IEAPTGKCGIAMEASYPIKKG 356
Query: 361 Q 361
Q
Sbjct: 357 Q 357
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 226/353 (64%), Positives = 281/353 (79%), Gaps = 8/353 (2%)
Query: 12 TLVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGKTSN--GMGH 68
++FL ++ +SA DMSIISYD H S++ R+D EVM+IY+ WL KHGK N +
Sbjct: 2 VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLVE 61
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
++RF+IFKDNLRFID+HN N +Y++GL +FADLTN+EYR+ YLG + + K +
Sbjct: 62 KDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERR 117
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
SQRY + GDELPES+DWR+KGAV VKDQGSCGSCWAFST+ AVEGIN+IVTG+LI+L
Sbjct: 118 TSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITL 177
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQELVDCD N GCNGGLMDYAF+FII+NGG+D+++DYPY G + CD R+NAKVV+
Sbjct: 178 SEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVT 237
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
ID YEDV + E SLKKAVA QPVSVAIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGY
Sbjct: 238 IDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVGY 297
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
GTENG DYW+VRNSWG WGE+GY+K+ RN+ +++GKCGIA+E SYP+KN +
Sbjct: 298 GTENGKDYWIVRNSWGKSWGESGYLKMARNIA-SSSGKCGIAIEPSYPIKNGE 349
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 233/359 (64%), Positives = 286/359 (79%), Gaps = 11/359 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSS-SWRTDDEVMTIYQTWLAKHGKT--SN 64
+ ++ L+ I S AADMSIISYD H ++ + R+D EV IY+ W+ KHGK SN
Sbjct: 4 VKVTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSN 63
Query: 65 GMGHNEK--RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
G+ EK RF+IFKDNLRFIDEHN+ N +YK+GL +FADLTNEEYR++YLG +S K+R
Sbjct: 64 GLVGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKS--KKR 121
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
++K+ S RY + GD +P+SVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKIVT
Sbjct: 122 VLKT---SDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVT 178
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G+LISLSEQELVDCD N GCNGGLMDYAF+FII+NGG+D+E+DYPY A+ +CD +R+
Sbjct: 179 GDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQTRK 238
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
NAKVV+ID YEDV +E +LKK +A+QP+SVAIEAGGRAFQ Y SGVF G CG+ LDHG
Sbjct: 239 NAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHG 298
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
VVAVGYGTENG DYW+VRNSWG WGE+GY+K+ RN+ + TGKCGIAMEASYP+K Q
Sbjct: 299 VVAVGYGTENGKDYWIVRNSWGGSWGESGYIKMARNIAEP-TGKCGIAMEASYPIKKGQ 356
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 235/361 (65%), Positives = 285/361 (78%), Gaps = 12/361 (3%)
Query: 7 FLAISTLVFLF-FISSSSAADMSIISYDNNHDHSS-SWRTDDEVMTIYQTWLAKHGK--- 61
FL +S ++ L I S A DMSIISYD NH S+ S R+D EV IY+ W+ +HGK
Sbjct: 3 FLKLSPMILLLAMIGVSYAIDMSIISYDENHHISTVSSRSDAEVERIYEAWMVEHGKKKM 62
Query: 62 TSNGMG-HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
NG+G ++RF+IFKDNLR+IDEHN+ N +YK+GL +FADLTN+EYR+MYLG +
Sbjct: 63 NQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRSMYLGAK--PV 120
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+R++K+ S RY + GD LP+SVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKI
Sbjct: 121 KRVLKT---SDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG+LISLSEQELVDCD N GCNGGLMDYAF+FII+NGG+D+E DYPY A+ +CD +
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R+NAKVV+ID YEDV E SLKKA+A QP+SVAIEAGGRAFQ Y SGVF G CG+ LD
Sbjct: 238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGICGTELD 297
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
HGVVAVGYGTENG DYW+VRNSWG+ WGE+GY+K+ RN+ + TGKCGIAMEASYP+K
Sbjct: 298 HGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIAEP-TGKCGIAMEASYPIKKG 356
Query: 361 Q 361
Q
Sbjct: 357 Q 357
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 221/311 (71%), Positives = 257/311 (82%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
M++Y+ WL KHGK N +G +KRF IFKDNLRFID+HN+ NRTYK+GLN+FADLTNEEY
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEY 60
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
RA YLGTR D RR +K+K S RYA + GD LPESVDWR + AV PVKDQG+CGSCWAF
Sbjct: 61 RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAF 120
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST+ AVEGINKIVTG+LISLSEQELVDCD N GCNGGLMDYA++FII NGG+DSE+DY
Sbjct: 121 STIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEEDY 180
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY + CD R+NAKVV+ID YEDV DE++LKKAVA+QPVSVAIE GGR FQ Y S
Sbjct: 181 PYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYVS 240
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GVFTG CG+ALDHGVVAVGYG+ G DYW+VRNSWG+ WGE GYV+L+RNL + +GKCG
Sbjct: 241 GVFTGRCGTALDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGKCG 300
Query: 349 IAMEASYPVKN 359
IA+E SYP+KN
Sbjct: 301 IAIEPSYPIKN 311
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 476 bits (1224), Expect = e-132, Method: Compositional matrix adjust.
Identities = 231/344 (67%), Positives = 279/344 (81%), Gaps = 6/344 (1%)
Query: 24 AADMSIISYDNNHDH-SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRF 82
A DMSIISYD+NH+ SS R+DDEVM IY++WL +H K N +G EKRF IFKDNL F
Sbjct: 24 AVDMSIISYDHNHNLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEF 83
Query: 83 IDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLG----TRSDAKRRLMKSKVASQRYACKA 137
ID+HNS + +T+KVGLNKFADLTNEE+R++YLG + S KSKV S RY K
Sbjct: 84 IDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKE 143
Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
GDELPE+VDWR+ GAV VKDQG CGSCWAFST+AAVEGIN+IVTGEL+SLSEQELVDCD
Sbjct: 144 GDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCD 203
Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
N+GC+GGLMDYA++FII NGG+D++ DYPY + KCD R+NAKVV+ID +EDV
Sbjct: 204 TSYNSGCDGGLMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPE 263
Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
DE +L+KAVA QPVSVAIEAGG FQ Y+SGVFTG+CG+ LDHGVVAVGYG+++G DYW
Sbjct: 264 NDEKALQKAVAHQPVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGSDDGKDYW 323
Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
+VRNSWG+DWGE+GY++++RNL TGKCGIA+E SYP+KNSQ
Sbjct: 324 IVRNSWGADWGESGYIRMERNLETVKTGKCGIAIEPSYPIKNSQ 367
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
++FL ++ SSA DMSIISYD H S++ R++ EVM+IY+ WL KHGK + N +
Sbjct: 10 ILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
++RF+IFKDNLRF+DEHN N +Y++GL +FADLTN+EYR+ YLG + + K +
Sbjct: 70 DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S RY + GDELPES+DWR+KGAV VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQELVDCD N GCNGGLMDYAF+FII+NGG+D+++DYPY G + CD R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
D YEDV + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
TENG DYW+VRNSWG WGE+GY+++ RN+ +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
++FL ++ SSA DMSIISYD H S++ R++ EVM+IY+ WL KHGK + N +
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
++RF+IFKDNLRF+DEHN N +Y++GL +FADLTN+EYR+ YLG + + K +
Sbjct: 70 DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S RY + GDELPES+DWR+KGAV VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQELVDCD N GCNGGLMDYAF+FII+NGG+D+++DYPY G + CD R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
D YEDV + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
TENG DYW+VRNSWG WGE+GY+++ RN+ +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
++FL ++ SSA DMSIISYD H S++ R++ EVM+IY+ WL KHGK + N +
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
++RF+IFKDNLRF+DEHN N +Y++GL +FADLTN+EYR+ YLG + + K +
Sbjct: 70 DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S RY + GDELPES+DWR+KGAV VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQELVDCD N GCNGGLMDYAF+FII+NGG+D+++DYPY G + CD R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
D YEDV + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
TENG DYW+VRNSWG WGE+GY+++ RN+ +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 222/342 (64%), Positives = 276/342 (80%), Gaps = 6/342 (1%)
Query: 22 SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
++A DMSII+YD H + ++TDDE T++++WL HGK+ N +G EKRFQIFK+NLR
Sbjct: 17 AAATDMSIITYDETH--AVGFKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLR 74
Query: 82 FIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
+IDE N + +R +K+GLNKFADLTNEEYR+ Y G +S R+ + +K S RYA +G+
Sbjct: 75 YIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAK--SGRYATLSGES 132
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPESVDWRE GAV VKDQGSCGSCWAFST++AVEGIN+I TG+LI+LSEQELVDCDR
Sbjct: 133 LPESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSY 192
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GCNGGLMDYAF+FII NGG+D++ DYPY G + KCD R+NAKVV+ID YEDV +DE
Sbjct: 193 NEGCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDE 252
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
++LKKA A+QP+SVAIEA GR FQ Y+SG+FTG+CG ALDHGVV VGYGTENG DYW+VR
Sbjct: 253 LALKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVR 312
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
NSWG+DWGENGY++++R + + TG CGIA+E SYPVK N
Sbjct: 313 NSWGADWGENGYLRMERG-ISSKTGICGIAIEPSYPVKTGVN 353
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 234/358 (65%), Positives = 282/358 (78%), Gaps = 10/358 (2%)
Query: 8 LAISTLVFLFFISSSSAA-----DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
+A +++F F + SAA DMSII+YD H R++DEV ++++WL KHGK+
Sbjct: 1 MARPSILFTFLFAVVSAAAAAAEDMSIITYDQQHPAKGLVRSEDEVKEMFESWLVKHGKS 60
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
N + +KRF+IF+DNL++IDE NSL NR+YK+GLN+FAD+TNEEYR YLG + DA R
Sbjct: 61 YNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASR 120
Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
++KSK S RYA AGD LP+S+DWREKGAV VKDQGSCGSCWAFST+AAVEG+N++
Sbjct: 121 NMVKSK--SDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLA 178
Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
TG LISLSEQELVDCDRKIN GCNGG M YAFQFII+NGG+DSE+DYPY G + KCD R
Sbjct: 179 TGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDSEEDYPYTGKDGKCDSYR 238
Query: 242 R-NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ NAKV SIDGYE+V +E SL+KAVA+QPVSVAIEAGG FQ Y SG+FTG CG+ LD
Sbjct: 239 QNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQLYSSGIFTGSCGTDLD 298
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYGTENGVDYW+V+NSWG WGE GYV++QRN + TG CGIAMEASYP K
Sbjct: 299 HGVAAVGYGTENGVDYWIVKNSWGDYWGEKGYVRMQRN-VKAKTGLCGIAMEASYPTK 355
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 227/356 (63%), Positives = 277/356 (77%), Gaps = 7/356 (1%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
++ D SII+ WRTD+EV +IY W A+HGKT+N +KRF IFKD
Sbjct: 20 ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79
Query: 79 NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY-AC 135
NLRFID HN N+ TYK+GL KF DLTN+EYR +YLG R++ RR+ K+K +Q+Y A
Sbjct: 80 NLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
CD+ N GCNGGLMDYAFQFI++NGG+++E+DYPY G KC+ +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE +LKKA++ QPVSVAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
YW+VRNSWG WGE GY++++RNL + +GKCGIA+EASYPVK S N + SS
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISS 375
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 227/356 (63%), Positives = 277/356 (77%), Gaps = 7/356 (1%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
++ D SII+ WRTD+EV +IY W A+HGKT+N +KRF IFKD
Sbjct: 20 ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79
Query: 79 NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY-AC 135
NLRFID HN N+ TYK+GL KF DLTN+EYR +YLG R++ RR+ K+K +Q+Y A
Sbjct: 80 NLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
CD+ N GCNGGLMDYAFQFI++NGG+++E+DYPY G KC+ +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE +LKKA++ QPVSVAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
YW+VRNSWG WGE GY++++RNL + +GKCGIA+EASYPVK S N + SS
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISS 375
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 219/323 (67%), Positives = 260/323 (80%), Gaps = 3/323 (0%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGL 97
+S+ RTD+EV Y+ WLA+HGKT N +G E RF+IF DNL+FIDEHN S NR+YKVGL
Sbjct: 23 TSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGL 82
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNP 155
N+FADLTNEEYR+MYLGT+ D RR+ M+ S+RYA + + P VDWRE+GAV+P
Sbjct: 83 NQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSP 142
Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
VK+QG CGSCWAFSTVA+VEGINKIVTG+LISLSEQELVDCD K N+GCNGG MDYAFQF
Sbjct: 143 VKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQF 202
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
I+ NGG+DSE DYPY G CDP R AK+VSIDGYEDV P +E +L KAVA QPVSV
Sbjct: 203 IVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVG 262
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
IEA GRAFQ Y SGV TG CG+ LDHGVV VGYG+ENG DYW+VRNSWG +WGE+GY+++
Sbjct: 263 IEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGYIRM 322
Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
+RN++DT G CGI + ASYP+K
Sbjct: 323 ERNMVDTPVGMCGITLMASYPIK 345
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 226/350 (64%), Positives = 273/350 (78%), Gaps = 7/350 (2%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
++ D SII+ S WRTD+EV +IY W A HGKT+N +KRF IFKD
Sbjct: 20 ASGDESIINDHLQLPSDSWWRTDEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKD 79
Query: 79 NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
NLRFID HN N+ TYK+GL KF DLTNEEYR++YLG R++ RR+ K+K +Q+Y+
Sbjct: 80 NLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSAA 139
Query: 137 A-GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
G E+PE+VDWR KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVD 199
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
CD N GCNGGLMDYAFQFI++NGG+ +E+DYPY G KC+ +NAKVVSIDGYEDV
Sbjct: 200 CDNSYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDV 259
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE +LK+A++ QPVSVAIEAGGR FQHY++G+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYGSENGVD 319
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
YW+VRNSWG WGE GY++++RNL + +GKCGIA+EASYPVK S N +
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLASSKSGKCGIAVEASYPVKYSPNPVR 369
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 225/357 (63%), Positives = 276/357 (77%), Gaps = 7/357 (1%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
++ D SII+ WRTD+EV +IY W A+HGKT+N +KRF IFKD
Sbjct: 20 ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79
Query: 79 NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
NLRFID HN N+ TYK+GL KF DLTN+EYR +YLG R++ RR+ K+K +Q+Y+
Sbjct: 80 NLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139
Query: 137 A-GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
CD+ N GCNGGLMDYAFQFI++NGG+++E+DYPY G KC+ +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE +LKKA++ QPV VAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
YW+VRNSWG WGE GY++++RNL + +GKCGIA+EASYPVK S N + SS
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISSV 376
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 226/360 (62%), Positives = 279/360 (77%), Gaps = 11/360 (3%)
Query: 6 MFLAISTLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
M + ST+ LF FI SSSA D+SII N R DDE+ ++Y+TWL KHGK
Sbjct: 1 MSTSKSTIFLLFSIIFIVSSSALDLSIIDRAFN-------RPDDEIASLYETWLVKHGKN 53
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
NG+G + RF IFKDNLRF+DE NS N ++K+GLN+FADLTNEEYR++YLGTR +
Sbjct: 54 YNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAV 113
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ S RYA +AGD LPESVDWR+KGAV +KDQGSCGSCWAFS +AAVEG+N+IVT
Sbjct: 114 ARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVT 173
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G+LISLSEQELV+CD N GC+GGLMDYAF+FII+N G+DS++DYPY G + +CD +R+
Sbjct: 174 GDLISLSEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRK 233
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
NAKVV+ID YED +DE SL+KAVA+QPVSVAIE GGR FQ Y+SGVFTG+CG+ALDHG
Sbjct: 234 NAKVVTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHG 293
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
V VGYGTE+G+DYW+VRNSWG WGE GY+++QRN +G CGIA+E SYP+K+ N
Sbjct: 294 VAVVGYGTEDGLDYWIVRNSWGDTWGEGGYIRMQRN-TKLPSGICGIAIEPSYPIKSGLN 352
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 467 bits (1201), Expect = e-129, Method: Compositional matrix adjust.
Identities = 223/337 (66%), Positives = 272/337 (80%), Gaps = 4/337 (1%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
A+DMSII+YD H +S RTDDEVMT+Y +WL KHGK+ N +G E RFQIFKDNLR+I
Sbjct: 22 ASDMSIINYDQTHTNSLI-RTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYI 80
Query: 84 DEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
D HN+ +R+Y++GLN+FADLTNEEYRA YLGT+S R + SK S RYA G+ELP
Sbjct: 81 DNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKL-SKGPSDRYAPVEGEELP 139
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
+S+DWREKGAV VKDQGSCGSCWAFS + AVEGIN+I TGELI+LSEQELVDCDR N
Sbjct: 140 DSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNE 199
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GC GGLMDYAF FII+NGG+DS+ DYPY G + C+ ++ NAKVV+ID YEDV +DE +
Sbjct: 200 GCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKA 259
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KA A+QP+SVAIEAGG FQ Y SG+FTG+CG+A+DHGVV VGYG+E G+DYW+VRNS
Sbjct: 260 LQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNS 319
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
WG+ WGE GY+K+QRN + ++G CGI +E SYPVKN
Sbjct: 320 WGAAWGEAGYLKMQRN-VGKSSGLCGITIEPSYPVKN 355
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 218/354 (61%), Positives = 275/354 (77%), Gaps = 21/354 (5%)
Query: 19 ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
+S ++AADMSI+SY R+++EV +Y W+A+HG T N +G E+RF+ F+D
Sbjct: 18 VSLAAAADMSIVSYGE--------RSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRD 69
Query: 79 NLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQR 132
NLR+ID+HN+ ++++GLN+FADLTNEEYR+ YLG R+ D +R+L S R
Sbjct: 70 NLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SAR 123
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y DELPESVDWR+KGAV VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQE
Sbjct: 124 YQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQE 183
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
LVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY +N+CD +++NAKVV+IDGY
Sbjct: 184 LVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGY 243
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTEN
Sbjct: 244 EDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTEN 303
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
G DYWLVRNSWGS WGE+GY++++RN + ++GKCGIA+E SYP K ++ P
Sbjct: 304 GKDYWLVRNSWGSVWGEDGYIRMERN-IKASSGKCGIAVEPSYPTKTARTPLTP 356
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 217/321 (67%), Positives = 255/321 (79%), Gaps = 2/321 (0%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
MT+Y+ WL KH K NG+G + RFQIFKDNLRFIDEHN+ N +YKVGLNKFAD+ NEEY
Sbjct: 1 MTMYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEY 60
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
R MYLGT+SDAKRR+MK+K+ R + + VDWR KGAV +KDQGSCGSCWAF
Sbjct: 61 RDMYLGTKSDAKRRVMKTKITGHRITYNS-VIVTVKVDWRLKGAVTHIKDQGSCGSCWAF 119
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST+A VE INKIVTG+ +SLSEQELVDCDR N GCNGGLMDYAF+FII+NGG+D++QDY
Sbjct: 120 STIATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDY 179
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY G E KCDP+++NAKVVSIDGYEDV + +LKKAVA QPVSVAI GRA Q Y+S
Sbjct: 180 PYNGFERKCDPTKKNAKVVSIDGYEDVPSYMN-ALKKAVAHQPVSVAIAGLGRALQLYQS 238
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GVFTG+CG+ LDHGVV VGYG+ENGVDYWLVRNSWG++WGE+GY K+ + + KCG
Sbjct: 239 GVFTGKCGTDLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCG 298
Query: 349 IAMEASYPVKNSQNSAKPKPH 369
IAMEASYPVK QN+ P
Sbjct: 299 IAMEASYPVKYGQNTNSAAPQ 319
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 223/354 (62%), Positives = 266/354 (75%), Gaps = 8/354 (2%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
+ L + L S+S AD SIISYD S DD +M +Y+ WLA+H K NG
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIISYD-----SQDLIGDDAIMELYELWLAQHKKAYNG 57
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
+ +K+F +FKDN +I +HN+ N +YK+GLN+FADL++EE++A YLGT+ DAK+RL
Sbjct: 58 LDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLS 117
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
+S S RY G++LPES+DWREKGAV VK+QGSCGSCWAFSTVAAVEGIN+IVTG
Sbjct: 118 RS--PSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGN 175
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
L SLSEQELVDCD N GCNGGLMDYAFQFII NGG+DSE DYPY CD R+NA
Sbjct: 176 LTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNA 235
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
VV+ID YEDV DE SLKKA A+QP+SVAIEA GRAFQ YESGVFT CG+ LDHGV
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVT 295
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
VGYG+E+G+DYWLV+NSWG+ WGE G++KLQRNL +TG CGIAMEASYPVK
Sbjct: 296 LVGYGSESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVK 349
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/341 (63%), Positives = 267/341 (78%), Gaps = 7/341 (2%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRF 82
+AADMSII+YD H S TDD +M Y++WL KHGK+ N +G E+RFQIFKDN +
Sbjct: 18 TAADMSIITYDQTHAVGS---TDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLY 74
Query: 83 IDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
IDE N+ +R++K+GLN+FADLTNEEYR+ Y G R+ R+ + K SQRYA AG+ L
Sbjct: 75 IDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGK--SQRYASLAGESL 132
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWRE GAV VKDQG CGSCWAFST++AVEGIN+I TG+LI+LSEQELVDCDR N
Sbjct: 133 PESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYN 192
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMD AFQFII NGG+DS+ DYPY G + +CD R+NAKVV+ID YEDV +DE
Sbjct: 193 EGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEK 252
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KA A+QP+SVAIEA GR FQ Y+SG+FTG+CG+ LDHGVV VGYGTENG DYW+VRN
Sbjct: 253 ALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRN 312
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG+DWGE GY++++R + + G CGI E SYPVK+ N
Sbjct: 313 SWGADWGEKGYLRMERG-ISSKAGICGITSEPSYPVKSGVN 352
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 211/324 (65%), Positives = 263/324 (81%), Gaps = 5/324 (1%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
R+DDEV +YQ W A+H ++ N + +E+R +IF+DNLRFID+HN+ ++++GL
Sbjct: 38 RSDDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLT 97
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEEYR+ YLG R+ RR S V S RY ++ D+LP+S+DWR+KGAV VKD
Sbjct: 98 RFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKD 157
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QGSCGSCWAFST+AAVEGIN IVTG+LISLSEQELVDCD N GCNGGLMDYAF+FII
Sbjct: 158 QGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIIS 217
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+D+++DYPY G + CD R+NA VV+ID YEDV DE SL+KAVA+QPVSVAIEA
Sbjct: 218 NGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEA 277
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
GGRAFQ YESG+FTG CG+ LDHGV A+GYG+ENG YW+V+NSWGSDWGE+GY++++RN
Sbjct: 278 GGRAFQLYESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRMERN 337
Query: 339 LLDTNTGKCGIAMEASYPVKNSQN 362
+++ TGKCGIAMEASYP+KN QN
Sbjct: 338 -INSATGKCGIAMEASYPIKNGQN 360
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 456 bits (1173), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/350 (62%), Positives = 274/350 (78%), Gaps = 21/350 (6%)
Query: 19 ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
+S ++AADMSI+SY R+++EV +Y W+A+HG T N +G E+RF+ F+D
Sbjct: 18 VSLAAAADMSIVSYGE--------RSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRD 69
Query: 79 NLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQR 132
NLR+ID+HN+ ++++GLN+FADLTNEEYR+ YLG R+ D +R+L S R
Sbjct: 70 NLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SAR 123
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y DELPESVDWR+KGAV VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQE
Sbjct: 124 YQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQE 183
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
LVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY +N+CD +++NAKVV+IDGY
Sbjct: 184 LVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGY 243
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTEN
Sbjct: 244 EDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTEN 303
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
G DYWLVRNSWGS WGE+GY++++RN + ++GKCGIA+E SYP K +N
Sbjct: 304 GKDYWLVRNSWGSVWGEDGYIRMERN-IKASSGKCGIAVEPSYPTKTGEN 352
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 219/354 (61%), Positives = 262/354 (74%), Gaps = 8/354 (2%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
+ L + L S+S AD SII YD S R DD +M +Y+ WLA+H K NG
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYD-----SKDLREDDAIMELYELWLAQHKKAYNG 57
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
+G + RF +FKDN +I +HN+ N +YK+GLN+FADL++EE++A YLG + D K+RL
Sbjct: 58 LGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLS 117
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
S S RY G++LPES+DWREKGAV VKDQGSCGSCWAFSTVAAVEGIN+IVTG
Sbjct: 118 NS--PSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGN 175
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
L SLSEQELVDCD N GCNGGLMDYAFQFII NGG+DSE DYPY + CD R+NA
Sbjct: 176 LTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNA 235
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
VV+ID YEDV DE SLKKA A+QP+SVAIEA GRAFQ YESGVFT CG+ LDHGV
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVT 295
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
VGYG+E+G DYW+V+NSWG WGE G+++LQRN+ +TG CGIAMEASYP+K
Sbjct: 296 LVGYGSESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLK 349
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 216/338 (63%), Positives = 260/338 (76%), Gaps = 9/338 (2%)
Query: 21 SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
S+S AD SIIS S R DD +M +Y+ WLA+H + NG+ +KRF +FKDN
Sbjct: 18 SASRADFSIIS-------SKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNF 70
Query: 81 RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
+I EHN NR+YK+GLN+FADL++EE++A YLG + D K+RL S+ S+RY G++
Sbjct: 71 LYIHEHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRL--SRPPSRRYQYSDGED 128
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPES+DWREKGAV VKDQGSCGSCWAFSTVAAVEGIN+IVTG+LISLSEQELVDCD
Sbjct: 129 LPESIDWREKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSY 188
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GCNGGLMDYAF+FII NGG+DSE+DYPY + CD R+NA VV+ID YEDV DE
Sbjct: 189 NQGCNGGLMDYAFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDE 248
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
SLKKA A+QP+SVAIEA GR FQ Y+SGVFT CG+ LDHGV VGYG+E+G DYW V+
Sbjct: 249 KSLKKAAANQPISVAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGSESGTDYWTVK 308
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG WGE G+++LQRN+ +TG CGIAMEASYPVK
Sbjct: 309 NSWGKSWGEEGFIRLQRNIEVASTGMCGIAMEASYPVK 346
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 213/308 (69%), Positives = 253/308 (82%), Gaps = 5/308 (1%)
Query: 56 LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLG 114
L KH K N +G EKRF+IFKDNLRFIDEHN +N+++K+GLNKFADL+NEEY++M+LG
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
R R+ +S R+ GDELP+SVDWREKGAV PVKDQG CGSCWAFSTVAAV
Sbjct: 71 GRMVRDRKGFES----DRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAV 126
Query: 175 EGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
EGIN+I TG+LISLSEQELVDCD+ N GCNGG MDYAF+FI++NGG+D+E DYPY G +
Sbjct: 127 EGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVD 186
Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
+CD +R+NAKVV+I+G+EDV DE SLKKAVA QPVSVAIEAGGRAFQ YESG+F G
Sbjct: 187 GQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGL 246
Query: 295 CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
CG+ LDHGVVAVGYGTE+G DYW+VRNSWG +WGENGY++L+RN+ TNTGKCGIAM+ S
Sbjct: 247 CGTDLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPS 306
Query: 355 YPVKNSQN 362
YP K N
Sbjct: 307 YPTKTGVN 314
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 212/314 (67%), Positives = 248/314 (78%), Gaps = 34/314 (10%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
M +Y+ WL KHGK+ N +G E+RF+IFKDNLRFI+EHN++NRTYKVG
Sbjct: 1 MAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVG------------ 48
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
RY+ +AG++LPESVDWREKGAV PVKDQG+CGSCWAF
Sbjct: 49 ----------------------DRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAF 86
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST+AAVEGIN+I TG+LISLSEQELVDCD+ N GCNGGLMDYAF+FII NGG+DSE+DY
Sbjct: 87 STIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDY 146
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY A+ CDP+R+NA+VVSIDGYEDV DE SLKKAVA+QPVSVAIEAGGRAFQ Y+S
Sbjct: 147 PYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQS 206
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GVFTG+CG+ LDHGVVAVGYGTEN VDYW+VRNSWG +WGE+GY+KL+RNL T TGKCG
Sbjct: 207 GVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266
Query: 349 IAMEASYPVKNSQN 362
IA+E SYP+KN QN
Sbjct: 267 IAIEPSYPIKNGQN 280
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 215/347 (61%), Positives = 270/347 (77%), Gaps = 21/347 (6%)
Query: 22 SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
++AADMSI+ Y R+++EV +Y W+A+H T N +G E+RF+ F++NLR
Sbjct: 20 AAAADMSIVFYGE--------RSEEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLR 71
Query: 82 FIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQRYAC 135
+ID+HN+ ++++GLN+FADLTNEEYR+ YLG R+ D +R+L S RY
Sbjct: 72 YIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SARYQA 125
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
DELPESVDWR+KGAV VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQELVD
Sbjct: 126 ADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVD 185
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
CD N GCNGGLMDYAF+FII NGG+DSE+DYPY +N+CD +++NAKVV+IDGYEDV
Sbjct: 186 CDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDV 245
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTENG D
Sbjct: 246 PVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKD 305
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
YWLVRNSWGS WGENGY++++RN + ++GKCGIA+E SYP K +N
Sbjct: 306 YWLVRNSWGSVWGENGYIRMERN-IKASSGKCGIAVEPSYPTKTGEN 351
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/341 (61%), Positives = 270/341 (79%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++EV +Y W+A++G+T N +G E+RF++F+DNLR++D+
Sbjct: 24 DMSIVSYGE--------RSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQ 75
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ + ++ + S RY +EL
Sbjct: 76 HNAAADAGLHSFRLGLNRFADLTNEEYRDTYLGVRT----KPVRERRLSGRYQAADNEEL 131
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWREKGAV VKDQG CGSCWAFS +AAVEGIN+IVTG++I+LSEQELVDCD N
Sbjct: 132 PESVDWREKGAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYN 191
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF+FII NGG+DSE+DYPY +N+CD +++NAKVV+IDGYEDV E+
Sbjct: 192 QGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEL 251
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SLKKAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYG+ENG DYW+V+N
Sbjct: 252 SLKKAVANQPISVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGSENGKDYWIVKN 311
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG+ WGE+GYV+L+RN+ T +GKCGIA+E SYP+K N
Sbjct: 312 SWGTVWGEDGYVRLERNIKAT-SGKCGIAIEPSYPLKKGAN 351
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/337 (62%), Positives = 262/337 (77%), Gaps = 17/337 (5%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+D+E +Y W+A HG+T N +G E+R+Q+F+DNLR+ID
Sbjct: 28 DMSIVSYGE--------RSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA 79
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTN+EYRA YLG R+ R + + RY ++L
Sbjct: 80 HNAAADAGVHSFRLGLNRFADLTNDEYRATYLGART----RPQRERKLGARYHAADNEDL 135
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV VKDQGSCGSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 136 PESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN 195
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV DE
Sbjct: 196 QGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEK 255
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEA G AFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+N
Sbjct: 256 SLQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKN 315
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWGS WGE+GYV+++RN + ++GKCGIA+E SYP+K
Sbjct: 316 SWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 351
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 209/343 (60%), Positives = 269/343 (78%), Gaps = 21/343 (6%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++EV +Y W+++H +T N +G E+RF++F+DNLR+ID+
Sbjct: 23 DMSIVSYGE--------RSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQ 74
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQRYACKAGD 139
HN+ ++++GLN+FADLTNEEYR+ YLG R+ D +R+L S RY +
Sbjct: 75 HNAAADAGLHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SARYQADDNE 128
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
ELPE+VDWR+KGAV +KDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQELVDCD
Sbjct: 129 ELPETVDWRKKGAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS 188
Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
N GCNGGLMDYAF+FII NGG+DSE+DYPY +N+CD +++NAKVV+IDGYEDV
Sbjct: 189 YNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNS 248
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTENG DYWLV
Sbjct: 249 EKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLV 308
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
RNSWG+ WGE+GY++++RN + ++GKCGIA+E SYP K +N
Sbjct: 309 RNSWGTVWGEDGYIRMERN-IKASSGKCGIAVEPSYPTKTGEN 350
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 213/362 (58%), Positives = 278/362 (76%), Gaps = 16/362 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MAT+ + ++ L+F + S S ++ + + R + E +Y+ WL ++
Sbjct: 1 MATSIKSITLALLIFSVLLISLSLGSVT---------ATETTRNEAEARRMYERWLVENR 51
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K NG+G E+RF+IFKDNL+F++EH+S+ NRTY+VGL +FADLTN+E+RA+YL RS
Sbjct: 52 KNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYL--RSKM 109
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+R + V ++Y K GD LP+++DWR KGAVNPVKDQGSCGSCWAFS + AVEGIN+
Sbjct: 110 ER--TRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQ 167
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE-NKCD 238
I TGELISLSEQELVDCD N GC GGLMDYAF+FII+NGG+D+E+DYPY+ + N C+
Sbjct: 168 IKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCN 227
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
++N +VV+IDGYEDV DE SLKKA+A+QP+SVAIEAGGRAFQ Y SGVFTG CG++
Sbjct: 228 SDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTS 287
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
LDHGVVAVGYG+E G DYW+VRNSWGS+WGE+GY KL+RN+ ++ +GKCG+AM ASYP K
Sbjct: 288 LDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKES-SGKCGVAMMASYPTK 346
Query: 359 NS 360
+S
Sbjct: 347 SS 348
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 218/350 (62%), Positives = 265/350 (75%), Gaps = 15/350 (4%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG-H 68
I L+F FI+ S+A+ SII RTDDEVM +Y W AKHGK N +G
Sbjct: 9 IMALLFFLFIALSAASPSSIIPQ----------RTDDEVMALYDQWRAKHGKLHNNLGAE 58
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
E RF IFKDNL+FIDE N+ N Y++GLN FADLTNEEYR+ YLG + + R ++
Sbjct: 59 PENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRT-- 116
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
S RY + GD+LP+S+DWR KGAV PVKDQGSCGSCWAFSTVA+VE IN+IVTG+LI+L
Sbjct: 117 -SNRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIAL 175
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQELVDCDR N GCNGGLMDYAF+FII+NGG+D+E+DYPY G ++ C ++NAKVV+
Sbjct: 176 SEQELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVA 235
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
ID YEDV +E +L+KAV+ Q VSVAIE GGR+FQ Y+SG+FTG CG+ LDHGV VGY
Sbjct: 236 IDSYEDVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGY 295
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
G+E GVDYW+VRNSWG WGE+GYVK+QRN+ + TG CGIAME SYP K
Sbjct: 296 GSEGGVDYWIVRNSWGGSWGESGYVKMQRNIA-SPTGLCGIAMEPSYPTK 344
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 209/340 (61%), Positives = 263/340 (77%), Gaps = 17/340 (5%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSI+SY R+++E +Y W+A HG+T N +G E+RF++F+DNLR++D H
Sbjct: 29 MSIVSYGE--------RSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAH 80
Query: 87 NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
N+ ++++GLN+FADLTN+EYRA YLG RS R + + RY ++LP
Sbjct: 81 NAAADAGVHSFRLGLNRFADLTNDEYRATYLGVRS----RPQRERRLGDRYLAGDNEDLP 136
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
ESVDWR KGAV VKDQGSCGSCWAFST+AAVEGIN+IVTG++ISLSEQELVDCD N
Sbjct: 137 ESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQ 196
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV E S
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKS 256
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+NS
Sbjct: 257 LQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNS 316
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WGS WGE+GYV+++RN + ++GKCGIA+E SYP+K N
Sbjct: 317 WGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGAN 355
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/341 (61%), Positives = 261/341 (76%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK+ N +G E+R+ F+DNLR+IDE
Sbjct: 22 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 74 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P E
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 208/340 (61%), Positives = 263/340 (77%), Gaps = 17/340 (5%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSI+SY R+++E +Y W+A HG+T N +G E+RF++F+DNLR++D H
Sbjct: 29 MSIVSYGE--------RSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAH 80
Query: 87 NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
N+ ++++GLN+FADLTN+EYRA YLG RS R + + RY ++LP
Sbjct: 81 NAAADAGVHSFRLGLNRFADLTNDEYRATYLGVRS----RPQRERRLGDRYLAGDNEDLP 136
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
ESVDWR KGAV +KDQGSCGSCWAFST+AAVEGIN+IVTG++ISLSEQELVDCD N
Sbjct: 137 ESVDWRAKGAVAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQ 196
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV E S
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKS 256
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+NS
Sbjct: 257 LQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNS 316
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WGS WGE+GYV+++RN + ++GKCGIA+E SYP+K N
Sbjct: 317 WGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGAN 355
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/341 (61%), Positives = 261/341 (76%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK+ N +G E+R+ F+DNLR+IDE
Sbjct: 23 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 74
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 75 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 130
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 131 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 190
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P E
Sbjct: 191 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 250
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 251 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 310
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 311 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 350
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/341 (61%), Positives = 260/341 (76%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK N +G E+R+ F+DNLR+IDE
Sbjct: 22 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDE 73
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 74 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P E
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 206/324 (63%), Positives = 265/324 (81%), Gaps = 8/324 (2%)
Query: 49 MTIYQTWLAKHGKT---SNGM-GHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFAD 102
M+IY W +HGK+ SNG+ ++RF IFKDNLRFID HN N+ TYK+GL FA+
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD-ELPESVDWREKGAVNPVKDQGS 161
LTN+EYR++YLG R++ RR+ K+K + +Y+ D E+P +VDWR+KGAVN +KDQG+
Sbjct: 61 LTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGT 120
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST AAVEGINKIVTGEL+SLSEQELVDCD+ N GCNGGLMDYAFQFI++NGG
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+++E+DYPY G KC+ +N++VV+IDGYEDV DE +LK+AV+ QPVSVAI+AGGR
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
AFQHY+SG+FTG+CG+ +DH VVAVGYG+ENGVDYW+VRNSWG+ WGE+GY++++RN+
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA- 299
Query: 342 TNTGKCGIAMEASYPVKNSQNSAK 365
+ +GKCGIA+EASYPVK S N +
Sbjct: 300 SKSGKCGIAIEASYPVKYSPNPVR 323
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 206/324 (63%), Positives = 265/324 (81%), Gaps = 8/324 (2%)
Query: 49 MTIYQTWLAKHGKT---SNGM-GHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFAD 102
M+IY W +HGK+ SNG+ ++RF IFKDNLRFID HN N+ TYK+GL FA+
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQGS 161
LTN+EYR++YLG R++ RR+ K+K + +Y+ DE+P +VDWR+KGAVN +KDQG+
Sbjct: 61 LTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST AAVEGINKIVTGEL+SLSEQELVDCD+ N GCNGGLMDYAFQFI++NGG
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+++E+DYPY G KC+ +N++VV+IDGYEDV DE +LK+AV+ QPVSVAI+AGGR
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
AFQHY+SG+FTG+CG+ +DH VVAVGYG+ENGVDYW+VRNSWG+ WGE+GY++++RN+
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA- 299
Query: 342 TNTGKCGIAMEASYPVKNSQNSAK 365
+ +GKCGIA+EASYPVK S N +
Sbjct: 300 SKSGKCGIAIEASYPVKYSPNPVR 323
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 210/337 (62%), Positives = 261/337 (77%), Gaps = 17/337 (5%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+ +E +Y W+A HG+T N +G E+R+Q+F+DNLR+ID
Sbjct: 23 DMSIVSYGE--------RSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA 74
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTN+EYRA YLG R+ R + + RY ++L
Sbjct: 75 HNAAADAGVHSFRLGLNRFADLTNDEYRATYLGART----RPQRERKLGARYHAADNEDL 130
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV VKDQGSCGSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 131 PESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN 190
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV DE
Sbjct: 191 QGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEK 250
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEA G AFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+N
Sbjct: 251 SLQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKN 310
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWGS WGE+GYV+++RN + ++GKCGIA+E SYP+K
Sbjct: 311 SWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 346
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/341 (61%), Positives = 259/341 (75%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK+ N +G E+R+ F+DNLR+IDE
Sbjct: 22 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 74 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQG CGSCWAFS +AAVE IN+IVTG+LISLSEQELVDCD N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYN 189
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P E
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAV +QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 208/336 (61%), Positives = 259/336 (77%), Gaps = 17/336 (5%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSI+SY RTD+E +Y W+A HG+T N +G E+R+Q+F+DNLR+ID H
Sbjct: 27 MSIVSYGE--------RTDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAH 78
Query: 87 NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
N+ ++++GLN+FADLTN+EY A YLG R+ R + + RY ++LP
Sbjct: 79 NAAADAGVHSFRLGLNRFADLTNDEYPATYLGART----RPQRDRKLGARYHAADNEDLP 134
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
ESVDWR KGAV VKDQGSCG+CWAFST+AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 135 ESVDWRAKGAVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQ 194
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV DE S
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA+QPVSVAIEA G AFQ Y SG+FTG CG+ LDHGV AVGYGTENG DYW+V+NS
Sbjct: 255 LQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYGTENGKDYWIVKNS 314
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
WGS WGE+GYV+++RN + ++GKCGIA+E SYP+K
Sbjct: 315 WGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 349
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/337 (62%), Positives = 260/337 (77%), Gaps = 17/337 (5%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+D+E +Y W+A HG+T N +G E+R+Q+F+DNLR+ID
Sbjct: 26 DMSIVSYGE--------RSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA 77
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTN+EYRA YLG R+ R + + RY ++L
Sbjct: 78 HNAAADAGVHSFRLGLNRFADLTNDEYRATYLGART----RPQRERKLGARYHAADNEDL 133
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV VKDQGS GSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 134 PESVDWRAKGAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN 193
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV DE
Sbjct: 194 QGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEK 253
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEA G FQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+N
Sbjct: 254 SLQKAVANQPVSVAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKN 313
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWGS WGE+GYV+++RN + ++GKCGIA+E SYP+K
Sbjct: 314 SWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 349
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 436 bits (1122), Expect = e-120, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 273/360 (75%), Gaps = 16/360 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MAT + ++ L+F + S S ++ + + R + E +Y+ WL ++
Sbjct: 1 MATPIKSITLALLIFSMLLISLSLGSVT---------AADTTRNEAEARRMYEQWLVENR 51
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K NG+G E RF+IF DNL++I+EHNS+ N+T++VGL +FADLTN+E+RA+YL RS
Sbjct: 52 KNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYL--RSKM 109
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+R + V +RY K GD LP+ +DWR KGAVNPVKDQG+CGSCWAFS + AVEGIN+
Sbjct: 110 ER--TRVPVKGERYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQ 167
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA-ENKCD 238
I TGELISLSEQELVDCD N GC GGLMDYAF+FII+NGG+D+E+DYPY +N C+
Sbjct: 168 IKTGELISLSEQELVDCDTSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICN 227
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
++N++VV+IDGYEDV DE SLKKA+A+QP+SVAIEAGGRAFQ Y+SGVFTG CG++
Sbjct: 228 SDKKNSRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTS 287
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
LDHGVVAVGYG+E G DYW+VRNSWGS+WGE+GY KL+RN+ ++ +GKCG+AM ASYP K
Sbjct: 288 LDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKES-SGKCGVAMMASYPTK 346
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/341 (61%), Positives = 259/341 (75%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK+ N +G E+R+ F+DNLR+IDE
Sbjct: 22 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 74 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQ GSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 130 PESVDWRTKGAVAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P E
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 207/313 (66%), Positives = 240/313 (76%), Gaps = 34/313 (10%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
M +Y+ WLAKHGK+ N +G E+RFQIFKDNLRFIDEHN+ NRTYK+
Sbjct: 1 MAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKI------------- 47
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
S RYA + GD LPESVDWR+KGAV VKDQGSCGSCWAF
Sbjct: 48 ---------------------SDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAF 86
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST+AAVEGINKIVTG LISLSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE+DY
Sbjct: 87 STIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDY 146
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY ++ +CD R+NAKVV+IDGYEDV DE SL+KAVA+QPVSVAIEAGGR FQ Y+S
Sbjct: 147 PYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQS 206
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
G+FTG CG+ALDHGV AVGYGTENGVDYW+V+NSWG+ WGE GY++++R+L + TGKCG
Sbjct: 207 GIFTGRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266
Query: 349 IAMEASYPVKNSQ 361
IAMEASYP+K Q
Sbjct: 267 IAMEASYPIKKGQ 279
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 211/353 (59%), Positives = 260/353 (73%), Gaps = 29/353 (8%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK N +G E+R+ F+DNLR+IDE
Sbjct: 22 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDE 73
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 74 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR------------RNAKVVSI 249
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R +NAKVV+I
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTI 249
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
D YEDV+P E SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYG
Sbjct: 250 DSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG 309
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
TENG DYW+VRNSWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 310 TENGKDYWIVRNSWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 361
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 206/323 (63%), Positives = 250/323 (77%), Gaps = 3/323 (0%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKF 100
RT++EV +Y+ WL +GK N +G E+RF+IF DNLR+ID+HN N +Y +GL +F
Sbjct: 29 RTEEEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRF 88
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQ 159
ADLTNEEYR+ YLG + R ++ + R GD+LP+ VDWREKGAV P+KDQ
Sbjct: 89 ADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQ 148
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAFSTVAAVEGIN+IVTG+LI LSEQELVDCD N GCNGGLMDYAFQFII N
Sbjct: 149 GGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAFQFIISN 208
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+D+E+DYPY + CDP+R+NAKVVSID YEDV DE +LK AVA QPVSVAIE G
Sbjct: 209 GGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGG 268
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
GR+FQ Y+SG+F G CG LDHGVVAVGYGTE+G DYW+VRNSWG WGE GY++++RNL
Sbjct: 269 GRSFQLYKSGIFDGRCGIDLDHGVVAVGYGTESGKDYWIVRNSWGKSWGEAGYIRMERNL 328
Query: 340 LDTNTGKCGIAMEASYPVKNSQN 362
+++GKCGIA+E SYP+K QN
Sbjct: 329 PSSSSGKCGIAIEPSYPIKKGQN 351
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 208/340 (61%), Positives = 260/340 (76%), Gaps = 7/340 (2%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSIISY+ H RT+ E T+Y+ WLA+HG+ N +G ++RF++F DNLRF+D H
Sbjct: 84 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 143
Query: 87 N--SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPE 143
N + +++G+N+FADLTN+E+RA YLG R A RR + +RY G +ELPE
Sbjct: 144 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRR--RGTAVGERYRHGGGAEELPE 201
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INA 202
SVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C N+
Sbjct: 202 SVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNS 261
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV DE S
Sbjct: 262 GCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKS 321
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA QPVSVAIEAGGR FQ Y++GVFTG C + LDHGVVAVGYGTENG DYW+VRNS
Sbjct: 322 LQKAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNS 381
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WG+ WGE+GY++++RN ++ TGKCGIAM ASYP K N
Sbjct: 382 WGAKWGEDGYIRMERN-VNATTGKCGIAMMASYPTKKGAN 420
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 433 bits (1113), Expect = e-119, Method: Compositional matrix adjust.
Identities = 208/340 (61%), Positives = 260/340 (76%), Gaps = 7/340 (2%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSIISY+ H RT+ E T+Y+ WLA+HG+ N +G ++RF++F DNLRF+D H
Sbjct: 27 MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 86
Query: 87 N--SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPE 143
N + +++G+N+FADLTN+E+RA YLG R A RR + +RY G +ELPE
Sbjct: 87 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRR--RGTAVGERYRHGGGAEELPE 144
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INA 202
SVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C N+
Sbjct: 145 SVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNS 204
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV DE S
Sbjct: 205 GCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKS 264
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA QPVSVAIEAGGR FQ Y++GVFTG C + LDHGVVAVGYGTENG DYW+VRNS
Sbjct: 265 LQKAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNS 324
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WG+ WGE+GY++++RN ++ TGKCGIAM ASYP K N
Sbjct: 325 WGAKWGEDGYIRMERN-VNATTGKCGIAMMASYPTKKGAN 363
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 207/347 (59%), Positives = 263/347 (75%), Gaps = 7/347 (2%)
Query: 20 SSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDN 79
+++ MSIISY+ H RT+ E T+Y+ WLA+HG+ N +G ++RF++F DN
Sbjct: 17 AAAPGGRMSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDN 76
Query: 80 LRFIDEHN--SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
LRF+D HN + +++G+N+FADLTN+E+RA YLG R A RR + +RY
Sbjct: 77 LRFVDAHNERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPAARR--RGTAVGERYRHGG 134
Query: 138 G-DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
G +ELPESVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C
Sbjct: 135 GAEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVEC 194
Query: 197 DRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
N+GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV
Sbjct: 195 STDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDV 254
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE SL+KAVA QPVSVAIEAGGR FQ Y++GVF+G C + LDHGVVAVGYGTENG D
Sbjct: 255 PENDEKSLQKAVAHQPVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGTENGKD 314
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
YW+VRNSWG+ WGE+GY++++RN ++ TGKCGIAM ASYP K N
Sbjct: 315 YWIVRNSWGAKWGEDGYIRMERN-VNATTGKCGIAMMASYPTKKGAN 360
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/353 (60%), Positives = 261/353 (73%), Gaps = 12/353 (3%)
Query: 18 FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK-TSNGMGHNEKRFQIF 76
F + ++ DMSIISY+ H RT+ E IY W A+HG SN +G E+RF+ F
Sbjct: 18 FGACAAGPDMSIISYNAEHGARGLERTEAEARAIYGLWRAEHGSGNSNSLGEEERRFRAF 77
Query: 77 KDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
DNLRF+D HN+ +++G+N+FADLTN+E+RA YLG + +RR ++ V +R
Sbjct: 78 WDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFRAAYLGVKGAGQRRSARAGVG-ER 136
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y +ELPE+VDWREKGAV PVK+QG CGSCWAFS V+AVE IN++VTGEL++LSEQE
Sbjct: 137 YRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSAVESINQLVTGELVTLSEQE 196
Query: 193 LVDCDRKINA---GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
LV+CD IN GCNGGLMD AF FII NGG+D+E DYPY + KCD +RRNAKVVSI
Sbjct: 197 LVECD--INGQSNGCNGGLMDDAFDFIINNGGIDTEDDYPYKALDGKCDINRRNAKVVSI 254
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
DG+EDV DE SL+KAVA QPVSVAIEAGGR FQ Y SGVFTG CG+ LDHGVVAVGYG
Sbjct: 255 DGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFTGRCGTELDHGVVAVGYG 314
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
TENG DYW+VRNSWG WGE GY++++RN ++ TGKCGIAM +SYP K N
Sbjct: 315 TENGKDYWIVRNSWGPKWGEAGYLRMERN-INATTGKCGIAMMSSYPTKKGAN 366
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 208/360 (57%), Positives = 270/360 (75%), Gaps = 19/360 (5%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M + +L+ I+ S + DMS S R++ EVMT+Y+ WL KH K G
Sbjct: 1 MASILYSLILFGLITLSLSLDMS------------SGRSNKEVMTMYEKWLVKHQKVYYG 48
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+G +RFQIFKDNL FIDEHN+ N +Y+VGLN+F+D+TN+EYR YL S+ +K
Sbjct: 49 LGEKNQRFQIFKDNLIFIDEHNAPNHSYRVGLNEFSDITNKEYRDTYLSRWSNNN---IK 105
Query: 126 SKVASQRYACKAG--DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
+K+ S RYA KAG ++LP SVDWR GA+ P+K+QGSCG+CWAFS VAAVE INKIVTG
Sbjct: 106 NKITSVRYAYKAGHNNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTG 163
Query: 184 ELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
L+SLSEQELVDCDR N GCNGG A++FI++NGG+DS+ DYPYLG ++ C+ +++N
Sbjct: 164 SLVSLSEQELVDCDRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKN 223
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
KVVSI+GY++V E +L +AVA+QPVSV IEA G+ FQ Y+SGVFTG CG++LDH V
Sbjct: 224 TKVVSINGYKNVQRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAV 283
Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
V VGYG+ENG DYWLV+NSWG++WGE GY+K++RNL +TNTGKCGIAM+A+YP K +NS
Sbjct: 284 VVVGYGSENGKDYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTKLRENS 343
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 204/316 (64%), Positives = 256/316 (81%), Gaps = 8/316 (2%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
E + +++ WL ++ K NG+G +KRF+IF DNL+F+ EHNS+ N++Y++GL +FADLTN
Sbjct: 32 EEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTN 91
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
EE+RA+YL RS +R + V S+RY GD+LP+ VDWR KGAV PVKDQGSCGSC
Sbjct: 92 EEFRAIYL--RSKMER--TRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSC 147
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS + AVEGIN+I TGEL+SLSEQELVDCD N GC GGLMDYAFQFII NGG+D+E
Sbjct: 148 WAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISNGGIDTE 207
Query: 226 QDYPYLGA-ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
+DYPY +N C+ ++N +VV+IDGYEDV P +E SLKKA+A+QP+SVAIEAGGR FQ
Sbjct: 208 EDYPYTATDDNICNTDKKNTRVVTIDGYEDV-PENENSLKKALANQPISVAIEAGGRGFQ 266
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y+SGVFTG CG+ALDHGVVAVGYGT G DYW++RNSWGS+WGE+GY+KLQRN+ D+ +
Sbjct: 267 LYKSGVFTGTCGTALDHGVVAVGYGTSEGQDYWIIRNSWGSNWGESGYIKLQRNIKDS-S 325
Query: 345 GKCGIAMEASYPVKNS 360
GKCG+AM ASYP K+S
Sbjct: 326 GKCGVAMMASYPTKSS 341
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/340 (60%), Positives = 260/340 (76%), Gaps = 9/340 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISY+ H RT+ E Y WLA++G++ N +G +E+RF++F DNLRF D
Sbjct: 28 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 87
Query: 86 HNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
HN+ + +++G+N+FADLTNEE+RA +LG + + +S+ A +RY +ELPE
Sbjct: 88 HNARADDHGFRLGMNRFADLTNEEFRATFLGAKV-----VERSRAAGERYRHDGVEELPE 142
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INA 202
SVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C N+
Sbjct: 143 SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNS 202
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV DE S
Sbjct: 203 GCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKS 262
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRNS
Sbjct: 263 LQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNS 322
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WG WGE+GYV+++RN ++ TGKCGIAM ASYP K+ N
Sbjct: 323 WGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 361
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 204/341 (59%), Positives = 260/341 (76%), Gaps = 10/341 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISY+ H RT+ E Y WLA++G++ N +G E+RF++F DNL+F+D
Sbjct: 23 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82
Query: 86 HNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
HN+ + +++G+N+FADLTN+E+R+ +LG + + +S+ A +RY +ELP
Sbjct: 83 HNARADEHGGFRLGMNRFADLTNDEFRSTFLGAKV-----VERSRAAGERYRHDGVEELP 137
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-IN 201
ESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C N
Sbjct: 138 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQN 197
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
+GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV DE
Sbjct: 198 SGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEK 257
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRN
Sbjct: 258 SLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRN 317
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN ++ TGKCGIAM ASYP K+ N
Sbjct: 318 SWGPKWGESGYVRMERN-INATTGKCGIAMMASYPTKSGAN 357
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/340 (60%), Positives = 253/340 (74%), Gaps = 8/340 (2%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK-TSNGMGHNEKRFQIFKDNLRFIDE 85
MSIISY+ H RT+ EV +Y+ WL +HG+ SN +G ++ RF++F DNLRF+D
Sbjct: 31 MSIISYNEEHGARGLERTEAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDA 90
Query: 86 HNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
HN +++G+N+FADLTN+E+RA YLG R A R + Y +ELPE
Sbjct: 91 HNERAGEHGFRLGMNQFADLTNDEFRAAYLGARIPAAR---SGNAVGEMYRHDGAEELPE 147
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NA 202
SVDWREKGAV PVK+QG CGSCWAFS V++VE IN+IVTGE+++LSEQELV+C N+
Sbjct: 148 SVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNS 207
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMD AF FII+NGG+D+E DYPY + KCD +RRNAKVVSID +EDV DE S
Sbjct: 208 GCNGGLMDAAFNFIIKNGGIDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKS 267
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA QPVSVAIEAGGR FQ Y+SGVF+G C + LDHGVVAVGYGTENG DYW+VRNS
Sbjct: 268 LQKAVAHQPVSVAIEAGGRQFQLYKSGVFSGSCTTNLDHGVVAVGYGTENGKDYWIVRNS 327
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WG WGE GY++++RN ++ TGKCGIAM ASYP K N
Sbjct: 328 WGPKWGEAGYIRMERN-INATTGKCGIAMMASYPTKKGAN 366
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/344 (60%), Positives = 256/344 (74%), Gaps = 12/344 (3%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE----KRFQIFKDNLRF 82
MSII+Y+ H RT+ EV +Y WLA+HG+ N +G E +RF +F DNLRF
Sbjct: 32 MSIITYNEEHGARGLERTEPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRF 91
Query: 83 IDEHNSLN--RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK-AGD 139
+D HN R +++G+N+FADLTN+E+RA YLG A RR V +RY A +
Sbjct: 92 VDAHNERAGARGFRLGMNQFADLTNDEFRAAYLGAMVPAARR---GAVVGERYRHDGAAE 148
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
ELPESVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C
Sbjct: 149 ELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTD 208
Query: 200 I-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
N+GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R+NA+VVSIDG+EDV
Sbjct: 209 GGNSGCNGGLMDAAFDFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPEN 268
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
DE SL+KAVA QPVSVAIEAGGR FQ Y+SGVF+G C + LDHGVVAVGYG ENG DYW+
Sbjct: 269 DEKSLQKAVAHQPVSVAIEAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGAENGKDYWI 328
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
VRNSWG WGE GY++++RN ++ +TGKCGIAM ASYP K N
Sbjct: 329 VRNSWGPKWGEAGYIRMERN-VNASTGKCGIAMMASYPTKKGAN 371
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 205/350 (58%), Positives = 257/350 (73%), Gaps = 15/350 (4%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDN 79
+ADMSII+Y+ H RT+ E +Y WLA+HG S N + E+RF F DN
Sbjct: 24 SADMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDN 83
Query: 80 LRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
LRF+D HN+ +++ +N+FADLTN+E+RA YLG + A+R +V +RY
Sbjct: 84 LRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERN-RAGRVVGERYRH 142
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+
Sbjct: 143 DGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVE 202
Query: 196 CDRKIN---AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
CD IN +GCNGGLMD AF+FII+NGG+D+E DYPY + +CD R+NAKVVSIDG+
Sbjct: 203 CD--INGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGF 260
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV DE SL+KAVA PVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTEN
Sbjct: 261 EDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTEN 320
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
G DYW+VRNSWG +WGE GY++++RN ++ +GKCGIAM +SYP K N
Sbjct: 321 GKDYWIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 369
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 205/350 (58%), Positives = 257/350 (73%), Gaps = 15/350 (4%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDN 79
+ADMSII+Y+ H RT+ E +Y WLA+HG S N + E+RF F DN
Sbjct: 24 SADMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDN 83
Query: 80 LRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
LRF+D HN+ +++ +N+FADLTN+E+RA YLG + A+R +V +RY
Sbjct: 84 LRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERN-RAGRVVGERYRH 142
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+
Sbjct: 143 DGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVE 202
Query: 196 CDRKIN---AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
CD IN +GCNGGLMD AF+FII+NGG+D+E DYPY + +CD R+NAKVVSIDG+
Sbjct: 203 CD--INGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGF 260
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV DE SL+KAVA PVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTEN
Sbjct: 261 EDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTEN 320
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
G DYW+VRNSWG +WGE GY++++RN ++ +GKCGIAM +SYP K N
Sbjct: 321 GKDYWIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 369
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 214/353 (60%), Positives = 259/353 (73%), Gaps = 22/353 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG-H 68
I L+F FI+ S+A+ SII RTDDEVM +Y W AKHGK N +G
Sbjct: 9 IMALLFFLFIALSAASPSSIIPQ----------RTDDEVMALYDQWRAKHGKLHNNLGAE 58
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
E RF IFKDNL+FIDE N+ N Y++GLN FADLTNEEYR+ YLG + + R ++
Sbjct: 59 PENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRT-- 116
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
S RY + GD+LP+S+DWR KGAV PVKDQGSCGSCWAFSTVA+VE IN+IVTG+LI+L
Sbjct: 117 -SNRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIAL 175
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQELVDCDR N GCNGGLMDYAF+FII+NGG+D+E+DYPY G ++ C ++NA
Sbjct: 176 SEQELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA---- 231
Query: 249 IDGYEDVSPFDEMSLKKA---VADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
IDGYEDV +E +L+KA VSVAIE GGR+FQ Y+SG+FTG CG+ LDHGV
Sbjct: 232 IDGYEDVPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNV 291
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
VGYG+E GVDYW+VRNSWG WGE+GYVK+QRN+ + TG CGIAME SYP K
Sbjct: 292 VGYGSEGGVDYWIVRNSWGGSWGESGYVKMQRNIA-SPTGLCGIAMEPSYPTK 343
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 201/312 (64%), Positives = 247/312 (79%), Gaps = 11/312 (3%)
Query: 53 QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-----YKVGLNKFADLTNEE 107
Q+WL KH K N +G EKRF IF+DNL FID+HN+ N +++GLNKFADLTN+E
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R +Y G KR V S RYA K GDELPESVDWR+KGAV+ VKDQG CGSCWA
Sbjct: 66 FRRIYFGV----KRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWA 121
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FS + AVEGINKIVTG+LI+LSEQELVDCD N+GC+GGLMDYAF+FII NGG+D+++D
Sbjct: 122 FSAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKD 181
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
YPY + CD +R+NAKVV+IDG EDV +E +L+KAVA QPV +AIEAGGR FQ Y+
Sbjct: 182 YPYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYK 241
Query: 288 SGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
SGVFTG CG++LDHGVVAVGYG T++G DYW+VRNSWG DWGE+GY++++RN ++ +GK
Sbjct: 242 SGVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERN-TESKSGK 300
Query: 347 CGIAMEASYPVK 358
CGIA+E SYPVK
Sbjct: 301 CGIAIEPSYPVK 312
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 205/350 (58%), Positives = 256/350 (73%), Gaps = 15/350 (4%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDN 79
+ADMSII+Y+ H RT+ E +Y WLA+HG S N + E+RF F DN
Sbjct: 24 SADMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDN 83
Query: 80 LRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
LRF+D HN+ +++ +N+FADLTN+E+RA YLG + A+R +V RY
Sbjct: 84 LRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERN-RAGRVVGDRYRH 142
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+
Sbjct: 143 DGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVE 202
Query: 196 CDRKIN---AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
CD IN +GCNGGLMD AF+FII+NGG+D+E DYPY + +CD R+NAKVVSIDG+
Sbjct: 203 CD--INGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGF 260
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
EDV DE SL+KAVA PVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTEN
Sbjct: 261 EDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTEN 320
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
G DYW+VRNSWG +WGE GY++++RN ++ +GKCGIAM +SYP K N
Sbjct: 321 GKDYWIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 369
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 201/346 (58%), Positives = 255/346 (73%), Gaps = 13/346 (3%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDNLR 81
DMSII+Y+ H RT+ E +Y WLA+HG S N + E+RF+ F DNLR
Sbjct: 24 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLR 83
Query: 82 FIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
F+D HN+ +++ +N+FADLTN+E+RA YLG + +R +V +RY
Sbjct: 84 FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKG---QRARPGRVVGERYRHDG 140
Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
+ELPE+VDWREKGAV PVK+QG CGSCWAFS ++ VE IN+IVTGE+++LSEQELV+CD
Sbjct: 141 AEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECD 200
Query: 198 RK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
++GCNGGLMD AF+FII+NGG+D+E DYPY + +CD R+NAKVVSIDG+EDV
Sbjct: 201 TNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVP 260
Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTENG DY
Sbjct: 261 ENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDY 320
Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
W+VRNSWG +WGE GY++++RN ++ +GKCGIAM +SYP K N
Sbjct: 321 WIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 365
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 201/346 (58%), Positives = 257/346 (74%), Gaps = 13/346 (3%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDNLR 81
DMSII+Y+ H RT+ E +Y WLA++G S N + E+RF+ F DNL
Sbjct: 27 DMSIIAYNAEHGARGLERTEAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLN 86
Query: 82 FIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
F+D HN+ Y++G+N+FADLTN+E+RA YLG ++ +R ++ +RY
Sbjct: 87 FVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKA---QRARPGRMVGERYRHDG 143
Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
+ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+CD
Sbjct: 144 AEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECD 203
Query: 198 RK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
++GCNGGLMD AF+FII+NGG+D+E DYPY + +CD R+NAKVVSIDG+EDV
Sbjct: 204 TNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVP 263
Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTENG DY
Sbjct: 264 ENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDY 323
Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
W+VRNSWG +WGE+GY++++RN ++ +GKCGIAM +SYP K N
Sbjct: 324 WIVRNSWGPNWGESGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 368
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/351 (58%), Positives = 261/351 (74%), Gaps = 14/351 (3%)
Query: 20 SSSSAADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQI 75
++++A DMSIISY+ H T+ E Y WLA++G S N +G +E+RF +
Sbjct: 18 AATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLV 77
Query: 76 FKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
F DNL+F+D HN+ +++G+N+FADLTNEE+RA +LG + +S+ A +R
Sbjct: 78 FWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKV-----AERSRAAGER 132
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y +ELPESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQE
Sbjct: 133 YRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQE 192
Query: 193 LVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
LV+C N+GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG
Sbjct: 193 LVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDG 252
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
+EDV DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+
Sbjct: 253 FEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTD 312
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
NG DYW+VRNSWG WGE+GYV+++RN ++ TGKCGIAM ASYP K+ N
Sbjct: 313 NGKDYWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 362
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/340 (60%), Positives = 259/340 (76%), Gaps = 9/340 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISY+ H RT+ E Y WLA++G++ N +G +E+RF++F DNLRF D
Sbjct: 27 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 86
Query: 86 HNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
HN+ + +++G+N+FADLTNEE+RA +LG + + +S+ A +RY +ELPE
Sbjct: 87 HNARADDHGFRLGMNRFADLTNEEFRATFLGAKV-----VERSRAAGERYRHDGVEELPE 141
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINA 202
SVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C N
Sbjct: 142 SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNG 201
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV DE S
Sbjct: 202 GCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKS 261
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
L+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRNS
Sbjct: 262 LQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNS 321
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
WG WGE+GYV+++RN ++ TGKCGIAM ASYP K+ N
Sbjct: 322 WGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 360
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/373 (54%), Positives = 260/373 (69%), Gaps = 42/373 (11%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSIISY+ H RT+ E Y WLA++G++ N +G E+RF++F DNL+F+D
Sbjct: 23 DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82
Query: 86 HNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
HN+ + +++G+N+FADLTN+E+RA +LG + + +S+ A +RY +ELP
Sbjct: 83 HNARADEHGGFRLGMNRFADLTNDEFRATFLGAKF-----VERSRAAGERYRHDGVEELP 137
Query: 143 ESVDWREKGAVNPVKDQGSC--------------------------------GSCWAFST 170
ESVDWREKGAV PVK+QG C GSCWAFS
Sbjct: 138 ESVDWREKGAVAPVKNQGQCVDRIIVWNSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSA 197
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
V+ VE IN++VTGE+I+LSEQELV+C N+GCNGGLMD AF FII+NGG+D+E DYP
Sbjct: 198 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 257
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KCD +R NAKVVSIDG+EDV DE SL+KAVA QPVSVAIEAGGR FQ Y SG
Sbjct: 258 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 317
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
VF+G CG++LDHGVVAVGYGT+NG DYW+VRNSWG WGE+GYV+++RN ++ TGKCGI
Sbjct: 318 VFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN-INATTGKCGI 376
Query: 350 AMEASYPVKNSQN 362
AM ASYP K+ N
Sbjct: 377 AMMASYPTKSGAN 389
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/347 (59%), Positives = 259/347 (74%), Gaps = 14/347 (4%)
Query: 24 AADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQIFKDN 79
A+DMSIISY+ H T+ E Y WLA++G S N +G +E+RF +F DN
Sbjct: 21 ASDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDN 80
Query: 80 LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
L+F+D HN+ +++G+N+FADLTNEE+RA +LG + A+R S+ A +RY
Sbjct: 81 LKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGAKV-AER----SRAAGERYRHD 135
Query: 137 AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
+ELPESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C
Sbjct: 136 GVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVEC 195
Query: 197 DRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
N+GCNGGLM AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG+EDV
Sbjct: 196 STNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDV 255
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG D
Sbjct: 256 PQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKD 315
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
YW+VRNSWG WGE+GYV+++RN ++ TGKCGIAM ASYP K+ N
Sbjct: 316 YWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 361
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 199/359 (55%), Positives = 257/359 (71%), Gaps = 31/359 (8%)
Query: 11 STLVFLFF--ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
+T+ LFF ++ SSA D+SIISYD +H S WR+D+EVM+IY+ LAKHGK N +
Sbjct: 9 ATIFILFFTVLAVSSALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGKVYNAIDE 68
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
E+RFQI K+NL+F+++HN+ NRTYKVGLN+FAD + R+M
Sbjct: 69 MEERFQISKENLKFVEQHNAGNRTYKVGLNRFAD-----------------RSRMMTR-- 109
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
S RYA + D L ESVDWR++GAV VK Q C SC F+ +AAVEGINKIVTG L +L
Sbjct: 110 PSSRYAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIVTGNLTAL 169
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
S DCDR +NAGC+GGL DYA +FII NGG+D+E+DYP+ GA CD + NA
Sbjct: 170 S-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYKINA---- 220
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVA-IEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+DGYE V +DE++LKKAVA+QPVSVA IEA G+ FQ YESG+FTG+CG+++DHGV AVG
Sbjct: 221 VDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTAVG 280
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
YGTENG+DYW+V+NSWG +WGE GYV+++RN + GKCGIA+ YP+K+ QN + P
Sbjct: 281 YGTENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPIKSGQNPSNP 339
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 196/326 (60%), Positives = 247/326 (75%), Gaps = 10/326 (3%)
Query: 43 RTDDEVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNK 99
RT+ +V +Y+ W+A+HGK SN +G +++RF+ F DNLRF+D HN+ R Y++G+N+
Sbjct: 43 RTEAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINR 102
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
FADLTN E+RA YL S R + +RY + LPE VDWR+KGAV PVK+Q
Sbjct: 103 FADLTNAEFRAAYL---SAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQ 159
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS V AVEGIN+IVTGEL++LSEQELVDC + N GC+GG+MD AF FI+
Sbjct: 160 GQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVG 219
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+D+++DYPY + KCD ++R+ VVSIDG+E V DE SL+KAVA QPV+VAIEA
Sbjct: 220 NGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEA 279
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQ 336
GGR FQ Y+SGVFTG CG++LDHGVVAVGYGTE G DYWLVRNSWG+DWGE GY++++
Sbjct: 280 GGREFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRME 339
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN 362
RN + GKCGIAMEASYPVK+ N
Sbjct: 340 RN-VGARAGKCGIAMEASYPVKSGAN 364
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/372 (54%), Positives = 267/372 (71%), Gaps = 22/372 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + F+++S L F F+ S A D I S RT+DEVM +Y++WL K+G
Sbjct: 1 MGSPKSFISMSLLFFSTFLIFSFAIDAKI----------SPLRTNDEVMALYESWLVKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E R +IFK+NLRFIDEHN+ NR+Y VGLN+FADLT+EEYR+ YLG +S
Sbjct: 51 KSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSS- 109
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+KSKV S RY + G+ LP+ VDWR GAV VK+QG C SCWAF+T+A VE IN+
Sbjct: 110 ----LKSKV-SNRYMPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQ 164
Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
I+TG+LISLSEQELVDC+R IN GC GG MD A++FII NGG+++E++YPY+G +++CD
Sbjct: 165 IITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCD 224
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT-GECGS 297
++N V+ID YE V P DE+++K+AVA QPVSVAI+A F+ Y+SG+FT G CG+
Sbjct: 225 EPKKNQNYVTIDSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGT 284
Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
L+H V +GYGTENG+DYW+V+NS+G+ WGE+GY K+QRN+ G+CGIA YPV
Sbjct: 285 TLNHAVTIIGYGTENGIDYWIVKNSYGTQWGESGYGKVQRNV--GGEGRCGIASYPFYPV 342
Query: 358 KN-SQNSAKPKP 368
KN + AKP P
Sbjct: 343 KNYTSKPAKPHP 354
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/322 (62%), Positives = 251/322 (77%), Gaps = 9/322 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
R + EV +Y+ WL ++ K NG+G E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35 RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLTNEE+RA+YL R +R K V ++RY K GD LP+ VDWR GAV VKDQG+
Sbjct: 95 DLTNEEFRAIYL--RKKMER--TKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210
Query: 221 GMDSEQDYPYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
G++++QDYPY + C+ + N +VV+IDGYEDV DE SLKKAVA QPVSVAIEA
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+AFQ Y+SGV TG CG +LDHGVV VGYG+ +G DYW++RNSWG +WG++GYVKLQRN
Sbjct: 271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRN 330
Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
+D GKCGIAM SYP K+S
Sbjct: 331 -IDDPFGKCGIAMMPSYPTKSS 351
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/366 (56%), Positives = 256/366 (69%), Gaps = 17/366 (4%)
Query: 5 SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
S IS L+FL S S A D+S I Y +D SS+WRTD+EV IY+ WLAKH K +
Sbjct: 2 STLFIISILLFL--ASFSYAMDISTIEY--KYDKSSAWRTDEEVKEIYELWLAKHDKVYS 57
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
G+ EKRF+IFKDNL+FIDEHNS N TYK+GL + DLTNEE++A+YLGTRSD RL
Sbjct: 58 GLVEYEKRFEIFKDNLKFIDEHNSENHTYKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLK 117
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
++ S+RYA +AGD LPE +DWR+KGAV PVK+QG CGSCWAFSTV+ VE IN+I TG
Sbjct: 118 RTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGN 177
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
LISLSEQ+LVDC++K N GC GG YA+Q+II NGG+D+E +YPY + C R
Sbjct: 178 LISLSEQQLVDCNKK-NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPC---RAAK 233
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
KVV IDGY+ V +E +LKKAVA QP VAI+A + FQHY+SG+F+G CG+ L+HGVV
Sbjct: 234 KVVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVV 293
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS--QN 362
VGY DYW+VRNSWG WGE GY++++R G CGIA YP K + +N
Sbjct: 294 IVGYWK----DYWIVRNSWGRYWGEQGYIRMKRV---GGCGLCGIARLPYYPTKAAGDEN 346
Query: 363 SAKPKP 368
S P
Sbjct: 347 SKLETP 352
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/322 (62%), Positives = 251/322 (77%), Gaps = 9/322 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
R + EV +Y+ WL ++ K NG+G E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35 RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLTNEE+RA+YL R +R K V ++RY K GD LP+ VDWR GAV VKDQG+
Sbjct: 95 DLTNEEFRAIYL--RKKMERN--KDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210
Query: 221 GMDSEQDYPYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
G++++QDYPY + C+ + N +VV+IDGYEDV DE SLKKAVA QPVSVAIEA
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+AFQ Y+SGV TG CG +LDHGVV VGYG+ +G DYW++RNSWG +WG++GYVKLQRN
Sbjct: 271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRN 330
Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
+D GKCGIAM SYP K+S
Sbjct: 331 -IDDPFGKCGIAMMPSYPTKSS 351
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/371 (53%), Positives = 260/371 (70%), Gaps = 20/371 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + ++ S L F + SSA D+ +S RT+D+VM +Y++WL +HG
Sbjct: 1 MGSPKSIISKSLLFFSTLLILSSAIDI----------ENSVQRTNDQVMAMYESWLVEHG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N + E RF+IFK+NLR ID+HN+ NR+Y +GLN+FADLT+EEYR+ YLG +
Sbjct: 51 KSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGP 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + S +Y K GD LP+ VDWR GAV VK+QG C SCWAFS VAAVEGINK
Sbjct: 111 KTDV------SNQYMPKVGDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINK 164
Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQELVDC R +I GCN GLM AF+FII NGG+++E +YPY + +C+
Sbjct: 165 IVTGNLISLSEQELVDCGRTQITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCN 224
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
S +N K V+ID Y++V +EM+LKKAVA QPVSV +E+ G F+ Y SG+FTG CG+A
Sbjct: 225 LSLKNQKYVTIDSYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTA 284
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DHGV VGYGTE G+DYW+V+NSWG++WGE+GY+++QRN+ GKCGIA SYPVK
Sbjct: 285 VDHGVTIVGYGTERGMDYWIVKNSWGTNWGESGYIRIQRNI--GGAGKCGIAKMPSYPVK 342
Query: 359 NSQNSAKPKPH 369
+ N KP P+
Sbjct: 343 YTSNPLKPYPY 353
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/359 (54%), Positives = 263/359 (73%), Gaps = 14/359 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGM 66
+ I L+ +F +S+ S+A M + + H+ R+++EV I+Q W++KHGKT +N +
Sbjct: 9 MTILFLLIVFVLSAPSSA-MDLPATSGGHN-----RSNEEVEFIFQMWMSKHGKTYTNAL 62
Query: 67 GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
G E+RFQ FKDNLRFID+HN+ N +Y++GL +FADLT +EYR ++ G+ +R L S
Sbjct: 63 GEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTS 122
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ RY AGD+LPESVDWR++GAV+ +KDQG+C SCWAFSTVAAVEG+NKIVTGELI
Sbjct: 123 R----RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELI 178
Query: 187 SLSEQELVDCDRKINAGCNG-GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQELVDC+ +N GC G GLMD AFQF+I N G+DSE+DYPY G + C+ + +
Sbjct: 179 SLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLL 237
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
V++ID YEDV DE+SL+KAVA QPVSV ++ + F Y S ++ G CG+ LDH +V
Sbjct: 238 VITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVI 297
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYG+ENG DYW+VRNSWG+ WG+ GY+K+ RN D G CGIAM ASYP+KNS ++A
Sbjct: 298 VGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIKNSASNA 355
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/360 (55%), Positives = 264/360 (73%), Gaps = 15/360 (4%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGM 66
+ I L+ +F +S+ S+A M + + H+ R+++EV I+Q W++KHGKT +N +
Sbjct: 9 MTILFLLIVFVLSAPSSA-MDLPATSGGHN-----RSNEEVEFIFQMWMSKHGKTYTNAL 62
Query: 67 GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
G E+RFQ FKDNLRFID+HN+ N +Y++GL +FADLT +EYR ++ G+ +R L S
Sbjct: 63 GEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTS 122
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ RY AGD+LPESVDWR++GAV+ +KDQG+C SCWAFSTVAAVEG+NKIVTGELI
Sbjct: 123 R----RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELI 178
Query: 187 SLSEQELVDCDRKINAGCNG-GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA- 244
SLSEQELVDC+ +N GC G GLMD AFQF+I N G+DSE+DYPY G + C+ + +
Sbjct: 179 SLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSN 237
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
KV++ID YEDV DE+SL+KAVA QPVSV ++ + F Y S ++ G CG+ LDH +V
Sbjct: 238 KVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALV 297
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYG+ENG DYW+VRNSWG+ WG+ GY+K+ RN D G CGIAM ASYP+KNS ++A
Sbjct: 298 IVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIKNSASNA 356
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 202/375 (53%), Positives = 260/375 (69%), Gaps = 28/375 (7%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + S A D RT+DEV +Y++WL KHG
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLALDAK--------------RTNDEVKAMYESWLIKHG 46
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLG-TRSD 118
K+ N +G E+RF+IFK+ LRFIDEHN+ +R+YKVGLN+FADLTNEE+R+ YLG TR
Sbjct: 47 KSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS 106
Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
K ++ S RY + G LP+ VDWR +GAV +K+QG CGSCWAFS +AAVEGIN
Sbjct: 107 NKTKV------SNRYEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGIN 160
Query: 179 KIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
KIVTG LISLSEQELVDC R + GC+GG M F+FII NGG+++E++YPY E +C
Sbjct: 161 KIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQC 220
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
D + +N K V+ID YE+V ++E +L+ AVA QPVSVA+E+ G AFQHY SG+FTG CG+
Sbjct: 221 DLNLQNEKYVTIDNYENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGT 280
Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
A DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPV
Sbjct: 281 ATDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPV 338
Query: 358 K-NSQNSAKPKPHSS 371
K N+QN PKP+SS
Sbjct: 339 KYNNQN--HPKPYSS 351
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/322 (61%), Positives = 239/322 (74%), Gaps = 13/322 (4%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADL 103
D +Y+ W+ HG+ NG+G E+RFQIF+DN +I+EHN +N+TY +GLN FAD+
Sbjct: 27 DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T++E++A+Y GT K L + + RY K LP DWR KGAV VK+QG+CG
Sbjct: 87 THDEFKALYFGT----KVPLSNTIKSGFRY--KDATNLPLDTDWRSKGAVATVKNQGACG 140
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFSTVAAVEG+N+IVTGEL+SLSEQELVDCD++ N GCNGGLMD AF+FIIQNGG+D
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLD 200
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
SE DYPY CD SRRN+ VV+IDG+EDV E L KAVA+QPVSVAIEA GR F
Sbjct: 201 SEADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNF 260
Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTE---NGV--DYWLVRNSWGSDWGENGYVKLQRN 338
Q Y GV+TG CG LDHGVVAVGYGT +GV DYW+VRNSWG WGE+GY++LQRN
Sbjct: 261 QLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRN 320
Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
+ + GKCGIAM ASYPVKNS
Sbjct: 321 VA-SPRGKCGIAMMASYPVKNS 341
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/322 (61%), Positives = 240/322 (74%), Gaps = 13/322 (4%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADL 103
D +Y+ W+ HG+ NG+G E+RFQIF+DN +I+EHN +N+TY +GLN FAD+
Sbjct: 27 DGSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T++E++A+Y GT K L + + RY + LP DWR KGAV VK+QG+CG
Sbjct: 87 THDEFKALYFGT----KVPLSNTIKSGFRY--EDATNLPLDTDWRSKGAVATVKNQGACG 140
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFSTVAAVEG+N+IVTGEL+SLSEQELVDCD++ N GCNGGLMD AF+FIIQNGG+D
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLD 200
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
SE DYPY CD SRRN+ VV+IDG+EDV E L KAVA+QPVSVAIEA GR F
Sbjct: 201 SEADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNF 260
Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTE---NGV--DYWLVRNSWGSDWGENGYVKLQRN 338
Q Y GV+TG CG LDHGVVAVGYGT +GV DYW+VRNSWG WGE+GY++LQRN
Sbjct: 261 QLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRN 320
Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
+ ++ GKCGIAM ASYPVKNS
Sbjct: 321 VA-SSRGKCGIAMMASYPVKNS 341
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/346 (58%), Positives = 257/346 (74%), Gaps = 20/346 (5%)
Query: 27 MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNGM-GHNEKRFQIFKDN 79
MSII Y+ H RT+ E +Y W+A+H G + NG+ G E+RF++F DN
Sbjct: 37 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDN 96
Query: 80 LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
L+F+D HN+ + +++G+N+FADLTN+E+RA YLGT + R + + Y
Sbjct: 97 LKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEAYRHD 151
Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ LP+SVDWR+KGAV PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 152 GVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 211
Query: 196 CDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
C R N+GCNGG+MD AF FI +NGG+D+E+DYPY + KC+ ++++ KVVSIDG+ED
Sbjct: 212 CARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFED 271
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
V DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+
Sbjct: 272 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 331
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
G DYW VRNSWG DWGENGY++++RN+ TGKCGIAM ASYP+K
Sbjct: 332 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 376
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/346 (58%), Positives = 257/346 (74%), Gaps = 20/346 (5%)
Query: 27 MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNGM-GHNEKRFQIFKDN 79
MSII Y+ H RT+ E +Y W+A+H G + NG+ G E+RF++F DN
Sbjct: 37 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDN 96
Query: 80 LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
L+F+D HN+ + +++G+N+FADLTN+E+RA YLGT + R + + Y
Sbjct: 97 LKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEAYRHD 151
Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ LP+SVDWR+KGAV PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 152 GVEVLPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 211
Query: 196 CDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
C R N+GCNGG+MD AF FI +NGG+D+E+DYPY + KC+ ++++ KVVSIDG+ED
Sbjct: 212 CARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFED 271
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
V DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+
Sbjct: 272 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 331
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
G DYW VRNSWG DWGENGY++++RN+ TGKCGIAM ASYP+K
Sbjct: 332 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 376
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 202/346 (58%), Positives = 259/346 (74%), Gaps = 20/346 (5%)
Query: 27 MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNG-MGHNEKRFQIFKDN 79
MSII Y+ H RT+ E +Y W+A+H G + NG +G E+RF++F DN
Sbjct: 38 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDN 97
Query: 80 LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
L+F+D HN+ + +++G+N+FADLTN+E+RA YLGT + R + + Y
Sbjct: 98 LKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEMYRHD 152
Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ LP+SVDWR+KGAV +PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 153 GVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 212
Query: 196 CDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
C R + N+GCNGG+MD AF FI +NGG+D+E+DYPY + KCD ++++ KVVSIDG+ED
Sbjct: 213 CARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFED 272
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
V DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+
Sbjct: 273 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 332
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
G DYW VRNSWG DWGENGY++++RN+ TGKCGIAM ASYP+K
Sbjct: 333 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 377
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/327 (56%), Positives = 249/327 (76%), Gaps = 9/327 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFA 101
RT+DEV+ ++++WL ++GK+ N +G E+RF+IFKDNLRF+DEHN+ +NR+YKVGLN+F+
Sbjct: 39 RTNDEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFS 98
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLT+ EY ++YLGT+ + ++ S RY + GD+LP+SVDWR+KGAV VK+QG+
Sbjct: 99 DLTDAEYSSIYLGTKFN-----IRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGN 153
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
CGSCW F+++AAVEGINKIVTG LISLSEQE+VDC RK N GCNGG + A+QFII NG
Sbjct: 154 CGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNG 213
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+++E +YPY G + CD +++N K V+ID YE+V +E +L+KAVA QPVSV I +
Sbjct: 214 GINTEANYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNS 273
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
AF+ Y+SG+F G CG +DHGV VGYGTE G DYW+VRNSWG +WGE+GYV++QRN+
Sbjct: 274 TAFKSYKSGIFNGPCGPRIDHGVTIVGYGTEGGKDYWIVRNSWGPNWGESGYVRMQRNV- 332
Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPK 367
+GKC IA YPVK N KP+
Sbjct: 333 -GGSGKCFIARAPVYPVKYGPNPTKPR 358
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/371 (52%), Positives = 256/371 (69%), Gaps = 20/371 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + +++S L F + S A D+ +S RT+D+VM +Y++WL + G
Sbjct: 1 MGSPKSVISMSLLFFSTLLILSLALDI----------ENSVQRTNDQVMAMYESWLVEQG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N + E RF+IFK+NLR ID+HN+ NR+Y +GLN+FADLT+EEYR+ YLG +
Sbjct: 51 KSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGP 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + S Y K G+ LP+ VDWR GAV VK+QG C SCWAFS V AVEGINK
Sbjct: 111 KTDV------SNEYMPKVGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINK 164
Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQELVDC R + GCN GLM AFQFII NGG+++E +YPY + +C+
Sbjct: 165 IVTGNLISLSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCN 224
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
S +N K V+ID Y++V +EM+LKKAVA QPVSV +E+ G F+ Y SG+FTG CG+A
Sbjct: 225 LSLKNQKYVTIDNYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTA 284
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DHGV VGYGTE G+DYW+V+NSWG++WGENGY+++QRN+ GKCGIA SYPVK
Sbjct: 285 VDHGVTIVGYGTERGMDYWIVKNSWGTNWGENGYIRIQRNI--GGAGKCGIARMPSYPVK 342
Query: 359 NSQNSAKPKPH 369
+ N KP P+
Sbjct: 343 YTTNPLKPYPY 353
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 195/313 (62%), Positives = 233/313 (74%), Gaps = 8/313 (2%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEE 107
++ + W KHGK + RF ++KDNL +I H+ NRTY +GL KFADLTNEE
Sbjct: 50 LLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYI-RHSETNRTYSLGLTKFADLTNEE 108
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R MY GTR D RR + RYA E PESVDWR+ GAV VKDQGSCGSCWA
Sbjct: 109 FRRMYTGTRIDRSRRAKRR--TGFRYA---DSEAPESVDWRKNGAVTSVKDQGSCGSCWA 163
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FS V +VEGIN I GE +SLSEQELVDCD + N GCNGGLMDYAF FIIQNGG+D+E+D
Sbjct: 164 FSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGIDTEKD 223
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
YPY G + +CD S++NA VV+IDGYEDV DE +LKKAVA QPVSVAIEAGGR FQ Y
Sbjct: 224 YPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYA 283
Query: 288 SGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK- 346
GVF+GECG+ LDHGV+AVGYGTE+GVDYW+V+NSWG WGE+GY++++RN+ D+N G
Sbjct: 284 QGVFSGECGTDLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPG 343
Query: 347 -CGIAMEASYPVK 358
CGI +E SY VK
Sbjct: 344 LCGINIEPSYAVK 356
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/310 (62%), Positives = 231/310 (74%), Gaps = 6/310 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ W KHGK + RF ++KDNL +I H+ N +Y +GL KFADLTNEE+R
Sbjct: 45 FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLTKFADLTNEEFRRQ 104
Query: 112 YLGTRSDAKRRLMKSKVA--SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
Y GTR D RRL K + A S RYA E P+S+DWREKGAV VKDQGSCGSCWAFS
Sbjct: 105 YTGTRIDRSRRLKKGRNATGSFRYA---NSEAPKSIDWREKGAVTSVKDQGSCGSCWAFS 161
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
V +VEGIN I TG+ ISLS QELVDCD+K N GCNGGLMDYAF F+IQNGG+D+E+DYP
Sbjct: 162 AVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGIDTEKDYP 221
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + +CD ++ NA+VV+ID YEDV DE +LKKAVA QPVSVAIEAGGR FQ Y G
Sbjct: 222 YQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGG 281
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT-GKCG 348
VFTG CG+ LDHGV+AVGYG+E G+DYW+V+NSWG WGE+GY+++QRNL D N G CG
Sbjct: 282 VFTGRCGTDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGLCG 341
Query: 349 IAMEASYPVK 358
I +E SY VK
Sbjct: 342 INIEPSYAVK 351
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 191/361 (52%), Positives = 248/361 (68%), Gaps = 15/361 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMS--IISYDNNHDHSSSWRTDDEVMTIYQTWLAK 58
M L +S ++ + I + A + I+ Y+ N HS DD ++ ++ WL
Sbjct: 1 MGWGRRALGLSLVLLVIAIGQQADAGRANAIVDYEGNQLHS-----DDAILDVFHQWLET 55
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
H + + RFQIFK+N +I HN ++Y +GLNKF+DLT++E+RA YLGT+
Sbjct: 56 HSRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKPV 115
Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
++R K A+ Y E VDWR KGAV VKDQG+CGSCWAFS V +VEG+N
Sbjct: 116 NRQR----KEANFMYE---DVEAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVN 168
Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
I TGEL+SLSEQELVDCDRK N GCNGGLMDYAF+FII+NGG+D+E+DYPY + +CD
Sbjct: 169 AIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCD 228
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
RRN+KVV ID Y+DV E +L KA+ PVSVAIEAGGR FQHY+ GVFTG CGS
Sbjct: 229 EGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSE 288
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV+AVGYGT ++GV+YW+V+NSWG WGE GY++++R D+ GKCGI +EAS+P+
Sbjct: 289 LDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPI 348
Query: 358 K 358
K
Sbjct: 349 K 349
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 194/371 (52%), Positives = 256/371 (69%), Gaps = 20/371 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + +++S L F + SSA D+ +S+ RT+D+V +Y++WL + G
Sbjct: 1 MGSPKSVISMSLLFFSTLLILSSALDIV----------NSAQRTNDQVRDMYESWLVEQG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N + E RF+IFKDNLR ID+HN+ NR++ +GLN+FADLT+EEYR+ YLG +S
Sbjct: 51 KSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFKSGP 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K ++ S RY K GD LP VDWR GAV VK+QG C SCWAFS VAAVEGINK
Sbjct: 111 KAKV------SNRYVPKVGDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINK 164
Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
I+TG L+SLSEQELVDC R + GCN G M AFQFII NGG+++E +YPY + +C+
Sbjct: 165 IMTGNLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCN 224
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AVA QPVSV +E+ G F+ Y SG+FT CG+A
Sbjct: 225 RYLQNQKYVTIDDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTA 284
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DHGV VGYGTE G+DYW+V+NSWG++WGENGY+++QRN+ GKCGIA ASYPVK
Sbjct: 285 IDHGVTIVGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNI--GGAGKCGIARMASYPVK 342
Query: 359 NSQNSAKPKPH 369
+ N KP P+
Sbjct: 343 YNSNPLKPYPY 353
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 194/346 (56%), Positives = 253/346 (73%), Gaps = 16/346 (4%)
Query: 22 SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNL 80
SSA D+ S +N R+++EV I+Q W++KHGKT +N +G E+RFQ FKDNL
Sbjct: 25 SSAIDLPATSGGHN-------RSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNL 77
Query: 81 RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
RFID+HN+ N +Y++GL +FADLT +EYR ++ G+ +R L S+ RY GD+
Sbjct: 78 RFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLRISR----RYVPLDGDQ 133
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPESVDWR +GAV+ +KDQG+C SCWAFSTVAAVEGINKIVTGEL+SLSEQELVDC+ +
Sbjct: 134 LPESVDWRNEGAVSAIKDQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNL-V 192
Query: 201 NAGCNG-GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA-KVVSIDGYEDVSPF 258
N GC G G MD AFQF+I NGG+DS+ DYPY G++ C+ + K+++ID YEDV
Sbjct: 193 NNGCYGSGTMDAAFQFLINNGGLDSDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPAN 252
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
DE+SL+KAVA QPVSV ++ + F Y SG++ G CG+ LDH +V VGYG+ENG DYW+
Sbjct: 253 DEISLQKAVAHQPVSVGVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYGSENGQDYWI 312
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VRNSWG+ WG+ GY K+ RN + +G CGIAM ASYPVKNS ++A
Sbjct: 313 VRNSWGTTWGDAGYAKMARN-FEYPSGVCGIAMLASYPVKNSASNA 357
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 203/373 (54%), Positives = 261/373 (69%), Gaps = 26/373 (6%)
Query: 5 SMFLAISTLVFLFFISSSSAAD-------MSIISYDNNHDHSSSWRTDDEVMTIYQTWLA 57
S+ A++ FL +++ + MSII Y+ H RT+ E Y WLA
Sbjct: 8 SVAAALAMACFLLILAAFAPPAAAAPPDIMSIIRYNAEHGVRGLERTEAEARAAYDLWLA 67
Query: 58 KHGKTSNG------MGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEY 108
+H + G +G +E+RF++F DNL+F+D HN+ +++G+N+FADLTN E+
Sbjct: 68 RHRRGGGGGSRNGFIGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF 127
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV-NPVKDQGSCGSCWA 167
RA YLGT + R + + Y + LP+SVDWR+KGAV PVK+QG CGSCWA
Sbjct: 128 RATYLGTTPAGRGRRV-----GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWA 182
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS VAAVEGINKIVTGEL+SLSEQELV+C R N+GCNGG+MD AF FI +NGG+D+E+
Sbjct: 183 FSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEE 242
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
DYPY + KC+ ++R+ KVVSIDG+EDV DE+SL+KAVA QPVSVAI+AGGR FQ Y
Sbjct: 243 DYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 302
Query: 287 ESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
+SGVFTG CG+ LDHGVVAVGYGT+ G YW VRNSWG DWGENGY++++RN+ T
Sbjct: 303 DSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVT-ART 361
Query: 345 GKCGIAMEASYPV 357
GKCGIAM ASYP+
Sbjct: 362 GKCGIAMMASYPI 374
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 203/373 (54%), Positives = 261/373 (69%), Gaps = 26/373 (6%)
Query: 5 SMFLAISTLVFLFFISSSSAAD-------MSIISYDNNHDHSSSWRTDDEVMTIYQTWLA 57
S+ A++ FL +++ + MSII Y+ H RT+ E Y WLA
Sbjct: 8 SVAAALAMACFLLILAAFAPPAAAAPPDIMSIIRYNAEHGVRGLERTEAEARAAYDLWLA 67
Query: 58 KHGKTSNG------MGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEY 108
+H + G +G +E+RF++F DNL+F+D HN+ +++G+N+FADLTN E+
Sbjct: 68 RHRRGGGGGSRNGFIGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF 127
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV-NPVKDQGSCGSCWA 167
RA YLGT + R + + Y + LP+SVDWR+KGAV PVK+QG CGSCWA
Sbjct: 128 RATYLGTTPAGRGRRV-----GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWA 182
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS VAAVEGINKIVTGEL+SLSEQELV+C R N+GCNGG+MD AF FI +NGG+D+E+
Sbjct: 183 FSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEE 242
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
DYPY + KC+ ++R+ KVVSIDG+EDV DE+SL+KAVA QPVSVAI+AGGR FQ Y
Sbjct: 243 DYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 302
Query: 287 ESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
+SGVFTG CG+ LDHGVVAVGYGT+ G YW VRNSWG DWGENGY++++RN+ T
Sbjct: 303 DSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVT-ART 361
Query: 345 GKCGIAMEASYPV 357
GKCGIAM ASYP+
Sbjct: 362 GKCGIAMMASYPI 374
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/280 (65%), Positives = 224/280 (80%), Gaps = 5/280 (1%)
Query: 13 LVFLFFISS---SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
L+ + ISS S A DMSIISYD H D S+S RT+ EV+T+Y+ WL KHGK+ NG+G
Sbjct: 12 LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK-SK 127
+KRF+IFKDNL+FIDEHN LN TY++GL +FADLTNEEYR+ +LGT+ D RR+ K
Sbjct: 72 KDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGG 131
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S RYA + GD+LPESVDWR++GAV VKDQ SCGSCWAFS +AAVEGINKIVTG+LIS
Sbjct: 132 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 191
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE DYPY + +CD +R+NAKVV
Sbjct: 192 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 251
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
+ID YEDV +DE++L+KAVA+QP++VA+E GGR FQ YE
Sbjct: 252 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYE 291
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + FL++S L F + S A + ++ RT+DE+ +Y++WL K+G
Sbjct: 1 MGSPKSFLSMSLLFFSTLLVLSLAFNAKNLTK----------RTNDELKAMYESWLTKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+Y+VGLN+FAD TNEE+++ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+ MK S RY + G LP+ VDWR GAV +K QG CGSCWAFS +A VEGINK
Sbjct: 111 NK--MK---VSNRYEPRVGQVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG+LISLSEQELVDC R N GC+GG + FQFII NGG+++E +YPY + +C+
Sbjct: 166 IVTGDLISLSEQELVDCGRTQNTRGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K SID YE+V +E +L+ AVA QPVSVA+EA G AFQHY SG+FTG CG+A
Sbjct: 226 LDLQNEKYASIDTYENVPYNNEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA + SYPVK
Sbjct: 286 VDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYIRILRNV--GGAGTCGIATKPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + +++S L F + SSA D+ +S RT+D+VM +Y++WL + G
Sbjct: 3 MGSPKSVISMSLLFFSTLLILSSALDI----------KNSVQRTNDQVMAMYESWLVEQG 52
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N + E RF+IFK+NLR ID+HN+ NR+Y +GLN+FADLT+EEYR+ YLG +S
Sbjct: 53 KSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGP 112
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K ++ S RY K G LP VDWR GAV VKDQG C SCWAFS VAAVEGINK
Sbjct: 113 KAKV------SNRYVPKVGVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINK 166
Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQELVDC R + GCN G M+ AFQFII NGG+++E +YPY + +CD
Sbjct: 167 IVTGNLISLSEQELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCD 226
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
R+N + V+ID YE + +E L+ AVA QP++V +E+ G F+ Y SG++TG CG+A
Sbjct: 227 WYRKNQRYVTIDNYEQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTA 286
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DHGV VGYGTE G+DYW+V+NSWG++WGENGY+++QRN+ GKCGIAM SYPVK
Sbjct: 287 IDHGVTIVGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNI--GGAGKCGIAMVPSYPVK 344
Query: 359 NSQNSAKPKPHSSA 372
S + P H S+
Sbjct: 345 YSYQN--PNKHYSS 356
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/309 (60%), Positives = 231/309 (74%), Gaps = 7/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ W KHGK + + + R+ ++KDNL +I H+ NR+Y +GL KFAD+TN+E+R
Sbjct: 46 FGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDEFRRQ 105
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
Y GTR D +R + RYA E PESVDWR+KGAV VKDQGSCGSCWAFS +
Sbjct: 106 YTGTRIDRSKR--SKRKTGFRYA---DSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAI 160
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
+VEGIN I TGE +SLSEQELVDCD + N GCNGGLMDYAF FI++NGG+D+E DYPY
Sbjct: 161 GSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGIDTENDYPYK 220
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G + +CD +++NA VV+IDGYEDV DE +LKKAVA QPVSVAIEAGGR FQ Y GVF
Sbjct: 221 GLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVF 280
Query: 292 TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT--GKCGI 349
TGECG+ LDHGV+AVGYG+E +DYW+V+NSWG WGE+GY+++QRN+ D+N G CGI
Sbjct: 281 TGECGTDLDHGVLAVGYGSEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDSNHQFGLCGI 340
Query: 350 AMEASYPVK 358
+E SY VK
Sbjct: 341 NIEPSYAVK 349
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 193/329 (58%), Positives = 243/329 (73%), Gaps = 14/329 (4%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYK 94
S R+++E +Y W A+HG S E R++ F+DNLR+IDEHN+ +++
Sbjct: 30 SGQIRSEEETRRMYAEWTAQHG--SPITNEEEGRYEAFRDNLRYIDEHNAAADAGIHSFR 87
Query: 95 VGLNKFADLTNEEYRAMYLGTR--SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA 152
+GLN+FA LTNEEYRA YLG R S A L K S RY G+ LPESVDWREKGA
Sbjct: 88 LGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRK---PSARYEAADGEALPESVDWREKGA 144
Query: 153 VNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
V VKDQG SCGS WAFS +AAVE IN+IVTGELISLSEQEL+DCD NAGC+GGLMD
Sbjct: 145 VGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDD 204
Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
AF+FII NGG+D+++DYPY + CD ++RN K V+ID YED+ +E SL+KAV++QP
Sbjct: 205 AFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLR-MNEKSLQKAVSNQP 263
Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENG 331
VSVAIEAGGR FQ Y+SG+FTG CG+ LDH VGYG+ENG DYW+V+ S+G+ WGE+G
Sbjct: 264 VSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGSENGTDYWIVKESYGTSWGESG 323
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
Y +++RN+ +T +GKCGIAM SYPVKN+
Sbjct: 324 YARMERNIKET-SGKCGIAMLPSYPVKNT 351
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/341 (54%), Positives = 250/341 (73%), Gaps = 14/341 (4%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
A D SII+Y + RT+DEVM ++++WL ++GK+ N +G E+RF+IFKDNLRF+
Sbjct: 24 AFDASIITYAKKWEQ----RTNDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFV 79
Query: 84 DEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
DEHN+ +NR+YKVGLN+F+DLT EEY ++YLGT+ D M+ S RY + GD+LP
Sbjct: 80 DEHNADVNRSYKVGLNQFSDLTLEEYSSIYLGTKFD-----MRMTNVSDRYEPRVGDQLP 134
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-IN 201
S+DWR+KGAV VK+QG+CGSCW F+ +AAVE IN+IVTG LISLSEQ++VDC RK N
Sbjct: 135 NSIDWRKKGAVLGVKNQGNCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPN 194
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC GG A+QFII NGG+++E +YPY + +CD ++N K V+ID YE+V +E
Sbjct: 195 NGCKGGSRAGAYQFIIDNGGINTEANYPYKAQDGECD-EQKNQKYVTIDRYENVPRKNEK 253
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAV++Q VSV I + F+ Y+SG+FTG CG+ +DH V VGYGTE G+DYW+VRN
Sbjct: 254 ALQKAVSNQLVSVGIASNSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGTEGGMDYWIVRN 313
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWGS+WGENGYV++QRN+ N G C IA +YPVK N
Sbjct: 314 SWGSNWGENGYVRMQRNV--GNAGTCFIATSPNYPVKYGPN 352
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+HY SG+FTG CG+A
Sbjct: 226 LDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/356 (53%), Positives = 248/356 (69%), Gaps = 14/356 (3%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
+ ++ LA S F F S + D SI+ Y S ++ D+++ ++++W++KHGK
Sbjct: 6 SKALVLACS---FCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSKHGKI 57
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ RF+IFKDNL+ IDE N + Y +GLN+FADL+++E++ YLG + D RR
Sbjct: 58 YQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRR 117
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ + + + K ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 118 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 172
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQEL+DCDR N GCNGGLMDYAF FI++NGG+ E+DYPY+ E C+ ++
Sbjct: 173 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKE 232
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
+VV+I GY DV +E SL KA+A+QP+SVAIEA GR FQ Y GVF G CGS LDHG
Sbjct: 233 ETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 292
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V AVGYGT GVDY +V+NSWGS WGE GY++++RN + G CGI ASYP K
Sbjct: 293 VAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 347
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRFGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+HY SG+FTG CG+A
Sbjct: 226 LDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/344 (53%), Positives = 245/344 (71%), Gaps = 25/344 (7%)
Query: 28 SIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN 87
+I+ Y+ + HS DD ++ ++ WL +H + + + ++RFQIFKDNL +I HN
Sbjct: 33 AIMDYEAHELHS-----DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHN 87
Query: 88 SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL------ 141
++Y +GLNKF+DLT++E+RA+YLG R A + + + GD
Sbjct: 88 KQEKSYWLGLNKFSDLTHDEFRALYLGIRP-----------AGRAHGLRNGDRFIYEDVV 136
Query: 142 -PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
E VDWR+KGAV+ VKDQGSCGSCWAFS + +VEG+N IVTGELISLSEQELVDCDR
Sbjct: 137 AEEMVDWRKKGAVSDVKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQ 196
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR-NAKVVSIDGYEDVSPFD 259
N GCNGGLMDYAF FII+NGG+D+E+DYPY + +CD +R+ +KVV ID Y+DV
Sbjct: 197 NQGCNGGLMDYAFDFIIKNGGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKS 256
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWL 318
E SL KAV+ PVSVAIEAGGR FQHY+ GVFTG CG+ LDHGV+AVGYGT ++GV+YW+
Sbjct: 257 ESSLLKAVSKNPVSVAIEAGGRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWI 316
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
V+NSWG WGE GY++++R ++ +GKCGI +E S+P+K N
Sbjct: 317 VKNSWGPSWGEKGYIRMERMGSNSTSGKCGINIEPSFPIKKGAN 360
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/357 (52%), Positives = 248/357 (69%), Gaps = 14/357 (3%)
Query: 2 ATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK 61
++ ++FLA S F F S + A D SI+ Y S ++ D+++ ++++W+++HGK
Sbjct: 5 SSKALFLACS---FCLFASLAVAGDFSIVGYS-----SEDLKSMDKLIELFESWMSRHGK 56
Query: 62 TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
+ RF IFKDNL+ IDE N + Y +GLN+FADL+++E++ YLG + D R
Sbjct: 57 IYQSIEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSR 116
Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
R + + + + K ELP+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGIN+IV
Sbjct: 117 R----RESPEEFTYKDF-ELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIV 171
Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
TG L SLSEQEL+DCDR N GCNGGLMDYAF FI++NGG+ E+DYPY+ E C+ ++
Sbjct: 172 TGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTK 231
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
+VV+I GY DV +E SL KA+ +QP+SVAIEA GR FQ Y GVF G CGS LDH
Sbjct: 232 EETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDH 291
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GV AVGYGT GV+Y +V+NSWGS WGE GY++++RN + G CGI ASYP K
Sbjct: 292 GVAAVGYGTSKGVNYIIVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 347
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/358 (52%), Positives = 243/358 (67%), Gaps = 11/358 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA ++ A L FI+ + A D SI+ Y H S D+ + ++++W++KH
Sbjct: 1 MALSTFSKATLILSATLFITYAIAHDFSIVGYSPEHLASM-----DKTIELFESWMSKHS 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
KT + RF+IF DNL+ IDE N +Y +GLN+FADL++EE+++ YLG R +
Sbjct: 56 KTYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFP 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R K +S+ ++ ++LPESVDWR KGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 116 R-----KRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 170
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L SLSEQEL+DCDR N GC GGLMDYAFQ+I+ N G+ E+DYPYL E +C
Sbjct: 171 VTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIRE 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ +VV+I GYEDV DE SL KA++ QPVSVAIEA R FQ Y+ G+FTG CG+ +D
Sbjct: 231 KEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYG+ G DY +V+NSWG WGENGY++++RN G CGI ASYP K
Sbjct: 291 HGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRN-TGKPEGLCGINQMASYPTK 347
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/327 (57%), Positives = 232/327 (70%), Gaps = 11/327 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKT--SNGM------GHNEKRFQIFKDNLRFIDEHNSLNRTYKV 95
+++ + ++ +W+ +HGK+ N + G R+ IFKDNLRFI N N+ Y +
Sbjct: 49 SEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQGYFL 108
Query: 96 GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
GLN FADLTNEE+RA G R D R ++ RY +LP+S+DWREKGAV
Sbjct: 109 GLNAFADLTNEEFRAQRHGGRFDRSRE--RTSYEEFRYGSVQLKDLPDSIDWREKGAVVG 166
Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
VKDQGSCGSCWAFS VAA+EG+NK+ TGEL+SLSEQELVDCD+ + GCNGGLMDYAF F
Sbjct: 167 VKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGF 226
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
+I+NGG+D+E DYPY G +CD S+ NAKVV+IDGYEDV DE +L KAVA QPVSVA
Sbjct: 227 VIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVA 286
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
I+AGG + Q Y SG+FTG CG+ LDHGV VGYG E+G YW+++NSWGS+WGE GY+K+
Sbjct: 287 IDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNSWGSNWGEKGYIKM 346
Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQN 362
RN G CGI MEASYP K N
Sbjct: 347 ARN-TGLAAGLCGINMEASYPTKTGAN 372
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/327 (57%), Positives = 231/327 (70%), Gaps = 11/327 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKT--------SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKV 95
+++ + ++ +W+ +HGK+ + G R+ IFKDNLRFI N N+ Y +
Sbjct: 49 SEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQGYFL 108
Query: 96 GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
GLN FADLTNEE+RA G R D R ++ RY +LP+S+DWREKGAV
Sbjct: 109 GLNAFADLTNEEFRAQRHGGRFDRSRE--RTSHEEFRYGSVQLKDLPDSIDWREKGAVVG 166
Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
VKDQGSCGSCWAFS VAA+EG+NK+ TGEL+SLSEQELVDCD+ + GCNGGLMDYAF F
Sbjct: 167 VKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGF 226
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
+I+NGG+D+E DYPY G +CD S+ NAKVV+IDGYEDV DE +L KAVA QPVSVA
Sbjct: 227 VIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVA 286
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
I+AGG + Q Y SG+FTG CG+ LDHGV VGYG E+G YW+++NSWGS+WGE GYVK+
Sbjct: 287 IDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNSWGSNWGEKGYVKM 346
Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQN 362
RN G CGI MEASYP K N
Sbjct: 347 ARN-TGLAAGLCGINMEASYPTKTGAN 372
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/358 (52%), Positives = 243/358 (67%), Gaps = 11/358 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA ++ A L FI+ ++A D SI+ Y H S D+ + ++++W++KH
Sbjct: 1 MALSTFSKATLILSATLFITYATAHDFSIVGYSPEHLASM-----DKTIELFESWMSKHS 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IF DNL+ IDE N +Y +GLN+FADL++EE+++ YLG R +
Sbjct: 56 KAYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFP 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R K +S+ ++ ++LPESVDWR KGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 116 R-----KRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 170
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L SLSEQEL+DCDR N GC GGLMDYAFQ+I+ N G+ E+DYPYL E +C
Sbjct: 171 VTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIRE 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ +VV+I GYEDV DE SL KA++ QPVSVAIEA R FQ Y+ G+FTG CG+ +D
Sbjct: 231 KEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYG+ G DY +V+NSWG WGENGY++++RN G CGI ASYP K
Sbjct: 291 HGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRN-TGKPEGLCGINQMASYPTK 347
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/356 (52%), Positives = 244/356 (68%), Gaps = 11/356 (3%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
++S L + F F S + D SI+ Y S ++ D+++ ++++W+++HGK
Sbjct: 4 SSSKALVLIACSFCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHGKI 58
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ RF+IFKDNL+ IDE N + Y +GLN+FADL++ E+ YLG + D RR
Sbjct: 59 YENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRR 118
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ + + + K ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 119 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 173
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQEL+DCDR N GCNGGLMDYAF FI++NGG+ E+DYPY+ E C+ ++
Sbjct: 174 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKE 233
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
+VV+I GY DV +E SL KA+A+QP+SVAIEA GR FQ Y GVF G CGS LDHG
Sbjct: 234 ETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 293
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V AVGYGT GVDY V+NSWGS WGE GY++++RN + G CGI ASYP K
Sbjct: 294 VAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 348
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 195/374 (52%), Positives = 257/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VELQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN P+P+SS
Sbjct: 344 YNNQN--YPEPYSS 355
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 173/228 (75%), Positives = 203/228 (89%)
Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
G+ LPESVDWRE GAVNPVKDQ SCGSCWAFSTVAAVEGIN+IVTGELISLSEQELVDCD
Sbjct: 3 GEVLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCD 62
Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
+ + GCNGGLMDYAF FII+NGG+D+E+DYPY G + +C+ S +++KVVSIDGYEDV P
Sbjct: 63 TEYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPP 122
Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
FDE +L+KAVA QPVSVA+EAGGRA Q Y SG+FTGECG+ALDHG+VAVGYGTENG DYW
Sbjct: 123 FDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGTENGTDYW 182
Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
+VRNSWGS WGENGY++++RN+ D +GKCGIAMEASYP+KN +N +K
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIKNGENPSK 230
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 191/314 (60%), Positives = 237/314 (75%), Gaps = 21/314 (6%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+Y+ WL ++ K NG+G E+R +IFK+NL+FIDEHNSL N+T++VGL +FADLTN+E
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE-- 58
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ MK+ RY K GD LP+ +DWR KGAV PVKDQG+CGSCWAFS
Sbjct: 59 ----------PKDFMKA----DRYLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFS 104
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
V AVEGIN+I TGELISLS+QEL+DCDR +NAGC GG+M+YAF+FII NGG++S+QDY
Sbjct: 105 AVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDY 164
Query: 229 PYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
PY + C+ ++N +VV IDGYE V+ DE SLKKAVA QPV VAIEA +AF+ Y
Sbjct: 165 PYTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLY 224
Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
+SGVFTG CG LDHGVV VGYGT +G DYW++RNSWG +WGENGYVKLQRN +D + GK
Sbjct: 225 KSGVFTGTCGIYLDHGVVVVGYGTSSGEDYWIIRNSWGLNWGENGYVKLQRN-IDDSFGK 283
Query: 347 CGIAMEASYPVKNS 360
CG+AM SYP K+S
Sbjct: 284 CGVAMMPSYPTKSS 297
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/346 (56%), Positives = 252/346 (72%), Gaps = 20/346 (5%)
Query: 27 MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNG-MGHNEKRFQIFKDN 79
MSII Y+ H RT+ E +Y W+A+H G + NG +G E+RF++F DN
Sbjct: 38 MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDN 97
Query: 80 LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
L+F+D HN+ + +++G+N+FADLTN+E+RA YLGT + R + + Y
Sbjct: 98 LKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEMYRHD 152
Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ LP+SVDWR+KGAV +PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 153 GVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 212
Query: 196 CDRKINAGCNGG-LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
C R G +MD AF FI +NGG+D+E+DYPY + KCD ++++ KVVSIDG+ED
Sbjct: 213 CARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFED 272
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
V DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+
Sbjct: 273 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 332
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
G DYW VRNSWG DWGENGY++++RN+ TGKCGIAM ASYP+K
Sbjct: 333 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 377
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/356 (52%), Positives = 244/356 (68%), Gaps = 11/356 (3%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
++S L + F F S + D SI+ Y S ++ D+++ ++++W+++HGK
Sbjct: 4 SSSKALVLIACSFCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHGKI 58
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ RF+IFKDNL+ IDE N + Y +GL++FADL++ E+ YLG + D RR
Sbjct: 59 YENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRR 118
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ + + + K ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 119 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 173
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQEL+DCDR N GCNGGLMDYAF FI++NGG+ E+DYPY+ E C+ ++
Sbjct: 174 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKE 233
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
+VV+I GY DV +E SL KA+A+QP+SVAIEA GR FQ Y GVF G CGS LDHG
Sbjct: 234 ETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 293
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V AVGYGT GVDY V+NSWGS WGE GY++++RN + G CGI ASYP K
Sbjct: 294 VAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 348
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YL S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 VDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/336 (55%), Positives = 235/336 (69%), Gaps = 6/336 (1%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H T+D + +Y+ W +H K + G +RF +FK N+ + E N +++ YK+ L
Sbjct: 26 HEKELETEDNLWDMYERW--RH-KVATNHGEKLRRFNVFKSNVLHVHETNKMDKPYKLKL 82
Query: 98 NKFADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
NKFAD+TN E+R++Y G++ R L + S+ + + +P SVDWR+KGAV PV
Sbjct: 83 NKFADMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPV 142
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
KDQG CGSCWAFSTVAAVEGINKI T EL+SLSEQELVDCD N GCNGGLMD AF FI
Sbjct: 143 KDQGQCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFI 202
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
+ GG+ E YPY + KCD ++ N+ VVSIDG+EDV DE SL KAVA+QPV+VAI
Sbjct: 203 KKTGGLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAI 262
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKL 335
+AG FQ Y GVFTG+CG+ LDHGV AVGYGT +G YW+VRNSWGS+WGE GY+++
Sbjct: 263 DAGSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRM 322
Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
+R + D G CGIAMEASYP+KNS N+ K P SS
Sbjct: 323 ERGISDKR-GLCGIAMEASYPIKNSSNNPKSSPTSS 357
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNTKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PK +SS
Sbjct: 344 YNNQN--HPKSYSS 355
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/356 (52%), Positives = 243/356 (68%), Gaps = 11/356 (3%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
+ S L + F F S + D SI+ Y S ++ D+++ ++++W+++HGK
Sbjct: 4 STSKALRVLACSFCLFASFTFGRDFSIVGYS-----SEDLKSMDKLIELFESWISRHGKI 58
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ RF+IFKDNL+ IDE N + Y +GLN+FADL+++E++ YLG + D RR
Sbjct: 59 YQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRR 118
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ + + + K ELP+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 119 ----RESPEEFTYK-DVELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVT 173
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQEL+DCDR N GCNGGLMDYAF FI++N G+ E+DYPY+ E C+ ++
Sbjct: 174 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEEDYPYIMEEGTCEMAKE 233
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
+VV+I GY DV +E SL KA+A+QP+SVAIEA GR FQ Y GVF G CGS LDHG
Sbjct: 234 ETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 293
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V AVGYGT GVDY V+NSWGS WGE GY++++RN + G CGI ASYP K
Sbjct: 294 VAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 348
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 186/330 (56%), Positives = 244/330 (73%), Gaps = 17/330 (5%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK-RFQIFKDNLRFIDEHNSLN----RTYKVGL 97
R DDEV +Y+ W ++HG +G G +++ R ++F+DNLR+ID HN+ T+++GL
Sbjct: 43 RADDEVRRMYEAWKSEHG---HGHGSDDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGL 99
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQRYACKAGDELPESVDWREKGAVN 154
FADLT EEYR LG R+ RR S+V +S R + GD LP+++DWRE GAV
Sbjct: 100 TPFADLTLEEYRGRALGFRA---RRGGASRVGSGSSYRPRPRGGD-LPDAIDWRELGAVT 155
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQ 214
VK+Q CG CWAFS VAA+EGIN+IVTG L+SLSEQE++DCD + + GCNGG M AFQ
Sbjct: 156 GVKNQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNGGEMQNAFQ 214
Query: 215 FIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
F+I NGG+D+E DYPYLG + CD +R N +VV+IDG+ V+ +E +L++AVA+QPVSV
Sbjct: 215 FVINNGGIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSV 274
Query: 275 AIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
AI+A GR FQHY SG+F G CG+ LDHGV AVGYG+ENG DYW+V+NSW S WGE GY++
Sbjct: 275 AIDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYGSENGKDYWIVKNSWSSSWGEAGYIR 334
Query: 335 LQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
++RN+ TGKCGIAM+ASYPVK+S N A
Sbjct: 335 IRRNVA-AATGKCGIAMDASYPVKSSSNPA 363
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/358 (50%), Positives = 244/358 (68%), Gaps = 11/358 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA + + + T F+ S A D SI+ Y H S D+++ ++++W++ HG
Sbjct: 1 MALSVLKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSV-----DKLVELFESWISGHG 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K N + RF++FK+NL+ ID+ N +Y +GLN+FADL++EE+++ +LG +
Sbjct: 56 KAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFP 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R K +S+ ++ + +LP+S+DWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 116 R-----KKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 170
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
V G L SLSEQ+L+DCD N GCNGGLMDYAF+FI+ NGG+ E+DYPYL E CD
Sbjct: 171 VAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEK 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R +VV+I GY DV DE SL KA+A QP+SVAI+A GR FQ Y GVF+G CG+ LD
Sbjct: 231 REEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYG+ +G+DY +V+NSWG WGE GY++++RN G CGI ASYP K
Sbjct: 291 HGVAAVGYGSSSGIDYIIVKNSWGPKWGERGYLRMKRN-TGKPEGLCGINKMASYPTK 347
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 193/374 (51%), Positives = 254/374 (67%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNG + F FII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PK +SS
Sbjct: 344 YNNQN--HPKSYSS 355
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/358 (50%), Positives = 247/358 (68%), Gaps = 10/358 (2%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S + T F+S + D SI+ Y S ++ D+++ ++++W+++HG
Sbjct: 1 MAFFSPKTLVLTCSLCLFLSLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHG 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++FKDNL+ ID+ N + Y +GLN+FADL+++E++ YLG + D
Sbjct: 56 KIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLS 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+R + +S+ +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+I
Sbjct: 116 QR----RESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQI 171
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L SLSEQEL+DCD N GCNGGLMDYAF FI++NGG+ E+DYPY+ E+ C+
Sbjct: 172 VTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMK 231
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ ++VV+I+GY DV +E SL KA+A+QP+SVAIEA GR FQ Y GVF G CGS LD
Sbjct: 232 KEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELD 291
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYGT G+DY +V+NSWG+ WGE G+++++RN + + G CG+ ASYP K
Sbjct: 292 HGVSAVGYGTSKGLDYIIVKNSWGAKWGEKGFIRMKRN-IGKSEGICGLYKMASYPTK 348
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/368 (49%), Positives = 240/368 (65%), Gaps = 20/368 (5%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
+FL + TL + + S +D H T+++ +Y+ W + H S
Sbjct: 4 LFLVLFTLALVLRLGES---------FDF---HEKELETEEKFWELYERWRSHH-TVSRS 50
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+ KRF +FK N+ ++ N ++ YK+ LNKFAD+TN E+R Y G++ R L+
Sbjct: 51 LDEKHKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLG 110
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ A+ + D +P S+DWR+KGAV PVKDQG CGSCWAFSTV AVEGIN+I T +L
Sbjct: 111 ASRANGTFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKL 170
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
+SLSEQELVDCD N GCNGGLMD AF FI + GG+ +E+ YPY ++KCD +RN
Sbjct: 171 VSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTP 230
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VVSIDG+EDV P DE +L KAVA+QP+SVAI+A G FQ Y GVFTGECG+ LDHGV
Sbjct: 231 VVSIDGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAI 290
Query: 306 VGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN-- 362
VGYGT +G YW+V+NSWG+ WGE GY+++QR +D G CGIAM+ SYP+K S N
Sbjct: 291 VGYGTTVDGTKYWIVKNSWGAGWGEKGYIRMQRK-VDAEEGLCGIAMQPSYPIKTSSNPT 349
Query: 363 ---SAKPK 367
+A PK
Sbjct: 350 GSPAATPK 357
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 175/334 (52%), Positives = 229/334 (68%), Gaps = 3/334 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H T++ + +Y+ W + H S + KRF +FK+N+ F+ E N + YK+ L
Sbjct: 24 HQKELETEESLWNLYERWRSHH-TVSRSLDEKHKRFNVFKENVNFVHEFNKKDEPYKLKL 82
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + R S+ A+ + + +P SVDWR+KGAV P+K
Sbjct: 83 NKFADMTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIK 142
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN I T +L+SLSEQELVDCD N GCNGGLM YAF+FI
Sbjct: 143 DQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIK 202
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+ GG+ +EQ YPY + CD S+ N+ VVSIDG+E V P +E +L KA A+QP+SVAI+
Sbjct: 203 EKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAID 262
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG AFQ Y GVF G CG+ LDHGV VGYGT +G YW+V+NSWG+DWGENGY++++
Sbjct: 263 AGGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMK 322
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
R + G CGIA+EASYP+KNS + P S
Sbjct: 323 RG-ISAKEGLCGIAVEASYPIKNSSTNPVGAPSS 355
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/322 (54%), Positives = 228/322 (70%), Gaps = 3/322 (0%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
+Y+ W + H S + +KRF +FK N+ ++ N ++ YK+ LNKFAD+TN E+R
Sbjct: 37 LYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRH 95
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
Y G++ R + + A+ + +++P SVDWR+KGAV PVKDQG CGSCWAFST
Sbjct: 96 HYAGSKIKHHRSFLGASRANGTFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWAFST 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
V AVEGIN+I T EL+SLSEQELVDCD N GCNGGLMD AF+FI + GG+++E++YPY
Sbjct: 156 VVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPY 215
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ +CD +RN+ VVSIDGYEDV P DE SL KAVA+QPVSVAI+A G FQ Y GV
Sbjct: 216 MAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSEGV 275
Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+ LDHGV VGYGT +G YW+VRNSWG +WGE GY+++QR +D G CGI
Sbjct: 276 FTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRE-IDAEEGLCGI 334
Query: 350 AMEASYPVKNSQNSAKPKPHSS 371
AM+ SYP+K S ++ P ++
Sbjct: 335 AMQPSYPIKTSSSNPTGSPATA 356
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 369 bits (948), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 188/381 (49%), Positives = 251/381 (65%), Gaps = 22/381 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSS-------------SWRTDDE 47
M +A L I L+ + S ++A DMS+++YD+NH ++ + D E
Sbjct: 1 MGSAKSALLI-LLLAMVIASCATAMDMSVVTYDDNHHVTAGPGHHVTAGPGRRNGVFDVE 59
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEE 107
I+++W+ KHGK + + E+R IFKDNLRFI NS N Y++GLN+FADL+ E
Sbjct: 60 ASLIFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGYRLGLNRFADLSLHE 119
Query: 108 YRAMYLGTRSDAKRR--LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
Y+ + G R M S S RY AGD LP+SVDWR +GAV VKDQG C SC
Sbjct: 120 YKEICHGADPKPPRNHVFMSS---SDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSC 176
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFSTV AVEG+NKIVTGEL++LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++
Sbjct: 177 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTD 235
Query: 226 QDYPYLGAENKCDPS-RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
DYPY CD + N K V IDGYE++ DE++L KAVA QPV+ I++ R FQ
Sbjct: 236 NDYPYKAVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQ 295
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
YESGVF G CG+ L+HGVV VGYGTENG +YW+VRNSWG+ WGE GY+K+ RN+ +
Sbjct: 296 LYESGVFDGRCGTNLNHGVVVVGYGTENGRNYWIVRNSWGNTWGEAGYMKMARNIANPR- 354
Query: 345 GKCGIAMEASYPVKNSQNSAK 365
G CGIAM SYP+KNS + K
Sbjct: 355 GLCGIAMRVSYPLKNSFTTGK 375
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/361 (50%), Positives = 244/361 (67%), Gaps = 23/361 (6%)
Query: 8 LAISTLVFL-----FFISSSSAADMSIISYDNNHDHSSSWRTD----DEVMTIYQTWLAK 58
+A+S L+ L FF+ +S D SI+ Y W D D ++ +++ W++
Sbjct: 1 MALSKLLPLAMCMSFFVVTSFGKDFSIVGY---------WPEDLTSMDRLIELFEEWISN 51
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
HGK + RF++FKDNL+ IDE N +Y +G+N+FADLT++E++ MYLG + +
Sbjct: 52 HGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVE 111
Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
+ R ++ + + + K +LP+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGIN
Sbjct: 112 SSR----TRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGIN 167
Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
KIV G L SLSEQEL+DCDR N GC+GGLMDYAF FI+ +GG+ E+DYPYL E+ CD
Sbjct: 168 KIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCD 227
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ +VV+I GY+DV +E SL KA+A QP+SVAIEA GR FQ Y GVF G CG+
Sbjct: 228 NKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQ 287
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
LDHGV AVGYG+ GVDY +V+NSWG WGE GY++++RN G CGI ASYP K
Sbjct: 288 LDHGVTAVGYGSSKGVDYIIVKNSWGPKWGEKGYIRMKRN-TGKPAGLCGINKMASYPTK 346
Query: 359 N 359
+
Sbjct: 347 S 347
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 184/360 (51%), Positives = 245/360 (68%), Gaps = 16/360 (4%)
Query: 6 MFLAISTLVFLFFIS------SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
M L+ + FL FIS S+ A D SI+ Y + D +S D++ ++++W++KH
Sbjct: 1 MALSPFSNFFLLFISMAVFAYSAFARDFSIVGYSPD-DLTSM----DKLTDLFESWMSKH 55
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
GK+ RF++F+DNL+ IDE N +Y +GLN+FADL++EE++ YLG + +
Sbjct: 56 GKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIEL 115
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+R + + + ++ K +LP+SVDWR+KGAV VK+QG+CGSCWAFSTVAAVEGIN+
Sbjct: 116 PKR----RDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQ 171
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG L +LSEQEL+DCD+ N GCNGGLMDYAF FII NGG+ E+DYPY+ E C
Sbjct: 172 IVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCGE 231
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+ +VV+I GY DV +E S KA+A+QP+SVAIEA R FQ Y G+F G CG+ L
Sbjct: 232 KKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTEL 291
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGV AVGYGT GVDY V+NSWGS WGE GY++++RN + G CGI ASYP KN
Sbjct: 292 DHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRN-VGKPEGICGIYKMASYPTKN 350
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 367 bits (943), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 179/358 (50%), Positives = 242/358 (67%), Gaps = 9/358 (2%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S + T F+S + D SI+ Y S ++ D+++ ++++W+++HG
Sbjct: 1 MAFFSSKTLVLTCSLCLFLSLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHG 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++FKDNL+ IDE N + Y +GLN+FADL+++E++ YLG + +
Sbjct: 56 KIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLS 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+R S + + + D LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+I
Sbjct: 116 QRRESSN--EEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQI 172
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L SLSEQEL+DCD N GCNGGLMDYAF FI+QNGG+ E DYPY+ E+ C+
Sbjct: 173 VTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMK 232
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ +VV+I+GY DV +E SL KA+A+QP+SVAIEA R FQ Y GVF G CGS LD
Sbjct: 233 KEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLD 292
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYGT +DY +V+NSWG+ WGE G+++++RN + G CG+ ASYP K
Sbjct: 293 HGVSAVGYGTSKNLDYIIVKNSWGAKWGEKGFIRMKRN-IGKPEGICGLYKMASYPTK 349
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 367 bits (942), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 181/357 (50%), Positives = 239/357 (66%), Gaps = 18/357 (5%)
Query: 7 FLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTD----DEVMTIYQTWLAKHGKT 62
F + FF+ +S D SI+ Y W D D ++ +++ W++ HGK
Sbjct: 8 FYFFLAMCMSFFVVTSFGKDFSIVGY---------WPEDLTSMDRLIELFEEWISNHGKI 58
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ RF++FKDNL+ IDE N +Y +G+N+FADLT++E++ MYLG + ++ R
Sbjct: 59 YETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSR- 117
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
++ + + + K +LP+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGINKIV
Sbjct: 118 ---TRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVG 174
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQEL+DCDR N GC+GGLMDYAF FI+ +GG+ E+DYPYL E+ CD +
Sbjct: 175 GNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKG 234
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
+VV+I GY+DV +E SL KA+A QP+SVAIEA GR FQ Y GVF G CG+ LDHG
Sbjct: 235 ELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHG 294
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
V AVGYG+ GVDY +V+NSWG WGE GY++++RN G CGI ASYP K+
Sbjct: 295 VTAVGYGSSKGVDYIIVKNSWGPKWGEKGYIRMKRN-TGKPAGLCGINKMASYPTKS 350
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 185/355 (52%), Positives = 244/355 (68%), Gaps = 15/355 (4%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
+ ++ LA S F F S + D SI+ Y S ++ D+++ ++++W++KHGK
Sbjct: 6 SKALVLACS---FCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSKHGKI 57
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ RF+IFKDNL+ IDE N + Y +GLN+FADL+++E++ YLG + D RR
Sbjct: 58 YQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRR 117
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ + + + K ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 118 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 172
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
G L SLSEQEL+DCDR + GCNGGLMDYAF FI++NGG+ E+DYPY+ E C+ ++
Sbjct: 173 GNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKE 232
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
+VV+I GY DV +E SL KA+A+Q +SVAIEA GR FQ Y GVF G CGS LDHG
Sbjct: 233 ETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 292
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V AVGYGT GVDY +V+NSWGS WGE GY+++ R L+T G ASYP+
Sbjct: 293 VAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIRM-RGTLETR-GNLRYLQMASYPL 345
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 367 bits (941), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 180/326 (55%), Positives = 233/326 (71%), Gaps = 9/326 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
R+D+EV +Y W K+ + NE R ++FK+NL+F+DEHN+ T+ +G+N
Sbjct: 44 RSDEEVRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGMN 103
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEEYR +L R ++ R S S RY + GD+LP+S+DWRE GAV PVK+
Sbjct: 104 RFADLTNEEYRTRFL--RDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPVKN 161
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC N GC GG M+ AFQFI+
Sbjct: 162 QGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT-TANHGCRGGWMNPAFQFIVN 220
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG++SE+ YPY G C+ S NA VVSID YE+V +E SL+KAVA+QPVSV ++A
Sbjct: 221 NGGINSEETYPYRGQNGICN-STVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDA 279
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
GR FQ Y SG+FTG C + +H + VGYGTEN D+W+V+NSWG +WGE+GY++ +RN
Sbjct: 280 AGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAERN 339
Query: 339 LLDTNTGKCGIAMEASYPVKNSQNSA 364
+ + N GKCGI ASYPVK N+A
Sbjct: 340 IENPN-GKCGITRFASYPVKKGANTA 364
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 366 bits (940), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 184/360 (51%), Positives = 239/360 (66%), Gaps = 9/360 (2%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+ L+ +F S YD+ S ++ + T+Y W + H +
Sbjct: 1 MKKLLLIFLFSLVILQTACGFDYDDKEIES-----EEGLSTLYDRWRSHHS-VPRSLNER 54
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
EKRF +F+ N+ + N NR+YK+ LNKFADLT E++ Y G+ R L K
Sbjct: 55 EKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRG 114
Query: 130 SQR--YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S++ Y + +LP SVDWR+KGAV +K+QG CGSCWAFSTVAAVEGINKI T +L+S
Sbjct: 115 SKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVS 174
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCD K N GCNGGLM+ AF+FI +NGG+ +E YPY G + KCD S+ N +V
Sbjct: 175 LSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+IDG+EDV DE +L KAVA+QPVSVAI+AG FQ Y GVFTG CG+ L+HGV AVG
Sbjct: 235 TIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVG 294
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
YG+E G YW+VRNSWG++WGE GY+K++R +D G+CGIAMEASYP+K S ++ PK
Sbjct: 295 YGSERGKKYWIVRNSWGAEWGEGGYIKIERE-IDEPEGRCGIAMEASYPIKLSSSNPTPK 353
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 366 bits (940), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 181/326 (55%), Positives = 235/326 (72%), Gaps = 9/326 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
R+D+EV +Y W AK+ + NE R ++FK+NL+F+D+HN+ T+++G+N
Sbjct: 42 RSDEEVRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAAADRGEHTFRLGMN 101
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEEYR +L R ++ R S S RY + GD+LP+S+DWREKGAV PVK+
Sbjct: 102 RFADLTNEEYRTRFL--RDFSRLRRSASGKISSRYRLREGDDLPDSIDWREKGAVVPVKN 159
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC N GC GG M+ AFQFI+
Sbjct: 160 QGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT-TANHGCRGGWMNPAFQFIVN 218
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG++SE+ YPY G C+ S NA VVSID YE+V +E SL+KAVA+QPVSV ++A
Sbjct: 219 NGGINSEETYPYRGQNGICN-STVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDA 277
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
GR FQ Y SG+FTG C + +H + VGYGTEN DY V+NSWG +WGE+GY++++RN
Sbjct: 278 AGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDYRTVKNSWGKNWGESGYIRVERN 337
Query: 339 LLDTNTGKCGIAMEASYPVKNSQNSA 364
+ + N GKCGI ASYPVK N+A
Sbjct: 338 IGNPN-GKCGITRFASYPVKKGTNTA 362
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 173/314 (55%), Positives = 224/314 (71%), Gaps = 5/314 (1%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D+++ +++W++KHGK M RF++F++NL IDE N +Y +GLN+FADL++
Sbjct: 398 DKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSH 457
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
EE+++ YLG R++ R S+ S + + +LPESVDWR+KGAV VK+QG+CGSC
Sbjct: 458 EEFKSKYLGLRAEFPR----SRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSC 513
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFSTVAAVEGIN+IVTG L +LSEQEL+DCD N+GCNGGLMDYAF FI NGG+ E
Sbjct: 514 WAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKE 573
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
DYPYL E C+ + + +V+I GYEDV DE SL KA+A QP+SVAIEA GR FQ
Sbjct: 574 DDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQF 633
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y GVF G CG+ LDHGV AVGYG+ G+DY +V+NSWG WGE GY++++RN T G
Sbjct: 634 YSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE-G 692
Query: 346 KCGIAMEASYPVKN 359
CGI ASYP K+
Sbjct: 693 LCGINKMASYPTKD 706
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 185/342 (54%), Positives = 234/342 (68%), Gaps = 10/342 (2%)
Query: 17 FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIF 76
FF +S A D SI+ Y D +S D+++ ++++W++KHGK + RF+IF
Sbjct: 3 FFANSGLARDFSIVGY-TPEDLTSG----DKIIDLFESWISKHGKIYESIEEKWLRFEIF 57
Query: 77 KDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
KDNL IDE N Y +GLN+F+DL++EE++ YLG + D R + SQ + K
Sbjct: 58 KDNLFHIDETNKKVVNYWLGLNEFSDLSHEEFKNKYLGLKVDMSER----RECSQEFNYK 113
Query: 137 AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
+P+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGIN+IVTG L SLSEQELVDC
Sbjct: 114 DVMSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDC 173
Query: 197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
D N GCNGGLMDYAF +II NGG+ E DYPY+ E C+ + ++VV+I GY DV
Sbjct: 174 DTTNNYGCNGGLMDYAFSYIISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVP 233
Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
E SL KA+A+QP+SVAIEA GR FQ Y GVF G CG+ LDHGV AVGYG+ NG+DY
Sbjct: 234 QNSEESLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGSTNGLDY 293
Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+V+NSWGS WGE GY++++RN G CGI ASYP K
Sbjct: 294 IIVKNSWGSKWGEKGYIRMKRN-TGKPAGLCGINKMASYPTK 334
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 171/319 (53%), Positives = 224/319 (70%), Gaps = 3/319 (0%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
+Y+ W + H S + +KRF +FK N+ ++ N ++ YK+ LNKFAD+TN E+R
Sbjct: 37 LYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRH 95
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
Y G++ R + + A+ + D +P +VDWR+KGAV PVKDQG CGSCWAFST
Sbjct: 96 HYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWAFST 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
V AVEGIN+I T EL+SLSEQELVDCD N GCNGGLMD AF+FI + GG+++E++YPY
Sbjct: 156 VVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPY 215
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ +CD +RN+ VVSIDG+EDV P DE SL KAVA+QPVSVAI+A G FQ Y GV
Sbjct: 216 MAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFYSEGV 275
Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+ LDHGV VGYGT + YW+V+NSWG +WGE GY+++QR +D G CGI
Sbjct: 276 FTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQRE-IDAEEGLCGI 334
Query: 350 AMEASYPVKNSQNSAKPKP 368
AM+ SYP+K S ++ P
Sbjct: 335 AMQPSYPIKTSSSNPTGSP 353
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 184/353 (52%), Positives = 242/353 (68%), Gaps = 11/353 (3%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M +AI L +F +SS A DMSIIS+DN H ++ RTDDEVM++++ WL KH K N
Sbjct: 1 MNMAIVLLFMVFAVSS--ALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNA 58
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+G EKRFQIFK+NLRFIDE NSLNRTYK+GLN FADLTN EYRAMYL T D R +
Sbjct: 59 LGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLD 118
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGE 184
+ RY + GD +P+SVDWR++GAV PVK+QG +C SCWAF+ V AVE + KI TG+
Sbjct: 119 TP-PRNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGD 177
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
LISLSEQE+VDC + GC GG + + + +I +N G+ E+DYPY G E KCD +++NA
Sbjct: 178 LISLSEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNA 236
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
+V+IDG+ V E +LK+ +A+QPV+V I A FQ+Y SGVF G+CG+ L+H ++
Sbjct: 237 -IVTIDGHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALL 295
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYG E DYW+ +NS+ WGENGY+++QR L C YP+
Sbjct: 296 LVGYGAEKDGDYWIAKNSYSDKWGENGYIRIQRKL-----STCKFGNGGYYPI 343
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 182/326 (55%), Positives = 235/326 (72%), Gaps = 7/326 (2%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + ++Y+ W A H S + +KRF +FK+N++FI E N + TYK+ LNKF D
Sbjct: 33 SEESLWSLYEKWRAHHA-VSRDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGD 91
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+TN+E+R+ Y G++ D L K A + ++ + +LP SVDWREKGAV VKDQG C
Sbjct: 92 MTNQEFRSTYAGSKIDHHMTLRGVKDAGE-FSYEKFHDLPTSVDWREKGAVTGVKDQGQC 150
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFSTV AVEGIN+I T EL+SLSEQ+LVDCD K N+GCNGGLMDYAF FI NGG+
Sbjct: 151 GSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNNGGL 209
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
SE YPYL + C S N+ VV+IDGY+DV +E +L KAVA+QPVSVAIEA G A
Sbjct: 210 SSEDSYPYLAEQKSCG-SEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYA 268
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVF+G CG+ LDHGV AVGYG ++G YW+V+NSWG WGE+GY++++R + D
Sbjct: 269 FQFYSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKD 328
Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
GKCGIAMEASYP+K+S N K +
Sbjct: 329 KR-GKCGIAMEASYPIKSSPNPKKAE 353
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 364 bits (934), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 178/358 (49%), Positives = 242/358 (67%), Gaps = 9/358 (2%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S + T F+S + D SI+ Y S ++ D+++ ++++W+++HG
Sbjct: 1 MAFFSSKTLVLTCSLCLFLSLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHG 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++FKDNL+ ID+ N + Y +GLN+FADL+++E++ YLG + D
Sbjct: 56 KIYETIEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLS 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+R S + + + D LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+I
Sbjct: 116 QRRESSN--EEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQI 172
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L SLSEQEL+DCD N GCNGGLMDYAF FI QNGG+ E+DYPY+ E+ C+
Sbjct: 173 VTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYIMEESTCEMK 232
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ +VV+I+GY DV +E SL KA+A+QP+SVAIEA R FQ Y GVF G CGS LD
Sbjct: 233 KEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLD 292
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYGT +DY +V+NSWG+ WGE G+++++R+ + G CG+ ASYP K
Sbjct: 293 HGVSAVGYGTSKNLDYIIVKNSWGAKWGEKGFIRMKRD-IGKPEGICGLYKMASYPTK 349
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 169/243 (69%), Positives = 199/243 (81%), Gaps = 1/243 (0%)
Query: 121 RRLMK-SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
RR+ K S RYA + GD+LPESVDWR++GAV VKDQ SCGSCWAFS +AAVEGINK
Sbjct: 3 RRMKKFGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINK 62
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG+LISLSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE DYPY + +CD
Sbjct: 63 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 122
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+R+NAKVV+ID YEDV +DE++L+KAVA+QP++VA+E GGR FQ YE GV TG CG+AL
Sbjct: 123 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTAL 182
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGV AVGYGTENG DYW+VRNSWG WGE GY++L+RNL + GKCGIA+E SYP+KN
Sbjct: 183 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 242
Query: 360 SQN 362
QN
Sbjct: 243 GQN 245
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 182/331 (54%), Positives = 233/331 (70%), Gaps = 18/331 (5%)
Query: 43 RTDDEVMTIYQTWLAKH------GKTSNGMGHNE----KRFQIFKDNLRFIDEHNSLN-- 90
RTD+EV +Y+ W ++H G T +G E +R ++F+ NLR+ID HN+
Sbjct: 44 RTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADA 103
Query: 91 --RTYKVGLNKFADLTNEEYRA-MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDW 147
+++GL +FADLT EEYRA + LG+R + V S+RY AG++LP++VDW
Sbjct: 104 GLHGFRLGLTRFADLTLEEYRARLLLGSR--GRNGTAVGVVGSRRYLPLAGEQLPDAVDW 161
Query: 148 REKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
RE+GAV VKDQG CG+CWAFS VAAVEGINKIVTG LISLSEQEL+DCD+ + GC+GG
Sbjct: 162 RERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGG 221
Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
LMD AF F+I+NGG+D+E DYP+ G + CD +N +VVSID +E V E +L+KAV
Sbjct: 222 LMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAV 281
Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
A QPVS +IEA RAFQ Y SG+F G CG+ LDHGV VGYG+E G DYW+V+NSWG+ W
Sbjct: 282 AHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSEGGKDYWIVKNSWGTQW 341
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GE GYV++ RN + GKCGIAME YPVK
Sbjct: 342 GEAGYVRMARN-VRVRAGKCGIAMEPLYPVK 371
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 176/324 (54%), Positives = 228/324 (70%), Gaps = 5/324 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+++ + +Y+TW + H + G+G +RF +FK+N+R+I E N +R +++ LNKFA
Sbjct: 32 SEESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEANKKDRPFRLALNKFA 91
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQG 160
D+T +E+R Y G+R R L + A E LP +VDWR+KGAV P+KDQG
Sbjct: 92 DMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKDQG 151
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST+ AVEGINKI TG L+SLSEQEL+DC+ N GCNGGLMD AFQFI QNG
Sbjct: 152 QCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQQNG 211
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY G +N CD S+ N+ VSIDGYEDV DE +L+KAVA+QPVSVAI+A G
Sbjct: 212 GITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASG 271
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GVFT + G+ LDHGV AVGYG T +G YW+V+NSWG DWGE GY+++QR +
Sbjct: 272 NDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGV 331
Query: 340 LDTNTGKCGIAMEASYPVKNSQNS 363
G CGIAMEASYP K++ ++
Sbjct: 332 KQAE-GLCGIAMEASYPTKSAPHA 354
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 163/234 (69%), Positives = 200/234 (85%)
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
+ LPE+VDWR+KGAVN +K+QG+CGSCWAFST A VEGINKIVTGELISLSEQELVDCD+
Sbjct: 2 EALPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDK 61
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
N GCNGGLMDYAFQFI++NGG+++EQDYPY G++ KC+ +N+KVV+IDGYEDV
Sbjct: 62 SYNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTN 121
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
DE +LK+AV+ QPVSVAI+AGGR FQHY+SG+FTGECG+ +DH VVAVGYG+ENGVDYW+
Sbjct: 122 DETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGSENGVDYWI 181
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
VRNSWG WGE+GY++++RNL + +GKCGIA+EASYPVK S N + SS
Sbjct: 182 VRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPVKYSPNPIRGNTISSV 235
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 174/334 (52%), Positives = 232/334 (69%), Gaps = 13/334 (3%)
Query: 44 TDDEVMTIYQTWLAKH--------GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYK 94
+++ + +Y+ W +++ G N G +RF +F +N R+I E N R ++
Sbjct: 34 SEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFR 93
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRL---MKSKVASQRYACKAGDELPESVDWREKG 151
+ LNKFAD+T +E+R Y G+R+ R L + S RY D LP +VDWRE+G
Sbjct: 94 LALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERG 153
Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
AV +KDQG CGSCWAFSTVAAVEG+NKI TG L++LSEQELVDCD N GC+GGLMDY
Sbjct: 154 AVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDY 213
Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
AFQFI +NGG+ +E +YPY + +C+ ++ ++ V+IDGYEDV DE +L+KAVA+QP
Sbjct: 214 AFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQP 273
Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGEN 330
V+VA+EA G+ FQ Y GVFTGECG+ LDHGV AVGYG T +G YW+V+NSWG DWGE
Sbjct: 274 VAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGER 333
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
GY+++QR + + G CGIAMEASYPVK+ +A
Sbjct: 334 GYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNA 367
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 250/350 (71%), Gaps = 17/350 (4%)
Query: 18 FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE--KRFQI 75
++ S+SA+D + D + + S R+ +Y W +H ++S + E +RF+I
Sbjct: 18 WVLSASASDFTPGFTDEDLESEKSLRS------LYDNWALQH-RSSRSLDSEEHAERFEI 70
Query: 76 FKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
FK+N+++ID N + YK+GLNKFADL+NEE++A+Y+GT+ D + +V S +
Sbjct: 71 FKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRG---DREVQSGSFMY 127
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ + LP S+DWR+KGAV VK+QG CGSCWAFSTVA+VEGIN I TG L+SLSEQ+LVD
Sbjct: 128 QNSEPLPASIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVD 187
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV--VSIDGYE 253
C + N+GCNGGLMD AFQ+II NGG+ +E +YPY +C ++ N++ V IDG+E
Sbjct: 188 CSTE-NSGCNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFE 246
Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-N 312
DV +E +LK+AVA QPVSVAIEA G+ FQ Y +GVFTG+CG+ALDHGVVAVGYGT
Sbjct: 247 DVPANNEQALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPE 306
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
G++YW+VRNSWG WGE GY+++Q+ ++ GKCGIAM+ASYP K +Q+
Sbjct: 307 GINYWIVRNSWGPKWGEEGYIRMQQG-IEAAEGKCGIAMQASYPTKKTQD 355
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 179/359 (49%), Positives = 240/359 (66%), Gaps = 12/359 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
++ S+ +AIS L + A D SI+ Y H ++ D+++ ++++W+++H
Sbjct: 8 LSKFSLLVAISASALL---CCAFARDFSIVGYTPEHLTNT-----DKLLELFESWMSEHS 59
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++F++NL ID+ N+ +Y +GLN+FADLT+EE++ YLG AK
Sbjct: 60 KAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ + + S + + +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L SLSEQEL+DCD N+GCNGGLMDYAFQ+II GG+ E DYPYL E C
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ + + V+I GYEDV D+ SL KA+A QPVSVAIEA GR FQ Y+ GVF G+CG+ LD
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLD 296
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV AVGYG+ G DY +V+NSWG WGE G+++++RN G CGI ASYP K
Sbjct: 297 HGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRN-TGKPEGLCGINKMASYPTKT 354
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 179/359 (49%), Positives = 239/359 (66%), Gaps = 12/359 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+ S+ +AIS L S+ A D SI+ Y S+ ++++ ++++W+++H
Sbjct: 8 LTKFSLLVAISASALL---CSALARDFSIVGYTPEQLTST-----EKLLELFESWMSEHS 59
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++F++NL ID+ N+ +Y +GLN+FADLT+EE++ YLG AK
Sbjct: 60 KVYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ + + S + + +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L SLSEQEL+DCD N+GCNGGLMDYAFQ+II GG+ E DYPYL E C
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ + + V+I GYEDV D+ SL KA+A QPVSVAIEA GR FQ Y+ GVF G+CG+ LD
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGQCGTDLD 296
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV AVGYG+ G DY +V+NSWG WGE G+++++RN G CGI ASYP K
Sbjct: 297 HGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRN-TGKPEGLCGINKMASYPTKT 354
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 181/354 (51%), Positives = 243/354 (68%), Gaps = 6/354 (1%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LV + S ++A DMS++SYD+N+ S + D E I+++W+ KHGK + E+R
Sbjct: 12 LVAMVIASCATAIDMSVVSYDDNNRLHSVF--DAEASLIFESWMVKHGKVYGSVAEKERR 69
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
IF+DNLRFI+ N+ N +Y++GL FADL+ EY+ + G R + +S R
Sbjct: 70 LTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHV-FMTSSDR 128
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y A D LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+
Sbjct: 129 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 188
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDG 251
L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY CD + N K V IDG
Sbjct: 189 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 247
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
YE++ DE +L KAVA QPV+ I++ R FQ YESGVF G CG+ L+HGVV VGYGTE
Sbjct: 248 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTE 307
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
NG DYWLV+NS G WGE GY+K+ RN+ + G CGIAM ASYP+KNS ++ K
Sbjct: 308 NGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 360
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 173/331 (52%), Positives = 231/331 (69%), Gaps = 12/331 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+++ + +Y+ W + + + G+G + E+RF +FK+N R++ E N +R +++ LNKFA
Sbjct: 33 SEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKRDRPFRLALNKFA 92
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
D+T +E+R Y G+R L + + D LP +VDWR+KGAV +KDQG
Sbjct: 93 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQ 152
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD N GC GGLMDYAFQFI +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN-G 211
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY G + CD ++ NA+ V+IDGYEDV DE +L+KAVA QPVSVAI+A G+
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y GVFTGEC + LDHGV AVGYG T +G YW+V+NSWG DWGE GY+++QR +
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331
Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
T G CGIAM+ASYP K++ PH+S
Sbjct: 332 QTE-GLCGIAMQASYPTKSA-------PHAS 354
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 181/354 (51%), Positives = 243/354 (68%), Gaps = 6/354 (1%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LV + S ++A DMS++SYD+N+ S + D E I+++W+ KHGK + E+R
Sbjct: 5 LVAMVIASCATAIDMSVVSYDDNNRLHSVF--DAEASLIFESWMVKHGKVYGSVAEKERR 62
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
IF+DNLRFI+ N+ N +Y++GL FADL+ EY+ + G R + +S R
Sbjct: 63 LTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHV-FMTSSDR 121
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y A D LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+
Sbjct: 122 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 181
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDG 251
L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY CD + N K V IDG
Sbjct: 182 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 240
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
YE++ DE +L KAVA QPV+ I++ R FQ YESGVF G CG+ L+HGVV VGYGTE
Sbjct: 241 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTE 300
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
NG DYWLV+NS G WGE GY+K+ RN+ + G CGIAM ASYP+KNS ++ K
Sbjct: 301 NGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 353
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 187/371 (50%), Positives = 241/371 (64%), Gaps = 19/371 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MAT SM LA+ +V L F+ + N D +S ++ + +Y+ W + H
Sbjct: 1 MATKSMLLAL--VVALAFVGVARTIPF------NEKDLAS----EESLWGLYERWRSHH- 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
S + KRF +FK+N +FI E N + YK+GLNKFAD+TN+E+R+ Y G++
Sbjct: 48 TVSRDLSEKNKRFNVFKENAKFIHEFNKKDAPYKLGLNKFADMTNQEFRSTYAGSKIHHH 107
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R + A+ + + +P SVDWR +GAV PVKDQG CGSCWAFST+A+VEGINKI
Sbjct: 108 RTQRGTPRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKI 167
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
T +L+ LS Q+LVDCD N GCNGGLMDYAF+FI NGG+ SE YPY + C S
Sbjct: 168 KTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSC-AS 226
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+A VV+IDGYEDV +E +L KAVA+Q VSVAIEA G AFQ Y GVFTG CG+ LD
Sbjct: 227 ESSAPVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELD 286
Query: 301 HGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK- 358
HGV VGYG T +G YW+VRNSWG++WGE GY+++QR + G CGIAME SYP+K
Sbjct: 287 HGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRG-IRARHGLCGIAMEPSYPLKT 345
Query: 359 --NSQNSAKPK 367
N +N+ PK
Sbjct: 346 SPNPKNNISPK 356
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 168/254 (66%), Positives = 203/254 (79%), Gaps = 4/254 (1%)
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
R Y G R +R +AS RY +AGD LP+SVDWREKGAV P+KDQG CGSCWAF
Sbjct: 12 RTTYFGVRGAGRR---TPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAF 68
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST+A+VEGINKIVTG+LISLSEQELVDCD+ N GCNGGLMDYAFQFII NGG+D+E+DY
Sbjct: 69 STIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDY 128
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY + +CD R+NAKVVSI+ YEDV DE +LKKA A QP++VAI+ GGR+FQ Y S
Sbjct: 129 PYTEQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNS 188
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
G+FTG+CG++LDHGV VGYG+E+G DYW+VRNSWG WGE GY+++ RN +D+ +G CG
Sbjct: 189 GIFTGKCGTSLDHGVTVVGYGSESGKDYWIVRNSWGESWGEKGYIRMARN-IDSPSGICG 247
Query: 349 IAMEASYPVKNSQN 362
IAMEASYP+K QN
Sbjct: 248 IAMEASYPIKKGQN 261
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 360 bits (924), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 173/334 (51%), Positives = 231/334 (69%), Gaps = 13/334 (3%)
Query: 44 TDDEVMTIYQTWLAKH--------GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYK 94
+++ + +Y+ W +++ G N G +RF +F +N R+I E N R ++
Sbjct: 34 SEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFR 93
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA---SQRYACKAGDELPESVDWREKG 151
+ LNKFAD+T +E+R Y G+R+ R L + S RY D LP +VDWRE+G
Sbjct: 94 LALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERG 153
Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
AV +KDQG CGSCWAFS VAAVEG+NKI TG L++LSEQELVDCD N GC+GGLMDY
Sbjct: 154 AVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDY 213
Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
AFQFI +NGG+ +E +YPY + +C+ ++ ++ V+IDGYEDV DE +L+KAVA+QP
Sbjct: 214 AFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQP 273
Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGEN 330
V+VA+EA G+ FQ Y GVFTGECG+ LDHGV AVGYG T +G YW+V+NSWG DWGE
Sbjct: 274 VAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGER 333
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
GY+++QR + + G CGIAMEASYPVK+ +A
Sbjct: 334 GYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNA 367
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 360 bits (923), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 181/359 (50%), Positives = 243/359 (67%), Gaps = 9/359 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRT-----DDEVMTIYQTWLAKHGKTSNGMG 67
LV + S ++A DMS++S +NNH ++S D E I+ +W+ KHGK +
Sbjct: 12 LVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWMVKHGKVYGSVA 71
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
E+R IF+DNLRFI N+ N +Y++GL +FADL+ EY + G R +
Sbjct: 72 EKERRLTIFEDNLRFISNRNAENLSYRLGLTQFADLSLHEYGEVCHGADPRPPRNHV-FM 130
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+S RY AGD LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVTGEL++
Sbjct: 131 TSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVT 190
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKV 246
LSEQ+L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY CD + N K
Sbjct: 191 LSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKN 249
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V IDG+E++ DE +L KAVA QPV+ I++ R FQ YESGVF G CG+ L+HGVV V
Sbjct: 250 VMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVV 309
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
GYGTENG DYWLV+NS G+ WGE GY+K+ RN+ + G CGIAM ASYP+KNS ++ K
Sbjct: 310 GYGTENGRDYWLVKNSRGNTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 367
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 177/320 (55%), Positives = 230/320 (71%), Gaps = 8/320 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
R+D+EV IYQ W KH N + R ++FK+NLRF+DEHN+ Y++G+N
Sbjct: 43 RSDEEVRIIYQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMN 102
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEEYRA +L S R S S +Y + GD LP+S+DWREKGAV VK+
Sbjct: 103 RFADLTNEEYRARFLRDLSRLGRS--TSGEISNQYRLREGDVLPDSIDWREKGAVVAVKN 160
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCWAF+ +AAVEGIN+IVTG+LISLSEQ+LVDC + N GC GG AFQ+II
Sbjct: 161 QGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR-NYGCEGGWPYRAFQYIIN 219
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG++SE+ YPY G C+ ++ NA VVSID Y +V DE SL+KA A+QP+SV I+A
Sbjct: 220 NGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDA 279
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
GR FQ Y SG+FTG C ++L+HGV VGYGTENG DYW+V+NSWG +WG +GY+ ++RN
Sbjct: 280 SGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTENGNDYWIVKNSWGENWGNSGYILMERN 339
Query: 339 LLDTNTGKCGIAMEASYPVK 358
+ ++ +GKCGIA+ SYP+K
Sbjct: 340 IAES-SGKCGIAISPSYPIK 358
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 173/326 (53%), Positives = 229/326 (70%), Gaps = 4/326 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y W + H + EKRF +F+ N+ + N NR+YK+ LNKFADL
Sbjct: 30 SEEGLSKLYDRWRSHH-SVPRSLHEREKRFNVFRHNVMHVHNSNKKNRSYKLKLNKFADL 88
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQR--YACKAGDELPESVDWREKGAVNPVKDQGS 161
T E++ Y G++ R L K S++ Y + +LP SVDWR+KGAV +K+QG
Sbjct: 89 TIHEFKNAYTGSKIKHHRMLQGPKRGSKQFMYDHENVSKLPSSVDWRKKGAVTEIKNQGK 148
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFSTVAAVEGINKI T +L+SLSEQELVDCD N GCNGGLM+ AF+FI +NGG
Sbjct: 149 CGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTNQNEGCNGGLMEIAFEFIKKNGG 208
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E YPY G + KCD S+ N +V+IDG+E+V DE +L KAVA+QPVSVAI+AG
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDNGVLVTIDGHENVPENDENALLKAVANQPVSVAIDAGSS 268
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVFTG+CG+ L+HGV VGYG++ G YW+VRNSWG++WGE GY+K++R +D
Sbjct: 269 DFQFYSEGVFTGDCGTELNHGVATVGYGSQGGKKYWIVRNSWGTEWGEGGYIKIERG-ID 327
Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
G+CGIAMEASYP+K S ++ PK
Sbjct: 328 EPEGRCGIAMEASYPIKLSSSNPTPK 353
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 179/342 (52%), Positives = 229/342 (66%), Gaps = 10/342 (2%)
Query: 17 FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIF 76
FF SS A D SI+ Y + D ++ ++++W++KH K + RF+IF
Sbjct: 3 FFASSCLARDFSIVGY-----APEDLTSRDRIIDLFESWISKHQKIYESIEEKWHRFEIF 57
Query: 77 KDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
KDNL IDE N Y +GLN+FADL++EE++ YLG D R + S+ + K
Sbjct: 58 KDNLFHIDETNKKVVNYWLGLNEFADLSHEEFKNKYLGLNVDLSNR----RECSEEFTYK 113
Query: 137 AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
+P+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGIN+IVTG L SLSEQELVDC
Sbjct: 114 DVSSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDC 173
Query: 197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
D N GCNGGLMDYAF +II NGG+ E+DYPY+ E C+ + ++VV+I GY DV
Sbjct: 174 DTTYNNGCNGGLMDYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVP 233
Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
E SL KA+A+QP+SVAI+A GR FQ Y GVF G CG+ LDHGV AVGYG+ G+D+
Sbjct: 234 QNSEESLLKALANQPLSVAIDASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAKGLDF 293
Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+V+NSWGS WGE G+++++RN G CGI ASYP K
Sbjct: 294 IVVKNSWGSKWGEKGFIRMKRN-TGKPAGLCGINKMASYPTK 334
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 177/368 (48%), Positives = 240/368 (65%), Gaps = 16/368 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M T + L + ++ + +S S + HD S +D+ + +Y+ W + H
Sbjct: 1 MTTKKLLLIVLSIALVLVVSESF----------DFHDKDVS--SDESLWDLYERWRSHHT 48
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
+ N + +KRF +FK N+ + N +++ YK+ LNKFAD+TN E++ Y G++ +
Sbjct: 49 VSRN-LNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHH 107
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R + S + + + P SVDWR+KGAV VKDQG CGSCWAFSTV AVEGIN+I
Sbjct: 108 RMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
T L+ LSEQEL+DCD + N GCNGGLM+YAF++I Q GG+ +E YPY + CD +
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDAT 227
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ N VSIDG+E V DE +L KAVA+QPVSVAI+AGG FQ Y GVFTG+CG L+
Sbjct: 228 KENVPAVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN 287
Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV VGYGT +G +YW+VRNSWG++WGE GY++++RN + G CGIAMEASYPVKN
Sbjct: 288 HGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGYIRMKRN-VSNKEGLCGIAMEASYPVKN 346
Query: 360 -SQNSAKP 366
S+N A P
Sbjct: 347 SSKNPAGP 354
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 170/330 (51%), Positives = 231/330 (70%), Gaps = 5/330 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+++ + +Y+ W + + + G+G + E+RF +FK N R++ E N + +++ LNKFA
Sbjct: 33 SEESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDMPFRLALNKFA 92
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
D+T +E+R Y G+R L + + D LP +VDWR+KGAV +KDQG
Sbjct: 93 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQ 152
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD N GC+GGLMDYAFQFI +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN-G 211
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY G + CD ++ NA+ V+IDGYEDV DE +L+KAVA QPVSVAI+A G+
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y GVFTGEC + LDHGV AVGYG T +G YW+V+NSWG DWGE GY+++QR +
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331
Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
T G CGIAM+ASYP K++ +++ + S
Sbjct: 332 QTE-GLCGIAMQASYPTKSAPHASTVREES 360
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 170/326 (52%), Positives = 239/326 (73%), Gaps = 9/326 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKF 100
+D+ + +Y W +H +++ + +E +RF+IFK+N++ ID N + YK+GLNKF
Sbjct: 36 ESDESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKF 94
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSK-VASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
ADL+NEE++AM++ T+ + + L + V S + + LP S+DWR+KGAV PVK+Q
Sbjct: 95 ADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQ 154
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAFST+A+VEGIN I TG+L+SLSEQ+LVDC ++ NAGCNGGLMD AFQ+II N
Sbjct: 155 GQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIIDN 213
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVS--IDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
GG+ +E +YPY +C ++ +K ++ IDG+EDV +E +LKKAVA QPVS+AIE
Sbjct: 214 GGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIE 273
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQ 336
A G FQ Y +GVFTG+CG+ LDHGVV VGYG + G++YW+VRNSWG +WGE GY+++Q
Sbjct: 274 ASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRMQ 333
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN 362
R ++ GKCGI+M+ASYP K +Q+
Sbjct: 334 RG-IEATEGKCGISMQASYPTKKTQD 358
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 170/330 (51%), Positives = 231/330 (70%), Gaps = 5/330 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+++ + +Y+ W + + + G+G + E+RF +FK N R++ E N + +++ LNKFA
Sbjct: 33 SEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKRDMPFRLALNKFA 92
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
D+T +E+R Y G+R L + + D LP +VDWR+KGAV +KDQG
Sbjct: 93 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQ 152
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD N GC+GGLMDYAFQFI +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN-G 211
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY G + CD ++ NA+ V+IDGYEDV DE +L+KAVA QPVSVAI+A G+
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y GVFTGEC + LDHGV AVGYG T +G YW+V+NSWG DWGE GY+++QR +
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331
Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
T G CGIAM+ASYP K++ +++ + S
Sbjct: 332 QTE-GLCGIAMQASYPTKSAPHASTVREES 360
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 357 bits (915), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 177/333 (53%), Positives = 225/333 (67%), Gaps = 14/333 (4%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y+ W +H + +G +RF +FK N+R I E N + YK+ LN+F D+
Sbjct: 148 SEEALWALYERWRGRHA-LARDLGDKARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDM 206
Query: 104 TNEEYRAMYLGTRSDAKRRLMK-----SKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
T +E+R Y G+R A R+ + S ++ + ++P SVDWR+KGAV VKD
Sbjct: 207 TADEFRRHYAGSRV-AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 265
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCWAFST+AAVEGIN I T L SLSEQ+LVDCD K NAGCNGGLMDYAFQ+I +
Sbjct: 266 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 325
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
+GG+ +E YPY + C S A VV+IDGYEDV DE +LKKAVA QPVSVAIEA
Sbjct: 326 HGGVAAEDAYPYRARQASCKKS--PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 383
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
G FQ Y GVF+G CG+ LDHGV AVGYG T +G YWLV+NSWG +WGE GY+++ R
Sbjct: 384 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMAR 443
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
++ G CGIAMEASYPVK S N PK H+
Sbjct: 444 DVA-AKEGHCGIAMEASYPVKTSPN---PKVHA 472
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 357 bits (915), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 177/353 (50%), Positives = 234/353 (66%), Gaps = 15/353 (4%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
FLA+S L F++ S A SI+ Y ++D+++ ++++W+++ G+
Sbjct: 10 FFLAVS----LSFLAYSGFARDSIVGY-----APEDLTSNDKLIDLFESWISRFGRVYES 60
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+RF+IFKDNL ID+ N R Y +GLN+FADL++EE++ YLG + D +R
Sbjct: 61 AEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHEEFKNKYLGLKPDLSKRAQC 120
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ + + +P+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVTG L
Sbjct: 121 PEEFTYKDVA-----IPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQEL+DCD N GCNGGLMDYAF +I+ NGG+ E+DYPY+ E CD + +
Sbjct: 176 TSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEEDYPYIMEEGTCDMRKEESD 235
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
V+I GY DV E SL KA+A+QP+S+AIEA GR FQ Y GVF G CG+ LDHGV A
Sbjct: 236 AVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 295
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
VGYGT G+DY +V+NSWG WGE GY++++R G CGI ASYP K
Sbjct: 296 VGYGTSKGLDYIIVKNSWGPKWGEKGYIRMKRK-TSKPEGICGIYKMASYPTK 347
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 169/327 (51%), Positives = 227/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S +G KRF +FK N+ + N +++ YK+ L
Sbjct: 26 HEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + + S+ S + + +P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 177/368 (48%), Positives = 239/368 (64%), Gaps = 16/368 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M T + L + ++ + +S S + HD S +D+ + +Y+ W + H
Sbjct: 1 MTTKKLLLIVLSIALVLVVSESF----------DFHDKDVS--SDESLWDLYERWRSHHT 48
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
+ N + +KRF +FK N+ + N +++ YK+ LNKFAD+TN E++ Y GT+ +
Sbjct: 49 VSRN-LNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGTKVNHH 107
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R + S + + + P SVDWR+KGAV VKDQG CGSCWAFSTV AVEGIN+I
Sbjct: 108 RMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
T L+ LSEQEL+DCD + N GCNGGLM+YAF++I Q GG+ +E YPY + CD +
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDAT 227
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ N VSIDG+E V DE +L KAVA+QPVSVAI+AGG FQ Y GVFTG+CG L+
Sbjct: 228 KENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN 287
Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV VGYGT +G +YW+VRNSWG++WGE G ++++RN + G CGIAMEASYPVKN
Sbjct: 288 HGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRN-VSNKEGLCGIAMEASYPVKN 346
Query: 360 -SQNSAKP 366
S+N A P
Sbjct: 347 SSKNPAGP 354
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 169/327 (51%), Positives = 227/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S +G KRF +FK N+ + N +++ YK+ L
Sbjct: 26 HEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + + S+ S + + +P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 177/320 (55%), Positives = 228/320 (71%), Gaps = 8/320 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
R+D+EV IYQ W AKH N + R ++FK+NLRF+DEHN+ Y++G+N
Sbjct: 34 RSDEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMN 93
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEEYRA +L S R S S +Y + GD LP+S+DWREKGAV VK
Sbjct: 94 RFADLTNEEYRARFLRDLSRLGRS--TSGEISNQYRLREGDVLPDSIDWREKGAVVAVKS 151
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCWAF+ +A VEGIN+IVTG+LISLSEQ+LVDC + N GC GG AFQ+II
Sbjct: 152 QGRCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIIN 210
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG++SE+ YPY G C+ ++ NA VVSID Y +V DE SL+KAVA+QP+SV I A
Sbjct: 211 NGGVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINA 270
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
GR FQ Y SG+FTG C ++L+HGV VGYGT NG DYW+V+NSWG WG++GY+ ++RN
Sbjct: 271 SGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTVNGNDYWIVKNSWGESWGDSGYILMERN 330
Query: 339 LLDTNTGKCGIAMEASYPVK 358
+ ++ +GKCGIA+ SYP+K
Sbjct: 331 IAES-SGKCGIAISPSYPIK 349
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 169/320 (52%), Positives = 225/320 (70%), Gaps = 5/320 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+++ + +Y+ W + + + G+G + E+RF +FK+N R+I E N +R +++ LNKFA
Sbjct: 32 SEENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKKDRPFRLALNKFA 91
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
D+T +E+R Y G+R L + + D LP +VDWR+KGAV +KDQG
Sbjct: 92 DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQGQ 151
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD N GC+GGLMDYAFQFI +N G
Sbjct: 152 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN-G 210
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY G + CD ++ A V+IDGYEDV DE +L+KAVA QPVSVAI+A G
Sbjct: 211 ITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGN 270
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y GVFTGEC + LDHGV AVGYG T +G YW+V+NSWG DWGE GY+++QR +
Sbjct: 271 DFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 330
Query: 341 DTNTGKCGIAMEASYPVKNS 360
G+CGIAM+ASYP K++
Sbjct: 331 QAE-GQCGIAMQASYPTKSA 349
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 171/335 (51%), Positives = 232/335 (69%), Gaps = 12/335 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMG---------HNE-KRFQIFKDNLRFIDEHNSLNRTY 93
+++ + +Y+ W +++ + + G H+ +RF +FK+N+++I E N +R +
Sbjct: 30 SEESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKKDRPF 89
Query: 94 KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV 153
++ LNKFAD+T +E R Y G+R R L + A + + LP +VDWREKGAV
Sbjct: 90 RLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQGNFTYSDAENLPPAVDWREKGAV 149
Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
+KDQG CGSCWAFST+AAVE INKI TG+L+SLSEQEL+DCD + GC+GGLMDYAF
Sbjct: 150 TGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGCDGGLMDYAF 209
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
QFI +NGG+ SE +YPY G +N CD ++ N V+IDGYEDV DE +L+KAVA QPVS
Sbjct: 210 QFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQKAVAYQPVS 269
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGY 332
VAIEA G+ FQ Y GVFTG+C + LDHGV AVGYGT +G YW+V+NSWG DWGE GY
Sbjct: 270 VAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEKGY 329
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
+++QR + G CGIAM+ASYP+K + ++ +
Sbjct: 330 IRMQRGVSQAE-GLCGIAMQASYPIKAAPHATTAR 363
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 168/324 (51%), Positives = 219/324 (67%), Gaps = 3/324 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S + KRF +FK+N+ + + N + + YK+ L
Sbjct: 26 HEKDLESEESLWDLYERWRSHH-TVSTSLDEKHKRFNVFKENVMHVHKTNKMGKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R++Y G++ R + + + +++P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYGKVEKVPTSVDWRKKGAVTAVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFST+ AVEGIN I T EL+SLSEQELVDCD N GCNGGLM+YAF+FI
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDTTENQGCNGGLMEYAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+ G+ +E YPY + CD ++ N VSIDGYE V DE +L KA A+QPVSVAI+
Sbjct: 205 KKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVPENDEDALLKAAANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVF GECG+ LDHGV VGYGT +G YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
R + D G CGIAMEASYP+KNS
Sbjct: 325 RGISDKE-GLCGIAMEASYPIKNS 347
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 165/224 (73%), Positives = 193/224 (86%), Gaps = 1/224 (0%)
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
D +PESVDWR++GAV VKDQGSCGSCWAFST+ AVEGINKIVTG+LISLSEQELVDCD
Sbjct: 1 DAIPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDT 60
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
N GCNGGLMDYAF+FII+NGG+D+E+DYPY A+ +CD +R+NAKVV+ID YEDV
Sbjct: 61 SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPEN 120
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
+E +LKKA+A+QP+SVAIEAGGRAFQ Y SGVF G CG+ LDHGVVAVGYGTENG DYW+
Sbjct: 121 NEAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGTENGKDYWI 180
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
VRNSWG WGE+GY+K+ RN+ + TGKCGIAMEASYP+K QN
Sbjct: 181 VRNSWGGSWGESGYIKMARNIAEA-TGKCGIAMEASYPIKKGQN 223
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 175/355 (49%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 7 FLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGM 66
F+ ++ L L + ++ + D H ++D + +Y+ W + H + +
Sbjct: 4 FIVLA-LCMLMVLETTKSLDF----------HEKDVESEDSLWELYERWKSHH-TIARSL 51
Query: 67 GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
KRF +FK N++ I E N +YK+ LNKF D+T+EE+R Y G+ R
Sbjct: 52 EEKAKRFNVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGE 111
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ ++ + D LP SVDWR+ GAV PVK+QG CGSCWAFSTV AVEGIN+I T +L
Sbjct: 112 RQTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLT 171
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SLSEQELVDCD N GCNGGLMD AF+FI + GG+ SE YPY ++ CD ++ NA V
Sbjct: 172 SLSEQELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPV 231
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
VSIDG+EDV E+ L KAVA QPVSVAI+AGG FQ Y GVFTG CG+ L+HGV V
Sbjct: 232 VSIDGHEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVV 291
Query: 307 GYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
GYGT +G YW+V+NSWG +WGE GY+++QR + G CGIAMEASYP+KNS
Sbjct: 292 GYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE-GLCGIAMEASYPLKNS 345
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 171/324 (52%), Positives = 221/324 (68%), Gaps = 3/324 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H+ +++ + +Y+ W + H + + KRF +FK N++ I E N +++YK+ L
Sbjct: 24 HNKDVESENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKL 82
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKF D+T+EE+R Y G+ R K A++ + + LP SVDWR+ GAV PVK
Sbjct: 83 NKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVK 142
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QG CGSCWAFSTV AVEGIN+I T +L SLSEQELVDCD N GCNGGLMD AF+FI
Sbjct: 143 NQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK 202
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+ GG+ SE YPY ++ CD ++ NA VVSIDG+EDV E L KAVA+QPVSVAI+
Sbjct: 203 EKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAID 262
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG CG+ L+HGV VGYGT +G YW+V+NSWG +WGE GY+++Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322
Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
R + G CGIAMEASYP+KNS
Sbjct: 323 RGIRHKE-GLCGIAMEASYPLKNS 345
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 176/368 (47%), Positives = 239/368 (64%), Gaps = 16/368 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M T + L + ++ + +S S + HD S +D+ + +Y+ W + H
Sbjct: 1 MTTKKLLLIVLSIALVLVVSESF----------DFHDKDVS--SDESLWDLYERWRSHHT 48
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
+ N + +KRF +FK N+ + N +++ YK+ LNKFAD+TN E++ Y G++ +
Sbjct: 49 VSRN-LNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHH 107
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R + S + + + P SVDWR+KGAV VKDQG CGSCWAFSTV AVEGIN+I
Sbjct: 108 RMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
T L+ LSEQEL+DCD + N GCNGGLM+YAF++I Q GG+ +E YPY + CD +
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDAT 227
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ N VSIDG+E V DE +L KAVA+QPVSVAI+AGG FQ Y GVFTG+CG L+
Sbjct: 228 KENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN 287
Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV VGYGT +G +YW+VRNSWG++WGE G ++++RN + G CGIAMEASYPVKN
Sbjct: 288 HGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRN-VSNKEGLCGIAMEASYPVKN 346
Query: 360 -SQNSAKP 366
S+N A P
Sbjct: 347 SSKNPAGP 354
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 178/332 (53%), Positives = 224/332 (67%), Gaps = 13/332 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y+ W +H + +G +RF +FK N+R I E N + YK+ LN+F D+
Sbjct: 41 SEEALWALYERWRGRHALARD-LGDKARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDM 99
Query: 104 TNEEYRAMYLGTRSDAKRRLMKS----KVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
T +E+R Y G+R A R+ + AS + ++P SVDWR+KGAV VKDQ
Sbjct: 100 TADEFRRHYAGSRV-AHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQ 158
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAFST+AAVEGIN I T L SLSEQ+LVDCD K NAGCNGGLMDYAFQ+I ++
Sbjct: 159 GQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKH 218
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ +E YPY + C S A VV+IDGYEDV DE +LKKAVA QPVSVAIEA
Sbjct: 219 GGVAAEDAYPYRARQASCKKS--PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEAS 276
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRN 338
G FQ Y GVF+G CG+ LDHGV AVGYG T +G YWLV+NSWG +WGE GY+++ R+
Sbjct: 277 GSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARD 336
Query: 339 LLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
+ G CGIAMEASYPVK S N PK H+
Sbjct: 337 VA-AKEGHCGIAMEASYPVKTSPN---PKVHA 364
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 170/314 (54%), Positives = 216/314 (68%), Gaps = 3/314 (0%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
+Y+ W + H S + +KRF +FK N + N +++ YK+ LNKFAD+TN E+R
Sbjct: 37 LYERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRN 95
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
Y G++ R + + + D +P SVDWR+KGAV VKDQG CGSCWAFST
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
+ AVEGIN+I T +L+SLSEQELVDCD N GCNGGLMDYAF+FI Q GG+ +E +YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ CD S+ NA VSIDG+E+V DE +L KAVA+QPVSVAI+AGG FQ Y GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG CG+ LDHGV VGYGT +G YW V+NSWG +WGE GY++++R + D G CGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKE-GLCGI 334
Query: 350 AMEASYPVKNSQNS 363
AMEASYP+K S N+
Sbjct: 335 AMEASYPIKKSSNN 348
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 353 bits (906), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 180/360 (50%), Positives = 237/360 (65%), Gaps = 13/360 (3%)
Query: 15 FLFFISSSSAADMSIISYDNNHD-HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
FL+ + S S ++ N+ D H +++ + +Y+ W + H S +G KRF
Sbjct: 6 FLWVVLSLSL----VLGVANSFDFHDKDLESEESLWDLYERWRSHH-TVSRSLGDKHKRF 60
Query: 74 QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
+FK N+ + N +++ YK+ LNKFAD+TN E+R+ Y G++ + R + +
Sbjct: 61 NVFKANMMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTF 120
Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQEL 193
+ +P SVDWR+KGAV VKDQG CGSCWAFSTV AVEGIN+I T +L+SLSEQEL
Sbjct: 121 MYEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQEL 180
Query: 194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE 253
VDCD + NAGCNGGLM+ AFQFI Q GG+ +E YPY + CD S+ N VSIDG+E
Sbjct: 181 VDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHE 240
Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TEN 312
+V DE +L KAVA+QPVSVAI+AGG FQ Y GVFTG+C + L+HGV VGYG T +
Sbjct: 241 NVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGATVD 300
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN-----SAKPK 367
G YW+VRNSWG +WGE GY+++QRN + G CGIAM ASYP+KNS N S+ PK
Sbjct: 301 GTSYWIVRNSWGPEWGELGYIRMQRN-ISKKEGLCGIAMLASYPIKNSSNNPTGPSSSPK 359
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 180/364 (49%), Positives = 244/364 (67%), Gaps = 10/364 (2%)
Query: 9 AISTLVFLFFISS-SSAADMSIISYDNNHDHSS-----SWRTDDEVMTIYQTWLAKHGKT 62
A+ L+ ISS ++A DMSI+S ++NH ++ D E ++++W+ KHGK
Sbjct: 7 AMLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHGKV 66
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+ E+R IF+DNLRFI N+ N +Y++GLN+FADL+ EY + G R
Sbjct: 67 YESVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYAQICHGADPRPPRN 126
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ +S RY GD LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVT
Sbjct: 127 HV-FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIVT 185
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-DPSR 241
GEL++LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++ DYPY C D +
Sbjct: 186 GELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRLK 244
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
N K V IDGYE++ DE +L KAVA QPV+ +++ R FQ Y SGVF G CG+ L+H
Sbjct: 245 ENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLNH 304
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
GVV VGYGTENG DYW+VRNS G+ WGE GY+K+ RN+ + G CGIAM ASYP+KNS
Sbjct: 305 GVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSF 363
Query: 362 NSAK 365
++ K
Sbjct: 364 STDK 367
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 353 bits (905), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 172/336 (51%), Positives = 226/336 (67%), Gaps = 8/336 (2%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ +Y+ W + H S +G KRF +FK N+ + N +++ YK+ L
Sbjct: 26 HDKDLASEESFWDLYERWRSHH-TVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + R + + + + +P SVDWR+ GAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN+I T +L+SLSEQELVDCD K NAGCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY + CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN-----SAKPK 367
R+ + G CGIAM ASYP+KNS N S+ PK
Sbjct: 325 RS-ISKKEGLCGIAMMASYPIKNSSNNPTGPSSSPK 359
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 170/324 (52%), Positives = 233/324 (71%), Gaps = 8/324 (2%)
Query: 44 TDDEVMTIYQTWLAKHGK---TSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNK 99
+D ++ Y +W AK GK +SN +G ++RF+ FK+N R+I+EHN + +Y++GLN+
Sbjct: 5 SDSDLSGEYASWCAKFGKECASSNSLG--DRRFETFKENFRYIEEHNRAGKHSYRLGLNQ 62
Query: 100 FADLTNEEYRAMYLGTRSD-AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
F+DLT+EE+R +LG R D ++K S +LP SVDWR+ GAV KD
Sbjct: 63 FSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKD 122
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QGSCG CWAF+T A+EGIN+IVTG+L+SLSEQEL+DCD+K + GC+GGLM+ A+QFI++
Sbjct: 123 QGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIVE 182
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+D+E DYPY +E+ C+ + N++VV+IDGYE + DE +L +AVA QPVSVAIE
Sbjct: 183 NGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEG 242
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+ FQHY SGVFTG CG ++HGV+ VGYGTE+G+DYW+V+NSW + WG+ G+VK+QRN
Sbjct: 243 ASKDFQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRN 302
Query: 339 LLDTNTGKCGIAMEASYPVKNSQN 362
G C I ASYPVK+ N
Sbjct: 303 -TGKRGGLCSINTLASYPVKSGGN 325
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 168/327 (51%), Positives = 226/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S +G KRF +FK NL + N +++ YK+ L
Sbjct: 26 HDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + R + + + + +P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN+I T +L++LSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE+GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM SYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMLPSYPIKNSSDN 350
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 168/327 (51%), Positives = 226/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S +G KRF +FK NL + N +++ YK+ L
Sbjct: 25 HDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKL 83
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + R + + + + +P SVDWR+KGAV VK
Sbjct: 84 NKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVK 143
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN+I T +L++LSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 144 DQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIK 203
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 204 QKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAID 263
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE+GY+++Q
Sbjct: 264 AGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQ 323
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM SYP+KNS ++
Sbjct: 324 RN-ISKKEGLCGIAMLPSYPIKNSSDN 349
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 350 bits (899), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 171/333 (51%), Positives = 229/333 (68%), Gaps = 8/333 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
D SI+ Y + D+++ +++ W++ K + RF++FKDNL+ IDE
Sbjct: 30 DYSIVGYS-----PEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE 84
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
N ++Y +GLN+FADL++EE++ MYLG ++D RR + A +A + + +P+SV
Sbjct: 85 TNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA--EFAYRDVEAVPKSV 142
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWR+KGAV VK+QGSCGSCWAFSTVAAVEGINKIVTG L +LSEQEL+DCD N GCN
Sbjct: 143 DWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCN 202
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMDYAF++I++NGG+ E+DYPY E C+ + ++ V+I+G++DV DE SL K
Sbjct: 203 GGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLK 262
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
A+A QP+SVAI+A GR FQ Y GVF G CG LDHGV AVGYG+ G DY +V+NSWG
Sbjct: 263 ALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGP 322
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
WGE GY++L+RN G CGI AS+P K
Sbjct: 323 KWGEKGYIRLKRN-TGKPEGLCGINKMASFPTK 354
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 173/328 (52%), Positives = 228/328 (69%), Gaps = 19/328 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFAD 102
++D + ++Y+ W + H S + +KRF +FK+N++FI E N + + T+K+ LNKF D
Sbjct: 30 SEDSLWSLYERWRSHHA-VSRDLDQKQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGD 88
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-------PESVDWREKGAVNP 155
+TN+E+RA Y G++ R + S R+ +G + P S+DWRE+GAV
Sbjct: 89 MTNQEFRAKYAGSKVHHHRTMKGS-----RHGSGSGAKFMYENAVAPPSIDWRERGAVAA 143
Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
VK+QG CGSCWAFS +AAVEGIN+IVT EL+ LSEQEL+DCD N GC+GGLMDYAF+F
Sbjct: 144 VKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEF 203
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
I NGG+ +E YPY + C ++N+ V IDGYEDV DE +L KAVA+QPV+VA
Sbjct: 204 IKNNGGITTEDVYPYQAEDATC---KKNSPAVVIDGYEDVPTNDEDALMKAVANQPVAVA 260
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVK 334
IEA G FQ Y GVFTG CG+ LDHGV VGYG T++G YW VRNSWG+DWGE+GYV+
Sbjct: 261 IEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVR 320
Query: 335 LQRNLLDTNTGKCGIAMEASYPVKNSQN 362
+QR + T+ G CGIAM+ASYP+K S N
Sbjct: 321 MQRGIKATH-GLCGIAMQASYPIKTSLN 347
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 176/360 (48%), Positives = 240/360 (66%), Gaps = 14/360 (3%)
Query: 4 ASMFLAISTLVFLFFIS----SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
A +F + T +FL F+S S+ A + SI+ Y + +V+ ++++WLAKH
Sbjct: 2 AFIFSSKKTSLFLVFVSVLACSALANEFSILGYA-----PEDLTSIHKVIHLFESWLAKH 56
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K + RF+IF DNL+ ID+ N Y +GLN+FADLT+EE++ +LG + +
Sbjct: 57 SKIYESLDEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGEL 116
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
R +S + ++ + +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+
Sbjct: 117 PERKDES---IEEFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQ 173
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG L LSEQEL+DCD N GCNGGLMDYAF +++++G + E++YPY+ +E CD
Sbjct: 174 IVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDE 232
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+ ++ V+I GY DV +E S KA+A+QP+SVAIEA GR FQ Y GVF G CG+ L
Sbjct: 233 KKDVSETVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTEL 292
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGV AVGYGT G+DY +VRNSWG WGE GY++++R + G CG+ M ASYP K
Sbjct: 293 DHGVAAVGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPH-GMCGLYMMASYPTKQ 351
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 166/327 (50%), Positives = 226/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S + KRF +FK+N+ + N +++ YK+ L
Sbjct: 26 HEKDLASEESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + + ++ + + + +P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GV TG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 175/352 (49%), Positives = 235/352 (66%), Gaps = 13/352 (3%)
Query: 11 STLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
++L+FLF S+ A + SI+ Y + +V+ ++++WL KH K +
Sbjct: 10 TSLLFLFVSILACSALAHEFSILGYA-----PEDLTSIHKVIHLFESWLVKHSKFYESLD 64
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
RF+IF DNL+ IDE N Y +GLN+FADLT+EE++ +LG + + R +S
Sbjct: 65 EKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDES- 123
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S+ + + +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG L
Sbjct: 124 --SKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTM 181
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQEL+DCD N GCNGGLMDYAF +++++G + E++YPY+ +E CD + ++ V
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSEKV 240
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+I GY DV DE S KA+A+QP+SVAIEA GR FQ Y GVF G CG+ LDHGV AVG
Sbjct: 241 TISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
YGT G+DY +VRNSWG WGE GY++++R + G CG+ M ASYP K
Sbjct: 301 YGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH-GMCGLYMMASYPTKQ 351
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 175/359 (48%), Positives = 242/359 (67%), Gaps = 9/359 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRT-----DDEVMTIYQTWLAKHGKTSNGMG 67
L+ L S ++A DMS++S ++NH ++ D E ++++W+ KHGK + +
Sbjct: 12 LLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDSVA 71
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
E+R IF+DNLRFI N+ N +Y++GLN+FADL+ EY + G R +
Sbjct: 72 EKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV-FM 130
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+S RY GD LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVTGEL++
Sbjct: 131 TSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVT 190
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKV 246
LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++ DYPY C+ + + K
Sbjct: 191 LSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDNKN 249
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V IDGYE++ DE +L KAVA QPV+ +++ R FQ YESGVF G CG+ L+HGVV V
Sbjct: 250 VMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVVVV 309
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
GYGTENG DYW+V+NS G WGE GY+K+ RN+ + G CGIAM ASYP+KNS ++ K
Sbjct: 310 GYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 367
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 165/277 (59%), Positives = 210/277 (75%), Gaps = 16/277 (5%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
MSI+SY R+++E +Y W+A HG+T N +G E+RF++F+DNLR++D H
Sbjct: 29 MSIVSYGE--------RSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAH 80
Query: 87 NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
N+ ++++GLN+FADLTN+EYRA YLG RS R + + RY ++LP
Sbjct: 81 NAAADAGVHSFRLGLNRFADLTNDEYRATYLGVRS----RPQRERRLGDRYLAGDNEDLP 136
Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
ESVDWR KGAV VKDQGSCGSCWAFST+AAVEGIN+IVTG++ISLSEQELVDCD N
Sbjct: 137 ESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQ 196
Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV E S
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKS 256
Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
L+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+++
Sbjct: 257 LQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGNSV 293
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 168/304 (55%), Positives = 216/304 (71%), Gaps = 5/304 (1%)
Query: 56 LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGT 115
++KHGK+ RF++F+DNL+ IDE N +Y +GLN+FADL++EE++ YLG
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL 60
Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
+ + L K + + + ++ K +LP+SVDWR+KGAV VK+QG+CGSCWAFSTVAAVE
Sbjct: 61 KIE----LPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
GIN+IVTG L +LSEQEL+DCD+ N GCNGGLMDYAF FII NGG+ E+DYPY+ E
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176
Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
C + +VV+I GY DV +E S KA+A+QP+SVAIEA R FQ Y G+F G C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236
Query: 296 GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
G+ LDHGV AVGYGT GVDY V+NSWGS WGE GY++++RN + G CGI ASY
Sbjct: 237 GTELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRN-VGKPEGICGIYKMASY 295
Query: 356 PVKN 359
P KN
Sbjct: 296 PTKN 299
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 177/352 (50%), Positives = 238/352 (67%), Gaps = 11/352 (3%)
Query: 15 FLFFISSSSAADMSIISYDNNHDHS------SSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
L F + SAA +S+ S +HD+S + D+++ +++ W++ K +
Sbjct: 9 ILCFPLALSAATLSL-SVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEE 67
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
RF++FKDNL+ IDE N ++Y +GLN+FADL++EE++ MYLG ++D RR +
Sbjct: 68 KLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSY 127
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
A +A + + +P+SVDWR+KGAV VK+QGSCGSCWAFSTVAAVEGINKIVTG L +L
Sbjct: 128 A--EFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTL 185
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQEL+DCD N GCNGGLMDYAF++I++NGG+ E+DYPY E C+ + ++ V+
Sbjct: 186 SEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVT 245
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES-GVFTGECGSALDHGVVAVG 307
IDG++DV DE SL KA+A QP+SVAI+A GR FQ Y VF G CG LDHGV AVG
Sbjct: 246 IDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDHGVAAVG 305
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
YG+ G DY +V+NSWG WGE GY++L+RN G CGI AS+P K
Sbjct: 306 YGSSKGSDYIIVKNSWGPKWGEKGYIRLKRN-TGKPEGLCGINKMASFPTKT 356
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 169/324 (52%), Positives = 231/324 (71%), Gaps = 8/324 (2%)
Query: 44 TDDEVMTIYQTWLAKHGK---TSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNK 99
+D ++ Y +W AK GK +SN +G + RF+ FK+N R+I+EHN + +Y++GLN+
Sbjct: 5 SDSDLSGEYASWCAKFGKECASSNSLG--DHRFETFKENFRYIEEHNRAGKHSYRLGLNQ 62
Query: 100 FADLTNEEYRAMYLGTRSD-AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
F+DLT+EE+R +LG R D ++K S +LP SVDWR+ GAV KD
Sbjct: 63 FSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKD 122
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QGSCG CWAF+T A+EGIN+IVTG+L+SLSEQEL+DCD+K + GC+GGLM+ A+QFI++
Sbjct: 123 QGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVE 182
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+D+E DYPY +E+ C+ + N++VV+IDGY+ + DE +L AVA QPVSVAIE
Sbjct: 183 NGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEG 242
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+ FQHY SGVFTG CG ++HGV+ VGYGTE+G+DYW+V+NSW + WG+ G+VK+QRN
Sbjct: 243 ASKDFQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRN 302
Query: 339 LLDTNTGKCGIAMEASYPVKNSQN 362
G C I ASYPVK+ N
Sbjct: 303 -TGKRGGLCSINTLASYPVKSGGN 325
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 168/315 (53%), Positives = 221/315 (70%), Gaps = 6/315 (1%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
D++ ++ W KHGKT ++R QIFKDN F+ +HN + N TY + LN FADLT
Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 85
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
+ E++A LG A +M SK S + K +P+SVDWR+KGAV VKDQGSCG+
Sbjct: 86 HHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKDQGSCGA 141
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FS A+EGIN+IVTG+LISLSEQEL+DCD+ NAGCNGGLMDYAF+F+I+N G+D+
Sbjct: 142 CWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDT 201
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E+DYPY + C + KVV+ID Y V DE +L +AVA QPVSV I RAFQ
Sbjct: 202 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 261
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y SG+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG WG +G++ +QRN +++
Sbjct: 262 LYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSD- 320
Query: 345 GKCGIAMEASYPVKN 359
G CGI M ASYP+K
Sbjct: 321 GVCGINMLASYPIKT 335
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 172/366 (46%), Positives = 239/366 (65%), Gaps = 15/366 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M + L +LV +F ++ S D ++ +++ + +Y+ W + H
Sbjct: 1 MKMEKVILVALSLVLVFGLAESFDFDEKDLA------------SEESLWDLYERWRSYH- 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
S + KRF +FK+N + + + N +++ YK+ LNKFAD+TN E+R+ Y G++
Sbjct: 48 TVSRDLEEKNKRFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHY 107
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R L + + + + LP SVDWR+KGAV +KDQG CGSCWAFSTV VEGIN+I
Sbjct: 108 RMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQI 167
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
T EL+SLSEQ+L+DCDR + GCNGGLM+ AF+FI +NGG+ +E +YPY + +CD
Sbjct: 168 KTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDML 227
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ NA VV+IDG+E V DE +L KAVA QPVSVAI+AGG Q Y GVF GECG+ LD
Sbjct: 228 KMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELD 287
Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV VGYGT +G YW+V+NSWG++WGE GY+++ R + G+CGIAMEASYPVK+
Sbjct: 288 HGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARG-IQAAEGQCGIAMEASYPVKS 346
Query: 360 SQNSAK 365
S N+ +
Sbjct: 347 SNNTRR 352
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 179/356 (50%), Positives = 241/356 (67%), Gaps = 25/356 (7%)
Query: 31 SYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG--MGHNEKRF--QIFKDNLRFIDEH 86
SY + + R D+EV +Y+ W +KHG+ M +E R ++F+DNLR+ID H
Sbjct: 33 SYTTTIVPAPAERADEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAH 92
Query: 87 NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAK----RRLMKSKVAS-------- 130
N+ T+++GL FADLT EEYR LG R+ + R S+V S
Sbjct: 93 NAEADAGLHTFRLGLTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHR 152
Query: 131 -QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
R + GD LP+++DWR+ GAV VK+Q CG CWAFS VAA+EGIN IVTG L+SLS
Sbjct: 153 RPRPRPRCGD-LPDAIDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLS 211
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN-AKVVS 248
EQE++DCD + ++GCNGG M+ AFQF+I NGG+DSE DYP++ + CD ++ N KV +
Sbjct: 212 EQEIIDCDTQ-DSGCNGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAA 270
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
IDG+ +V+ +E +L++AVA QPVSVAI+AGGRAFQHY SG+F G CG+ LDHGV VGY
Sbjct: 271 IDGFVEVASNNETALQEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGY 330
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
G+ENG YW+V+NSW WGE GY++++RN+ GKCGIAM+ASYPVK++ A
Sbjct: 331 GSENGKAYWIVKNSWSDSWGEAGYIRIRRNVF-LPVGKCGIAMDASYPVKDTYGPA 385
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 174/316 (55%), Positives = 221/316 (69%), Gaps = 7/316 (2%)
Query: 46 DEVMTIYQTWLAKHGKT--SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
DE ++ W+++HG+ H KRF +FK+N+ I+E N +T+K+ +N+FADL
Sbjct: 31 DEDSMRHEEWMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFND-GKTFKLAINQFADL 89
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TNEE+RA Y G + +K RY LP SVDWR+KGAV PVK+QG CG
Sbjct: 90 TNEEFRASYNGFKGPMVLSSQITKPTPFRYE-NVSSALPVSVDWRKKGAVTPVKNQGQCG 148
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA+EGI +I TG+LISLSEQELVDCD K I+ GC GGLMD AF+FII NGG+
Sbjct: 149 CCWAFSAVAAIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINNGGL 208
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY G + C+ ++ N VSI GYEDV DE +L KAVA QPVSVAIEAGG
Sbjct: 209 TTESNYPYKGEDGTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSD 268
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y SGVFTGECG+ LDH V AVGYG +E+G YW+V+NSWG+ WGE+GY+++Q++ +
Sbjct: 269 FQFYSSGVFTGECGTELDHAVTAVGYGESEDGSKYWIVKNSWGTKWGESGYIEMQKD-IK 327
Query: 342 TNTGKCGIAMEASYPV 357
G CGIAM+ASYP
Sbjct: 328 VKQGLCGIAMQASYPT 343
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 172/326 (52%), Positives = 228/326 (69%), Gaps = 15/326 (4%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNE-----KRFQIFKDNLRFIDEHNSLNRTYKVGLN 98
+++ + +Y+ W + H S G E + F +FK+N+R+I E N R++++ LN
Sbjct: 34 SEESLRALYEQWRS-HYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKGRSFRLALN 92
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKV-----ASQRYACKAGDELPESVDWREKGAV 153
KFAD+T +E+R Y R + S + S YA +AG+ LP +VDWR++GAV
Sbjct: 93 KFADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGSFMYA-QAGN-LPLAVDWRQRGAV 150
Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
+KDQG CGSCWAFST+AAVEGINKI TG+L+SLSEQELVDCD N GCNGGLMDYAF
Sbjct: 151 TGIKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAF 210
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
Q+I +NGG+ +E +YPYL + C+ ++ + V+IDGYEDV +E +L+KAVA+QPVS
Sbjct: 211 QYIKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVS 270
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGY 332
+AIEA G+ FQ Y GVFTG CG+ LDHGV AVGYG T +G YW+V+NSWG DWGE GY
Sbjct: 271 IAIEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGY 330
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVK 358
+++QR + D+ G CGIAME SYP K
Sbjct: 331 IRMQRGISDSQ-GLCGIAMEPSYPTK 355
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 218/324 (67%), Gaps = 3/324 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S + KRF +F+ N+ + N +++ YK+ L
Sbjct: 24 HEKDLESEESLWDLYEKWRSHH-TVSTSLDEKRKRFNVFRANVLHVHNTNKMDKPYKLKL 82
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R Y ++ + + + + D++P S+DWR+KGAV PVK
Sbjct: 83 NKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGSFMYGNIDKVPASIDWRKKGAVTPVK 142
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFST+ AVEGIN I T +LISLSEQELVDC+ N GCNGGLMDYAF+FI
Sbjct: 143 DQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFIT 202
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+ G+ +E +YPY + CD ++ N VSIDG+EDV +E +L KAVA+QPVSVAI+
Sbjct: 203 KQKGITTEANYPYRAQDGHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAID 262
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTGECG LDHGV VGYGT +G YW+VRNSWG +WGE GY+++Q
Sbjct: 263 AGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQ 322
Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
R + D G CGIAMEASYP+K S
Sbjct: 323 RGISD-RRGLCGIAMEASYPIKKS 345
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 172/337 (51%), Positives = 237/337 (70%), Gaps = 21/337 (6%)
Query: 43 RTDDEVMTIYQTWLAKHGK--TSN-----GMGHNEK------RFQIFKDNLRFIDEHNSL 89
R D+EV +Y+ W +KHG+ +SN G +E+ R ++F+DNLR+ID+HN+
Sbjct: 75 RADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAE 134
Query: 90 N----RTYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPES 144
T+++GL FADLT +EYR LG + R + GD LP++
Sbjct: 135 ADAGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDA 194
Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
+DWR+ GAV VKDQ CG CWAFS VAA+EGIN I TG L+SLSEQE++DCD + ++GC
Sbjct: 195 IDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGC 253
Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN-AKVVSIDGYEDVSPFDEMSL 263
+GG M+ AF+F+I NGG+D+E DYP++G + CD S+ N KV +IDG +V+ +E +L
Sbjct: 254 DGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETAL 313
Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
++AVA QPVSVAI+A GRAFQHY SG+F G CG++LDHGV AVGYG+E+G DYW+V+NSW
Sbjct: 314 QEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSW 373
Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+ WGE GY++++RN + TGKCGIAM+ASYPVK++
Sbjct: 374 SASWGEAGYIRMRRN-VPRPTGKCGIAMDASYPVKDT 409
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 173/337 (51%), Positives = 240/337 (71%), Gaps = 25/337 (7%)
Query: 43 RTDDEVMTIYQTWLAKHGK--TSN-----GMGHNEK-------RFQIFKDNLRFIDEHNS 88
R D+EV +Y+ W +KHG+ +SN G +E+ R ++F+DNLR+ID HN+
Sbjct: 45 RADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNA 104
Query: 89 LN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
T+++GL FADLT EEYR LG R+ + Y+ + GD LP++
Sbjct: 105 EADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGR---RSGARYGSGYSVRGGD-LPDA 160
Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
+DWR+ GAV VKDQ CG CWAFS VAA+EG+N I TG L+SLSEQE++DCD + ++GC
Sbjct: 161 IDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGC 219
Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSL 263
+GG M+ AF+F+I NGG+D+E DYP++G + CD S+ +N KV +IDG +V+ +E +L
Sbjct: 220 DGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETAL 279
Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
++AVA QPVSVAI+A GRAFQHY SG+F G CG++LDHGV AVGYG+E+G DYW+V+NSW
Sbjct: 280 QEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSW 339
Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+ WGE GY++++RN + TGKCGIAM+ASYPVK++
Sbjct: 340 SASWGEAGYIRMRRN-VPRPTGKCGIAMDASYPVKDT 375
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 183/364 (50%), Positives = 239/364 (65%), Gaps = 14/364 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSA--ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAK 58
MA+ + +S + L + + A +D SI+ Y + D SS+ R ++ +++ WLAK
Sbjct: 1 MASPQHLMKLSGALLLLCVGACVARNSDFSIVGY-SEEDLSSNER----LVELFEKWLAK 55
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
H K RF++FKDNL+ ID+ N +Y +GLN+FADLT++E++A YLG +
Sbjct: 56 HQKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAA 115
Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
RR S RY + +LP+SVDWR+KGAV VK+QG CGSCWAFSTVAAVEGIN
Sbjct: 116 PARR---GSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGIN 172
Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC- 237
IVTG L +LSEQEL+DC N+GCNGGLMDYAF +I +GG+ +E+ YPYL E C
Sbjct: 173 AIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCG 232
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
D + ++ V+I GYEDV DE +L KA+A QPVSVAIEA GR FQ Y GVF G CG+
Sbjct: 233 DGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGA 292
Query: 298 ALDHGVVAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
LDHGV AVGYG++ G DY +VRNSWG+ WGE GY++++R G CGI ASY
Sbjct: 293 QLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRG-TSNGEGLCGINKMASY 351
Query: 356 PVKN 359
P K+
Sbjct: 352 PTKD 355
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 174/352 (49%), Positives = 234/352 (66%), Gaps = 13/352 (3%)
Query: 11 STLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
++L+FLF S A + SI+ Y + +V+ ++++WL KH K +
Sbjct: 10 TSLLFLFVSILACSPLAHEFSILGYA-----PEDLTSIHKVIHLFESWLVKHSKFYESLD 64
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
RF+IF DNL+ IDE N Y +GLN+FADLT+EE++ +LG + + R +S
Sbjct: 65 EKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDES- 123
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S+ + + +LP+SVDWR+KGAV PVK+QG CG+CWAFSTVAAVEGIN+IVTG L
Sbjct: 124 --SKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTM 181
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQEL+DCD N GCNGGLMDYAF +++++G + E++YPY+ +E CD + ++ V
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSEKV 240
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+I GY DV DE S KA+A+QP+SVAIEA GR FQ Y GVF G CG+ LDHGV AVG
Sbjct: 241 TISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
YGT G+DY +VRNSWG WGE GY++++R + G CG+ M ASYP K
Sbjct: 301 YGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH-GMCGLYMMASYPTKQ 351
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 171/361 (47%), Positives = 238/361 (65%), Gaps = 15/361 (4%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
+ L +LV +F ++ S D ++ +++ + +Y+ W + H S
Sbjct: 4 VILVALSLVLVFGLAESFDFDEKDLA------------SEESLWDLYERWRSYH-TVSRD 50
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+ KRF +FK+N + + + N +++ YK+ LNKFAD+TN E+R+ Y G++ R L
Sbjct: 51 LEEKNKRFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRG 110
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ + + + LP SVDWR+KGAV +KDQG CGSCWAFSTV VEGIN+I T EL
Sbjct: 111 DRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKEL 170
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
+SLSEQ+L+DCDR + GCNGGLM+ AF+FI +NGG+ +E +YPY + +CD + NA
Sbjct: 171 LSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAP 230
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV+IDG+E V DE +L KAVA QPVSVAI+AGG Q Y GVF GECG+ LDHGV
Sbjct: 231 VVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAI 290
Query: 306 VGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYGT +G YW+V+NSWG++WGE GY+++ R + G+CGIAMEASYPVK+S N+
Sbjct: 291 VGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARG-IQAAEGQCGIAMEASYPVKSSNNTR 349
Query: 365 K 365
+
Sbjct: 350 R 350
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 178/340 (52%), Positives = 229/340 (67%), Gaps = 27/340 (7%)
Query: 43 RTDDEVMTIYQTWLAKH------GKTSNGMGHNE-------------KRFQIFKDNLRFI 83
RTD+EV +Y+ W ++H G T +G + +R ++F+DNLR+I
Sbjct: 44 RTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGDADAGAGAGEDDDARRLEVFRDNLRYI 103
Query: 84 DEHNSLN----RTYKVGLNKFADLTNEEYRA-MYLGTRSDAKRRLMKSKVASQRYACKAG 138
D HN+ +++GL +FADLT EEYRA + LG+R + V +RY AG
Sbjct: 104 DAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSR--GRNGTAVGVVGRRRYLPLAG 161
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
++LP++VDWRE+GAV VKDQG CG CWAFS VAAVEGINKIVTG LISLSEQEL+DCD+
Sbjct: 162 EQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLISLSEQELIDCDK 221
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
+ GC+GGLMD AF F+I+NGG+D+E DYP+ G + CD +N +VVSID +E V
Sbjct: 222 FQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPIN 281
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
E +L+KAVA QPVS +IEA RAFQ Y SG+F G CG+ LDHGV VGYG+E G DYW+
Sbjct: 282 YERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSEGGKDYWI 341
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V+NSWG+ WGE GYV++ RN + GIAME YPVK
Sbjct: 342 VKNSWGTQWGEAGYVRMARN-VRVRPPSAGIAMEPLYPVK 380
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 178/368 (48%), Positives = 242/368 (65%), Gaps = 18/368 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M T + LA+ ++V +F ++ S +D + +S ++ + +Y+ W + H
Sbjct: 1 MDTRKVILAVFSVVLVFRLADS---------FDYTEEDLAS---EERLRDLYERWRSHH- 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
S + ++RF +FK+NL+ I + N +R YK+ LN FAD+TN E+ Y G++ +
Sbjct: 48 TVSRSLAEKQERFNVFKENLKHIHKVNHKDRPYKLKLNSFADMTNHEFLQHYGGSKV-SH 106
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R+++ + + +LP SVDWR+ GAV +KDQG CGSCWAFSTVAAVEGINKI
Sbjct: 107 YRVLRGQRQGTGSMHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKI 166
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TGELISLSEQELVDCD N GCNGGLM+ AF FI Q GG+ SE YPY E CD +
Sbjct: 167 KTGELISLSEQELVDCDSD-NHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSN 225
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ N+ VV+IDGYE V DE +L KAVA+QPV++A++AGG+ Q Y +FTG+CG+ L+
Sbjct: 226 KMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELN 285
Query: 301 HGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK- 358
HGV VGYG T++G YW+V+NSWG+DWGE GY+++QR +D G CGI MEASYPVK
Sbjct: 286 HGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRG-IDAEEGLCGITMEASYPVKL 344
Query: 359 NSQNSAKP 366
S N P
Sbjct: 345 RSDNKKAP 352
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 167/315 (53%), Positives = 220/315 (69%), Gaps = 6/315 (1%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
D++ ++ W KHGKT ++R QIFKDN F+ +HN + N TY + LN FADLT
Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 85
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
+ E++A LG A +M SK S + K +P+SVDWR+KGAV VKDQGSCG+
Sbjct: 86 HHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKDQGSCGA 141
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FS A+EGIN+IVTG+LISLSEQEL+DCD+ NAGCNGGLMDYAF+F+I+N G+D+
Sbjct: 142 CWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDT 201
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E+DYPY + C + KVV+ID Y V DE +L +AVA QPVSV I RAFQ
Sbjct: 202 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 261
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y G+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG WG +G++ +QRN +++
Sbjct: 262 LYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSD- 320
Query: 345 GKCGIAMEASYPVKN 359
G CGI M ASYP+K
Sbjct: 321 GVCGINMLASYPIKT 335
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 347 bits (889), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 181/361 (50%), Positives = 235/361 (65%), Gaps = 18/361 (4%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
S ++ F++S +A + I D T+D + +Y+ W + H S +
Sbjct: 4 FSLILVASFLASVAATAIDIADKD--------LETEDSLWNLYERWRSHH-TVSRDLDEK 54
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK- 127
+KRF +FK+N R+I + N + YK+ LNKFADLTN E+R+ Y G+R + R L S+
Sbjct: 55 QKRFNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRR 114
Query: 128 ---VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
S Y LP S+DWR+KGAV VKDQG CGSCWAFSTVAAVEGIN+I T +
Sbjct: 115 GGATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKK 174
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
L+SLSEQEL+DCD N GCNGGLMDYAF FI +NGG+ SE +YPY ++ C + + +
Sbjct: 175 LLSLSEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYC-ATEKKS 233
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
VVSIDG+EDV DE SL KAVA+QPVS+AIEA G FQ Y GVFTG G+ LDHGV
Sbjct: 234 HVVSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVA 293
Query: 305 AVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
VGYG T+ G YW+VRNSWG++WGE GY+++ + + CG+AMEASYP+K S N
Sbjct: 294 IVGYGKTQQGTKYWIVRNSWGAEWGEKGYIRI--SAASDSKRLCGLAMEASYPIKTSPNP 351
Query: 364 A 364
+
Sbjct: 352 S 352
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 175/358 (48%), Positives = 239/358 (66%), Gaps = 10/358 (2%)
Query: 16 LFFISSSSAADMSII-SYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
+FF++ S A + + S++ N S ++ + +Y+ W + H S + RF
Sbjct: 6 VFFVALSFALVLRVAESFEFNEKDLES---EEGLWDLYERWRSHH-TVSRSLDEKHNRFN 61
Query: 75 IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
+FK N+ + N +++ YK+ LN+FAD+TN E+R++Y G++ + R + + +
Sbjct: 62 VFKGNVMHVHSSNKMDKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGNGTFM 121
Query: 135 CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELV 194
+ D +P SVDWR+KGAV VKDQG CGSCWAFST+ AVEGIN+I T +L+ LSEQELV
Sbjct: 122 YQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELV 181
Query: 195 DCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
DCD N GCNGGLM+ AF+FI Q G+ + +YPY + CD S+ N VSIDG+E+
Sbjct: 182 DCDTTQNQGCNGGLMESAFEFIKQY-GITTASNYPYEAKDGTCDASKVNEPAVSIDGHEN 240
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENG 313
V +E +L KAVA QPVSVAIEAGG FQ Y GVFTG CG+ALDHGV VGYG T++G
Sbjct: 241 VPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDG 300
Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
YW V+NSWGS+WGE GY++++R+ + G CGIAMEASYP+K S S+KP+ HSS
Sbjct: 301 TKYWTVKNSWGSEWGEKGYIRMKRS-ISVKKGLCGIAMEASYPIKKS--SSKPREHSS 355
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 167/327 (51%), Positives = 222/327 (67%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ +Y+ W + + S +G KRF +FK N+ + N +++ YK+ L
Sbjct: 26 HDKDLASEESFWDLYERWRS-YRTVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + R + + + + +P S DWR+ GAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN+I T +L+SLSEQELVDCD K NAGCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY + CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW VRNSWG +WGE GY+++Q
Sbjct: 265 AGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
R++ G CGIAM ASYP+KNS N+
Sbjct: 325 RSIFKKE-GLCGIAMMASYPIKNSSNN 350
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 163/308 (52%), Positives = 224/308 (72%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++GK EKRF+IFK+N+ +I+ +N+ N+ YK+ +N+FADLTNEE+
Sbjct: 586 HEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF-- 643
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + + +P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 644 --IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 701
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + +G+LISLSEQELVDCD K ++ GC GGLMD AF+F+IQN G+++E +YP
Sbjct: 702 VAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYP 761
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + KC+ + VV+I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 762 YKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSG 821
Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG N G +YWLV+NSWG++WGE GY+++QR +D+ G CG
Sbjct: 822 VFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGLCG 880
Query: 349 IAMEASYP 356
IAM+ASYP
Sbjct: 881 IAMQASYP 888
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 173/315 (54%), Positives = 215/315 (68%), Gaps = 5/315 (1%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D ++ +++ W+AK+ K RF++FKDNL IDE N TY +GLN FADLT+
Sbjct: 60 DRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADLTH 119
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
+E++A YLG R ++ S+ RY A D++P SVDWR+KGAV VK+QG CGSC
Sbjct: 120 DEFKATYLGLRQPETKKTTDSRF---RYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSC 176
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFSTVAAVEGIN+IVTG L SLSEQELVDC N GCNGG+MD AF +I +GG+ +E
Sbjct: 177 WAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGLRTE 236
Query: 226 QDYPYLGAENKCDPSRRNA-KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
+ YPYL E CD R+ +VV+I GYEDV DE +L KA+A QP+SVAIEA GR FQ
Sbjct: 237 EAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQ 296
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y GVF G CGS LDHGV AVGYG+ G DY +V+NSWGS WGE GY++++R
Sbjct: 297 FYSGGVFNGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGSHWGEKGYIRMKRG-TGKPE 355
Query: 345 GKCGIAMEASYPVKN 359
G CGI ASYP K+
Sbjct: 356 GLCGINKMASYPTKD 370
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 171/321 (53%), Positives = 216/321 (67%), Gaps = 6/321 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y+ W +H + +G +RF +FK+N+R I + N + YK+ LN+F D+
Sbjct: 39 SEEALWALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQRDEPYKLRLNRFGDM 97
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQGSC 162
T +E+R Y G+R R + S AG +LP SVDWR+KGAV VKDQG C
Sbjct: 98 TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQC 157
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFST+AAVEGIN I T L SLSEQ+LVDCD K NAGC+GGLMDYAFQ+I ++GG+
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGV 217
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E YPY + C S A V+IDGYEDV DE +LKKAVA QPVSVAIEA G
Sbjct: 218 AAEDAYPYKARQASCKKS--PAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 275
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVF G CG+ LDHGV AVGYG +G YW+V+NSWG +WGE GY+++ R++
Sbjct: 276 FQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVA- 334
Query: 342 TNTGKCGIAMEASYPVKNSQN 362
G CGIAMEASYPVK S N
Sbjct: 335 AKEGHCGIAMEASYPVKTSPN 355
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 180/350 (51%), Positives = 235/350 (67%), Gaps = 43/350 (12%)
Query: 12 TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGMGHNE 70
+L+ +F + SSA D+S+ S R+++EV I+QTW++KHGKT +N +G E
Sbjct: 13 SLLIIFLLPPSSAMDLSVTS--------GGLRSNEEVGFIFQTWMSKHGKTYTNALGDKE 64
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
+RFQ FKDNLRFID+HN+ N +Y++GL +FADLT +EY+ ++ G R K++ ++ +
Sbjct: 65 QRFQNFKDNLRFIDQHNAKNLSYRLGLTQFADLTVQEYQDLFSG-RPIQKQKALR---VT 120
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
RY A D+LP+SVDWR+KGAV+ +KDQG C VE INKIVTGELISLSE
Sbjct: 121 HRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSE 170
Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK-VVSI 249
QELVDC N GCNGGLMD AFQF+I N G++ + DYPY + C+ ++ +K V+ I
Sbjct: 171 QELVDCSID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKI 229
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
DGYEDV +E SL+KAVA QP G++TG CG+ LDH VV VGYG
Sbjct: 230 DGYEDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYG 272
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
TENG DYW+VRNSWG+ WGE GY K+ RN + TG CGIAM ASYP+KN
Sbjct: 273 TENGQDYWIVRNSWGTVWGEAGYAKIARN-FENPTGVCGIAMVASYPIKN 321
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 178/338 (52%), Positives = 227/338 (67%), Gaps = 12/338 (3%)
Query: 25 ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID 84
+D SI+ Y + D SS +D ++ +++ WLAKH K RF++FKDNL+ ID
Sbjct: 128 SDFSIVGY-SEEDLSS----NDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHID 182
Query: 85 EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
+ N +Y +GLN+FADLT+EE++A YLG A R + S +Y + D+LP+S
Sbjct: 183 KVNREVTSYWLGLNEFADLTHEEFKATYLGLAPPAPARESR---GSFKYEDVSADDLPKS 239
Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
VDWR KGAV VK+QG CGSCWAFSTVAAVEGIN IVTG L +LSEQEL+DC N GC
Sbjct: 240 VDWRTKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGC 299
Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-DPSRRNAKVVSIDGYEDVSPFDEMSL 263
NGGLMDYAF +I +GG+ +E+ YPYL E C D + ++ V+I GYEDV +E +L
Sbjct: 300 NGGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQAL 359
Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGV--DYWLVRN 321
KA+A QPVSVAIEA GR FQ Y GVF G CG+ LDHGV AVGYG++ G DY +VRN
Sbjct: 360 IKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRN 419
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
SWG+ WGE GY++++R G CGI ASYP K+
Sbjct: 420 SWGAKWGEKGYIRMKRG-TGKGEGLCGINKMASYPTKD 456
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 167/276 (60%), Positives = 211/276 (76%), Gaps = 20/276 (7%)
Query: 19 ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
+S ++AADMSI+SY R+++EV +Y W+A+HG T N +G E+RF+ F+D
Sbjct: 18 VSLAAAADMSIVSYGE--------RSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRD 69
Query: 79 NLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQR 132
NLR+ID+HN+ ++++GLN+FADLTNEEYR+ YLG R+ D +R+L S R
Sbjct: 70 NLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SAR 123
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y DELPESVDWR+KGAV VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQE
Sbjct: 124 YQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQE 183
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
LVDCD N GCNGGLMDYAF+FII NGG+DSE+DYPY +N+CD +++NAKVV+IDGY
Sbjct: 184 LVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGY 243
Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
EDV E SL+KAVA+QP+SVAIEAGGRAFQ Y+S
Sbjct: 244 EDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKS 279
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 167/315 (53%), Positives = 214/315 (67%), Gaps = 26/315 (8%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D+++ +++W++KHGK M RF++F++NL IDE N +Y +GLN+FADL++
Sbjct: 43 DKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSH 102
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
EE+++ K +LPESVDWR+KGAV VK+QG+CGSC
Sbjct: 103 EEFKS-------------------------KDVADLPESVDWRKKGAVTHVKNQGACGSC 137
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFSTVAAVEGIN+IVTG L +LSEQEL+DCD N+GCNGGLMDYAF FI NGG+ E
Sbjct: 138 WAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKE 197
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
DYPYL E C+ + + +V+I GYEDV DE SL KA+A QP+SVAIEA GR FQ
Sbjct: 198 DDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQF 257
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y GVF G CG+ LDHGV AVGYG+ G+DY +V+NSWG WGE GY++++RN T G
Sbjct: 258 YSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE-G 316
Query: 346 KCGIAMEASYPVKNS 360
CGI ASYP K++
Sbjct: 317 LCGINKMASYPTKDN 331
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 163/309 (52%), Positives = 224/309 (72%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++GK EKRF+IFK+N+ +I+ +N+ N+ YK+ +N+FADLTNEE+
Sbjct: 57 HEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF-- 114
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + + +P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 115 --IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 172
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + +G+LISLSEQELVDCD K ++ GC GGLMD AF+F+IQN G+++E +YP
Sbjct: 173 VAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYP 232
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + KC+ + VV+I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 233 YKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSG 292
Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG N G +YWLV+NSWG++WGE GY+++QR +D+ G CG
Sbjct: 293 VFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGLCG 351
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 352 IAMQASYPT 360
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 168/325 (51%), Positives = 219/325 (67%), Gaps = 11/325 (3%)
Query: 41 SWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNK 99
SW D + +YQ W+ +HGK N +KRFQIFK+N+ +I+ HN+ N ++ +GLNK
Sbjct: 27 SWHFIDPLWQVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNK 86
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
FADLTN E+R +Y+G RL + + + SVDWR+KG V +KDQ
Sbjct: 87 FADLTNSEFRGLYVG-------RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQ 139
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAFS VAAVEG+ + TG L+SLSEQELVDCD +N GC+GG+MDYAFQ++I+N
Sbjct: 140 GDCGSCWAFSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRN 199
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ S+ +YPY CD + +I+G++ + P E L +AVA+QPVSVAIEAG
Sbjct: 200 GGITSQSNYPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAG 259
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRN 338
G+ FQ Y SGVFTGECGS LDHGV VGYGT+ G YWLV+NSWGS WGE+GYV+++R
Sbjct: 260 GQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQ 319
Query: 339 LLDTNTGKCGIAMEASYPVKNSQNS 363
G CGI ++ASYP K Q +
Sbjct: 320 --GPGAGVCGINLDASYPTKIQQRT 342
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 172/332 (51%), Positives = 225/332 (67%), Gaps = 11/332 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
++D + +Y+ W H + + +RF +FK+N++FI E N + YK+ LNKF D
Sbjct: 32 SEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGD 90
Query: 103 LTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
+TN+E+R+ Y G++ R R ++ S Y G S+DWR KGAV VKDQG
Sbjct: 91 MTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYE-NVGSLPAASIDWRAKGAVTGVKDQG 149
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST+A+VEGIN+I TGEL+SLSEQELVDCD N GCNGGLMDYAF+F IQ
Sbjct: 150 QCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEF-IQKN 208
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY + C + N+ VVSIDG++DV +E +L +AVA+QP+SV+IEA G
Sbjct: 209 GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASG 268
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GVFTG CG+ LDHGV VGYG T +G YW+V+NSWG +WGE+GY+++QR +
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328
Query: 340 LDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
D GKCGIAMEASYP+K S N PK S+
Sbjct: 329 SDKR-GKCGIAMEASYPIKTSAN---PKNSST 356
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 168/322 (52%), Positives = 221/322 (68%), Gaps = 13/322 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
D++ ++ W KHGKT ++R QIFKDN F+ +HN + N TY + LN FADLT
Sbjct: 24 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 83
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
+ E++A LG A +M SK S + K +P+SVDWR+KGAV VKDQGSCG+
Sbjct: 84 HHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKDQGSCGA 139
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FS A+EGIN+IVTG+LISLSEQEL+DCD+ NAGCNGGLMDYAF+F+I+N G+D+
Sbjct: 140 CWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDT 199
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E+DYPY + C + KVV+ID Y V DE +L +AVA QPVSV I RAFQ
Sbjct: 200 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 259
Query: 285 HYES-------GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
Y S G+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG WG +G++ +QR
Sbjct: 260 LYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQR 319
Query: 338 NLLDTNTGKCGIAMEASYPVKN 359
N +++ G CGI M ASYP+K
Sbjct: 320 NTENSD-GVCGINMLASYPIKT 340
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 164/309 (53%), Positives = 217/309 (70%), Gaps = 6/309 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++TW+ ++G+ G EKRF+IFK+N+ FI+ +N+ N+ YK+G+N F DLTNEE+RA
Sbjct: 38 HKTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ G + S RY +P S+DWR KGAV +KDQG CG CWAFS
Sbjct: 98 SHNGYTMSMSSHQSSYRTKSFRYENVTA--VPPSLDWRTKGAVTHIKDQGQCGCCWAFSA 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI K+ TG LISLSEQELVDCD ++ GC GGLMD AF+FII+N G+ +E +YP
Sbjct: 156 VAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYP 215
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + I GYE+V +DE +L+KAVA+QPVSVAI+AG AFQHY SG
Sbjct: 216 YEGVDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSG 275
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
+FTG+CG+ LDHGV VGYGT ++G YWLV+NSWG+ WGE+GY++++R+ +D G CG
Sbjct: 276 IFTGDCGTELDHGVTVVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERD-IDAKEGLCG 334
Query: 349 IAMEASYPV 357
IAME SYP
Sbjct: 335 IAMEPSYPT 343
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/336 (50%), Positives = 232/336 (69%), Gaps = 17/336 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNE----KRFQIFKDNLRFIDEHNSLN-RTYKVGLN 98
+++ + +Y+ W + + + S G ++ +RF +FK+N R++ E N + R +++ LN
Sbjct: 33 SEESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALN 92
Query: 99 KFADLTNEEYRAMYLGTRSDAKR-RLMKSKVASQRYACKAGD---ELPESVDWREKGAVN 154
KFAD+T +E+R Y G+R+ R +L +++ + + G LP +VDWR +GAV
Sbjct: 93 KFADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVT 152
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQ 214
VKDQG CGSCWAFS +AAVEG+NKI+TG+L+SLSEQELVDCD N GC+GGLMDYAFQ
Sbjct: 153 GVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQ 212
Query: 215 FIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
+I +NGG+ +E +YPYL + C+ ++ + V+IDGYEDV +E +L+KAVA QPV+V
Sbjct: 213 YIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAV 272
Query: 275 AIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYV 333
AIEA G+ FQ Y GVFTG CG+ LDHGV AVGYGT +G YW V+NSWG DWGE GY+
Sbjct: 273 AIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYI 332
Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
++QR + D+ G CGIAME SYP K KP H
Sbjct: 333 RMQRGVPDSR-GLCGIAMEPSYPTK------KPAGH 361
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/326 (51%), Positives = 220/326 (67%), Gaps = 4/326 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y+ W +H + + +G +RF +FKDN+R I E N + YK+ LN+F D+
Sbjct: 40 SEEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDM 98
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T +E+R Y +R R + +LP +VDWREKGAV VKDQG CG
Sbjct: 99 TADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCG 158
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFST+AAVEGIN I T L +LSEQ+LVDCD K NAGC+GGLMD AFQ+I ++GG+
Sbjct: 159 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 218
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+ YPY ++ C S ++ V+IDGYEDV E +LKKAVA+QPVSVAIEAGG
Sbjct: 219 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 278
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVF G+CG+ LDHGV AVGYGT +G YW+VRNSWG+DWGE GY++++R+ +
Sbjct: 279 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRD-VS 337
Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
G CGIAMEASYP+K S N A K
Sbjct: 338 AKEGLCGIAMEASYPIKTSPNPAPKK 363
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/315 (53%), Positives = 215/315 (68%), Gaps = 4/315 (1%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
E+ +++TW +HGKT R ++F+DN F+ EHNS N +Y + LN FADLT+
Sbjct: 25 EIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTH 84
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E++A LG S A L + S R ++P SVDWR+ GAV VKDQG+CG+C
Sbjct: 85 HEFKASRLGLSSAASASLNVDR--SNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGAC 142
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
W+FS A+EGINKIVTG L+SLSEQELVDCD+ N GC GG+MDYAFQF+I N G+D+E
Sbjct: 143 WSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTE 202
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+DYPY G + C+ + VV+IDGY DV +E L KAVA+QPVSV I RAFQ
Sbjct: 203 EDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQL 262
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y G+FTG C ++LDH V+ VGYG+ENGVDYW+V+NSWGS WG +GY+ +QRN ++ G
Sbjct: 263 YSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRN-SGSSRG 321
Query: 346 KCGIAMEASYPVKNS 360
CGI M ASYP K S
Sbjct: 322 LCGINMLASYPKKTS 336
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/323 (51%), Positives = 224/323 (69%), Gaps = 10/323 (3%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVG 96
S + + D + ++ W+ +GK + E R +IFK+N+ +I+ N+ N+ YK+G
Sbjct: 28 SRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N+FADLTNEE+ + +R+ K + S + + + +P +VDWR+KGAV PV
Sbjct: 88 INQFADLTNEEF----IASRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTPV 142
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
K+QG CG CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+F
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
IIQN G+++E YPY G + C ++ + V+I GYEDV +E +L+KAVA+QP+SVA
Sbjct: 203 IIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVA 262
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVK 334
I+A G FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+K
Sbjct: 263 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIK 322
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+QR +D G CGIAMEASYP
Sbjct: 323 MQRG-VDAAEGLCGIAMEASYPT 344
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 164/324 (50%), Positives = 227/324 (70%), Gaps = 5/324 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y+ W + H S + +RF +FK+NL+ I + N +R YK+ LNKFAD+
Sbjct: 32 SEESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQKDRPYKLRLNKFADM 90
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN E+ Y G++ R S+ + +A + LP S+DWR++GAV VKDQG CG
Sbjct: 91 TNHEFLQHYGGSKVSHYRMFHGSRRQTG-FAHENTSNLPSSIDWRKQGAVTGVKDQGKCG 149
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFS+VAAVEGINKI TGELISLSEQELVDC+ +N GC+GGLM+ AF FI + GG+
Sbjct: 150 SCWAFSSVAAVEGINKIKTGELISLSEQELVDCN-SVNHGCDGGLMEQAFSFIEKTGGLT 208
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E +YPY + CD ++ N +V+IDGYE V DE +L +AVA+QPVS+AI+AGG+ F
Sbjct: 209 TENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDF 268
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GV+TG+CG+ L+HGV VGYG T++G YW+V+NSWGS+WGENG++++QR D
Sbjct: 269 QFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRE-NDV 327
Query: 343 NTGKCGIAMEASYPVKNSQNSAKP 366
G CGI +EASYP+K + +P
Sbjct: 328 EEGLCGITLEASYPIKQRSDIKQP 351
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 166/323 (51%), Positives = 224/323 (69%), Gaps = 10/323 (3%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVG 96
S + + D + ++ W+ +GK + E R +IFK+N+ +I+ N+ N+ YK+G
Sbjct: 28 SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N+FADLTNEE+ + +R+ K + S + + + +P +VDWR+KGAV PV
Sbjct: 88 INQFADLTNEEF----IASRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTPV 142
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
K+QG CG CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+F
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
IIQN G+++E YPY G + C ++ + V+I GYEDV +E +L+KAVA+QP+SVA
Sbjct: 203 IIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVA 262
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVK 334
I+A G FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+K
Sbjct: 263 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIK 322
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+QR +D G CGIAMEASYP
Sbjct: 323 MQRG-VDAAEGLCGIAMEASYPT 344
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 172/302 (56%), Positives = 207/302 (68%), Gaps = 10/302 (3%)
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM---KSKVA 129
F +FK N+R I E N + YK+ LN+F D+T +E+R Y G+R R + A
Sbjct: 70 FNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSA 129
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S + ++P SVDWR+KGAV VKDQG CGSCWAFST+AAVEGIN I T L SLS
Sbjct: 130 SASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLS 189
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQ+LVDCD K NAGCNGGLMDYAFQ+I ++GG+ +E YPY + C S A VV+I
Sbjct: 190 EQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKS--PAPVVTI 247
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
DGYEDV DE +LKKAVA QPVSVAIEA G FQ Y GVF+G CG+ LDHGV AVGYG
Sbjct: 248 DGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYG 307
Query: 310 -TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
T +G YWLV+NSWG +WGE GY+++ R++ G CGIAMEASYPVK S N PK
Sbjct: 308 VTADGTKYWLVKNSWGPEWGEKGYIRMARDVA-AKEGHCGIAMEASYPVKTSPN---PKV 363
Query: 369 HS 370
H+
Sbjct: 364 HA 365
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 164/317 (51%), Positives = 224/317 (70%), Gaps = 9/317 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFAD 102
DD + + W++++GK E RF+IFK+N+ +I+ N+ + ++YK+G+N+FAD
Sbjct: 32 DDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFAD 91
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LTNEE+ + +R+ K + S + + + + +P +VDWR+KGAV PVK+QG C
Sbjct: 92 LTNEEF----IASRNKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPVKNQGQC 147
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
G CWAFS VAA EGI+K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G
Sbjct: 148 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 207
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E YPY G + C+ ++ + + V+I GYEDV E +L+KAVA+QP+SVAI+A G
Sbjct: 208 LSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGS 267
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+ +QR +
Sbjct: 268 DFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRG-I 326
Query: 341 DTNTGKCGIAMEASYPV 357
+ G CGIAM+ASYP
Sbjct: 327 EAAEGICGIAMQASYPT 343
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 171/312 (54%), Positives = 221/312 (70%), Gaps = 11/312 (3%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEE 107
M ++TW+A++G+ G E+R IFK+N+ FI+ N + + YK+ +N+FADLTNEE
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
++A G + A L S RY + +P ++DWR+KGAV P+KDQG CG CWA
Sbjct: 61 FQASRNGYKMSA--HLSSSSTKPFRYENVSA--VPSTMDWRKKGAVTPIKDQGQCGCCWA 116
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS VAA EGI ++ TG+LISLSEQELVDCD + GCNGGLMD AF FIIQN G+ +E
Sbjct: 117 FSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEA 176
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+YPY GA+ C+ + AK I GYEDV E +L KAVA+QPVSVAI+AGG AFQ Y
Sbjct: 177 NYPYQGADGACNSGKAAAK---ITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFY 233
Query: 287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
SGVFTG+CG+ LDHGV AVGYG +++G YWLV+NSWG+ WGENGY++++R+ +D G
Sbjct: 234 SSGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERD-IDAQEG 292
Query: 346 KCGIAMEASYPV 357
CGIAMEASYP
Sbjct: 293 LCGIAMEASYPT 304
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 161/309 (52%), Positives = 225/309 (72%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++GK EKRF++FK+N+ +I+ +N+ N++YK+G+N+FADLTN+E+
Sbjct: 39 HEQWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + + P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNGFKGHMCSSIIRTTTFKFENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + G+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + KC+ + +I GYEDV +EM+L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 215 YKGVDGKCNANEAAKNAATITGYEDVPANNEMALQKAVANQPVSVAIDASGSDFQFYKSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG +++G +YWLV+NSWG++WGE GY+++QR +D+ G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGLCG 333
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 334 IAMQASYPT 342
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 170/324 (52%), Positives = 224/324 (69%), Gaps = 29/324 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK-RFQIFKDNLRFIDEHNSLN----RTYKVGL 97
R D+EV +Y+TW ++HG+ +G+ + R ++F+DNLR+ID HN+ T+++GL
Sbjct: 42 RADEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGL 101
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
F DLT EE+RA LG + R VAS RY +AGD+LP++VDWR++GAV VK
Sbjct: 102 TPFTDLTLEEFRAHALGFLNSTLPR-----VASDRYLPRAGDDLPDAVDWRQQGAVTGVK 156
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+Q CG CWAFS VAA+EGINKIVT LISLSEQEL+DCD + + GC GG M AFQF+I
Sbjct: 157 NQLDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVI 215
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
NGG+D+E DYP++G CD R KVVSID YE+V DE +L+KAVA+QP
Sbjct: 216 DNGGIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP------ 269
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
G+F G CG LDHGV AVGYG++NG D+W+V+NSWG++WGE+GY++++R
Sbjct: 270 -----------GIFNGPCGFILDHGVTAVGYGSDNGEDFWIVKNSWGAEWGESGYIRMKR 318
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQ 361
N+L GKCGIAM ASYPVKN +
Sbjct: 319 NVL-LPMGKCGIAMYASYPVKNGR 341
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 178/356 (50%), Positives = 232/356 (65%), Gaps = 12/356 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
L+++ L+ + +D SI+ Y + D SS D ++ +++ WLAKH K
Sbjct: 5 LSVAVLLLCVGACVARNSDFSIVGY-SEEDLSSH----DRLVELFEKWLAKHQKAYASFE 59
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
RF++FKDNL+ IDE N +Y +GLN+FADLT++E++ YLG RR
Sbjct: 60 EKLHRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDEFKTTYLGLSPPPARRSSSRS 119
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
RY A +LP++VDWR+KGAV VK+QG CGSCWAFSTVAAVEGIN IVTG L +
Sbjct: 120 F---RYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTA 176
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-DPSRRNAKV 246
LSEQEL+DC N+GCNGG+MDYAF +I +GG+ +E+ YPYL E C D + ++
Sbjct: 177 LSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEA 236
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
VSI GYEDV DE +L KA+A QPVSVAIEA GR FQ Y GVF G CG+ LDHGV AV
Sbjct: 237 VSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAV 296
Query: 307 GYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
GYG++ G DY +V+NSWG WGE GY++++R + G CGI ASYP K++
Sbjct: 297 GYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRG-TGKSEGLCGINKMASYPTKDN 351
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 163/317 (51%), Positives = 220/317 (69%), Gaps = 10/317 (3%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFAD 102
DD + ++ W+ +GK EKR +IF +NL++I+ N+ N+ YK+G+N+FAD
Sbjct: 32 DDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKLGINQFAD 91
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LTNEE+ + +R+ K + S + + + + +P +VDWR+KGAV PVK+QG C
Sbjct: 92 LTNEEF----IASRNKFKGHMCSSIIRTTTFKYE-NTSVPSTVDWRKKGAVTPVKNQGQC 146
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
G CWAFS +AA EGI+KI TG+L+SLSEQELVDCD ++ GC GGLMD AF+FIIQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E YPY G + C + + +I GYEDV +E +L+KAVA+QP+SVAI+A G
Sbjct: 207 ISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPISVAIDASGS 266
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+++QR+ +
Sbjct: 267 DFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRS-I 325
Query: 341 DTNTGKCGIAMEASYPV 357
D G CGIAM+ASYP
Sbjct: 326 DAAEGLCGIAMQASYPT 342
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 165/311 (53%), Positives = 217/311 (69%), Gaps = 8/311 (2%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
++ W +HGKT ++R QIFKDN F+ +HN + N TY + LN FADLT+ E++
Sbjct: 31 LFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFK 90
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
A LG A +M SK S ++P+SVDWR+KGAV VKDQGSCG+CW+FS
Sbjct: 91 ASRLGLSVSASSLIMASKGQSL----GGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
A+EGIN+IVTG+LISLSEQEL+DCD+ NAGCNGGLMDYAF+F+I+N G+D+E+DYP
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE-- 287
Y + C + KVV+ID Y V DE +L++AVA QPVSV I RAFQ Y
Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266
Query: 288 SGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
SG+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG WG +G++ +QRN ++ G C
Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSE-GIC 325
Query: 348 GIAMEASYPVK 358
GI M ASYP+K
Sbjct: 326 GINMLASYPIK 336
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/316 (51%), Positives = 223/316 (70%), Gaps = 9/316 (2%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS--LNRTYKVGLNKFADL 103
D++ ++ W++++GK EKRF+IF +N+ +I+ N N+ Y +G+N+FADL
Sbjct: 32 DDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADL 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN+E+ + +R+ K + S + + + +P SVDWR+KGAV PVK+QG CG
Sbjct: 92 TNDEFTS----SRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPVKNQGQCG 147
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA EGI+K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+
Sbjct: 148 CCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 207
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
++E +YPY G + C+ ++ + V+I GYEDV +E +L+KAVA+QP+SVAI+A G
Sbjct: 208 NTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVAIDASGSD 267
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG++WGE GY+ +QR +D
Sbjct: 268 FQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEGYIMMQRG-VD 326
Query: 342 TNTGKCGIAMEASYPV 357
G CGIAM+ASYP
Sbjct: 327 AAEGLCGIAMQASYPT 342
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 162/310 (52%), Positives = 221/310 (71%), Gaps = 9/310 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS--LNRTYKVGLNKFADLTNEEYR 109
++ W+ +GK EKRF+IF +N+++I+ N+ N +YK+G+N+FADLTNEE+
Sbjct: 39 HERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEEFV 98
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
A +R+ K + S + + + + +P +VDWR+KGAV PVK+QG CG CWAFS
Sbjct: 99 A----SRNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFS 154
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+++E Y
Sbjct: 155 AVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQY 214
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY G + C+ ++ + + +I GYEDV +E +L+KAVA+QP+SVAI+A G FQ Y+S
Sbjct: 215 PYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKS 274
Query: 289 GVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
GVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+ +QR ++ G C
Sbjct: 275 GVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRG-VEAAEGLC 333
Query: 348 GIAMEASYPV 357
GIAM+ASYP
Sbjct: 334 GIAMQASYPT 343
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/316 (51%), Positives = 222/316 (70%), Gaps = 8/316 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
DD + + W++++GK E RF+IF +N+ +++ N+ + ++YK+G+N+FADL
Sbjct: 32 DDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADL 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TNEE+ A +R+ K + S + + + +P +VDWR+KGAV PVK+QG CG
Sbjct: 92 TNEEFVA----SRNKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCG 147
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA EGI+K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+
Sbjct: 148 CCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 207
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E YPY G + C+ ++ + + V+I GYEDV E +L+KAVA+QP+SVAI+A G
Sbjct: 208 STEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSD 267
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+ +QR ++
Sbjct: 268 FQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRG-VE 326
Query: 342 TNTGKCGIAMEASYPV 357
G CGIAM+ASYP
Sbjct: 327 AAEGLCGIAMQASYPT 342
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 160/309 (51%), Positives = 222/309 (71%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++GK EKRF+IFK+N+ +I+ +N+ N+ YK+ +N+FADLTNEE+
Sbjct: 39 HEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + + +P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + +G+LISLSEQELVDCD K ++ GC GGLMD AF+F+IQN G+++E +YP
Sbjct: 155 VAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + KC+ + +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 215 YKGVDGKCNVNEAANDAATITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSG 274
Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG N G +YWLV+NSWG++WGE GY+++QR +++ G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRG-VNSEEGLCG 333
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 334 IAMQASYPT 342
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 176/360 (48%), Positives = 243/360 (67%), Gaps = 23/360 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ + + I L LF + AA S N H+ S R +D W+A++G
Sbjct: 1 MASVNQYRYI-CLALLFVL----AAWASHAKARNLHEASMYERHED--------WMAQYG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ G KR++IFKDN+ I+ N ++N++YK+ +N+FADLTNEE+RA +R+
Sbjct: 48 RVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRA----SRNRF 103
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + ++ S +Y + +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--EHVXAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD + GC+GGLMD AF+FI QN G+ +E +YPY G + C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCN 221
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I+GYEDV +E +L+KAVA QP++VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 281
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ + G CGIAM+ASYP
Sbjct: 282 LDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKE-GLCGIAMQASYPT 340
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 164/309 (53%), Positives = 217/309 (70%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
+Q W+ ++ K N EKRFQIFK+N+ +I+ N R YK+G+N+F DLTNEE+
Sbjct: 39 HQQWMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + Y + +P +VDWR+KGAV PVKDQG CG CWAFS
Sbjct: 97 --IAPRNRFKGHMCSSIIRTNTYKYENVTTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+++ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+D+E YP
Sbjct: 155 VAATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + + +I YEDV +E +L+KAVA+QP+SVAI+A G FQ Y SG
Sbjct: 215 YQGVDGTCNANEASINAATITSYEDVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG +++G YWLV+NSWG+ WGE GY+++QR +D G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSDDGTKYWLVKNSWGTSWGEEGYIRMQRG-VDAVEGLCG 333
Query: 349 IAMEASYPV 357
IAM+ASYP+
Sbjct: 334 IAMQASYPI 342
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 337 bits (864), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 219/317 (69%), Gaps = 10/317 (3%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFAD 102
DD + ++ W+ +GK EKR +IF +NL++I+ N+ + YK+G+N+FAD
Sbjct: 32 DDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKLGINQFAD 91
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LTNEE+ + +R+ K + S + + + + +P +VDWR+KGAV PVK+QG C
Sbjct: 92 LTNEEF----IASRNKFKGHMCSSIIRTTTFKYE-NTSVPSTVDWRKKGAVTPVKNQGQC 146
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
G CWAFS +AA EGI+KI TG+L+SLSEQELVDCD ++ GC GGLMD AF+FIIQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E YPY G + C + + +I GYEDV +E +L+KAVA+QP+SVAI+A G
Sbjct: 207 ISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPISVAIDASGS 266
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+++QR+ +
Sbjct: 267 DFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRS-I 325
Query: 341 DTNTGKCGIAMEASYPV 357
D G CGIAM+ASYP
Sbjct: 326 DAAEGLCGIAMQASYPT 342
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 337 bits (864), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 164/312 (52%), Positives = 221/312 (70%), Gaps = 10/312 (3%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNE 106
++ ++ W+A+HG+ M EKR+ IFK+N+ I+ +N +R YK+G+NKFADLTNE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E+RAMY G + + SK+ S + + ++P S+DWR GAV PVKDQG+CG CW
Sbjct: 61 EFRAMYHGYKRQS------SKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 114
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFSTVAA+EGI K+ TG LISLSEQ+LVDC N GC GGLMD AFQ+II+NGG+ SE
Sbjct: 115 AFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG-NKGCQGGLMDTAFQYIIRNGGLTSED 173
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+YPY G + C + + I GYEDV +E +L +AVA QPVSVA++ GG F+ Y
Sbjct: 174 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFY 233
Query: 287 ESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
+SGVF G+CG+ L+HGV A+GYGT+ +G DYWLV+NSWG+ WGE+GY ++QR + + G
Sbjct: 234 KSGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRG-IGASEG 292
Query: 346 KCGIAMEASYPV 357
CG+AM+ASYP
Sbjct: 293 LCGVAMDASYPT 304
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 337 bits (863), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 177/358 (49%), Positives = 221/358 (61%), Gaps = 36/358 (10%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA + + + T+ I S A D SI+ Y H S T+ ++++W++KHG
Sbjct: 1 MAPSVSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTE-----LFESWMSKHG 55
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
KT + R ++FKDNL ID N TY + LN+FADL++EE+
Sbjct: 56 KTYESIEEKLHRLEVFKDNLMHIDRRNRDVTTYWLALNEFADLSHEEF------------ 103
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
KSK+A R EKGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 104 ----KSKLAQIRRL--------------EKGAVAPVKNQGSCGSCWAFSTVAAVEGINQI 145
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG L SLSEQEL+DCD N+GCNGGLMDYAF +I+ NGG+ E+DYPYL E CD
Sbjct: 146 VTGNLTSLSEQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEK 205
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R +VV+I GY DV +E SL KA+A QP+S+AIEA GR FQ Y GVF G CG+ LD
Sbjct: 206 REEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLD 265
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
HGV AVGYG+ G+DY +V+NSWG WGE GY++++RN G CGI ASYP K
Sbjct: 266 HGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPTK 322
>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
Length = 234
Score = 336 bits (862), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 154/197 (78%), Positives = 172/197 (87%)
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CG CWAFST+AAVEGIN IVTGELISLSEQELVDCDR N GCNGGLMDYAF+FII+NGG
Sbjct: 1 CGRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQGCNGGLMDYAFEFIIKNGG 60
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+DSE+DYPY + CDP R+NAKVV+IDGYEDV DE SLKKAVA QPVSVAIEAGGR
Sbjct: 61 IDSEEDYPYKAVDGTCDPIRKNAKVVTIDGYEDVPENDENSLKKAVAYQPVSVAIEAGGR 120
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y+SG+FTG CG+ALDHGV AVGYGTENG+DYW+VRNSWGS WGENGY++++RN+
Sbjct: 121 EFQLYQSGIFTGRCGTALDHGVAAVGYGTENGIDYWIVRNSWGSSWGENGYIRMERNVKT 180
Query: 342 TNTGKCGIAMEASYPVK 358
T TGKCGIAMEASYP K
Sbjct: 181 TKTGKCGIAMEASYPTK 197
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 164/313 (52%), Positives = 221/313 (70%), Gaps = 10/313 (3%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNE 106
++ ++ W+A+HG+ M EKR+ IFK+N+ I+ +N +R YK+G+NKFADLTNE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E+RAM+ G + + SK+ S + + +P S+DWR+ GAV PVKDQG+CG CW
Sbjct: 61 EFRAMHHGYKRQS------SKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCW 114
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSE 225
AFS VAA+EGI K+ TG+LISLSEQ+LVDCD K ++ GC GGLMD AFQFI++NGG+ SE
Sbjct: 115 AFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSE 174
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
YPY G + C + + I GYEDV +E +L +AVA QPVSVA+E GG FQ
Sbjct: 175 ATYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQF 234
Query: 286 YESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y+SGVF G+CG+ LDH V A+GYGT +G +YWLV+NSWG+ WGE+GY+++QR +
Sbjct: 235 YKSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRG-IGARE 293
Query: 345 GKCGIAMEASYPV 357
G CG+AM+ASYP
Sbjct: 294 GLCGVAMDASYPT 306
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 158/233 (67%), Positives = 192/233 (82%), Gaps = 1/233 (0%)
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
SK S RY K GD LPES+DWREKG + VKDQGSCGSCWAFS VAA+E IN IVTG L
Sbjct: 3 SKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNL 62
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
ISLSEQELVDCDR N GC+GGLMDYAF+F+I+NGG+D+E+DYPY CD R+NAK
Sbjct: 63 ISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAK 122
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV ID YEDV +E +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVV
Sbjct: 123 VVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVI 182
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GYGTENG+DYW+VRNSWG++ ENGY+++QRN + +++G CG+A+E SYPVK
Sbjct: 183 AGYGTENGMDYWIVRNSWGANCRENGYLRVQRN-VSSSSGLCGLAIEPSYPVK 234
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 164/312 (52%), Positives = 215/312 (68%), Gaps = 7/312 (2%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
++++W +HGKT RF+IF++N F+ +HNS N +Y + LN FADLT+ E++
Sbjct: 31 LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90
Query: 110 AMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
A LG + +L + + GD +P S+DWR+KGAV+ VKDQG+CG+CW+F
Sbjct: 91 ASRLGLSAFSTSGKLSRRNFPLHDFV---GD-VPISIDWRKKGAVSQVKDQGNCGACWSF 146
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
S A+EGINKIVTG L+SLSEQELVDCDR N GC GGLMDYA+QF+I+N G+D+E+DY
Sbjct: 147 SATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDY 206
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY E C+ + VV+IDGY DV +E L KAVA QPVSV I RAFQ Y
Sbjct: 207 PYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSK 266
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
G+FTG C ++LDH V+ VGYG+ENGVDYW+V+NSWG+ WG NGY+ + RN ++ G CG
Sbjct: 267 GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQ-GLCG 325
Query: 349 IAMEASYPVKNS 360
I M AS+PVK S
Sbjct: 326 INMLASFPVKTS 337
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 163/308 (52%), Positives = 216/308 (70%), Gaps = 7/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ K+G+ E+RF+IF++N+ FI+ N NR YK+ +N+FADLTNEE++A
Sbjct: 38 HEMWMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNEEFKA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G + + L S+ +S RY +P S+DWR+KGAV P+KDQG CG CWAFS
Sbjct: 98 SRNGYKRSSNVGL--SEKSSFRYGNVTA--VPTSMDWRQKGAVTPIKDQGQCGCCWAFSA 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI K+ TG+LISLSEQELVDCD + GC GGLMD AF+FI QNGG+ +E +YP
Sbjct: 154 VAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ ++ I GYEDV E +L KAVA QPVSVAI+A G AFQ Y G
Sbjct: 214 YQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGG 273
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
VFTG+CG+ LDHGV AVGYGT +G YWLV+NSWG+ WGE+GY++++R+ ++ G CGI
Sbjct: 274 VFTGDCGTELDHGVTAVGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERD-IEAKEGLCGI 332
Query: 350 AMEASYPV 357
AM++SYP
Sbjct: 333 AMQSSYPT 340
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 173/339 (51%), Positives = 220/339 (64%), Gaps = 8/339 (2%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRF 82
S + SI+ Y + D +S D ++ +++ W+AK+ K +RF++FKDNL
Sbjct: 27 SGGEFSIVGY-SEEDLASH----DRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNH 81
Query: 83 IDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDEL 141
ID+ N +Y +GLN+FADLT++E++A YLG R K + + RY + E+
Sbjct: 82 IDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEV 141
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P+ +DWR+K AV VK+QG CGSCWAFSTVAAVEGIN IVTG L SLSEQEL+DC N
Sbjct: 142 PKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGN 201
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF +I GG+ +E+ YPY E CD + A VV+I GYEDV DE
Sbjct: 202 NGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDCDEG-KGAAVVTISGYEDVPANDEQ 260
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L KA+A QPVSVAIEA GR FQ Y GVF G CG LDHGV AVGYGT G DY +V+N
Sbjct: 261 ALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSKGQDYIIVKN 320
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
SWG WGE GY++++R G CGI ASYP K++
Sbjct: 321 SWGPHWGEKGYIRMKRG-TGKGEGLCGINKMASYPTKDN 358
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 171/322 (53%), Positives = 223/322 (69%), Gaps = 11/322 (3%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
E+ ++ W+AKHGK +RFQIFK N+ FI+ N+ N++Y +G+NKFADLTN
Sbjct: 34 EMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKFADLTN 93
Query: 106 EEYRAMYLGTRSDAKRRLMKS-KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
EE+RA + G KR L S K+ +Y + LP S+DWR KGAV P+KDQG CGS
Sbjct: 94 EEFRAFWNGY----KRPLGASRKITPFKY--ENVTALPSSIDWRSKGAVTPIKDQGVCGS 147
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMD 223
CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K + GC GGLM AF+FI ++GGM
Sbjct: 148 CWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGGMT 207
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
SE +YPY G + KCD + ++ V I GY+ V E +L KAVA+QPVSVAI+AG +F
Sbjct: 208 SEANYPYQGRDGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVSVAIDAGSLSF 267
Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y SG+FTG CG ++HGV AVGYG N G YW+V+NSWG++WGE GY++++R+ + +
Sbjct: 268 QFYRSGIFTGICGKDINHGVAAVGYGRSNSGSKYWIVKNSWGTEWGEKGYIRMKRD-VRS 326
Query: 343 NTGKCGIAMEASYPVKNSQNSA 364
G CGIAME SYP Q S+
Sbjct: 327 KEGLCGIAMECSYPTAQVQASS 348
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 176/360 (48%), Positives = 242/360 (67%), Gaps = 23/360 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ + + I L LFF+ AA S + N + S R +D W+A++G
Sbjct: 1 MASVNQYQYI-CLALLFFL----AAWASQATARNLLEASMYERHED--------WMAQYG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+RA +R+
Sbjct: 48 RVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA----SRNRF 103
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + ++ S +Y A +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKYEHVAA--VPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD + GCNGGLMD AF+FI QN G+ +E +YPY G + C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCN 221
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I+GYEDV +E +L+KAVA QP++VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 281
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 282 LDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEVGYIRMQRDVT-AKEGLCGIAMQASYPT 340
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 162/312 (51%), Positives = 220/312 (70%), Gaps = 10/312 (3%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNE 106
++ ++ W+A+HG+ M EKR+ IFK+N+ I+ +N +R YK+G+NKFADLTNE
Sbjct: 36 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 95
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E+RAMY G + + SK+ S + + ++P S+DWR GAV PVKDQG+CG CW
Sbjct: 96 EFRAMYHGYKRQS------SKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 149
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFSTVAA+EGI K+ TG LISLSEQ+LVDC N GC GGLMD AFQ+II+NGG+ SE
Sbjct: 150 AFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG-NKGCQGGLMDTAFQYIIRNGGLTSED 208
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+YPY G + C + + I GYEDV +E +L +AVA QPVSV ++ GG FQ Y
Sbjct: 209 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQFY 268
Query: 287 ESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
+SGVF G+CG+ +H V A+GYGT+ +G DYWLV+NSWG+ WGENGY++++R + ++ G
Sbjct: 269 KSGVFNGDCGTQQNHAVTAIGYGTDIDGTDYWLVKNSWGTSWGENGYMRMRRG-IGSSEG 327
Query: 346 KCGIAMEASYPV 357
CG+AM+ASYP
Sbjct: 328 LCGVAMDASYPT 339
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 334 bits (856), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 160/322 (49%), Positives = 226/322 (70%), Gaps = 9/322 (2%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID-EHNSLNRTYKVGL 97
SS D + ++ W+A++G+ + EKRF IFK+N+ +I+ +N+ ++ YK+G+
Sbjct: 26 SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYIEASNNAGDKPYKLGV 85
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
N+FADLTNEE+ + TR+ K + S + + + P +VDWR++GAV PVK
Sbjct: 86 NQFADLTNEEF----IATRNKFKGHMSSSITRTTTFKYE-NVTAPSTVDWRQEGAVTPVK 140
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFI 216
+QG+CG CWAFS VAA EGI+K+ TG L+SLSEQELVDCD + GC GGLMD AF+FI
Sbjct: 141 NQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFI 200
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
IQNGG+++E YPY G + C+ + V +I GYEDV +E +L++AVA+QP+S+AI
Sbjct: 201 IQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNEQALQQAVANQPISIAI 260
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKL 335
+A G FQ+Y+SGVFTG CG+ LDHGV VGYG +++G YWLV+NSWG+DWGE GY+++
Sbjct: 261 DASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWGADWGEEGYIRM 320
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
QR+ +D G CG+AM+ SYP
Sbjct: 321 QRD-VDAPEGLCGLAMQPSYPT 341
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 334 bits (856), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 170/366 (46%), Positives = 240/366 (65%), Gaps = 34/366 (9%)
Query: 1 MATASMFLAISTLVFL------FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQT 54
MAT + F IS + L F +SS + D S+ H+ ++
Sbjct: 1 MATKNQFYQISFALVLCLGLWAFQVSSRTLQDASM------HER-------------HEQ 41
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFID-EHNSLNRTYKVGLNKFADLTNEEYRAMYL 113
W+A++GK + EKRF IF++N+++I+ +N+ N+ YK+G+N+F DLTN+E+ +
Sbjct: 42 WMARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGNKPYKLGVNQFTDLTNKEF----I 97
Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
TR+ K + S + + + P +VDWR++GAV PVK+QG+CG CWAFS VAA
Sbjct: 98 ATRNKFKGHMSSSITRTTTFKYE-NVTAPSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAA 156
Query: 174 VEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
EGI+K+ TG L+SLSEQELVDCD + GC GGLMD AF+FIIQNGG+++E YPY G
Sbjct: 157 TEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQG 216
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
+ C+ + V +I GYEDV +E +L++AVA+QP+SVAI+A G FQ+Y+SGVFT
Sbjct: 217 VDGTCNTNEEVTHVATITGYEDVPSNNEQALQQAVANQPISVAIDASGSDFQNYQSGVFT 276
Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
G CG+ LDHGV VGYG +++G YWLV+NSWG DWGE GY+++QR+ ++ G CGIAM
Sbjct: 277 GSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWGEDWGEEGYIRMQRD-VEAPEGLCGIAM 335
Query: 352 EASYPV 357
+ SYP
Sbjct: 336 QPSYPT 341
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 161/309 (52%), Positives = 215/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ GK E+RF+IFKDN+ +I+ N+ N+ YK+ +NKFADLTNEE +
Sbjct: 38 HEQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKV 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G R + R MK V S +Y +P ++DWR+KGAV P+KDQG CGSCWAFST
Sbjct: 98 ARNGYRRPLQTRPMK--VTSFKYENVTA--VPATMDWRKKGAVTPIKDQGQCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGIN++ TG+L+SLSEQELVDCD + + GC GGLM+ F+FII+N G+ +E +YP
Sbjct: 154 VAATEGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ + +++ I GYE V E +L KAVA QP+SV+I+AGG FQ Y SG
Sbjct: 214 YQAADGTCNSKKEASRIAKITGYESVPANSEAALLKAVASQPISVSIDAGGSDFQFYSSG 273
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYG T +G YWLV+NSWG+ WGE GY+++QR+ + G CG
Sbjct: 274 VFTGQCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRD-TEAEEGLCG 332
Query: 349 IAMEASYPV 357
IAM++SYP
Sbjct: 333 IAMDSSYPT 341
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 333 bits (854), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 165/327 (50%), Positives = 220/327 (67%), Gaps = 5/327 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFAD 102
+D+ + +Y+ W +H G +RF FKDN+R+I EHN R Y++ LN+F D
Sbjct: 38 SDEALWDLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRGGRGYRLRLNRFGD 96
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+ EE+RA + G+ ++ RR + + + +LP +VDWR KGAV VKDQG C
Sbjct: 97 MGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKC 156
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N+GC GGLM+ AF++I +GG+
Sbjct: 157 GSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGI 216
Query: 223 DSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+E YPY A CD R R A +V IDG+++V E +L KAVA+QPVSVAI+AG +
Sbjct: 217 TTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 276
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
+FQ Y GVF G+CG+ LDHGV VGYG T +G +YW+V+NSWG+ WGE GY+++QR+
Sbjct: 277 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD-S 335
Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPK 367
+ G CGIAMEASYPVK S N P+
Sbjct: 336 GYDGGLCGIAMEASYPVKFSPNRVTPR 362
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 333 bits (854), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 177/356 (49%), Positives = 222/356 (62%), Gaps = 26/356 (7%)
Query: 24 AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
+ D SI+ Y + D SS + + +++ WL++H + + +RFQ+FKDNL I
Sbjct: 36 SGDFSIVGY-SEEDLSS----HESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHI 90
Query: 84 DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---- 139
DE N +Y +GLN+FADLT++E++A YLG RS + + +
Sbjct: 91 DETNRKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDG 150
Query: 140 -ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
LP+SVDWR KGAV VK+QG CGSCWAFSTVAAVEGIN+IVTG L +LSEQEL+DCD
Sbjct: 151 ASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDT 210
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR--------------NA 244
N GCNGGLMDYAF +I NGG+ +E+ YPYL E C S +A
Sbjct: 211 DGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDA 270
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
VV+I GYEDV +E +L KA+A QPVSVAIEA GR FQ Y GVF G CG+ LDHGV
Sbjct: 271 AVVTISGYEDVPRNNEQALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVA 330
Query: 305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
AVGYGT G DY +V+NSWG WGE GY++++R G CGI ASYP KN
Sbjct: 331 AVGYGTAAKGHDYIIVKNSWGPSWGEKGYIRMRRG-TGKRQGLCGINKMASYPTKN 385
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 333 bits (854), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 162/308 (52%), Positives = 219/308 (71%), Gaps = 10/308 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W++++GK EKRF IFKDN+ FI+ N+ N+ YK+ +N ADLT +E++A
Sbjct: 40 HEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKA 99
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K+ + + A+ + + +PE+VDWR KGAV P+KDQG CGSCWAFST
Sbjct: 100 ----SRNGYKK--IDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGIN+I TG+LISLSEQELVDCD K + GC GGLM+ F+FII+NGG+ SE +YP
Sbjct: 154 VAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ + A V I GYE V E+SL KAVA+QP+SV+I+A +F Y SG
Sbjct: 214 YKAADGSCN-TATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
++TGECG+ LDHGV AVGYG+ NG DYW+V+NSWG+ WGE GY+++QR + D G CGI
Sbjct: 273 IYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE-GLCGI 331
Query: 350 AMEASYPV 357
AM++SYP
Sbjct: 332 AMDSSYPT 339
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 162/323 (50%), Positives = 221/323 (68%), Gaps = 10/323 (3%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVG 96
S + + D + ++ W+ +GK + E R +IFK+N+ +I+ N+ N+ YK+G
Sbjct: 28 SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N+FAD+TNEE+ + +R+ K + S + + + +P +VDWR+KGAV PV
Sbjct: 88 INQFADITNEEF----IASRNKFKGHMCSSITKTSTFKYENA-SVPSTVDWRKKGAVTPV 142
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
K+QG CG CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+F
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
IIQN G+ +E YPY G + C + + +I GYEDV +E +L+KAVA+QP+SVA
Sbjct: 203 IIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVA 262
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVK 334
I+A G FQ Y+SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY++
Sbjct: 263 IDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIR 322
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+QR+ +D G CGIAM ASYP
Sbjct: 323 MQRS-VDAAQGLCGIAMMASYPT 344
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 162/308 (52%), Positives = 218/308 (70%), Gaps = 10/308 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W++++GK EKRF IFKDN+ FI+ N+ N+ YK+ +N ADLT +E++A
Sbjct: 40 HEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKA 99
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K+ + + A+ + + +PE+VDWR KGAV P+KDQG CGSCWAFST
Sbjct: 100 ----SRNGYKK--IDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGIN+I TG+LISLSEQELVDCD K + GC GGLM+ F+FII+NGG+ SE +YP
Sbjct: 154 VAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C + A V I GYE V E+SL KAVA+QP+SV+I+A +F Y SG
Sbjct: 214 YKAADGSCSAAT-TAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
++TGECG+ LDHGV AVGYG+ NG DYW+V+NSWG+ WGE GY+++QR + D G CGI
Sbjct: 273 IYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE-GLCGI 331
Query: 350 AMEASYPV 357
AM++SYP
Sbjct: 332 AMDSSYPT 339
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 217/308 (70%), Gaps = 12/308 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++GK +KRFQIFKDN+ FI+ N+ N+ YK+G+N ADLT EE++A
Sbjct: 38 HEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLTVEEFKA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ KR + ++ + + +P ++DWR KGAV P+KDQG CGSCWAFST
Sbjct: 98 ----SRNGFKR---PHEFSTTTFKYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCWAFST 150
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
+AA EGI++I TG+L+SLSEQELVDCD K ++ GC GG M+ F+FII+NGG+ SE +YP
Sbjct: 151 IAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSETNYP 210
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC+ + + V I GYE V P E +L+KAVA+QPVSV+I+A G F Y SG
Sbjct: 211 YKAVDGKCN--KATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGFMFYSSG 268
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
++ GECG+ LDHGV AVGYGT NG DYW+V+NSWG+ WGE GYV++QR + + G CGI
Sbjct: 269 IYNGECGTELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRMQRGIAAKH-GLCGI 327
Query: 350 AMEASYPV 357
A+++SYP
Sbjct: 328 ALDSSYPT 335
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 158/307 (51%), Positives = 217/307 (70%), Gaps = 5/307 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W+A++G+ + + +R ++FK N+ FI+ N+ N + + N+FAD+T +E+RAM
Sbjct: 33 HEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFADITKDEFRAM 92
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
+ G + K++ RYA + D+LP SVDWR GAV PVKDQG CG CWAFSTV
Sbjct: 93 HKGYKMQVIGS--KARATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCGCCWAFSTV 150
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A++EGI K+ TG+LISLSEQELVDCD + N GC GGLMD AF+FI+ NGG+D+E DYPY
Sbjct: 151 ASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGGLDTEADYPY 210
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
GA+ C+ ++ + SI GYEDV DE SL+KAVA QPVS+A++ G F+ Y+ GV
Sbjct: 211 TGADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDGGDDLFRFYKGGV 270
Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
TG CG+ LDHGV AVGYG +G YWLV+NSWG+ WGE+G+++L+R++ D G CG+
Sbjct: 271 LTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLERDVAD-EAGMCGL 329
Query: 350 AMEASYP 356
AM+ SYP
Sbjct: 330 AMKPSYP 336
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 176/350 (50%), Positives = 228/350 (65%), Gaps = 16/350 (4%)
Query: 19 ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
++ + +++SI+ Y + D +S R +M +++ ++AK+ K + + +RF++FKD
Sbjct: 24 VAVAMPSELSIVGY-SEEDLASHER----LMELFEKFMAKYRKAYSSLEEKLRRFEVFKD 78
Query: 79 NLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG 138
NL IDE N Y +GLN+FADLT++E++A YLG RR ++ RY
Sbjct: 79 NLNHIDEENKKITGYWLGLNEFADLTHDEFKAAYLGLTLTPARRNSNDQLF--RYEEVEA 136
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
LP+ VDWR+KGAV VK+QG CGSCWAFSTVAAVEGIN IVTG L LSEQEL+DCD
Sbjct: 137 ASLPKEVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDT 196
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-------DPSRRNAKVVSIDG 251
N GC+GGLMDYAF +I NGG+ +E+ YPYL E C D A V+I G
Sbjct: 197 DGNNGCSGGLMDYAFSYIAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISG 256
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
YEDV +E +L KA+A QPVSVAIEA GR FQ Y GVF G CG+ LDHGV AVGYGT
Sbjct: 257 YEDVPRNNEQALLKALAHQPVSVAIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTA 316
Query: 312 N-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+ G DY +V+NSWGS WGE GY++++R + G CGI ASYP KN+
Sbjct: 317 SKGHDYIIVKNSWGSHWGEKGYIRMRRG-TGKHDGLCGINKMASYPTKNA 365
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 163/309 (52%), Positives = 217/309 (70%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+AK+G+ E+RF+IF++N+ FI+ N L NR YK+ +N+FADLTNEE++
Sbjct: 38 HEMWMAKYGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINEFADLTNEEFKV 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G + + L ++ +S RYA +P S+DWR+ GAV P+KDQG CG CWAFS
Sbjct: 98 SKNGYKRSSGVGL--TEKSSFRYANVTA--VPTSMDWRQNGAVTPIKDQGQCGCCWAFSA 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI K+ TG+LISLSEQELVDCD + GC GGLMD AF+FI QNGG+ +E +YP
Sbjct: 154 VAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ ++ I GYEDV E +L KAVA QPVSVAI+A G AFQ Y G
Sbjct: 214 YQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGG 273
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYGT ++G YWLV+NSWG+ WGE+GY++++R+ ++ G CG
Sbjct: 274 VFTGDCGTELDHGVTAVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERD-IEAKEGLCG 332
Query: 349 IAMEASYPV 357
IAM+ SYP
Sbjct: 333 IAMQPSYPT 341
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/360 (48%), Positives = 241/360 (66%), Gaps = 23/360 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ + + I L LF + AA S + N H+ S R +D W+ ++G
Sbjct: 1 MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMVQYG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+RA +R+
Sbjct: 48 REYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA----SRNRF 103
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + ++ S +Y + +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD + GC+GGLMD AF+FI QN G+ +E +YPY G + C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCN 221
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I+GYEDV +E +L+KAVA QP++VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTE 281
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 282 LDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 340
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 332 bits (850), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 168/328 (51%), Positives = 220/328 (67%), Gaps = 8/328 (2%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+D+ + +Y+ W H + G +RF FK+N RFI HN +R Y++ LN+F D
Sbjct: 34 SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGD 92
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+ EE+R+ + +R + RR + A + +LP SVDWR+KGAV VK+QG C
Sbjct: 93 MGREEFRSGFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRC 152
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFSTV AVEGIN I TG L+SLSEQEL+DCD N GC GGLM+ AF+FI +GG+
Sbjct: 153 GSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDEN-GCQGGLMENAFEFIKSHGGI 211
Query: 223 DSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+E YPY + CD +R R +VV+IDG++ V E +L KAVA QPVSVAI+AGG+
Sbjct: 212 TTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQ 271
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
A Q Y GVFTG+CG+ LDHGV AVGYG +++G YW+V+NSWG WGE GY+++QR
Sbjct: 272 ALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGT- 330
Query: 341 DTNTGKCGIAMEASYPVKNSQN-SAKPK 367
N G CGIAMEAS+P+K S N S KP+
Sbjct: 331 -GNGGLCGIAMEASFPIKTSPNPSRKPR 357
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 168/359 (46%), Positives = 229/359 (63%), Gaps = 26/359 (7%)
Query: 1 MATASMFLAISTLVFL-FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
+ S F+ ++ L L + S S+A + +S H+ W+A++
Sbjct: 3 LTKQSQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQ----------------WMAQY 46
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
G+ E R+ IFK+N+ ID NS ++YK+G+N+FADL+NEE++A +R+
Sbjct: 47 GRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKA----SRNR 102
Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
K + + RY + +P ++DWR+KGAV PVKDQG CG CWAFS VAA+EGIN
Sbjct: 103 FKGHMCSPQAGPFRY--ENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGIN 160
Query: 179 KIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
++ TG+LISLSEQE+VDCD K + GCNGGLMD AF+FI QN G+ +E +YPY G + C
Sbjct: 161 QLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTC 220
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
+ + I G+EDV E +L KAVA QPVSVAI+AGG FQ Y SG+FTG CG+
Sbjct: 221 NTQKEATHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGT 280
Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
LDHGV AVGYG +G YWLV+NSWG+ WGE GY+++Q++ + G CGIAM+ASYP
Sbjct: 281 QLDHGVTAVGYGISDGTKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGIAMQASYP 338
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 175/360 (48%), Positives = 235/360 (65%), Gaps = 24/360 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S + I+ L+ + S + + + H+ S S R +D W+ +G
Sbjct: 1 MALESKIICITLLIMGVWASQALSRTL--------HEVSMSERHED--------WMGLYG 44
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+T + E+RF+IFK+N+ +I+ NS NR YK+ +N+FAD TNEE++A G +
Sbjct: 45 RTYKDIAEKERRFKIFKENVEYIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSS 104
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+ R S++ S RY A +P S+DWR+KGAV P+KDQG CG CWAFS VAA+EG+ +
Sbjct: 105 RPR--SSEITSFRYENVAA--VPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQ 160
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TGELISLSEQELVDCD + GC GGLMD AF+FII NGG+ +E +YPY G + C+
Sbjct: 161 LKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCN 220
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ + I YEDV E +L KAVA PVSVAI+AGG FQ Y SGVFTG+CG+
Sbjct: 221 KKKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTE 280
Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYG T++G YWLV+NSWG+ WGE+GY+ ++R+ + + G CGIAMEASYP
Sbjct: 281 LDHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERD-IGADEGLCGIAMEASYPT 339
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 213/308 (69%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK E R +IFK+N++ I+ N+ N++YK+G+N+FADLTNEE++A
Sbjct: 39 HEQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLTNEEFKA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
R+ K + + + + + +P S+DWR+KGAV P+KDQG CG CWAFS
Sbjct: 99 -----RNRFKGHMCSNSTRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQGQCGCCWAFSA 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FI+QN G+++E YP
Sbjct: 154 VAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + SI G+EDV E +L KAVA+QP+SVAI+A G FQ Y SG
Sbjct: 214 YQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSG 273
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
VFTG CG+ LDHGV AVGYG++ G YWLV+NSWG WGE GY+++QR++ G CG
Sbjct: 274 VFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVA-AEEGLCGF 332
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 333 AMQASYPT 340
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 158/292 (54%), Positives = 210/292 (71%), Gaps = 9/292 (3%)
Query: 70 EKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
EKR +IF N+ +I+ NS N+ YK+ +NKFADLTNEE+ + +R+ K + S
Sbjct: 5 EKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEF----IASRNKFKGHMCSSI 60
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ + + + +P +VDWR+KGAV PVK+QG CGSCWAFS VAA EGI+++ TG+L+S
Sbjct: 61 IRTTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVS 120
Query: 188 LSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQEL+DCD K ++ GC GGLMD AF+FIIQN G+ +E YPY G + C+ ++ +
Sbjct: 121 LSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIHA 180
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V+I GYEDV +E++L+KAVA+QP+SVAI+A G FQ Y SGVFTG CG+ LDHGV AV
Sbjct: 181 VTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAV 240
Query: 307 GYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GYG N G YWLV+NSWG+DWGE GY+++QR + G CGIAM+ASYP
Sbjct: 241 GYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAE-GLCGIAMQASYPT 291
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 175/365 (47%), Positives = 227/365 (62%), Gaps = 16/365 (4%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M L L FL + +S D T++ V +Y+ W H T
Sbjct: 1 MKLFFIVLSFLCLLQASKGFDFD----------EKELETEENVWKLYERWRDHHSVTR-- 48
Query: 66 MGHNE-KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
H KRF +F+ N+ + N N+ YK+ +N+FAD+T+ E+R+ Y G+ R L
Sbjct: 49 ASHEALKRFNVFRHNVLHVHRTNKKNKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLR 108
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
K S + + +P SVDWREKGAV VK+Q CGSCWAFSTVAAVEGINKI T +
Sbjct: 109 GPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNK 168
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRN 243
L+SLSEQELVDCD + N GC GGLM+ AF+FI NGG+ +E+ YPY + + C +
Sbjct: 169 LVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSID 228
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
+ V+IDG+E V DE +L KAVA QPVSVAI+AG FQ Y GVF GECG+ L+HGV
Sbjct: 229 GETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGV 288
Query: 304 VAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
V VGYG T+NG YW+VRNSWG +WGE GYV+++R + + N G+CGIAMEASYP K S
Sbjct: 289 VIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE-NEGRCGIAMEASYPTKVSST 347
Query: 363 SAKPK 367
+ P+
Sbjct: 348 PSTPE 352
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 159/289 (55%), Positives = 208/289 (71%), Gaps = 5/289 (1%)
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR-LMKSKV 128
E++F ++ DNL F+ HN + T+K+GL FADLT++EYR LG R + K L K
Sbjct: 67 ERKFSVWLDNLEFVHSHNEKDSTFKLGLTNFADLTHDEYRQHALGYRPELKGTGLGTGKS 126
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+YA E P S+DWR+KGAV VK+Q CGSCWAFST +VEG N I +GEL+SL
Sbjct: 127 TGFQYA---DYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTGSVEGANAIYSGELVSL 183
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQELVDCD + GC+GGLMD+AF FII+NGG+D+E+DY Y + C+ ++ VV+
Sbjct: 184 SEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYKAQDGVCNIAKEKRHVVT 243
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
ID YEDV P DE +LKKA A+QP+SVAIEA R FQ Y GVF CG+ALDHGV+ VGY
Sbjct: 244 IDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVFDAPCGTALDHGVLVVGY 303
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G++NG DYW+V+NSWG WG++GY++L R + ++ G+CGIAM+ASYP+
Sbjct: 304 GSDNGTDYWIVKNSWGDFWGDSGYIRLARGISNS-AGQCGIAMQASYPI 351
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 176/359 (49%), Positives = 227/359 (63%), Gaps = 8/359 (2%)
Query: 16 LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQI 75
LFFI S + S + D T++ V +Y+ W H S KRF +
Sbjct: 3 LFFIVLISFLSLLQASKGFDFD-EKELETEENVWKLYERWRGHH-SVSRASHEAIKRFNV 60
Query: 76 FKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
F+ N+ + N N+ YK+ +N+FAD+T+ E+R+ Y G+ R L K S +
Sbjct: 61 FRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMY 120
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ +P SVDWREKGAV VK+Q CGSCWAFSTVAAVEGINKI T +L+SLSEQELVD
Sbjct: 121 ENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVD 180
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRNAKVVSIDGYED 254
CD + N GC GGLM+ AF+FI NGG+ +E+ YPY ++ + C + + V+IDG+E
Sbjct: 181 CDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEH 240
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENG 313
V DE L KAVA QPVSVAI+AG FQ Y GVF GECG+ L+HGVV VGYG T+NG
Sbjct: 241 VPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG 300
Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
YW+VRNSWG +WGE GYV+++R + + N G+CGIAMEASYP K S+ P H S
Sbjct: 301 TKYWIVRNSWGPEWGEGGYVRIERGISE-NEGRCGIAMEASYPTK---LSSTPSTHESV 355
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 330 bits (847), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 234/356 (65%), Gaps = 14/356 (3%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
T++ +A +L FL I + + A ++ + D D S R ++ W+AK+G+
Sbjct: 70 TSTPTMASRSLGFLIAILACTCAVSALAARDLTDDLSMVAR--------HEQWMAKYGRV 121
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
N + +R ++FK N+ FI+ N+ N + + N+FAD+T +E+RA + G +
Sbjct: 122 YNDVAEKAQRLEVFKANVAFIELVNAGNDKFSLEANQFADMTVDEFRAAHTGYKPVPAN- 180
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
K + +YA + D LP S+DWR KGAV P+KDQG CG CWAFSTVA+VEGI K+ T
Sbjct: 181 --KGRTTQFKYANVSLDALPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLST 238
Query: 183 GELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
G+LISLSEQELVDCD ++ GC GGLMD AF+FII NGG+ +E +YPY G ++ C+ ++
Sbjct: 239 GKLISLSEQELVDCDVDGMDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNK 298
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
+ V SI GYEDV DE SL KAVA QPVS+A++ G F+ Y+ GV +G CG+ LDH
Sbjct: 299 ESNDVASIKGYEDVPSNDETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDH 358
Query: 302 GVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
G+ AVGYG T +G +WL++NSWG+ WGE G+++++R++ D G CG+AM+ SYP
Sbjct: 359 GIAAVGYGITSDGTKFWLMKNSWGTSWGEKGFIRMERDIAD-EEGLCGLAMQPSYP 413
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 180/340 (52%), Positives = 232/340 (68%), Gaps = 19/340 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFA 101
+++ + +Y+ W A+H S + +RF +F++N R + E N L R YK+ LN+FA
Sbjct: 41 SEESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFN-LRRDAPYKLRLNRFA 98
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA-------GDELPESVDWREKGAVN 154
DLT++E+R Y +R + R+ K + A+ G LP SVDWREKGAV
Sbjct: 99 DLTSDEFRRSYASSRV-SHHRMFKPRAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVT 157
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQ 214
VKDQG CGSCWAFST+AAVEGIN I T L SLSEQ+LVDCD K NAGC+GGLMD AF
Sbjct: 158 GVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFS 217
Query: 215 FIIQNGGMDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+I ++GG+ +E+ YPY + + C+ + A VVSIDGYEDV DE +LKKAVA QPV+
Sbjct: 218 YIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVA 277
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGY 332
VAIEAGG FQ Y GVF G+CG+ LDHGV AVGYG T +G YW+V+NSWG +WGE GY
Sbjct: 278 VAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGY 337
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
++++R++ D G CGIAMEASYPVK S N PK H++A
Sbjct: 338 IRMKRDVADKE-GLCGIAMEASYPVKTSPN---PK-HAAA 372
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 330 bits (846), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 177/368 (48%), Positives = 233/368 (63%), Gaps = 17/368 (4%)
Query: 1 MATASMFLAI-STLVFLF----FISSSSAA---DMSIISYDNNHDHSSSWRTDDEVMTIY 52
MA + LA+ S L LF F++ S+ A D S++ Y ++++ ++
Sbjct: 1 MAGNNSLLAMDSKLSMLFLLLGFVACSATASHHDPSVVGYSQE-----DLALPNKLVGLF 55
Query: 53 QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMY 112
+W KH K KR++IFK NLR I E N N +Y +GLN FAD+ +EE++A Y
Sbjct: 56 TSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASY 115
Query: 113 LGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
LG + RR + ++ RYA LP +VDWR+KGAV PVK+QG CGSCWAFSTV
Sbjct: 116 LGLKPGLARRDAQPHGSTTFRYANAV--NLPWAVDWRKKGAVTPVKNQGECGSCWAFSTV 173
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
AAVEGIN+IVTG+L+SLSEQEL+DCD N GC GGLMD+AF +I+ N G+ +E+DYPYL
Sbjct: 174 AAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYL 233
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
E C + ++KV++I GYEDV E SL KA+A QPVSV I AG R FQ Y+ G+F
Sbjct: 234 MEEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIF 293
Query: 292 TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
GECG DH + AVGYG+ G DY +++NSWG +WGE GY +++R G C I
Sbjct: 294 DGECGIQPDHALTAVGYGSYYGQDYIIMKNSWGKNWGEQGYFRIRRG-TGKPEGVCDIYK 352
Query: 352 EASYPVKN 359
ASYP KN
Sbjct: 353 IASYPTKN 360
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 159/308 (51%), Positives = 217/308 (70%), Gaps = 9/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
++ W+++ G+ N E R++IFK+N++ I+ N + ++YK+G+N+FADLTNEE++
Sbjct: 39 HEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKT 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + S+ RY P S+DWR+KGAV +KDQG CGSCWAFS
Sbjct: 99 ----SRNRFKGHMCSSQAGPFRYENLTA--APSSMDWRKKGAVTAIKDQGQCGSCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAAVEGI ++ T +LISLSEQELVDCD K + GC GGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYP 212
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G++ C+ + I+G+EDV +E +L KAVA QPVSVAI+AGG FQ Y SG
Sbjct: 213 YEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+FTG+CG+ LDHGV AVGYG NG++YWLV+NSWG+ WGE GY+++Q++ +D G CGI
Sbjct: 273 IFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKD-IDAKEGLCGI 331
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 332 AMQASYPT 339
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 159/316 (50%), Positives = 218/316 (68%), Gaps = 9/316 (2%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADL 103
D + ++ W++++ K E+R +IF N+ +I+ N + N+ YK+G+N+FADL
Sbjct: 34 DSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGINQFADL 93
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TNEE+ + +R+ K + S + + + +P +VDWR+KGAV PVK+QG CG
Sbjct: 94 TNEEF----IASRNKFKGHMCSSIAKTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCG 149
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA EGI K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+
Sbjct: 150 CCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 209
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E YPY G + C+ ++ + +I GYEDV +E +L+KAVA+QP+SVAI+A G
Sbjct: 210 STEAAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQALQKAVANQPISVAIDASGSD 269
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y+SGVF+G CG+ LDHGV AVGYG N G YWLV+NSWG+DWGE GY+++QR +D
Sbjct: 270 FQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIRMQRG-VD 328
Query: 342 TNTGKCGIAMEASYPV 357
G CGIAM+ASYP
Sbjct: 329 AAEGLCGIAMQASYPT 344
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 160/315 (50%), Positives = 209/315 (66%), Gaps = 15/315 (4%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+++TW +HGK+ R ++F+DN F+ +HNS N +Y + LN FADLT+ E++
Sbjct: 28 LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87
Query: 110 AMYLGTRSD----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
LG + A R L + V ++P S+DWR KG V VKDQGSCG+C
Sbjct: 88 TSRLGLSAAPLNLAHRNLEITGVVG---------DIPASIDWRNKGVVTNVKDQGSCGAC 138
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
W+FS A+EGINKIVTG L+SLSEQEL++CD+ N GC GGLMDYAFQF+I N G+D+E
Sbjct: 139 WSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTE 198
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+DYPY + C+ R +VV+ID Y DV +E L +AVA QPVSV I RAFQ
Sbjct: 199 EDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQM 258
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y G+FTG C ++LDH V+ VGYG+ENGVDYW+V+NSWG+ WG GY+ +QRN ++ G
Sbjct: 259 YSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ-G 317
Query: 346 KCGIAMEASYPVKNS 360
CGI M ASYPVK S
Sbjct: 318 VCGINMLASYPVKTS 332
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 163/316 (51%), Positives = 215/316 (68%), Gaps = 10/316 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
T+ ++ ++ W+AK+ K EKRF IFKDN+ FI+ N+ N+ YK+G+N AD
Sbjct: 33 TETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLGVNHLAD 92
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LT EE++A +R+ KR +V + + + +P SVDWR+KGAV P+KDQG C
Sbjct: 93 LTIEEFKA----SRNGLKRSY-DYEVGTTSFKYENVTAIPASVDWRKKGAVTPIKDQGQC 147
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
GSCWAFSTVAA EGI+KI TG+L+SLSEQELVDCDRK + GC GG M+ F+FII+NGG
Sbjct: 148 GSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGG 207
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY + C + A I GYE V E +L KAVA+QPVSV+I+A
Sbjct: 208 ITTEANYPYKAVDGSCKNA--TAPAAQIKGYEKVPVNSEKALLKAVANQPVSVSIDAADG 265
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
+F Y SG+FTGECG+ LDHGV AVGYG NG DYW+V+NSWG+ WGE GY+++QR +
Sbjct: 266 SFMFYSSGIFTGECGTELDHGVTAVGYGRANGTDYWIVKNSWGTVWGEQGYIRMQRGIA- 324
Query: 342 TNTGKCGIAMEASYPV 357
G CGIAM++SYP
Sbjct: 325 AKEGLCGIAMDSSYPT 340
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 170/351 (48%), Positives = 225/351 (64%), Gaps = 12/351 (3%)
Query: 13 LVFLFFISSSSAA---DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+ L F++ S+ A D S++ Y ++++ ++ +W KH K
Sbjct: 9 FLLLGFVACSATASHHDPSVVGYSQE-----DLALPNKLVGLFTSWSVKHSKIYASPKEK 63
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
KR++IFK NLR I E N N +Y +GLN FAD+ +EE++A YLG + RR + +
Sbjct: 64 VKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGS 123
Query: 130 SQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+ RYA LP +VDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG+L+SL
Sbjct: 124 TTFRYANAV--NLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSL 181
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQEL+DCD N GC GGLMD+AF +I+ N G+ +E+DYPYL E C + ++KV++
Sbjct: 182 SEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVIT 241
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
I GYEDV E SL KA+A QPVSV I AG R FQ Y+ G+F GECG DH + AVGY
Sbjct: 242 ITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGY 301
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
G+ G DY +++NSWG +WGE GY +++R G C I ASYP KN
Sbjct: 302 GSYYGQDYIIMKNSWGKNWGEQGYFRIRRG-TGKPEGVCDIYKIASYPTKN 351
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 156/308 (50%), Positives = 218/308 (70%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ +HGK EKRF+IF +N+ +++ +N+ N+ YK+G+N+F DLTN+E+
Sbjct: 135 HEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEF-- 192
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + + +P +VDWR+ GAV PVKDQG CG CWAFS
Sbjct: 193 --IAPRNRFKGHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSA 250
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + G+LISLSEQELVDCD K ++ GC GGLMD A++FIIQN G+++E +YP
Sbjct: 251 VAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYP 310
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + KC+ + +I GYEDV +E +L+KAVA+QPVSVAI+A FQ Y+SG
Sbjct: 311 YKGVDGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSG 370
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
FTG CG+ LDHGV AVGYG +++G YWLV+NSWG++WGE GY+++QR +D+ G CG
Sbjct: 371 AFTGSCGTELDHGVTAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGVCG 429
Query: 349 IAMEASYP 356
IAM+ASYP
Sbjct: 430 IAMQASYP 437
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 171/360 (47%), Positives = 239/360 (66%), Gaps = 23/360 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ + + I L LF + AA S + N H+ S R +D W+A++G
Sbjct: 1 MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMAQYG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+ +R+
Sbjct: 48 RVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGT----SRNRF 103
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + ++ S +Y + +P ++DWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD + GCNGGLMD AF+FI QN G+ +E +YPY G + C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCN 221
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I+GYEDV +E +L+KAV QP++VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 281
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 282 LDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 340
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 174/364 (47%), Positives = 234/364 (64%), Gaps = 14/364 (3%)
Query: 12 TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
TL+ + ++ S+ I +D S D+ + +Y+ W H + G +
Sbjct: 7 TLLLVALVAMSAVELCRAIEFDERDLAS-----DEALWDLYERWQTHHHVHRH-HGEKGR 60
Query: 72 RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTR-SDAKRRLMKSKVA 129
RF FK+N+RFI HN +R Y++ LN+F D+ EE+R+ + +R +D +R + A
Sbjct: 61 RFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESPAAPA 120
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
+ +LP SVDWR++GAV VKDQG CGSCWAFSTV +VEGIN I TG L+SLS
Sbjct: 121 VPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLVSLS 180
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVS 248
EQEL+DCD N GC GGLM+ AF+FI GG+ +E YPY + CD R R ++VS
Sbjct: 181 EQELIDCDTDEN-GCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRRGQIVS 239
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
IDG++ V E +L KAVA+QPVSVAI+AGG+AFQ Y GVFTG+CG+ LDHGV AVGY
Sbjct: 240 IDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 299
Query: 309 G-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
G +++G YW+V+NSWG WGE GY+++QR N G CGIAMEAS+P+K S N A+ K
Sbjct: 300 GVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPAR-K 356
Query: 368 PHSS 371
P +
Sbjct: 357 PRRA 360
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 155/309 (50%), Positives = 204/309 (66%), Gaps = 8/309 (2%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+++ W ++GKT + R ++F++N F+ +HNS+ N +Y + LN FADLT+ E++
Sbjct: 28 LFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFK 87
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
A LG + + Q +P +VDWR+ GAV VKDQG+CG CW+FS
Sbjct: 88 ASRLGFSPGRAQSIRSVGTPVQEL------HVPPAVDWRKSGAVTGVKDQGNCGGCWSFS 141
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
T A+EGINKIVTG L+SLSEQELVDCDR N+GC GGLMDYA+QF+I+N G+DSE DYP
Sbjct: 142 TTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYP 201
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y+G + C+ + +V+IDGY D+ P DE L + VA QPVSV I + FQ Y G
Sbjct: 202 YVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKG 261
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
V+TG C S LDH V+ VGYGTE+GVD+W+V+NSWG WG GY+ + RN T G CGI
Sbjct: 262 VYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRN-NGTAEGICGI 320
Query: 350 AMEASYPVK 358
M ASYP K
Sbjct: 321 NMLASYPAK 329
>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
Length = 321
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 151/200 (75%), Positives = 175/200 (87%)
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFS+VAAVEGIN+IVTGELI LSEQELVDCD+ N GCNGGLMDYAFQFII NGG+
Sbjct: 13 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 72
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
D+E+DYPY G + CDP+R+NAKVV+IDGYEDV DE SLKKAVA+QPVSVAIEAGGRA
Sbjct: 73 DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 132
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
FQ Y+SGVFTG CG+ LDHGVVAVGYGT+NG DYW+VRNSWG DWGE+GY++L+RN+ +
Sbjct: 133 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 192
Query: 343 NTGKCGIAMEASYPVKNSQN 362
TGKCGIA++ SYP K+ N
Sbjct: 193 TTGKCGIAVQPSYPTKSGAN 212
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 216/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A +GK E+RF+IFK+N+ +I+ N+ N+ YK+ +NKFAD TNE+++
Sbjct: 38 HEQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKG 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G R + R MK V S +Y +P ++DWR+KGAV P+KDQG CGSCWAFST
Sbjct: 98 ARNGYRRPFQTRPMK--VTSFKYENVTA--VPATMDWRKKGAVTPIKDQGQCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGIN++ TG+L+SLSEQELVDCD + + GC GGLM+ F+FII+N G+ +E +YP
Sbjct: 154 VAATEGINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHGITTEANYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ ++ + + I GYE V E L K VA+QP+SV+I+AGG FQ Y SG
Sbjct: 214 YQAADGTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSG 273
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYG T +G YWLV+NSW + WGE GY+++QR+ +D G CG
Sbjct: 274 VFTGKCGTELDHGVTAVGYGETSDGTKYWLVKNSWXTSWGEEGYIRMQRD-IDAEEGLCG 332
Query: 349 IAMEASYPV 357
IAM++SYP
Sbjct: 333 IAMDSSYPT 341
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 162/325 (49%), Positives = 215/325 (66%), Gaps = 4/325 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+D+ + +Y+ W +H G +RF FKDN+R+I EHN Y LN+F D+
Sbjct: 38 SDEALWDLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYPP-LNRFGDM 95
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
EE+RA + G+ ++ RR + + + +LP +VDWR KGAV VKDQG CG
Sbjct: 96 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N+GC GGLM+ AF++I +GG+
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGIT 215
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E YPY A CD R +V IDG+++V E +L KAVA+QPVSVAI+AG ++F
Sbjct: 216 TESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 275
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GVF G+CG+ LDHGV VGYG T +G +YW+V+NSWG+ WGE GY+++QR+
Sbjct: 276 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD-SGY 334
Query: 343 NTGKCGIAMEASYPVKNSQNSAKPK 367
+ G CGIAMEASYPVK S N P+
Sbjct: 335 DGGLCGIAMEASYPVKFSPNRVTPR 359
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 162/325 (49%), Positives = 215/325 (66%), Gaps = 4/325 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+D+ + +Y+ W +H G +RF FKDN+R+I EHN Y LN+F D+
Sbjct: 38 SDEALWDLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYAP-LNRFGDM 95
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
EE+RA + G+ ++ RR + + + +LP +VDWR KGAV VKDQG CG
Sbjct: 96 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N+GC GGLM+ AF++I +GG+
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGIT 215
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E YPY A CD R +V IDG+++V E +L KAVA+QPVSVAI+AG ++F
Sbjct: 216 TESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 275
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GVF G+CG+ LDHGV VGYG T +G +YW+V+NSWG+ WGE GY+++QR+
Sbjct: 276 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD-SGY 334
Query: 343 NTGKCGIAMEASYPVKNSQNSAKPK 367
+ G CGIAMEASYPVK S N P+
Sbjct: 335 DGGLCGIAMEASYPVKFSPNRVTPR 359
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 217/309 (70%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A +GK E+RF+IFK+N+ +I+ N+ N+ YK+ +NKFAD TNE+++
Sbjct: 38 HEQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKG 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G R + R MK V S +Y +P ++DWR+KGAV +KDQG CGSCWAFST
Sbjct: 98 ARNGYRRPFQTRPMK--VTSFKYENVTA--VPATMDWRKKGAVTLIKDQGQCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGIN++ TG+L+SLSEQELVDCD + + GC GGLM+ F+FII+N G+ +E +YP
Sbjct: 154 VAATEGINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHGITTEANYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ ++ + + I GYE V E L K VA+QP+SV+I+AGG FQ Y SG
Sbjct: 214 YQAADGTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSG 273
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYG T +G YWLV+NSWG+ WGE GY+++QR+ +DT G CG
Sbjct: 274 VFTGKCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRD-IDTEEGLCG 332
Query: 349 IAMEASYPV 357
IAM++SYP
Sbjct: 333 IAMDSSYPT 341
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 160/321 (49%), Positives = 219/321 (68%), Gaps = 12/321 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
TD + +Y+ W ++H S +KRF +FK N+ I+ N L + YK+ LN+FAD+
Sbjct: 32 TDKSLWDLYERWGSQH-MVSRAPDEKKKRFNVFKYNVNHINRVNQLGKPYKLKLNEFADM 90
Query: 104 TNEEYRAMYLGTRSDAK---RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
TN E++A + D+K R++K K + + P S+DWR GAVNP+K+QG
Sbjct: 91 TNHEFKAGF-----DSKILHFRMLKGKRRQTPFTHAKTTDPPPSIDWRTNGAVNPIKNQG 145
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST+ VEGINKI T +L+SLSEQELVDC+ GCNGGLM+ ++FI + G
Sbjct: 146 RCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE-GCNGGLMENGYEFIKETG 204
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +EQ YPY +CD S+RN+ VV IDG+E+V DE ++ +AVA+QPVS+AI+AGG
Sbjct: 205 GVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGG 264
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GVF G CG+ L+HGV VGYG T++G +YW+VRNSWG+ WGE GYV++QR
Sbjct: 265 LNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRG- 323
Query: 340 LDTNTGKCGIAMEASYPVKNS 360
++ G CG+AM+ASYP+K S
Sbjct: 324 VNVPEGLCGLAMDASYPIKAS 344
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 162/310 (52%), Positives = 217/310 (70%), Gaps = 8/310 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQIFKDN+ FI+ N+ N+ YK+G+N ADLT EE++
Sbjct: 38 HENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKD 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
G + + K+ +Y + ++PE++DWR KGAV P+KDQG CGSCWAFS
Sbjct: 98 SRNGLKRTYEFSTTTFKLNGFKY--ENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFS 155
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
TVAA EGI +I TG L+SLSEQELVDCD ++ GC+GGLM+ F+FII+NGG+ SE +YP
Sbjct: 156 TVAATEGIYQISTGMLMSLSEQELVDCD-SVDHGCDGGLMEDGFEFIIKNGGISSEANYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + CD S+ + I GYE V E +L++AVA+QPVSV+I+AGG FQ Y SG
Sbjct: 215 YTAVDGTCDASKEASPAAQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGV-DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
VFTG+CG+ LDHGV VGYG T++G +YW+V+NSWG+ WGE GY+++QR +D G C
Sbjct: 275 VFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRG-IDALEGLC 333
Query: 348 GIAMEASYPV 357
GIAM+ASYP
Sbjct: 334 GIAMDASYPT 343
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 159/292 (54%), Positives = 207/292 (70%), Gaps = 4/292 (1%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNE 106
+V+ ++++ L KH K RF+IF DNL+ IDE N Y +GLN+FADLT+E
Sbjct: 44 KVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHE 103
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E++ +LG + + R +S + RY + +LP+SVDWR+KGAV+PVK+QG CGSCW
Sbjct: 104 EFKNKFLGFKGELAERKDES-IEQFRY--RDFVDLPKSVDWRKKGAVSPVKNQGQCGSCW 160
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFSTVAAVEGIN+IVTG L LSEQEL+DCD N GCNGGLMDYAF ++ +NG + E+
Sbjct: 161 AFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRNG-LHKEE 219
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+YPY+ +E CD R ++ V+I GY DV +E S KA+A+QP+SVAIEA GR FQ Y
Sbjct: 220 EYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFY 279
Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
GVF G CG+ LDHGV AVGYGT G+DY +VRNSWG WGE GY++++RN
Sbjct: 280 SGGVFDGHCGTELDHGVAAVGYGTSKGLDYVIVRNSWGPKWGEKGYIRMKRN 331
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 174/359 (48%), Positives = 223/359 (62%), Gaps = 25/359 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ I LV L I +S + H+ S S R ++ W+ K+G
Sbjct: 1 MASIGKKQHILALVLLLSICTSQVMSRYL------HEASMSER--------HEQWMKKYG 46
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K +KR IFKDN+ FI+ N+ N+ YK+G+N AD TNEE+ A + G + A
Sbjct: 47 KVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKHKA 106
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K Y G +P +VDWRE GAV VKDQG CGSCWAFSTVAA EGI +
Sbjct: 107 SHSQTPFK-----YENVTG--VPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQ 159
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
I T L+SLSEQELVDCD ++ GC+GG M+ F+FII+NGG+ SE +YPY + CD
Sbjct: 160 ITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDA 218
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
++ + I GYE V E +L+KAVA+QPVSV I+AGG AFQ Y SGVFTG+CG+ L
Sbjct: 219 NKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQL 278
Query: 300 DHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGV AVGYG T++G YW+V+NSWG+ WGE GY+++QR D G CGIAM+ASYP
Sbjct: 279 DHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRG-TDAQEGLCGIAMDASYPT 336
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 210/308 (68%), Gaps = 7/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
Y+ WL +HG+ ++ F I++ N+RFI+ N+ N ++ + N+FAD+TNEEY+A+
Sbjct: 45 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKAL 104
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
Y+G + R +S +R LP SVDWR+ GAV PV++QG CGSCWAFSTV
Sbjct: 105 YMGLGTSETSRKNQSSFKRERSKV-----LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 159
Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
AAVEGINKI TG+L+SLSEQEL+DCD N GCNGG M AF+FI QNGG+ + ++YPY
Sbjct: 160 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 219
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+G + C+ + VV I GYE V P +E L+ AVA QPVSVAI+AGG FQ Y G+
Sbjct: 220 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 279
Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
F G CG L+H V +GYG +NG YWLV+NSWG+ WGE GY ++ R+ D + G CGIA
Sbjct: 280 FNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMIRDSRD-DEGICGIA 338
Query: 351 MEASYPVK 358
MEASYP+K
Sbjct: 339 MEASYPIK 346
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 171/371 (46%), Positives = 233/371 (62%), Gaps = 22/371 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M A +F + ++ + +A M I D +++ + +Y+ W + H
Sbjct: 3 MGKAFLFAVVLAVILV------AAMSMEITERD--------LASEESLWDLYERWRSHH- 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
S + KRF +FK N+ I + N ++ YK+ LN FAD+TN E+R Y S K
Sbjct: 48 TVSRDLSEKRKRFNVFKANVHHIHKVNQKDKPYKLKLNSFADMTNHEFREFY---SSKVK 104
Query: 121 R-RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
R++ A+ + + LP SVDWR++GAV VK+QG CGSCWAFSTV VEGINK
Sbjct: 105 HYRMLHGSRANTGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINK 164
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
I TG+L+SLSEQELVDC+ N GCNGGLM+ A++FI ++GG+ +E+ YPY + CD
Sbjct: 165 IKTGQLVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDS 223
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSA 298
S+ NA V+IDG+E V DE +L KAVA+QPVSVAI+A G Q Y GV+ G+ CG+
Sbjct: 224 SKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNE 283
Query: 299 LDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV VGYGT +G YW+V+NSWG+ WGE GY+++QR + G CGIAMEASYP+
Sbjct: 284 LDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPL 343
Query: 358 KNSQNSAKPKP 368
K S ++ KP P
Sbjct: 344 KLSSHNPKPSP 354
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 327 bits (839), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 167/315 (53%), Positives = 216/315 (68%), Gaps = 11/315 (3%)
Query: 46 DEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M + ++ W+A++G+ KRF IFK+N+ +I+ N + YK+G+N FADL
Sbjct: 32 DSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADL 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN+E++A +R+ K S RY + +P +VDWR KGAV PVKDQG CG
Sbjct: 92 TNQEFKA----SRNGYKLPHDCSSNTPFRY--ENVSSVPTTVDWRTKGAVTPVKDQGQCG 145
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA+EGI K+ TG LISLSEQELVDCD K I+ GC GGLMD AF FII N G+
Sbjct: 146 CCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKGL 205
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY G + C S+ + I GYEDV E +L+KAVA+QPVSVAI+AGG
Sbjct: 206 TTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSD 265
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y SGVFTGECG+ LDHGV AVGYG E+G YWLV+NSWG+ WGE GY+++Q++ ++
Sbjct: 266 FQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKD-IE 324
Query: 342 TNTGKCGIAMEASYP 356
G CGIAM++SYP
Sbjct: 325 AKEGLCGIAMQSSYP 339
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 174/353 (49%), Positives = 229/353 (64%), Gaps = 15/353 (4%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M +AI L +F +SS A DMSIIS+DN H ++ RTDDEVM++++ WL KH K N
Sbjct: 1 MNMAIVLLFMVFAVSS--ALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNA 58
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+G EKRFQIFK+NLRFIDE NSLNRTYK+GLN FADLTN EYRAMYL T D R +
Sbjct: 59 LGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLD 118
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGE 184
+ Y + GD +P+SVDWR++GAV PVK+QG +C SCWAF+ V AVE + KI TG+
Sbjct: 119 TP-PRNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGD 177
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
LISLSEQE+VDC + GC GG + + + +I +N G+ E+DYPY G E KCD +++NA
Sbjct: 178 LISLSEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNA 236
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
+V+IDG+ V E +L +A+ Q GVF G+CG+ L+H ++
Sbjct: 237 -IVTIDGHGWVPTQLEEALNRALFCYCAYFLYVDKFFLCQ----GVFKGKCGTELNHALL 291
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYGTE DYW+ +NS+ WGENGY+++QR L C YP+
Sbjct: 292 LVGYGTEKDGDYWIAKNSYSDKWGENGYIRIQRKL-----STCKFGNGGYYPI 339
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 233/337 (69%), Gaps = 14/337 (4%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
+ S R + EV+T+Y+ WL ++GK NG+G E+RF+IFKDNL+ I+EHNS NR+Y+ GL
Sbjct: 28 TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
NKF+DLT +E++A YLG + + K S VA +RY K GD LP+ VDWRE+GAV P V
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSL---SDVA-ERYQYKEGDVLPDEVDWRERGAVVPRV 143
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K QG CGSCWAF+ AVEGIN+I TGEL+SLSEQEL+DCDR N GC GG +AF+F
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203
Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
I +NGG+ S++ Y Y G + C + +VV+I+G+E V DEMSLKKAVA QP+S
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPIS 263
Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
V I A Y+SGV+ G C + DH V+ VGYGT + DYWL+RNSWG +WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGG 321
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
Y++LQRN + TGKC +A+ YP+K++ +S P
Sbjct: 322 YLRLQRNFHEP-TGKCAVAVAPVYPIKSNSSSHLLSP 357
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 157/308 (50%), Positives = 210/308 (68%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ WL H K G RF I++ N++ ID NSL+ +K+ N+FAD+TN E++A
Sbjct: 43 FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAH 102
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
+LG + + R K QR C +P++VDWR +GAV P+++QG CG CWAFS V
Sbjct: 103 FLGLNTSSLRLHKK-----QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157
Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
AA+EGINKI TG L+SLSEQ+L+DCD N GC+GGLM+ AF+FI NGG+ +E DYPY
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPY 217
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G E CD + KVV+I GY+ V+ +E SL+ A A QPVSV I+AGG FQ Y SGV
Sbjct: 218 TGIEGTCDQEKAKNKVVTIQGYQKVAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276
Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
FT CG+ L+HGV VGYG E YW+V+NSWG+ WGE GY++++R + + +TGKCGIA
Sbjct: 277 FTSYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGISE-DTGKCGIA 335
Query: 351 MEASYPVK 358
M ASYP++
Sbjct: 336 MLASYPLQ 343
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 210/308 (68%), Gaps = 7/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
Y+ WL +HG+ ++ F I++ N+RFI+ N+ N ++ + N+FAD+TNEEY+A+
Sbjct: 41 YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKAL 100
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
Y+G + R +S +R LP SVDWR+ GAV PV++QG CGSCWAFSTV
Sbjct: 101 YMGLGTSETSRKNQSSFKRERSKV-----LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 155
Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
AAVEGINKI TG+L+SLSEQEL+DCD N GCNGG M AF+FI QNGG+ + ++YPY
Sbjct: 156 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 215
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+G + C+ + VV I GYE V P +E L+ AVA QPVSVAI+AGG FQ Y G+
Sbjct: 216 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 275
Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
F G CG L+H V +GYG +NG YWLV+NSWG+ WGE GY ++ R+ D + G CGIA
Sbjct: 276 FNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMIRDSRD-DEGICGIA 334
Query: 351 MEASYPVK 358
MEASYP+K
Sbjct: 335 MEASYPIK 342
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 157/307 (51%), Positives = 210/307 (68%), Gaps = 6/307 (1%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
W+ KHG+ + R+ +FK N+ I+ N++ RT+K+ +N+FADLTN+E+R+MY
Sbjct: 41 WMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMY 100
Query: 113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G + + ++K S RY + LP SVDWR KGAV P+K+QGSCG CWAFS V
Sbjct: 101 TGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAV 160
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
AA+EG +I G+LISLSEQ+LVDCD + GC GGLMD AF+ I+ GG+ +E +YPY
Sbjct: 161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIMATGGLTTESNYPYK 219
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G + C+ + N K SI GYEDV DE +L KAVA QPVSV IE GG FQ Y SGVF
Sbjct: 220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279
Query: 292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
TGEC + LDH V A+GYG + NG YW+++NSWG+ WGE+GY+++Q+++ D G CG+A
Sbjct: 280 TGECTTYLDHAVTAIGYGQSTNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQ-GLCGLA 338
Query: 351 MEASYPV 357
M+ASYP
Sbjct: 339 MKASYPT 345
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 158/307 (51%), Positives = 212/307 (69%), Gaps = 6/307 (1%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
W+ KHG+ + R+ +FK+N+ I+ NS+ RT+K+ +N+FADLTN+E+R+MY
Sbjct: 41 WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMY 100
Query: 113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G + A ++K++ RY + LP SVDWR+KGAV P+K+QGSCG CWAFS V
Sbjct: 101 TGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAV 160
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
AA+EG +I G+LISLSEQ+LVDCD + GC GGLMD AF+ I GG+ +E +YPY
Sbjct: 161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESNYPYK 219
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G + C+ + N K SI GYEDV DE +L KAVA QPVSV IE GG FQ Y SGVF
Sbjct: 220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279
Query: 292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
TGEC + LDH V A+GYG + NG YW+++NSWG+ WGE+GY+++Q+++ D G CG+A
Sbjct: 280 TGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQ-GLCGLA 338
Query: 351 MEASYPV 357
M+ASYP
Sbjct: 339 MKASYPT 345
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 327 bits (837), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 157/308 (50%), Positives = 210/308 (68%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ WL H K G RF I++ N++ ID NSL+ +K+ N+FAD+TN E++A
Sbjct: 43 FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAH 102
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
+LG + + R K QR C +P++VDWR +GAV P+++QG CG CWAFS V
Sbjct: 103 FLGLNTSSLRLHKK-----QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157
Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
AA+EGINKI TG L+SLSEQ+L+DCD N GC+GGLM+ AF+FI NGG+ +E DYPY
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPY 217
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G E CD + KVV+I GY+ V+ +E SL+ A A QPVSV I+AGG FQ Y SGV
Sbjct: 218 TGIEGTCDQEKSKNKVVTIQGYQKVAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276
Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
FT CG+ L+HGV VGYG E YW+V+NSWG+ WGE GY++++R + + +TGKCGIA
Sbjct: 277 FTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSE-DTGKCGIA 335
Query: 351 MEASYPVK 358
M ASYP++
Sbjct: 336 MMASYPLQ 343
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 327 bits (837), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 212/309 (68%), Gaps = 10/309 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
++ W+A +G+ + +KR++IF++N+ I+ N N+ YK+ +N+FADLTNEE++A
Sbjct: 38 HEEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQFADLTNEEFKA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + +K S +Y + +P ++DWR KGAV PVKDQG CG CWAFS
Sbjct: 98 ----SRNRFKGHICSTKSTSFKYGNVSA--VPSAMDWRMKGAVTPVKDQGQCGCCWAFSA 151
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI K+ TGELISLSEQELVDCD ++ GC GGLMD AF FI N G+ SE +YP
Sbjct: 152 VAATEGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYP 211
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ +++ I+G+EDV E +L AVA QPVSVAI+AGG FQ Y G
Sbjct: 212 YKGVDGTCNTNKQAIHAAEINGFEDVPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKG 271
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VF G CG+ LDHGV AVGYGT ++G YWLV+NSWG+ WGE GY+++QR+ +D G CG
Sbjct: 272 VFIGACGTQLDHGVTAVGYGTSDDGTKYWLVKNSWGTQWGEEGYIRMQRD-VDAKEGLCG 330
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 331 IAMKASYPT 339
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 327 bits (837), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 166/315 (52%), Positives = 215/315 (68%), Gaps = 11/315 (3%)
Query: 46 DEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M + ++ W+A++G+ KRF IFK+N+ +I+ N + YK+G+N FADL
Sbjct: 30 DSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADL 89
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN+E++A +R+ K S RY + +P +VDWR KGAV PVKDQG CG
Sbjct: 90 TNQEFKA----SRNGYKLPHDCSSNTPFRY--ENVSSVPTTVDWRTKGAVTPVKDQGQCG 143
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA+EGI K+ TG LISLSEQELVDCD K + GC GGLMD AF FII N G+
Sbjct: 144 CCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKGL 203
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY G + C S+ + I GYEDV E +L+KAVA+QPVSVAI+AGG
Sbjct: 204 TTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSD 263
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y SGVFTGECG+ LDHGV AVGYG E+G YWLV+NSWG+ WGE GY+++Q++ ++
Sbjct: 264 FQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKD-IE 322
Query: 342 TNTGKCGIAMEASYP 356
G CGIAM++SYP
Sbjct: 323 AKEGLCGIAMQSSYP 337
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 327 bits (837), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 156/308 (50%), Positives = 213/308 (69%), Gaps = 6/308 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQIFKDN+ FI+ N+ N+ YK+G+N ADLT EE++
Sbjct: 38 HENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKD 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
G + + K+ +Y + ++PE++DWR KGAV P+KDQG CGSCWAFS
Sbjct: 98 SRNGLKRTYEFSTTTFKLNGFKY--ENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFS 155
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
T+AA EGI++I TG L+SLSEQELVDCD ++ GC GG M+ F+FII+NGG+ SE +YP
Sbjct: 156 TIAATEGIHQISTGNLVSLSEQELVDCD-SVDDGCEGGFMEDGFEFIIKNGGITSETNYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + + V I GYE V + E +L+KAVA+QPVSV+I A F Y SG
Sbjct: 215 YKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSG 274
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
++ GECG+ LDHGV AVGYGTENG DYW+V+NSWG+ WGE GY+++ R + + G CGI
Sbjct: 275 IYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKH-GICGI 333
Query: 350 AMEASYPV 357
A+++SYP
Sbjct: 334 ALDSSYPT 341
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 157/309 (50%), Positives = 220/309 (71%), Gaps = 10/309 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++G+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+RA
Sbjct: 39 HEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + ++ S +Y + +P +VDWR+KGAV P+KDQG CGSCWAFS
Sbjct: 99 ----SRNRFKAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI ++ TG+LISLSEQELVDCD + GC+GGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYP 212
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + I+GYEDV +E +L+KAVA QP++VAI+A G FQ Y SG
Sbjct: 213 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSW + WGE GY+++QR++ G CG
Sbjct: 273 VFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVT-AKEGLCG 331
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 332 IAMQASYPT 340
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 156/309 (50%), Positives = 214/309 (69%), Gaps = 13/309 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W+A+HG+ RF+IF+ N+ I+ N+ N +K+G+N+FADLTNEE++
Sbjct: 41 HEQWMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENHKFKLGVNQFADLTNEEFKT- 99
Query: 112 YLGTRSDAKRRLMKSKVASQR-YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ L SK+AS + + + +P ++DWR KGAV P+KDQG CGSCWAFS
Sbjct: 100 --------RNTLKPSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSA 151
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI K+ TG+LISLSEQE+VDCD + GCNGG MD AF++II+N G+ +E +YP
Sbjct: 152 VAATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYP 211
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ + + SI GYEDV+ E +L KA A+QP++VAI+AG AFQ Y SG
Sbjct: 212 YKAADGTCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSG 271
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV VGYG T +G YWLV+NSWG+ WGE+GY++++R+ +D G CG
Sbjct: 272 VFTGDCGTDLDHGVTLVGYGATSDGTKYWLVKNSWGTSWGEDGYIRMERD-VDAKEGLCG 330
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 331 IAMDASYPT 339
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 157/309 (50%), Positives = 220/309 (71%), Gaps = 10/309 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++G+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+RA
Sbjct: 39 HEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + ++ S +Y + +P +VDWR+KGAV P+KDQG CGSCWAFS
Sbjct: 99 ----SRNRFKAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI ++ TG+LISLSEQELVDCD + GC+GGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYP 212
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + I+GYEDV +E +L+KAVA QP++VAI+A G FQ Y SG
Sbjct: 213 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSW + WGE GY+++QR++ G CG
Sbjct: 273 VFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVT-VKEGLCG 331
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 332 IAMQASYPT 340
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 230/337 (68%), Gaps = 14/337 (4%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
+ S R + EV TIY+ WL +HGK NG+G E+RF+IFKDNL+ I+EHNS NR+Y GL
Sbjct: 28 TESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGL 87
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
N+F+DLT +E++A YLG + + K S VA +RY K GD LP+ VDWRE+GAV P V
Sbjct: 88 NQFSDLTVDEFQASYLGGKIEKKSL---SDVA-ERYQYKEGDILPDEVDWRERGAVVPRV 143
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K QG CGSCWAF+ AVEGIN+I TGEL+SLSEQEL+DCDR K N GC GG +AF+F
Sbjct: 144 KRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEF 203
Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
I +NGG+ +++DY Y G + C + +VV+I+G+E V DEMSLKKAV+ QP+S
Sbjct: 204 IKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPIS 263
Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
V I A Y+SGV+ G C + DH V+ VGYGT + DYWL+RNSWG WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGG 321
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
Y++LQRN + TGKC +A+ YP+K + S P
Sbjct: 322 YLRLQRN-FNEPTGKCAVAVAPVYPIKTNSASNLLSP 357
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 151/217 (69%), Positives = 183/217 (84%), Gaps = 1/217 (0%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P SVDWR+KG + VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+ N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC+GGLMDYAF+F+I NGG+DSE+DYPY CD R+NAKVV ID YEDV +E
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNEK 121
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGLDYWIVRN 181
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWG+DWGE GY+++QRN+ +++G CG+A+E SYPVK
Sbjct: 182 SWGADWGEKGYLRVQRNVA-SSSGLCGLAIEPSYPVK 217
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 174/359 (48%), Positives = 222/359 (61%), Gaps = 25/359 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ I LV L I +S + N H+ S S R ++ W+ K+G
Sbjct: 1 MASIGKKQHILALVLLLSICTSQ------VMSRNLHEASMSER--------HEQWMKKYG 46
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K +KR IFKDN+ FI+ N+ NR YK+ +N AD TNEE+ A + G +
Sbjct: 47 KVYKDAAEKQKRLLIFKDNVEFIESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKHKG 106
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K Y G +P +VDWRE GAV VKDQG CGSCWAFSTVAA EGI +
Sbjct: 107 SHSQTPFK-----YENVTG--VPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQ 159
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
I T L+SLSEQELVDCD ++ GC+GG M+ F+FII+NGG+ SE +YPY + CD
Sbjct: 160 ITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDA 218
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
++ + I GYE V E +L+KAVA+QPVSV I+AGG AFQ Y SGVFTG+CG+ L
Sbjct: 219 NKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQL 278
Query: 300 DHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGV AVGYG T++G YW+V+NSWG+ WGE GY+++QR D G CGIAM+ASYP
Sbjct: 279 DHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRG-TDAQEGLCGIAMDASYPT 336
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/348 (47%), Positives = 222/348 (63%), Gaps = 25/348 (7%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVF F ++A + +S H+ W+ ++GK E R
Sbjct: 16 LVFGFLAFEANARTLEDVSLKERHEQ----------------WMTQYGKVYTDSYEKELR 59
Query: 73 FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
IFK+N++ I+ N+ N+ YK+G+N+FADLTNEE++A R+ K + + +
Sbjct: 60 SNIFKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKA-----RNRFKGHMCSNSTRTP 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ + +P S+DWR+KGAV P+KDQG CG CWAFS VAA EGI K+ TG+LISLSEQ
Sbjct: 115 TFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQ 174
Query: 192 ELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD K ++ GC GGLMD AF+FI+QN G+++E YPY G + C+ + SI
Sbjct: 175 ELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIK 234
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
G+EDV E +L KAVA+QP+SVAI+A G FQ Y SG+FTG CG+ LDHGV AVGYG
Sbjct: 235 GFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGV 294
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+++G YWLV+NSWG WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 295 SDDGTKYWLVKNSWGEQWGEEGYIRMQRDVA-AEEGLCGIAMQASYPT 341
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 173/359 (48%), Positives = 230/359 (64%), Gaps = 30/359 (8%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
I LV L I +S + N H+ S S R ++ W+ K+GK
Sbjct: 10 ILALVLLLSICTSQ------VMSRNLHEASMSER--------HEQWMKKYGKVYKDAAEK 55
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+KR IFKDN+ FI+ N+ N+ YK+ +N AD TNEE+ A + G K K
Sbjct: 56 QKRLLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEEFVASHNG---------YKYKG 106
Query: 129 ASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ + K G+ ++P +VDWR+ GAV VKDQG CGSCWAFSTVAA EGI +I TG L+
Sbjct: 107 SHSQTPFKYGNVTDIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLM 166
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SLSEQELVDCD ++ GC+GGLM+ F+FII+NGG+ SE +YPY + CD S+ +
Sbjct: 167 SLSEQELVDCD-SVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPA 225
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
I GYE V E +L++AVA+QPVSV+I+AGG FQ Y SGVFTG+CG+ LDHGV V
Sbjct: 226 AQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVV 285
Query: 307 GYG-TENGV-DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
GYG T++G +YW+V+NSWG+ WGE GY+++QR +D G CGIAM+ASYP+ S +S
Sbjct: 286 GYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRG-IDAQEGLCGIAMDASYPMGKSSDS 343
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 175/345 (50%), Positives = 226/345 (65%), Gaps = 24/345 (6%)
Query: 35 NHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYK 94
+HD +S +D + +Y+ W +H + +G +RF +F++N+R I E N + YK
Sbjct: 34 DHDLAS----EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNRGDAPYK 88
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRL---------MKSKVASQRYACKAGDELPESV 145
+ LN+F D+T +E+R Y +R R M AS R ++P SV
Sbjct: 89 LRLNRFGDMTADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVR-------DVPPSV 141
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWR+KGAV VKDQG CGSCWAFST+AAVEGIN I + L SLSEQ+LVDCD K NAGCN
Sbjct: 142 DWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCN 201
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMDYAFQ+I ++GG+ +E YPY A +++ + VV+IDGYEDV DE +LKK
Sbjct: 202 GGLMDYAFQYIAKHGGVAAEDAYPYK-ARQASSCNKKPSAVVTIDGYEDVPANDETALKK 260
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWG 324
AVA QPV+VAIEA G FQ Y GVF G+CG+ LDHGV AVGYGT +G YW+V+NSWG
Sbjct: 261 AVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWG 320
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
+WGE GY++++R++ D G CGIAMEASYPVK S N H
Sbjct: 321 PEWGEKGYIRMKRDVKDKE-GLCGIAMEASYPVKTSANPKHAGAH 364
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 150/217 (69%), Positives = 183/217 (84%), Gaps = 1/217 (0%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P SVDWR+KG + VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+ N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC+GGLMDYAF+F+I NGG+DSE+DYPY + CD R+NAKVV ID YEDV +E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWG++WGE GY+++QRN+ +++G CG+A E SYPVK
Sbjct: 182 SWGANWGEKGYLRVQRNIA-SSSGLCGLATEPSYPVK 217
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 170/351 (48%), Positives = 225/351 (64%), Gaps = 11/351 (3%)
Query: 13 LVFLFF--ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
++FL F S+S D S++ Y + ++ ++++W KH K
Sbjct: 9 VLFLAFAACSASHHRDPSVVGYSQE-----DLALPNRLVNLFKSWSVKHRKIYVSPKEKL 63
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
KR+ IFK NL I E N N +Y +GLN+FAD+T+EE++A +LG + R +++ +
Sbjct: 64 KRYGIFKQNLMHIAETNRKNGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123
Query: 131 Q-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
RYA A LP SVDWR KGAV PVK+QG CGSCWAFS+VAAVEGIN+IVTG+L+SLS
Sbjct: 124 TFRYAAAA--NLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLS 181
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQEL+DCD ++ GC GGLMD+AF +I+ + G+ +E DYPYL E C + A VV+I
Sbjct: 182 EQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTI 241
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
GYEDV E+SL KA+A QPVSV I AG R FQ Y+ GVF G C LDH + AVGYG
Sbjct: 242 TGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYG 301
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+ G +Y ++NSWG +WGE GYV+++ G CGI ASYPVKN+
Sbjct: 302 SSYGQNYITMKNSWGKNWGEQGYVRIKMG-TGKPEGVCGIYTMASYPVKNA 351
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 211/308 (68%), Gaps = 10/308 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ +HGK EKRF IFKDN+ FI+ N+ N+ YK+ +N ADLT +E++A
Sbjct: 40 HEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEFKA 99
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K+ + S +Y +P +VDWR KGAV P+KDQG CGSCWAFST
Sbjct: 100 ----SRNGYKKIDREFTTTSFKYENVTA--IPAAVDWRVKGAVTPIKDQGQCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGIN+I TG+L+SLSEQELVDCD K + GC GGLM+ F+FII+NGG+ SE +YP
Sbjct: 154 VAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ + V I GYE V E SL KAVA+QP+SV+I+A +F Y SG
Sbjct: 214 YKAADGSCNTAT-TTPVAKITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
++TGECG+ LDHGV AVGYG+ NG DYW+V+NSWG+ WGE GY+++QR + G CGI
Sbjct: 273 IYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIA-AKEGLCGI 331
Query: 350 AMEASYPV 357
AM++SYP
Sbjct: 332 AMDSSYPT 339
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 158/307 (51%), Positives = 211/307 (68%), Gaps = 6/307 (1%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
W+ KHG+ + R+ +FK+N+ I+ NS+ RT+K+ +N+FADLTN+E+ +MY
Sbjct: 41 WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMY 100
Query: 113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G + A ++K++ RY + LP SVDWR+KGAV P+K+QGSCG CWAFS V
Sbjct: 101 TGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAV 160
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
AA+EG +I G+LISLSEQ+LVDCD + GC GGLMD AF+ I GG+ +E DYPY
Sbjct: 161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESDYPYK 219
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G + C+ + N K SI GYEDV DE +L KAVA QPVSV IE GG FQ Y SGVF
Sbjct: 220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279
Query: 292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
TGEC + LDH V A+GYG + NG YW+++NSWG+ WGE+GY+++Q+++ D G CG+A
Sbjct: 280 TGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQ-GLCGLA 338
Query: 351 MEASYPV 357
M+ASYP
Sbjct: 339 MKASYPT 345
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 167/362 (46%), Positives = 236/362 (65%), Gaps = 30/362 (8%)
Query: 4 ASMFLAISTLV-FLFFISSSSAA-DMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHG 60
A++ +IS ++ F FF ++ AA D+S DD VM ++ W+A++
Sbjct: 95 ATLKASISAIIGFAFFCGAAMAARDLS----------------DDSVMVARHEQWMAQYS 138
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ +RF++FK N++FI+ N+ N + +G+N+FADLTN+E+R+ T+++
Sbjct: 139 RVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRS----TKTNK 194
Query: 120 KRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
+ K+ + RY + D LP ++DWR KGAV P+KDQG CG CWAFS VAA EGI
Sbjct: 195 GLKSSNMKIPTGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIV 254
Query: 179 KIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
KI TG+L+SL+EQELVDCD + GC GGLMD AF+FII+NGG+ +E YPY A+ KC
Sbjct: 255 KISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC 314
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
+A +I GYEDV DE +L KAVA+QPVSVA++ G FQ Y GV TG CG+
Sbjct: 315 KSGSNSA--ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGT 372
Query: 298 ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
LDHG+ A+GYG T +G YWL++NSWG+ WGENGY+++++++ D G CG+AME SYP
Sbjct: 373 DLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYP 431
Query: 357 VK 358
+
Sbjct: 432 TE 433
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 150/217 (69%), Positives = 182/217 (83%), Gaps = 1/217 (0%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P SVDWR+KG + VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+ N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC+GGLMDYAF+F+I NGG+DSE+DYPY + CD R+NAKVV ID YEDV +E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWG+ WGE GY+++QRN+ +++G CG+A E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNIA-SSSGLCGLATEPSYPVK 217
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 215/309 (69%), Gaps = 13/309 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRF IFK N+ FI+ N+ N+ YK+G+N ADLT EE++A
Sbjct: 38 HEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC-GSCWAFS 169
+R+ KR ++++ + + +P ++DWR KGAV +KDQG C GSCWAFS
Sbjct: 98 ----SRNGLKRPY---ELSTTPFKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFS 150
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
TVAA EGI++I TG+L+SLSEQELVDCD K ++ GC GG M+ F+FII+NGG+ SE +Y
Sbjct: 151 TVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANY 210
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY + KC+ + + V I GYE V P E +L+KAVA+QPVSV+I+A G F Y S
Sbjct: 211 PYKAVDGKCN--KATSPVAQIKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSS 268
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
G++ GECG+ LDHGV AVGYG NG DYWLV+NSWG+ WGE GYV++QR + + G CG
Sbjct: 269 GIYNGECGTELDHGVTAVGYGIANGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKH-GLCG 327
Query: 349 IAMEASYPV 357
IA+++SYP
Sbjct: 328 IALDSSYPT 336
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 157/315 (49%), Positives = 218/315 (69%), Gaps = 9/315 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
D + ++ W+ + + + E R++IFK+N++ I+ N + ++YK+G+N+FADL
Sbjct: 32 DASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADL 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TNEE++ +R+ K + S+ RY +P S+DWR++GAV +KDQG CG
Sbjct: 92 TNEEFKT----SRNRFKGHMCSSQAGPFRYENITA--VPSSMDWRKEGAVTAIKDQGQCG 145
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS VAAVEGI ++ T +LISLSEQELVDCD K + GC GGLMD AF+FI QN G+
Sbjct: 146 SCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGL 205
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY G++ C+ + I+G+EDV +E +L KAVA QPVSVAI+AGG
Sbjct: 206 TTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFE 265
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
FQ Y SG+FTG+CG+ LDHGV AVGYG NG++YWLV+NSWG+ WGE GY+++Q++ +D
Sbjct: 266 FQFYSSGIFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKD-IDA 324
Query: 343 NTGKCGIAMEASYPV 357
G CGIAM+ASYP
Sbjct: 325 KEGLCGIAMQASYPT 339
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 211/308 (68%), Gaps = 9/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++G+ R+ IFK+N+ ID NS ++YK+G+N+FADLTNEE++A
Sbjct: 39 HEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + + RY + +P +VDWR++GAV PVKDQG CG CWAFS
Sbjct: 99 ----SRNRFKGHMCSPQAGPFRY--ENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGINK+ TG+LISLSEQE+VDCD K + GCNGGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 212
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ ++ I G+EDV E +L KAVA QPVSVAI+AGG FQ Y SG
Sbjct: 213 YKGTDGTCNTNKAAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSG 272
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+FTG C + LDHGV AVGYG +G YWLV+NSWG+ WGE GY+++Q++ + G CGI
Sbjct: 273 IFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGI 331
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 332 AMQASYPT 339
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 149/217 (68%), Positives = 183/217 (84%), Gaps = 1/217 (0%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P SVDWR+KG + VKDQGSCGSCWAFS VAA+E IN IVTG+LISLSEQELVDCD+ N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC+GGLMDYAF+F+I NGG+D+E+DYPY + CD R+NAKVV ID YEDV +E
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWG+ WGE GY+++QRN+ +++G CG+A E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNIA-SSSGLCGLATEPSYPVK 217
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 215/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRF++FK+N+ +I+ +N+ N+ YK+G+N+FADLT+EE+
Sbjct: 39 HEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ S + + + LP+S+DWR+KGAV P+K+QGSCG CWAFS
Sbjct: 97 --IVPRNRFNGHTRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
+AA EGI+KI TG+L+SLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E YP
Sbjct: 155 IAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + KC+ +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 215 YKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVSVAIDASGADFQFYKSG 274
Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
+FTG CG+ LDHGV AVGYG N G YWLV+NSWG++WGE GY+ +QR + G CG
Sbjct: 275 IFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGYIMMQRGVKAVE-GICG 333
Query: 349 IAMEASYPV 357
IAM ASYP
Sbjct: 334 IAMMASYPT 342
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 176/337 (52%), Positives = 232/337 (68%), Gaps = 14/337 (4%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
+ S R + V+T+Y+ WL ++GK NG+G E+RF+IFKDNL+ I+EHNS NR+Y+ GL
Sbjct: 28 TESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
NKF+DLT +E++A YLG + + K S VA +RY K GD LP+ VDWRE+GAV P V
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSL---SDVA-ERYQYKEGDVLPDEVDWRERGAVVPRV 143
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K QG CGSCWAF+ AVEGIN+I TGEL+SLSEQEL+DCDR N GC GG +AF+F
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203
Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
I +NGG+ S++ Y Y G + C + +VV+I+G+E V DEMSLKKAVA QP+S
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPIS 263
Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
V I A Y+SGV+ G C + DH V+ VGYGT + DYWL+RNSWG +WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGG 321
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
Y++LQRN + TGKC +A+ YP+K++ +S P
Sbjct: 322 YLRLQRNFHEP-TGKCAVAVAPVYPIKSNSSSHLLSP 357
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 210/308 (68%), Gaps = 9/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++G+ R+ IFK+N+ ID NS ++YK+G+N+FADLTNEE++A
Sbjct: 5 HEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKA 64
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + + RY + +P +VDWR++GAV PVKDQG CG CWAFS
Sbjct: 65 ----SRNRFKGHMCSPQAGPFRYENVSA--VPSTVDWRKEGAVTPVKDQGQCGCCWAFSA 118
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGINK+ TG+LISLSEQE+VDCD K + GCNGGLMD AF+FI QN G+ +E +YP
Sbjct: 119 VAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 178
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + I G+EDV E +L KAVA QPVSVAI+AGG FQ Y SG
Sbjct: 179 YKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSG 238
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+FTG C + LDHGV AVGYG +G YWLV+NSWG+ WGE GY+++Q++ + G CGI
Sbjct: 239 IFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGI 297
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 298 AMQASYPT 305
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 150/217 (69%), Positives = 181/217 (83%), Gaps = 1/217 (0%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P SVDWR+KG + VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+ N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC+GGLMDYAF+F+I NGG+DSE+DYPY + CD R+NAKVV ID YEDV +E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWG+ WGE GY+++QRN+ + +G CG+A E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNIARS-SGLCGLATEPSYPVK 217
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 159/306 (51%), Positives = 213/306 (69%), Gaps = 8/306 (2%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYL 113
W+A++ K EKRF+IFK+N+ +I+ NS N++YK+ +N+FADLTNEE+ +
Sbjct: 42 WMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEEF----I 97
Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
R+ K + S + + + +P +VDWR+KGAV P+KDQG CG CWAFS VAA
Sbjct: 98 APRNRFKGHMCSSITRTTTFKYENVTVIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAA 157
Query: 174 VEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
EGI+ + G+LISLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E +YPY
Sbjct: 158 TEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKA 217
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
A+ KC+ +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SGVFT
Sbjct: 218 ADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFT 277
Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
G CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR + G CGIAM
Sbjct: 278 GSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLCGIAM 336
Query: 352 EASYPV 357
ASYP
Sbjct: 337 MASYPT 342
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 156/309 (50%), Positives = 214/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++ K E+RF+IFK+N+ +I+ +N+ N+ Y +G+N+FADLTNEE+
Sbjct: 39 HEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + +P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + G+LISLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC+ V +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 215 YKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR + G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLCG 333
Query: 349 IAMEASYPV 357
IAM ASYP
Sbjct: 334 IAMMASYPT 342
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 218/326 (66%), Gaps = 15/326 (4%)
Query: 37 DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG 96
D+S ++ YQ W+ K+G+ E+RF I++ N+++ID NS+N ++ +
Sbjct: 4 DYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLA 63
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVN 154
N FADLTNEE++A YLG ++ V+ + G+ LP +VDWR++GAV
Sbjct: 64 ENNFADLTNEEFKATYLGYKT----------VSIPDTCFRYGNMVNLPTNVDWRQEGAVT 113
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
P+K+QG CGSCWAFS VAAVEGINKI G+LISLSEQELVDCD N GCNGG M AF
Sbjct: 114 PIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAF 173
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+FI + G+ +E +YPY GAE+ C+ + + VSI GYE V DE SLK AVA+QPVS
Sbjct: 174 EFI-KRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVS 232
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYV 333
VAI+A G FQ Y G+F+G CG+ L+HGV VGYG + YWLV+NSWG+DWGE+GY+
Sbjct: 233 VAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYI 292
Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKN 359
+++R+ D G CGIAM ASYP K+
Sbjct: 293 RMKRDSTD-RQGTCGIAMMASYPTKD 317
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 215/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+A++ K EKRF+IFK+N+ +I+ +N+ N+ YK+G+N+FADLTNEE+
Sbjct: 39 HEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + LP +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNRFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + +G+LISLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC+ + +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y++G
Sbjct: 215 YKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG + +G YWLV+NSWG++WGE GY+ +QR + G CG
Sbjct: 275 VFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRG-VKAQEGLCG 333
Query: 349 IAMEASYPV 357
IAM ASYP
Sbjct: 334 IAMMASYPT 342
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 165/311 (53%), Positives = 208/311 (66%), Gaps = 12/311 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
Y WL ++G+ + RF I+ N++FI+ NS N ++K+ NKFADLTN+E+ ++
Sbjct: 46 YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSI 105
Query: 112 YLG--TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
YLG RS +R L S +LP++VDWRE GAV P+KDQG CGSCWAFS
Sbjct: 106 YLGYQIRSYKRRNLSHMHENST--------DLPDAVDWRENGAVTPIKDQGQCGSCWAFS 157
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VAAVEGINKI TG L+SLSEQELVDCD N GCNGG M+ AF FI GG+ +E DY
Sbjct: 158 AVAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDY 217
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY G + C+ ++ + V I GYE V +E SLK AV+ QPVSVAI+A G FQ Y
Sbjct: 218 PYKGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSE 277
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GVF+G CG L+HGV VGYG NG YWLV+NSWG WGE+GY++++R+ DT G CG
Sbjct: 278 GVFSGYCGIQLNHGVTIVGYGDNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTK-GMCG 336
Query: 349 IAMEASYPVKN 359
IAME SYP+K+
Sbjct: 337 IAMEPSYPIKD 347
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 156/309 (50%), Positives = 214/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++ K E+RF+IFK+N+ +I+ +N+ N+ Y +G+N+FADLTNEE+
Sbjct: 39 HEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + +P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + G+LISLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC+ V +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 215 YKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR + G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLCG 333
Query: 349 IAMEASYPV 357
IAM ASYP
Sbjct: 334 IAMMASYPT 342
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 171/320 (53%), Positives = 213/320 (66%), Gaps = 15/320 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLT 104
D ++ +++ W+AK+ K +RF++FKDNL IDE N T Y +GLN FADLT
Sbjct: 66 DRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLT 125
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRY----ACKAGDELPESVDWREKGAVNPVKDQG 160
++E++A YLG L+ + + R+ GDE+P SVDWR+KGAV VK+QG
Sbjct: 126 HDEFKATYLG--------LLPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 177
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFSTVAAVEGIN+IVTG L SLSEQ+LVDC N GC+GG+MD AF FI
Sbjct: 178 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 237
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
G+ SE+ YPYL E CD R+ +V V+I GYEDV DE +L KA+A QPVSVAIEA
Sbjct: 238 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 297
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
GR FQ Y GVF G CGS LDHGV AVGYG+ G DY +V+NSWG+ WGE GY++++R
Sbjct: 298 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEKGYIRMKRG- 356
Query: 340 LDTNTGKCGIAMEASYPVKN 359
G CGI ASYP K+
Sbjct: 357 TGKPEGLCGINKMASYPTKD 376
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 323 bits (828), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 162/291 (55%), Positives = 198/291 (68%), Gaps = 3/291 (1%)
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
+RF++FKDNL ID+ N +Y +GLN+FADLT++E++A YLG R K +
Sbjct: 48 RRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSE 107
Query: 131 Q-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
+ RY + E+P+ +DWR+K AV VK+QG CGSCWAFSTVAAVEGIN IVTG L SLS
Sbjct: 108 EFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLS 167
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQEL+DC N GCNGGLMDYAF +I GG+ +E+ YPY E CD + A VV+I
Sbjct: 168 EQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDCDEG-KGAAVVTI 226
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
GYEDV DE +L KA+A QPVSVAIEA GR FQ Y GVF G CG LDHGV AVGYG
Sbjct: 227 SGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYG 286
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
T G DY +V+NSWG WGE GY++++R G CGI ASYP K++
Sbjct: 287 TSKGQDYIIVKNSWGPHWGEKGYIRMKRG-TGKGEGLCGINKMASYPTKDN 336
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 323 bits (828), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 225/362 (62%), Gaps = 17/362 (4%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LV L F+SS++ I +D +D+ + +Y+ W H + G +R
Sbjct: 54 LVALVFVSSAAVELCRAIDFDER-----DLASDEALWDLYERWQTHH-RVHRHHGEKGRR 107
Query: 73 FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV--- 128
F FK+N+RFI HN +R Y++ LN+F D+ EE+R+ + +R + RR
Sbjct: 108 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARAG 167
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
A + + + P SVDWR++GAV VKDQG CGSCWAFSTV AVEGIN I TG L SL
Sbjct: 168 AVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASL 227
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD---PSRRNAK 245
SEQEL+DCD N GC GGLM+ AF+FI GG+ +E YPY + CD R
Sbjct: 228 SEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGV 286
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV IDG++ V E +L KAVA QPVSVA++AGG+AFQ Y GVFTG+CG+ LDHGV A
Sbjct: 287 VVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 346
Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYG ++G YW+V+NSWG+ WGE GY+++QR N G CGIAMEAS+P+K S N A
Sbjct: 347 VGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPA 404
Query: 365 KP 366
P
Sbjct: 405 DP 406
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 171/320 (53%), Positives = 213/320 (66%), Gaps = 15/320 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLT 104
D ++ +++ W+AK+ K +RF++FKDNL IDE N T Y +GLN FADLT
Sbjct: 80 DRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLT 139
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRY----ACKAGDELPESVDWREKGAVNPVKDQG 160
++E++A YLG L+ + + R+ GDE+P SVDWR+KGAV VK+QG
Sbjct: 140 HDEFKATYLG--------LLPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 191
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFSTVAAVEGIN+IVTG L SLSEQ+LVDC N GC+GG+MD AF FI
Sbjct: 192 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 251
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
G+ SE+ YPYL E CD R+ +V V+I GYEDV DE +L KA+A QPVSVAIEA
Sbjct: 252 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 311
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
GR FQ Y GVF G CGS LDHGV AVGYG+ G DY +V+NSWG+ WGE GY++++R
Sbjct: 312 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEKGYIRMKRG- 370
Query: 340 LDTNTGKCGIAMEASYPVKN 359
G CGI ASYP K+
Sbjct: 371 TGKPEGLCGINKMASYPTKD 390
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 173/360 (48%), Positives = 232/360 (64%), Gaps = 27/360 (7%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKH 59
MA+ S+ L I+ L +F S+ A +++ D +M + ++ W+A++
Sbjct: 1 MASNSLKLLIA-LALVFATSAYLATSRTLL---------------DSLMAVRHEQWMAQY 44
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
G+ KR+ IFK+N+ +I+ N + YK+G+N FADLTN+E+ + +R+
Sbjct: 45 GRVYKNEVEKTKRYNIFKENVEYIESFNKAGTKPYKLGINAFADLTNKEF----IASRNG 100
Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
S RY + +P +VDWR+KGAV PVKDQG CG CWAFS VAA+EGI
Sbjct: 101 YILPHECSSNTPFRY--ENVSAVPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGIT 158
Query: 179 KIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
K+ TG LISLSEQELVDCD K I+ GC GGLMD AF FII N G+ +E +YPY G + C
Sbjct: 159 KLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSC 218
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
S+ + I GYEDV E +L+KAVA+QPVSVAI+AGG FQ Y SGVFTGECG+
Sbjct: 219 KKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGT 278
Query: 298 ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
LDHGV AVGYG E+G YWLV+NSWG+ WGE GY+++Q++ ++ G CGIAM++SYP
Sbjct: 279 ELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKD-IEAKEGLCGIAMQSSYP 337
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 167/354 (47%), Positives = 236/354 (66%), Gaps = 22/354 (6%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGM 66
+ +S ++ +I +S+ ++ H +S T+ VM Y+TWL ++G+
Sbjct: 5 ITLSIVILNLWIIASACPEI--------HTKNS---TNPAVMKKRYETWLKRYGRHYRDR 53
Query: 67 GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
E RF I++ N+++I+ +NS N +YK+ N+FAD+TNEE+++ YLG L +
Sbjct: 54 EEWEVRFDIYQSNVQYIEFYNSQNYSYKLIDNRFADITNEEFKSTYLGY-------LPRF 106
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+V ++ K G ELP+S+DWR+KGAV VKDQG CGSCWAFS VAAVEGINKI T L+
Sbjct: 107 RVQTEFRYHKHG-ELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLV 165
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ+L+DCD K N GC GG M AF +I ++GG+ + ++YPY G + C+ S+
Sbjct: 166 SLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNN 225
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
V+I GYE V +E LK AVA QPVS+A +AGG AFQ Y G+F+G CG L+HG+
Sbjct: 226 AVTISGYESVPARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTI 285
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
VGYG ENG YW+V+NSW +DWGE+GYV+++R+ D + G CGIAM+A+YPVK+
Sbjct: 286 VGYGEENGDKYWIVKNSWANDWGESGYVRMKRDTKDKD-GTCGIAMDATYPVKH 338
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 175/369 (47%), Positives = 228/369 (61%), Gaps = 19/369 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LV L F+SS++ I +D +D+ + +Y+ W H + G +R
Sbjct: 10 LVALVFVSSAAVELCRAIDFDER-----DLASDEALWDLYERWQTHH-RVHRHHGEKGRR 63
Query: 73 FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV--- 128
F FK+N+RFI HN +R Y++ LN+F D+ EE+R+ + +R + RR
Sbjct: 64 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARAG 123
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
A + + + P SVDWR++GAV VKDQG CGSCWAFSTV AVEGIN I TG L SL
Sbjct: 124 AVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASL 183
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD---PSRRNAK 245
SEQEL+DCD N GC GGLM+ AF+FI GG+ +E YPY + CD R
Sbjct: 184 SEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGV 242
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV IDG++ V E +L KAVA QPVSVA++AGG+AFQ Y GVFTG+CG+ LDHGV A
Sbjct: 243 VVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 302
Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYG ++G YW+V+NSWG+ WGE GY+++QR N G CGIAMEAS+P+K S N A
Sbjct: 303 VGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPA 360
Query: 365 KP--KPHSS 371
P KP +
Sbjct: 361 DPPRKPRRA 369
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 148/217 (68%), Positives = 182/217 (83%), Gaps = 1/217 (0%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
P SVDWR+KG + VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+ N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC+GGLMDYAF+F+I NGG+D+E+DYPY CD R+NAKVV+ID YEDV +E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVV GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGTENGMDYWIVRN 181
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
SWG+ WGE GY+++QRN+ +++G CG+A+E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNVA-SSSGLCGLAIEPSYPVK 217
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 173/358 (48%), Positives = 233/358 (65%), Gaps = 13/358 (3%)
Query: 16 LFFISSSSAADMSII-SYD-NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
L FIS S A ++ ++D N HD S + + +Y+ W + H T N + RF
Sbjct: 6 LLFISLSLALIFTVANTFDFNEHDLES----EKSLWNLYERWRSHHTVTRN-LDEKHNRF 60
Query: 74 QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
+FK N+ + N L++ YK+ LNKF D+TN E+R +Y ++ R + +
Sbjct: 61 NVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGTF 120
Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQEL 193
+ ++P S+DWR KGAV VKDQG CGSCWAFST+AAVEGIN+I T +L+SLSEQ+L
Sbjct: 121 MYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQL 180
Query: 194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE 253
VDCD + N GCNGGLM+YAF+FI QN G+ +E +YPY + CD + + K VSIDG+E
Sbjct: 181 VDCDTEENEGCNGGLMEYAFEFIKQN-GITTESNYPYAAKDGTCDVEKED-KAVSIDGHE 238
Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TEN 312
+V +E +L KA A QPVSVAI+AGG FQ Y GVFTG C + L+HGV VGYG T++
Sbjct: 239 NVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQD 298
Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
YW+++NSWGS+WGE GY+++QR + + G CGIAMEASYP+K S S KP S
Sbjct: 299 RTKYWIMKNSWGSEWGEQGYIRMQRG-ISSREGLCGIAMEASYPIKKS--STKPTESS 353
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 153/273 (56%), Positives = 200/273 (73%), Gaps = 7/273 (2%)
Query: 87 NSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVD 146
N N+ YK+G+NKFADLTNEE++A +R+ K + S + + + + +P +VD
Sbjct: 4 NVNNKLYKLGINKFADLTNEEFKA----SRNKFKGHMCSSIIRTTTFKYENASAIPSTVD 59
Query: 147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCN 205
WR+KGAV PVK+QG CGSCWAFS VAA EGI+++ TG+L+SLSEQEL+DCD K ++ GC
Sbjct: 60 WRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCE 119
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMD AF+FIIQN G+ +E YPY G + C+ + + V+I GYEDV +E++L+K
Sbjct: 120 GGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQK 179
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWG 324
AVA+QP+SVAI+A G FQ Y SGVFTG CG+ LDHGV AVGYG N G YWLV+NSWG
Sbjct: 180 AVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWG 239
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+DWGE GY+++QR +D G CGIAM+ASYP
Sbjct: 240 ADWGEEGYIRMQRG-IDAAEGLCGIAMQASYPT 271
>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
Length = 480
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 180/361 (49%), Positives = 234/361 (64%), Gaps = 20/361 (5%)
Query: 20 SSSSAADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQI 75
++++A DMSIISY+ H T+ E Y WLA++G S N +G +E+RF +
Sbjct: 18 AATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLV 77
Query: 76 FKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
F DNL+F+D HN+ +++G+N+ + + + +V +R
Sbjct: 78 FWDNLKFVDAHNARADERGGFRLGMNRLRRSHQRGVPRDLPRRQGRREEPRRRGEVPPRR 137
Query: 133 YACKAG----DELPESVDWREKGAVNP------VKDQGSCGSCWAFSTVAAVEGINKIVT 182
AG + +E G + VK G GSCWAFS V+ VE IN++VT
Sbjct: 138 GGGAAGVRRLEGEGRRRPRQEPGPMRSFSVHLSVKYFGQ-GSCWAFSAVSTVESINQLVT 196
Query: 183 GELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
GE+I+LSEQELV+C N+GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R
Sbjct: 197 GEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINR 256
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
NAKVVSIDG+EDV DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDH
Sbjct: 257 ENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDH 316
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
GVVAVGYGT+NG DYW+VRNSWG WGE+GYV+++RN ++ TGKCGIAM ASYP K+
Sbjct: 317 GVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGA 375
Query: 362 N 362
N
Sbjct: 376 N 376
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 157/309 (50%), Positives = 215/309 (69%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+A++ K EKRF+IFK+N+ +I+ +N+ ++ YK+G+N+FADLTNEE+
Sbjct: 39 HEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + LP +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNKFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + +G+LISLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC+ + +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y++G
Sbjct: 215 YKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG + +G YWLV+NSWG++WGE GY+ +QR + G CG
Sbjct: 275 VFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRG-VKAQEGLCG 333
Query: 349 IAMEASYPV 357
IAM ASYP
Sbjct: 334 IAMMASYPT 342
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/353 (46%), Positives = 230/353 (65%), Gaps = 25/353 (7%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L F FF ++ AA D N D + R ++ W+A++ +
Sbjct: 9 LAVLSFAFFCGAALAA------RDLNEDSAMVAR--------HEQWMAQYSRVYKDAAEK 54
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF++FK N++FI+ N+ NR + +G+N+FADLTN+E+R T+++ + KV
Sbjct: 55 ARRFEVFKANVKFIESFNTGGNRKFWLGINQFADLTNDEFRT----TKTNKGFKPSLDKV 110
Query: 129 ASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
++ RY + D +P ++DWR GAV P+KDQG CG CWAFS VAA EGI KI TG+LIS
Sbjct: 111 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 170
Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQELVDCD + GC GGLMD AF+FII+NGG+ +E +YPY A+ KC +A
Sbjct: 171 LSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSA-- 228
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
+I GYEDV DE +L KAVA+QPVSVA++ G FQ Y GV TG CG+ LDHG+ A+
Sbjct: 229 ANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 288
Query: 307 GYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GYG T +G YWL++NSWG+ WGENGY+++++++ D G CG+AME SYP +
Sbjct: 289 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISD-KKGMCGLAMEPSYPTE 340
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 172/363 (47%), Positives = 235/363 (64%), Gaps = 31/363 (8%)
Query: 1 MATASMFLAIS-TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
MA+ + + +S L+F+ +S A S+ H+ S R +D W+A++
Sbjct: 1 MASTNQYQYVSMALLFILAAWASQATSRSL------HEASMYERHED--------WMARY 46
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
G+ EKRF+IFKDN+ I+ N ++++TYK+ +N+FADLTNEE+R++
Sbjct: 47 GRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSL------- 99
Query: 119 AKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
R K+ + S+ K + +P ++DWR+KGAV P+KDQ CG CWAFS VAA EG
Sbjct: 100 --RNRFKAHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEG 157
Query: 177 INKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
I +I TG+LISLSEQELVDCD N GC+GGLMD AF+FI + G+ SE YPY G +
Sbjct: 158 ITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYEGDDG 216
Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
C+ + I GYEDV +E +L+KAVA QPV+VAI+AGG FQ Y SGVFTG+C
Sbjct: 217 TCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQC 276
Query: 296 GSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
G+ LDHGV AVGYG ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+AS
Sbjct: 277 GTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQAS 335
Query: 355 YPV 357
YP
Sbjct: 336 YPT 338
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 166/350 (47%), Positives = 225/350 (64%), Gaps = 13/350 (3%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMGH 68
++T +F+ + ++ S H SS D E M + W+ +HG+
Sbjct: 6 LTTTIFILLMLCNTCVIASESECPPTHKQKSS---DVEAMKKRFDGWVKRHGRKYKHNDE 62
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
E RF I++ N+++I N+ +Y + NKFADLTNEE+++ Y+G + ++S
Sbjct: 63 REVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTR-----LRSHN 117
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
RY + GD LPES DWR++GAV + DQG CG CWAF+ VAAVEGINKI +G+LISL
Sbjct: 118 TGFRYD-EHGD-LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISL 175
Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
SEQEL+DCD K N GC GGLM+ A+ FII+NGG+ +EQDYPY G + C +
Sbjct: 176 SEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAA 235
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
SI GYE+V +E LK A A QPVSVAI+AGG +FQ Y GVF+G CG L+HGV VG
Sbjct: 236 SISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVG 295
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
YG E YW+V+NSWG+DWGE+GY++++R+ L + G CGIAM+ASYP+
Sbjct: 296 YGKETINKYWIVKNSWGADWGESGYIRMKRDTL-SKEGMCGIAMQASYPL 344
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 209/309 (67%), Gaps = 8/309 (2%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
W+A+HG+T E+R IFK N+ +I+ N+ R Y++ N+FADLT+EE++AM+ G
Sbjct: 38 WMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNAGKRKYQLAANQFADLTHEEFKAMHTG 97
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
+ +K A + + +P+SVDWR KGAV PVKDQG CGSCWAF+ VAAV
Sbjct: 98 FKPSGT----GAKKAGNGFRHGSLSSVPDSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAV 153
Query: 175 EGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EGI KIVTG+LISLSEQ+LVDCD + GC GG MD AF+FI+ NGG+ SE +YPY
Sbjct: 154 EGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGITSEANYPYEEV 213
Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA-FQHYESGVFT 292
+ C+ + V +I+ +EDV DE +L+KAVA+QPVSV I+AG FQ Y GVF+
Sbjct: 214 QRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSSLDFQLYSGGVFS 273
Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
GECG+ LDH V VGYG T +G YWL +NSWG WGENGY++++R++ G CGIAM
Sbjct: 274 GECGTDLDHAVTVVGYGTTSDGTKYWLAKNSWGETWGENGYIRMERDVA-AKEGLCGIAM 332
Query: 352 EASYPVKNS 360
+ASYP +
Sbjct: 333 QASYPTAGT 341
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 211/308 (68%), Gaps = 6/308 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQIFKDN+ FI+ N+ N+ YK+G+N ADLT EE++
Sbjct: 38 HENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKD 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
G + + K+ +Y + ++PE++DWR KGAV P+KDQG CG WAFS
Sbjct: 98 SRNGLKRTYEFSTTTFKLNGFKY--ENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFS 155
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
T+AA EGI++I TG L+SLSEQELVDCD ++ GC GG M+ F+FII+NGG+ SE +YP
Sbjct: 156 TIAATEGIHQISTGNLVSLSEQELVDCD-SVDDGCEGGFMEDGFEFIIKNGGITSETNYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + + V I GYE V + E +LKKAVA+QPVSV+I A F Y SG
Sbjct: 215 YKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSG 274
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
++ GECG+ LDHGV AVGYGTENG DYW+V+NSWG+ WGE GY+++ R + + G CGI
Sbjct: 275 IYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKH-GICGI 333
Query: 350 AMEASYPV 357
A+++SYP
Sbjct: 334 ALDSSYPT 341
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 163/323 (50%), Positives = 216/323 (66%), Gaps = 15/323 (4%)
Query: 37 DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG 96
D+S ++ YQ W+ K+G+ E+RF I++ N+++ID NS+N ++ +
Sbjct: 4 DYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLA 63
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVN 154
N FADLTNEE++A YLG ++ V+ + G+ LP +VDWR++GAV
Sbjct: 64 ENNFADLTNEEFKATYLGYKT----------VSIPDTCFRYGNMVNLPTNVDWRQEGAVT 113
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
P+K+QG CGSCWAFS VAAVEGINKI G+LISLSEQELVDCD N GCNGG M AF
Sbjct: 114 PIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAF 173
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+FI + G+ +E +YPY GAE+ C+ + + VSI GYE V DE SLK AVA+QPVS
Sbjct: 174 EFI-KRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVS 232
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYV 333
VAI+A G FQ Y G+F+G CG+ L+HGV VGYG + YWLV+NSWG+DWGE+GY+
Sbjct: 233 VAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYI 292
Query: 334 KLQRNLLDTNTGKCGIAMEASYP 356
+++R+ D G CGIAM ASYP
Sbjct: 293 RMKRDSTDKQ-GTCGIAMMASYP 314
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 152/269 (56%), Positives = 193/269 (71%), Gaps = 2/269 (0%)
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+TN E+R+ Y G++ + R S+ A+ + + +P SVDWR+KGAV P+KDQG C
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFSTV AVEGIN I T +L+SLSEQELVDCD N GCNGGLM YAF+FI + GG+
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+EQ YPY + CD S+ N+ VVSIDG+E V P +E +L KA A+QP+SVAI+AGG A
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVF G CG+ LDHGV VGYGT +G YW+V+NSWG+DWGENGY++++R +
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG-IS 239
Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
G CGIA+EASYP+KNS + P S
Sbjct: 240 AKEGLCGIAVEASYPIKNSSTNPVGAPSS 268
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 152/317 (47%), Positives = 217/317 (68%), Gaps = 8/317 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D ++ ++ W+AK + +RF++FK N+ FI+ N+ NR + +G+N+F DLT
Sbjct: 30 DTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAENRKFWLGVNQFTDLT 89
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
N+E+RA T+++ ++ + + +Y+ + D LP +VDWR KG V P+KDQG CG
Sbjct: 90 NDEFRA----TKTNKGLKMSGGRAPTGFKYSNVSIDALPTAVDWRTKGVVTPIKDQGQCG 145
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
CWAFS V A EGI K+ TG+LISLSEQELVDCD ++ GC GG MD AF+FII+NGG+
Sbjct: 146 CCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGGL 205
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY + +C S + V +I GYEDV DE SL KAVA+QPVSVA++ G
Sbjct: 206 TTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVI 265
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQHY GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGE+GY+++++++ D
Sbjct: 266 FQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYLRMEKDISD 325
Query: 342 TNTGKCGIAMEASYPVK 358
+G CG+AM+ SYP +
Sbjct: 326 -KSGMCGLAMQPSYPTE 341
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 205/295 (69%), Gaps = 12/295 (4%)
Query: 80 LRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG 138
LRFIDEHN+ NR+YKVGLN+FADLT EE+R+ YLG + K+KV S RY +
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGSN----KTKV-SNRYEPRVS 55
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
LP VDWR GAV +K QG CG CWAFS +A VEGINKIVTG LISLSEQEL+ C
Sbjct: 56 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGG 115
Query: 199 KINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
N GCNGG + FQFII NGG+++ ++YPY + +C+ +N K V+ID Y +V
Sbjct: 116 TQNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPY 175
Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
+E +L+ AV QPVSVA++A G AF+HY SG+FTG CG+A+DH V VGYGTE G+DYW
Sbjct: 176 NNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYW 235
Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK-NSQNSAKPKPHSS 371
+V NSW + WGE GY+++ RN+ G CGIA SYPVK N+QN PKP+SS
Sbjct: 236 IVENSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVKYNNQN--YPKPYSS 286
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 168/362 (46%), Positives = 228/362 (62%), Gaps = 15/362 (4%)
Query: 1 MATAS-MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
MA S + L +L F+ + SS+S D S++ Y + D + ++ D ++ +W KH
Sbjct: 1 MAMGSKLSLFFLSLGFVAYSSSASHNDPSVVGY-SQEDLALPYKLVD----LFSSWSVKH 55
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS-- 117
K KR+++FK NL+ I E N N +Y +GLN+FAD+ +EE+++ YLG ++
Sbjct: 56 SKIYVSPEEKVKRYEVFKQNLKHIVETNRRNGSYWLGLNQFADVAHEEFKSTYLGLKTGM 115
Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
D R A + + LP SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGI
Sbjct: 116 DGPAR------APTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGI 169
Query: 178 NKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
N+I TG+L SLSEQEL+DCD + GC GG MD+AF +I+ N G+ ++ DYPYL E C
Sbjct: 170 NQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDDYPYLMEEGYC 229
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
+ +KVV+I GYEDV E+SL KA+A QP+SV I AG + FQ Y+ GVF G CG+
Sbjct: 230 KEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVFEGSCGT 289
Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDH + AVGYG+ +G DY +++NSWG WGE GY +++R G C I ASYP
Sbjct: 290 ELDHALTAVGYGSSDGQDYIIMKNSWGKSWGEQGYFRIKRG-TGKPEGVCSIYSMASYPT 348
Query: 358 KN 359
K
Sbjct: 349 KT 350
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 167/354 (47%), Positives = 227/354 (64%), Gaps = 27/354 (7%)
Query: 10 ISTLVFLFFISSSSAA-DMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMG 67
++ L F FF ++ AA D+S DD M ++ W+A++ +
Sbjct: 9 LAILGFAFFCGAALAARDLS----------------DDSAMVARHEQWMAQYSRVYKDAS 52
Query: 68 HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+RF++FK N++FI+ N+ N + +G+N+FADLTN+E+R+ + T K MK
Sbjct: 53 EKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFADLTNDEFRS--IKTNKGFKSSNMKI 110
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
RY + D LP ++DWR KGAV P+KDQG CG CWAFS VAA EGI KI TG+L+
Sbjct: 111 PTGF-RYENVSVDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLV 169
Query: 187 SLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SL+EQELVDCD + GC GGLMD AF+FII NGG+ +E YPY A+ KC +A
Sbjct: 170 SLAEQELVDCDVHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSNSA- 228
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
+I GYEDV DE +L KAVA+QPVSVA++ G FQ Y SGV TG CG+ LDHG+ A
Sbjct: 229 -ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAA 287
Query: 306 VGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+GYG T +G YWL++NSWG+ WGENGY+++++++ D G CG+AME SYP +
Sbjct: 288 IGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYPTE 340
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/333 (48%), Positives = 213/333 (63%), Gaps = 13/333 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + +Y+ W H + +RF FK N+ FI HN +R Y++ LN+F D
Sbjct: 38 SEEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGD 96
Query: 103 LTNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
++ E+RA + G+R +RR V YA +LP SVDWR+KGAV VK+Q
Sbjct: 97 MSQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQ 156
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAFSTV +VEGIN I TG+L+SLSEQEL+DCD N GC GGLMD AF++I +N
Sbjct: 157 GKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKN 216
Query: 220 GGMDSEQDYPYLGAENKCDP---SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
GG+ +E YPY A C ++ + VV IDG++DV E +L KAVA+QPVSV I
Sbjct: 217 GGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGI 276
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKL 335
+A G+AF Y GVFTGECG+ LDHGV VGYG E+G YW V+NSWG WGE GY+++
Sbjct: 277 DASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRV 336
Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+++ G CGIAMEASY VK +KPKP
Sbjct: 337 EKD-SGAEGGLCGIAMEASYAVK---TDSKPKP 365
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 165/357 (46%), Positives = 224/357 (62%), Gaps = 23/357 (6%)
Query: 6 MFLAISTLV-FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
+FL +S + F I+ S D + + HD W+AKHG+
Sbjct: 8 IFLIVSLISSFCLSITLSRPLDDNELIMQKRHDE----------------WMAKHGRVYA 51
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
M R+ +FK N+ I+ N++ RT+K+ +N+FADLTN+E+R+MY G + +
Sbjct: 52 DMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLS 111
Query: 123 LMK-SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
+K +S RY + LP SVDWR+KGAV P+K+QG+CG CWAFS VAA+EG KI
Sbjct: 112 SQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIK 171
Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
G+LISLSEQ+LVDCD + GC+GGLMD AF+ I+ GG+ +E +YPY G + C
Sbjct: 172 KGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKN 230
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
SI GYEDV DE +L KAVA QPVS+ IE GG FQ Y SGVFTGEC + LDH
Sbjct: 231 TKPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDH 290
Query: 302 GVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V AVGYG + NG YW+++NSWG+ WGE+GY+++++++ D G CG+AM+ASYP
Sbjct: 291 AVTAVGYGQSSNGSKYWIIKNSWGTKWGESGYMRIKKDVKD-KKGLCGLAMKASYPT 346
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 226/348 (64%), Gaps = 21/348 (6%)
Query: 15 FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
LF I S ++++ DH++ ++ ++ W+ ++G+ +RF+
Sbjct: 7 LLFAILSCLCLCSAVLAAREQSDHAA-------MVARHERWMEQYGRVYKDATEKARRFE 59
Query: 75 IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQ 131
IFK N+ FI+ N+ N + +G+N+FADLTN E+RA + + + S V +
Sbjct: 60 IFKANVAFIESFNAGNHKFWLGVNQFADLTNYEFRA------TKTNKGFIPSTVRVPTTF 113
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + D LP +VDWR KGAV P+KDQG CG CWAFS VAA+EGI K+ TG+LISLSEQ
Sbjct: 114 RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 173
Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD + GC GGLMD AF+FII+NGG+ +E YPY A+ KC+ +A +I
Sbjct: 174 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSA--ATIK 231
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
GYEDV +E +L KAVA+QPVSVA++ G FQ Y GV TG CG+ LDHG+VA+GYG
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291
Query: 311 E-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ +G YWL++NSWG+ WGENG++++++++ D G CG+AME SYP
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKR-GMCGLAMEPSYPT 338
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 165/348 (47%), Positives = 224/348 (64%), Gaps = 24/348 (6%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
L+F + +S AA S+ + +S T D+ W+A++G+ +R
Sbjct: 14 LLFTIGVLASLAAARSL-------NEASMTETHDQ-------WMARYGRVYKTANEKNRR 59
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
IF++NL++I N N + YK+G+N+FADLTNEE+ +R+ K + +
Sbjct: 60 STIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTT----SRNKFKSHVCATVTNVF 115
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + +P ++DWR+KGAV P+K+QG CG CWAFS VAA+EGI ++ TG+LISLSEQ
Sbjct: 116 RY--ENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQ 173
Query: 192 ELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD + GC GGLMDYAF FI QN G+ +E +YPY G + C+ ++ +I
Sbjct: 174 ELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATIT 233
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
G+EDV E +L KAVA+QP+SVAI+A G FQ Y SGVFTGECG+ LDHGV AVGYGT
Sbjct: 234 GHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGT 293
Query: 311 -ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+G YWLV+NSWG+ WGE GY+++QR + G CGIAM+ASYP
Sbjct: 294 AADGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAE-GLCGIAMQASYPT 340
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 174/369 (47%), Positives = 227/369 (61%), Gaps = 19/369 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LV L F+SS++ I +D +D+ + +Y+ W H + G +R
Sbjct: 10 LVALVFVSSAAVELCRAIDFDER-----DLASDEALWDLYERWQTHH-RVHRHHGEKGRR 63
Query: 73 FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV--- 128
F FK+N+RFI HN +R Y++ LN+F D+ EE+R+ + +R + RR
Sbjct: 64 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARAG 123
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
A + + + P SVDWR++GAV VK QG CGSCWAFSTV AVEGIN I TG L SL
Sbjct: 124 AVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLASL 183
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD---PSRRNAK 245
SEQEL+DCD N GC GGLM+ AF+FI GG+ +E YPY + CD R
Sbjct: 184 SEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGV 242
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV IDG++ V E +L KAVA QPVSVA++AGG+AFQ Y GVFTG+CG+ LDHGV A
Sbjct: 243 VVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 302
Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
VGYG ++G YW+V+NSWG+ WGE GY+++QR N G CGIAMEAS+P+K S N A
Sbjct: 303 VGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPA 360
Query: 365 KP--KPHSS 371
P KP +
Sbjct: 361 DPPRKPRRA 369
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 155/309 (50%), Positives = 213/309 (68%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++ K E+RF+IFK+N+ +I+ +N+ N+ Y +G+N+FADLTNEE+
Sbjct: 39 HEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF-- 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R+ K + S + + + +P +VDWR+KGAV P+KDQG CG CWAFS
Sbjct: 97 --IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI+ + G+LISLSEQE+VDCD K + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC+ V +I GYEDV +E +L+KAVA+QPVSVAI+A G FQ Y+SG
Sbjct: 215 YKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR + G G
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLXG 333
Query: 349 IAMEASYPV 357
IAM ASYP
Sbjct: 334 IAMMASYPT 342
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 208/309 (67%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
++ W+ ++G+ RFQIF DN++FI+E N R +YK+ +N+FAD TNEE++A
Sbjct: 57 HEQWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQA 116
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G + R S+ RY + +P S+DWR+KGAV PVKDQG CGSCWAFST
Sbjct: 117 SRNGYKMAVSSR--PSQTTLFRY--ENVTAVPSSMDWRKKGAVTPVKDQGQCGSCWAFST 172
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
+AA EGI K+ TG+LISLSEQELVDCD+ + GC GG M+ F+FI++N G+ E YP
Sbjct: 173 IAATEGITKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIALEASYP 232
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A+ C+ ++ I GYE V E +L KAVA+QPVSV+I+A G AFQ Y SG
Sbjct: 233 YTAADGTCNSKEEASRAAKISGYEKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSG 292
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTGECG+ LDHGV AVGYG T +G YWLV+NSWG+ WG++GY+ +QR + G CG
Sbjct: 293 VFTGECGTDLDHGVTAVGYGKTSDGTKYWLVKNSWGASWGDSGYIMMQRGVA-AKGGLCG 351
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 352 IAMDASYPT 360
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 210/308 (68%), Gaps = 6/308 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ K+GK EKRF IF++N+ FI+ N+ N+ YK+ +N AD TNEE+ A
Sbjct: 38 HEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEFMA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ G + + L + +Y + ++P +VDWR+KG +KDQG CG CWAFS
Sbjct: 98 SHKGYKGSHWQGLRITTQTPFKY--ENVTDIPWAVDWRQKGDATSIKDQGQCGICWAFSA 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA EGI +I TG L+SLSEQELVDCD ++ GC+GGLM++ F+FII+NGG+ SE +YPY
Sbjct: 156 VAATEGIYQITTGNLVSLSEQELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEANYPY 214
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
CD ++ + I GYE V E L+KAVA+QPVSV+I+AGG AFQ Y SGV
Sbjct: 215 TAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVSVSIDAGGSAFQFYSSGV 274
Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+ LDHGV AVGYG T++G+ YW+V+NSWG+ WGE GY+++ R +D G CGI
Sbjct: 275 FTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRG-IDAQEGLCGI 333
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 334 AMDASYPT 341
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 220/339 (64%), Gaps = 10/339 (2%)
Query: 34 NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY 93
N HD S + + +Y+ W + H T + + RF +FK N+ + N L++ Y
Sbjct: 26 NEHDLDS----EKSLWDLYERWRSHHTVTRS-LDEKHNRFNVFKANVMHVHNTNKLDKPY 80
Query: 94 KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV 153
K+ LNKFAD+TN E+R +Y ++ R + + + +P S+DWR+KGAV
Sbjct: 81 KLKLNKFADMTNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAV 140
Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
VKDQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD N GCNGGLM+YAF
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAF 200
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+FI QN G+ +E +YPY + CD + + VSIDGYE+V +E +L KA A QPVS
Sbjct: 201 EFIKQN-GITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVS 259
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGY 332
VAI+AGG FQ Y GVF+G CG+ L+HGV VGYG T++ YW+V+NSWGS+WGE GY
Sbjct: 260 VAIDAGGYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGY 319
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
+++QR + G CGIAMEASYP+K S S P S+
Sbjct: 320 IRMQRG-ISHKEGLCGIAMEASYPIKKS--STNPTESST 355
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/318 (49%), Positives = 219/318 (68%), Gaps = 11/318 (3%)
Query: 44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+DD M ++ W+A++G+ +RF++FK N+ FI+ N+ N + +G+N+FAD
Sbjct: 28 SDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+R M +++ ++V + RY D LP +VDWR KGAV P+KDQG
Sbjct: 88 LTNDEFRWM----KTNKGFIPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTPIKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E +YPY A++KC + V SI GYEDV +E +L KAVA+QPVSVA++ G
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 261
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y+ GV TG CG+ LDHG+VA+GYG +G YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDI 321
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 322 SDKR-GMCGLAMEPSYPT 338
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 167/334 (50%), Positives = 215/334 (64%), Gaps = 14/334 (4%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFA 101
+D+ + +Y+ W H + G +RF FK+N+RFI HN +Y++ LN+F
Sbjct: 38 SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFG 96
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQR---YACKAGDELPESVDWREKGAVNPVKD 158
D+ EE+R+ + +R + RR +S A+ + ++P SVDWR+ GAV VK+
Sbjct: 97 DMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKN 156
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCWAFSTV AVEGIN I TG L+SLSEQELVDCD N GC GGLM+ AF FI
Sbjct: 157 QGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAEN-GCQGGLMENAFDFIKS 215
Query: 219 NGGMDSEQDYPYLGAENKCD--PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
GG+ +E YPY + CD +RR VSIDG++ V E +L KAVA QPVSVAI
Sbjct: 216 YGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAI 275
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
+AGG+AFQ Y GVFTG+CG+ LDHGV VGYG +G YW+V+NSWG WGE GY++
Sbjct: 276 DAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGGYIR 335
Query: 335 LQRNLLDTNTGKCGIAMEASYPVKNSQNSA-KPK 367
+QR N G CGIAMEAS+P+K S N A KP+
Sbjct: 336 MQRGA--GNGGLCGIAMEASFPIKTSHNPARKPR 367
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 158/318 (49%), Positives = 219/318 (68%), Gaps = 11/318 (3%)
Query: 44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+DD M ++ W+A++G+ +RF++FK N+ FI+ N+ N + +G+N+FAD
Sbjct: 28 SDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+R T+++ ++V + RY D LP +VDWR KGAV P+KDQG
Sbjct: 88 LTNDEFR----WTKTNKGFIPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTPIKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E +YPY A++KC + V SI GYEDV +E +L KAVA+QPVSVA++ G
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 261
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y+ GV TG CG+ LDHG+VA+GYG +G YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDI 321
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 322 SDKR-GMCGLAMEPSYPT 338
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 167/349 (47%), Positives = 229/349 (65%), Gaps = 22/349 (6%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT-IYQTWLAKHGKTSNGMGHNEK 71
L+ LFF+ + A D +S+ + M ++ W+AKHGK +
Sbjct: 11 LIALFFVLAMWA------------DQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLR 58
Query: 72 RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
RFQIFK+N+ FI+ N+ N +Y +G+N+FADLTNEE+RA + G KR L S++ +
Sbjct: 59 RFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWNGY----KRPLDASRIVT 114
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
+ + LP S+DWR KGAV +KDQ CGSCWAFS VAA EG++K+ TG+L+SLSE
Sbjct: 115 P-FKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSE 173
Query: 191 QELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
QELVDCD K + GC GGLM+ AF+FI +NGG+ +E +Y Y G + KCD + + V I
Sbjct: 174 QELVDCDVKGEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKI 233
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
GY+ V E +L KAVA QPVSV+I+AG +FQ Y+SG++ G CGS L+HGV AVGYG
Sbjct: 234 TGYQVVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYG 293
Query: 310 T-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
T +G YW+V+NSWG +WGE GYV+++R+ + + G CGIAM+ SYP
Sbjct: 294 TSSSGSKYWIVKNSWGPEWGERGYVRMKRD-ITSRKGLCGIAMDCSYPT 341
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 226/348 (64%), Gaps = 21/348 (6%)
Query: 15 FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
LF I S ++++ DH++ ++ ++ W+ ++G+ +RF+
Sbjct: 7 LLFAILSCLCLCSAVLAAREQSDHAA-------MVARHERWMEQYGRVYKDATEKARRFE 59
Query: 75 IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQ 131
IFK N+ FI+ N+ N + +G+N+FADLTN E+RA + + + S V +
Sbjct: 60 IFKANVAFIESFNAGNHKFWLGVNQFADLTNYEFRA------TKTNKGFIPSTVRVPTTF 113
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + D LP +VDWR KGAV P+KDQG CG CWAFS VAA+EGI K+ TG+LISLSEQ
Sbjct: 114 RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 173
Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD + GC GGLMD AF+FII+NGG+ +E YPY A+ KC+ +A +I
Sbjct: 174 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSA--ATIK 231
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
GYE+V +E +L KAVA+QPVSVA++ G FQ Y GV TG CG+ LDHG+VA+GYG
Sbjct: 232 GYEEVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291
Query: 311 E-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ +G YWL++NSWG+ WGENG++++++++ D G CG+AME SYP
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKR-GMCGLAMEPSYPT 338
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 159/308 (51%), Positives = 208/308 (67%), Gaps = 9/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A HGK E+++Q FK+N++ I+ N N+ YK+G+N FADLTNEE++A
Sbjct: 40 HEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKA 99
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R ++ + RY + +P ++DWR++GAV P+KDQG CG CWAFS
Sbjct: 100 I---NRFKGHVCSKITRTPTFRY--ENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FI+QN G+ +E YP
Sbjct: 155 VAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ SI GYEDV E +L KAVA+QPVSVAIEA G FQ Y G
Sbjct: 215 YEGVDGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV AVGYG +++G YWLV+NSWG WG+ GY+++QR++ G CG
Sbjct: 275 VFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIRMQRDVA-AKEGLCG 333
Query: 349 IAMEASYP 356
IAM ASYP
Sbjct: 334 IAMLASYP 341
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 166/316 (52%), Positives = 223/316 (70%), Gaps = 5/316 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
T++ + +Y+ W KH S + KRF +FK+N+ + N +++ YK+ LNKFAD+
Sbjct: 33 TEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADM 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
+N E+ Y + R+L + + + + + +LP SVDWRE+GAVN VK+QG CG
Sbjct: 92 SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCG 151
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFS+VAAVEGINKI T +L+SLSEQEL+DC+ + N GCNGG M+ AF FI +NGG+
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKRNGGIA 210
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E YPY G+ C SR ++ +V IDGYE V P +E +L +AVA+QPVSVAI+A GR F
Sbjct: 211 TENSYPYHGSRGLCRSSRISSPIVKIDGYESV-PENEDALMQAVANQPVSVAIDAAGRDF 269
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GVF G CG+ L+HGVVA+GYG TE+G DYWLVRNSWG WGE+GYV+++R ++
Sbjct: 270 QFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRG-VEQ 328
Query: 343 NTGKCGIAMEASYPVK 358
G CGIAMEASYP+K
Sbjct: 329 AEGLCGIAMEASYPIK 344
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 225/348 (64%), Gaps = 21/348 (6%)
Query: 15 FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
LF I S ++++ DH++ ++ ++ W+ ++G+ +RF+
Sbjct: 7 LLFAILSCLCLCSAVLAAREQSDHAA-------MVARHERWMEQYGRVYKDATEKARRFE 59
Query: 75 IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQ 131
IFK N+ FI+ N+ N + + +N+FADLTN E+RA + + + S V +
Sbjct: 60 IFKANVAFIESFNAGNHKFWLSVNQFADLTNYEFRA------TKTNKGFIPSTVRVPTTF 113
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + D LP +VDWR KGAV P+KDQG CG CWAFS VAA+EGI K+ TG+LISLSEQ
Sbjct: 114 RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 173
Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD + GC GGLMD AF+FII+NGG+ +E YPY A+ KC+ +A +I
Sbjct: 174 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSA--ATIK 231
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
GYEDV +E +L KAVA+QPVSVA++ G FQ Y GV TG CG+ LDHG+VA+GYG
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291
Query: 311 E-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ +G YWL++NSWG+ WGENG++++++++ D G CG+AME SYP
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKR-GMCGLAMEPSYPT 338
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 158/299 (52%), Positives = 210/299 (70%), Gaps = 14/299 (4%)
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA--KRRLMKSK 127
E+RF ++ DNLRF+ E+N+ + ++ + + +ADL+ +EYR+ LG +D +R L +
Sbjct: 58 ERRFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAP 117
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ G P+ VDW KGAV PVK+Q CGSCWAFST AVEG + I TG+L S
Sbjct: 118 FLYE------GTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLAS 171
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQ LVDCDR+ + GC+GGLMD+AF+FI++NGG+D+E DYPY E C ++ VV
Sbjct: 172 LSEQMLVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVV 231
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+ID Y+DV P DE +L KAVA+QPVSVAIEA RAFQ Y GVF ECG+ALDHGV+ VG
Sbjct: 232 TIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVG 291
Query: 308 YGT-ENG---VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
YGT NG + YWLV+NSWG++WG+ GY++L RNL G+CG+AM+AS+P+K N
Sbjct: 292 YGTASNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNL--GEEGQCGVAMQASFPIKKGAN 348
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 208/308 (67%), Gaps = 9/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
++ W+A HGK E+++QIF +N++ I+ N+ + YK+G+N FADLTNEE++A
Sbjct: 38 HEQWMATHGKVYKHSYEKEQKYQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ R +++ + RY + +P S+DWR+KGAV P+KDQG CG CWAFS
Sbjct: 98 I---NRFKGHVCSKRTRTTTFRY--ENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FI+QN G+ +E YP
Sbjct: 153 VAATEGITKLRTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEAIYP 212
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ SI GYEDV E +L KAVA+QPVSVAIEA G FQ Y G
Sbjct: 213 YEGFDGTCNAKADGNHAGSIKGYEDVPANSESALLKAVANQPVSVAIEASGFKFQFYSGG 272
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG CG+ LDHGV +VGYG ++G YWLV+NSWG WGE GY+++QR++ G CG
Sbjct: 273 VFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVKNSWGVKWGEKGYIRMQRDVA-AKEGLCG 331
Query: 349 IAMEASYP 356
IAM ASYP
Sbjct: 332 IAMLASYP 339
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 156/318 (49%), Positives = 219/318 (68%), Gaps = 11/318 (3%)
Query: 44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+DD M ++ W+A++G+ +RF++FK N+ FI+ N+ N + +G+N+FAD
Sbjct: 28 SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+R+ T+++ ++V + RY D LP ++DWR KG V P+KDQG
Sbjct: 88 LTNDEFRS----TKTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTPIKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E +YPY A++KC + V SI GYEDV +E +L KAVA+QPVSVA++ G
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 261
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y+ GV TG CG+ LDHG+VA+GYG +G YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDI 321
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 322 SDKR-GMCGLAMEPSYPT 338
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 317 bits (811), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 159/319 (49%), Positives = 216/319 (67%), Gaps = 12/319 (3%)
Query: 45 DDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
DD M ++ W+A++ + +RF++FK N++FI+ N+ NR + +G+N+FAD
Sbjct: 29 DDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+RA T+++ + KV + RY + D LP S+DWR KGAV P+KDQG
Sbjct: 89 LTNDEFRA----TKTNKGFKPSPVKVPTGFRYENVSVDALPASIDWRTKGAVTPIKDQGQ 144
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA EGI KI T +LISLSEQELVDCD + GC GGLMD AF+FII+NG
Sbjct: 145 CGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 204
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY + KC +A +I G+EDV DE +L KAVA+QPVSVA++ G
Sbjct: 205 GLTTESSYPYTATDGKCKSGTNSA--ANIKGFEDVPANDEAALMKAVANQPVSVAVDGGD 262
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENGY+++++++
Sbjct: 263 MTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI 322
Query: 340 LDTNTGKCGIAMEASYPVK 358
D G CG+AME SYP +
Sbjct: 323 SDKR-GMCGLAMEPSYPTE 340
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 164/331 (49%), Positives = 211/331 (63%), Gaps = 13/331 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + +Y+ W + H + +RF FK N FI HN + Y++ LN+F D
Sbjct: 38 SEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGD 96
Query: 103 LTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
+ E+RA ++G R D + V YA +LP SVDWR+KGAV VKDQG
Sbjct: 97 MDQAEFRATFVGDLRRDTPSK--PPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N GC GGLMD AF++I NGG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 222 MDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
+ +E YPY A C+ +R + VV IDG++DV E L +AVA+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
G+AF Y GVFTGECG+ LDHGV VGYG E+G YW V+NSWG WGE GY+++++
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+ + G CGIAMEASYPVK +KPKP
Sbjct: 335 D-SGASGGLCGIAMEASYPVK---TYSKPKP 361
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 215/316 (68%), Gaps = 10/316 (3%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
+D + +++ W+ +HGK +KRF IFK+N+ +I+ N++ N++YK+GLN FADL
Sbjct: 32 NDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAFNNVGNKSYKLGLNHFADL 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN E+ + R+ L S + + +Y K ++P +VDWR++GAV PVK+QG CG
Sbjct: 92 TNHEF----IAARNKFNGYLHGSIITTFKY--KNVSDVPSAVDWRQEGAVTPVKNQGQCG 145
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VA+ EGI+K+ TG L+SLSEQELVDCD + GC GGLMD AF+FIIQN G+
Sbjct: 146 CCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNGL 205
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY G + C+ + + +I GYE+V DE +L+KAVA+QPVSVAI+A G
Sbjct: 206 STEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQKAVANQPVSVAIDASGSD 265
Query: 283 FQHYESGVFTGECGSALDH-GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y+SGVFTG CG+ LDH V E+ +YWLV+NSWG+ WGE GY+++QR +D
Sbjct: 266 FQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRG-VD 324
Query: 342 TNTGKCGIAMEASYPV 357
+ G CGIAM+ SYP
Sbjct: 325 ASEGLCGIAMQPSYPT 340
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 152/311 (48%), Positives = 206/311 (66%), Gaps = 11/311 (3%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
V +++ W +HGK+ + R +F DN F+ HN+L N +Y + LN +ADLT+
Sbjct: 25 VSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHH 84
Query: 107 EYRAMYLGTRSDAK--RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
E++ LG + R ++ + + R ++P+S+DWR+KGAV VKDQGSCG+
Sbjct: 85 EFKVSRLGFSPALRNFRPVLPQEPSLPR-------DVPDSLDWRKKGAVTAVKDQGSCGA 137
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FS A+EGIN+I+TG LISLSEQEL+DCDR N+GC GGLMDYA+QF+I N G+D+
Sbjct: 138 CWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDT 197
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DYPY + C + VV+IDGY D+ DE L +AVA QPVSV I RAFQ
Sbjct: 198 ENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQ 257
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y G+F+G C ++LDH V+ VGYG+ENGVDYW+V+NSWG WG +GY+ +QRN ++
Sbjct: 258 LYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSE- 316
Query: 345 GKCGIAMEASY 355
G CGI ASY
Sbjct: 317 GVCGINKLASY 327
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/299 (53%), Positives = 203/299 (67%), Gaps = 11/299 (3%)
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA-KRRLMKSKV 128
E+RF I+ DNLRF E+N+ + ++ + + +ADL+ +EYR+ LG + K+R +++
Sbjct: 69 ERRFNIWLDNLRFAHEYNARHTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRAAP 128
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+ G PE VDW GAV PVKDQ CGSCWAFST AVEG N I TG+L+SL
Sbjct: 129 FLYK-----GTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSL 183
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQ LVDCDR+ + GC GG MD AF FI+ NGG+D+E DYPY + C +R VV+
Sbjct: 184 SEQMLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVT 243
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
IDGY+DV P DE +L KAVA QPVSVAIEA AFQ Y GVF ECG+ALDH V+ VGY
Sbjct: 244 IDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGY 303
Query: 309 GT-ENG---VDYWLVRNSWGSDWGENGYVKLQRNL-LDTNTGKCGIAMEASYPVKNSQN 362
GT NG + YWLV+NSWG++WGE GY++L RNL D G+CG+AM AS+P+K N
Sbjct: 304 GTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPIKKGAN 362
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 162/326 (49%), Positives = 210/326 (64%), Gaps = 20/326 (6%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+++ + +Y+ W +H + + +G +RF +FKDN+R I E N + YK+ LN+F D+
Sbjct: 40 SEEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDM 98
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T +E Y +R R +QR GAV VKDQG CG
Sbjct: 99 TADESAGAYASSRVSHHRMFRGRGEKAQRL----------------HGAVGAVKDQGQCG 142
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFST+AAVEGIN I T L +LSEQ+LVDCD K NAGC+GGLMD AFQ+I ++GG+
Sbjct: 143 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 202
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+ YPY ++ C S ++ V+IDGYEDV E +LKKAVA+QPVSVAIEAGG
Sbjct: 203 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 262
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVF G+CG+ LDHGV AVGYGT +G YW+VRNSWG+DWGE GY++++R+ +
Sbjct: 263 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRD-VS 321
Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
G CGIAMEASYP+K S N A K
Sbjct: 322 AKEGLCGIAMEASYPIKTSPNPAPKK 347
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 212/309 (68%), Gaps = 10/309 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++TW+A++G+ ++F++FK N RFID N+ N + +G+N+FADLTNEE++A
Sbjct: 37 HETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAENHKFWLGINQFADLTNEEFKA- 95
Query: 112 YLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
T+++ K++V++ +Y + LP S+DWR KGAV PVKDQG CG CWAFS
Sbjct: 96 ---TKTNKGFISNKARVSTGFKYENLKIEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA EGI K+ TG+L+SLSEQELVDCD + GC GGLMD AF+FII NGG+ E YP
Sbjct: 153 VAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYP 212
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC ++A +I YEDV +E +L KAVA+QPVSVA++ G FQ Y G
Sbjct: 213 YDAEDGKCKSGSKSAG--TIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGG 270
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
V TG CG+ LDHG+ A+GYG T +G +WL++NSWG+ WGENG++++++++ D G CG
Sbjct: 271 VMTGSCGTDLDHGIAAIGYGVTSDGTKFWLMKNSWGTTWGENGFLRMEKDIAD-KKGMCG 329
Query: 349 IAMEASYPV 357
+AME SYP
Sbjct: 330 LAMEPSYPT 338
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 158/322 (49%), Positives = 214/322 (66%), Gaps = 16/322 (4%)
Query: 45 DDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
DDE+ + ++ W+ +HG+ RF +FK N++FI+ N+ NR + +G+N
Sbjct: 32 DDELAMVARHEQWMVQHGRVYKDETDKAHRFLVFKANVKFIESFNAAAAAGNRKFWLGVN 91
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVK 157
+FADLTN+E+RA T+++ KV + RY + D LP++VDWR KGAV P+K
Sbjct: 92 QFADLTNDEFRA----TKTNKGFNPNVVKVPTGFRYQNLSIDALPQTVDWRTKGAVTPIK 147
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFI 216
DQG CG CWAFS VAA EGI KI TG+L SLSEQELVDCD + GCNGG MD AF+FI
Sbjct: 148 DQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFI 207
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
I+NGG+ +E +YPY + +C A +I GYEDV DE +L KAVA QPVSVA+
Sbjct: 208 IKNGGLTTESNYPYTAQDGQCKSGSNGA--ATIKGYEDVPANDEAALMKAVASQPVSVAV 265
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKL 335
+ G FQ Y GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENG++++
Sbjct: 266 DGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGFLRM 325
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
++++ D G CG+AM+ SYP
Sbjct: 326 EKDIAD-KKGMCGLAMQPSYPT 346
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 143/206 (69%), Positives = 171/206 (83%), Gaps = 1/206 (0%)
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD N GCNGGLMDYAF+FII NGG+
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 772
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
D+E+DYPY G + +CD +R+NAKVV+ID YEDV DE SL+KAVA+QPVSVAIEA G
Sbjct: 773 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 832
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
FQ Y SG+FTG CG+ALDHGV VGYGTENG DYW+++NSWGS WGE+GYV+++RN +
Sbjct: 833 FQLYSSGIFTGSCGTALDHGVTVVGYGTENGKDYWIMKNSWGSSWGESGYVRMERN-IKA 891
Query: 343 NTGKCGIAMEASYPVKNSQNSAKPKP 368
++GKCGIA+E SYP+K N P P
Sbjct: 892 SSGKCGIAVEPSYPLKEGANPPNPGP 917
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 213/321 (66%), Gaps = 8/321 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFAD 102
D ++ ++ W+A+HG+ +RF+ F++N+ FI+ N+ R + +G+N+F D
Sbjct: 30 DAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGVNQFTD 89
Query: 103 LTNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
LTN+E+RA + +A S + RY+ + D LP +VDWR KGAV P+K+Q
Sbjct: 90 LTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQ 149
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQ 218
G CG CWAFS VAA EGI ++ TG+L+ LSEQELVDCD + GC GG MD AF+FII+
Sbjct: 150 GQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIK 209
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+ SE +YPY + +C V +I GYEDV DE SL KAVA QPVSVA++
Sbjct: 210 NGGLTSETNYPYTAQDGQCKAKNTINSVATIKGYEDVPANDEASLMKAVAAQPVSVAVDG 269
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
G FQHY GV +G CG++LDHG+VAVGYG ++G +WL++NSWG+ WGE+GY+++++
Sbjct: 270 GDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAADDGTKFWLMKNSWGTTWGEDGYIRMEK 329
Query: 338 NLLDTNTGKCGIAMEASYPVK 358
++ D G CG+AM+ SYP +
Sbjct: 330 DVADAG-GMCGLAMQPSYPTE 349
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 159/307 (51%), Positives = 212/307 (69%), Gaps = 16/307 (5%)
Query: 56 LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLG 114
+A++G+ EKRF+IFKDN+ I+ N ++++TYK+ +N+FADLTNEE+R++
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSL--- 57
Query: 115 TRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
R K+ + S+ K + +P ++DWR+KGAV P+KDQ CG CWAFS VA
Sbjct: 58 ------RNRFKAHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVA 111
Query: 173 AVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
A EGI +I TG+LISLSEQELVDCD N GC+GGLMD AF+FI + G+ SE YPY
Sbjct: 112 ATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYE 170
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G + C+ + I GYEDV +E +L+KAVA QPV+VAI+AGG FQ Y SGVF
Sbjct: 171 GDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVF 230
Query: 292 TGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
TG+CG+ LDHGV AVGYG ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIA
Sbjct: 231 TGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIA 289
Query: 351 MEASYPV 357
M+ASYP
Sbjct: 290 MQASYPT 296
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 156/316 (49%), Positives = 212/316 (67%), Gaps = 7/316 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D ++ ++ W+A HG+ + RFQIFK+N+ +ID HN+ +++Y + +NKFADL
Sbjct: 48 DPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKFADL 107
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN+E+RA G + K+ S V S + +P+ VDWR++GAV PVKDQG CG
Sbjct: 108 TNDEFRASRNGYK---KQPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKDQGDCG 164
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA+EGINK+ G+L+SLSEQELVDCD I+ GC GGLM+ AFQFI + G+
Sbjct: 165 CCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKGL 224
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E YPY G + C+ + I G+E V +E +L +AVA+QPVS+AI+A G
Sbjct: 225 AAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAIDASGYE 284
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVFTG CG+ LDH + AVGYG T +G YWL++NSWG+ WGENGY++++R+ L
Sbjct: 285 FQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIKRDSL- 343
Query: 342 TNTGKCGIAMEASYPV 357
G CGIAM+ SYPV
Sbjct: 344 AKEGLCGIAMDPSYPV 359
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 208/312 (66%), Gaps = 9/312 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
++ W+AKHG+ +R ++F+DN+ FI+ N+ +K L N+FADLTN E+R
Sbjct: 40 HERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFR 99
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
A G R + R S RYA + +LP SVDWR KGAVNPVKDQG CG CWAFS
Sbjct: 100 ATRTGLRPSSSRG--NRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFS 157
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VAA+EG K+ TG+L+SLSEQ+LV CD K + GC GGLMD AF FII+NGG+ +E DY
Sbjct: 158 AVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDY 217
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY +++KC + A +I GYEDV DE +L KAVA+QPVSVAI+ G R FQ Y+
Sbjct: 218 PYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKG 277
Query: 289 GVFTGE--CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV +G C + LDH + AVGYG +G YWL++NSWG+ WGE+GYV+++R + D G
Sbjct: 278 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE-G 336
Query: 346 KCGIAMEASYPV 357
CG+AM ASYP
Sbjct: 337 VCGLAMMASYPT 348
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 213/321 (66%), Gaps = 17/321 (5%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVG 96
H +S R + E W+A++G+ + ++ FQIFK+N+ FI+ N+ N+ YK+G
Sbjct: 30 HETSLREEHE------NWIARYGQVYK-VAAEKETFQIFKENVEFIESFNAAANKPYKLG 82
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N FADLT EE++ G + + + K + ++PE++DWREKGAV P+
Sbjct: 83 VNLFADLTLEEFKDFRFGLKKTHEFSITPFKYENVT-------DIPEALDWREKGAVTPI 135
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
KDQG CGSCWAFSTVAA EGI++I TG L+SL EQELV CD K ++ GC GG M+ F+F
Sbjct: 136 KDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQGCEGGYMEDGFEF 195
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
II+NGG+ ++ +YPY G C+ + + V I GYE V + E +L+KAVA+QPVSV+
Sbjct: 196 IIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEALQKAVANQPVSVS 255
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
I+A F Y G++TGECG+ LDHGV AVGYGT N DYW+V+NSWG+ W E G++++
Sbjct: 256 IDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETDYWIVKNSWGTGWDEKGFIRM 315
Query: 336 QRNLLDTNTGKCGIAMEASYP 356
QR + G CG+A+++SYP
Sbjct: 316 QRG-ITVKHGLCGVALDSSYP 335
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/319 (49%), Positives = 211/319 (66%), Gaps = 8/319 (2%)
Query: 45 DDEVMT--IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKF 100
DDE++ + W+A+HG+T M R+ +FK N+ I+ N++ RT+K+ +N+F
Sbjct: 29 DDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQF 88
Query: 101 ADLTNEEYRAMYLGTRSD-AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
ADLTN+E+R MY G + D ++K S RY LP +VDWR+KGAV P+K+Q
Sbjct: 89 ADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKNQ 148
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
GSCG CWAFS VAA+EG +I G+LISLSEQ+LVDCD + GC+GGLMD AF+ I+
Sbjct: 149 GSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHIMAT 207
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ +E +YPY G + C SI GYEDV DE +L KAVA QPVSV IE G
Sbjct: 208 GGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEGG 267
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRN 338
G FQ Y SGVFTGEC + LDH V AVGY + G YW+++NSWG+ WGE GY++++++
Sbjct: 268 GFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIKKD 327
Query: 339 LLDTNTGKCGIAMEASYPV 357
+ D G CG+AM+ASYP
Sbjct: 328 IKDKE-GLCGLAMKASYPT 345
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 204/319 (63%), Gaps = 14/319 (4%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---------TYKVGLNKFA 101
+++ W A+HGK G R F DN F+ HN+ +Y + LN FA
Sbjct: 41 LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQG 160
DLT+ E+RA LG + R S+ +A G +PE++DWR+ GAV VKDQG
Sbjct: 101 DLTHAEFRAARLGRLAVGGARAPPSEGG---FAGSVGVGAVPEALDWRQSGAVTKVKDQG 157
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
SCG+CW+FS A+EGINKI TG LISLSEQEL+DCDR NAGC GGLMDYA++F+I+NG
Sbjct: 158 SCGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNG 217
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+D+E DYPY A+ C+ ++ VV+IDGY DV E SL +AVA QP+SV I
Sbjct: 218 GIDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSA 277
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
RAFQ Y G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG WG GY+ + RN
Sbjct: 278 RAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRN-T 336
Query: 341 DTNTGKCGIAMEASYPVKN 359
+++G CGI M AS+P K
Sbjct: 337 GSSSGICGINMMASFPTKT 355
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 210/308 (68%), Gaps = 5/308 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ + GK+ EKRFQIFK+N+ FI+ N++ N+ + + +N FADLTNEE++A
Sbjct: 37 HEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKA 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G + + + ++ S RY +P S+DWR++GAV P+K+QGSCGSCWAFST
Sbjct: 97 SLNGNKKLHDKFDILNETTSFRYHNVT--SVPASMDWRKRGAVTPIKNQGSCGSCWAFST 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VA++EGI++I TGEL+SLSEQEL+DC R ++GC+GG ++ AF+FI + GGM SE +YPY
Sbjct: 155 VASIEGIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKKGGMASETNYPY 214
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ KC + + V I GYE V E L KAVA+QPVSV ++AG FQ Y G+
Sbjct: 215 KETDEKCKFKKESKHVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDYVFQFYSGGI 274
Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+ DH V VGYG + +YWLV+NSWG+ WGE GY+KL+RN +D+ G CGI
Sbjct: 275 FTGKCGTDTDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRN-VDSKKGLCGI 333
Query: 350 AMEASYPV 357
A SYPV
Sbjct: 334 ATNPSYPV 341
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 208/312 (66%), Gaps = 9/312 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
++ W+AKHG+ +R ++F+DN+ FI+ N+ +K L N+FADLTN E+R
Sbjct: 5 HERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFR 64
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
A G R + R S RYA + +LP SVDWR KGAVNPVKDQG CG CWAFS
Sbjct: 65 ATRTGLRPSSSRG--NRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFS 122
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VAA+EG K+ TG+L+SLSEQ+LV CD K + GC GGLMD AF FII+NGG+ +E DY
Sbjct: 123 AVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDY 182
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY +++KC + A +I GYEDV DE +L KAVA+QPVSVAI+ G R FQ Y+
Sbjct: 183 PYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKG 242
Query: 289 GVFTGE--CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV +G C + LDH + AVGYG +G YWL++NSWG+ WGE+GYV+++R + D G
Sbjct: 243 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE-G 301
Query: 346 KCGIAMEASYPV 357
CG+AM ASYP
Sbjct: 302 VCGLAMMASYPT 313
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 177/360 (49%), Positives = 219/360 (60%), Gaps = 32/360 (8%)
Query: 25 ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG-MGHNEKRFQIFKDNLRFI 83
D SI+ Y + D SS + + +++ WL++H K + + +RF++FKDNL I
Sbjct: 26 GDFSIVGY-SEEDLSS----HESLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHI 80
Query: 84 DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR------------------LMK 125
DE N +Y +GLN+FADLT++E++A YLG
Sbjct: 81 DETNRKVSSYWLGLNEFADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSS 140
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
S RY LP+SVDWR KGAV VK+QG CGSCWAFSTVAAVEGIN+IVTG L
Sbjct: 141 SSSFRFRYEGVDAARLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNL 200
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
+LSEQELVDCD N GCNGGLMDYAF +I NGG+ +E+ YPYL E C +A
Sbjct: 201 TALSEQELVDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRG-SSAA 259
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV+I GYEDV +E +L KA+A QPVSVAIEA GR Q Y GVF G CG+ LDHGV A
Sbjct: 260 VVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAA 319
Query: 306 VGYGT---ENG---VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
VGYGT +NG DY +V+NSWG WGE GY++++R G CGI SYP KN
Sbjct: 320 VGYGTAGKDNGHVVADYIIVKNSWGPSWGEKGYIRMRRG-TGKRQGLCGINKMPSYPTKN 378
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 208/312 (66%), Gaps = 9/312 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
++ W+AKHG+ +R ++F+DN+ FI+ N+ +K L N+FADLTN E+R
Sbjct: 5 HERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFR 64
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
A G R + R S RYA + +LP SVDWR KGAVNPVKDQG CG CWAFS
Sbjct: 65 ATRTGLRPSSSRG--NRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFS 122
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VAA+EG K+ TG+L+SLSEQ+LV CD K + GC GGLMD AF FII+NGG+ +E DY
Sbjct: 123 AVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDY 182
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY +++KC + A +I GYEDV DE +L KAVA+QPVSVAI+ G R FQ Y+
Sbjct: 183 PYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKG 242
Query: 289 GVFTGE--CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV +G C + LDH + AVGYG +G YWL++NSWG+ WGE+GYV+++R + D G
Sbjct: 243 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE-G 301
Query: 346 KCGIAMEASYPV 357
CG+AM ASYP
Sbjct: 302 VCGLAMMASYPT 313
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 162/331 (48%), Positives = 209/331 (63%), Gaps = 13/331 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + +Y+ W + H + +RF FK N FI HN + Y++ LN+F D
Sbjct: 38 SEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGD 96
Query: 103 LTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
+ E+RA ++G R D + V YA +LP SVDWR+KGAV VKDQG
Sbjct: 97 MDQAEFRATFVGDLRRDTPAK--PPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N GC GGLMD AF++I NGG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 222 MDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
+ +E YPY A C+ +R + VV IDG++DV E L +AVA+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
G+AF Y GVFTG+CG+ LDHGV VGYG E+G YW V+NSWG WGE GY+++++
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+ + G CGIAMEASYPVK KP P
Sbjct: 335 D-SGASGGLCGIAMEASYPVKTYN---KPMP 361
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 165/316 (52%), Positives = 222/316 (70%), Gaps = 5/316 (1%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
T++ + +Y+ W KH S + KRF +FK+N+ + N +++ YK+ LNKFAD+
Sbjct: 33 TEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADM 91
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
+N E+ Y + R+L + + + + + +LP SVD RE+GAVN VK+QG CG
Sbjct: 92 SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCG 151
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAFS+VAAVEGINKI T +L+SLSEQEL+DC+ + N GCNGG M+ AF FI +NGG+
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKRNGGIA 210
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E YPY G+ C SR ++ +V IDGYE V P +E +L +AVA+QPVSVAI+A GR F
Sbjct: 211 TENSYPYHGSRGLCRSSRISSPIVKIDGYESV-PENEDALMQAVANQPVSVAIDAAGRDF 269
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GVF G CG+ L+HGVVA+GYG TE+G DYWLVRNSWG WGE+GYV+++R ++
Sbjct: 270 QFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRG-VEQ 328
Query: 343 NTGKCGIAMEASYPVK 358
G CGIAMEASYP+K
Sbjct: 329 AEGLCGIAMEASYPIK 344
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 151/306 (49%), Positives = 204/306 (66%), Gaps = 6/306 (1%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMY 112
W+ +HG+ R+ +FK N+ I+ N + T+K+ +N+FADLTNEE+R+MY
Sbjct: 41 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100
Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
G + ++ ++K S RY + D LP SVDWR+KGAV P+KDQG CGSCWAFS VA
Sbjct: 101 TGFKGNSVLS-SRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVA 159
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
A+EG+ +I G+LISLSEQELVDCD + GC GGLMD AF + I GG+ SE +YPY
Sbjct: 160 AIEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKS 218
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
C+ ++ SI G+EDV DE +L KAVA PVS+ I G FQ Y SGVF+
Sbjct: 219 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFS 278
Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
GEC + LDHGV AVGYG ++NG+ YW+++NSWG WGE GY++++++ + G+CG+AM
Sbjct: 279 GECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKD-IKPKHGQCGLAM 337
Query: 352 EASYPV 357
ASYP
Sbjct: 338 NASYPT 343
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 160/337 (47%), Positives = 216/337 (64%), Gaps = 18/337 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRT--------YK 94
+++ + +Y W + H +RF FK N+ FI HN+ LN T Y+
Sbjct: 34 SEEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYR 93
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ LN+F D+ E+R+ + G R + + + ++P++VDWR+KGAV
Sbjct: 94 LRLNRFGDMDQAEFRSTFAGPL----HRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVT 149
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAF 213
VKDQG CGSCWAFS VA+VEG+N I TG L+SLSEQEL+DCD + GC GGLM+ AF
Sbjct: 150 GVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAF 209
Query: 214 QFIIQN-GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPV 272
+FI + GG+ +E YPY + C+ +R ++ V IDG++ V +E +L KAVA QPV
Sbjct: 210 EFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPV 269
Query: 273 SVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGEN 330
SVAI+AGG+AFQ Y GVFTG+CGS LDHGV VGYG E+G +YW+V+NSWG WGE+
Sbjct: 270 SVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEH 329
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
GYV++QR+ + G CGIAMEASYPVKN Q KP+
Sbjct: 330 GYVRMQRD-SGVDGGLCGIAMEASYPVKNEQTKKKPR 365
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 310 bits (795), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 204/322 (63%), Gaps = 17/322 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--------------TYKVGL 97
+ W A+HGK R +F DN F+ HN+ +Y + L
Sbjct: 36 FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
N FADLT+EE+RA LG A ++S+ A + G +P+++DWR+ GAV VK
Sbjct: 96 NAFADLTHEEFRAARLGRI--APGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVK 153
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQGSCG+CW+FS A+EGINKI TG L+SLSEQEL+DCDR N+GC GGLMDYA++F+I
Sbjct: 154 DQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVI 213
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+NGG+D+E+DYPY A+ C+ ++ +VV+IDGY DV E L +AVA QPVSV I
Sbjct: 214 KNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGIC 273
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
RAFQ Y G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG WG GY+ + R
Sbjct: 274 GSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHR 333
Query: 338 NLLDTNTGKCGIAMEASYPVKN 359
N D+ G CGI M AS+P K
Sbjct: 334 NTGDSK-GVCGINMMASFPTKT 354
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 158/274 (57%), Positives = 191/274 (69%), Gaps = 13/274 (4%)
Query: 103 LTNEEYRAMYLGTRSDAKRRLMK-----SKVASQRYACKAGDELPESVDWREKGAVNPVK 157
+T +E+R Y G+R A R+ + S ++ + ++P SVDWR+KGAV VK
Sbjct: 1 MTADEFRRHYAGSRV-AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVK 59
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFST+AAVEGIN I T L SLSEQ+LVDCD K NAGCNGGLMDYAFQ+I
Sbjct: 60 DQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIA 119
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
++GG+ +E YPY + C S A VV+IDGYEDV DE +LKKAVA QPVSVAIE
Sbjct: 120 KHGGVAAEDAYPYRARQASCKKS--PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIE 177
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQ 336
A G FQ Y GVF+G CG+ LDHGV AVGYG T +G YWLV+NSWG +WGE GY+++
Sbjct: 178 ASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMA 237
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
R++ G CGIAMEASYPVK S N PK H+
Sbjct: 238 RDVA-AKEGHCGIAMEASYPVKTSPN---PKVHA 267
>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
Length = 229
Score = 310 bits (794), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 148/234 (63%), Positives = 174/234 (74%), Gaps = 12/234 (5%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
L STL+FL F S + +I +Y TD+EVMT+Y+ WL KH K NG+
Sbjct: 7 LVTSTLLFLSFTLSCAIDTSTITNY-----------TDNEVMTMYEEWLVKHQKVYNGLR 55
Query: 68 HNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+KRFQ+FKDNL FI EHN+ N TYK+GLN+FAD+TNEEYR MY GT+SDAKRRLMK+
Sbjct: 56 EKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKT 115
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
K RYA AGD LP VDWR KGAV P+KDQGSCGSCWAFSTVA VE NKIVTG+ +
Sbjct: 116 KSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEATNKIVTGKFV 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
SLSEQELVDCDR N CNGGLMDYAF+FIIQNGG+D+++DYPY G + CDP+
Sbjct: 176 SLSEQELVDCDRAYNERCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPT 229
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 310 bits (794), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 208/316 (65%), Gaps = 5/316 (1%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D+ + ++ W+A+ G+ R ++FK N+ FI+ N+ N + +G N+FADLT
Sbjct: 34 DNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANVAFIESFNAENHEFWLGANQFADLT 93
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
N+E+RA T K+ ++ +Y+ + D LP SVDWR KGAV P+K+QG CGS
Sbjct: 94 NDEFRASK--TNKGIKQGGVRDAPTGFKYSDVSIDALPASVDWRTKGAVTPIKNQGQCGS 151
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMD 223
CWAFS VAA EG+ K+ TG+L+SLSEQELVDCD ++ GC GG MD AF+FII+NGG+
Sbjct: 152 CWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGGLT 211
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E +YPY G ++KC + +I GYEDV DE +L KAVA QPVSV ++ G F
Sbjct: 212 TEANYPYTGEDDKCKSNETVNVAATIKGYEDVPANDESALMKAVAHQPVSVVVDGGDMTF 271
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GV TG CG +DHG+ A+GYG T NG YWL++NSWG+ WGE G++++ +++ D
Sbjct: 272 QLYAGGVMTGSCGVEMDHGIAAIGYGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDK 331
Query: 343 NTGKCGIAMEASYPVK 358
G CG+AM+ SYP +
Sbjct: 332 R-GMCGLAMKPSYPTE 346
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 212/318 (66%), Gaps = 14/318 (4%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D ++ +++W++++G++ +++F++FK N FID N+ N + +G+N+FAD+T
Sbjct: 30 DLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAKNHKFWLGINQFADIT 89
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ---RYACKAGDELPESVDWREKGAVNPVKDQGS 161
NEE++ + + + +KV + Y + D LP ++DWR KGAV PVKDQG
Sbjct: 90 NEEFKV------TKTNKGFISNKVRASTGFSYENVSIDALPATIDWRTKGAVTPVKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA EGI K+ TG+L+SLSEQELVDCD + GC GGLMD AF+FII NG
Sbjct: 144 CGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNG 203
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ E YPY + KC ++A +I YEDV +E +L KAVA+QPVSVA++ G
Sbjct: 204 GLTQESSYPYDAEDGKCKSGSKSAG--TIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRMEKDI 321
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 322 AD-KKGMCGLAMEPSYPT 338
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 150/293 (51%), Positives = 203/293 (69%), Gaps = 7/293 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
D SI+ Y + D+++ +++ W++ K + RF++FKDNL+ IDE
Sbjct: 30 DYSIVGYS-----PEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE 84
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
N ++Y +GLN+FADL++EE++ MYLG ++D RR + A +A + + +P+SV
Sbjct: 85 TNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA--EFAYRDVEAVPKSV 142
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWR+KGAV VK+QGSCGSCWAFSTVAAVEGINKIVTG L +LSEQEL+DCD N GCN
Sbjct: 143 DWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCN 202
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMDYAF++I++NGG+ E+DYPY E C+ + ++ V+I+G++DV DE SL K
Sbjct: 203 GGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLK 262
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
A+A QP+SVAI+A GR FQ Y GVF G CG LDHGV AVGYG+ G DY +
Sbjct: 263 ALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYII 315
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 207/320 (64%), Gaps = 8/320 (2%)
Query: 43 RTDDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLN 98
R DEV + W+ +HG+ R+ +FK N+ I+ N + T+K+ +N
Sbjct: 26 RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 85
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEE+R+MY G + ++ ++K S RY + D LP SVDWR+KGAV P+KD
Sbjct: 86 QFADLTNEEFRSMYTGYKGNSVLS-SRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKD 144
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QGSCGSCWAFS VAA+EG+ +I G+LISLSEQELVDCD + GC GG M+ AF + +
Sbjct: 145 QGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMT 203
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
GG+ SE +YPY + C+ ++ SI G+EDV DE +L KAVA PVS+ I
Sbjct: 204 TGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAG 263
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
GG FQ Y SGVF+GEC + LDHGV VGYG + NG YW+++NSWG WGE GY+++++
Sbjct: 264 GGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKK 323
Query: 338 NLLDTNTGKCGIAMEASYPV 357
+ G+CG+AM ASYP
Sbjct: 324 D-TKAKHGQCGLAMNASYPT 342
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 203/317 (64%), Gaps = 13/317 (4%)
Query: 50 TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---------LNRTYKVGLNKF 100
++ W A+HGK R +F DN F+ HN+ +Y + LN F
Sbjct: 39 ALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAF 98
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQ 159
ADLT+EE+RA LG R A ++S A G +P+++DWRE GAV VKDQ
Sbjct: 99 ADLTHEEFRAARLG-RIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQ 157
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
GSCG+CW+FS A+EGINKI TG L+SLSEQEL+DCDR N+GC GGLMDYA++F+++N
Sbjct: 158 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKN 217
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+D+E+DYPY A+ C+ ++ ++V+IDGY DV E L +AVA QPVSV I
Sbjct: 218 GGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGS 277
Query: 280 GRAFQHY-ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
RAFQ Y + G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG WG GY+ + RN
Sbjct: 278 ARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRN 337
Query: 339 LLDTNTGKCGIAMEASY 355
D+ G CGI M AS+
Sbjct: 338 TGDSK-GVCGINMMASF 353
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 200/315 (63%), Gaps = 9/315 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-------NRTYKVGLNKFADLT 104
++ W A+HGK G R F +N F+ HN +Y + LN FADLT
Sbjct: 39 FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
++E+RA LG + L + + + G +P+++DWR+ GAV VKDQGSCG+
Sbjct: 99 HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVG-AVPDALDWRQSGAVTKVKDQGSCGA 157
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FS A+EGINKI TG L+SLSEQEL+DCDR N GC GGLM YA++F+I+NGG+D+
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDT 217
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DYP+ A+ C+ ++ VV+IDGY++V E L +AVA QP+SV I RAFQ
Sbjct: 218 EDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQ 277
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG WG GY+ + RN +++
Sbjct: 278 LYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRN-TGSSS 336
Query: 345 GKCGIAMEASYPVKN 359
G CGI M AS+P K
Sbjct: 337 GICGINMMASFPTKT 351
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 209/309 (67%), Gaps = 7/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ K+GK +KRF IF++N+ FI+ N+ N+ YK+ +N AD TNEE+ A
Sbjct: 38 HEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEFMA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ G + + L + +Y + ++P +VDWR+KG V +KDQ CG+CWAFS
Sbjct: 98 SHKGYKGSHWQGLRITTQTPFKY--ENVTDIPWAVDWRQKGDVTSIKDQAQCGNCWAFSA 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA EGI +I TG L+SLSE+ELVDCD ++ GC+GGLM++ F+FII+NGG+ SE +YPY
Sbjct: 156 VAATEGIYQITTGNLVSLSEKELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEANYPY 214
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESG 289
CD ++ + V I GYE V E L+KAVA+Q +SV+I+AGG AFQ Y SG
Sbjct: 215 TAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTMSVSIDAGGSAFQFYPSG 274
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYG T+ G YW+V+NSWG+ WGE GY+++ R +D G CG
Sbjct: 275 VFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYIRMLRG-IDAQEGLCG 333
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 334 IAMDASYPT 342
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 166/361 (45%), Positives = 216/361 (59%), Gaps = 17/361 (4%)
Query: 8 LAISTLVFLFFISSSSA---ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
LA++ V ++ SA D S++ Y S +++++W KHGK
Sbjct: 5 LAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPS--------SLFRSWSVKHGKLYA 56
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR-- 122
+R++IFK NL I E N N +Y +GLN+FAD+ +EE++A YLG + R
Sbjct: 57 SPTEKLERYEIFKQNLMHIAETNRKNGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGA 116
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
+ RYA A LP SVDWR KGAV PVK+QG CGSCWAFS+VAAVEGIN+IVT
Sbjct: 117 PQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVT 176
Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC---DP 239
G+L+SLSEQELVDCD ++ GC GG MD AF +++ + G+ +E DYPYL E C P
Sbjct: 177 GKLVSLSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQP 236
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+ G+EDV E+SL KA+A QPVSV I AG R FQ Y GVF G C L
Sbjct: 237 CVLGITEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVEL 296
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DH + AVGYG+ G +Y ++NSWG +WGE GYV+++ G CGI ASYPVKN
Sbjct: 297 DHALTAVGYGSSYGQNYITMKNSWGKNWGEQGYVRIKMG-TGKPEGVCGIYTMASYPVKN 355
Query: 360 S 360
+
Sbjct: 356 A 356
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 169/365 (46%), Positives = 232/365 (63%), Gaps = 29/365 (7%)
Query: 1 MATASMFLAISTLVFL--FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLA 57
M T +AI L+ L +I++S+ H+ +SS D EVM + Y++WL
Sbjct: 1 MKTTITLVAIINLLVLCNLWITASACPA--------KHNDNSS---DSEVMRMRYESWLK 49
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYL--GT 115
K+G+ E RF+I++ N++FI+ +NS N +YK+ NKF DLTNEE+R MYL
Sbjct: 50 KYGQKYRNKDEWEFRFEIYRANVQFIEVYNSQNYSYKLMDNKFVDLTNEEFRRMYLVYQP 109
Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
RS + R M K GD LP+ +DWR +GAV +KDQG CGSCW+FS VA VE
Sbjct: 110 RSHLQTRFMYQK---------HGD-LPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVE 159
Query: 176 GINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
INKI TG+L+SLSEQ+L+DCD R N GCNGG M+ F FI + GG+ ++++YPY G++
Sbjct: 160 DINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSD 218
Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
+ ++ V+I GYE++ +E LK AVA QP SVA +AGG AFQ Y G F+G
Sbjct: 219 GDXNKAKVRNHAVAICGYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGS 278
Query: 295 CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
CG L+H + VGYG ENG YWLV+NSW +D G +GY++++R+ D + G CG AMEAS
Sbjct: 279 CGKDLNHRMTIVGYGEENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKD-GTCGTAMEAS 337
Query: 355 YPVKN 359
YP K+
Sbjct: 338 YPDKH 342
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 148/316 (46%), Positives = 213/316 (67%), Gaps = 10/316 (3%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D ++ ++ W+ ++G+ ++F++FK N FI+ N+ N + +G+N+FAD+T
Sbjct: 30 DLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNAGNHKFWLGINQFADIT 89
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
NEE++A T+++ K +V + Y + D LP ++DWR KGAV P+KDQG CG
Sbjct: 90 NEEFKA----TKTNKGFISNKVRVPTGFMYENMSFDALPATIDWRTKGAVTPIKDQGQCG 145
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA+EGI K+ TG+L+SLSEQELVDCD + GC GGLMD AF+FII+NGG+
Sbjct: 146 CCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGL 205
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
E +YPY A+ KC +A +I YEDV +E +L KAVA+QPVSVA++ G
Sbjct: 206 TQESNYPYDAADGKCKSGSSSA--ATIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMT 263
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GV TG CG+ LDHG+ A+GYG T +G +W+++NSWG+ WGENG++++++++ D
Sbjct: 264 FQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKFWIMKNSWGTSWGENGFLRMEKDIAD 323
Query: 342 TNTGKCGIAMEASYPV 357
G CG+AME SYP
Sbjct: 324 -KKGMCGLAMEPSYPT 338
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 306 bits (785), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 209/318 (65%), Gaps = 14/318 (4%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D ++ +++W+ ++G+ +F++FK N FID N+ N + +G+N+FAD+T
Sbjct: 30 DLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGFIDSFNAGNHKFWLGINQFADIT 89
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQR---YACKAGDELPESVDWREKGAVNPVKDQGS 161
N+E++A + + + +KV + Y + D LP S+DWR KGAV PVKDQG
Sbjct: 90 NKEFKA------TKTNKGFISNKVRAPTGFSYENVSFDALPASIDWRTKGAVTPVKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA EGI K+ TG+L+SLSEQELVDCD + GC GGLMD AF+FII NG
Sbjct: 144 CGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIISNG 203
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ E YPY + KC ++A +I YEDV +E +L KAVA+QPVSVA++ G
Sbjct: 204 GLTQESSYPYDAEDGKCKSGSKSAG--TIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRMEKDI 321
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 322 AD-KKGMCGLAMEPSYPT 338
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 151/284 (53%), Positives = 201/284 (70%), Gaps = 8/284 (2%)
Query: 77 KDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
K+N+ +I+ +N+ N+ YK+G+N+FADLT+EE+ + R + R ++ + +Y
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEF--IVPRNRFNGHMRFSNTRTTTFKY-- 60
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ LP+S+DWR+KGAV P+K+QGSCG CWAFS +AA EGI+KI TG+L+SLSEQE+VD
Sbjct: 61 ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120
Query: 196 CDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
CD K + GC GG MD AF+FIIQN G+++E YPY G + KC+ +I GYED
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYED 180
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-G 313
V +E +L+KAVA+QPVSVAI+A G FQ Y+SG+FTG CG+ LDHGV AVGYG N G
Sbjct: 181 VPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEG 240
Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
YWLV+NSWG++WGE GY +QR + G CGIAM ASYP
Sbjct: 241 TKYWLVKNSWGTEWGEEGYTMMQRGVKAVE-GICGIAMLASYPT 283
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/256 (60%), Positives = 195/256 (76%), Gaps = 14/256 (5%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
R + EV +Y+ WL ++ K NG+G E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35 RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLTNEE+RA+YL R +R K V ++RY K GD LP+ VDWR GAV VKDQG+
Sbjct: 95 DLTNEEFRAIYL--RKKMER--TKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210
Query: 221 GMDSEQDYPY----LGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
G++++QDYPY LG C+ + N +VV+IDGYEDV DE SLKKAVA QPVSVA
Sbjct: 211 GIETDQDYPYNANDLGL---CNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVA 267
Query: 276 IEAGGRAFQHYESGVF 291
IEA +AFQ Y+S F
Sbjct: 268 IEASSQAFQLYKSVNF 283
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/308 (49%), Positives = 207/308 (67%), Gaps = 17/308 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
++ W+A++G+ E R+ IFK+N+ ID NS ++Y +G+N+FADL+NEE++A
Sbjct: 5 HEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNEEFKA 64
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + + RY + +P ++DWR+KGAV PVKDQG C
Sbjct: 65 ----SRNRFKGHMCSPQAGPFRYENVSA--VPATMDWRKKGAVTPVKDQGQC-------- 110
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGIN++ TG+LISLSEQE+VDCD K + GCNGGLMD AF+FI QN G+ +E +YP
Sbjct: 111 VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 170
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + + I G++DV E +L KAVA QPVSVAI+AGG FQ Y SG
Sbjct: 171 YTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSG 230
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+FTG CG+ LDHGV AVGYG +G YWLV+NSWG+ WGE GY+++Q++ + G CGI
Sbjct: 231 IFTGSCGTELDHGVTAVGYGGSDGTKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGI 289
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 290 AMQASYPT 297
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 158/358 (44%), Positives = 229/358 (63%), Gaps = 19/358 (5%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+ +IS L+F L S+AD SI+ Y + D +S+ R ++ ++++W+ KH
Sbjct: 2 TTICSISKLIFVATCLIVHVGLSSADFSIVGYSQD-DLTSTER----LIRLFESWMLKHD 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
+ N + RF+IFKDNL +IDE N N +Y +GLN+F DLT++E++ Y+G+ +
Sbjct: 57 RVYNNIEEKIHRFEIFKDNLMYIDETNKKNNSYWLGLNEFVDLTHDEFKEKYVGSIGEDF 116
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ +S + + K + PES+DWR+KGAV PVK CGSCWAFSTVA VEGINKI
Sbjct: 117 VTIEQSN--DEEFPYKHVVDYPESIDWRDKGAVTPVKPN-PCGSCWAFSTVATVEGINKI 173
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
VTG+LISLSEQEL+DCDR+ + GC GG + Q+++ NG + +E++YPY + KC
Sbjct: 174 VTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVVDNG-VHTEKEYPYEKKQGKCRAK 231
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ V I GY+ V DE+SL +A+A+QPVSV +E+ GRAFQ Y+ G+F G CG+ LD
Sbjct: 232 EKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLD 291
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
H V A+GYG Y L++NSWG +WGE GY+K++R + G CG+ + +P K
Sbjct: 292 HAVTAIGYGK----TYILIKNSWGPNWGEKGYLKIKR-ASGKSEGTCGVYKSSYFPTK 344
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 156/360 (43%), Positives = 225/360 (62%), Gaps = 17/360 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
A S +L ++ LFFI + +S N++ + R D W+ H
Sbjct: 3 FANLSQYLCLA----LFFICLGLWSSQVALSRPINYEATMRARHDQ--------WIVHHE 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K + E RFQIFK+N+ I+ N+ ++ YK+G NKF+DLTNEE+R ++ G +
Sbjct: 51 KVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSH 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+ + SK + D +P ++DWR+KGAV P+KDQ CG CWAFS VAA+EG+++
Sbjct: 111 PKVMTSSKGKTHFRYTNVTD-IPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQ 169
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TGELI LSEQELVDCD + + GC+GGL+D AF FI++N G+ +E +YPY G + C+
Sbjct: 170 LKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCN 229
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I GYEDV E +L +AVA+QPVSVAI+ FQ Y SGVF+G C +
Sbjct: 230 KKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTW 289
Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
L+H V AVGYG T +G YW+++NSWGS WG++GY++++R++ + G CG+AM+ASYP
Sbjct: 290 LNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKE-GLCGLAMDASYPT 348
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 143/300 (47%), Positives = 205/300 (68%), Gaps = 7/300 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D ++ ++ W+AK + +RF+ FK N+ FI+ N+ N + +G+N+F DLT
Sbjct: 30 DAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNHKFWLGVNQFTDLT 89
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
N+E+RA T+++ + ++ ++ +Y + D LP +VDWR KG V P+KDQG CG
Sbjct: 90 NDEFRA----TKTNKGLKRNGARAPTRFKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCG 145
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
CWAFS VAA EGI K+ TG+L+SLSEQELVDCD ++ GC GG MD AF+FII+NGG+
Sbjct: 146 CCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGL 205
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPY + +C S + V +I GYEDV DE SL KAVA+QPVSVA++ G
Sbjct: 206 TTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVI 265
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQHY GV TG CG+ LDHG+VA+GYG T +G +WL++NSWG+ WGE+GY+++++++ D
Sbjct: 266 FQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKDISD 325
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 303 bits (776), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 154/310 (49%), Positives = 203/310 (65%), Gaps = 6/310 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W+ KHG+ G ++RF+++K+NL I+E NS Y + NKFADLTNEE+RA
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGYTLTDNKFADLTNEEFRAK 178
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGD---ELPESVDWREKGAVNPVKDQGSCGSCWAF 168
LG R +++ AS D +LP+ VDWR+KGAV VK+QGSCGSCWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
S VAA+EG+N+I G+L+SLSEQELVDCD + GC GG M +AF+F++ N G+ +E Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAEA-VGCAGGFMSWAFEFVMANHGLTTEASY 297
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY G C ++ N VSI GY +V+ E L K A QPVSVA++AGG FQ Y
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357
Query: 289 GVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
GVF+G C + ++HGV VGYG T+ YW+V+NSWG +WGE GY+ +QR+ TG C
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRD-AGVPTGLC 416
Query: 348 GIAMEASYPV 357
GIAM ASYPV
Sbjct: 417 GIAMLASYPV 426
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 209/326 (64%), Gaps = 16/326 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D ++ ++ W+ +HG+ G ++RF++++ N+ ++ NS++ YK+ NKFADLTN
Sbjct: 25 DLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTN 84
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAVNPVKDQGSC 162
EE+RA LG R + S S A + D LP+SVDWR+KGAV VK+QG C
Sbjct: 85 EEFRAKMLGFRPHVTIPQI-SNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 143
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFS VAA+EGIN+I GEL+SLSEQELVDCD + GC GG M +AF+F++ N G+
Sbjct: 144 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGL 202
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E YPY A C ++ N V+I GY +V+P E L +A A QPVSVA++ G
Sbjct: 203 TTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFM 262
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVD----------YWLVRNSWGSDWGENG 331
FQ Y SGV+TG C + ++HGV VGYG +E D YW+V+NSWG++WG+ G
Sbjct: 263 FQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAG 322
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
Y+ +QR++ +G CGIA+ SYPV
Sbjct: 323 YILMQRDVAGLASGLCGIALLPSYPV 348
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 148/302 (49%), Positives = 201/302 (66%), Gaps = 6/302 (1%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMY 112
W+ +HG+ R+ +FK N+ I+ N + T+K+ +N+FADLTNEE+R+MY
Sbjct: 35 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94
Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
G + ++ ++K S RY + D LP SVDWR+KGAV P+KDQG CGSCWAFS VA
Sbjct: 95 TGFKGNSVLS-SRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVA 153
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
A+EG+ +I G+LISLSEQELVDCD + GC GGLMD AF + I GG+ SE +YPY
Sbjct: 154 AIEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKS 212
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
C+ ++ SI G+EDV DE +L KAVA PVS+ I G FQ Y SGVF+
Sbjct: 213 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFS 272
Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
GEC + LDHGV AVGYG ++NG+ YW+++NSWG WGE GY++++++ + G+CG+AM
Sbjct: 273 GECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKD-IKPKHGQCGLAM 331
Query: 352 EA 353
A
Sbjct: 332 NA 333
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 209/326 (64%), Gaps = 16/326 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D ++ ++ W+ +HG+ G ++RF++++ N+ ++ NS++ YK+ NKFADLTN
Sbjct: 26 DLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTN 85
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAVNPVKDQGSC 162
EE+RA LG R + S S A + D LP+SVDWR+KGAV VK+QG C
Sbjct: 86 EEFRAKMLGFRPHVTIPQI-SNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 144
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFS VAA+EGIN+I GEL+SLSEQELVDCD + GC GG M +AF+F++ N G+
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGL 203
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E YPY A C ++ N V+I GY +V+P E L +A A QPVSVA++ G
Sbjct: 204 TTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFM 263
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVD----------YWLVRNSWGSDWGENG 331
FQ Y SGV+TG C + ++HGV VGYG +E D YW+V+NSWG++WG+ G
Sbjct: 264 FQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAG 323
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
Y+ +QR++ +G CGIA+ SYPV
Sbjct: 324 YILMQRDVAGLASGLCGIALLPSYPV 349
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 214/313 (68%), Gaps = 5/313 (1%)
Query: 49 MTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEE 107
M +Q W+ ++ K +N + E RF ++ +NL +I +N+ ++ + LN FADLT +E
Sbjct: 42 MAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSHWLHLNAFADLTTDE 101
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R LG A++ + + + Y ++LP +DWR+KGAV VK+QG CGSCWA
Sbjct: 102 FRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWA 160
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
F+T +VEGIN IVTGEL SLSEQELVDCD + GC+GGLMDYA+Q+II+NGG+D+E D
Sbjct: 161 FATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDD 220
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
YPY + C +++N +VV+IDGY D+ DE++LKKA A QP++VAIEA ++FQ Y
Sbjct: 221 YPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYG 280
Query: 288 SGVFTGE-CGSALDHGVVAVGYGTENGV-DYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ CG++L+HGV+ VGYG + +YW+V+NSWG +WG+NGY++L+ D G
Sbjct: 281 GGVYDDPTCGTSLNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQ-G 339
Query: 346 KCGIAMEASYPVK 358
CGIAM S+P K
Sbjct: 340 MCGIAMAPSFPTK 352
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 198/308 (64%), Gaps = 11/308 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W K+GK +KR IFKDN+ FI+ N+ N+ YK+ +N D TNEE+ A
Sbjct: 40 HEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEEFVA 99
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ G + K Y G +P +VDWRE GAV +KDQG CG+CWAFST
Sbjct: 100 SHNGYKHKGSHSQTPFK-----YENITG--VPNAVDWRENGAVXAMKDQGQCGNCWAFST 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VA EGI +I T L+SLSEQELVDCD ++ GC+GG M+ F+FI +NGG+ SE +YPY
Sbjct: 153 VATTEGIYQITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIXKNGGISSEANYPY 211
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ D ++ + I GYE V E +L+KAVA+QPVSV I+ GG AFQ SGV
Sbjct: 212 TAVDGTYDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGV 271
Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+ LDHGV AVGYG T++G YW+V+NSWG+ WGE GY+++QR D G CGI
Sbjct: 272 FTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRG-TDAQEGLCGI 330
Query: 350 AMEASYPV 357
AM+ASYP
Sbjct: 331 AMDASYPT 338
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 201/308 (65%), Gaps = 3/308 (0%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK +KRFQIFK+N+ FI+ N+ ++ + + +N+FADL +EE++A
Sbjct: 38 HENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEFKA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ + + + + +L ++DWR++GAV P+KDQ CGSCWAFS
Sbjct: 98 LLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPIKDQRRCGSCWAFSA 157
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA+EGI++I T +L+SLSEQELVDC + + GCNGG M+ AF+F+ + GG+ SE YPY
Sbjct: 158 VAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFVAKKGGIASESYYPY 217
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G + C + V I GYE V E +L+KAVA QPVSV +EAGG AFQ Y SG+
Sbjct: 218 KGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYVEAGGNAFQFYSSGI 277
Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+ DH + VGYG + G YWLV+NSWG+ WGE GY++++R+ + G CGI
Sbjct: 278 FTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYIRMKRD-IRAKEGLCGI 336
Query: 350 AMEASYPV 357
AM A YP
Sbjct: 337 AMNAFYPT 344
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 198/311 (63%), Gaps = 9/311 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-------NRTYKVGLNKFADLT 104
++ W A+HGK G R F +N F+ HN +Y + LN FADLT
Sbjct: 39 FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
++E+RA LG + L + + + G +P+++DWR+ GAV VKDQGSCG+
Sbjct: 99 HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVG-AVPDALDWRQSGAVTKVKDQGSCGA 157
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FS A+EGINKI TG L+SLSEQEL+DCDR N GC GGLM YA++F+I+NGG+D+
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDT 217
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DYP+ A+ C+ ++ VV+IDGY++V E L +AVA QP+SV I RAFQ
Sbjct: 218 EDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQ 277
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG WG GY+ + RN +++
Sbjct: 278 LYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRN-TGSSS 336
Query: 345 GKCGIAMEASY 355
G CGI M AS+
Sbjct: 337 GICGINMMASF 347
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 219/360 (60%), Gaps = 14/360 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M S L ++ L + + SSS + + + D + + R ++ W+A+HG
Sbjct: 1 MGAISKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAAR--------HERWMAQHG 52
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ +R ++FK N+ FI+ N+ + Y +G+N+FADLT+EE++A ++ +
Sbjct: 53 RVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFS 112
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+Y + D LP SVDWR KGAV +KDQG CG CWAFS VAA+EGI K
Sbjct: 113 TPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVK 172
Query: 180 IVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD N GC GG +D AFQFI+ NGG+ +E +YPY + +C
Sbjct: 173 LSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCK 232
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ SI GYEDV DE SL KAVA QPVSVA++A FQ Y GV GECG++
Sbjct: 233 TTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTS 290
Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV +GYG +G YWLV+NSWG+ WGE GY++++++ +D G CG+AM+ SYP
Sbjct: 291 LDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKD-IDDKRGMCGLAMQPSYPT 349
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 301 bits (771), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 151/313 (48%), Positives = 211/313 (67%), Gaps = 11/313 (3%)
Query: 44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+DD M ++ W+A++G+ +RF++FK N FI+ N+ N + +G+N+FAD
Sbjct: 28 SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNHKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+R T+++ ++V + RY D LP ++DWR KG V P+KDQG
Sbjct: 88 LTNDEFRL----TKTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTPIKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E +YPY A++KC + V SI GYEDV +E +L KAVA+QPVSVA++
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGDD 261
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y+ GV G CG+ LDHG+VA+GYG +G YWL++NSWG WGENG++++++++
Sbjct: 262 MTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDI 321
Query: 340 LDTNTGKCGIAME 352
D G CG+AME
Sbjct: 322 SDKR-GMCGLAME 333
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 204/316 (64%), Gaps = 8/316 (2%)
Query: 43 RTDDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLN 98
R DEV + W+ +HG+ R+ +FK N+ I+ N + T+K+ +N
Sbjct: 20 RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 79
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+FADLTNEE+R+MY G + ++ ++K S RY + D LP SVDWR+KGAV P+KD
Sbjct: 80 QFADLTNEEFRSMYTGYKGNSVLS-SRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKD 138
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QGSCGSCWAFS VAA+EG+ +I G+LISLSEQELVDCD + GC GG M+ AF + +
Sbjct: 139 QGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMT 197
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
GG+ SE +YPY + C+ ++ SI G+EDV DE +L KAVA PVS+ I
Sbjct: 198 TGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAG 257
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
GG FQ Y SGVF+GEC + LDHGV VGYG + NG YW+++NSWG WGE GY+++++
Sbjct: 258 GGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKK 317
Query: 338 NLLDTNTGKCGIAMEA 353
+ G+CG+AM A
Sbjct: 318 D-TKAKHGQCGLAMNA 332
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 152/361 (42%), Positives = 226/361 (62%), Gaps = 19/361 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
A S +L ++ + S A I+Y+ + + + W+A H
Sbjct: 3 FANLSQYLCLALFFIFLGVWRSQVASSRPINYEAS------------MRARHDQWIAHHD 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K + E RF+IFK+N+ I+ N+ ++ YK+G+NKF+DLTNE++R ++ G +
Sbjct: 51 KVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSH 110
Query: 120 KRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
+ + SK + RYA ++P ++DWR+KGAV P+KDQ CG CWAFS VAA EG++
Sbjct: 111 PKVMSSSKPKTHFRYANVT--DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLH 168
Query: 179 KIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
++ TG+LI LSEQELVDCD + + GC+GGL+D AF FI++N G+ +E +YPY G + C
Sbjct: 169 QLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVC 228
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
+ + I GYEDV E +L +AVA+QPVSVAI+ FQ Y SGVF+G C +
Sbjct: 229 NKKKSALSAAKIAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCST 288
Query: 298 ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
L+H V AVGYG T +G YW+++NSWGS WG++GY++++R++ + G CG+AM+ASYP
Sbjct: 289 WLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKE-GLCGLAMDASYP 347
Query: 357 V 357
Sbjct: 348 T 348
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 198/315 (62%), Gaps = 13/315 (4%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR------TYKVGLNKFADLT 104
+++ W +H KT + R ++F+DN F+ +HN +Y + LN FADLT
Sbjct: 32 LFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADLT 91
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
+ E++ LG L++ K Q + +P +DWR+ GAV PVKDQ SCG+
Sbjct: 92 HHEFKTTRLG----LPLTLLRFK-RPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CWAFS A+EGINKIVTG L+SLSEQEL+DCD N+GC GGLMD+A+QF+I N G+D+
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DYPY + C + + V+I+ Y DV P +E L KAVA QPVSV I R FQ
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEIL-KAVASQPVSVGICGSEREFQ 265
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y G+FTG C + LDH V+ VGYG+ENGVDYW+V+NSWG WG NGY+ + RN ++
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSK- 324
Query: 345 GKCGIAMEASYPVKN 359
G CGI ASYPVK
Sbjct: 325 GICGINTLASYPVKT 339
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 217/313 (69%), Gaps = 9/313 (2%)
Query: 52 YQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
++ W H ++ N + E RF+++ +NL ++ +N+ ++ + LN ADL+ EY++
Sbjct: 13 FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHWLTLNHLADLSTPEYKS 72
Query: 111 MYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
LG D + R+ ++K+ + RY + LP ++DWR+K AV VK+QG CGSCWAF+
Sbjct: 73 KLLGF--DNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFA 130
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
T +VEGIN IVTG L+SLSEQELVDCD + + GC+GGLMDYA+ +II+N G+++E+DYP
Sbjct: 131 TTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYP 190
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + +CD ++ +VV+ID YEDV DE++LKKA A QPV+VAIEA ++FQ Y G
Sbjct: 191 YTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGG 250
Query: 290 VFTGE-CGSALDHGVVAVGYG---TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
V+ CG++L+HGV+ VGYG T +G +YW+V+NSWG++WG+ GY++L+ D G
Sbjct: 251 VYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAE-G 309
Query: 346 KCGIAMEASYPVK 358
CGIAM SYPVK
Sbjct: 310 LCGIAMAPSYPVK 322
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 146/307 (47%), Positives = 208/307 (67%), Gaps = 8/307 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQIFK+N++FI+ N+ ++ + + +N+FADL NEE++A
Sbjct: 37 HEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKA 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ + + + + S RY ++ ++P ++DWR++GAV P+KDQG+CGSCWAFST
Sbjct: 97 SLINVQKK-ESGVETATETSFRY--ESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFST 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA+EGI++I TG+L+SLSEQELVDC + + GCN G + AF+F+ +NGG+ SE YPY
Sbjct: 154 VAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPY 213
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
C + V I GYE+V E +L KAVA+QPVSV I+AG A Q Y SG+
Sbjct: 214 KANNKTCMVKKETQGVAQIKGYENVPSNSEKALLKAVANQPVSVYIDAG--ALQFYSSGI 271
Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+A +H V +GYG G YWLV+NSWG+ WGE GY+K++R+ + G CGI
Sbjct: 272 FTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNSWGTKWGEKGYIKMKRD-IRAKEGLCGI 330
Query: 350 AMEASYP 356
A ASYP
Sbjct: 331 ATNASYP 337
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 149/331 (45%), Positives = 201/331 (60%), Gaps = 21/331 (6%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D ++ ++ W+ +HG+ G ++R ++++ N+ ++ NS+ Y++ NKFADLTN
Sbjct: 48 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTN 107
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYAC--------KAGDELPESVDWREKGAVNPVK 157
EE+RA LG A AC + +LP+SVDWREKGAV PVK
Sbjct: 108 EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 167
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QG CGSCWAFS VAA+EGIN+I G+L+SLSEQELVDCD K GC GG M +AF+F++
Sbjct: 168 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFEFVM 226
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+N G+ +E++YPY G C + VSI GY +V+P E L +A A QPVSVA++
Sbjct: 227 KNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVD 286
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-----------GVDYWLVRNSWGSD 326
AG +Q Y GVFTG C + L+HGV VGYG G YW+V+NSWG +
Sbjct: 287 AGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPE 346
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG+ GY+ +QR +G CGIAM SYPV
Sbjct: 347 WGDAGYILMQRE-ASVASGLCGIAMLPSYPV 376
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 157/361 (43%), Positives = 219/361 (60%), Gaps = 14/361 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M S L ++ L + + SSS + + + D + + R ++ W+A+HG
Sbjct: 1 MGAISKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAAR--------HERWMAQHG 52
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ +R ++FK N+ FI+ N+ + Y +G+N+FADLT+EE++A ++ +
Sbjct: 53 RVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFS 112
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+Y + D LP SVDWR KGAV +KDQG CG CWAFS VAA+EG K
Sbjct: 113 TPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVK 172
Query: 180 IVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD N GC GG +D AFQFI+ NGG+ +E +YPY + +C
Sbjct: 173 LSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCK 232
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ SI GYEDV DE SL KAVA QPVSVA++A FQ Y GV GECG++
Sbjct: 233 TTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTS 290
Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV +GYG +G YWLV+NSWG+ WGE GY++++++ +D G CG+AM+ SYP
Sbjct: 291 LDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKD-IDDKRGMCGLAMQPSYPT 349
Query: 358 K 358
+
Sbjct: 350 E 350
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 149/331 (45%), Positives = 201/331 (60%), Gaps = 21/331 (6%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D ++ ++ W+ +HG+ G ++R ++++ N+ ++ NS+ Y++ NKFADLTN
Sbjct: 27 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTN 86
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYAC--------KAGDELPESVDWREKGAVNPVK 157
EE+RA LG A AC + +LP+SVDWREKGAV PVK
Sbjct: 87 EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 146
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QG CGSCWAFS VAA+EGIN+I G+L+SLSEQELVDCD K GC GG M +AF+F++
Sbjct: 147 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFEFVM 205
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+N G+ +E++YPY G C + VSI GY +V+P E L +A A QPVSVA++
Sbjct: 206 KNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVD 265
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-----------GVDYWLVRNSWGSD 326
AG +Q Y GVFTG C + L+HGV VGYG G YW+V+NSWG +
Sbjct: 266 AGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPE 325
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG+ GY+ +QR +G CGIAM SYPV
Sbjct: 326 WGDAGYILMQRE-ASVASGLCGIAMLPSYPV 355
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 214/312 (68%), Gaps = 9/312 (2%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
E+ +++ W AKHGK+ + +R IF D L +I++HN+ N T+ +GLNKF+DLTN
Sbjct: 32 EIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTN 91
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E+RAM++G KR + ++ ++ LP S+DWR+KGAV P+KDQG CGSC
Sbjct: 92 AEFRAMHVGKF---KRPRYQDRLPAEDEDVDV-SSLPTSLDWRQKGAVTPIKDQGDCGSC 147
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +A++E + + T EL+SLSEQ+L+DCD ++AGC+GGLM+ AF+F+++NGG+ +E
Sbjct: 148 WAFSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCDGGLMETAFKFVVKNGGVTTE 206
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
YPY G+ C+ ++ KV I G++ V+ +L KAV+ PV+V+I FQ+
Sbjct: 207 AAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQN 266
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y+SG+ +G+C +LDHGV+ +GYGTE G+ YW+++NSWG+ WGE+G++K++R D G
Sbjct: 267 YKSGILSGKCDDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGD---G 323
Query: 346 KCGIAMEASYPV 357
CG+ ++SYP
Sbjct: 324 MCGMNGDSSYPT 335
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 158/323 (48%), Positives = 206/323 (63%), Gaps = 14/323 (4%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYK----VGLNK 99
+++ V+ I+Q W KH K EKRF+ FK NL++I E N+ + K VGLNK
Sbjct: 41 SEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNK 100
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD-ELPESVDWREKGAVNPVKD 158
FAD++NEE+R YL S K+ + K S+ K + P S+DWR G V VKD
Sbjct: 101 FADMSNEEFRKAYL---SKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKD 157
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QGSCGSCWAFS+ A+EGIN +VTG+LISLSEQELV+CD N GC GG MDYAF+++I
Sbjct: 158 QGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVIN 216
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+DSE DYPY G + C+ ++ KVVSIDGY+DV D +L AVA QPVSV I+
Sbjct: 217 NGGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSDS-ALLCAVAQQPVSVGIDG 275
Query: 279 GGRAFQHYESGVFTGECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
FQ Y G++ G C +DH V+ VGYG+E+ +YW+V+NSWG+ WG +GY L
Sbjct: 276 SAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWGIDGYFYL 335
Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
+R+ D G C + ASYP K
Sbjct: 336 KRD-TDLPYGVCAVNAMASYPTK 357
>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
Length = 300
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 142/195 (72%), Positives = 164/195 (84%), Gaps = 1/195 (0%)
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFST+ AVEGINKIVTG+LISLSEQELVDCD N GCNGGLMDYAF+FII+NGG+D+E
Sbjct: 1 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEA 60
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
DYPY A+ +CD +R+NAKVV+ID YEDV E SLKKA+A QP+SVAIEAGGRAFQ Y
Sbjct: 61 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 120
Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
SGVF G CG+ LDHGVVAVGYGTENG YW+VRNSWG+ WGE+GY+K+ RN ++ TGK
Sbjct: 121 SSGVFDGLCGTELDHGVVAVGYGTENGKGYWIVRNSWGNRWGESGYIKMARN-IEAPTGK 179
Query: 347 CGIAMEASYPVKNSQ 361
CGIAMEASYP+K Q
Sbjct: 180 CGIAMEASYPIKKGQ 194
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 143/224 (63%), Positives = 171/224 (76%), Gaps = 2/224 (0%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
+P SVDWR+KGAV VKDQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GCNGGLMDYAF+FI Q GG+ +E +YPY + CD S+ NA VSIDG+E+V DE
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLV 319
+L KAVA+QPVSVAI+AGG FQ Y GVFTG CG+ LDHGV VGYGT +G YW V
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
+NSWG +WGE GY++++R + D G CGIAMEASYP+K S N+
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKE-GLCGIAMEASYPIKKSSNN 224
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 158/362 (43%), Positives = 226/362 (62%), Gaps = 17/362 (4%)
Query: 1 MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT S +IS ++FL S S+AD + Y + D +S R ++ ++ +W+
Sbjct: 1 MATMS---SISKIIFLATCLIIHMSLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
KH K + RF+IF+DNL +IDE N N +Y +GLN FADL+N+E++ Y+G+
Sbjct: 53 LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGSV 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
++ L ++ + K P+S+DWR KGAV PVK+QGSCGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
+NKIVTG L+ LSEQELVDCD+ + GC GG + Q++ NG + + + YPY +
Sbjct: 171 VNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADNG-VHTSKVYPYQAKAMQ 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + + V I GY+ V E S A+A+QP+SV +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R ++ G CG+ + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347
Query: 357 VK 358
K
Sbjct: 348 FK 349
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 205/324 (63%), Gaps = 16/324 (4%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH---NSLNRTYKVGLNKF 100
+++ ++ I+Q W +H K +EKR++ FK NL++I E + + VGLNKF
Sbjct: 42 SEESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKF 101
Query: 101 ADLTNEEYRAMYLGTRS---DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
ADL+NEE++ +YL + KR + C A P S+DWR+KG V VK
Sbjct: 102 ADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDA----PSSLDWRKKGVVTAVK 157
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCW+FST A+EGIN IVTG+LISLSEQELVDCD N GC GG MDYAF+++I
Sbjct: 158 DQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCD-TTNYGCEGGYMDYAFEWVI 216
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
NGG+D+E +YPY G + C+ ++ KVVSIDGY DV D +L A QP+SV ++
Sbjct: 217 NNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETDS-ALLCATVQQPISVGMD 275
Query: 278 AGGRAFQHYESGVFTGECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
FQ Y G++ G+C + +DH V+ VGYG+ENG DYW+V+NSWG++WG GY
Sbjct: 276 GSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFY 335
Query: 335 LQRNLLDTNTGKCGIAMEASYPVK 358
++RN D G C I EASYP K
Sbjct: 336 IKRN-TDLPYGVCAINAEASYPTK 358
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 212/320 (66%), Gaps = 14/320 (4%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFAD 102
+D ++ ++ W+ ++G+ +RF++FKDN+ F++ N+ N + +G+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQG 160
LT EE++A + + + KV + +Y + LP +VDWR KGAV P+K+QG
Sbjct: 88 LTIEEFKA------NKGFKPISAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQG 141
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQN 219
CG CWAFS VAA+EGI K+ TG LISLSEQELVDCD ++ GC GG MD AF+F+I+N
Sbjct: 142 QCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKN 201
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ + YPY + KC ++A +I G+EDV DE +L KAVA+QPVSVA++A
Sbjct: 202 GGLATVSSYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNDEAALMKAVANQPVSVAVDAS 259
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRN 338
R F Y GV TG CG+ LDHG+ A+GYG E +G YW+++NSWG+ WGE G+++++++
Sbjct: 260 DRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLRMEKD 319
Query: 339 LLDTNTGKCGIAMEASYPVK 358
+ D G CG+AM+ SYP +
Sbjct: 320 ISDKQ-GMCGLAMKPSYPTE 338
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 163/359 (45%), Positives = 221/359 (61%), Gaps = 42/359 (11%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA S + I+ L+ + S + + + H+ S S R +D W+ +G
Sbjct: 1 MALESKIICITLLIMGVWASQALSRTL--------HEVSMSERHED--------WMGLYG 44
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
+T + E+RF+IFK+N+ +I+ S+N+ +K N + +
Sbjct: 45 RTYKDIAEKERRFKIFKENVEYIE---SVNK-FKASRNGY-----------------NMS 83
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
R S++ S RY A +P S+DWR+KGAV P+KDQG CG CWAFS VAA+EG+ ++
Sbjct: 84 SRPRSSEITSFRYENVAA--VPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQL 141
Query: 181 VTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
TGELISLSEQELVDCD + GC GGLMD AF+FII NGG+ +E +YPY G + C+
Sbjct: 142 KTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNK 201
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+ + I YEDV E +L KAVA PVSVAI+AGG FQ Y SGVFTG+CG+ L
Sbjct: 202 KKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTEL 261
Query: 300 DHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGV AVGYG T++G YWLV+NSWG+ WGE+GY+ ++R+ + + G CGIAMEASYP
Sbjct: 262 DHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERD-IGADEGLCGIAMEASYPT 319
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 158/362 (43%), Positives = 225/362 (62%), Gaps = 17/362 (4%)
Query: 1 MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT S +IS ++FL S+AD + Y + D +S R ++ ++ +W+
Sbjct: 1 MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
KH K + RF+IF+DNL +IDE N N +Y +GLN FADL+N+E++ Y+G
Sbjct: 53 LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
++ L ++ + K P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG L+ LSEQELVDCD+ + GC GG + Q++ NG + + + YPY + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPYQAKQYK 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + + V I GY+ V E S A+A+QP+SV +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R ++ G CG+ + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347
Query: 357 VK 358
K
Sbjct: 348 FK 349
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 205/308 (66%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQIFK+N+ FI+ H + ++ + + +N+FADL +++A
Sbjct: 38 HEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKA 95
Query: 111 MYL-GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ + G + + R + AS +Y + +P S+DWR++GAV P+KDQG+C SCWAFS
Sbjct: 96 LLINGQKKEHNVRTATATEASFKY--DSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFS 153
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
TVA +EG+++I GEL+SLSEQELVDC + + GC GG ++ AF+FI + GG+ SE YP
Sbjct: 154 TVATIEGLHQITKGELVSLSEQELVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETHYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G C + VV I GYE V E +L KAVA QPVS +EAGG AFQ Y SG
Sbjct: 214 YKGVNKTCKVKKETHGVVQIKGYEQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSG 273
Query: 290 VFTGECGSALDHGVVAVGYGTENGVD-YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
+FTG+CG+ +DH V VGYG G + YWLV+NSWG++WGE GY++++R+ + G CG
Sbjct: 274 IFTGKCGTDIDHSVTVVGYGKARGGNKYWLVKNSWGTEWGEKGYIRMKRD-IRAKEGLCG 332
Query: 349 IAMEASYP 356
IA A YP
Sbjct: 333 IATGALYP 340
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 140/314 (44%), Positives = 215/314 (68%), Gaps = 11/314 (3%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
E+ +++ W AKHGK+ + +R IF D L +I++HN+ N T+ +GLNKF+DLTN
Sbjct: 36 EIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTN 95
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E+RAM++G KR + ++ ++ LP S+DWR+KGAV P+KDQG CGSC
Sbjct: 96 AEFRAMHVGKF---KRPRYQDRLPAEDEDVDV-SSLPTSLDWRQKGAVTPIKDQGDCGSC 151
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +A++E + + T EL+SLSEQ+L+DCD ++AGC+GGLM+ AF+F+++NGG+ +E
Sbjct: 152 WAFSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCDGGLMETAFKFVVKNGGVTTE 210
Query: 226 QDYPYLGAENKCDPSRRNA--KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
YPY G+ C+ ++ KV I G++ V+ +L KAV+ PV+V+I F
Sbjct: 211 ASYPYTGSVGSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENF 270
Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Q+Y+SG+ +G+CG +LDHGV+ +GYGTE G+ YW+++NSWG+ WGE+G++K++R D
Sbjct: 271 QNYKSGILSGQCGDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGD-- 328
Query: 344 TGKCGIAMEASYPV 357
G CG+ ++SYP
Sbjct: 329 -GICGMNGDSSYPT 341
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 151/312 (48%), Positives = 200/312 (64%), Gaps = 18/312 (5%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNE 106
++ + K K +RF +F N+ FI+ HN+ T+ V +N+FADLTNE
Sbjct: 29 LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNE 88
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
EYR +YL R L + + AG SVDWR+KGAV P+K+QG CGSCW
Sbjct: 89 EYRQLYL--RPYPTELLGRERQEVWLDGPNAG-----SVDWRQKGAVTPIKNQGQCGSCW 141
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
+FST +VEG + I TG L+SLSEQ+LVDC N GCNGGLMD AF++II NGG+D+E
Sbjct: 142 SFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTE 201
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
QDYPY + CD S+ + VSI GY+DV +E L AV PVSVAIEA ++FQ
Sbjct: 202 QDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQM 261
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y SGVF+G CG+ LDHGV+ VGY + DYW+V+NSWG+ WG+ GY+ ++R + ++ G
Sbjct: 262 YSSGVFSGPCGTNLDHGVLVVGYTS----DYWIVKNSWGASWGDQGYIMMKRGV--SSAG 315
Query: 346 KCGIAMEASYPV 357
CGIAM+ SYP+
Sbjct: 316 ICGIAMQPSYPI 327
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 157/362 (43%), Positives = 224/362 (61%), Gaps = 17/362 (4%)
Query: 1 MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT S +IS ++FL S+AD + Y + D +S R ++ ++ +W+
Sbjct: 1 MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
KH K + RF+IF+DNL +IDE N N +Y +GLN FADL+N+E++ Y+G
Sbjct: 53 LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
++ L ++ + K P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG L+ LSEQELVDCD+ + GC GG + Q++ NG + + + YPY + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPYQAKQYK 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + + V I GY+ V E S A+A+QP+S +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R ++ G CG+ + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347
Query: 357 VK 358
K
Sbjct: 348 FK 349
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 145/306 (47%), Positives = 195/306 (63%), Gaps = 6/306 (1%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W A+HG++ G R F DN F+ HN +Y + LN FADLT++E+RA
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRAA 97
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ Y G +P++VDWR+ GAV VKDQGSCG+CW+FS
Sbjct: 98 ---RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 154
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
A+EGINKI TG LISLSEQEL+DCDR N+GC GGLMDYA++F+++NGG+D+E DYP
Sbjct: 155 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + C+ ++ +VV+IDGY+DV +E L +AVA QPVSV I RAFQ Y G
Sbjct: 215 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 274
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+F G C ++LDH ++ VGYG+E G DYW+V+NSWG WG GY+ + RN ++N G CGI
Sbjct: 275 IFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSN-GVCGI 333
Query: 350 AMEASY 355
S+
Sbjct: 334 NQMPSF 339
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 146/306 (47%), Positives = 196/306 (64%), Gaps = 7/306 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W A+HG++ G R F DN F+ HN +Y + LN FADLT++E+RA
Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRA- 96
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
R + Y G +P++VDWR+ GAV VKDQGSCG+CW+FS
Sbjct: 97 ---ARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 153
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
A+EGINKI TG LISLSEQEL+DCDR N+GC GGLMDYA++F+++NGG+D+E DYP
Sbjct: 154 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 213
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + C+ ++ +VV+IDGY+DV +E L +AVA QPVSV I RAFQ Y G
Sbjct: 214 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 273
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+F G C ++LDH ++ VGYG+E G DYW+V+NSWG WG GY+ + RN ++N G CGI
Sbjct: 274 IFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSN-GVCGI 332
Query: 350 AMEASY 355
S+
Sbjct: 333 NQMPSF 338
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 210/319 (65%), Gaps = 11/319 (3%)
Query: 44 TDDEVMT-IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFA 101
+DD M ++ W+A +G+ +RF++FKDNL F++ N+ + + +G+N+FA
Sbjct: 32 SDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGVNQFA 91
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLT EE++A G + + + + + + A LP +VDWR KGAV P+K+QG
Sbjct: 92 DLTTEEFKANK-GFKPISAEEVPTTGFKYENLSVSA---LPTAVDWRTKGAVTPIKNQGQ 147
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+EGI K+ T L+SLSEQELVDCD ++ GC GG MD AF+F+I+NG
Sbjct: 148 CGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNG 207
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY + KC ++A +I G+EDV P +E +L KAVA QPVSVA++A
Sbjct: 208 GLATESSYPYKAVDGKCKGGSKSA--ATIKGHEDVPPNNEAALMKAVASQPVSVAVDASD 265
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNL 339
R F Y GV TG CG+ LDHG+ A+GYG E +G YW+++NSWG+ WGE ++++++++
Sbjct: 266 RTFMLYSGGVMTGSCGTQLDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKRFLRMEKDI 325
Query: 340 LDTNTGKCGIAMEASYPVK 358
D G CG+AM+ SYP +
Sbjct: 326 SDKQ-GMCGLAMKPSYPTE 343
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 208/318 (65%), Gaps = 11/318 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFAD 102
+D ++ ++ W+ ++G+ +RF+ FK N+ F++ N+ + + +G+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LT EE++A K K +Y + LP +VDWR KGAV P+K+QG C
Sbjct: 88 LTTEEFKA-----NKGFKPTAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQC 142
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
G CWAFS VAA+EGI K+ TG LISLSEQELVDCD ++ GC GG MD AF+F+I+NGG
Sbjct: 143 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 202
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY + KC ++A +I G+EDV +E +L KAVA+QPVSVA++A R
Sbjct: 203 LATESNYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNNEAALMKAVANQPVSVAVDASDR 260
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
F Y GV TG CG+ LDHG+ A+GYG E +G YW+++NSWG+ WGE G++++++++
Sbjct: 261 TFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRMEKDIT 320
Query: 341 DTNTGKCGIAMEASYPVK 358
D G CG+AM+ SYP +
Sbjct: 321 DKR-GMCGLAMKPSYPTE 337
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 203/323 (62%), Gaps = 17/323 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKF 100
T++ + +++ W KH K E+R FK NL++I E N ++ +KVGLNKF
Sbjct: 42 TEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKF 101
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY--ACKAGDELPESVDWREKGAVNPVKD 158
ADL+NEE+R MYL S K+ + + R+ C A P S+DWR KG V VKD
Sbjct: 102 ADLSNEEFREMYL---SKVKKPITIEEKRKHRHLQTCDA----PSSLDWRNKGVVTAVKD 154
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CGSCW+FST A+E IN IVTG+LISLSEQELVDCD N GC GG MD AFQ++I
Sbjct: 155 QGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIG 214
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+D+E DYPY G + C+ ++ KVVSI+GY DV P D +L A QP+SV ++
Sbjct: 215 NGGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDS-ALLCATVQQPISVGMDG 273
Query: 279 GGRAFQHYESGVFTGECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
FQ Y G++ G+C + +DH ++ VGYG+EN DYW+V+NSWG++WG GY +
Sbjct: 274 SALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYI 333
Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
+RN G C I +ASYP K
Sbjct: 334 RRN-TSKPYGVCAINADASYPTK 355
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/325 (45%), Positives = 211/325 (64%), Gaps = 11/325 (3%)
Query: 40 SSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLN 98
+S ++ ++ W+ +HGK E+RFQIFK+NL FI+ N+ + + + +N
Sbjct: 23 TSLVISSRLLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSIN 82
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR----YACKAGDELPESVDWREKGAVN 154
+F D TN+E++A YL + K+ L+ +A+ + + E+P ++DWRE+GAV
Sbjct: 83 QFGDQTNDEFKANYLNGK---KKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVT 139
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
P+K Q CGSCWAF+TVAA+EGI++I TG L+SLSEQELVDC + GCNGG ++ A
Sbjct: 140 PIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDAC 199
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
FI++ GG+ SE +YPY + KC+ + V I GYE V +E +L KAVA+QP++
Sbjct: 200 DFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIA 259
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGY 332
V I A RAFQ Y SG+ G+CG LDH V VGYGT ++GV YWLV+NSWG+ WGE GY
Sbjct: 260 VYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGY 319
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPV 357
+K++R+ + G CGIAM +YP+
Sbjct: 320 IKIKRD-VHAKEGSCGIAMVPTYPI 343
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 139/303 (45%), Positives = 206/303 (67%), Gaps = 6/303 (1%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
D ++ ++ W+AK+ + + +RF++FK N+ I+ N+ N + + N+FADLT
Sbjct: 34 DQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVNAGNHKFWLEANRFADLT 93
Query: 105 NEEYRAMYLGTR--SDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQG 160
++E+RA + G R + A +S+ A+ +YA + D++P SVDWR KGAV P+K+QG
Sbjct: 94 DDEFRATWTGYRPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQG 153
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQN 219
CG CWAFS VA++EG+ K+ TG+L+SLSEQELVDCD ++ GC GG MD AF FI+ N
Sbjct: 154 ECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGN 213
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ +E YPY ++ C+ + + SI GYEDV DE SL+KAVA+QPVSVA++ G
Sbjct: 214 GGLTTESRYPYTASDGTCNSNEASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGG 273
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRN 338
F+ Y+ GV +G CG+ LDHG+ AVGYG +G YW+++NSWG+ WGE GY++++R+
Sbjct: 274 DSHFRFYKGGVLSGACGTELDHGIAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERD 333
Query: 339 LLD 341
+ D
Sbjct: 334 IAD 336
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 139/254 (54%), Positives = 186/254 (73%), Gaps = 5/254 (1%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D+++ ++++W+++HGK + RF+IFKDNL+ IDE N + Y +GLN+FADL++
Sbjct: 2 DKLIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSNYWLGLNEFADLSH 61
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E++ YLG + D R + +S+ + + D LP+SVDWR+KGAV +K+QGSCGSC
Sbjct: 62 HEFKKQYLGLKVDFSTR----RESSEEFTYRDVD-LPKSVDWRKKGAVTNIKNQGSCGSC 116
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFSTVAAVEGIN+IVTG L SLSEQEL+DCDR N+GCNGGLMDYAF FI++NGG+ E
Sbjct: 117 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKE 176
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
DYPY+ E C+ S+ ++VV+I GY DV +E SL KA+A+QP+SVAIEA GR FQ
Sbjct: 177 DDYPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 236
Query: 286 YESGVFTGECGSAL 299
Y GVF G CG+ L
Sbjct: 237 YSGGVFDGHCGTQL 250
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 216/356 (60%), Gaps = 20/356 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
L + FI +S A S + + + + +++ V ++ W +H + KR
Sbjct: 8 LALVLFIWASLACLSSSLPTEF-YITGEEFASEERVRELFHLWKERHKRVYKHAEETAKR 66
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK-------RRLMK 125
F+IFK+NL+++ E NS + +G+NKFAD++NEE++ YL RR M+
Sbjct: 67 FEIFKENLKYVIERNSKGHRHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
K + A E P S+DWR+KG V +KDQG CGSCWAFS+ A+EGIN IVTG+L
Sbjct: 127 QKKGT------ASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDL 180
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
ISLSEQELVDCD N GC GG MDYAF+++I NGG+DSE DYPY G + C+ ++ + K
Sbjct: 181 ISLSEQELVDCD-TTNYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTK 239
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTG---ECGSALDHG 302
VVSIDGY+DV D L AV +QP+SV ++ FQ Y SG++ G + +DH
Sbjct: 240 VVSIDGYKDVDESDSALLCAAV-NQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHA 298
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V+ VGYG+E+ DYW+ +NSWG+ WG GY ++RN D G+C I ASYP K
Sbjct: 299 VLIVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRN-TDLPYGECAINAMASYPTK 353
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/308 (46%), Positives = 206/308 (66%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQIFK+N++FI+ N+ ++ + + +N+FADL NEE++A
Sbjct: 37 HEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKA 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ + + + + S RY ++ ++P ++DWR++GAV P+KDQG+CGSCWAFS
Sbjct: 97 SLINVQKK-ESGVETATETSFRY--ESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSI 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA+EGI++I TG+L+SLSEQELVDC + + GCN G + AF+F+ +NGG+ SE YPY
Sbjct: 154 VAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPY 213
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
C + V I GYE+V E +L KAVA+QPVSV I+AG A Q Y SG+
Sbjct: 214 KANNKTCMVKKETQGVAQIKGYENVPSNSEKALLKAVANQPVSVYIDAG--ALQFYSSGI 271
Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG+CG+A +H +GYG G YWLV+NSWG+ WGE GY++++R+ + G CGI
Sbjct: 272 FTGKCGTAPNHAATVIGYGKARGGAKYWLVKNSWGTKWGEKGYIRMKRD-IRAKEGLCGI 330
Query: 350 AMEASYPV 357
A ASYP
Sbjct: 331 ATNASYPT 338
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 210/318 (66%), Gaps = 10/318 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFAD 102
+D ++ ++ W+ ++G+ +RF+ FK N+ F++ N+ + + +G+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LT EE++A G + + + + + + A LP +VDWR KGAV P+K+QG C
Sbjct: 88 LTTEEFKANK-GFKPISAEMVPTTGFKYENLSVSA---LPTAVDWRTKGAVTPIKNQGQC 143
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
G CWAFS VAA+EGI K+ TG LISLSEQELVDCD ++ GC GG MD AF+F+I+NGG
Sbjct: 144 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 203
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E YPY + KC ++A +I G+EDV DE +L KAVA+QPVSVA++A R
Sbjct: 204 LATESSYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNDEAALMKAVANQPVSVAVDASDR 261
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
F Y GV TG CG+ LDHG+ A+GYG E +G YW+++NSWG+ WGE G++++++++
Sbjct: 262 TFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLRMEKDIS 321
Query: 341 DTNTGKCGIAMEASYPVK 358
D G CG+AM+ SYP +
Sbjct: 322 DKQ-GMCGLAMKPSYPTE 338
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/333 (44%), Positives = 218/333 (65%), Gaps = 14/333 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
D +I+ Y + D +S R ++ ++++W ++ K + RF+IFKDNL +IDE
Sbjct: 1 DFAIVGYSQD-DLTSIER----LVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDE 55
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
N N +Y +GLN+FADLT++E++A Y+G+ + + +S + + K + PES+
Sbjct: 56 TNKKNSSYWLGLNEFADLTHDEFKAKYVGSLGEDSTIIEQSD--DEEFPYKHVVDYPESI 113
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWR+KGAV PVK+Q CGSCWAFSTVA VEGINKIVTG+LISLSEQEL+DCDR+ + GC
Sbjct: 114 DWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCK 172
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GG + Q++ N G+ +E++YPY + KC + V I GY+ V +E+SL +
Sbjct: 173 GGYQTTSLQYVADN-GVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQ 231
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
A+A+QPVSV +E+ GRAFQ Y+ G+F G CG+ +DH V AVGYG +Y L++NSWG
Sbjct: 232 AIANQPVSVVVESKGRAFQFYKGGIFEGPCGTKVDHAVTAVGYGK----NYILIKNSWGP 287
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
WGE GY++++R + G CG+ + +P K
Sbjct: 288 KWGEKGYIRIKR-ASGKSKGTCGVYSSSYFPTK 319
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 165/380 (43%), Positives = 227/380 (59%), Gaps = 26/380 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA L IS + + +S++ A S D N + + ++ ++ WL +HG
Sbjct: 1 MANPLHLLLISATI-ICLVSAAKAVQHSYEVGDIN--------SGNGLVRLFDRWLGRHG 51
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K +R QIF+ NL++I HN + N ++++GLNKFADLTNEE++ Y G S
Sbjct: 52 KLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQ 111
Query: 120 --KRRLMKSKVASQRYACK-------AGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
RR + + A R K + + S+DWR+KGAV VKDQ CGSCWAFST
Sbjct: 112 WRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFST 171
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A+EG+N I TG+L+SLSEQELV CD N GC GG MDYAF ++IQNGG+D+E+DY Y
Sbjct: 172 TGAIEGVNFISTGKLVSLSEQELVACD-ATNYGCEGGDMDYAFTWVIQNGGIDTEKDYSY 230
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G ++ C+ ++ K+VSIDGY DVSP D+ +L A QPVSV I+ FQ Y G+
Sbjct: 231 TGVDSTCNTNKEAKKIVSIDGYTDVSP-DDSALLCAAGSQPVSVGIDGSAIDFQLYTGGI 289
Query: 291 FTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
+ G+C +DH V+ VGY +NG DYW+V+NSWG+DWG GY + RN + G C
Sbjct: 290 YDGDCSGNPDDIDHAVLVVGYSAKNGKDYWIVKNSWGTDWGLEGYFYILRN-TELPYGVC 348
Query: 348 GIAMEASYPVKNSQNSAKPK 367
I ASYP K +++S + K
Sbjct: 349 AINAMASYPTK-TESSVQSK 367
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 162/350 (46%), Positives = 219/350 (62%), Gaps = 20/350 (5%)
Query: 19 ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
+SSS ++ SI+ D S D+ ++ I+Q W +H K EKRF FK
Sbjct: 15 VSSSLPSEYSIVGND-----FSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKR 69
Query: 79 NLRFIDEHNSLNRT--YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ---RY 133
NL++I E T ++VGLNKFADL+NEE++ +YL S K+ + K+++ ++ R
Sbjct: 70 NLKYIIEKTGKETTLRHRVGLNKFADLSNEEFKQLYL---SKVKKPINKTRIDAEDRSRR 126
Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQEL 193
++ D P S+DWR+KG V VKDQG CGSCW+FST A+EGIN IVT +LISLSEQEL
Sbjct: 127 NLQSCDA-PSSLDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQEL 185
Query: 194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE 253
VDCD N GC GG MDYAF+++I NGG+D+E +YPY G + C+ ++ KVVSIDGY+
Sbjct: 186 VDCD-TTNYGCEGGYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYK 244
Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF---TGECGSALDHGVVAVGYGT 310
DV D +L A A QP+SV I+ FQ Y G++ + +DH V+ VGYG+
Sbjct: 245 DVDETDS-ALLCAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGS 303
Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
ENG DYW+V+NSWG+ WG GY ++RN D G C I ASYP K +
Sbjct: 304 ENGEDYWIVKNSWGTSWGIEGYFYIKRN-TDLPYGVCAINAMASYPTKEA 352
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 209/319 (65%), Gaps = 24/319 (7%)
Query: 45 DDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
DD M ++ W+ ++ + +RF++FK N++FI+ N+ NR + +G+N+FAD
Sbjct: 29 DDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+RA T+++ + KV + RY + D LP ++DWR KGAV P+KDQG
Sbjct: 89 LTNDEFRA----TKTNKGFKPSPVKVPTGFRYENVSVDALPATIDWRTKGAVTPIKDQGQ 144
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
C EGI KI TG+LISLSEQELVDCD + GC GGLMD AFQFII+NG
Sbjct: 145 C------------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFQFIIKNG 192
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY A+ KC +A ++ G+EDV DE +L KAVA+QPVSVA++ G
Sbjct: 193 GLTTESSYPYTAADGKCKSGSNSA--ATVKGFEDVPANDEAALMKAVANQPVSVAVDGGD 250
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENGY+++++++
Sbjct: 251 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI 310
Query: 340 LDTNTGKCGIAMEASYPVK 358
D G CG+AME SYP++
Sbjct: 311 SDKR-GMCGLAMEPSYPIE 328
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 159/360 (44%), Positives = 224/360 (62%), Gaps = 42/360 (11%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ + + I L LF + AA S + N H+ S R +D W+A++G
Sbjct: 1 MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMAQYG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+ +R+
Sbjct: 48 RVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGT----SRNRF 103
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + ++ S +Y + +P ++DWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD + GCNG +YPY G + C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGA-------------------NYPYAGTDGTCN 202
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I+GYEDV +E +L+KAV QP++VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 203 RKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 262
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 263 LDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 321
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 160/360 (44%), Positives = 224/360 (62%), Gaps = 44/360 (12%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ + + I L LF + AA S + N H+ S R +D W+ ++G
Sbjct: 1 MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMVQYG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+RA +R+
Sbjct: 48 REYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA----SRNRF 103
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K + ++ S +Y + +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+ TG+LISLSEQELVDCD + GC +YPY G + C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCT---------------------NYPYAGTDGTCN 200
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I+GYEDV +E +L+KAVA QP++VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 201 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTE 260
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 261 LDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 319
>gi|125592011|gb|EAZ32361.1| hypothetical protein OsJ_16571 [Oryza sativa Japonica Group]
Length = 416
Score = 294 bits (752), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 215/343 (62%), Gaps = 46/343 (13%)
Query: 27 MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG------MGHNEKRFQIFKDNL 80
MSII + H RT+ + Y WLA+H + G +G +E+RF++F DNL
Sbjct: 1 MSIIRNNAEHGVRGLERTEAQARAAYDLWLARHRRGGGGGSRNGFIGEHERRFRVFWDNL 60
Query: 81 RFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
+F+D HN+ +++G+N+FADLTN E+RA YLGT + R + + Y
Sbjct: 61 KFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV-----GEAYRHDG 115
Query: 138 GDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
+ LP+SVDWR+KGAV PVK+QG CG+ G +EQ L
Sbjct: 116 VEALPDSVDWRDKGAVVAPVKNQGQCGA-----------------GGVREERAEQRL--- 155
Query: 197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
+MD AF FI +NGG+D+E+DYPY + KC+ ++R+ KVVSIDG+EDV
Sbjct: 156 --------QRWIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKVVSIDGFEDVP 207
Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGV 314
DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG+ LDHGVVAVGYGT+ G
Sbjct: 208 ENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGA 267
Query: 315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
YW VRNSWG DWGENGY++++RN+ TGKCGIAM ASYP+
Sbjct: 268 AYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPI 309
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 159/365 (43%), Positives = 228/365 (62%), Gaps = 46/365 (12%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+ L FL F+S+ + S++WR+DDEV+ +Y+ WL KH K + +G
Sbjct: 5 VLILSFLLFVSAITCI-------------STNWRSDDEVIALYEEWLVKHQKLYSSLGEK 51
Query: 70 EKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFADLTNEEYRAMYLGT---------- 115
KRF+IFKDNLR+ID+ N N+ + +GLN+FADLT +E+ ++YLGT
Sbjct: 52 IKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISS 111
Query: 116 ---RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
D + ++K V ELP+SVDWREKG V P+++QG CGSCW FS VA
Sbjct: 112 NPNHDDVEEDILKEDVV----------ELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVA 161
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
++E +N I G +I+LSEQEL+DC+ I+ GC GG + AF ++ +NG + SE+ YPY+
Sbjct: 162 SIETLNGIKKGHMIALSEQELLDCE-TISQGCKGGHYNNAFAYVAKNG-ITSEEKYPYIF 219
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
+ +C + KVV I GY+ V + L+ AVA Q VSVA++ + FQ Y+ G+F+
Sbjct: 220 RQGQC---YQKEKVVKISGYKRVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFS 276
Query: 293 GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
G CG LDH V VGYG++ G +YW++RNSWG++WGENGY+++Q+N G CGIAM+
Sbjct: 277 GACGPILDHAVNIVGYGSKGGANYWIMRNSWGTNWGENGYMRIQKN-SKHYEGHCGIAMQ 335
Query: 353 ASYPV 357
SYPV
Sbjct: 336 PSYPV 340
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 149/362 (41%), Positives = 226/362 (62%), Gaps = 25/362 (6%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+ ++F S + L F + +A+ + + H+ ++ W+A+HG
Sbjct: 1 MASENLFHCTSLALLLLFGFWAFSANTRTLEDASMHER-------------HEQWMAQHG 47
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K E R++IF+ N++ I+ N+ N+++K+G+N+FADLT EE++A+ +
Sbjct: 48 KVYKDHHEKELRYKIFQQNVKGIEGFNNAGNKSHKLGVNQFADLTEEEFKAI------NK 101
Query: 120 KRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGI 177
+ M SK++ + + + ++P ++DWR+KGAV P+K QG CGSCWAF+ VAA EGI
Sbjct: 102 LKGYMWSKISRTSTFKYEHVTKVPATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGI 161
Query: 178 NKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
K+ TGELISLSEQEL+DCD N GC G++ AF+FI+QN G+ +E YPY +
Sbjct: 162 TKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGT 221
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C+ + V SI GYEDV +E +L AVA+QPVSV +++ F+ Y SGV +G CG
Sbjct: 222 CNAKVESKHVASIKGYEDVPANNETALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCG 281
Query: 297 SALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
+ DH V VGYG +++G YWL++NSWG WGE GY++++R++ G CGIAM+ASY
Sbjct: 282 TTFDHAVTVVGYGVSDDGTKYWLIKNSWGVYWGEQGYIRIKRDVA-AKEGMCGIAMQASY 340
Query: 356 PV 357
P+
Sbjct: 341 PI 342
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 217/341 (63%), Gaps = 20/341 (5%)
Query: 1 MATASMFLAISTLVFLFFISSS----SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT +IS L+F+ S S+AD SI+ Y + D +S + + ++++W+
Sbjct: 1 MAT---IFSISKLIFVVTCLSLHLGLSSADFSIVGY--SQDDLTSIESS---IRLFESWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
KH K + RF+ FKDNL +IDE N N +Y +GLN+FADLT++E++ Y+G+
Sbjct: 53 LKHDKVYKTIDEKIYRFETFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKEKYVGSI 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
+ + +S + K + PES+DWR+KGAV PVK+Q CGSCWAFSTVA VEG
Sbjct: 113 PEDSMIIEQSD--DVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG LISLSEQEL+DCDR+ + GC GG + ++++ N G+ +E++YPY +
Sbjct: 171 INKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTEKEYPYEKKQGN 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + V I+GY+ V DE+SL K ++ QPVSV +E+ GR FQ Y+ GVF G CG
Sbjct: 229 CRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
+ LDH V AVGYG DY L++NSWG WG+ GY+K++R
Sbjct: 289 TKLDHAVTAVGYGK----DYILIKNSWGPKWGDKGYIKIKR 325
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 156/362 (43%), Positives = 223/362 (61%), Gaps = 17/362 (4%)
Query: 1 MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT S +IS ++FL S+AD + Y + D +S R ++ ++ +W+
Sbjct: 1 MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
KH K + RF+IF+DNL +IDE N N +Y +GLN FADL+N+E++ Y+G
Sbjct: 53 LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
++ L ++ + K P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG L+ LSEQELVDCD+ + GC GG + Q++ NG + + + YP + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPCQAKQYK 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + + V I GY+ V E S A+A+QP+S +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R ++ G CG+ + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347
Query: 357 VK 358
K
Sbjct: 348 FK 349
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/319 (46%), Positives = 209/319 (65%), Gaps = 24/319 (7%)
Query: 45 DDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
DD M ++ W+ ++ + +RF++FK N++FI+ N+ NR + +G+N+FAD
Sbjct: 29 DDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+RA T+++ + KV++ RY + D LP ++DWR KGAV P+KDQG
Sbjct: 89 LTNDEFRA----TKTNKGFKPSPVKVSTGFRYENVSVDALPATIDWRTKGAVTPIKDQGQ 144
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
C EGI KI TG+LISLSEQELVDCD + GC GGLMD AF+FII+NG
Sbjct: 145 C------------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 192
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY A+ KC +A ++ G+EDV DE +L KAVA+QPVSVA++ G
Sbjct: 193 GLTTESSYPYTAADGKCKSGSNSA--ATVKGFEDVPANDEAALMKAVANQPVSVAVDGGD 250
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENGY+++++++
Sbjct: 251 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI 310
Query: 340 LDTNTGKCGIAMEASYPVK 358
D G CG+AME SYP +
Sbjct: 311 SDKR-GMCGLAMEPSYPTE 328
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 155/355 (43%), Positives = 217/355 (61%), Gaps = 18/355 (5%)
Query: 11 STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
ST++F+ I +SY + S + + ++ W+A+ + +
Sbjct: 3 STIIFILTI---------FLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKR 53
Query: 71 KRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKSKV 128
RF IFK NL F+ N N+ TYKV +N+F+DLT+EE+RA + G +A R+
Sbjct: 54 NRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSS 113
Query: 129 ASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ G+ + ES+DWR++GAV PVK QG CG CWAFS VAAVEGI KI GEL+
Sbjct: 114 GKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELV 173
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA-- 244
SLSEQ+L+DCDR N GC GG+M AF++II+N G+ +E +YPY ++ C S +
Sbjct: 174 SLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSS 233
Query: 245 -KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
+ +I GYE V +E +L +AV+ QPVSV IE G AF+HY GVF GECG+ L H V
Sbjct: 234 FRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAV 293
Query: 304 VAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYG +E G YW+V+NSWG WGENGY++++R+ +D G CG+A+ A YP+
Sbjct: 294 TIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRD-VDAPQGMCGLAILAFYPL 347
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 160/366 (43%), Positives = 223/366 (60%), Gaps = 24/366 (6%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+M +IS L+F LF S S D SI+ Y + D +S+ R ++ ++ +W+ H
Sbjct: 2 AMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQD-DLTSTER----LIQLFNSWMLNHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL +IDE N N +Y++GLN+FADL+N+E+ Y+G+ DA
Sbjct: 57 KFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYRLGLNEFADLSNDEFNEKYVGSLIDA- 115
Query: 121 RRLMKSKVASQRYACKAGDE----LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
Q Y + +E LPE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEG
Sbjct: 116 -------TIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEG 168
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKI TG+L+ LSEQELVDC+R+ + GC GG YA +++ +NG + YPY +
Sbjct: 169 INKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGT 226
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + +V G V P +E +L A+A QPVSV +E+ GR FQ Y+ G+F G CG
Sbjct: 227 CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCG 286
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ +DH V AVGYG G Y L++NSWG+ WGE GY++++R + G CG+ + YP
Sbjct: 287 TKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYP 345
Query: 357 VKNSQN 362
+KN N
Sbjct: 346 IKNRDN 351
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 217/359 (60%), Gaps = 19/359 (5%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+M +IS L+F LF S D SI+ Y N D +S+ R ++ ++++W+ KH
Sbjct: 2 AMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQN-DLTSTER----LIQLFESWMLKHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL++IDE N N +Y +GLN FAD++N+E++ Y G+ +
Sbjct: 57 KIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAG-- 114
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
++++ + +PE VDWR+KGAV PVK+QGSCGSCWAFS V +EGI KI
Sbjct: 115 -NYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKI 173
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L SEQEL+DCDR+ + GCNGG A Q + Q G+ YPY G + C
Sbjct: 174 RTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSR 231
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ DG V P++E +L ++A+QPVSV +EA G+ FQ Y G+F G CG+ +D
Sbjct: 232 EKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD 291
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
H V AVGYG +Y L++NSWG+ WGENGY++++R ++ G CG+ + YPVKN
Sbjct: 292 HAVAAVGYGP----NYILIKNSWGTGWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVKN 345
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 290 bits (743), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 151/314 (48%), Positives = 201/314 (64%), Gaps = 10/314 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ + AK G++ NG +R +F N++ I+E NS TY +G+N+FADLT EE+
Sbjct: 19 WEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKT 78
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
Y+G + A++ + + Y G+ LP SVDW +GAV PVK+QG CGSCW+FST
Sbjct: 79 YMGFKKPAQKYGDAAYLGRHVYN---GEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTT 135
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
++EG N+I TG+L+SLSEQ+ VDC N GCNGGLMD AF++ N + +EQ YPY
Sbjct: 136 GSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEANA-LCTEQSYPY 194
Query: 231 LGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
G + C S + + S+ GY+DVS E + AVA QPVS+AIEA FQ Y
Sbjct: 195 KGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQLYSG 254
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GV TG CG++LDHGV+AVGYGT +G DYW V+NSWGS WG +GYV LQR +G+CG
Sbjct: 255 GVLTGACGASLDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSGYVLLQRG--KGGSGECG 312
Query: 349 IAMEASYP-VKNSQ 361
+ E SYP V SQ
Sbjct: 313 LLSEPSYPQVTGSQ 326
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 144/310 (46%), Positives = 205/310 (66%), Gaps = 8/310 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++GK EKRFQ+FK+N++FI+ N+ ++ + + +N+FADL +EE++A
Sbjct: 35 HEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEFKA 94
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
+ + A R + + S RY + ++P ++DWR++GAV P+KDQG +CGSCWAF+
Sbjct: 95 LLNNVQKKASR-VETATETSFRY--ENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAFA 151
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
TVA VE +++I TGEL+SLSEQELVDC R + GC GG ++ AF+FI GG+ SE YP
Sbjct: 152 TVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYYP 211
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C + V I GYE V E +L KAVA+QPVSV I+AG AF+ Y SG
Sbjct: 212 YKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSSG 271
Query: 290 VFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
+F CG+ LDH V VGYG +G YWLV+NSW + WGE GY++++R+ + G C
Sbjct: 272 IFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRD-IRAKKGLC 330
Query: 348 GIAMEASYPV 357
GIA ASYP+
Sbjct: 331 GIASNASYPI 340
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYL 113
W+A+HG+T +RF++FK N+ ID N+ N+ Y++ N+F DLT+ E+ AMY
Sbjct: 45 WMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYT 104
Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
G + + + A+ R + + D+ P VDWR++GAV VK+Q SCG CWAFSTVAA
Sbjct: 105 GY--NPANTMYAAANATTRLSSE-DDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 161
Query: 174 VEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
VEGI++I TGEL+SLSEQ+L+DC N GC GG +D AFQ++ +GG+ +E Y Y GA
Sbjct: 162 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 219
Query: 234 ENKCD---PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ C S + +I GY+ V+P DE SL AVA QPVSVAIE G F+HY SGV
Sbjct: 220 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 279
Query: 291 FTGE-CGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
FT + CG+ LDH V VGYG E G YW+++NSWG+ WG+ GY+KL++++ + G
Sbjct: 280 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV--GSQG 337
Query: 346 KCGIAMEASYPV 357
CG+AM SYPV
Sbjct: 338 ACGVAMAPSYPV 349
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYL 113
W+A+HG+T +RF++FK N+ ID N+ N+ Y++ N+F DLT+ E+ AMY
Sbjct: 35 WMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYT 94
Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
G + + + A+ R + + D+ P VDWR++GAV VK+Q SCG CWAFSTVAA
Sbjct: 95 GY--NPANTMYAAANATTRLSSE-DDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 151
Query: 174 VEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
VEGI++I TGEL+SLSEQ+L+DC N GC GG +D AFQ++ +GG+ +E Y Y GA
Sbjct: 152 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 209
Query: 234 ENKCD---PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ C S + +I GY+ V+P DE SL AVA QPVSVAIE G F+HY SGV
Sbjct: 210 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 269
Query: 291 FTGE-CGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
FT + CG+ LDH V VGYG E G YW+++NSWG+ WG+ GY+KL++++ + G
Sbjct: 270 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV--GSQG 327
Query: 346 KCGIAMEASYPV 357
CG+AM SYPV
Sbjct: 328 ACGVAMAPSYPV 339
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 203/309 (65%), Gaps = 31/309 (10%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+ ++G+ KR++IFKDN+ I+ N +++++YK+ +N+FADLTNEE+RA
Sbjct: 39 HEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+R+ K + ++ S +Y + +P +VDWR+KGAV P+KDQG CGSCWAFS
Sbjct: 99 ----SRNRFKAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI ++ TG+LISLSEQELVDCD + GC +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCT---------------------NYP 191
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C+ + I+GYEDV +E +L+KAVA QP++VAI+A G FQ Y SG
Sbjct: 192 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSG 251
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
VFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSW + WGE GY+++QR++ G CG
Sbjct: 252 VFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVT-AKEGLCG 310
Query: 349 IAMEASYPV 357
IAM+ASYP
Sbjct: 311 IAMQASYPT 319
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 203/308 (65%), Gaps = 7/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++G+ EKRFQ+FK+N+ FI+ N+ ++ + + +N+FADL +EE++A
Sbjct: 37 HEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKA 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ + + A + S S RY ++ ++P ++DWR++GAV P+KDQG CGSCWAFS
Sbjct: 97 LLINVQKKASW-VETSTETSFRY--ESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSA 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA EGI++I TG+L+ LSEQELVDC + + GC GG +D AF+FI + GG+ SE YPY
Sbjct: 154 VAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPY 213
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G C + V I GYE V +E +L KAVA+QPVSV I+AG AF++Y SG+
Sbjct: 214 KGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGI 273
Query: 291 FTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
F CG+ +H V VGYG +G YWLV+NSWG++WGE GY++++R+ + G CG
Sbjct: 274 FNARNCGTDPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRD-IRAKEGLCG 332
Query: 349 IAMEASYP 356
IA YP
Sbjct: 333 IAKYPYYP 340
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 137/219 (62%), Positives = 167/219 (76%), Gaps = 3/219 (1%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP+ VDWR GAV +KDQG CGSCWAFST+AAVEGINKI TG+LISLSEQELVDC R
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
N GC+GG M FQFII NGG+++E +YPY E +C+ + K VSID YE+V +
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
E +L+ AVA QPVSVA+EA G FQHY SG+FTG CG+A+DH V VGYGTE G+DYW+V
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+NSWG+ WGE GY+++QRN+ G+CGIA +ASYPVK
Sbjct: 181 KNSWGTTWGEEGYMRIQRNV--GGVGQCGIAKKASYPVK 217
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 203/308 (65%), Gaps = 7/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++G+ EKRFQ+FK+N+ FI+ N+ ++ + + +N+FADL +EE++A
Sbjct: 37 HEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKA 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ + + A + S S RY ++ ++P ++DWR++GAV P+KDQG CGSCWAFS
Sbjct: 97 LLINVQKKASW-VETSTQTSFRY--ESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSA 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA EGI++I TG+L+ LSEQELVDC + + GC GG +D AF+FI + GG+ SE YPY
Sbjct: 154 VAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPY 213
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G C + V I GYE V +E +L KAVA+QPVSV I+AG AF++Y SG+
Sbjct: 214 KGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGI 273
Query: 291 F-TGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
F CG+ +H V VGYG +G YWLV+NSWG++WGE GY++++R+ + G CG
Sbjct: 274 FNVRNCGTDPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRD-IRAKEGLCG 332
Query: 349 IAMEASYP 356
IA YP
Sbjct: 333 IAKYPYYP 340
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 146/311 (46%), Positives = 204/311 (65%), Gaps = 23/311 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+ ++ + +RF++FK N++FI+ N+ NR + +G+N+FADLTN+E+RA
Sbjct: 5 HEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 64
Query: 111 MYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
T+++ + KV + RY + D LP ++DWR KGAV P+KDQG C
Sbjct: 65 ----TKTNKGFKPSPVKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC------- 113
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
EGI KI TG+LISLSEQELVDCD + GC GGLMD AF+FII+ GG+ +E Y
Sbjct: 114 -----EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTESSY 168
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY A+ KC + V ++ G+EDV DE SL KAVA+QPVSVA++ G FQ Y
Sbjct: 169 PYTAADGKCKSGSNS--VATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQFYSG 226
Query: 289 GVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
GV TG CG+ LDHG+ A+GYG T +G YWL++NSWG+ WGENGY+++++++ D G C
Sbjct: 227 GVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKR-GMC 285
Query: 348 GIAMEASYPVK 358
G+AME SYP +
Sbjct: 286 GLAMEPSYPTE 296
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 204/312 (65%), Gaps = 19/312 (6%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+++ W AKHGK+ + +R IF D L +I++HN+ N T+ +GLNKF+DLTN E+R
Sbjct: 1 MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
A Y+G KS R K D LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61 ANYVGK--------FKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +A++E + + T EL+SLSEQ+L+DCD ++ GC GG + AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTE 171
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+ YPY G C+ ++ KVV I GY+DV+ +L KAV+ PV+V I + FQ+
Sbjct: 172 EAYPYTGFAGSCNANKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y SG+ +G+C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGENG++K+++ G
Sbjct: 230 YRSGILSGQCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKK---DGEG 286
Query: 346 KCGIAMEASYPV 357
CG+ ++SYP
Sbjct: 287 MCGMNGQSSYPT 298
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 153/326 (46%), Positives = 203/326 (62%), Gaps = 27/326 (8%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYK-----VGLNKFADLTN 105
+++ W+ KH K G +R+ F NL F+ + N+ R VG+N FADL+N
Sbjct: 50 LFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLSN 109
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK--------AGDELPESVDWREKGAVNPVK 157
EE+R +Y R+++ K A R A + AG + P S+DWR++GAV VK
Sbjct: 110 EEFREVY-------SSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVK 162
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QG CGSCWAFS+ A+EGIN I TGELISLSEQELVDCD N GC+GG MDYAF+++I
Sbjct: 163 NQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD-TTNEGCDGGYMDYAFEWVI 221
Query: 218 QNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
NGG+DSE +YPY G A++ C+ ++ KVVSIDGYEDV+ E +L A QPVSV I
Sbjct: 222 NNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGI 280
Query: 277 EAGGRAFQHYESGVFTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYV 333
+ FQ Y G++ G+C +DH V+ VGYG + G DYW+V+NSWG+DWG GY+
Sbjct: 281 DGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGTDYWIVKNSWGTDWGMQGYI 340
Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKN 359
++RN G C I ASYP K
Sbjct: 341 YIRRN-TGLPYGVCAIDAMASYPTKQ 365
>gi|357437721|ref|XP_003589136.1| Cysteine proteinase [Medicago truncatula]
gi|355478184|gb|AES59387.1| Cysteine proteinase [Medicago truncatula]
Length = 295
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 129/183 (70%), Positives = 153/183 (83%)
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
IVTG+LISLSEQELVDCD N GCNGGLMDYAF+FII NGG+DSE DYPY + +CD
Sbjct: 5 IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 64
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+R+NAKVV+ID YEDV +DE++L+KAVA+QP++VA+E GGR FQ YE GVFTG CG+AL
Sbjct: 65 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 124
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
DHGV AVGYGTENG DYW+VRNSWG WGE GY++L+RNL + GKCGIA+E SYP+KN
Sbjct: 125 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 184
Query: 360 SQN 362
QN
Sbjct: 185 GQN 187
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 3/224 (1%)
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
D+LP+S+DWRE GAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC
Sbjct: 1 DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-T 59
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
N GC GG M+ AFQFI+ NGG++SE+ YPY G + C+ S NA VVSID YE+V
Sbjct: 60 TANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICN-STVNAPVVSIDSYENVPSH 118
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
+E SL+KAVA+QPVSV ++A GR FQ Y SG+FTG C + +H + VGYGTEN D+W+
Sbjct: 119 NEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWI 178
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
V+NSWG +WGE+GY++ +RN+ + + GKCGI ASYPVK N
Sbjct: 179 VKNSWGKNWGESGYIRAERNIENPD-GKCGITRFASYPVKKGTN 221
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 203/312 (65%), Gaps = 12/312 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+++ + + RF+IFK NL+F++ N + N+TY + +N+F+DLT+EE++A
Sbjct: 35 HEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKA 94
Query: 111 MYLGTRSDAKRRLMKS----KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y G M + + S RY + E ES+DWRE+GAV VK Q CG CW
Sbjct: 95 RYTGLVVPEGMTRMSTTDSHETVSFRY--ENVGETGESMDWREEGAVTSVKHQQQCGCCW 152
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFS VAAVEG+ KI GEL+SLSEQ+L+DC + N GC+GG+M AF +I++N G+ +E
Sbjct: 153 AFSAVAAVEGMTKIAKGELVSLSEQQLLDCSTE-NDGCDGGIMWKAFDYIVENQGITAED 211
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+YPY GA+ C+ + A +S GYE V DE +L KAV+ QPVSVAIE G F HY
Sbjct: 212 NYPYQGAQQTCESNHVAAATIS--GYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHY 269
Query: 287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
G+F GECG+ L+H V VGYG +E G+ YWL++NSWG WGE+GY+++ R+ +D G
Sbjct: 270 SGGIFNGECGTHLNHAVTIVGYGVSEEGIKYWLLKNSWGESWGEDGYMRIMRD-VDAPQG 328
Query: 346 KCGIAMEASYPV 357
CG+A A YPV
Sbjct: 329 MCGLASLAYYPV 340
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/327 (44%), Positives = 204/327 (62%), Gaps = 17/327 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
D ++ ++ W+ +HG+ G ++RF++++ N+ ++ NS++ YK+ NKFADLTN
Sbjct: 25 DLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTN 84
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAV-NPVKDQGS 161
EE+RA LG R + S S A + D LP+SVDWR KGAV N K
Sbjct: 85 EEFRAKMLGFRPHVTIPQI-SNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVD 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
GSCWAFS VAA+EGIN+I GEL+SLSEQELVDCD + GC GG M +AF+F++ N G
Sbjct: 144 AGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHG 202
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E YPY A C ++ N V+I GY +V+P E L +A A QPVSVA++ G
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSF 262
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVD----------YWLVRNSWGSDWGEN 330
FQ Y SGV+TG C + ++HGV VGYG +E D YW+V+NSWG++WG+
Sbjct: 263 MFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDA 322
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY+ +QR++ +G CGIA+ SYPV
Sbjct: 323 GYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 210/318 (66%), Gaps = 14/318 (4%)
Query: 44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+DD M ++ W+A++G+ +RF++FK N+ FI+ N+ N + +G+N+FAD
Sbjct: 28 SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+R+ T+++ ++V + R D LP ++DWR KG V P+KDQG
Sbjct: 88 LTNDEFRS----TKTNKGFIPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTPIKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLS-EQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+EGI K+ TG+LIS S + L+ ++ GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSLLTV---MSMGCEGGLMDDAFKFIIKNG 200
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E +YPY ++K + V SI GYEDV +E +L KAVA+QPVSVA++ G
Sbjct: 201 GLTTESNYPYAAVDDKFKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 258
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y+ GV TG CG+ LDHG+VA+GYG +G YWL++NSWG WGENG++++++++
Sbjct: 259 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDI 318
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 319 SDKR-GMCGLAMEPSYPT 335
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 151/310 (48%), Positives = 197/310 (63%), Gaps = 12/310 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ +W A HG + +G R I++ NL FI++HNS +YK+ +NKFADLT E+ A
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
YLG R DA KS AS LP+SVDWR G V P+KDQG CGSCW+FST
Sbjct: 82 YLGLRFDATNA-TKSFAASTYLPRMV--SLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138
Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
+VEG + TG+L+SLSEQ LVDC + NAGCNGGLMD AFQ+II N G+D+E YPY
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
+ C + N ++ Y+D++ E L+ AVA P+SVAI+A +FQ Y SG
Sbjct: 199 TAQDGTCQFNSANVG-ATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSG 257
Query: 290 VFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V+ C S+ LDHGV+AVGYGT DYWLV+NSWG+ WG++GY+ + RN + +C
Sbjct: 258 VYNEPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRN----SNNQC 313
Query: 348 GIAMEASYPV 357
GIA ASYP+
Sbjct: 314 GIATAASYPL 323
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 156/359 (43%), Positives = 219/359 (61%), Gaps = 16/359 (4%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+M +IS L+F LF S S D SI+ Y + D +S+ R ++ ++ +W+ H
Sbjct: 2 AMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQD-DLTSTER----LIQLFNSWMLNHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL +IDE N N +Y +GLN+FADL+N+E+ Y+G+ DA
Sbjct: 57 KFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDA- 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ + + + LPE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEGINKI
Sbjct: 116 ---TIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKI 172
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG+L+ LSEQELVDC+R+ + GC GG YA +++ +NG + YPY + C
Sbjct: 173 RTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAK 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ +V G V P +E +L A+A QPVSV +E+ GR FQ Y+ G+F G CG+ +D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
H V AVGYG G Y L++NSWG+ WGE GY++++R + G CG+ + YP KN
Sbjct: 291 HAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYPTKN 348
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/336 (44%), Positives = 214/336 (63%), Gaps = 28/336 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
+D+ + +Y+ W + + ++ G + RF +FK+N+++I+E N +++ YK+ LN+F DL
Sbjct: 36 SDETLWDLYERWRSVY-TSARSFGEKQNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDL 94
Query: 104 TNEEYRAMYL------GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
T E+ Y GTR+++ + ++ E+P S+DWR KGAV PVK
Sbjct: 95 TPSEFARTYANSKIIEGTRNESGGFMYENV------------EVPRSIDWRVKGAVTPVK 142
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QG CG CWAFS AAVEGIN+I TG+LISLSEQ+L+DCD + N+GC GG M AF++I
Sbjct: 143 NQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIK 201
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ SE +YPY C + VSIDGY ++ ++ L K +A QPVSVA++
Sbjct: 202 QRGGITSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRRSEDAVL-KILAHQPVSVAVD 260
Query: 278 AGGRA---FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYV 333
A + + Y GVFTG CG+ L+HGV AVGYGT N G DYW+++NSWG WGE GY+
Sbjct: 261 ATTWSSLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYM 320
Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKN-SQNSAKPKP 368
++ R + + G CGIAM+AS+P+K S AK +P
Sbjct: 321 RMLRGV--SPYGLCGIAMQASFPIKRVSAGKAKFEP 354
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 136/219 (62%), Positives = 166/219 (75%), Gaps = 3/219 (1%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP+ VDWR GAV +KDQG CGS WAFST+AAVEGINKI TG+LISLSEQELVDC R
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
N GC+GG M FQFII NGG+++E +YPY E +C+ + K VSID YE+V +
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
E +L+ AVA QPVSVA+EA G FQHY SG+FTG CG+A+DH V VGYGTE G+DYW+V
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+NSWG+ WGE GY+++QRN+ G+CGIA +ASYPVK
Sbjct: 181 KNSWGTTWGEEGYMRIQRNV--GGVGQCGIAKKASYPVK 217
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 151/311 (48%), Positives = 201/311 (64%), Gaps = 12/311 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ WL ++ + E RF I++ NL +I+ NS +Y + NKFADLTNEE+ +
Sbjct: 5 FERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEEFVSP 64
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
YLG R L + + ++LPES DWR++GAV+ +KDQG+CGSCWAFS V
Sbjct: 65 YLGF---GTRFLPHTGFMYHEH-----EDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAV 116
Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
AAVEGINKI +G+L+SLSEQE DCD N GC GGLMD AF FI +NGG+ + +DYPY
Sbjct: 117 AAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPY 176
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA--DQPVSVAIEAGGRAFQHYES 288
G + C+ + +I G+ V DE LK A +Q SVAI+AGG AFQ Y
Sbjct: 177 EGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLK 236
Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
GVF+G CG L+HGV VGYG YW+V+NSWG+DWGE+GY++++R+ D G CG
Sbjct: 237 GVFSGICGKQLNHGVTIVGYGKGTSDKYWIVKNSWGADWGESGYIRMKRDAFD-KAGTCG 295
Query: 349 IAMEASYPVKN 359
IAM+ASYP+K+
Sbjct: 296 IAMQASYPLKD 306
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 203/310 (65%), Gaps = 10/310 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLTNEEYRA 110
++ W+A++ + +RF++FKDN F++ N+ + + +G+N+FADLT EE++A
Sbjct: 5 HERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEEFKA 64
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G + + + + + + A LP +VDWR KGAV P+K+QG CG CWAFS
Sbjct: 65 NK-GFKPISAEEVPTTGFKYENLSVSA---LPTAVDWRTKGAVTPIKNQGQCGCCWAFSA 120
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
+AA+EGI K+ TG L+SLSEQE VDCD ++ GC GG MD AF+F+I+NGG+ +E YP
Sbjct: 121 IAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATESSYP 180
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + KC ++A +I G+EDV P +E +L K VA QPVSVA++A R F Y G
Sbjct: 181 YKVVDGKCKGGSKSA--ATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYSGG 238
Query: 290 VFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
V TG CG+ LDHG+ A+GYG E + YW+++NSWG+ WGE G++++++++ D G C
Sbjct: 239 VMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKR-GMCD 297
Query: 349 IAMEASYPVK 358
+AM+ SYP +
Sbjct: 298 LAMKPSYPTE 307
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 206/334 (61%), Gaps = 24/334 (7%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNRTYKVGLNKFAD 102
D + ++ W A+H +T R +++ N+R+I+ N TY++G + D
Sbjct: 36 DPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTD 95
Query: 103 LTNEEYRAMYLG---TRSDAKRRLMKSKVAS--------------QRYACKAGDELPESV 145
LT++E+ AMY SD L + + + Q Y ++ P SV
Sbjct: 96 LTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGA-PASV 154
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWRE+GAV VK+QG CGSCWAFSTVA +EGI++I TG+L SLSEQELVDCD K++ GCN
Sbjct: 155 DWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCD-KLDHGCN 213
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GG+ A Q+I NGG+ S+ DYPY ++ CD + + SI G++ V+ E+SL
Sbjct: 214 GGVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTN 273
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSW 323
AVA QPV+V+IEAGG FQHY +GV+ G CG+ L+HGV VGYG + G YW+V+NSW
Sbjct: 274 AVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSW 333
Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G WG+NGY+++++ ++D G CGIA+ S+P+
Sbjct: 334 GEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 204/312 (65%), Gaps = 19/312 (6%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+++ W AKHGK+ + +R IF D L +I++HN+L N T+ +GLNKF+DLTN E+R
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
A Y+G K R K D LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61 ANYVGK--------FKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +A++E + + T EL+SLSEQ+L+DCD ++ GC GG + AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTE 171
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+ YPY G C+ ++ KVV I GY+DV+ +L KAV+ PV+V I + FQ+
Sbjct: 172 EAYPYTGFAGSCNANKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y SG+ +G C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGE+G++++++ + G
Sbjct: 230 YRSGILSGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKK---EDGEG 286
Query: 346 KCGIAMEASYPV 357
CG+ ++SYP
Sbjct: 287 MCGMNGQSSYPT 298
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 151/360 (41%), Positives = 203/360 (56%), Gaps = 50/360 (13%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
D ++ ++ W+ +HG+ G ++R ++++ N+ ++ NS+ N Y++ NKFADLT
Sbjct: 26 DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLT 85
Query: 105 NEEYRAMYLG-TRSDAKRRLMKSKVASQRYAC-------KAGDELPESVDWREKGAVNPV 156
NEE+RA LG R R AC + DELP+SVDWREKGAV PV
Sbjct: 86 NEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPV 145
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
K+QG CGSCWAFS VAA+EGIN+I G+L+SLSEQELVDCD K GC GG M +AF+F+
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFEFV 204
Query: 217 IQNGGMDSEQDYPYLGA----------------------------ENKCDPSRRNAKVVS 248
+ N G+ +E++YPY G C + VS
Sbjct: 205 MNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVS 264
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
I GY +V+ E L +A A QPVSVA++AG +Q Y GVFTG C + L+HGV VGY
Sbjct: 265 ISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGY 324
Query: 309 GTEN-----------GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G G YW+V+NSWG +WG+ GY+ +QR +G CGIA+ SYPV
Sbjct: 325 GETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQRE-ASVASGLCGIALLPSYPV 383
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 283 bits (725), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 149/358 (41%), Positives = 220/358 (61%), Gaps = 25/358 (6%)
Query: 5 SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
S+ + ++ LF S S A +++ H+ SS + ++ W+A+ +
Sbjct: 3 SIMVLVTIFTILFTTFSISQATSRTVTF---HEPSS--------LEKHEQWMARFSRVYR 51
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRL 123
+ R +FK NL+FI+ N N++YK+G+N+FAD TNEE+ A++ G +
Sbjct: 52 DELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKG------ 105
Query: 124 MKSKVASQRYACKA---GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ SKV + + ++ D + S DWR +GAV PVK QG CG CWAFS VAAVEG+ KI
Sbjct: 106 LSSKVVDETISSRSWNISDMVGVSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKI 165
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
G L+SLSEQ+L+DCDR+ + GC+GG+M AF +IIQN G+ SE DY Y G++ +C S
Sbjct: 166 AGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSS 225
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
R A +S G++ V +E +L +AV+ QPVSV+++A G F HY GV+ G CG++ +
Sbjct: 226 ARPAARIS--GFQTVPSNNEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSN 283
Query: 301 HGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
H V VGYGT ++G YWL +NSWG WGE GY++++R++ G CG+A A YPV
Sbjct: 284 HAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ-GMCGVAQYAFYPV 340
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 203/312 (65%), Gaps = 19/312 (6%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+++ W AKHGK+ + +R IF D L +I++HN+L N T+ +GLNKF+DLTN E+R
Sbjct: 1 MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
A Y+G K R K D LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61 ANYVGK--------FKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +A++E + + T EL+SLSEQ+L+DCD ++ GC GG + AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTE 171
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+ YPY G C+ ++ KVV I GY+DV+ +L KAV+ PV+V I + FQ+
Sbjct: 172 EAYPYTGFAGSCNANKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y SG+ +G C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGE+G++++++ G
Sbjct: 230 YRSGILSGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKK---DGEG 286
Query: 346 KCGIAMEASYPV 357
CG+ ++SYP
Sbjct: 287 MCGMNGQSSYPT 298
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 217/354 (61%), Gaps = 17/354 (4%)
Query: 11 STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
ST++F+ I +SY + S + + ++ W+A+ + +
Sbjct: 3 STIIFILTI---------FLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKR 53
Query: 71 KRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
RF IFK NL F+ N + N TYK+ +N+F+DLT+EE+RA + G + + + +
Sbjct: 54 NRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSS 113
Query: 130 SQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ + G+ + ES+DWR++GAV PVK QG CG CWAFS VAAVEGI KI GEL+S
Sbjct: 114 DKTVPFRYGNVSDTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVS 173
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA--- 244
LSEQ+L+DCD N GC+GG+M AF++II+N G+ +E +YPY ++ C S +
Sbjct: 174 LSEQQLLDCDTDYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSF 233
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
+ +I GYE V +E +L +AV+ QPVSV IE G F+HY G+F GECG+ L H V
Sbjct: 234 RAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVT 293
Query: 305 AVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYG +E G YW+V+NSWG WGE+G+++++R+ +D G CG+AM A YP+
Sbjct: 294 IVGYGMSEEGTKYWVVKNSWGETWGEDGFMRIKRD-VDAPQGMCGLAMLAFYPL 346
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 198/317 (62%), Gaps = 14/317 (4%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNE 106
++ W KHGKT + E R +IF DN F+ +HN+ T+ VGLN ADLT +
Sbjct: 67 LFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADLTKD 126
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E++ M LG + + ++ YA PE +DW GAV PVK+Q CGSCW
Sbjct: 127 EFKKM-LGYNAALRASRAPVDASTWEYADVT---PPEEIDWVASGAVTPVKNQKQCGSCW 182
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFST AVEG+N I TG+LISLSE+EL+ C N GCNGGLMD F++I+ N G+D+E
Sbjct: 183 AFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGIDTED 242
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+ Y+ E KC RR+ + V+IDG++DV DE SL KAV+ QPVSVAIEA ++FQ Y
Sbjct: 243 GWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSFQLY 302
Query: 287 ESGVFTG-ECGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
GV++ +CG+ LDHGV+ VGYG + +W ++NSWG WGE+GY+++ +
Sbjct: 303 AGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKGGSG 362
Query: 342 TNTGKCGIAMEASYPVK 358
G+CG+AM+ SYP K
Sbjct: 363 VE-GQCGVAMQPSYPTK 378
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 202/309 (65%), Gaps = 7/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++G+ EKRFQ+FK+N+ FI+ N+ ++ + + +N+FADL +EE++A
Sbjct: 37 HEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKA 96
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ + + A + S S RY ++ ++P ++D R++GAV P+KDQG CGSCWAFS
Sbjct: 97 LLINVQKKASW-VETSTETSFRY--ESVTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSA 153
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
VAA EGI++I TG+L+ LSEQELVDC + + GC GG +D AF+FI + GG+ SE YPY
Sbjct: 154 VAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPY 213
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G C + V I GYE V +E +L KAVA+QPVSV I+AG AF++Y SG+
Sbjct: 214 KGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGI 273
Query: 291 FTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
F CG+ +H V VGYG + YWLV+NSWG++WGE GY++++R+ + G CG
Sbjct: 274 FNARNCGTDPNHAVAVVGYGKALDDSKYWLVKNSWGTEWGERGYIRIKRD-IRAKEGLCG 332
Query: 349 IAMEASYPV 357
IA YP+
Sbjct: 333 IAKYPYYPI 341
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 160/362 (44%), Positives = 225/362 (62%), Gaps = 34/362 (9%)
Query: 2 ATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHG 60
A A M LA+ T+V A D+S +S+ +E M + +Q W+A+HG
Sbjct: 15 AAALMILAVMTMVV-------EARDLS----------TSTGGYGEEAMKVRHQQWMAEHG 57
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+T +RFQ+FK N F+D N+ ++Y++ +N+FAD+TN+E+ AMY G +
Sbjct: 58 RTYKDEAEKARRFQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLKPVP 117
Query: 120 KRRLMKSKVASQRYA-CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
K+A +Y D ++VDWR+KGAV +K+QG CG CWAF+ VAAVE I+
Sbjct: 118 AG---PKKMAGFKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIH 174
Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+I TG L+SLSEQ+++DCD N GCNGG +D AFQ+II NGG+ +E YPY A+ C
Sbjct: 175 QITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQ 234
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGS 297
S + A V+I Y+DV DE +L AVA+QPV+VAI+A FQ Y SGV T + CG+
Sbjct: 235 SSVQPA--VTISSYQDVPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGT 291
Query: 298 -ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
+L+H V AVGY T E+G YWL++N WG +WGE GY++++R T CG+A +ASY
Sbjct: 292 PSLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLRVER-----GTNACGVAQQASY 346
Query: 356 PV 357
PV
Sbjct: 347 PV 348
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 154/353 (43%), Positives = 208/353 (58%), Gaps = 51/353 (14%)
Query: 10 ISTLVFLFFISSSSAA-DMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMG 67
++ L F FF ++ AA D+S DD M ++ W+A++ +
Sbjct: 9 LAILGFAFFCGAALAARDLS----------------DDSAMVARHEQWMAQYSRVYKDAS 52
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
+RF KFADLTN E+R+ + T K MK
Sbjct: 53 EKARRF-------------------------KFADLTNHEFRS--VKTNKGFKSSNMKI- 84
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ RY + D LP ++DWR KG V P+KDQG CG C AFS VAA EGI KI TG+L+S
Sbjct: 85 LTGFRYENVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVS 144
Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
L++QELVDCD + GC GGLMD AF+FII+NGG+ +E YPY A+ KC+ +A
Sbjct: 145 LADQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNSA-- 202
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
+I GYEDV DE +L KA+A+QPVSVA++ G F+ Y GV TG CG+ LDHG+ A+
Sbjct: 203 ATIKGYEDVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAI 262
Query: 307 GYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GYG T +G YWL++NSWG+ WGENGY+++++++ D G CG+AME SYP K
Sbjct: 263 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYPTK 314
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 142/291 (48%), Positives = 192/291 (65%), Gaps = 12/291 (4%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
++ S+ +AIS L + A D SI+ Y H D+++ ++++W+++H
Sbjct: 8 LSKFSLLVAISASALL---CCAFARDFSIVGYTPEH-----LTNTDKLLELFESWMSEHS 59
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++F++NL ID+ N+ +Y +GLN+FADLT+EE++ YLG AK
Sbjct: 60 KAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ + + S + + +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L SLSEQEL+DCD N+GCNGGLMDYAFQ+II GG+ E DYPYL E C
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
+ + + V+I GYEDV D+ SL KA+A QPVSVAIEA GR FQ Y+ GV+
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK-GVY 286
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 156/320 (48%), Positives = 206/320 (64%), Gaps = 11/320 (3%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKF 100
T V +Q W+ ++G++ EKRF+IF +NL +I++ N+ N++YK+ LN+F
Sbjct: 29 ETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAPGNKSYKLDLNQF 88
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
+DLTNEE+ A + G D + SK AS + + P S+DWRE+GAV VK+QG
Sbjct: 89 SDLTNEEFIASHTGLMIDPSKPSSSSKRASPASLDLS--DTPTSLDWREQGAVTDVKNQG 146
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQN 219
+CGSCWAFS VAAVEGI KI G LISLSEQ+LVDC + N GC GG MD AF +I +N
Sbjct: 147 NCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNAFSYITEN 206
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
G+ SE DY Y G C + I GYEDV P E L AV+ QPVSVAI A
Sbjct: 207 -GIASENDYQYRGGAGTCQNNEMITPAARISGYEDV-PAGEDQLLLAVSQQPVSVAI-AV 263
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQR 337
G++F Y+ G+++G CGS+L+HGV VGYGT E+G YWL++NSWG WGENGY++L R
Sbjct: 264 GQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKYWLIKNSWGESWGENGYMRLLR 323
Query: 338 NLLDTNTGKCGIAMEASYPV 357
+ G CGIA++AS+P
Sbjct: 324 E-SGQSEGHCGIAVKASHPT 342
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 202/312 (64%), Gaps = 19/312 (6%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
+++ W AKH K+ + +R +F D L +I++HN+ N T+ +GLNKF+DLTN E+R
Sbjct: 1 MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
A Y+G K R K D LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61 ANYVGK--------FKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +A++E + + T EL+SLSEQ+L+DCD ++ GC GG D AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPDDAFKFVVENGGVTTE 171
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+ YPY G C+ ++ KVV I GY+DV+ +L KAV+ PV+V I + FQ+
Sbjct: 172 EAYPYTGFAGSCNTNKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229
Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
Y SG+ +G+C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGE+G++K+++ G
Sbjct: 230 YRSGILSGQCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKK---DGEG 286
Query: 346 KCGIAMEASYPV 357
CG+ ++SYP
Sbjct: 287 MCGMNGQSSYPT 298
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 280 bits (717), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 156/315 (49%), Positives = 204/315 (64%), Gaps = 22/315 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+A+HG+T + E+RFQIFK+NL +I+ N + N+TYK+GLNKF+DL+ EE+
Sbjct: 40 HEQWMARHGRTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVT 99
Query: 111 MYLG----TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y G T +K S Y DE+PES+DWRE G V VK+QG CG CW
Sbjct: 100 TYNGYEMPTTLPTANTTVKPTFFSNYYN---QDEVPESIDWRENGVVTSVKNQGECGCCW 156
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFS VAAVEGI G SLS Q+L+DC N+GC GG M AF++I+QN G+ S+
Sbjct: 157 AFSAVAAVEGI----AGNGASLSAQQLLDCVGD-NSGCGGGTMIKAFEYIVQNQGIVSDT 211
Query: 227 DYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEA-GGRAFQ 284
DYPY + C R + V + I GYE V +E +LK+AVA QP+SVAI+A G F+
Sbjct: 212 DYPYEQTQEMC---RSGSNVAARITGYESVIQSEE-ALKRAVAKQPISVAIDASSGPNFK 267
Query: 285 HYESGVFTGE-CGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Y SGVF+ E CG+ L H V VGYG TE+G YWLV+NSWG +WGE+GY++LQR+ +
Sbjct: 268 SYISGVFSAEDCGTHLTHAVTLVGYGTTEDGTKYWLVKNSWGEEWGESGYMRLQRD-VGA 326
Query: 343 NTGKCGIAMEASYPV 357
G CGIAM+ASYP
Sbjct: 327 MEGPCGIAMQASYPT 341
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 207/324 (63%), Gaps = 13/324 (4%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNRTYKVGLNKFA 101
++ V+ +++ W KHGK EK+FQ F+DNLR++ E N + + VGLNKFA
Sbjct: 44 EERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFA 103
Query: 102 DLTNEEYRAMYLGT--RSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVK 157
D++NEE+R +Y+ + +KR ++ + + A KA P S+DWR+ G V VK
Sbjct: 104 DMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVK 163
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFS+ A+EGIN + G+LISLSEQELVDCD N GC GG MDYAF++++
Sbjct: 164 DQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCD-STNDGCEGGYMDYAFEWVM 222
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
NGG+D+E DYPY G + C+ ++ K VSIDGYEDV+ +E +L AV QP+SV I+
Sbjct: 223 SNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQPISVGID 281
Query: 278 AGGRAFQHYESGVF---TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
G FQ Y G++ + +DH V+ VGYG E+G +YW+++NSWG+DWG GY
Sbjct: 282 GGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAY 341
Query: 335 LQRNLLDTNTGKCGIAMEASYPVK 358
++RN + G C I ASYP K
Sbjct: 342 IKRN-TSKDYGVCAINAMASYPTK 364
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 280 bits (716), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 205/335 (61%), Gaps = 28/335 (8%)
Query: 52 YQTWLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
++ W ++HG E KR F +N ++ EHN+L ++ VGLN A T
Sbjct: 98 FERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAATTR 157
Query: 106 EEYRAMY-----LGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAVNPVK 157
EEYRA+ L + DA+ S ++Y A + PE++DW E GAV P K
Sbjct: 158 EEYRALLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELGAVTPPK 217
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QG CGSCWAFST AVEGI KI TG L+SLSEQE+V C ++ N GCNGGLMDYAF++I+
Sbjct: 218 NQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ-NMGCNGGLMDYAFRWIV 276
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+NGG+DSE YPY C+ + V +IDG++DV P DE L+KAV+ QPVS+AIE
Sbjct: 277 KNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQPVSIAIE 336
Query: 278 AGGRAFQHYESGVF-TGECGSALDHGVVAVGYGTENG-----------VDYWLVRNSWGS 325
A ++FQ Y+ GV+ + ECGS +DHGV+ VGYG ++ +W V+NSWG
Sbjct: 337 ADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKVKNSWGG 396
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
WGE G++++ R + D TG+CGI SYP K++
Sbjct: 397 TWGEGGFIRMARRISD-ETGQCGITTAPSYPTKSA 430
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 207/329 (62%), Gaps = 17/329 (5%)
Query: 46 DEVMTI---YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLN 98
D V TI + WLA HGK KR IF DN F+ HN + +++ + LN
Sbjct: 61 DPVATIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLN 120
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
ADLT EE++ M LG + KR S A PE++DW +GAV PVK+
Sbjct: 121 HLADLTREEFKHM-LGYDASKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKN 179
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI--NAGCNGGLMDYAFQFI 216
QG CGSCWAFSTV AVEG+ + TG+LISLSEQELV C KI N GC GGLMD F++I
Sbjct: 180 QGQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSC-AKIGGNNGCKGGLMDNGFEWI 238
Query: 217 IQNGGMDSEQDYPYLGAENKCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
++N G+D E+D+ YL + +C+ +R AK SIDG++DV DE +LKKAV+ QPV+VA
Sbjct: 239 VENRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVA 298
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG----TENGVDYWLVRNSWGSDWGENG 331
IEA R FQ Y GVF GECG+ LDHGV+ VGYG + YW V+NSWG+ WGE G
Sbjct: 299 IEADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEG 358
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
Y+++ R + G+CG+AM+ASYP K+S
Sbjct: 359 YIRIARGGMGP-AGQCGVAMQASYPTKSS 386
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 203/321 (63%), Gaps = 10/321 (3%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGL 97
SS ++ + T ++ W+A H + ++R QIFK+NL FI++HN+ + Y + L
Sbjct: 25 SSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFIEKHNNEGKKRYNLSL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR--YACKAGDELPESVDWREKGAVNP 155
N FADLTNEE+ A + G +L K+ + GD + S+DWR++GAVN
Sbjct: 85 NSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMSVGD-IEASLDWRKRGAVND 143
Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
+K+QG CGSCWAFS VAAVEGIN+I G+L+SLSEQ LVDC N GC+G ++ AF +
Sbjct: 144 IKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCAS--NDGCHGQYVEKAFDY 201
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
I ++ G+ +E++YPY+ C + A + I GY+ V+P +E L AVA QPVSV
Sbjct: 202 I-RDYGLANEEEYPYVETVGTCSGNSNPA--IQIRGYQSVTPQNEEQLLTAVASQPVSVL 258
Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
+EA G+ FQ Y GVF+GECG+ L+H V VGYG E YWL+RNSWG WGE GY+KL
Sbjct: 259 LEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEAEGKYWLIRNSWGKSWGEGGYMKL 318
Query: 336 QRNLLDTNTGKCGIAMEASYP 356
R+ + G CGI M+ASYP
Sbjct: 319 MRDTGNPQ-GLCGINMQASYP 338
>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
Length = 1140
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 128/177 (72%), Positives = 149/177 (84%)
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD N GCNGGLMDYAF+FII NGG+
Sbjct: 780 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 839
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
D+E+DYPY G + +CD +R+NAKVV+ID YEDV DE SL+KAVA+QPVSVAIEA G
Sbjct: 840 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 899
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+++NSWGS WGE+G +R L
Sbjct: 900 FQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIMKNSWGSSWGESGRAPTRRTL 956
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 209/339 (61%), Gaps = 26/339 (7%)
Query: 44 TDDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGL 97
TDD I +Q W A + K+ + + +RF ++ N+ +I+ N+ TY++G
Sbjct: 42 TDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGE 101
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSK-------VASQRYACKAGDELP-------- 142
+ DLTN+E+ AMY S A+ + + + ++ A +LP
Sbjct: 102 TAYTDLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTA 161
Query: 143 --ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
SVDWR GAV PVK+QG CGSCWAFSTVA VEGI +I TG+L+SLSEQELVDCD +
Sbjct: 162 APASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TL 220
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
+AGC+GG+ A ++I NGG+ +E+DYPY G + C+ ++ SI G V+ E
Sbjct: 221 DAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSE 280
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWL 318
SL AVA QPV+V+IEAGG FQHY+ GV+ G CG++L+HGV VGYG E+G YW+
Sbjct: 281 ASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEEDGDKYWI 340
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
++NSWG+ WG+ GY+K+++++ G CGIA+ S+P+
Sbjct: 341 IKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 202/312 (64%), Gaps = 12/312 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+++ + + RF+IF +NL+F++ N + N+TY + +N+F+DLT+EE++A
Sbjct: 35 HEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKA 94
Query: 111 MYLG-TRSDAKRRLMKS---KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y G + R+ + + S RY + E ES+DW ++GAV VK Q CG CW
Sbjct: 95 RYTGLVVPEGMTRISTTDSHETVSFRY--ENVGETGESMDWIQEGAVTSVKHQQQCGCCW 152
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
AFS VAAVEG+ KI GEL+SLSEQ+L+DC + N GC GG+M AF +I +N G+ +E
Sbjct: 153 AFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE-NNGCGGGIMWKAFDYIKENQGITTED 211
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
+YPY GA+ C+ + A +S GYE V DE +L KAV+ QPVSVAIE G F HY
Sbjct: 212 NYPYQGAQQTCESNHLAAATIS--GYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHY 269
Query: 287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
G+F GECG+ L H V VGYG +E G+ YWL++NSWG WGENGY+++ R+ +D+ G
Sbjct: 270 SGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRD-VDSPQG 328
Query: 346 KCGIAMEASYPV 357
CG+A A YPV
Sbjct: 329 MCGLASLAYYPV 340
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 203/318 (63%), Gaps = 20/318 (6%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFAD 102
+D ++ ++ W+ ++G+ +RFQ+FKDN+ F++ N+ N + +G+N+FAD
Sbjct: 28 SDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LT EE++A K K +Y + LP +VDWR KGAV P+K+QG C
Sbjct: 88 LTTEEFKA-----NKGFKPTAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQC 142
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
AA+EGI K+ TG LISLSEQELVDCD ++ GC GG MD AF+F+I+NGG
Sbjct: 143 ---------AAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 193
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY + KC ++A +I G+EDV +E +L KAVA+QPVSVA++A R
Sbjct: 194 LATESNYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNNEAALMKAVANQPVSVAVDASDR 251
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
F Y GV TG CG+ LDHG+ A+GYG E +G YW+++NSWG+ WGE G++++++++
Sbjct: 252 TFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRMEKDIT 311
Query: 341 DTNTGKCGIAMEASYPVK 358
D G CG+AM+ SYP +
Sbjct: 312 DKR-GMCGLAMKPSYPTE 328
>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
Crystal Structure Of A Plant Cysteine Protease Ervatamin
B: Insight Into The Structural Basis Of Its Stability
And Substrate Specificity
Length = 215
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 131/218 (60%), Positives = 164/218 (75%), Gaps = 4/218 (1%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP VDWR KGAVN +K+Q CGSCWAFS VAAVE INKI TG+LISLSEQELVDCD
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
+ GCNGG M+ AFQ+II NGG+D++Q+YPY + C P R +VVSI+G++ V+ +E
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYR--LRVVSINGFQRVTRNNE 117
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
+L+ AVA QPVSV +EA G FQHY SG+FTG CG+A +HGVV VGYGT++G +YW+VR
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG +WG GY+ ++RN+ + G CGIA SYP K
Sbjct: 178 NSWGQNWGNQGYIWMERNVASS-AGLCGIAQLPSYPTK 214
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 160/384 (41%), Positives = 223/384 (58%), Gaps = 32/384 (8%)
Query: 6 MFLAISTLVFL---FFISSSSAADMSIISYDNNHDHSSSW--RTDDEVMTI--------Y 52
FL ++TL L F ++++A + + N R DD+ + +
Sbjct: 13 FFLLLTTLAILSLSFLPTATTAIRLEPENTINEKTDEVELVLRNDDDKRVLRESKIEDAF 72
Query: 53 QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEY 108
WL K+ K KR +IF +N F+ EHN+ ++ V +NKFA T EEY
Sbjct: 73 DAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNKFAAHTREEY 132
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACK-AGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
R M LG + +R+ + A + G E PES+DW ++G + K+QGSCGSCWA
Sbjct: 133 RKM-LGFKKSLRRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITTPKNQGSCGSCWA 191
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS + AVEGIN I TG+L+SLSEQELV C R+ N GCNGGLMD AF++I++NGG+DSE+
Sbjct: 192 FSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIVENGGVDSEK 251
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
Y Y + + C + + SIDG+ DV DE +LKKAV+ QPVSVAIEA R+FQ Y
Sbjct: 252 QYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSVAIEADQRSFQLY 311
Query: 287 ESGVFTGE-CGSALDHGVVAVGYGTENGVD----------YWLVRNSWGSDWGENGYVKL 335
GV+ E CG+ LDHGV+ VGYG ++ YW ++NSW WGE GY+++
Sbjct: 312 GGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSWSEQWGEGGYIRI 371
Query: 336 QRNLLDTNTGKCGIAMEASYPVKN 359
R+ +++ +G CG+A ASYP K
Sbjct: 372 ARD-VESPSGMCGVAEMASYPEKT 394
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 18/312 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
+ W+ H + + KR + + N +I EHN N V L N+F+ ++ EE++
Sbjct: 29 FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFK 88
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD-----ELPESVDWREKGAVNPVKDQGSCGS 164
G +M QR A + + ++P+SVDW++KG V PVK+QG CGS
Sbjct: 89 FKMTG-------YVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGS 141
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CWAFST AVEG + +G+L+SLSEQELVDCD + GCNGGLMD+AF +I NGG+ S
Sbjct: 142 CWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICS 201
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DY Y C R KVV I G++DV+P DE +LK AVA QPVSVAIEA +AFQ
Sbjct: 202 EDDYEYKAKAQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y+SGVF CG+ LDHGV+AVGYG+ENG +W V+NSWGS WGE GY++L R +
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLARE-ENGPA 317
Query: 345 GKCGIAMEASYP 356
G+CGIA SYP
Sbjct: 318 GQCGIASVPSYP 329
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 157/366 (42%), Positives = 218/366 (59%), Gaps = 33/366 (9%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGK 61
T F A++ + + A D+S +S+ +E M + +Q W+A+HG+
Sbjct: 10 TVITFTAVALTILAVTTMMAEARDLS---------STSTGGYGEEAMKVRHQQWMAEHGR 60
Query: 62 TSNGMGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
T RFQ+FK N F+D N+ ++Y++ LN+FAD+TN+E+ AMY G R
Sbjct: 61 TYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLRPV 120
Query: 119 AKRRLMKSKVASQRYA---CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
K+A +Y D+ ++VDWR+KGAV +K+QG CG CWAF+ VAAVE
Sbjct: 121 PAG---AKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVE 177
Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
GI++I TG L+SLSEQ+++DCD N GCNGG +D AFQ+I+ NGG+ +E YPY A+
Sbjct: 178 GIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQA 237
Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
C + V +I GY+DV DE +L AVA+QPVSVAI+A FQ Y GV T
Sbjct: 238 MCQSVQ---PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAAS 292
Query: 296 GSA---LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
S L+H V AVGYGT E+G YWL++N WG +WGE GY++L+R CG+A
Sbjct: 293 CSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLER-----GANACGVAQ 347
Query: 352 EASYPV 357
+ASYPV
Sbjct: 348 QASYPV 353
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 144/231 (62%), Positives = 173/231 (74%), Gaps = 3/231 (1%)
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
++P SVDWR+KGAV VKDQG CGSCWAFST+AAVEGIN I T L SLSEQ+LVDCD K
Sbjct: 60 DVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTK 119
Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
NAGCNGGLMDYAFQ+I ++GG+ +E YPY A +++ + VV+IDGYEDV D
Sbjct: 120 SNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYK-ARQASSCNKKPSAVVTIDGYEDVPAND 178
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWL 318
E +LKKAVA QPV+VAIEA G FQ Y GVF G+CG+ LDHGV AVGYGT +G YW+
Sbjct: 179 ETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWI 238
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
V+NSWG +WGE GY++++R++ D G CGIAMEASYPVK S N H
Sbjct: 239 VKNSWGPEWGEKGYIRMKRDVEDKE-GLCGIAMEASYPVKTSTNPKHAGAH 288
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 18/312 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
+ W+ H + + KR + + N +I EHN N V L N+F+ ++ EE++
Sbjct: 29 FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFK 88
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD-----ELPESVDWREKGAVNPVKDQGSCGS 164
G +M QR A + + ++P+SVDW++KG V PVK+QG CGS
Sbjct: 89 FKMTG-------YVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGS 141
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CWAFST AVEG + +G+L+SLSEQELVDCD + GCNGGLMD+AF +I NGG+ S
Sbjct: 142 CWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICS 201
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DY Y C R KVV I G++DV+P DE +LK AVA QPVSVAIEA +AFQ
Sbjct: 202 EDDYEYKAKAQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y+SGVF CG+ LDHGV+AVGYG+ENG +W V+NSWGS WGE GY++L R +
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLARE-ENGPA 317
Query: 345 GKCGIAMEASYP 356
G+CGIA SYP
Sbjct: 318 GQCGIASVPSYP 329
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 159/376 (42%), Positives = 229/376 (60%), Gaps = 33/376 (8%)
Query: 1 MATASMFLAISTLVFLFFISSSSA-----ADMSIISYDNNHDHSSSWRTDDEVMTIYQTW 55
MAT++ + I L+FL ++S S + ++ SI+ N SS+ +V ++ W
Sbjct: 1 MATSNSMITI--LIFLTYVSYSISTKTLPSEFSILEGQENDILSSA-----KVSDLFGKW 53
Query: 56 LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMY 112
HGKT R + FK +++F+ E NS ++ + VGLNKFADL+NEE++ MY
Sbjct: 54 KELHGKTYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMY 113
Query: 113 L----GTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ G+RS + K +K ++ C A P S+DWR+KG V P+KDQG CGSCWA
Sbjct: 114 MSKVKGSRSNELKMGGVKRNMSVSSRTCDA----PTSLDWRDKGVVTPMKDQGQCGSCWA 169
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FS ++E N I TG+LI LSEQELVDCD + GC+GG MD A+++II+NGG+DSE D
Sbjct: 170 FSVSGSIESANAIATGDLIRLSEQELVDCDT-YDYGCDGGNMDTAYRWIIKNGGLDSEDD 228
Query: 228 YPYL---GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
YPY G + KCD ++ VVS+D Y +V +E ++ AVA PV++ I FQ
Sbjct: 229 YPYTSSNGRDGKCDKTKSAKSVVSLDSYVEVES-NEDAVLCAVATTPVTIGIVGSAYDFQ 287
Query: 285 HYESGVFTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Y GV+ G+C S +DH V+ VGYG+++G DYW+V+NSWG+ WG GY+ ++RN D
Sbjct: 288 LYTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERN-TD 346
Query: 342 TNTGKCGIAMEASYPV 357
G CG+ +E YP+
Sbjct: 347 IKNGVCGMYLEPVYPI 362
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 205/334 (61%), Gaps = 15/334 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
D SI+ Y N D +S+ R ++ ++++W+ KH K + RF+IFKDNL++IDE
Sbjct: 45 DFSIVGYSQN-DLTSTER----LIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDE 99
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
N N +Y +GLN FAD++N+E++ Y G+ + ++++ + +PE V
Sbjct: 100 TNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAG---NYTTTELSYEEVLNDGDVNIPEYV 156
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWR+KGAV PVK+QGSCGS WAFS V+ +E I KI TG L SEQEL+DCDR+ + GCN
Sbjct: 157 DWRQKGAVTPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCN 215
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GG A Q + Q G+ YPY G + C + DG V P++E +L
Sbjct: 216 GGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLY 274
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
++A+QPVSV +EA G+ FQ Y G+F G CG+ +DH V AVGYG +Y L+RNSWG+
Sbjct: 275 SIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGP----NYILIRNSWGT 330
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
WGENGY++++R ++ G CG+ + YPVKN
Sbjct: 331 GWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVKN 363
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 220/359 (61%), Gaps = 16/359 (4%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
++ + S L+F LF S S D SI+ Y + D +S+ R ++ ++ +W+ KH
Sbjct: 2 AIICSFSKLLFVAICLFGHMSLSYCDFSIVGYSQD-DLTSTER----LIQLFNSWMLKHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL++IDE N + Y +GLN+F+DL+N+E++ Y+G+ +
Sbjct: 57 KNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVGSLPED- 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
++ + + + +LPESVDWR KGAV PVK QG C SCWAFSTVA VEGINKI
Sbjct: 116 ---YTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKI 172
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L+ LSEQELVDCD++ + GCN G + Q++ QNG + YPY+ + C +
Sbjct: 173 KTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQNG-IHLRAKYPYIAKQQTCRAN 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ V +G V +E SL A+A QPVSV +E+ GR FQ+Y+ G+F G CG+ +D
Sbjct: 231 QVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
H V AVGYG G Y L++NSWG WGENGY++++R + G CG+ + YP+KN
Sbjct: 291 HAVTAVGYGKSGGKGYILIKNSWGPGWGENGYIRIRR-ASGNSPGVCGVYRSSYYPIKN 348
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 206/341 (60%), Gaps = 21/341 (6%)
Query: 37 DHSSSWRTDDEVMT-IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---- 91
D S TDD M +Q W A + K+ + +RF++ N+ +I+ N+
Sbjct: 34 DMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGL 93
Query: 92 TYKVGLNKFADLTNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELP------ 142
TY++G + DLTN+E+ AMY + A ++ ++ A +LP
Sbjct: 94 TYELGETAYTDLTNQEFMAMYTAPAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLS 153
Query: 143 ----ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
SVDWR GAV PVK+QG CGSCWAFSTVA VEGI +I TG+L+SLSEQELVDCD
Sbjct: 154 TSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD- 212
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
++ GC+GG+ A ++I NGG+ +E DYPY G + C+ ++ + VSI G V+
Sbjct: 213 TLDDGCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATR 272
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDY 316
E SL AVA QPV+V+IEAGG FQHY+ GV+ G CG+ L+HGV VGYG E G Y
Sbjct: 273 SEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRY 332
Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
W+V+NSWG WG++GY+++++++ G CGIA+ SYP+
Sbjct: 333 WIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 138/233 (59%), Positives = 167/233 (71%), Gaps = 8/233 (3%)
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
+LP SVDWR+KGAV VKDQG CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD
Sbjct: 3 DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTA 62
Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVS 256
N GC GGLMD AF++I NGG+ +E YPY A C+ +R + VV IDG++DV
Sbjct: 63 DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122
Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVD 315
E L +AVA+QPVSVA+EA G+AF Y GVFTGECG+ LDHGV VGYG E+G
Sbjct: 123 ANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKA 182
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
YW V+NSWG WGE GY++++++ + G CGIAMEASYPVK +KPKP
Sbjct: 183 YWTVKNSWGPSWGEQGYIRVEKD-SGASGGLCGIAMEASYPVK---TYSKPKP 231
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 217/350 (62%), Gaps = 34/350 (9%)
Query: 16 LFFISSSS-AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
L F+S ++ SI+++D N + ++++V+ ++Q W +H K R +
Sbjct: 19 LTFLSCYGIPSEYSILAFDLN-----KFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLE 73
Query: 75 IFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FK NL++I E N++ + + +GLN+FAD++NEE++ ++ SKV S
Sbjct: 74 NFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFI------------SKVES- 120
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
D+ P S+DWR+KG V VKDQG+CGSCW+FS+ A+EG+N IVTG+LISLSEQ
Sbjct: 121 ------CDDAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQ 174
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
ELVDCD N GC GG MDYAF+++I NGG+D+E DYPY+G C+ ++ KVV+IDG
Sbjct: 175 ELVDCD-TTNDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDG 233
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA---LDHGVVAVGY 308
Y DV+ D +L A QP+SV I+ FQ Y G++ G+C S +DH V+ VGY
Sbjct: 234 YTDVTQSDS-ALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGY 292
Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
G++ DYW+V+NSWG+ WG G++ ++RN + G C I AS+P K
Sbjct: 293 GSDGNQDYWIVKNSWGTSWGIEGFIYIRRN-TNLKYGVCAINYMASFPTK 341
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 150/350 (42%), Positives = 208/350 (59%), Gaps = 19/350 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+A++G+ +R
Sbjct: 7 LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ N+ N +Y +G+NKF D+TN E+ A Y G S R L K
Sbjct: 58 FQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGIS---RPLNIEKEPVV 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ + +S+DWR+ GAV VKDQ CGSCWAFS +A VEGI KIVTG L+SLSEQ
Sbjct: 115 SFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQ 174
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
E++DC ++ GC+GG +D A+ FII N G+ SE DYPY + C + I G
Sbjct: 175 EVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC-AANSWPNSAYITG 231
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V DE S+K AV +QP++ AI+A G FQ+Y GVF+G CG++L+H + +GYG +
Sbjct: 232 YSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQD 291
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+G YW+V+NSWGS WGE GY+++ R + +++G CGIAM+ YP S
Sbjct: 292 SSGTQYWIVKNSWGSSWGERGYIRMARGV--SSSGLCGIAMDPLYPTLQS 339
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 157/366 (42%), Positives = 218/366 (59%), Gaps = 33/366 (9%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGK 61
T F A++ + + A D+S +S+ +E M + +Q W+A+HG+
Sbjct: 10 TVIAFTAVALTILAVKTMMAEARDLS---------STSTGGYGEEAMKVRHQQWMAEHGR 60
Query: 62 TSNGMGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
T RFQ+FK N F+D N+ ++Y++ LN+FAD+TN+E+ AMY G R
Sbjct: 61 TYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLRPV 120
Query: 119 AKRRLMKSKVASQRYA---CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
K+A +Y D+ ++VDWR+KGAV +K+QG CG CWAF+ VAAVE
Sbjct: 121 PAG---AKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVE 177
Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
GI++I TG L+SLSEQ+++DCD + N GCNGG +D AFQ+I NGG+ +E YPY A+
Sbjct: 178 GIHQITTGNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQA 237
Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
C + V +I GY+DV DE +L AVA+QPVSVAI+A FQ Y GV T
Sbjct: 238 MCQSVQ---PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAAS 292
Query: 296 GSA---LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
S L+H V AVGYGT E+G YWL++N WG +WGE GY++L+R CG+A
Sbjct: 293 CSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLER-----GANACGVAQ 347
Query: 352 EASYPV 357
+ASYPV
Sbjct: 348 QASYPV 353
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 204/332 (61%), Gaps = 20/332 (6%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKF 100
D ++ +Q W A + K+ + +RF+++ N+ +I+ N+ TY++G +
Sbjct: 43 DSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETAY 102
Query: 101 ADLTNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELP----------ESVDW 147
DLTN+E+ AMY + A ++ ++ A +LP SVDW
Sbjct: 103 TDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVDW 162
Query: 148 REKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
R GAV PVK+QG CGSCWAFSTVA VEGI +I TG+L+SLSEQELVDCD ++ GC+GG
Sbjct: 163 RASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TLDDGCDGG 221
Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
+ A ++I NGG+ +E DYPY G + C+ ++ + VSI G V+ E SL AV
Sbjct: 222 ISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAV 281
Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGS 325
A QPV+V+IEAGG FQHY+ GV+ G CG+ L+HGV VGYG E G YW+V+NSWG
Sbjct: 282 AGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSWGQ 341
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG++GY+++++++ G CGIA+ SYP+
Sbjct: 342 GWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 205/315 (65%), Gaps = 21/315 (6%)
Query: 52 YQT----WLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
YQT W+ KH + + H E R+Q FK+N+ FI + NS +GL KFADLTN
Sbjct: 29 YQTSFIGWMRKHDRAYS---HEEFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFADLTN 85
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
EEY+ YLG + + K+ L ++ + + P+S+DWREKGAV+ VKDQG CGSC
Sbjct: 86 EEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTG----PDSIDWREKGAVSQVKDQGQCGSC 141
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
W+FST AVEG ++I +G ++SLSEQ LVDC + N GC GGLM AF++II NGG+ +
Sbjct: 142 WSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIAT 201
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E YPY A+ +C + ++ +I GY+++ +E SL A+A QPVSVAI+A +FQ
Sbjct: 202 ESSYPYTAAQGRCKFT-KSMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQ 260
Query: 285 HYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Y SGV+ C S ALDHGV+AVGYGT G DY++++NSWG WG++GY+ + RN +
Sbjct: 261 LYSSGVYDEPACSSEALDHGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYIFMSRNAQN- 319
Query: 343 NTGKCGIAMEASYPV 357
+CG+A ASYP+
Sbjct: 320 ---QCGVATMASYPI 331
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 197/307 (64%), Gaps = 8/307 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFADLTNEEYR 109
+ W++ HG T + +R + + N +I EHN+ N K+G N F+ ++ +E++
Sbjct: 28 FSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFK 87
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
G ++ ++AS+ + E+P +VDW +KG V PVK+QG CGSCWAFS
Sbjct: 88 FKMTGLV--LPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFS 145
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
T AVEG + +G+L+SLSEQELVDCD + GCNGGLMD+AFQ+I +GG+ SE DY
Sbjct: 146 TTGAVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYE 205
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y C R+ VV + G++DV+P DE +LK AVA QPVSVAIEA +AFQ Y+SG
Sbjct: 206 YKAKAQVC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSG 262
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
VF CG+ LDHGV+AVGYG +NG +W V+NSWG+ WGE GY++L R + G+CGI
Sbjct: 263 VFNLTCGTRLDHGVLAVGYGNDNGQKFWKVKNSWGASWGEQGYIRLARE-ENGPAGQCGI 321
Query: 350 AMEASYP 356
A SYP
Sbjct: 322 ASVPSYP 328
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 199/313 (63%), Gaps = 5/313 (1%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNE 106
+++ ++ W+A+HG+T +R +IF+ N FID N + ++++ N+FADLT+E
Sbjct: 43 MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E+RA G R RY + + +SVDWR GAV VKDQG CG CW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSE 225
AFS VAAVEG+NKI TG L+SLSEQELVDCD + GC GGLMD AFQFI + GG+ SE
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
YPY G + C S A+ SI G+EDV +E +L AVA+QPVSVAI AF+
Sbjct: 223 SGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRF 282
Query: 286 YESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
Y+SGV GECG+ L+H + AVGYGT +G YWL++NSWG+ WGE GYV+++R +
Sbjct: 283 YDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV--RGE 340
Query: 345 GKCGIAMEASYPV 357
G CG+A SYPV
Sbjct: 341 GVCGLAKLPSYPV 353
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 139/275 (50%), Positives = 183/275 (66%), Gaps = 23/275 (8%)
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
LNKFAD+TN E+R++Y ++ + R + + + + +P S+DWR+ GAV V
Sbjct: 2 LNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTGV 61
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
KDQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD ++N GCNGGLM+YAF+FI
Sbjct: 62 KDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAFEFI 121
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
QN G+ +E +YPY + C+ + N VSIDG+E+V +E +L KA A+QP+SVAI
Sbjct: 122 KQN-GITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISVAI 180
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
+AGG FQ Y GVFTG CG+ L+HGV NSWGS+WGE GY+++Q
Sbjct: 181 DAGGSDFQFYSEGVFTGHCGTELNHGV-----------------NSWGSEWGEQGYIRMQ 223
Query: 337 RNLLDTNTGKCGIAMEASYPV----KNSQNSAKPK 367
R + G CGIAMEASYP+ KN S+ PK
Sbjct: 224 R-AISHKQGLCGIAMEASYPIKKSSKNPTKSSLPK 257
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 149/309 (48%), Positives = 197/309 (63%), Gaps = 8/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A+HG+ +R ++F+ N ID N+ ++++ N+FADLT EE+RA
Sbjct: 38 HEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEFRA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G R R + RY + + +SVDWR GAV VKDQG+CG CWAFS
Sbjct: 98 ARTGLR---PRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAAVEG+NKI TG L+SLSEQELVDCD ++ GC+GGLMD AFQF+ + GG+ SE YP
Sbjct: 155 VAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G + C S A+ SI G+EDV +E +L AVA+QPVSVAI AF+ Y+SG
Sbjct: 215 YQGRDGPCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSG 274
Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
V G CG+ L+H + AVGYGT N G YWL++NSWG+ WGE GYV+++R + G CG
Sbjct: 275 VLGGACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGV--RGEGVCG 332
Query: 349 IAMEASYPV 357
+A SYPV
Sbjct: 333 LAKLPSYPV 341
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 144/359 (40%), Positives = 219/359 (61%), Gaps = 21/359 (5%)
Query: 4 ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
AS+ + ++ L+ LF S A + + + ++ ++ W+A+ +
Sbjct: 2 ASIMVLVTVLIILFTGFRISQATSRTVIF-----------REQSMVDKHEQWMARFSREY 50
Query: 64 NGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAK-- 120
R +FK NL+FI+ N N++YK+G+N+FAD TNEE+ A++ G + +
Sbjct: 51 RDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVS 110
Query: 121 -RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+++ ++SQ + D + ES DWR +GAV PVK QG CG CWAFS VAAVEG+ K
Sbjct: 111 PSKVVAKTISSQTW--NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAK 168
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
I G L+SLSEQ+L+DCDR+ + GC+GG+M AF +++QN G+ SE DY Y G++ C
Sbjct: 169 IAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRS 228
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+ R A +S G++ V +E +L +AV+ QPVSV+++A G F HY GV+ G CG++
Sbjct: 229 NARPAARIS--GFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSS 286
Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+H V VGYGT ++G YWL +NSWG WGE GY++++R++ G CG+A A YPV
Sbjct: 287 NHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ-GMCGVAQYAFYPV 344
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/229 (57%), Positives = 165/229 (72%), Gaps = 5/229 (2%)
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + D LP ++DWR KGAV P+KDQG CG CWAFS VAA EGI KI TG+L+SL+EQ
Sbjct: 8 RYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQ 67
Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD + GC GGLMD AF+FII+NGG+ +E YPY A+ KC +A +I
Sbjct: 68 ELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSA--ATIK 125
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
GYEDV DE +L KAVA+QPVSVA++ G FQ Y GV TG CG+ LDHG+ A+GYG
Sbjct: 126 GYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 185
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
T +G YWL++NSWG+ WGENGY+++++++ D G CG+AME SYP K
Sbjct: 186 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYPTK 233
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 144/310 (46%), Positives = 201/310 (64%), Gaps = 9/310 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A++ + E+RF +FKDN+ FI ++ N K+G+N AD+T+EE+RA
Sbjct: 35 HEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNKLGVNALADMTHEEFRA 94
Query: 111 MYLGTRSDAKRRL-MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
G L ++S+ S R+ + +P ++DWR+K V +K+Q CG CWAFS
Sbjct: 95 S--GNTFKIPPNLGLRSETTSFRH--QNVTRIPSTMDWRKKRTVTHIKNQLQCGGCWAFS 150
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VAA+EGI K+ T + ISLSEQELVDCD N GC GG MD AF+FIIQN G++SE Y
Sbjct: 151 AVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRGLNSEARY 210
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
Y G E C+ + +++ I+ YE++ F E +L K VA QP+SVAI+AGG AFQ YE
Sbjct: 211 LYKGVEGHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQPISVAIDAGGSAFQFYEI 270
Query: 289 GVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
G+ T E G+ LD+GV GYG + +G +WLV+NSWG+DWGENGY +++R + T TG C
Sbjct: 271 GIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYTRMERGVKAT-TGLC 329
Query: 348 GIAMEASYPV 357
G M+ASYP
Sbjct: 330 GFTMQASYPT 339
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 147/291 (50%), Positives = 186/291 (63%), Gaps = 19/291 (6%)
Query: 72 RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
RF FK N+ I HN+L N +Y +GLN+FADL+ EE++ Y G + +R +S
Sbjct: 61 RFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYFGYK-HVEREFARSNNLH 119
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISL 188
Q + P S+DWR AV P+KDQG CGSCWAFS ++EG ++ G+ L SL
Sbjct: 120 QEV-----EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSL 173
Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
SEQ+LVDC NAGCNGGLMDYAF++II N G+ +E YPY G C S KVV
Sbjct: 174 SEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKS--CTKVV 231
Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
+I GY+DV+ DE SL AV PVSVAIEA FQ Y SGVF+G CG LDHGV+AV
Sbjct: 232 TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAV 291
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GYGT DYW+V+NSWG+ WGE+GY+++ R N +CGIA++ SYP
Sbjct: 292 GYGTTGSQDYWIVKNSWGTSWGESGYIRMIR-----NKNQCGIAIQPSYPT 337
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/297 (47%), Positives = 187/297 (62%), Gaps = 16/297 (5%)
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
S+ +G E F+ NLR I+ HN+ N ++ +G+ +FADLT E+ A KR
Sbjct: 38 SSQLGLCEPAFRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFSAY-------VKRF 90
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
M V R + + VDWR+K AV +K+QG CGSCW+FST +VEG + I T
Sbjct: 91 PMN--VTRPRNEVWITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIAT 148
Query: 183 GELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
G+L+SLSEQ+L+DC R N GCNGGLMDYAF+++I NGG+D+E+DYPY + KC+ +
Sbjct: 149 GKLVSLSEQQLMDCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEK 208
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
I G+ +V E L AV+ PVSVAIEA FQHY SGVF G+CG++LDH
Sbjct: 209 EKKHAAEIHGFRNVPKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDH 268
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GV+ VGY DYW+V+NSWG WGE GY++L+R + G CGI M+ASYP K
Sbjct: 269 GVLVVGYSD----DYWIVKNSWGKSWGEEGYIRLKRGV--DKKGMCGITMQASYPEK 319
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 157/359 (43%), Positives = 214/359 (59%), Gaps = 18/359 (5%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+A+ST V + S +AA ++ D +++ D + + ++ W+AKHGKT
Sbjct: 1 MALSTFVLAVLVMSGAAALGRELAGDGAAAAAAA---DVAMASRHEKWMAKHGKTYKDEE 57
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRT-----YKVGLNKFADLTNEEYRAMYLG-TRSDAKR 121
+R ++F+ N + ID N+ +++ N+FADLT++E+RA G R A
Sbjct: 58 EKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRPPAAV 117
Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
+ ++ A P+S+DWR GAV VKDQGSCG CWAFS VAAVEG+ KI
Sbjct: 118 AGAGGGFLYENFSLAAA---PQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIR 174
Query: 182 TGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG+L+SLSEQELVDCD R + GC GGLMD AFQ+I + GG+ +E YPY G + +
Sbjct: 175 TGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVDGA-CRA 233
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSAL 299
SI G++DV DE +L AVA QPVSVAI G F+ Y+ GV G CG+ L
Sbjct: 234 AAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTEL 293
Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+H V AVGYGT +G YWL++NSWG+ WGE GYV+++R + G CGIA ASYPV
Sbjct: 294 NHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGV--GREGACGIAQMASYPV 350
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 156/360 (43%), Positives = 206/360 (57%), Gaps = 66/360 (18%)
Query: 1 MATASMFLAIS-TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
MA+ + + +S L+F+ +S A S+ H+ S R +D W+A++
Sbjct: 1 MASTNQYQYVSMALLFILAAWASQATSRSL------HEASMYERHED--------WMARY 46
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
G+ EKRF+IFKDN+ A T +Y +
Sbjct: 47 GRMYKDANEKEKRFKIFKDNV--------------------AQATTFKYENV-------- 78
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+P ++DWR+KGAV P+KDQ CGSCWAFS VAA EGI +
Sbjct: 79 -------------------TAVPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQ 119
Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
I TG+LISLSEQELVDCD N GC+GGL D AF+FI +G + SE YPY G + C+
Sbjct: 120 ITTGKLISLSEQELVDCDTGGENQGCSGGLXDDAFRFIXIHG-LASEATYPYEGDDGTCN 178
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+ I GYEDV +E +L+KAVA QPV+VAI+AGG FQ Y SGVFTG+CG+
Sbjct: 179 SKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTE 238
Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
LDHGV AVGYG ++G+ YWLV+NSWG+ WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 239 LDHGVAAVGYGIGDDGMXYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 297
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 161/356 (45%), Positives = 211/356 (59%), Gaps = 47/356 (13%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
LAI+ LV +F +S A +I+ +D ++ ++ W+A+HG+T
Sbjct: 9 LAIALLV-VFSTWASQAMARQLIN-------------EDALVEKHEQWMARHGRTYQDSE 54
Query: 68 HNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
E+RFQIFK NL +ID N + N+TY++GLN FADL++EEY A Y
Sbjct: 55 EKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYT------------- 101
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
A K E+PES+DWR+ GAV P+K+Q CG CWAFS AAVEGI + G +
Sbjct: 102 -------ARKMPVEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI--VANG--V 150
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SLS Q+L+DC N GC GG M+ AF +IIQN G+ E DYPY + C SR A
Sbjct: 151 SLSAQQLLDCVSD-NQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCS-SRMAA-- 206
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA-FQHYESGVFTGE-CGSALDHGVV 304
I G+EDV+P DE +L +AVA QPVSV I+A F+ Y+ GVFT CG+ H V
Sbjct: 207 AQISGFEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVT 266
Query: 305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
VGYGT E+G YWL +NSWG WGE+GY++LQR+ + G CGIA+ ASYP N
Sbjct: 267 LVGYGTSEDGTKYWLAKNSWGETWGESGYMRLQRD-IGLEGGPCGIALYASYPTIN 321
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 149/350 (42%), Positives = 206/350 (58%), Gaps = 20/350 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+A++G+ +R
Sbjct: 7 LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ N+ N +Y +G+NKF D+TN E+ Y G + V S
Sbjct: 58 FQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGV--SLPLNFKREPVVS- 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ + +S+DWR+ GAV VKDQ CGSCWAFS +A VEGI KIVTG L+SLSEQ
Sbjct: 115 -FDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQ 173
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
E++DC ++ GC+GG +D A+ FII N G+ SE DYPY E C + I G
Sbjct: 174 EVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSW-PNSAYITG 230
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V DE S+K AV +QP++ AI+A G FQ+Y GVF+G CG++L+H + +GYG +
Sbjct: 231 YSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQD 290
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+G YW+V+NSWGS WGE GYV++ R + +++G CGIAM+ YP S
Sbjct: 291 SSGTQYWIVKNSWGSSWGERGYVRMARGV--SSSGLCGIAMDPLYPTLQS 338
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 154/310 (49%), Positives = 202/310 (65%), Gaps = 24/310 (7%)
Query: 59 HGKTSNGMGHNEKRF--QIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMY 112
HGK+ GH+E+ F Q+F ++ I+ HN + TY++GLNKF D+T+EE+R +
Sbjct: 26 HGKS---YGHDEEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-F 81
Query: 113 LGTRSDAKRRLMKSKVASQRYACKA-GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G + DA K+K R+ + G+ LP VDWREKG V PVK+QG CGSCWAFST
Sbjct: 82 KGLKFDA----TKTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTT 137
Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
++EG + TG+L+SLSEQ LVDC R + N GCNGGLMD F +I QNGG+D+E+ YPY
Sbjct: 138 GSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPY 197
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
G + C N+ + G+ DV DE +L+ AVA PVSVAI+A +FQ+Y+ G
Sbjct: 198 TGKDGDC-AFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEG 256
Query: 290 VF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V+ C S LDHGV+ VGYGTENGVDYWLV+NSWG WG++GY+K+ RN +C
Sbjct: 257 VYDEPSCSFSQLDHGVLVVGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRN----KENQC 312
Query: 348 GIAMEASYPV 357
GIA ASYP
Sbjct: 313 GIASMASYPT 322
>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
Length = 352
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 143/326 (43%), Positives = 202/326 (61%), Gaps = 13/326 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNK 99
DDE+ + +W K K +G H RF +FK N+ I HN+L T+ + N+
Sbjct: 27 VDDEIHLAFISWKNKFEKVYDGAEH-LARFAVFKANMEIIRAHNALYELGEETFSMAANQ 85
Query: 100 FADLTNEEYRAMYLGTRSDAK-RRLMKSKVASQRYACKAGDEL-PESVDWREKGAVNPVK 157
FAD+T EE++ LG + + K +RL++ + + ++ + P+++DWR K AV PVK
Sbjct: 86 FADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCTHRSNNSTRPKAIDWRTKSAVTPVK 145
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QG CGSCW+FST AVEG + LISLSE+ELV CD K + GCNGGLMD A+ +II
Sbjct: 146 NQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEELVQCDTKSDQGCNGGLMDNAYAWII 205
Query: 218 QNGGMDSEQDYPYL---GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
QNGG+ +E YPY+ G C + + KV SI + D+ P DE L+ A+ QPV+V
Sbjct: 206 QNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASISDWCDLKPEDESDLELALVQQPVAV 265
Query: 275 AIEAGGRAFQHYESGVFTG-ECGSALDHGVVAVGYG--TENGVDYWLVRNSWGSDWGENG 331
AIEA +FQ Y GV +CG+ LDHGV+AVGYG ++ + YW+V+NSWG++WG+ G
Sbjct: 266 AIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGYGYDKKHKMHYWIVKNSWGAEWGDEG 325
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
Y++L++ T CGIA ASYP
Sbjct: 326 YIRLEKMPKKTKHSACGIAKAASYPT 351
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 199/332 (59%), Gaps = 23/332 (6%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + +Y+ W A H + G +RF +FK+N R I EHN N TY +GLN+F+D
Sbjct: 40 SEESLWALYERWCA-HYNMARDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSD 98
Query: 103 LTNEEY-RAMYLGTRS------DAKRRLMKSKVASQRYAC------KAGDEL--PESVDW 147
+T+EE+ R+ Y G + D L + G +L P +VDW
Sbjct: 99 MTDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDW 158
Query: 148 REKGAVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG 206
R + AV VKDQG +CGSCWAFS +AAVEGIN I T L+ LSEQ+LVDCD K+N GCNG
Sbjct: 159 RGR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCD-KLNHGCNG 216
Query: 207 GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKA 266
GLM AF F+++N G+ E YPY+G E +C A V+I GY+ V FD +L A
Sbjct: 217 GLMTTAFSFVVRNRGVVPEGAYPYMGREGRC--KHVMAPPVTIYGYQRVPRFDANALMNA 274
Query: 267 VADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSD 326
VA QPVSVAIEA F+HY+ GVF G CG L H AVGYG + G +W+V+NSWG
Sbjct: 275 VAAQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGADAGGPFWIVKNSWGPG 334
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
WGE GYV++ RN G CGI E SYPVK
Sbjct: 335 WGEGGYVRISRN-TPVRQGVCGILTENSYPVK 365
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 205/317 (64%), Gaps = 24/317 (7%)
Query: 52 YQT----WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNE 106
YQT W+ KH ++ + N K +Q FKDN+ FI N+ N +GL +FADLTNE
Sbjct: 29 YQTSFLGWMKKHDRSYHHHEFNNK-YQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNE 87
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-PESVDWREKGAVNPVKDQGSCGSC 165
EYR +YLGT K VA +++ P+S+DWR KGAV+ VKDQG CGSC
Sbjct: 88 EYRKIYLGT---------KVNVAPEKHNFNMIHFTGPDSIDWRTKGAVSHVKDQGQCGSC 138
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
W+FST +VEG ++I TG +++LSEQ LVDC K N GC+GGLM AF+FI+ GG+ +
Sbjct: 139 WSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVAT 198
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E YPY + KC ++ +I GY++++ E+ L+ A+ QPVS+AI+A ++FQ
Sbjct: 199 EDSYPYNAVQGKCKFTKSMVG-ANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQ 257
Query: 285 HYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Y+SGV+ EC S LDHGV+AVGYGTENG DY++V+NSW WG++GY+ + RN +
Sbjct: 258 LYKSGVYDEPECSSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDGYIFMSRNAKN- 316
Query: 343 NTGKCGIAMEASYPVKN 359
+CG+A ASYP+ N
Sbjct: 317 ---QCGVATMASYPISN 330
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 140/296 (47%), Positives = 189/296 (63%), Gaps = 28/296 (9%)
Query: 71 KRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
KR F+ NL FI++HN+ + +Y VG+N+FADLT +E+ A+Y+ ++ + R + +
Sbjct: 17 KRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALYVPSKFN--RTMPYN 74
Query: 127 KVASQRYACKAGDELP----ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
V LP +SVDWR KGAV P+K+QG CGSCW+FST + EG + I T
Sbjct: 75 TVY-----------LPATSEDSVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEGAHAIAT 123
Query: 183 GELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
G L+SLSEQ+LVDC N GCNGGLMD AF++II N G+D+E+DYPY + C+ +
Sbjct: 124 GNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDGTCNKEK 183
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
+I Y DV +E L AVA PVSVAIEA FQ Y+SGVF G CG+ LDH
Sbjct: 184 EAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFDGNCGTNLDH 243
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GV+ VGY DYW+V+NSWG+ WG GY+ ++R + + +G CGIAM+ SYP+
Sbjct: 244 GVLVVGYTD----DYWIVKNSWGTTWGVEGYINMKRGV--SASGICGIAMQPSYPI 293
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 157/361 (43%), Positives = 211/361 (58%), Gaps = 37/361 (10%)
Query: 5 SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
S+ L + TL+ L +++S+ Y+N D D M +++ W+AK GKT
Sbjct: 3 SIVLLVCTLMALQAMAASA-------YYNNGSD-------DGVTMQMFEEWMAKFGKTYK 48
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
G E RF IF+DN+ FI + TY VG+N+FADLTN+E+ A Y G
Sbjct: 49 CHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQFADLTNDEFVATYTG-------- 99
Query: 123 LMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+K + A + D + P +DWR +GAV VKDQG+CGSCWAF+ VAA+EG+ KI
Sbjct: 100 ---AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKI 156
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--D 238
TG+L LSEQELVDCD N GC GG D AF+ + GG+ +E DY Y G + KC D
Sbjct: 157 RTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVD 215
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
N SI GY V P DE L AVA QPV+V I+A G AFQ Y+SGVF G CG++
Sbjct: 216 DMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGAS 274
Query: 299 LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+H V VGY + +G YWL +NSWG WG+ GY+ L+++++ + G CG+A+ YP
Sbjct: 275 SNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPH-GTCGLAVSPFYP 333
Query: 357 V 357
Sbjct: 334 T 334
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 146/291 (50%), Positives = 186/291 (63%), Gaps = 19/291 (6%)
Query: 72 RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
RF FK N+ I HN+L N +Y +GLN+FADL+ EE++ Y G + +R +S
Sbjct: 61 RFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYFGYK-HVEREFARSNNLH 119
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISL 188
Q + P S+DWR AV P+KDQG CGSCWAFS ++EG ++ G+ L SL
Sbjct: 120 QEV-----EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSL 173
Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
SEQ+LVDC +AGCNGGLMDYAF++II N G+ +E YPY G C S KVV
Sbjct: 174 SEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKS--CTKVV 231
Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
+I GY+DV+ DE SL AV PVSVAIEA FQ Y SGVF+G CG LDHGV+AV
Sbjct: 232 TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAV 291
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GYGT DYW+V+NSWG+ WGE+GY+++ R N +CGIA++ SYP
Sbjct: 292 GYGTTGSQDYWIVKNSWGTSWGESGYIRMIR-----NKNQCGIAIQPSYPT 337
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 147/365 (40%), Positives = 221/365 (60%), Gaps = 23/365 (6%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHS------SSWRTDDEVMTIYQTWLAKH 59
M + T +FL F+ S + + Y ++S + +++ V+ ++Q W ++
Sbjct: 1 MGCQLKTQLFLLFLVWGS---WTFLCYGLPSEYSILALEIDKFPSEEGVIELFQRWKEEN 57
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTR 116
K + RF+ FK NL++I E NS + +GLN+FAD++NEE+++ + T
Sbjct: 58 KKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKF--TS 115
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
K ++ ++ + ++C ++ P S+DWR+KG V VKDQG CG CWAFS+ A+EG
Sbjct: 116 KVKKPFSKRNGLSGKDHSC---EDAPYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEG 172
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
IN IV+G+LISLSE ELVDCDR N GC+GG MDYAF++++ NGG+D+E +YPY GA+
Sbjct: 173 INAIVSGDLISLSEPELVDCDR-TNDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGT 231
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C+ ++ KV+ IDGY +V D SL A QP+S I+ FQ Y G++ G+C
Sbjct: 232 CNVAKEETKVIGIDGYYNVEQSDR-SLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCS 290
Query: 297 S---ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEA 353
S +DH ++ VGYG+E DYW+V+NSWG+ WG GY+ ++RN + G C I A
Sbjct: 291 SDPDDIDHAILVVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRRN-TNLKYGVCAINYMA 349
Query: 354 SYPVK 358
SYP K
Sbjct: 350 SYPTK 354
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 159/365 (43%), Positives = 213/365 (58%), Gaps = 38/365 (10%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MA+A L + TL+ L +++S+ Y+N D D M +++ W+AK G
Sbjct: 1 MASA-FLLVVCTLMALQAMAASA-------YYNNGSD-------DGVTMQMFEEWMAKFG 45
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYRAMYLGTRSD 118
KT G E RF IF+DN+ FI + TY VG+N+FADLTN+E+ A Y G
Sbjct: 46 KTYKCHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQFADLTNDEFVATYTG---- 100
Query: 119 AKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
+K + A + D + P +DWR +GAV VKDQG+CGSCWAF+ VAA+EG
Sbjct: 101 -------AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEG 153
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
+ KI TG+L LSEQELVDCD N GC GG D AF+ + GG+ +E DY Y G + K
Sbjct: 154 LTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGK 212
Query: 237 C--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
C D N SI GY V P DE L AVA QPV+V I+A G AFQ Y+SGVF G
Sbjct: 213 CRVDDMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGP 271
Query: 295 CGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
CG++ +H V VGY + +G YW+ +NSWG WG+ GY+ L++++L + G CG+A+
Sbjct: 272 CGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVS 330
Query: 353 ASYPV 357
YP
Sbjct: 331 PFYPT 335
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 134/260 (51%), Positives = 178/260 (68%), Gaps = 4/260 (1%)
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVK 157
+FA++TN+E+R+MY G + D+ ++ RY + LP +VDWR+KGAV P+K
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QGSCG CWAFS VAA+EG +I G+LISLSEQ+LVDCD + GC+GGL+D AF+ I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTAFEHIM 119
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
GG+ +E +YPY G + C SI GYEDV DE +L KAVA QPVSV IE
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQ 336
GG FQ Y SGVFTGEC + LDH V AVGY + G YW+++NSWG+ WGE GY++++
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239
Query: 337 RNLLDTNTGKCGIAMEASYP 356
+++ D G CG+AM+ASYP
Sbjct: 240 KDIKDKE-GLCGLAMKASYP 258
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/262 (53%), Positives = 174/262 (66%), Gaps = 16/262 (6%)
Query: 71 KRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+RF IF DNL FI HN+ T+ VG+N+FADLTNEEYR +YL R L +
Sbjct: 39 RRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQLYL--RPYPTELLGRE 96
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ AG SVDWR+KGAV P+K+QG CGSCW+FST +VEG + I TG L+
Sbjct: 97 RQEVWLDGPNAG-----SVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLV 151
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ+LVDC N GCNGGLMD AF++II NGG+D+EQDYPY + CD S+ +
Sbjct: 152 SLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKH 211
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VSI GY+DV +E L AV PVSVAIEA ++FQ Y SGVF+G CG+ LDHGV+
Sbjct: 212 AVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLV 271
Query: 306 VGYGTENGVDYWLVRNSWGSDW 327
VGY + DYW+V+NSWG+ W
Sbjct: 272 VGYTS----DYWIVKNSWGASW 289
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 201/318 (63%), Gaps = 27/318 (8%)
Query: 44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+DD M ++ W+A++G+ +RF++FK N+ FI+ N+ N + +G+N+FAD
Sbjct: 28 SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFAD 87
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
LTN+E+R+ T+++ ++V + R D LP ++DWR KG V P+KDQG
Sbjct: 88 LTNDEFRS----TKTNKGFIPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTPIKDQGQ 143
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG CWAFS VAA+E ELVDCD + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAME----------------ELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 187
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E +YPY ++K + V SI GYEDV +E +L KAVA+QPVSVA++ G
Sbjct: 188 GLTTESNYPYAAVDDKFKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 245
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y+ GV TG CG+ LDHG+VA+GYG +G YWL++NSWG WGENG++++++++
Sbjct: 246 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDI 305
Query: 340 LDTNTGKCGIAMEASYPV 357
D G CG+AME SYP
Sbjct: 306 SDKR-GMCGLAMEPSYPT 322
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 155/385 (40%), Positives = 214/385 (55%), Gaps = 32/385 (8%)
Query: 1 MATASMFLAISTLVFL--FFISSSSAADMSIIS-YDNNHDHSSSWRTDDEVMTIYQTWLA 57
MA AS F L+ L FFI SS + S N D + T +M ++Q W A
Sbjct: 1 MAAASFFSMPCLLILLGVFFIGCSSGTARRVTSDTAANTDGEPAATT---MMEMFQRWKA 57
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGT- 115
++ ++ +R +++ N+R+I+ N+ Y++G + DLTN+E+ AMY
Sbjct: 58 EYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTAPP 117
Query: 116 -RSDAKRRLMKSKVASQRYACKAGDE-------------LPESVDWREKGAVNPVKDQGS 161
RS A + DE P SVDWR GAV VKDQG
Sbjct: 118 LRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRASGAVTEVKDQGR 177
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFSTVA VEGI KI G+L+SLSEQELVDCD +++GC+GG+ A ++I NGG
Sbjct: 178 CGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCD-TLDSGCDGGVSYRALEWITANGG 236
Query: 222 MDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
+ + DYPY G A CD ++ +I G V+ E SL+ A A QPV+V+IEAGG
Sbjct: 237 ITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGG 296
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTEN--------GVDYWLVRNSWGSDWGENGY 332
FQHY GV+ G CG+ L+HGV VGYG E G YW+++NSWG +WG+ GY
Sbjct: 297 DNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGY 356
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPV 357
+K+++++ G CGIA+ S+P+
Sbjct: 357 IKMKKDVAGKPEGLCGIAIRPSFPL 381
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 199/320 (62%), Gaps = 16/320 (5%)
Query: 46 DEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFAD 102
DE M + Y+ W+A++ + RFQ+FK N FID N+ + Y +G N+FAD
Sbjct: 51 DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFAD 110
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKV--ASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
LT++E+ AMY G R A ++ A +Y + VDWR++GAV PVK+QG
Sbjct: 111 LTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQG 170
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
CG CWAFS V A+EG+ I TG L+SLSEQ+++DCD N GCNGG MD AFQ++I N
Sbjct: 171 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINN 230
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ +E YPY + C + A +I G++D+ DE +L AVA+QPVSV ++ G
Sbjct: 231 GGVTTEDAYPYSAVQGTCQNVQPAA---TISGFQDLPSGDENALANAVANQPVSVGVDGG 287
Query: 280 GRAFQHYESGVFTGE-CGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQR 337
FQ Y+ G++ G+ CG+ ++H V A+GYG ++ G YW+++NSWG+ WGENG+++LQ
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQM 347
Query: 338 NLLDTNTGKCGIAMEASYPV 357
+ G CGI+ ASYP
Sbjct: 348 GV-----GACGISTMASYPT 362
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 198/310 (63%), Gaps = 12/310 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+ HG+ E RF+ FK+N+ FI+ N + + YK+ +NK+ADLT EE+
Sbjct: 41 HENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTT 100
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
++G + + +S + + + E+P S+DWR++G+V VKDQG CG CWAFS
Sbjct: 101 SFMGLDTSLLSQ-QESTATTTSFKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSA 159
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN--GGMDSEQDY 228
AA+EG +I ELISLSEQ+L+DC + N GC GGLM A+ F++QN GG+ +E +Y
Sbjct: 160 AAAIEGAYQIANNELISLSEQQLLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNY 218
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY A+N C + A V+I+GYE V P DE SL KAV +QP+SV I A F Y S
Sbjct: 219 PYEEAQNVCKTEQPAA--VTINGYE-VVPSDESSLLKAVVNQPISVGI-AANDEFHMYGS 274
Query: 289 GVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
G++ G C S L+H V +GYGT E+G YW+V+NSWGSDWGE GY+++ R+ + + G
Sbjct: 275 GIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSWGSDWGEEGYMRIARD-VGVDGGH 333
Query: 347 CGIAMEASYP 356
CGIA AS+P
Sbjct: 334 CGIAKVASFP 343
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 203/318 (63%), Gaps = 22/318 (6%)
Query: 44 TDDEVMTIYQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+ + T +Q W+ KH K T++ G R+ +F+DN+ + + N +GLN A
Sbjct: 24 SQKQYQTAFQNWMVKHQKSYTNDEFG---SRYSVFQDNMDIVAKWNQKGSNTILGLNVMA 80
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLTNEE++ +YLGT K+ V ++ LP SVDWR GAV VK+QG
Sbjct: 81 DLTNEEFKKLYLGT---------KANVTYKKKTLVGVSGLPASVDWRANGAVTAVKNQGQ 131
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
CG C+AFST +VEGI++I + +L+ LSEQ+++DC + N GC+GGLM +F++II G
Sbjct: 132 CGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVG 191
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+D+E YPY G KC +++N +I GY++V E L+ AVA QPVSVAI+A
Sbjct: 192 GLDTEASYPYTGEVGKCKFNKKNIG-ATITGYKNVESGSESDLQTAVAAQPVSVAIDASQ 250
Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+FQ Y SGV + EC S LDHGV+AVGYG+++G DYW+V+NSWG+DWGENG++ + RN
Sbjct: 251 SSFQLYASGVYYEPECSSTQLDHGVLAVGYGSQSGQDYWIVKNSWGADWGENGFILMARN 310
Query: 339 LLDTNTGKCGIAMEASYP 356
D N CGIA AS+P
Sbjct: 311 -KDNN---CGIATMASFP 324
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 127/216 (58%), Positives = 160/216 (74%), Gaps = 3/216 (1%)
Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG 203
SVDWR+KG V +KDQG CG+CWAFS +AAVEG+ + TG L+SLSEQELVDCD +N G
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60
Query: 204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSL 263
C+GG+MDYAFQ++I+NGG+ S+ +YPY CD + +I+G++ + P E L
Sbjct: 61 CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120
Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNS 322
+AVA+QPVSVAIEAGG+ FQ Y SGVFTGECGS LDHGV VGYGT+ G YWLV+NS
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNS 180
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
WGS WGE+GYV+++R G CGI ++ASYP K
Sbjct: 181 WGSGWGESGYVRMERQ--GPGAGVCGINLDASYPTK 214
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/338 (44%), Positives = 207/338 (61%), Gaps = 20/338 (5%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
D SI+ Y + D +S+ R ++ ++ +W+ H K + RF+IFKDNL +IDE
Sbjct: 1 DFSIVGYSQD-DLTSTER----LIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE 55
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE----L 141
N N +Y +GLN+FADL+N+E+ Y+G+ DA Q Y + +E L
Sbjct: 56 TNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDA--------TIEQSYDEEFINEDIVNL 107
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEGINKI TG+L+ LSEQELVDC+R+ +
Sbjct: 108 PENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-S 166
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC GG YA +++ +NG + YPY + C + +V G V P +E
Sbjct: 167 HGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEG 225
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
+L A+A QPVSV +E+ GR FQ Y+ G+F G CG+ +D V AVGYG G Y L++N
Sbjct: 226 NLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKN 285
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
SWG+ WGE GY++++R + G CG+ + YP KN
Sbjct: 286 SWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYPTKN 322
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 191/312 (61%), Gaps = 11/312 (3%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
M ++ + K+GK NG+ + RF IFK N+ I N+ N T+ +G+N+F DLT EE
Sbjct: 24 MMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEL 83
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
A Y G + + + ++++ Y G L SVDW +G V PVK+QG CGSCW+F
Sbjct: 84 AASYTGLKPASLWSGLP-RLSTHEYN---GAPLASSVDWTTQGVVTPVKNQGQCGSCWSF 139
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST A+EG + TG L+SLSEQ+ VDCD ++GCNGG MD AF F +N + +E Y
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFVDCDTT-DSGCNGGWMDNAFSFAKKNS-ICTEGSY 197
Query: 229 PYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
PY + C+ S + + GY DVS E ++ AVA QPVS+AIEA +FQ Y
Sbjct: 198 PYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLY 257
Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
SGV T CG+ LDHGV+AVGYG+E G DYW V+NSWGS WGE GYV+LQR G+
Sbjct: 258 SSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRG--KGGAGE 315
Query: 347 CG-IAMEASYPV 357
CG +A SYPV
Sbjct: 316 CGLLAGPPSYPV 327
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 193/307 (62%), Gaps = 8/307 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFADLTNEEYR 109
+ W+ HG T + +R + + N +I EHN+ N +G N F+ ++ +E++
Sbjct: 28 FSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFK 87
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
G ++ ++AS+ + E+P +VDW +KG V PVK+QG CGSCWAFS
Sbjct: 88 FKMTGLV--LPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFS 145
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
T AVEG + +G+L SLSEQELVDCD + GCNGGLMD+AFQ+I +GG+ SE DY
Sbjct: 146 TTGAVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYE 205
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y C R VV + G++DV+P DE +LK AVA QPVSVAIEA +AFQ Y+SG
Sbjct: 206 YKAKAQVC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSG 262
Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
VF CG+ LDHGV+AVGYG +NG +W V+NSWG+ WGE GY++L R + G+CGI
Sbjct: 263 VFNLTCGTRLDHGVLAVGYGNDNGHKFWKVKNSWGASWGEQGYIRLARE-ENGPAGQCGI 321
Query: 350 AMEASYP 356
A SYP
Sbjct: 322 ASVPSYP 328
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 202/322 (62%), Gaps = 21/322 (6%)
Query: 46 DEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
DE M + Y+ W+A++ + RFQ+FK N FID N+ + Y +G N+FAD
Sbjct: 51 DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFAD 110
Query: 103 LTNEEYRAMYLGTRSDAK----RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
LT++E+ AMY G R A + + + Q + + D++ VDWR++GAV PVK+
Sbjct: 111 LTSKEFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFT-RLDDDV--QVDWRQQGAVTPVKN 167
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
QG CG CWAFS V A+EG+ I TG L+SLSEQ+++DCD N GCNGG MD AFQ+++
Sbjct: 168 QGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVV 227
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
NGG+ +E YPY + C + A +I G++D+ DE +L AVA+QPVSV ++
Sbjct: 228 NNGGVTTEDAYPYSAVQGTCQNVQPAA---TISGFQDLPSGDENALANAVANQPVSVGVD 284
Query: 278 AGGRAFQHYESGVFTGE-CGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKL 335
G FQ Y+ G++ G+ CG+ ++H V A+GYG ++ G YW+++NSWG+ WGENG+++L
Sbjct: 285 GGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQL 344
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
Q + G CGI+ ASYP
Sbjct: 345 QMGV-----GACGISTMASYPT 361
>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
Length = 221
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 130/220 (59%), Positives = 166/220 (75%), Gaps = 3/220 (1%)
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
D LP+S+DWREKGAV PVK+QG CGSCWAF +AAVEGIN+IVTG+LISLSEQ+LVDC
Sbjct: 1 DVLPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCST 60
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
+ N GC GG AFQ+II NGG++SE+ YPY G CD ++ NA VVSID Y +V
Sbjct: 61 R-NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSN 118
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
DE SL+KAVA+QPVSV ++A GR FQ Y +G+FTG C + +H G TEN DYW
Sbjct: 119 DEKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWT 178
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V+NSWG +WGE+GY++++RN+ ++ +GKCGIA+ SYP+K
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAES-SGKCGIAISPSYPIK 217
>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
Length = 1105
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 160/218 (73%), Gaps = 1/218 (0%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
+P++VDWR+ GAV VKDQGSCG+CW+FS A+EGINKI TG LISLSEQEL+DCDR
Sbjct: 129 VPDAVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSY 188
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N+GC GGLMDYA++F+++NGG+D+E DYPY + C+ ++ +VV+IDGY+DV +E
Sbjct: 189 NSGCGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNE 248
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
L +AVA QPVSV I RAFQ Y G+F G C ++LDH ++ VGYG+E G DYW+V+
Sbjct: 249 DMLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYWIVK 308
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG WG GY+ + RN ++N G CGI S+P K
Sbjct: 309 NSWGESWGMKGYMYMHRNTGNSN-GVCGINQMPSFPTK 345
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 151/352 (42%), Positives = 210/352 (59%), Gaps = 34/352 (9%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK--TSNGMG 67
I LVF F I + +A + + T +Q W+ KH K T++ G
Sbjct: 4 ILALVFCFLIVNCISAARVF--------------SQKQYQTAFQNWMVKHQKSYTNDEFG 49
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R+ IF+DN+ F+ + N +GLN ADLTN+EY+ +YLGT++ K+ +
Sbjct: 50 ---SRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIG 106
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
V A P SVDWR GAV VK+QG CG C++FST +VEGI++I + +L+S
Sbjct: 107 VTDVSKA-------PASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVS 159
Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ+++DC + N GC+GGLM +F++II GG+D+E YPY G KC ++ N
Sbjct: 160 LSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIG- 218
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVV 304
+I GY++V E L+ AVA QPVSVAI+A +FQ Y SGV + C S LDHGV+
Sbjct: 219 ATITGYKNVKSGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVL 278
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
AVGYG+++G DYW+V+NSWG+DWGE G++ + RN CGIA ASYP
Sbjct: 279 AVGYGSQSGQDYWIVKNSWGADWGEKGFILMARN----KHNNCGIATMASYP 326
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 142/359 (39%), Positives = 217/359 (60%), Gaps = 21/359 (5%)
Query: 4 ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
AS+ + ++ L+ LF S A + + + ++ ++ W+A+ +
Sbjct: 2 ASIMVLVTVLIILFTGFRISQATSRTVIF-----------REQSMVDKHEQWMARFSREY 50
Query: 64 NGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAK-- 120
R +FK NL+FI+ N N++YK+G+N+FAD TNEE+ A++ G + +
Sbjct: 51 RDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVS 110
Query: 121 -RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+++ ++SQ + D + ES DWR +GAV PVK QG CG CWAFS VAAVEG+ K
Sbjct: 111 PSKVVAKTISSQTW--NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAK 168
Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
I G L+SLSEQ+L+DCDR+ + C+GG+M AF +++QN G+ SE DY Y G++ C
Sbjct: 169 IAGGNLVSLSEQQLLDCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRS 228
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
+ R A +S G++ V +E +L +AV+ QPVSV+++A G F HY GV+ G CG++
Sbjct: 229 NARPAARIS--GFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSS 286
Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+H V VGYGT ++G YWL +NSWG W E GY++++R++ G CG+A A YPV
Sbjct: 287 NHAVTFVGYGTSQDGTKYWLAKNSWGETWEEKGYIRIRRDVA-WPQGMCGVAQYAFYPV 344
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 146/320 (45%), Positives = 198/320 (61%), Gaps = 19/320 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEK---RFQIFKDNLRFIDEHNSLNRTYKVGLNKF 100
T D + ++ W+ + K+ + NE+ R+ ++++N + I+EHN N+T + +NKF
Sbjct: 22 THDPLTGVFAEWMRDNSKSYS----NEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKF 77
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
DLTN E+ ++ G D +K A+++ G L DWR+KGAV VK+QG
Sbjct: 78 GDLTNAEFNKLFKGLAFD--YSFHANKAAAEKAVPAPG--LSADFDWRQKGAVTHVKNQG 133
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
CGSCW+FST + EG N + TG L SLSEQ L+DC N GCNGGLMDYAF++II N
Sbjct: 134 QCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINN 193
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
G+D+E YPY A+ C + N+ S+ Y DVS DE +L AVA +P SVAI+A
Sbjct: 194 KGIDTEASYPYQTAQYTCQYNPANSG-GSLTSYTDVSSGDENALLNAVATEPTSVAIDAS 252
Query: 280 GRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
+FQ Y GV + C S LDHGV+AVG+GTE+G DYWLV+NSWG+DWG GY+K+ R
Sbjct: 253 HNSFQFYSGGVYYESACSSTQLDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAGYIKMAR 312
Query: 338 NLLDTNTGKCGIAMEASYPV 357
N + CGIA ASYP
Sbjct: 313 N----RSNNCGIATSASYPT 328
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 148/290 (51%), Positives = 196/290 (67%), Gaps = 8/290 (2%)
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
EKR +IFK+NL +I+ N+ N++YK+GLN+++DLT++E+ A + G + ++L SK+
Sbjct: 80 EKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLK--VSKQLSSSKM 137
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
S D++P + DWR++GAV VKDQGSCG CWAFS VAAVEG KI TGELISL
Sbjct: 138 RSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELISL 197
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQ+LVDCD + N+GC+GG MD AF++IIQ G+ SE DYPY C + +
Sbjct: 198 SEQQLVDCDER-NSGCHGGNMDSAFKYIIQK-GIVSEADYPYQEGSQTCQLNDQMKFEAQ 255
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
I + DV DE L +AVA QPVSV IE G FQHY V++G CG +++H V AVGY
Sbjct: 256 ITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDE-FQHYMGDVYSGTCGQSMNHAVTAVGY 314
Query: 309 G-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G +E+G YWL++NSWG WGE GY+KL R + G+CGIA ASYP+
Sbjct: 315 GVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPG-GQCGIAAHASYPI 363
>gi|308082013|ref|NP_001183396.1| uncharacterized protein LOC100501813 [Zea mays]
gi|238011208|gb|ACR36639.1| unknown [Zea mays]
Length = 291
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 124/178 (69%), Positives = 150/178 (84%), Gaps = 1/178 (0%)
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+ISLSEQELVDCD N GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NA
Sbjct: 1 MISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNA 60
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
KVV+ID YEDV E SL+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+ALDHGV
Sbjct: 61 KVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVT 120
Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
AVGYGTENG DYW+V+NSWGS WGE+GYV+++RN + ++GKCGIA+E SYP+K N
Sbjct: 121 AVGYGTENGKDYWIVKNSWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGAN 177
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 197/323 (60%), Gaps = 24/323 (7%)
Query: 44 TDDEV-MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKF 100
+DD V M +++ W+AK GKT G E RF IF+DN+ FI + TY VG+N+F
Sbjct: 11 SDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQF 69
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKD 158
ADLTN+E+ A Y G +K + A + D + P +DWR +GAV VKD
Sbjct: 70 ADLTNDEFVATYTG-----------AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKD 118
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG+CGSCWAF+ VAA+EG+ KI TG+L LSEQELVDCD N GC GG D AF+ +
Sbjct: 119 QGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVAS 177
Query: 219 NGGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
GG+ +E DY Y G + KC D N SI GY V P DE L AVA QPV+V I
Sbjct: 178 KGGITAESDYRYEGFQGKCRVDDMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYI 236
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
+A G AFQ Y+SGVF G CG++ +H V VGY + +G YWL +NSWG WG+ GY+
Sbjct: 237 DASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYIL 296
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
L+++++ + G CG+A+ YP
Sbjct: 297 LEKDIVQPH-GTCGLAVSPFYPT 318
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 191/312 (61%), Gaps = 11/312 (3%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
M ++ + K+GK NG+ + RF IFK N+ I N+ N T+ +G+N+F DLT EE+
Sbjct: 24 MMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEF 83
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
A Y G + + + ++++ Y G L SVDW +G V PVK+QG CGSCW+F
Sbjct: 84 AASYTGLKPASLWSGLP-RLSTHEYN---GAPLASSVDWTTQGVVTPVKNQGQCGSCWSF 139
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
ST A+EG + TG L+SLSEQ+ DCD ++GCNGG MD AF F +N + +E Y
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFEDCD-TTDSGCNGGWMDNAFSFAKKNS-ICTEGSY 197
Query: 229 PYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
PY + C+ S + + GY DVS E ++ AVA QPVS+AIEA +FQ Y
Sbjct: 198 PYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLY 257
Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
SGV T CG+ LDHGV+AVGYG+E G DYW V+NSWGS WGE GYV+LQR G+
Sbjct: 258 SSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRG--KGGAGE 315
Query: 347 CG-IAMEASYPV 357
CG +A SYPV
Sbjct: 316 CGLLAGPPSYPV 327
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 155/357 (43%), Positives = 208/357 (58%), Gaps = 31/357 (8%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEV-MTIYQTWLAKHGKTSNGMGH 68
+++ V L + + M +Y NN +DD V M +++ W+AK GKT G
Sbjct: 7 MASAVLLVVCTLMALQAMGADAYYNNG-------SDDGVTMQMFEEWMAKFGKTYKCHGE 59
Query: 69 NEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
E RF IF+DN+ FI + TY VG+N+FADLTN+E+ A Y G +
Sbjct: 60 KEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQFADLTNDEFVATYTG-----------A 107
Query: 127 KVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
K + A + D + P +DWR +GAV VKDQG+CGSCWAF+ VAA+EG+ KI TG+
Sbjct: 108 KPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQ 167
Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRR 242
L LSEQELVDCD N GC GG D AF+ + GG+ +E DY Y G + KC D
Sbjct: 168 LTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLF 226
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
N I GY V P DE L AVA QPV+V I+A G AFQ Y+SGVF G CG++ +H
Sbjct: 227 N-HAARIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHA 285
Query: 303 VVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V VGY + +G YW+ +NSWG WG+ GY+ L++++L + G CG+A+ YP
Sbjct: 286 VTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVSPFYPT 341
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 197/323 (60%), Gaps = 24/323 (7%)
Query: 44 TDDEV-MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKF 100
+DD V M +++ W+AK GKT G E RF IF+DN+ FI + TY VG+N+F
Sbjct: 11 SDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQF 69
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKD 158
ADLTN+E+ A Y G +K + A + D + P +DWR +GAV VKD
Sbjct: 70 ADLTNDEFVATYTG-----------AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKD 118
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG+CGSCWAF+ VAA+EG+ KI TG+L LSEQELVDCD N GC GG D AF+ +
Sbjct: 119 QGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVAS 177
Query: 219 NGGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
GG+ +E DY Y G + KC D N SI GY V P DE L AVA QPV+V I
Sbjct: 178 KGGITAESDYRYEGFQGKCRVDDMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYI 236
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
+A G AFQ Y+SGVF G CG++ +H V VGY + +G YW+ +NSWG WG+ GY+
Sbjct: 237 DASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYIL 296
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
L++++L + G CG+A+ YP
Sbjct: 297 LEKDVLQPH-GTCGLAVSPFYPT 318
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/362 (38%), Positives = 215/362 (59%), Gaps = 19/362 (5%)
Query: 3 TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
T FL + ++ F + + ++ + ++ +M +Y+ W + H +
Sbjct: 2 TVMKFLIVPLVLIAFLCNICESFELE----------RKDFESEKSLMQLYKRW-SSHHRI 50
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
S RF++FK+N + + + N + ++ K+ LN+FAD++++E+R MY + K
Sbjct: 51 SRNANEMHNRFKVFKNNAKHVFKVNLMGKSLKLKLNQFADMSDDEFRNMYSSNITYYKDL 110
Query: 123 LMKSKVASQR----YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
K A+ + + + +P S+DWR+KGAVN +K+QG CGSCWAF+ VAAVE I+
Sbjct: 111 HAKKIEATGGRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIH 170
Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
+I T EL+SLSE+E++DCD + + GC GG + AF+F++ N G+ E +YPY C
Sbjct: 171 QIKTNELVSLSEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCR 229
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE--CG 296
K V IDGYE+V +E +L KAVA QPV+VAI +GG F+ Y G+FT CG
Sbjct: 230 RRGGRNKRVRIDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCG 289
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+DH VV VGYGT+ DYW++RN +G WG NGY+K+QR + G CG+AM+ +YP
Sbjct: 290 FNIDHTVVVVGYGTDEDGDYWIIRNQYGHRWGMNGYMKMQRG-AHSPQGVCGMAMQPAYP 348
Query: 357 VK 358
VK
Sbjct: 349 VK 350
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 217/359 (60%), Gaps = 25/359 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDE--VMTIYQTWLAKHGKTSNGMG 67
+++++F+F ++I+S ++S T E V +Q W+ + + +
Sbjct: 1 MTSILFMF-------VSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDEL 53
Query: 68 HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRS---DAKRRL 123
+ RF +FK NL+FI++ N +RTYK+G+N+FAD T EE+ A + G +
Sbjct: 54 EKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEF 113
Query: 124 MKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
+ + S + AG PE DWR +GAV PVK QG CG CWAFS+VAAVEG+ KIV
Sbjct: 114 VDEMIPSWNWNVSDVAG---PEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIV 170
Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
G L+SLSEQ+L+DCDR+ + GCNGG+M AF +II+N G+ SE YPY E C R
Sbjct: 171 GGNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTC---R 227
Query: 242 RNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSAL 299
NAK + I G++ V +E +L +AV+ QPVSV+I+A G F HY GV+ CG+ +
Sbjct: 228 YNAKPSAWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDV 287
Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+H V VGYGT G+ YWL +NSWG WGENGY++++R++ G CG+A A YPV
Sbjct: 288 NHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQ-GMCGVAQYAFYPV 345
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 129/229 (56%), Positives = 165/229 (72%), Gaps = 5/229 (2%)
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
RY + D +P ++DWR GAV P+KDQG CG CWAFS VAA EGI KI TG+LISLSEQ
Sbjct: 7 RYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQ 66
Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
ELVDCD + GC GGLMD AF+FII+NGG+ +E +YPY A+ KC +A +I
Sbjct: 67 ELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSA--ANIK 124
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
GYEDV DE +L KAVA+QPVSVA++ G FQ Y GV TG CG+ LDHG+ A+GYG
Sbjct: 125 GYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 184
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
T +G YWL++NSWG+ WGENGY+++++++ D G CG+A+E SYP +
Sbjct: 185 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISD-KKGMCGLAIEPSYPTE 232
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 197/312 (63%), Gaps = 20/312 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ W HGKT G + +R I+ DNL + +HN+ N +YK+ +N FADLT E++
Sbjct: 27 WHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIVKKHNAENHSYKLDMNHFADLTVTEFKQR 85
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
++G R+ + + + + +LP VDWR+KG V VK+QG CGSCWAFS+
Sbjct: 86 FMGYRAAS------NSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSST 139
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
++EG + TG+L+SLSEQ LVDC +K N GC GGLMDYAF++I N G+D+EQ YPY
Sbjct: 140 GSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPY 199
Query: 231 LGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
+ +C P A V GY DV E L+ AVA P+SVAI+AG +FQ Y+
Sbjct: 200 TARDGQCHFKPGSVGATVT---GYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYK 256
Query: 288 SGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
+GV++ +C S LDHGV+AVGYG E+G DYWLV+NSWG WG NGY+K+ RN
Sbjct: 257 TGVYSEPDCSSTQLDHGVLAVGYGAEDGKDYWLVKNSWGEGWGMNGYIKMSRN----KDN 312
Query: 346 KCGIAMEASYPV 357
+CGIA +ASYP+
Sbjct: 313 QCGIATQASYPL 324
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 192/311 (61%), Gaps = 12/311 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
++ W GK+ + R +++ N +D HN +Y +G+N FADLT+EE++
Sbjct: 30 FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKR 89
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
YLGT+ D R +S +S LP+SVDWR G V PVKDQG CGSCW+FST
Sbjct: 90 FYLGTKVDLNRP--RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFST 147
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
+VEG + TG+L+SLSEQ LVDC + + N GCNGGLMD AFQ+II N G+D+E YP
Sbjct: 148 TGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYP 207
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
Y + C + N ++ ++D++ E L+ AVA PVSVAI+A +FQ Y S
Sbjct: 208 YTAKDGTCKFNAANVG-ATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTS 266
Query: 289 GVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
GV+ +C S +LDHGV+A GYGT NG YWLV+NSWGS WG+ GY+ + RN +
Sbjct: 267 GVYNEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNA----NNQ 322
Query: 347 CGIAMEASYPV 357
CGIA ASYP+
Sbjct: 323 CGIATSASYPI 333
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 199/324 (61%), Gaps = 13/324 (4%)
Query: 45 DDEVMT-IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-----RTYKVGLN 98
DD+ M Y+ W+A+ G+T +RF++FK N FID HN+ K+ N
Sbjct: 12 DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTN 71
Query: 99 KFADLTNEEYRAMYL-GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
KFADLT +E+R +Y+ G R + + + + + A D +P S+DWR +GAV VK
Sbjct: 72 KFADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSD-VPPSIDWRARGAVTSVK 130
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQ C CWAFS+ AAVEGI++I TG +SLS Q+LVDC N C G +D A+++I
Sbjct: 131 DQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIA 190
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
++GG+ ++QDYPY G C + A V I G++ V +E +L AVA QPVSVA++
Sbjct: 191 RSGGLVADQDYPYEGHSGTCRVYGKQA-VARISGFQYVPARNETALLLAVAHQPVSVALD 249
Query: 278 AGGRAFQHYESGVF--TGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
RA QH +G+F GE C + L+H + VGYGT E+G YWL++NSWGSDWG+ GYV
Sbjct: 250 GLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYV 309
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K R++ G CG+A+EASYPV
Sbjct: 310 KFARDVASEINGVCGLALEASYPV 333
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 197/318 (61%), Gaps = 18/318 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEK---RFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
D + ++ W+ H K+ + NE+ R+ ++++N FI E N N +Y + +NKF D
Sbjct: 24 DPLTGVFADWMRTHTKSYS----NEEFVFRWNVWRENYNFIQEENRKNNSYYLTMNKFGD 79
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LTN E+ +Y G D ++K+K A+ LP + DWR+KGAV VK+QG C
Sbjct: 80 LTNAEFNKVYKGLAFDYSAHILKAKAATPA---APAPGLPANFDWRQKGAVTHVKNQGQC 136
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCW+FST + EG N + G L+SLSEQ L+DC N GCNGGLMDYAF++II N G
Sbjct: 137 GSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKG 196
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+D+E YPY A+ C + N+ S+ Y DVS DE +L AVA +P SVAI+A
Sbjct: 197 IDTEASYPYETAQYNCRYNPANSG-GSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHN 255
Query: 282 AFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
+FQ Y GV + C S LDHGV+AVG+GTENG DYWLV+NSWG+DWG GY+K+ RN
Sbjct: 256 SFQFYSGGVYYESSCSSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQGYIKMARNR 315
Query: 340 LDTNTGKCGIAMEASYPV 357
+ CGIA ASYP
Sbjct: 316 HN----NCGIATAASYPT 329
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 192/316 (60%), Gaps = 12/316 (3%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
DE +Q W H K + R I++DNL+ I +HN+ ++ + +N DLT
Sbjct: 22 DEDEQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQ 81
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
+E+R Y G RS K A + + ++P++VDWR++G V PVK+QG CGSC
Sbjct: 82 DEFRYFYTGMRSHYSNYTKKQGSA---FLAPSHVQVPDTVDWRKEGYVTPVKNQGQCGSC 138
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
WAFST ++EG N TG+L+SLSEQ LVDC N GC GGLMDYAF++I +NGG+D+
Sbjct: 139 WAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENGGIDT 198
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
E+ YPY ++C + N V G+ DV+ DE +LK A P+SVAI+AG +F
Sbjct: 199 EESYPYEARNDRCRFQKSNIGAVDT-GFVDVTHGDEEALKTAAGTVGPISVAIDAGHMSF 257
Query: 284 QHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Q Y SGV+ G ++LDHGV+ VGYGT G DYWLV+NSWG WG GY+ + RN
Sbjct: 258 QFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQGSDYWLVKNSWGERWGMEGYIMMSRN--- 314
Query: 342 TNTGKCGIAMEASYPV 357
+CG+A +ASYP+
Sbjct: 315 -KNNQCGVATQASYPL 329
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 153/325 (47%), Positives = 192/325 (59%), Gaps = 41/325 (12%)
Query: 72 RFQIFKDNLRFIDEHNSLNRTYK------------------------------VGLNKFA 101
R IFK N+ +I NS ++Y+ +GLN+FA
Sbjct: 20 RLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAHTDLLPQLGLNEFA 79
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP-ESVDWREKGAVNPVKDQG 160
D T EE+ + +LG + S R+A D P S++W E GAV PVK+Q
Sbjct: 80 DQTWEEFSSTHLGLNAGEDGSFRSSANTGFRHA----DVTPANSINWVEAGAVTPVKNQA 135
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST +VEG N + TG+L+SLSEQ+LVDCD K + GC GGLMDYAF +II+NG
Sbjct: 136 FCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTKKDQGCGGGLMDYAFDYIIKNG 195
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+D+E+DY Y C+ R VVSIDGYEDV DE++L KAV+ QPVSVAI A
Sbjct: 196 GLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSKQPVSVAICA-S 254
Query: 281 RAFQHYESGVFT--GECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
A Q Y SGV G C L+HGV+A GY E+G YWLV+NSWG WG GY+KL++
Sbjct: 255 EAMQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGGTWGMQGYMKLEK 313
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQN 362
+ G CGIAM ASYPVK+S N
Sbjct: 314 D-SSVKEGACGIAMAASYPVKSSPN 337
>gi|110743577|dbj|BAE98346.1| RD21A-like cysteine protease [Triticum aestivum]
Length = 184
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/184 (72%), Positives = 156/184 (84%), Gaps = 1/184 (0%)
Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-R 198
ELPES+DWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+CD
Sbjct: 1 ELPESIDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDIN 60
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
++GCNGGLMD AF+FII+NGG+D+E DYPY + +CD R+NAKVVSIDG+EDV
Sbjct: 61 GGSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPEN 120
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTENG DYW+
Sbjct: 121 DEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWI 180
Query: 319 VRNS 322
VRNS
Sbjct: 181 VRNS 184
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 199/313 (63%), Gaps = 18/313 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W K+G + +K FQIFK N+ +ID N+ N+ YK+ +N+F D E+
Sbjct: 42 FEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPIED--- 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
SD + + + + ++P +VDWR++GAV P+K+QG CGSCWAFS
Sbjct: 99 ------SDDGFERTTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCWAFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAA+EGI KI +G L+SLSEQ+LVDCDR GC+ G M AF+FI++NGG+ +E +YP
Sbjct: 153 VAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATEANYP 212
Query: 230 YLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
Y + C ++ + V I YE+V E SL KAVA+QPVSV I+ G F+ Y S
Sbjct: 213 YKRVVKGTC---KKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRG-MFKFYSS 268
Query: 289 GVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
G+FTGECG+ +H + VGYGT ++G+ YWLV+NSW WGE GY++++R+ +D G C
Sbjct: 269 GIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRD-IDAKEGLC 327
Query: 348 GIAMEASYPVKNS 360
GIAM+ SYP+ N+
Sbjct: 328 GIAMKPSYPIINN 340
>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
[Glycine max]
Length = 400
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 154/367 (41%), Positives = 223/367 (60%), Gaps = 26/367 (7%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHS------SSWRTDDEVMTIYQTWLAKH 59
M + T +FL FI S S + YD ++S + +++ V+ ++Q W ++
Sbjct: 1 MGCQLKTHLFLLFIVWGS---WSFLCYDLPSEYSILALEIDKFPSEEGVVELFQRWKEEN 57
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTR 116
K + RF+ FK NL++I E NS + +GLN+FAD++NEE+++ ++
Sbjct: 58 KKIYRNPEEEKLRFENFKRNLKYIVEKNSKRISPYGQSLGLNQFADMSNEEFKSKFM--- 114
Query: 117 SDAKRRLMKSK-VASQRYACKAGDELPESVDWREKGAVN-PVKDQGSCGSCWAFSTVAAV 174
S K+ K V+S+ ++C+ DE P S+DWR+KG V VKDQG CGS WAFS+ A+
Sbjct: 115 SKVKKPFSKRNGVSSKDHSCE--DE-PYSLDWRKKGVVTLAVKDQGYCGSYWAFSSTDAI 171
Query: 175 EGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
EGIN IVT +LISLSEQELVDCD N GC+GG MDYAF++++ NGG+D+E +YPY+GA+
Sbjct: 172 EGINAIVTADLISLSEQELVDCD-STNDGCDGGXMDYAFEWVMYNGGIDTETNYPYIGAD 230
Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
C+ ++ KV+ IDGY DV D SL A QP+S I+ FQ Y G++ G+
Sbjct: 231 GTCNVTKEKTKVIGIDGYYDVGQSDS-SLLCATVKQPISAGIDGTSWDFQLYIGGIYDGD 289
Query: 295 CGS---ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
C S +DH ++ VGYG+E DYW+V+NSW + WG G + L++N + G C I
Sbjct: 290 CSSDPDDIDHAILVVGYGSEGDDDYWIVKNSWRTSWGMEGCIYLRKN-TNLKYGXCAINY 348
Query: 352 EASYPVK 358
ASYP K
Sbjct: 349 MASYPTK 355
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 148/339 (43%), Positives = 203/339 (59%), Gaps = 19/339 (5%)
Query: 9 AISTLVFL-FFISSSSAADMSIISYDNNHDHSS-----SWRTDDEVMTIYQTWLAKHGKT 62
A+S LVF F I D + DH + W+ ++ + ++ A +GK+
Sbjct: 71 AVSLLVFASFLIQWQGDDDRGVFPPSPVEDHKTPVNIWEWK-EEHFQNAFGSFRATYGKS 129
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
+KR+ IFK+NL +I HN +Y + +N F DL+ EE+R YLG K R
Sbjct: 130 YATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRKYLGYN---KSR 186
Query: 123 LMKSK---VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
+KS VA++ D +P +VDWREKG V PVKDQ CGSCWAFS A+EG +
Sbjct: 187 NLKSNNLGVATELLKVSPSD-VPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALEGAHC 245
Query: 180 IVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
TGEL+SLSEQELVDC + N GC+GG M+ AFQ+++ +GG+ SE+ YPYL + +C
Sbjct: 246 AKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDGEC- 304
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
R KVV+I G++DV E ++K A+A PVS+AIEA FQ Y GVF CG+
Sbjct: 305 -KRACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFDASCGTD 363
Query: 299 LDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKL 335
LDHGV+ VGYGT E D+W+++NSWGS WG +GY+ +
Sbjct: 364 LDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYM 402
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 198/314 (63%), Gaps = 15/314 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
++ + HGK R IF+DN + I EHN R+Y +G+N+F DL + E
Sbjct: 20 WEAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSE 79
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
Y + +G L S + + G ++ ++VDWR+KGAV P+KDQG CGSCWA
Sbjct: 80 YLELVVGP---GLLPLNLSTPSENVFESTPGLQVDDTVDWRQKGAVTPIKDQGHCGSCWA 136
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FST ++EG + + TG+L+SLSEQ L+DC R+ N GC GGLMD AF++I NGG+D+E+
Sbjct: 137 FSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEE 196
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY+ + K + + ++ Y D+ DEM+L +AV PVSVAI+A ++ +
Sbjct: 197 CYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKSLRF 256
Query: 286 YESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y+SG++ EC + LDHGV+AVGYG+ +G+DYWLV+NSWGS WG+ GYVK+ RN
Sbjct: 257 YKSGIYDEPECSRTKLDHGVLAVGYGSMDGMDYWLVKNSWGSAWGDMGYVKMTRN----K 312
Query: 344 TGKCGIAMEASYPV 357
+CGIA +ASYPV
Sbjct: 313 NNQCGIATKASYPV 326
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 141/287 (49%), Positives = 191/287 (66%), Gaps = 20/287 (6%)
Query: 76 FKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
FK+N+ +I+ +N+ N+ YK G+N+FA R+ K + S + +
Sbjct: 58 FKENVNYIEACNNAANKPYKRGINQFA-------------PRNRFKGHMCSSIIRITTFK 104
Query: 135 CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELV 194
+ P +VD R+KGAV P+KDQG CG CWAFS VAA EGI+ + G+LISLSEQELV
Sbjct: 105 FENVTATPSTVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELV 164
Query: 195 DCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP-YLGAENKCDPSRRNAKVVS-IDG 251
DCD K ++ GC GGLMD AF+FIIQN G+ P Y+G + KC+ + + I G
Sbjct: 165 DCDTKGVDXGCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITG 224
Query: 252 YEDVSPFDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
YEDV +E + L+KAVA+ PVS AI+A G FQ Y+SGVFTG CG+ LDHGV AVGYG
Sbjct: 225 YEDVPANNEKAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGV 284
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+++G +YWLV+NSWG++WGE GY+++QR +D+ CGIA++ASYP
Sbjct: 285 SDDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEALCGIAVQASYP 330
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 144/358 (40%), Positives = 210/358 (58%), Gaps = 14/358 (3%)
Query: 9 AISTLVFL-FFISSSSAADMSIISYDNNHDHSSS-----WRTDDEVMTIYQTWLAKHGKT 62
A+S LVF F I D ++ DH W+ + + ++ A + K+
Sbjct: 69 AVSLLVFASFLIQWQGEDDRAVFPPSPVEDHQPPANIWEWK-EAHFQDAFSSFQAMYAKS 127
Query: 63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
++R+ IFK+NL +I HN +Y + +N F DL+ +E+R YLG + +
Sbjct: 128 YATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKSRNLK 187
Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
VA++ ELP VDWR +G V PVKDQ CGSCWAFST A+EG + T
Sbjct: 188 SHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKT 246
Query: 183 GELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
G+L+SLSEQEL+DC R + N C+GG M+ AFQ+++ +GG+ SE YPYL + +C ++
Sbjct: 247 GKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECR-AQ 305
Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
KVV I G++DV E ++K A+A PVS+AIEA FQ Y GVF CG+ LDH
Sbjct: 306 SCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDH 365
Query: 302 GVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GV+ VGYGT E+ D+W+++NSWG+ WG +GY+ + + G+CG+ ++AS+PV
Sbjct: 366 GVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMA--MHKGEEGQCGLLLDASFPV 421
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 199/326 (61%), Gaps = 37/326 (11%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
+T E + + +WL H T + KR + + N +I HN ++K+G N F+
Sbjct: 24 KTFKEYESDFVSWLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQESSFKLGHNAFSH 83
Query: 103 LTNEEYRAMYLGTRSD---AKRRLMKSKVASQ---RYACKAGDELPESVDWREKGAVNPV 156
LTNEE+R + G ++ +RL +S VAS +Y +LPESVDW EKGAV V
Sbjct: 84 LTNEEFRQRFNGFKASDDYLTKRLAQSNVASSTNFQYI-----DLPESVDWVEKGAVTGV 138
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
K+QG CGSCWAFST A+EG I +G+L+SLSEQELVDCD + GCNGGLMD+AF +I
Sbjct: 139 KNQGMCGSCWAFSTTGAIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWI 198
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
++ G+ SE+DY Y+ +++ C R VVS PV+VAI
Sbjct: 199 SEHDGICSEEDYAYIHSQSLC---RSCKPVVS----------------------PVAVAI 233
Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
+AG R+FQ Y+SGV+ CG+ LDHGV+ VGYG E+G YW V+NSWG+ WGE GY++L
Sbjct: 234 DAGDRSFQFYQSGVYNKTCGTQLDHGVLTVGYGVEDGQKYWKVKNSWGNSWGEKGYIRLS 293
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN 362
R+ + +G+CGIAM SYP + +N
Sbjct: 294 RD-QNGRSGQCGIAMVPSYPTASLRN 318
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 157/355 (44%), Positives = 214/355 (60%), Gaps = 11/355 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+++STL+ L ++ SSA + D +++ ++ W+A+HG+T
Sbjct: 1 MSLSTLI-LALLAMSSAVAAPRALAARQLAGDEAITVDSAMVSRHEKWMAEHGRTYANEE 59
Query: 68 HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+R ++F+ N + ID NS + T+++ N+FADLT+EE+RA G R
Sbjct: 60 EKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAAAAGAG 119
Query: 127 KVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
A RY + + S+DWR GAV VKDQGSCG CWAFS VAAVEG+ KI TG L
Sbjct: 120 SGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKIRTGRL 179
Query: 186 ISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+SLSEQ+LVDCD + GC GGLMD AF+++I GG+ +E YPY G + C RR+A
Sbjct: 180 VSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLTTESSYPYRGTDGSC---RRSA 236
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSALDHGV 303
SI GYEDV +E +L AVA QPVSVAI G F+ Y+SGV G CG+ L+H +
Sbjct: 237 SAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVFRFYDSGVLGGSGCGTELNHAI 296
Query: 304 VAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
AVGYGT +G YW+++NSWG WGE GYV+++R + G CG+A ASYPV
Sbjct: 297 TAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGV--RGEGVCGLAQLASYPV 349
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 181/286 (63%), Gaps = 4/286 (1%)
Query: 72 RFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
RF++F N + I+ HN + ++ +G N+++ LT +E++ + G R ++K A
Sbjct: 47 RFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRAKYAL 106
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
A D +P +DW E+G V PVK+QG CGSCWAFST A+EG + + +L+S+SE
Sbjct: 107 MAPAVNMTD-VPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSE 165
Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
QELVDCD + GCNGGLMD AF+++ + G+ E+DYPY E C ++ V +
Sbjct: 166 QELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAKEGTC-ALKKCKPVTKVT 224
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
+ DV DE +LK AVA QPVSVAIEA FQ Y+SGVF CG+ LDHGV+ VGYG
Sbjct: 225 AFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVFDKSCGTKLDHGVLVVGYGE 284
Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
E G YW V+NSWG+DWG+ GY+KL R TG+CG+AM SYP
Sbjct: 285 EGGKKYWKVKNSWGADWGDKGYIKLARE-FGPETGQCGVAMVPSYP 329
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 138/307 (44%), Positives = 193/307 (62%), Gaps = 10/307 (3%)
Query: 56 LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLG 114
+A++G+ +RFQIFK+N+ I+ N+ N +Y +G+NKF D+TN E+ A Y G
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
S R L K + + +S+DWR+ GAV VKDQ CGSCWAFS +A V
Sbjct: 61 GIS---RPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATV 117
Query: 175 EGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
EGI KIVTG L+SLSEQE++DC ++ GC+GG +D A+ FII N G+ SE DYPY +
Sbjct: 118 EGIYKIVTGYLVSLSEQEVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQ 175
Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
C + I GY V DE S+K AV +QP++ AI+A G FQ+Y GVF+G
Sbjct: 176 GDC-AANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGP 234
Query: 295 CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEA 353
CG++L+H + +GYG + +G YW+V+NSWGS WGE GY+++ R + +++G CGIAM+
Sbjct: 235 CGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGV--SSSGLCGIAMDP 292
Query: 354 SYPVKNS 360
YP S
Sbjct: 293 LYPTLQS 299
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 126/219 (57%), Positives = 158/219 (72%), Gaps = 3/219 (1%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP VDWR GAV +K QG CG CWAFS +A VEGINKIVTG LISLSEQEL+DC R
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60
Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
N GCNGG + FQFII NGG+++E++YPY + +C+ +N K V+ID YE+V +
Sbjct: 61 NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A+DH V VGYGTE G+DYW+V
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV 180
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 181 KNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 217
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 203/323 (62%), Gaps = 22/323 (6%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADL 103
+D V ++ W+A+HG+T E+RF IFK NL+ I+ +N+ NRTYK+GLN FADL
Sbjct: 31 EDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADL 90
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-----PESVDWREKGAVNPVKD 158
T+EE+ A Y G + + L + + ++ ++ D L PES+DWR +G V PVK+
Sbjct: 91 TDEEFLATYTGYK--MPKVLPTANITTK--TTQSSDVLYEANVPESIDWRTRGVVTPVKN 146
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
QG CG CWAFS AAVEGI G +SLS Q+L+DC N GCNGG MD AF++IIQ
Sbjct: 147 QGRCGCCWAFSAAAAVEGI----IGNGVSLSAQQLLDCVPDSN-GCNGGFMDNAFRYIIQ 201
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
N G+ S YPY C PS A+ I GY DV+P DE +LK AVA QPVS A++A
Sbjct: 202 NQGLASATYYPYQLMREMCRPSNNAAR---ISGYVDVTPADEETLKSAVARQPVSAAVDA 258
Query: 279 GGRA-FQHYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKL 335
F++Y G+F + CGS L H + VGYGT G YWL++NSWG WGE GY++L
Sbjct: 259 TSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAEGTKYWLIKNSWGEGWGEGGYMRL 318
Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
QR+ + + G CGIA+ ASYP +
Sbjct: 319 QRD-VGSYGGACGIALRASYPTR 340
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/355 (40%), Positives = 213/355 (60%), Gaps = 20/355 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF + A S S D +D +M ++ W+A++G+ +R
Sbjct: 7 LVFLFLFLCAMWASPSAASRD---------EPNDPMMKRFEEWMAEYGRVYKDDDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N++ I+ NS N +Y +G+N+F D+T E+ A Y G + + V S
Sbjct: 58 FQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGV--SLPLNIEREPVVS- 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR+ GAVN VK+Q CGSCW+F+ +A VEGI KI TG L+SLSEQ
Sbjct: 115 -FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQ 173
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
E++DC ++ GC GG ++ A+ FII N G+ +E++YPYL + C+ + I G
Sbjct: 174 EVLDC--AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCN-ANSFPNSAYITG 230
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V DE S+ AV++QP++ I+A FQ+Y GVF+G CG++L+H + +GYG +
Sbjct: 231 YSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQD 289
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
+G YW+VRNSWGS WGE GYV++ R + +++G CGIAM +P S +A+
Sbjct: 290 SSGTKYWIVRNSWGSSWGEGGYVRMARG-VSSSSGVCGIAMAPLFPTLQSGANAE 343
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 193/314 (61%), Gaps = 7/314 (2%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
++ +Q W+ + + + + R Q+ +NL+FI+ N++ N++YK+G+N+F D T E
Sbjct: 35 IVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94
Query: 107 EYRAMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E+ A Y G R + + D L + DWR +GAV PVK QG CG C
Sbjct: 95 EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGC 154
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +AAVEG+ KI G LISLSEQ+L+DC R+ N GC GG AF +II++ G+ SE
Sbjct: 155 WAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISSE 214
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
+YPY E C + R A + I G+E+V +E +L +AV+ QPV+VAI+A F H
Sbjct: 215 NEYPYQVKEGPCRSNARPA--ILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVH 272
Query: 286 YESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y GV+ CG++++H V VGYGT G+ YWL +NSWG WGENGY++++R+ ++
Sbjct: 273 YSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRD-VEWP 331
Query: 344 TGKCGIAMEASYPV 357
G CG+A ASYPV
Sbjct: 332 QGMCGVAQYASYPV 345
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 133/271 (49%), Positives = 181/271 (66%), Gaps = 28/271 (10%)
Query: 89 LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWR 148
++++YK+ +N+FADLTNEE+ +R+ K + ++ S +Y + +P + DWR
Sbjct: 1 MDKSYKLSINEFADLTNEEFGT----SRNRFKAHICSTEATSFKY--ENVTAVPSTXDWR 54
Query: 149 EKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGG 207
+KGAV P+KDQG CGSCWAFS VAA+EGI ++ TG+LISLSEQELVDCD + GC G
Sbjct: 55 KKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGA 114
Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
+YPY G + C+ + I+GYEDV +E +L+KAV
Sbjct: 115 -------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAV 155
Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSD 326
A QP++VAI+AGG FQ Y SGVFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSWG+
Sbjct: 156 AHQPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTG 215
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WGE GY+++QR++ G CGIAM+ASYP
Sbjct: 216 WGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 245
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 206/322 (63%), Gaps = 11/322 (3%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLT 104
D +M ++ W+A++G+ N +RFQIFK+N+ I+ N+ + +Y +G+N+F D+T
Sbjct: 4 DPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMT 63
Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
N E+ A Y G + + + V S + +P+S+DWR+ GAV VK+QGSCGS
Sbjct: 64 NNEFLARYTG--ASLPLNIERDPVVS--FDDVDISAVPQSIDWRDYGAVTSVKNQGSCGS 119
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CWAFS +A VEGI KI G LISLSEQE++DC ++ GC+GG ++ A+ FII N G+ S
Sbjct: 120 CWAFSAIATVEGIYKIKAGNLISLSEQEVLDC--ALSYGCDGGWVNKAYDFIISNNGVTS 177
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
+ PY G + C+ + K I GY V +E S+ AVA+QP++ I+AGG FQ
Sbjct: 178 FANLPYKGYKGPCNHNDLPNKAY-ITGYTYVQSNNERSMMIAVANQPIAALIDAGGD-FQ 235
Query: 285 HYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
+Y+SGVFTG CG++L+H + +GYG T +G YW+V+NSWG+ WGE GY+++ R+ + +
Sbjct: 236 YYKSGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARD-VSSP 294
Query: 344 TGKCGIAMEASYPVKNSQNSAK 365
G CGIAM +P S +A+
Sbjct: 295 YGLCGIAMAPLFPTLQSGANAE 316
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 198/318 (62%), Gaps = 18/318 (5%)
Query: 45 DDEVMTI---YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKF 100
+D+ +T+ Y+ W K+ EK QIFK N+ +ID N+ N++YK+ +N+F
Sbjct: 29 NDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRF 88
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
ADL E + KR+L S + K ++P +VDWR++GAV PVK+Q
Sbjct: 89 ADLPTEPSDDGF------KKRKL--EPTTSSLFKYKNITDIPAAVDWRKRGAVTPVKNQR 140
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFS V A+EGI +I +G L+SLSEQELVD R GCNGG + AF+F+++N
Sbjct: 141 ECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLEN 200
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ +E YPY G K + S++ ++ V I YE V E SL K VA+QPVSV I+
Sbjct: 201 GGIATEASYPYRGV--KGNNSKKVSRQVQIKSYEQVPRNSEDSLLKVVANQPVSVGIDIS 258
Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRN 338
G + Y SG+FTGECG+ +H V+ VGYGT N G YWLV+NSWG WGE Y++++R+
Sbjct: 259 G-MIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSWGIRWGEKRYIRMKRD 317
Query: 339 LLDTNTGKCGIAMEASYP 356
+D G CGI M+ASYP
Sbjct: 318 -IDAKEGLCGIPMDASYP 334
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/314 (46%), Positives = 191/314 (60%), Gaps = 17/314 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ W +HGK R I++ NL + +HN + TY +G+N+F DL NEE
Sbjct: 28 WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ AM G R + K S ELP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88 FVAMMTGFRVSGTSKAAK---GSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWA 144
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FST +VEG + TG+L+SLSEQ LVDC + +AGC+GG MD AFQ+II GG+D+E
Sbjct: 145 FSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR-DAGCDGGFMDRAFQYIIDAGGIDTEAS 203
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
YPY + KC + N ++ GY DV+ E +L+KAVA P+SVAI+A +FQHY
Sbjct: 204 YPYKAVDGKCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHY 262
Query: 287 ESGVFT--GECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
+SGV+ G + LDHGV+AVGYGT +G DYW+V+NSW WG NGYV + RN
Sbjct: 263 KSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRN----K 318
Query: 344 TGKCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 319 DNQCGIATNASYPL 332
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 156/355 (43%), Positives = 213/355 (60%), Gaps = 11/355 (3%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+++STL+ L ++ SSA + D +++ ++ W+A+HG+T
Sbjct: 1 MSLSTLI-LALLAMSSAVAAPRALAARQLAGDEAITVDAAMVSRHEKWMAEHGRTYANEE 59
Query: 68 HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+R ++F+ N + ID NS + T+++ N+FADLT+EE+RA G R
Sbjct: 60 EKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAAAAGAG 119
Query: 127 KVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
A RY + + S+DWR GAV VKDQGSCG CWAFS VAAVEG+ KI TG L
Sbjct: 120 SGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKIRTGRL 179
Query: 186 ISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+SLSEQ+LVDCD + GC GGLMD AF+++I GG+ +E YPY G + C RR+A
Sbjct: 180 VSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLTTESSYPYRGTDGSC---RRSA 236
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSALDHGV 303
SI GYEDV +E +L AVA QPVSVAI G F+ Y+SGV G CG+ L+H +
Sbjct: 237 SAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVFRFYDSGVLGGSGCGTELNHAI 296
Query: 304 VAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
A GYGT +G YW+++NSWG WGE GYV+++R + G CG+A ASYPV
Sbjct: 297 TAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGV--RGEGVCGLAQLASYPV 349
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 142/350 (40%), Positives = 206/350 (58%), Gaps = 19/350 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+A++G+ +R
Sbjct: 7 LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ NS N +Y +G+N+F D+T E+ A Y G S + V+
Sbjct: 58 FQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVSFD 117
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
A +P+S+DWR+ GAVN VK+Q CGSCWAF+ +A VEGI KI TG L+SLSEQ
Sbjct: 118 DVNISA---VPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQ 174
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
E++DC ++ GC GG ++ A+ FII N G+ +E++YPY + C+ + I G
Sbjct: 175 EVLDC--AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCN-ANSFPNSAYITG 231
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V DE S+ AV++QP++ I+A FQ+Y GVF+G CG++L+H + +GYG +
Sbjct: 232 YSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQD 290
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+G YW+VRNSWGS WGE GYV++ R + +++G CGIAM +P S
Sbjct: 291 SSGTKYWIVRNSWGSSWGEGGYVRMARG-VSSSSGACGIAMSPLFPTLQS 339
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 197/315 (62%), Gaps = 16/315 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
+Q W+ + + + + RF +FK NL+FI++ N +RTYK+G+N+FAD T EE+ A
Sbjct: 47 HQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIA 106
Query: 111 MYLGTRS---DAKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSC 165
+ G + + + S + AG E + DWR +GAV PVK QG CG C
Sbjct: 107 THTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRE---TKDWRYEGAVTPVKYQGQCGCC 163
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS+VAAVEG+ KIV L+SLSEQ+L+DCDR+ + GCNGG+M AF +II+N G+ SE
Sbjct: 164 WAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASE 223
Query: 226 QDYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
YPY AE C R N K + I G++ V +E +L +AV+ QPVSV+I+A G F
Sbjct: 224 ASYPYQAAEGTC---RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFM 280
Query: 285 HYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
HY GV+ CG+ ++H V VGYGT G+ YWL +NSWG WGENGY++++R++
Sbjct: 281 HYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWP 340
Query: 343 NTGKCGIAMEASYPV 357
G CG+A A YPV
Sbjct: 341 Q-GMCGVAQYAFYPV 354
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 203/329 (61%), Gaps = 20/329 (6%)
Query: 1 MATASMFLAISTLVFLFFISSS----SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT S F S L+F+ S S SI+ Y + S+ ++++ ++ +W+
Sbjct: 1 MATISSF---SKLLFVAICLSVHMGLSYGAFSIVGYSPDDLTST-----EKLINLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
++ K + RF+IFKDNL++IDE N N TY +GL F DLTN+E++ Y+G
Sbjct: 53 VEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVG-- 110
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
S + + + + +P S+DWR+KGAV PV++QGSCGSCW FS+VAAVEG
Sbjct: 111 SIPENWSTTEEPNDKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG+L+SLSEQEL+DC+R+ + GC GG YA Q+ + N G+ Q YPY G + +
Sbjct: 171 INKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQ 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C ++ V DG V +E +L + +A QPVS+ +EA GRAFQ+Y G+F G CG
Sbjct: 229 CRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGS 325
+++DH V AVGYG NG Y L++NSWG+
Sbjct: 289 TSIDHAVAAVGYG--NG--YILIKNSWGT 313
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 197/315 (62%), Gaps = 17/315 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ W +HGK R I++ NL + +HN + TY +G+N+FADL NEE
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ AM G R + + K S ELP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88 FVAMMTGFRVNGTSKAAK---GSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWA 144
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FST ++EG + TG+L+SLSEQ LVDC ++ N GC+GGLMD AFQ+II+ GG+D+E+
Sbjct: 145 FSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEE 204
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY + +C + N ++ GY DV+ E +L+KAVA P+SVAI+A +FQ
Sbjct: 205 SYPYKAVDGECHFKKANIG-ATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQL 263
Query: 286 YESGVFT-GECGSA-LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Y+SGV+ +C S LDHGV+AVGYG T +G DYW+V+NSW WG NGY+ + RN
Sbjct: 264 YKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRN---- 319
Query: 343 NTGKCGIAMEASYPV 357
+CGIA +ASYP+
Sbjct: 320 KDNQCGIATQASYPL 334
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 192/310 (61%), Gaps = 14/310 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W + HGK + G ++ R +F N++ I HN+ T+K+ +N+F+DLT +E+
Sbjct: 25 WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNA-KSTFKMAINEFSDLTRKEFVKT 83
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
Y G R K+ K + +P VDWR++G V P+K+QG CGSCWAFST
Sbjct: 84 YNGYRLSMKKSTNKPST----FMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTT 139
Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
++EG + TG+L+SLSEQ L+DC + N GC GG MD AF++I N G+D+E YPY
Sbjct: 140 GSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPY 199
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
G ++ C + N + GY D+ + E LK AVA P+SVAI+A ++F Y +G
Sbjct: 200 EGRDDICRYKKTNKGAIDT-GYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTG 258
Query: 290 VF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V+ EC + LDHGV+ VGYGTENG DYWLV+NSWG+DWG NGY+K+ RN + C
Sbjct: 259 VYHEPECSQTVLDHGVLVVGYGTENGEDYWLVKNSWGTDWGMNGYIKMSRN----RSNNC 314
Query: 348 GIAMEASYPV 357
GIA ASYP+
Sbjct: 315 GIATNASYPL 324
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 207/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-SGLCDIAKMSSYP 341
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 203/329 (61%), Gaps = 20/329 (6%)
Query: 1 MATASMFLAISTLVFLFFISSS----SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT F S L+F+ S S SI+ Y + S+ ++++ ++ +W+
Sbjct: 1 MATIXSF---SKLLFVAICLSVHMGLSYGAFSIVGYSPDDLTST-----EKLINLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
++ K + RF+IFKDNL++IDE N N TY +GL F DLTN+E++ Y+G+
Sbjct: 53 VEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSI 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
+ +S + + +P S+DWR+KGAV PV++QGSCGSCW FS+VAAVEG
Sbjct: 113 PENWSTTEESN--DKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG+L+SLSEQEL+DC+R+ + GC GG YA Q+ + N G+ Q YPY G + +
Sbjct: 171 INKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQ 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C ++ V DG V +E +L + +A QPVS+ +EA GRAFQ+Y G+F G CG
Sbjct: 229 CRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGS 325
+++DH V AVGYG NG Y L++NSWG+
Sbjct: 289 TSIDHAVAAVGYG--NG--YILIKNSWGT 313
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 149/338 (44%), Positives = 193/338 (57%), Gaps = 39/338 (11%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ WL +G E RF I++ N+ +I S +Y + NKFADLTNEE+ +
Sbjct: 5 FDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNSYNLTDNKFADLTNEEFVST 64
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG-------- 163
YLG + RL + R+ LP S DWR++GAV +KDQG+CG
Sbjct: 65 YLGFAT----RL----IPHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFSP 116
Query: 164 ---------------------SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKIN 201
S WAFS VAAVE INKI +G+L+SLSEQELVD D N
Sbjct: 117 EISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANKN 176
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GC GGLMD F FI +NGG+ + +DYPY G + C+ + V+I GYE DE
Sbjct: 177 QGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKDEA 236
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
LK A A+QP+SVAI+AGG AFQ Y GVF+G CG L+HGV VGY Y V+N
Sbjct: 237 MLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFDKYRTVKN 296
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
S G+DWGE+GY++++R+ D G CGIAM+ASYP+K+
Sbjct: 297 SXGADWGESGYIRMKRDAFD-KAGTCGIAMKASYPLKD 333
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 208/351 (59%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T+EE+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK+QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C + ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT ENG YWL++NSWG+ WGE G++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNP-SGLCDIAKLSSYP 341
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 148/351 (42%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D +P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGGLM AF FII+NGG+ E DY YLG + C SR
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCR-SREKTAA 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGNCADQINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 197/315 (62%), Gaps = 16/315 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
+Q W+ + + + + RF +FK NL+FI++ N +RTYK+G+N+FAD T EE+ A
Sbjct: 23 HQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIA 82
Query: 111 MYLGTRS---DAKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSC 165
+ G + + + S + AG E + DWR +GAV PVK QG CG C
Sbjct: 83 THTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRE---TKDWRYEGAVTPVKYQGQCGCC 139
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS+VAAVEG+ KIV L+SLSEQ+L+DCDR+ + GCNGG+M AF +II+N G+ SE
Sbjct: 140 WAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASE 199
Query: 226 QDYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
YPY AE C R N K + I G++ V +E +L +AV+ QPVSV+I+A G F
Sbjct: 200 ASYPYQAAEGTC---RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFM 256
Query: 285 HYESGVFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
HY GV+ CG+ ++H V VGYGT G+ YWL +NSWG WGENGY++++R++
Sbjct: 257 HYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWP 316
Query: 343 NTGKCGIAMEASYPV 357
G CG+A A YPV
Sbjct: 317 Q-GMCGVAQYAFYPV 330
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 147/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T+EE+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
S + + D++P ++DWRE GAV VK+QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 PSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQGKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A Q Y G + G C + ++H V A+
Sbjct: 234 VQISNYQ-VVPEGETSLLQAVTKQPVSIGI-AASHDLQFYAGGTYDGSCANRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT ENG YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISIFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYSGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N + S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S V
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPV 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|167345242|gb|ABZ69061.1| cysteine protease [Pinus sylvestris]
Length = 214
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 125/206 (60%), Positives = 156/206 (75%), Gaps = 9/206 (4%)
Query: 21 SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
S+S AD SIIS + R DD +M +Y+ WLA+H K NG+ +KRF +FKDN
Sbjct: 18 SASRADFSIIS-------NKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNF 70
Query: 81 RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
+I EHN NR+YK+GLN+FADL++EE++A YLG + D K+RL++S S RY G++
Sbjct: 71 LYIHEHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLLRS--PSPRYQYSDGED 128
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP+S+DWREKGAV PVKDQG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQELVDCD
Sbjct: 129 LPKSIDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSY 188
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
N GCNGGL DYAF+FII NGG+ + +
Sbjct: 189 NQGCNGGLRDYAFEFIINNGGLTARR 214
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 144/384 (37%), Positives = 219/384 (57%), Gaps = 62/384 (16%)
Query: 25 ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID 84
++ SI+++D N + ++++V+ ++Q W +H K R + FK NL++I
Sbjct: 30 SEYSILAFDLN-----KFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIV 84
Query: 85 EHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
E N++ + + +GLN+FAD++NEE++ ++ S K+ + K + ++ ++ D+
Sbjct: 85 ERNAMRNSPVGHHLGLNRFADMSNEEFKNKFI---SKVKKPISK-RASNLHVKVESCDDA 140
Query: 142 PESVDWREKGAVNPVKDQGSCG-------------------------------------- 163
P S+DWR+KG V VKDQG+CG
Sbjct: 141 PYSLDWRKKGVVTGVKDQGNCGKLLYFMHFKSFLVIYILELTTNFPLYSFESQFCILEKK 200
Query: 164 ------SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
SCW+FS+ A+EG+N IVTG+LISLSEQELVDCD N GC GG MDYAF+++I
Sbjct: 201 KLDFVGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTT-NDGCEGGYMDYAFEWVI 259
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
NGG+D+E DYPY+G C+ ++ KVV+IDGY DV+ D +L A QP+SV I+
Sbjct: 260 NNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSDS-ALFCATVKQPISVGID 318
Query: 278 AGGRAFQHYESGVFTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
FQ Y G++ G+C S +DH V+ VGYG++ DYW+V+NSWG+ WG G++
Sbjct: 319 GSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIY 378
Query: 335 LQRNLLDTNTGKCGIAMEASYPVK 358
++RN + G C I AS+P K
Sbjct: 379 IRRN-TNLKYGVCAINYMASFPTK 401
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 200/322 (62%), Gaps = 18/322 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNK 99
T+ E+ ++ + + G+ R IF+ NL+FI HN + + T+ V +N
Sbjct: 25 TEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNN 84
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
F DL+NEE+RA + G RRL +A +A + LP +VDW KG V P+K+Q
Sbjct: 85 FTDLSNEEFRATFNG-----YRRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQ 139
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQ 218
CGSCWAFS VA++EG + + TG+L+SLSEQ LVDC + + GC+GG MDYAF+++IQ
Sbjct: 140 QQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
N G+D+E YPY + C+ +RN+ +I + DV DE +L+ AVA P+SVAI+
Sbjct: 200 NRGIDTEASYPYKAIDESCE-FKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAID 258
Query: 278 AGGRAFQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
A +FQ Y SGV+ +C + LDHGV AVGYGT NGV YW V+NSWG+ WG+ GY+ +
Sbjct: 259 ASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTSWGQKGYIFM 318
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
RN +CGIA +ASYPV
Sbjct: 319 SRN----KQNQCGIATKASYPV 336
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 194/315 (61%), Gaps = 7/315 (2%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
T+ V+ +Q W+ K+ +T EKR +IFK+NL +I+ N++ N++YK+GLN+++D
Sbjct: 25 TESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSD 84
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
LT+EE+ A + G + +L SK+ S D++P + DWREKG V VK+Q C
Sbjct: 85 LTSEEFIASHTGFK--VSDQLSDSKMRSVAIPFNLNDDVPTNFDWREKGVVTDVKNQRQC 142
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
G CWAF+ VAAVEGI KI G LISLSEQ+LVDCDR+ ++GC GG AF II++ G+
Sbjct: 143 GCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGGGDFVLAFDSIIKSRGI 201
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
E DYPY + + + I+GY V DE L +AV QPVSVAI
Sbjct: 202 VKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAVLQQPVSVAIST-SYD 260
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
F HY GV+ G CG L+H V +GYG +E G YWL++NSWG WGE GY+K+ R
Sbjct: 261 FHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGETWGEKGYMKVLRESSA 320
Query: 342 TNTGKCGIAMEASYP 356
T G+C IA+ A+YP
Sbjct: 321 TG-GQCSIAVHAAYP 334
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 193/309 (62%), Gaps = 9/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A+HG+ +R ++F+ N ID N+ ++++ N+FADLT +E+RA
Sbjct: 38 HEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEFRA 97
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G R R + RY + + +SVDWR GAV VKDQG+ G CWAFS
Sbjct: 98 ARTGLR---PRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSA 154
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAAVEG+NKI TG L+SLSEQELVDCD ++ GC+GGLMD AFQF+ + GG+ SE YP
Sbjct: 155 VAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYP 214
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + C S A SI G+EDV +E +L AVA QPVSVAI AF+ Y+SG
Sbjct: 215 YQCRDGPCR-SSAAAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSG 273
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
V G CG+ L+H + AVGYGT +G YWL++NSWG+ WGE GYV+++R + G CG
Sbjct: 274 VLGGACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGV--RGEGVCG 331
Query: 349 IAMEASYPV 357
+A SYPV
Sbjct: 332 LAKLPSYPV 340
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 126/208 (60%), Positives = 156/208 (75%), Gaps = 8/208 (3%)
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
GSCWAFS +AAVEG+NKI+TG+L+SLSEQELVDCD N GC+GGLMDYAFQ+I +NGG+
Sbjct: 13 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQYIQRNGGV 72
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+E +YPYL + C+ ++ + V+IDGYEDV +E +L+KAVA QPV+VAIEA G+
Sbjct: 73 TTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQD 132
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
FQ Y GVFTG CG+ LDHGV AVGYGT +G YW V+NSWG DWGE GY+++QR + D
Sbjct: 133 FQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVPD 192
Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPKPH 369
+ G CGIAME SYP K KP H
Sbjct: 193 SR-GLCGIAMEPSYPTK------KPAGH 213
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 194/317 (61%), Gaps = 7/317 (2%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
+ + +Q W+ + + + R ++F +NL+FI+ N++ +++YK+G+NKF D
Sbjct: 31 EPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGVNKFTDW 90
Query: 104 TNEEYRAMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
T EE+ A + G + + + D L + DWR +GAV PVK QG C
Sbjct: 91 TKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGAVTPVKYQGEC 150
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
G CWAFS +AAVEG+ KI G LISLSEQ+L+DC R+ N GC GG M AF +I++NGG+
Sbjct: 151 GGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEAFNYIVKNGGV 210
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
SE YPY E C + + + I G+E+V +E +L +AV+ QPV+V I+A
Sbjct: 211 SSENAYPYQVKEGPCRSN--DIPAIVIRGFENVPSNNERALLEAVSRQPVAVDIDASETG 268
Query: 283 FQHYESGVFTG-ECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
F HY GV+ +CG++++H V VGYGT + G+ YWL +NSWG WGENGY++++R+ +
Sbjct: 269 FIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENGYIRIRRD-V 327
Query: 341 DTNTGKCGIAMEASYPV 357
+ G CG+A ASYPV
Sbjct: 328 EWPQGMCGVAQYASYPV 344
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 202/321 (62%), Gaps = 26/321 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEK---RFQIFKDNLRFIDEHNSLNRTYKVGLNKF 100
+ D + ++ W+ +H K+ NE+ R+ ++++N +I+ HN N+++ + +NKF
Sbjct: 22 SHDPLTGVFADWMQEHQKSYA----NEEFVYRWNVWRENYLYIEAHNHQNKSFHLAMNKF 77
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
DLTN E+ ++ G A + +S +A LP DWR+KGAV VK+QG
Sbjct: 78 GDLTNAEFNKLFKGLSITADQAKQESDIA-------PAPGLPADFDWRQKGAVTHVKNQG 130
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
CGSCW+FST + EG N + G L SLSEQ LVDC N GCNGGLMDYAF++II+N
Sbjct: 131 QCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNGGLMDYAFEYIIRN 190
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNA--KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
G+D+E+ YPY ++ C +++++ ++VS Y +V +E +L AVA QP SVAI+
Sbjct: 191 KGIDTEESYPYHASQGTCRYNKQHSGGELVS---YTNVPSGNEGALLNAVATQPTSVAID 247
Query: 278 AGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
A +FQ Y+ GV+ C S+ LDHGV+AVG+G +G DYWLV+NSWG+DWG +GY+++
Sbjct: 248 ASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGVRDGKDYWLVKNSWGADWGLSGYIEM 307
Query: 336 QRNLLDTNTGKCGIAMEASYP 356
RN +CGIA AS+P
Sbjct: 308 SRN----KHNQCGIATAASHP 324
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N + S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 204/349 (58%), Gaps = 13/349 (3%)
Query: 17 FFISSSSAADMSIISYDNNHDHSSS-----WRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
F I D ++ DH W+ + + ++ A + K+ ++
Sbjct: 77 FLIQWQGEDDRAVFPPSPVEDHQPPANIWEWK-EAHFQDAFSSFQAMYAKSYATEEEKQR 135
Query: 72 RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
R+ IFK+NL +I HN +Y + +N F DL+ +E+R YLG + + VA++
Sbjct: 136 RYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKSRNLKSHHLGVATE 195
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
ELP VDWR +G V PVKDQ CGSCWAFST A+EG + TG+L+SLSEQ
Sbjct: 196 LLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQ 254
Query: 192 ELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
EL+DC R + N C+GG M+ AFQ+++ +GG+ SE YPYL + +C ++ KVV I
Sbjct: 255 ELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECR-AQSCEKVVKIL 313
Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
G++DV E ++K A+A PVS+AIEA FQ Y GVF CG+ LDHGV+ VGYGT
Sbjct: 314 GFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGT 373
Query: 311 --ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
E+ D+W+++NSWG+ WG +GY+ + + G+CG+ ++AS+PV
Sbjct: 374 DKESKKDFWIMKNSWGTGWGRDGYMYMA--MHKGEEGQCGLLLDASFPV 420
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GC+GG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT ENG YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 199/321 (61%), Gaps = 28/321 (8%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K G++ +R QI+ +N + + HN L ++Y++G+ +FAD+ NEE
Sbjct: 27 FHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEE 86
Query: 108 YRAMY-LGT----RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
Y+++ LG + A RR S + G LP +VDWR+KG V VKDQ C
Sbjct: 87 YKSLISLGCLRAFNTSAPRR------GSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQC 140
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCWAFS ++EG N TG+L+SLSEQ+LVDC N GCNGGLMDYAF++I +NGG
Sbjct: 141 GSCWAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGG 200
Query: 222 MDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
+D+E+ YPY + +C P AK GY DV+ DE +LK+AVA PVSV I+A
Sbjct: 201 IDTEKSYPYEAEDGQCRFKPENVGAKCT---GYVDVTVGDEDALKEAVATIGPVSVGIDA 257
Query: 279 GGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
+FQ Y+SGV+ +C S LDHGV+AVGYGT+NG DYWLV+NSWG WG+ GY+ +
Sbjct: 258 SHSSFQLYDSGVYDEQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMS 317
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
RN +CGIA ASYP+
Sbjct: 318 RN----KDNQCGIATAASYPL 334
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG+L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAEGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
Length = 514
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 161/355 (45%), Positives = 212/355 (59%), Gaps = 46/355 (12%)
Query: 52 YQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
+ W ++G+T +R IF DN+R I E + + + LN++ADLT EE+ +
Sbjct: 38 FTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEKDPGVTLALNEYADLTWEEFSS 97
Query: 111 MYLGTRSDAKR-----RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
LG R D + R S+ + RYA A + P+++DWREKGAV VK+QG CGSC
Sbjct: 98 TRLGLRIDQDQLDRRSRRSASRRNAWRYA--AAVDNPKAIDWREKGAVAEVKNQGQCGSC 155
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCD---RKI---------------------- 200
WAFST A+EGIN IVTG+L SLSEQ+LVDCD R +
Sbjct: 156 WAFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRNE 215
Query: 201 -NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY---LGAENKCDPSRRNAK-VVSIDGYEDV 255
N GC+GGLMD AF+++IQNGG+D+EQDY Y G C+ ++ + VSIDGYEDV
Sbjct: 216 SNMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRPAVSIDGYEDV 275
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGV 314
P E +L KAVA QPV+VAI AG + Q Y GV + C L+HGV+ VGY +++G
Sbjct: 276 -PQGEDNLLKAVAHQPVAVAICAGA-SMQFYSRGVIS-TCCEGLNHGVLTVGYNVSQDGE 332
Query: 315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
YW+V+NSWG+ WGE GY +L+ + + TG CGIA ASYP K S N KP P
Sbjct: 333 KYWIVKNSWGAGWGEQGYFRLKMGVGE--TGLCGIASAASYPTKTSPN--KPVPE 383
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 204/310 (65%), Gaps = 21/310 (6%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEEYRAMYL 113
+HG+ E+RF+IFK NL++I+EHN SL ++Y +G+N+FAD+ NEE+R MY
Sbjct: 48 QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106
Query: 114 GTRSDAK--RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G R D R + S + Y P+ VDWR+KG V VK+QG CGSCW+FST
Sbjct: 107 GLRRDYNYSREVQCSNHLTPEYLVA-----PDEVDWRKKGYVTAVKNQGQCGSCWSFSTT 161
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
++EG + +G+L+SLSEQ+LVDC K N GCNGGLMD AF++II NGG+++E++YPY
Sbjct: 162 GSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPY 221
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
+ +C +++ + G DV DE LK +VA+ PVS+AI+A ++FQ Y G
Sbjct: 222 DARQERCH-FKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGG 280
Query: 290 VF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V+ +C S LDHGV+ VGYGT++G DYWLV+NSWG+ WG GYVK+ RN +C
Sbjct: 281 VYDEPKCSSTELDHGVLVVGYGTDDGQDYWLVKNSWGTTWGLEGYVKMSRN----QDNQC 336
Query: 348 GIAMEASYPV 357
G+A +ASYP+
Sbjct: 337 GVATQASYPL 346
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 22/351 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTTARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T+EE+ + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGI--NIPSYLSPSPM 114
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK+QG CG CWAFS V ++EG KI TG L+
Sbjct: 115 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 174
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ SE DY Y G + C + A
Sbjct: 175 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQEKTA-A 232
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 233 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 290
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 291 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPG-GHCDIAKMSSYP 340
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPELSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI-PNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GC+GG M AF FI +NGG+ SE DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/369 (39%), Positives = 218/369 (59%), Gaps = 26/369 (7%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT-IYQTWLAKH 59
+AT ++ + + +F+F + AA M+ + H DD +M + W A H
Sbjct: 15 LATTAVLM-LRGCLFVFLTALPPAAIMTPAA-----GHVV--ELDDMLMLDRFVRWQAAH 66
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYL----- 113
+T +RFQ+++ N+ +I+ N TY++G N+FADLT+EE+ +MY
Sbjct: 67 NRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYASSYDA 126
Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
G R+D + L+ + VA A GD P S DWR KGAV P K+QG +C SCWAF
Sbjct: 127 GDRADDEAALITTDVAGDG-AWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFV 185
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
TVA +EG+ I TG+LISLSEQ+LVDCD + GCN G F+++++NGG+ +E +YP
Sbjct: 186 TVATIEGLTFIKTGKLISLSEQQLVDCD-MYDGGCNTGSYSRGFRWVLENGGLTTEAEYP 244
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y A C+ ++ I G + P +E+ ++KAVA QPV VAIE G Q Y++G
Sbjct: 245 YTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEV-GSGMQFYKTG 303
Query: 290 VFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V++G CG+ L H V VGYG + +G YW+V+NSWG WGE G+++++R++ G C
Sbjct: 304 VYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDV--GGPGLC 361
Query: 348 GIAMEASYP 356
GIA++ +YP
Sbjct: 362 GIALDVAYP 370
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/352 (41%), Positives = 204/352 (57%), Gaps = 22/352 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYAC---KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+S + + D +P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L
Sbjct: 116 SSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNL 175
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
+ SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 MEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA- 233
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A
Sbjct: 234 AVQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGNCADRINHAVTA 291
Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+GYGT E G YWL++NSWG+ WGENGY+K+ R+ D +G C IA +SYP
Sbjct: 292 IGYGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDP-SGLCDIAKMSSYP 342
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N + S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 198/322 (61%), Gaps = 18/322 (5%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNK 99
T+ E+ ++ + + G+ R IF+ NL+FI HN + + T+ V +N
Sbjct: 25 TEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNN 84
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
F DL+NEE+RA + G RRL +A +A + LP +VDW KG V P+K+Q
Sbjct: 85 FTDLSNEEFRATFNG-----YRRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQ 139
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQ 218
CGSCWAFS VA++EG + + TG+L+SLSEQ LVDC + + GC+GG MDYAF+++IQ
Sbjct: 140 QQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
N G+D+E YPY + C+ +RN+ +I + DV DE +L+ AVA P+SVAI+
Sbjct: 200 NRGIDTEASYPYKAIDESCE-FKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAID 258
Query: 278 AGGRAFQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
A +FQ Y SGV+ +C + LDHGV AVGYGT NG YW V+NSWG+ WG GY+ +
Sbjct: 259 AAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTSWGRKGYIFM 318
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
RN +CGIA +ASYPV
Sbjct: 319 SRN----KQNQCGIATKASYPV 336
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C I +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDITKMSSYP 341
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 149/353 (42%), Positives = 204/353 (57%), Gaps = 25/353 (7%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M LA+ +V L +S + A ++ S + +++T + W+ KH K +
Sbjct: 1 MRLAVFLIVSLVILSINVCAATNLFS-------AQTYQTS------FLGWMKKHNKAYHH 47
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
N+K +Q FKDN+ FI NS +GLN+FADLTNEEY+ YLG + R +
Sbjct: 48 HEFNDK-YQTFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKTYLGMSINVNLRANQ 106
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ + G P S+DWR+ GAV VKDQG CGSCWAF+T AVEG ++I TG +
Sbjct: 107 VPMNGLNFERFTG---PSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNM 163
Query: 186 ISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
++ SEQ LVDC R N GC+GGLM AF++II N G+ +E+ YPY +N+C
Sbjct: 164 VTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRC-VYNTTM 222
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT-GECGS-ALDHG 302
+I GY+DV E +L A++ QPV+VAI+A FQ Y+SGV+ C S L+HG
Sbjct: 223 LGTAISGYKDVPRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHG 282
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
V+AVGYGT G DY++V+NSW WG GY+ + RN CGIA ASY
Sbjct: 283 VLAVGYGTLEGKDYYIVKNSWAETWGNQGYILMARNA----NNHCGIATMASY 331
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 199/318 (62%), Gaps = 11/318 (3%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
+D +M ++ W+A++G+ +RFQIFK+N++ I+ NS N +Y +G+N+F D+
Sbjct: 3 NDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDM 62
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T E+ A Y G + + V S + +P+S+DWR+ GAVN VK+Q CG
Sbjct: 63 TKSEFVAQYTGV--SLPLNIEREPVVS--FDDVNISAVPQSIDWRDYGAVNEVKNQNPCG 118
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
SCWAF+ +A VEGI KI TG L+SLSEQE++DC ++ GC GG ++ A+ FII N G+
Sbjct: 119 SCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC--AVSYGCKGGWVNKAYDFIISNNGVT 176
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
+E++YPY + C+ + I GY V DE S+ AV++QP++ I+A F
Sbjct: 177 TEENYPYQAYQGTCN-ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDA-SENF 234
Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q+Y GVF+G CG++L+H + +GYG + +G YW+VRNSWGS WGE GYV++ R + +
Sbjct: 235 QYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARG-VSS 293
Query: 343 NTGKCGIAMEASYPVKNS 360
++G CGIAM +P S
Sbjct: 294 SSGACGIAMSPLFPTLQS 311
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 204/333 (61%), Gaps = 21/333 (6%)
Query: 36 HDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---- 91
H S + DE + + GK+ N+ + F N+ I+EHN +R
Sbjct: 31 HRQKSLRQKIDEAFNKWDDYKETFGKSYEPDEEND-YMEAFVKNVIHIEEHNKEHRLGRK 89
Query: 92 TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWRE 149
T+++GLN+ ADL +YR + G R RR + S ++ ++PESVDWRE
Sbjct: 90 TFEMGLNEIADLPFSQYRKLN-GYRM---RRQFGDSLQSNGTKFLVPFNVQIPESVDWRE 145
Query: 150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGL 208
+G V PVK+QG CGSCWAFS+ A+EG + TG+L+SLSEQ LVDC K N GCNGGL
Sbjct: 146 EGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGL 205
Query: 209 MDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA 268
MD AF++I +N G+D+E YPY+G E KC +RNA G+ D+ DE +LKKAVA
Sbjct: 206 MDLAFEYIKENHGVDTEDSYPYVGRETKCH-FKRNAVGADDKGFVDLPEGDEEALKKAVA 264
Query: 269 DQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWG 324
Q P+S+AI+AG R+FQ Y+ GV F EC S LDHGV+ VGYGT+ DYWLV+NSWG
Sbjct: 265 TQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWG 324
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WGE GY+++ RN CG+A +ASYP+
Sbjct: 325 PTWGEKGYIRIARN----RNNHCGVATKASYPL 353
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPELSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI-PNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GC+GG M AF FI +NGG+ SE DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 139/327 (42%), Positives = 195/327 (59%), Gaps = 24/327 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNRTYKVGLNKFADLTNEEY 108
+Q W A+HG+ +R +++ N+R+I+ N + TY++G + DLT +E+
Sbjct: 53 FQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDLTADEF 112
Query: 109 RAMY------LGTRSDAKRRLMKSKVASQRYACKAGDE----------LPESVDWREKGA 152
AMY L D M + ++ A AG + P SVDWR KGA
Sbjct: 113 TAMYTSPSPVLSAHDDEAAGAMM--ITTRAGAVDAGGQQVYFNVSTAGAPASVDWRAKGA 170
Query: 153 VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYA 212
V VK+QG CGSCWAFSTVA VEGI++I TG LISLSEQELVDCD ++ GC+GG+ +A
Sbjct: 171 VTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCD-TLDYGCDGGVSYHA 229
Query: 213 FQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPV 272
++I NGG+ +E DYPY G + C ++ +I G+ V+ E SL AVA QPV
Sbjct: 230 LEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLANAVAAQPV 289
Query: 273 SVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV--GYGTENGVDYWLVRNSWGSDWGEN 330
+V+IEAGG FQHY GV+ G CG+ L+HGV V G +G YW+V+NSWG WG+
Sbjct: 290 AVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSWGKKWGDG 349
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY ++++++ G CGIA+ S+P+
Sbjct: 350 GYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 195/315 (61%), Gaps = 16/315 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K ++ + R QI+ +N +F+ HN L ++Y++G+ FAD+ NEE
Sbjct: 26 FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85
Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y R + G L + S + G +LP++VDWR+KG V VKDQ CGSCW
Sbjct: 86 YKRVISQGCLHSFNASL--PRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCW 143
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFS ++EG + TG L+SLSEQ+LVDC N GC GGLMDYAFQ+I NGG+D+E
Sbjct: 144 AFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDTE 203
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
+ YPY KC + N S GY +VS DE +LK+AVA P+SV I+A +FQ
Sbjct: 204 ESYPYEAENGKCRYNPDNIGATST-GYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQ 262
Query: 285 HYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
YESGV+ +C S LDHGV+AVGYGTE+G DYWLV+NSWG +WG+ GY+K+ RN
Sbjct: 263 FYESGVYNEPDCSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWGDKGYIKMSRN---- 318
Query: 343 NTGKCGIAMEASYPV 357
+ +CGIA ASYP+
Sbjct: 319 KSNQCGIATAASYPL 333
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 145/352 (41%), Positives = 206/352 (58%), Gaps = 22/352 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYAC---KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG+L
Sbjct: 116 SSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKL 175
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
+ SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 MEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA- 233
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A
Sbjct: 234 AVQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTA 291
Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 IGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 342
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 144/314 (45%), Positives = 192/314 (61%), Gaps = 17/314 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
++ W +HGK R I++ NL + HN + TY +G+N+FADL N+E
Sbjct: 28 WKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ AM G R + + K S +LP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88 FVAMMTGFRVNGTSKAAK---GSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWA 144
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FS ++EG + TG+L+SLSEQ LVDC K N GCNGGLMD AFQ+II GG+D+E+
Sbjct: 145 FSATGSLEGQHFKKTGKLVSLSEQNLVDCSDK-NYGCNGGLMDRAFQYIIDAGGIDTEES 203
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
YPY+ + C N ++ GY DV+ E +L+KAVA P+SVAI+A +FQ Y
Sbjct: 204 YPYIAMDGNCHFKTANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQLY 262
Query: 287 ESGVFT--GECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
+SGV+ G + LDHGV+AVGYGT +G DYW+V+NSW WG NGY+ + RN
Sbjct: 263 QSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRN----K 318
Query: 344 TGKCGIAMEASYPV 357
+CGIA +ASYP+
Sbjct: 319 DNQCGIATQASYPL 332
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GC+GG M AF FI +NGG+ SE DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 147/320 (45%), Positives = 206/320 (64%), Gaps = 25/320 (7%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFA 101
DE+ T+++T H KT + +RF I++ +L I++HN L + T+ +G+N++
Sbjct: 21 DEMWTLFKT---THSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLGMNEYG 76
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLT EY AM + ++ KS V S + ++P++VDWREKG V PVK+QG
Sbjct: 77 DLTQHEYAAM-------SGYKMAKSSVGSS-FLEPENLQVPKTVDWREKGYVTPVKNQGQ 128
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFS+ ++EG TG L S+SEQ LVDC R + N GC+GGLMD AF +I +N
Sbjct: 129 CGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNM 188
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
G+DSE+ YPY + +C +++ V + G+ D+ DE +L+ AVA PVSVAI+A
Sbjct: 189 GIDSEKSYPYEAVDGECR-YKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDAS 247
Query: 280 GRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
+FQ Y++GV+T C S LDHGV+ VGYG ENG DYWLV+NSWG+ WGE GY+KL R
Sbjct: 248 HTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIKLAR 307
Query: 338 NLLDTNTGKCGIAMEASYPV 357
N + +CGIA +ASYP+
Sbjct: 308 N----HGNQCGIASQASYPL 323
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 152/333 (45%), Positives = 203/333 (60%), Gaps = 21/333 (6%)
Query: 36 HDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---- 91
H S + DE + + GK+ N+ + F N+ I+EHN +R
Sbjct: 32 HRQKSLRQKIDEAFNKWDDYKETFGKSYEPEEEND-YMEAFVKNVIHIEEHNKEHRLGRK 90
Query: 92 TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWRE 149
T+++GLN+ ADL +YR + G R RR + S ++ ++PESVDWRE
Sbjct: 91 TFEMGLNEIADLPFSQYRKLN-GYRM---RRQFGDSMQSNGTKFLVPFNVQIPESVDWRE 146
Query: 150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGL 208
+G V PVK+QG CGSCWAFS+ A+EG + TG+L+SLSEQ LVDC K N GCNGGL
Sbjct: 147 EGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGL 206
Query: 209 MDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA 268
MD AF++I +N G+D+E YPY+G E KC +RN G+ D+ DE +LKKAVA
Sbjct: 207 MDLAFEYIKENHGVDTEDSYPYVGRETKCH-FKRNTVGADDKGFVDLPEGDEEALKKAVA 265
Query: 269 DQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWG 324
Q P+S+AI+AG R+FQ Y+ GV F EC S LDHGV+ VGYGT+ DYWLV+NSWG
Sbjct: 266 TQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWG 325
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WGE GY+++ RN CG+A +ASYP+
Sbjct: 326 PTWGEKGYIRIARN----RNNHCGVATKASYPL 354
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 189/317 (59%), Gaps = 12/317 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
T D + ++ W+ ++ K++ ++ + F I++ N+ +EHN N++Y + +N+F DL
Sbjct: 22 THDPLTGVFAKWMRENTKSNYRFVYSNEEF-IYRWNVWRDEEHNRQNKSYFLAMNQFGDL 80
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TN E+ ++ G D + A + A +P DWR+KGAV VK+QG CG
Sbjct: 81 TNAEFNRLFKGLAFDYSKHAKIHTAAPEAPATG----IPSEFDWRQKGAVTHVKNQGQCG 136
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCW+FST + EG N + TG L+SLSEQ L+DC N GCNGGLMDYAF++II N G+
Sbjct: 137 SCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGI 196
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
D+E YPY A K S+ GY DV+ DE +L A +PVSVAI+A +
Sbjct: 197 DTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSVAIDASHNS 256
Query: 283 FQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y GV + C S LDHGV+ VG+G+ENG D+W V+NSWG+ WG NGY+K+ RN
Sbjct: 257 FQFYSGGVYYESACSSTQLDHGVLVVGWGSENGQDFWWVKNSWGASWGLNGYIKMSRN-- 314
Query: 341 DTNTGKCGIAMEASYPV 357
CGIA ASYP
Sbjct: 315 --QNNNCGIATAASYPT 329
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GC+GG M AF FII+NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N + S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPELSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FII+NGG+ E DY Y G + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 206/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + + S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYVSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK+QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C + ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGE+G++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C I +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDITKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 183/307 (59%), Gaps = 17/307 (5%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
W H K + G R+ I+KDN R I EHN + + +N+F D+TN E++
Sbjct: 30 WKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEFK----- 84
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
D L V+ + P+SVDWR +G V PVKDQG CGSCWAFST ++
Sbjct: 85 ---DFNGYLSHKHVSGSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSL 141
Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EG N TG+L+SLSEQ LVDC N GCNGGLMD AF +I +N G+DSE YPY
Sbjct: 142 EGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAK 201
Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT 292
+ KC ++ N G+ D+ DE LK+AVA P+SVAI+A +FQ Y GV+
Sbjct: 202 DGKCAFTKPNVAATDT-GFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKGVYN 260
Query: 293 G-ECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
+C S LDHGV+ VGYGTE+G DYWLV+NSW + WG+ GY+K+ RN + +CGIA
Sbjct: 261 ERKCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMSRNAKN----QCGIA 316
Query: 351 MEASYPV 357
ASYP+
Sbjct: 317 TNASYPL 323
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ D +G C I +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDITKMSSYP 341
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 198/329 (60%), Gaps = 21/329 (6%)
Query: 52 YQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
+ W +H +T S G +R +F DN+R I E N N + LN++AD T EE+ A
Sbjct: 40 FGLWATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNRRNTGITLALNEYADETWEEFAA 99
Query: 111 MYLGTRSDAKR------RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
LG + ++ R S +S RYA + P +VDWR K AV VK+QG CGS
Sbjct: 100 KRLGLKISQEQLKAREARSSSSSSSSWRYA---QVQTPAAVDWRAKNAVTQVKNQGQCGS 156
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CWAFS V ++EG N + TG+L++LSEQ+LVDCD N GC+GGLMD AF++++ NGG+D+
Sbjct: 157 CWAFSAVGSIEGANALATGQLVALSEQQLVDCDTASNMGCSGGLMDDAFKYVLDNGGIDT 216
Query: 225 EQDYPY---LGAENKCDPSRRNAK-VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
E+DY Y G C+ ++ + VSIDGYEDV P E +L KAVA QPV+VAI A
Sbjct: 217 EEDYSYWSGYGFGFWCNKRKQTDRPAVSIDGYEDV-PTSEPALLKAVAGQPVAVAICASA 275
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD-YWLVRNSWGSDWGENGYVKLQRNL 339
Q Y SGV C L+HGV+AVGY T + YW+V+NSWG WGE GY +L+
Sbjct: 276 N-MQFYSSGVIN-SCCEGLNHGVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFRLKMG- 332
Query: 340 LDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+ G CGIA ASY VK S + KP P
Sbjct: 333 -EGPKGLCGIASAASYAVKTSAVN-KPVP 359
>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
At 1.7 Angstroms Resolution By Fast Fourier
Least-Squares Methods
Length = 220
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 122/219 (55%), Positives = 157/219 (71%), Gaps = 3/219 (1%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP VDWR GAV +K QG CG WAFS +A VEGINKI +G LISLSEQEL+DC R
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQ 60
Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
N GC+GG + FQFII +GG+++E++YPY + CD + ++ K V+ID YE+V +
Sbjct: 61 NTRGCDGGYITDGFQFIINDGGINTEENYPYTAQDGDCDVALQDQKYVTIDTYENVPYNN 120
Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A+DH +V VGYGTE GVDYW+V
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGTEGGVDYWIV 180
Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 181 KNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 217
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 191/316 (60%), Gaps = 28/316 (8%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA- 110
+ W H K + R+ I+KDN+ I E+NS ++ + +N F D+TN E+RA
Sbjct: 27 WYVWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAK 86
Query: 111 ---MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ L + L+ S A+ P++VDWR +G V PVK+QG CGSCWA
Sbjct: 87 MNGLLLHKHQNGSTFLVPSHTAA-----------PDAVDWRSEGYVTPVKNQGQCGSCWA 135
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG + TG L+SLSEQ LVDC N GCNGGLMD AF +I NGG+D+E
Sbjct: 136 FSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTET 195
Query: 227 DYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
YPY G + C R + + D G+ D+ DE +LK+AVA PVSVAI+A +F
Sbjct: 196 GYPYEGQDGTC---RYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSF 252
Query: 284 QHYESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Q Y SGV+ +C SALDHGV+ VGYGT+NG DYWLV+NSWG+ WG GY+ + RN
Sbjct: 253 QFYHSGVYDEPQCSPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRN--- 309
Query: 342 TNTGKCGIAMEASYPV 357
N +CGIA +ASYP+
Sbjct: 310 -NQNQCGIASKASYPL 324
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 199/324 (61%), Gaps = 21/324 (6%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADL 103
VM ++++ +H K R +IF +N + I HN L ++TYK+G+NK+ D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQG 160
+ E+ M G R++ K+ Q E +P+SVDWREKGAV VKDQG
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
SCGSCWAFS A+EG + TG+L+SLSEQ LVDC K N GCNGGLMD AFQ+I N
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
GG+D+E+ YPY E + +P R N D G+ DV +E +LKKA+A PVSVAI
Sbjct: 205 GGIDTEKSYPY---EAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAI 261
Query: 277 EAGGRAFQHYESGVFTGECGSA--LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y+ GV++ SA LDHGV+AVGYG TE+G DYWLV+NSW WG+ GY+
Sbjct: 262 DASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYI 321
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN CGIA ASYP+
Sbjct: 322 KIARN----QNNMCGIASAASYPL 341
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N+ + S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFCAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 203/329 (61%), Gaps = 20/329 (6%)
Query: 36 HDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRT 92
D SSS +E MT ++ W+ +HG+T +RFQ+FK N F+D N+ +
Sbjct: 35 RDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFKANAAFVDTSNAAAGGKK 94
Query: 93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA-CKAGDELPESVDWREKG 151
Y + +N+FAD+T++E+ A Y G + K+ +YA E ++VDWR+KG
Sbjct: 95 YHLAINRFADMTHDEFMARYTGFKP---LPATGKKMPGFKYANVTLSSEDQQAVDWRKKG 151
Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMD 210
AV VK+Q CG CWAFS VAA+EG+++I TGEL+SLSEQ+LVDC N GC GG M+
Sbjct: 152 AVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQQLVDCSTNGNNNGCGGGTME 211
Query: 211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ 270
AFQ++I N G+ +E YPY + C + V++ Y+ V DE +L AVA Q
Sbjct: 212 DAFQYVIGNNGIATEAAYPYTAMQGMCQNVQ---PAVAVRSYQQVPRDDEDALAAAVAGQ 268
Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWG 328
PVSVA++A FQ Y+ GV T + CG+ L+H V AVGYGT E+G YWL++N WGS WG
Sbjct: 269 PVSVAVDANN--FQFYKGGVMTADSCGTNLNHAVTAVGYGTAEDGTPYWLLKNQWGSTWG 326
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
E GY++LQR + G CG+A +ASYPV
Sbjct: 327 EEGYLRLQRGV-----GACGVAKDASYPV 350
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 202/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGHVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI-PNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GC+GG M AF FI +NGG+ SE DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 202/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 192/313 (61%), Gaps = 17/313 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYR 109
+++W +HGK N R I++ N +++DEHN+ + VG+N+FADL + E+
Sbjct: 22 WESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFG 81
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+Y G + + +SKV ++ K GD LP SVDWR KG V +K+QG CGSCWAFS
Sbjct: 82 RLYNGYNNKPSMKKAQSKV----FSTKVGD-LPTSVDWRTKGFVTAIKNQGQCGSCWAFS 136
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
VA +EG + TG L+SLSEQ LVDC + N GCNGGLMD AFQ++I+NGG+D+E Y
Sbjct: 137 AVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASY 196
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPF--DEMSLKKAVADQPVSVAIEAGGRAFQHY 286
PY + KC + N + G+ D+ P + P+SVAI+A +FQ Y
Sbjct: 197 PYKAVDQKCKFNAANVG-STCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLY 255
Query: 287 ESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
+SGV++ S +LDHGV AVGY + +GV YW+V+NSWG+ WG+ GY+ + RN
Sbjct: 256 KSGVYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRN----KN 311
Query: 345 GKCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 312 NQCGIATAASYPI 324
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 194/322 (60%), Gaps = 16/322 (4%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVG 96
S S T D+ + ++++ K E R Q +K N+ FI+ HNS N ++ +G
Sbjct: 29 SQSLYTADQDHIDFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLG 88
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
N AD T++EY+ M LG + ++K + Y+ ++PES+DWREKGAVN V
Sbjct: 89 PNHLADYTHDEYKKM-LGYKP-------RNKTGKEVYSTPNLKDIPESIDWREKGAVNAV 140
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
KDQG CGSCWAFST+A++E I TG+L SLSEQ+LVDC + N GCNGG M A +I
Sbjct: 141 KDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDYI 200
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQPVSVA 275
GG+++E+DYPY+G + C + +K V+ D G+ ++ P +L+ A+A+ PVSVA
Sbjct: 201 ASAGGVETEKDYPYVGKDQTC--AFEASKEVATDKGHINIVPGKFATLQAAIAEGPVSVA 258
Query: 276 IEAGGRAFQHYESGVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
IEA FQ Y SG+F CG+ LDHGV AVGYG +NG Y++VRNSW WG GY+
Sbjct: 259 IEADSLFFQFYRSGIFDSSWCGTNLDHGVAAVGYGVDNGKQYYIVRNSWSDSWGLKGYIN 318
Query: 335 LQRNLLDTNTGKCGIAMEASYP 356
+ N G CGI ME P
Sbjct: 319 IIAN--GDGNGMCGIQMEPVVP 338
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 202/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + D++P ++DWRE GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 199/324 (61%), Gaps = 19/324 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D + + T+ +H K R +IF +N I +HN L +YK+GLNK+A
Sbjct: 22 DLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYA 81
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSK--VASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ + E++ G + R+LM+ + + Y A +P+SVDWRE GAV VKDQ
Sbjct: 82 DMLHHEFKETMNG-YNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 200
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQ-PVSVAI 276
NGG+D+E+ YPY G ++ C ++ A + + D G+ D+ DE +KKAVA PVSVAI
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNK--ATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAI 258
Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y GV+ EC LDHGV+ VGYGT E+G+DYWLV+NSWG+ WGE GY+
Sbjct: 259 DASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYI 318
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN +CGIA +SYP
Sbjct: 319 KMARN----QNNQCGIATASSYPT 338
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 191/317 (60%), Gaps = 25/317 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
+ + A HGKT R +IF DN + I+ HN+ +YK+ +N F DL E
Sbjct: 27 WHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHE 86
Query: 108 YRAMYLGTR--SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
++A+ G + D KR + + LP++VDWR+KGAV PVKDQG CGSC
Sbjct: 87 FKALMNGFKMSPDTKR--------NGELYFPSNSNLPKTVDWRQKGAVTPVKDQGQCGSC 138
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
W+FS ++EG + TG+L+SLSEQ LVDC N GC GGLMD AFQ++ N G+D+
Sbjct: 139 WSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDT 198
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
E YPY EN C + KV D G+ D+ DE +L+ A+A P+SVAI+A +
Sbjct: 199 EASYPYEARENTC--RFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGS 256
Query: 283 FQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y GV+ C S LDHGV+AVGYGTENG DYWLV+NSWG WGENGY+K+ RN
Sbjct: 257 FQFYSKGVYNEPNCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENGYIKIARN-- 314
Query: 341 DTNTGKCGIAMEASYPV 357
++ CGIA ASYP+
Sbjct: 315 --HSNHCGIASMASYPL 329
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 188/308 (61%), Gaps = 8/308 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+++HG+ +RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A
Sbjct: 39 HELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ G +++ + + D++P ++DWRE GAV VK QG CG CWAFS
Sbjct: 99 KFTGLNIPNSYLSPSPLSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFS 158
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
V ++EG KI TG L+ SEQEL+DC N GCNGG M AF FII+NGG+ E DY
Sbjct: 159 AVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYE 217
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
YLG + C + A V I Y+ V P E SL +AV QPVS+ I A + Q Y G
Sbjct: 218 YLGQQYTCRSQEKTA-AVQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGG 274
Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
+ G C ++H V A+GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C
Sbjct: 275 TYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCD 333
Query: 349 IAMEASYP 356
IA +SYP
Sbjct: 334 IAKMSSYP 341
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 150/336 (44%), Positives = 198/336 (58%), Gaps = 26/336 (7%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNG---MGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLN 98
+++ + ++YQ W +G S+ + RF++FK N R+I + N +YK+GLN
Sbjct: 34 ESEESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLN 93
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
KFADLT EE+ A Y G L K+ S A AGD P + DWRE GAV VKD
Sbjct: 94 KFADLTLEEFTAKYTGANPGPITGL-KNGTGSPPLAAVAGDA-PPAWDWREHGAVTRVKD 151
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
QG CGSCWAFS V AVEGIN I+TG L++LSEQ+++DC AG C+GG YAF + +
Sbjct: 152 QGPCGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCS---GAGDCSGGYTSYAFDYAV 208
Query: 218 QNG---------GMDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKK 265
NG E + Y E +P R A +V ID Y V P DE +LK+
Sbjct: 209 SNGITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQ 268
Query: 266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSW 323
AV Q PVSV IEA F Y+ GVF+G CG+ L+H V+ VGY TE+G YW+V+NSW
Sbjct: 269 AVYSQGPVSVLIEA-SYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSW 327
Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
G+ WGE+GY+++ RN + G CGIAM YP+K+
Sbjct: 328 GAGWGESGYIRMIRN-IPAPEGICGIAMYPIYPIKS 362
>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
Length = 208
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/218 (57%), Positives = 155/218 (71%), Gaps = 11/218 (5%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPE +DWR+KGAV PVK+QG CGSCWAFSTV+ VE IN+I TG LISLSEQ+LVDC++K
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK- 59
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GC GG YA+Q+II NGG+D+E +YPY + C R KVV IDGY+ V +E
Sbjct: 60 NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPC---RAAKKVVRIDGYKGVPHCNE 116
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
+LKKAVA QP VAI+A + FQHY+SG+F+G CG+ L+HGVV VGY DYW+VR
Sbjct: 117 NALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYWK----DYWIVR 172
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG WGE GY++++R G CGIA YP K
Sbjct: 173 NSWGRYWGEQGYIRMKR---VGGCGLCGIARLPYYPTK 207
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 202/349 (57%), Gaps = 24/349 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN------IPNSYL 110
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+ + D++P ++DWRE GAV VK+QG CG CWAFS V ++EG KI TG L+
Sbjct: 111 SPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEF 170
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A V
Sbjct: 171 SEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-AVQ 228
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C + ++H V A+GY
Sbjct: 229 ISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAIGY 286
Query: 309 GT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GT E G YWL++NSWG+ WGE+G++K+ R+ + G C IA +SYP
Sbjct: 287 GTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 334
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 197/320 (61%), Gaps = 26/320 (8%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ W K G++ N +KR QI+ N + HN++ + TY++G+ +ADL +EE
Sbjct: 26 FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85
Query: 108 YR----AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
++ + LG+ + +K R S + R+ LP+++DWR+ G V PVK+QGSCG
Sbjct: 86 FKQTVFGVCLGSFNASKPRGGSSFLKMHRFY-----NLPQTIDWRQWGFVTPVKNQGSCG 140
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCW+FS+ A+EG N TG L+SLSEQELVDC N GCNGG MD AF++I+ GG+
Sbjct: 141 SCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGI 200
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
+E YPY G +C R N + + GY D+ +E +LK+AVA PVSVAI A
Sbjct: 201 HTEDSYPYEGQVGQC---RANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHAS 257
Query: 280 GRAFQHYESGVFTGE--CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
++FQ Y SGV+ G+ALDH V+ VGYGTE G DYWLV+NSWG WG+ GY+K+ R
Sbjct: 258 DQSFQLYHSGVYNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSR 317
Query: 338 NLLDTNTGKCGIAMEASYPV 357
N + +CGIA AS+P+
Sbjct: 318 NRYN----QCGIASAASFPL 333
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 202/349 (57%), Gaps = 24/349 (6%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MSILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN------IPNSYL 110
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+ + D++P ++DWRE GAV VK+QG CG CWAFS V ++EG KI TG L+
Sbjct: 111 SPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEF 170
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A V
Sbjct: 171 SEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-AVQ 228
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C + ++H V A+GY
Sbjct: 229 ISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAIGY 286
Query: 309 GT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GT E G YWL++NSWG+ WGE+G++K+ R+ + G C IA +SYP
Sbjct: 287 GTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 334
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 14/312 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
+ W A H + R +I+ NL I+EHN+ R +Y +G+N+F DL + E+ A
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
YLG R + AS Y + LP+SVDWR G V PVK+QG CGSCW+FST
Sbjct: 81 KYLGVRFNGVN--ATKSFASSTYLPRM-VSLPDSVDWRTAGIVTPVKNQGQCGSCWSFST 137
Query: 171 VAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
+VEG + TG L+SLSEQ LVDC ++ N GCNGGLMD AF++II+NGG+D+E YP
Sbjct: 138 TGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYP 197
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
Y C + N ++ Y+D+ E L+ AVA PVSVAI+A FQ Y +
Sbjct: 198 YTATTGTCKFNAANIG-ATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFT 256
Query: 289 GVFT-GECGSA-LDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ +C + LDHGV+AVGYGT G DYWLV+NSWG+ WG+ GY+ + RN +
Sbjct: 257 GVYNEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADN---- 312
Query: 346 KCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 313 QCGIATSASYPL 324
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
KHGK+ + KR IF DNL +I+E N+ N +YK+G+N++ DLT EE+ A+ L + +
Sbjct: 33 KHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFAALKL-SST 91
Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
D + VA A LP SVDWR+KG +NPVKDQG CGSCWAFS + A+E
Sbjct: 92 DMSEGMGDGFVAG---AGPTTTTLPTSVDWRKKGVLNPVKDQGYCGSCWAFSAIGALEPR 148
Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
I TG+L+SLSEQ+LVDC N GCNGGLMD AF++ I+ G+D E YPY+G++
Sbjct: 149 YAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEY-IKATGVDKESTYPYVGSDET 207
Query: 237 CDPSRRNA----KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
C + N V + G + + E +L + VA PVS+A+ A ++FQHY+SGV++
Sbjct: 208 CQATVENKTDGLPVGEVTGNQMLHQ-TEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYS 266
Query: 293 -GEC---GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
C G ++DHGVVAVGYGTENG DY+++RNSWG WG++GYV L+R + + G+C
Sbjct: 267 DPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRGV--GSFGQCN 324
Query: 349 IAMEASYPVKNSQ 361
I P S+
Sbjct: 325 IYKYMCVPTLKSR 337
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 190/309 (61%), Gaps = 10/309 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+++HG+ +RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A
Sbjct: 39 HELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLA 98
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
+ G + L S ++S + + D++P ++DWRE GAV VK QG CG CWAF
Sbjct: 99 KFTGL-NIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAF 157
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
S V ++EG KI TG L+ SEQEL+DC N GC+GG M AF FI +NGG+ E DY
Sbjct: 158 SAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRESDY 216
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
YLG + C + A V I Y+ V P E SL +AV QPVS+ I A + Q Y
Sbjct: 217 EYLGEQYTCRSQEKTA-AVQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAG 273
Query: 289 GVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
G + G C ++H V A+GYGT E G YWL++NSWG+ WGENG++K+ R+ + +G C
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLC 332
Query: 348 GIAMEASYP 356
IA +SYP
Sbjct: 333 DIAKMSSYP 341
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/371 (39%), Positives = 216/371 (58%), Gaps = 31/371 (8%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT-IYQTWLAKH 59
+A++S LA L+ + + + + + +++ DD M Y+ W A H
Sbjct: 3 IASSSFSLAAILLIIIMYCCPTGLVEAA------RKGPAAAGGGDDSAMRERYEKWAADH 56
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY---LG 114
G+T +RF++F+ N FID N+ ++ ++ NKFADLTNEE+ Y
Sbjct: 57 GRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEFAEYYGRPFS 116
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
T M V + ++P +++WR++GAV VK+Q C SCWAFS VAAV
Sbjct: 117 TPVIGGSGFMYGNVRTS--------DVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAV 168
Query: 175 EGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EGI++I + L++LS Q+L+DC + N GCN G MD AF++I NGG+ +E DYPY
Sbjct: 169 EGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPY--E 226
Query: 234 ENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
+ R + K V SI G++ V P +E +L AVA QPVSVA++ G+ Q + SGVF
Sbjct: 227 DRALGTCRASGKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVF 286
Query: 292 TG----ECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
C + L+H + AVGYGT E+G YWL++NSWG+DWGE GY+K+ R++ +NTG
Sbjct: 287 GAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVA-SNTGL 345
Query: 347 CGIAMEASYPV 357
CG+AM+ SYPV
Sbjct: 346 CGLAMQPSYPV 356
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/356 (41%), Positives = 209/356 (58%), Gaps = 29/356 (8%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+S L FL I + A + + N D S W W H K+ H
Sbjct: 1 MSNLTFLVAIGLVACATAAFVK-PTNPDLDSRWLE----------WKIAHTKSYTNDMHE 49
Query: 70 EKRFQIFKDNLRFIDEHN---SLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
+R ++++N++ I+ HN SL++ +++G+N++ D+ E R+ G +S
Sbjct: 50 LERRLVWEENVKMINMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYKSSNV----- 104
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+KV + + ++P++VDWR KG V PVK+QG CGSCWAFST ++EG T +L
Sbjct: 105 TKVQGSTFLTPSNIQVPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKL 164
Query: 186 ISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+SLSEQ LVDC R + N GC GGLMD FQ++I N G+DSE YPY + C + +
Sbjct: 165 VSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCH-YKASC 223
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA-LDH 301
+ G+ DV+ DE +L +AVA PVSVAI+A ++FQ YESGV+ EC S+ LDH
Sbjct: 224 DSAEVTGFTDVTSGDEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDH 283
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GV+ VGYGT+ G DYWLV+NSWG WG +GY+K+ RN + +CGIA ASYP+
Sbjct: 284 GVLVVGYGTDGGKDYWLVKNSWGETWGLSGYIKMSRN----KSNQCGIATSASYPL 335
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 198/317 (62%), Gaps = 20/317 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K K+ + +R QI+ +N + + HN L ++Y++G+ +FAD+ NEE
Sbjct: 33 FHAWKLKFEKSYDSESDEAQRKQIWLNNRKHVLVHNILADQGLKSYRLGMTQFADMENEE 92
Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y R + G L + S + G LP++VDWR+KG V V++Q CGSCW
Sbjct: 93 YKRLVSQGCLHSFNSSL--PRRGSTFFRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSCW 150
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFS ++EG + TG+L+SLS+Q+LVDC + N GCNGGLMD AFQ+I NGG+D+E
Sbjct: 151 AFSATGSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNEGCNGGLMDSAFQYIQANGGIDTE 210
Query: 226 QDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
+ YPY + KC R N K + GY DV P +E +LK+AVA P+SVAI+A +
Sbjct: 211 ESYPYEAEDGKC---RYNPKSTGATCTGYVDVQPANEETLKEAVATIGPISVAIDAFHPS 267
Query: 283 FQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ YESGV+ +C S LDH V+AVGYGTENG+DYWLV+NS G WGE GY+K+ RN
Sbjct: 268 FQFYESGVYDEPDCSSTMLDHAVLAVGYGTENGLDYWLVKNSAGVGWGEKGYIKMSRN-- 325
Query: 341 DTNTGKCGIAMEASYPV 357
+ +CGIA ASYP+
Sbjct: 326 --KSNQCGIATAASYPL 340
>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 340
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 197/319 (61%), Gaps = 12/319 (3%)
Query: 42 WRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
+ ++ +M +Y+ W + H + S KRF+IF+DN + + + N + ++ K+ LN+FA
Sbjct: 31 FESEKSLMQLYKRW-SSHHRISRNAHEMHKRFKIFQDNAKRVFKVNHMGKSLKLRLNQFA 89
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DL+++E+ MY + K+ + + +P S+DWREKGAVN +K+QG
Sbjct: 90 DLSDDEFSMMYGSNITHYNNLHAKAGGRVGGFMYERAMNIPFSIDWREKGAVNAIKNQGL 149
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
C VAAVE I++I T EL+SLSEQE+VDCD K+ GC GG D AF+FI+QNGG
Sbjct: 150 C-------AVAAVESIHQIKTNELVSLSEQEVVDCDYKV-GGCRGGNYDSAFEFIMQNGG 201
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ E++YPY C N++ V+IDGYE V +E +L KAVA QPV+V++ + G
Sbjct: 202 ITIEENYPYFAGNGYCRRRGPNSERVTIDGYECVPQNNEYALMKAVAHQPVAVSVASSGS 261
Query: 282 AFQHYESGVFT--GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
F+ Y G+ CG +DH VV VGYG++ DYW++RN +G+ WG NGY+K+QR
Sbjct: 262 DFRFYGEGMLREGSFCGYRIDHTVVVVGYGSDEEGDYWIIRNQYGTQWGMNGYMKMQRGT 321
Query: 340 LDTNTGKCGIAMEASYPVK 358
+ G CG+AM+ S+PVK
Sbjct: 322 RNPQ-GVCGMAMQPSFPVK 339
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 137/289 (47%), Positives = 192/289 (66%), Gaps = 22/289 (7%)
Query: 75 IFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
+FK+N+ +I+ +N+ ++ YK +N+FA + K + S + +
Sbjct: 57 VFKENVNYIEACNNAADKPYKRDINQFA-------------PKKRFKGHMCSSIIRITTF 103
Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS-EQE 192
+ P +VD R+K AV P+KDQG CG WA S VAA EGI+ + G+LI LS EQE
Sbjct: 104 KFENVTATPSTVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQE 163
Query: 193 LVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP--SRRNAKVVSI 249
LVDCD K ++ C GGLMD AF+FIIQN G+++E +YPY G + KC+ + +NA + I
Sbjct: 164 LVDCDTKGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATI-I 222
Query: 250 DGYEDVSPFDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
GYEDV +E + L+KAVA+ PVSVAI+A G FQ Y+SGVFTG CG+ LDHGV AVGY
Sbjct: 223 TGYEDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGY 282
Query: 309 G-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
G +++G +YWLV+NS G++WGE GY+++QR +D+ CGIA++ASYP
Sbjct: 283 GVSDDGTEYWLVKNSRGTEWGEEGYIRMQRG-VDSEEALCGIAVQASYP 330
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
KHGK+ + KR IF DNL +I+E N+ N +YK+G+N++ DLT EE+ A+ L + +
Sbjct: 33 KHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFAALKL-SST 91
Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
D + VA A LP SVDWR+KG +NPVKDQG CGSCWAFS + A+E
Sbjct: 92 DMSEGMGDGFVAG---AGPTTTTLPTSVDWRKKGVLNPVKDQGYCGSCWAFSAIGALEPR 148
Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
I TG+L+SLSEQ+LVDC N GCNGGLMD AF++ I+ G+D E YPY+G++
Sbjct: 149 YAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEY-IKATGVDKESTYPYVGSDET 207
Query: 237 CDPSRRNA----KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
C + N V + G + + E +L + VA PVS+A+ A ++FQHY+SGV++
Sbjct: 208 CQATVENKTDGLPVGEVTGNQMLHQ-TEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYS 266
Query: 293 -GEC---GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
C G ++DHGVVAVGYGTENG DY+++RNSWG WG++GYV L+R + + G+C
Sbjct: 267 DPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRGV--GSFGQCN 324
Query: 349 IAMEASYPVKNSQ 361
I P S+
Sbjct: 325 IYKYMCVPTLKSR 337
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 191/311 (61%), Gaps = 16/311 (5%)
Query: 53 QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT--YKVGLNKFADLTNEEYRA 110
+ W A+HGK+ R ++ N ++IDEHN Y + +N+F DL N E+++
Sbjct: 23 RAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKS 82
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+Y G R R K V + R +LP SVDW +KG V PVK+QG CGSCW+FS
Sbjct: 83 LYNGYRMSNAPRKGKPFVPAARV-----QDLPASVDWSKKGWVTPVKNQGQCGSCWSFSA 137
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
++EG + TG L+SLSEQ LVDC + N GCNGGLMD AF+++I+N G+D+E YP
Sbjct: 138 TGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYP 197
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
Y ++ C + + +I GY DV+ E L+ AVA PVSVAI+A +FQ Y S
Sbjct: 198 YRAVDSTCKFNTADVG-ATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSS 256
Query: 289 GVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
GV+ C S LDHGV+AVGYGT+ DYWLV+NSWG+ WG +GY+++ RN + K
Sbjct: 257 GVYDPLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSWGASWGMSGYIEMVRN----HNNK 312
Query: 347 CGIAMEASYPV 357
CGIA ASYPV
Sbjct: 313 CGIATSASYPV 323
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 196/319 (61%), Gaps = 25/319 (7%)
Query: 49 MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNRTYKVGLNKFADLTN 105
+T ++ + GKT G H ++ IF+ NL I++ N+ +R Y +G+ +FAD++
Sbjct: 163 LTNFEHFKEHFGKTYEGDEHALRQ-GIFQRNLAHIEKFNAEKAASRGYTLGITQFADMST 221
Query: 106 EEYRAMYLGTRSDAK-----RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
E+R YLG R +A R+L + VA R +LPE+VDWR+KGAV+PVKDQG
Sbjct: 222 AEFRQTYLGLRMNASTIAKLRKLQREVVADDR-------DLPEAVDWRDKGAVSPVKDQG 274
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST A+EG + + GEL+SLSEQ++VDC ++ GCNGG A +++ NG
Sbjct: 275 QCGSCWAFSTSGAIEGQHFLKNGELLSLSEQQMVDCSW-LDFGCNGGQPMLAMEYVRFNG 333
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
G++ E YPY G C +++A I G+ + E +L+KAVA P+SV ++A
Sbjct: 334 GLELETAYPYKGVGGSCHSDKKSA-AAKITGFWMAGFYSESALQKAVAKVGPISVGMDAS 392
Query: 280 GRAFQHYESGVFTGE-CGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
G FQHY+SG++ E C S LDH V+AVGYGT + DYWLV+NSW + WGE GY KL R
Sbjct: 393 GEDFQHYKSGIYNPESCSSIGLDHAVLAVGYGTSDDGDYWLVKNSWNTSWGEKGYFKLPR 452
Query: 338 NLLDTNTGKCGIAMEASYP 356
N KCGIA YP
Sbjct: 453 N----KGNKCGIATTPIYP 467
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 137/307 (44%), Positives = 183/307 (59%), Gaps = 15/307 (4%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
W + HGK+ + + R I++ NL I HN+ + +YK+ +N DLT +E+R YLG
Sbjct: 30 WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLG 89
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
R+ +K Y + ++P SVDW +KG V VK+QG CGSCWAFST +V
Sbjct: 90 VRAHHN----STKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSV 145
Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EG + TG L+SLSEQ L+DC N GC GGLMD AF++I NGG+D+E YPYLG
Sbjct: 146 EGQHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQ 205
Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT 292
+ C S + + GY+D+ E +L+ AVA PVSVA++A +Q Y SGV+
Sbjct: 206 QGSCHFSSSHVG-ARVTGYQDIPQGSEQALQSAVATVGPVSVAVDA--SQWQFYSSGVYD 262
Query: 293 GE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
C S LDHGV+ +GYG NG DYWLV+NSWG WG GY+ + RN +CGIA
Sbjct: 263 NPYCSSTQLDHGVLVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRN----KNNQCGIA 318
Query: 351 MEASYPV 357
ASYP+
Sbjct: 319 SSASYPL 325
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 191/311 (61%), Gaps = 19/311 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ W HG++ KR +F +N + + E N+ N + LN+FADLT EE+ A
Sbjct: 46 FSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNARNSGLVLALNQFADLTLEEFAAT 105
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
+LG + + R + S +YA ++LP +VDWR+K AV PVK+Q CGSCWAFS
Sbjct: 106 HLG-YNPSLREGKEHTTTSFQYA--DANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSAT 162
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
AVEGIN I TG+L+SLSEQ+LVDCD + + GC GGLMD+AF +I +NGG+DSE DY Y
Sbjct: 163 GAVEGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYW 222
Query: 232 GAENKCDPSRR-NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
G C + + VV+IDG+EDV D +LKKA+A QPVS+ Y SGV
Sbjct: 223 GYGLICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL-----------YHSGV 271
Query: 291 FTGE-CGSALDHGVVAVGY--GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
+ C L+HGV+AVGY G++ G +++++NSWG WGE G+ +L + +G C
Sbjct: 272 VGDDACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEA-SGAC 330
Query: 348 GIAMEASYPVK 358
G+ ASYP+K
Sbjct: 331 GVYKAASYPLK 341
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRF--QIFKDNLRFIDEHNSL----NRTYKVGLNK 99
D + +QT+ +H K N + E+RF +IF +N I +HN L ++K+GLNK
Sbjct: 21 DVIKEEWQTFKMEHRK--NFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNK 78
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
++D+ E++ G ++ L + Y A ++P+SVDWR+ GAV VKDQ
Sbjct: 79 YSDMLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQ 138
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ AA+EG + G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 139 GHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 198
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
NGG+D+E+ YPY G ++ C ++ G+ D+ DE +L KAVA PVSVAI+
Sbjct: 199 NGGIDTEKSYPYEGIDDSCHFTKSGVGATDT-GFVDIPQGDEEALMKAVATMGPVSVAID 257
Query: 278 AGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVK 334
A +FQ Y GV+ EC + LDHGV+ VGYGT+ G+DYWLV+NSWG+ WG+ GY+K
Sbjct: 258 ASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIK 317
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+ RN +CGIA +SYP
Sbjct: 318 MARN----QDNQCGIATASSYPT 336
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 191/314 (60%), Gaps = 15/314 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ + A H K + R +I+ +N + +HN L ++Y+V +NKF DL + E
Sbjct: 31 WHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHE 90
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R++ G + K++ ++ + A E+PESVDWREKGA+ PVKDQG CGSCWA
Sbjct: 91 FRSIMNGYQH--KKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWA 148
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG TG+LISLSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 149 FSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 208
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY ++ C + RN V G+ D+ +E LK AVA PVSVAI+A +FQ
Sbjct: 209 TYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 267
Query: 286 YESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y GV + C S LDHGV+ VGYG++NG DYWLV+NSW WG+ GY+K+ RN
Sbjct: 268 YSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARN----R 323
Query: 344 TGKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 324 KNHCGVATAASYPL 337
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 202/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + + D++P ++DWRE GAV VK QG CG CWAFS V ++E KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 128/309 (41%), Positives = 191/309 (61%), Gaps = 7/309 (2%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A+HGK E+ QIF++N+ FI+ + ++++ + N+FADL +EE++A
Sbjct: 32 HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA 91
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS- 169
+ L + L + RY ++P S+DWR++G V P+KDQG C SCWAFS
Sbjct: 92 L-LTNGHKKEHSLWTTTETLFRY--DNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSL 148
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VA +EG+++I+T EL+ LSEQELVD + + GC G ++ AF+FI + G ++SE YP
Sbjct: 149 CVATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYP 208
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y G N C + V I GY+ V E +L KAVA+Q VSV++EA AFQ Y SG
Sbjct: 209 YKGVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSG 268
Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
+FTG+CG+ DH V YG + +G YWL +NSWG++WGE GY++++ + + G CG
Sbjct: 269 IFTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXD-IPAKEGLCG 327
Query: 349 IAMEASYPV 357
IA YP+
Sbjct: 328 IAKYPYYPI 336
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 156/371 (42%), Positives = 217/371 (58%), Gaps = 33/371 (8%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT----IYQTWL---AK 58
M+ I+ + L +S++ A +H+S+ + + T ++TW
Sbjct: 1 MYTLIAVICVLTVVSAAPQAVNWFEIQPAKVEHASNLKLQVKASTRLGPYHETWKEFKTL 60
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLG 114
GK + + KRF IF+D L I+EHN ++Y +G+N+F+D++++EY
Sbjct: 61 FGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEY------ 114
Query: 115 TRSDAKRRLMKSKVASQRYAC----KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
R + RR ++ S+ C K+G +L + VDWR+KG V PVK+QG CGSCW+FST
Sbjct: 115 LRHNGLRR--GNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFST 172
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
++EG + TG+LISLSEQ+LVDC N GCNGGLMD AF++I GG++ E DYP
Sbjct: 173 TGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDDYP 232
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
Y + KC + K G DV DE +LK A+A P+SVAI+A +FQ Y+
Sbjct: 233 YTAKQGKCHLKKSLFKANDT-GCTDVESGDEDALKDALASVGPISVAIDASHASFQSYDG 291
Query: 289 GVF-TGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ EC S LDHGV+ VGYGT ENG DYWLV+NSWG WGE GY+K+ RN
Sbjct: 292 GVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRN----KDN 347
Query: 346 KCGIAMEASYP 356
+CGIA +ASYP
Sbjct: 348 QCGIATQASYP 358
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 146/313 (46%), Positives = 195/313 (62%), Gaps = 20/313 (6%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYL 113
+H K + R +I+ N I +HN Y++ +NK+ADL +EE+
Sbjct: 33 QHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVN 92
Query: 114 G-TRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G R+D+K+ L ++ + A E+P +VDWR+KGAV PVKDQG CGSCW+FS
Sbjct: 93 GFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSAT 152
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A+EG + TG+L+SLSEQ LVDC K N GCNGG+MDYAFQ+I NGG+D+E+ YPY
Sbjct: 153 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPY 212
Query: 231 LGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
++ C N K V + GY D+ DE +LKKA+A PVS+AI+A +FQ Y
Sbjct: 213 EAIDDTC---HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYS 269
Query: 288 SGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
GV + +C S LDHGV+AVGYGT E G DYWLV+NSWG+ WG+ GYVK+ RN +
Sbjct: 270 EGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN----HD 325
Query: 345 GKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 326 NHCGVATCASYPL 338
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 135/268 (50%), Positives = 172/268 (64%), Gaps = 14/268 (5%)
Query: 72 RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
RF FK ++ I HN+L N +Y +GLN+FADL+ EE++ Y G + +R +S
Sbjct: 61 RFNQFKASVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYFGCK-HVEREFARSNNLH 119
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISL 188
Q + P S+DWR AV P+KDQG CGSCWAFS ++EG ++ G+ L SL
Sbjct: 120 QEV-----EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSL 173
Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
SEQ+LVDC NAGCNGGLMDYAF++II N G+ +E YPY G C S KVV
Sbjct: 174 SEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKS--CTKVV 231
Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
+I G++DV+ DE S AV PVSVAIEA FQ Y SGVF+G CG LDHGV+AV
Sbjct: 232 TISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAV 291
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVK 334
GYGT DYW+V+NSWG+ WGE+GY++
Sbjct: 292 GYGTTGSQDYWIVKNSWGTSWGESGYIR 319
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 190/320 (59%), Gaps = 20/320 (6%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
+ T ++ + A H K+ RF+IF +N + HN +YK+G+N+F DL
Sbjct: 23 LRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDL 82
Query: 104 TNEEYRAMYLGTRS--DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
E+ M+ G R A R A+ Y+ LP+S+DWREKGAV PVK+QG
Sbjct: 83 LPHEFARMFNGYRGARTAGRGSTFLPPANVNYS-----SLPQSMDWREKGAVTPVKNQGQ 137
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST ++EG + + TG L+SLSEQ LVDC N GC GGLMD AFQ+I NG
Sbjct: 138 CGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANG 197
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
G+D+E+ YPY + +C ++N G+ D+ E LKKAVA PVSVAI+A
Sbjct: 198 GIDTEKSYPYEAEDGECRFKKQNVGATDT-GFVDIEQGSEDDLKKAVATVGPVSVAIDAS 256
Query: 280 GRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
+FQ Y GV+ EC S LDHGV+ VGYG E+G YWLV+NSW WG+NGY+K+ R
Sbjct: 257 HSSFQLYSEGVYDETECSSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIKMSR 316
Query: 338 NLLDTNTGKCGIAMEASYPV 357
+ +CGIA ASYP+
Sbjct: 317 D----KDNQCGIASAASYPL 332
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 146/313 (46%), Positives = 194/313 (61%), Gaps = 20/313 (6%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYL 113
+H K + R +I+ N I +HN Y++ +NK+ADL +EE+
Sbjct: 33 QHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVN 92
Query: 114 G-TRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
G R+D+K+ L ++ + A E+P +VDWR+KGAV PVKDQG CGSCW+FS
Sbjct: 93 GFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSAT 152
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A+EG + TG+L+SLSEQ LVDC K N GCNGG+MDYAFQ+I NGG+D+E+ YPY
Sbjct: 153 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPY 212
Query: 231 LGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYE 287
++ C N K V + GY D+ DE +LKKA+A PVS+AI+A +FQ Y
Sbjct: 213 EAIDDTC---HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYS 269
Query: 288 SGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
GV + +C S LDHGV+AVGYGT E G DYWLV+NSWG+ WG+ GYVK+ RN
Sbjct: 270 EGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN----RD 325
Query: 345 GKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 326 NHCGVATCASYPL 338
>gi|298705581|emb|CBJ28832.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 553
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 195/331 (58%), Gaps = 19/331 (5%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFAD 102
D +V ++ W+A+HG T G ++R +IF +N ID HN+ N T+ + N+F+
Sbjct: 36 DAKVANRFRAWMAQHGVTFGTKGEFDRRLKIFAENSDLIDTHNTANDGSTFTLSHNEFSH 95
Query: 103 LTNEEYRAMYLG----------TRSDAKRRLMKSKVASQRYACK-AGDELPESVDWREKG 151
L+ +E++ + G R +RR M+ +R + G E+P+ VDW +G
Sbjct: 96 LSWDEFKETHFGYKRSSDKPKPARQTPERRPMEKVAGGRRRLVELTGSEIPDEVDWVREG 155
Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
AV PV++QG CGSCWAFST+ A+EG + T +LI SE++LVDCD K++ GC GG M+
Sbjct: 156 AVTPVQNQGMCGSCWAFSTIGAMEGAYYLATDDLIKFSEEQLVDCD-KVDKGCFGGDMEQ 214
Query: 212 AFQFIIQNGGMDSEQDYPYLG---AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA 268
AF +I +NGG+ E +YPY+G C + + + + V DE +
Sbjct: 215 AFDWIKENGGVCPEDEYPYVGLWPPFKTCATTCTPVEGSQVKEWAQVKATDEALMTALAT 274
Query: 269 DQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDW 327
P+++AIEA AFQ Y GV+T CG LDHGV+AVGYGT E+G DYW V+NSWG W
Sbjct: 275 VGPIAIAIEADQMAFQFYSDGVYTAPCGDKLDHGVLAVGYGTWEDGTDYWKVKNSWGDSW 334
Query: 328 GENGYVKLQR-NLLDTNTGKCGIAMEASYPV 357
G+ GY+ L+R + + G+CG+ +EA YP+
Sbjct: 335 GQGGYILLERADSEEDEGGQCGLLIEAIYPI 365
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 140/355 (39%), Positives = 207/355 (58%), Gaps = 20/355 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+A++G+ R
Sbjct: 7 LVFLFLFLCVMWASPSAASCD---------EPSDPMMKQFEEWMAEYGRVYKDNDEKMLR 57
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ N+ N +Y +G+N+F D+TN E+ A Y G + + V S
Sbjct: 58 FQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGL--SLPLNIKREPVVS- 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR+ GAV VK+QG CGSCWAF+++A VE I KI G L+SLSEQ
Sbjct: 115 -FDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQ 173
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
+++DC ++ GC GG ++ A+ FII N G+ S YPY A+ C + I
Sbjct: 174 QVLDC--AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCK-TNGVPNSAYITR 230
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V +E ++ AV++QP++ A++A G FQHY+ GVFTG CG+ L+H +V +GYG +
Sbjct: 231 YTYVQRNNERNMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYGQD 289
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
+G +W+VRNSWG+ WGE GY++L R+ + ++ G CGIAM+ YP S S +
Sbjct: 290 SSGKKFWIVRNSWGAGWGEGGYIRLARD-VSSSFGLCGIAMDPLYPTLQSGPSVE 343
>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
gi|194696462|gb|ACF82315.1| unknown [Zea mays]
gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
Length = 361
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 153/361 (42%), Positives = 212/361 (58%), Gaps = 26/361 (7%)
Query: 11 STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
+ LV + +S++ AA S I Y HD +S ++ + +Y+ W A H + +G
Sbjct: 13 AALVVVIALSTTPAA--SAIDY-TEHDLAS----EESLWALYERWCA-HYNMARDLGEKT 64
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY------RAMYLGTR--SDAKRR 122
+RF +FK+N I EHN N TY +GLN+F+D+T+EE+ R ++ + SD +
Sbjct: 65 RRFNLFKENAHRIYEHNQGNATYTLGLNRFSDMTDEEFSRSPYGRCLFAPVQRISDGENE 124
Query: 123 LMKS----KVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGI 177
++ A LP SVDWR + +V VKDQG +CGSCWAF+ +AAVEGI
Sbjct: 125 ELQQHEDVSFNLTHGGATAALGLPPSVDWRGR-SVTRVKDQGLTCGSCWAFAAIAAVEGI 183
Query: 178 NKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
N I T L++LSEQ+LVDCD ++ GC GG + A FI++N G+ E YPY+G + +C
Sbjct: 184 NAIRTWSLVTLSEQQLVDCD-NVDHGCAGGWIPSALDFIVRNRGIVPEGTYPYIGTQGRC 242
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
A V+IDGY V PFD +L AVA QPV+VA+E+ AF+HY+ GVF G CG
Sbjct: 243 --RHVMAPPVTIDGYRRVLPFDVNALMSAVAAQPVAVAMESSAWAFRHYQGGVFNGNCGG 300
Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
L H VGYG G +W+V+NSWG WGE GYV++ RN + G CGI + YPV
Sbjct: 301 RLGHAAAVVGYGDGAGGPFWIVKNSWGPKWGEGGYVRISRNAPN-RLGICGILTQPLYPV 359
Query: 358 K 358
K
Sbjct: 360 K 360
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 144/314 (45%), Positives = 193/314 (61%), Gaps = 21/314 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADLTNEEYR 109
++ W +H K + R++I++ N + I+ HN S + +G+NKF DL + E+
Sbjct: 22 WEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFA 81
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
M+ G A+ K VA Y + +VDWR KGAV VK+QG CGSCWAFS
Sbjct: 82 EMFNGYMMQARSNSTKVFVADPNY------KADPTVDWRTKGAVTGVKNQGQCGSCWAFS 135
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
T ++EG + + TG+L+SLSEQ LVDC ++ N GCNGGLMD AF++I +NGG+D+E Y
Sbjct: 136 TTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEASY 195
Query: 229 PYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
PY + +C R A V + GY D+ DE +L +AV PVSVAI+A +FQ
Sbjct: 196 PYQAHDERC---RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQL 252
Query: 286 YESGV-FTGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y SGV + EC +ALDHGV+A+GYGTE G DYWLV+NSWG+DWG GY+ + RN
Sbjct: 253 YRSGVYYERECSQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEGYIMMSRN----R 308
Query: 344 TGKCGIAMEASYPV 357
CGIA EASYP
Sbjct: 309 NNNCGIATEASYPT 322
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 194/314 (61%), Gaps = 21/314 (6%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYL 113
+H K + R +I+ N I +HN +++ +NK+ DL +EE+
Sbjct: 33 QHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEEFVQTLN 92
Query: 114 G-TRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
G R++AK+ ++K + Y A E+P++VDWREKGAV PVKDQG CGSCW+FS
Sbjct: 93 GFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGSCWSFSA 152
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
A+EG + TG+L+SLSEQ LVDC K N GCNGG+MD+AFQ+I NGG+D+E+ YP
Sbjct: 153 TGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGGIDTEKAYP 212
Query: 230 YLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
Y ++ C N K V + G+ D+ DE +L KA+A PVSVAI+A +FQ Y
Sbjct: 213 YEAIDDTC---HYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASHESFQFY 269
Query: 287 ESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
GV + +C S LDHGV+AVGYGT E G DYWLV+NSWG+ WG+ GYVK+ RN
Sbjct: 270 SEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN----R 325
Query: 344 TGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 326 DNHCGIATAASYPL 339
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 143/355 (40%), Positives = 204/355 (57%), Gaps = 22/355 (6%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M L+I+ + L +S S + ++ S+ D W + ++ ++
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYTHKEFMP-------- 52
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
R++ FK N+ ++ NS +GLN+ ADL+NEEYR YLGTR+ K
Sbjct: 53 ------RYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYH 106
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ R + + P +VDWREK AV PVKDQG CGSC++FST +VEG+ I TG+L
Sbjct: 107 KRNLGLRLN-RPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKL 165
Query: 186 ISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+SLSEQ ++DC N GCNGGLM AF++II+N G++SE+ YPY N + +
Sbjct: 166 VSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGS 225
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHG 302
I Y+++ DE L+ A+ PVSVAI+A +FQ Y +GV+ S+ LDHG
Sbjct: 226 VAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHG 285
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+AVG GT+NG DY++V+NSWG WG NGY+ + RN D N CGI+ ASYP+
Sbjct: 286 VLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARN-KDNN---CGISTMASYPI 336
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 201/351 (57%), Gaps = 21/351 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
++ L+ LFF+ IS N S + V ++ W+++HG+
Sbjct: 8 MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+RF IFK+N++FI+ N N +YK+G+N+FAD+T++E+ A + G + L S +
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115
Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+S + D++P ++DW E GAV VK QG CG CWAFS V ++EG KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175
Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
SEQEL+DC N GCNGG M AF FI +NGG+ E DY YLG + C + A
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V I Y+ V P E SL +AV QPVS+ I A + Q Y G + G C ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291
Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
GYGT E G YWL++NSWG+ WGENG++K+ R+ + G C IA +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 124/211 (58%), Positives = 156/211 (73%), Gaps = 3/211 (1%)
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAFSTV VEGINKI TG+L+SLSEQELVDC+ N GCNGGLM+ A++FI ++
Sbjct: 1 GKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKS 59
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ +E+ YPY + CD S+ NA V+IDG+E V DE +L KAVA+QPVSVAI+A
Sbjct: 60 GGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDAS 119
Query: 280 GRAFQHYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQR 337
G Q Y GV+TG+ CG+ LDHGV VGYGT +G YW+V+NSWG+ WGE GY+++QR
Sbjct: 120 GSDMQFYSEGVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQR 179
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+ G CGIAMEASYP+K S ++ KP P
Sbjct: 180 GVDAAEGGVCGIAMEASYPLKLSSHNPKPSP 210
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 193/314 (61%), Gaps = 17/314 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+++W K+GK+ G G R ++++ NL+ + +HN L Y++G+N +ADL NEE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ A+ + K K ++Q + G LP SVDWR +G V PVKDQG CGSCW
Sbjct: 79 FMAL----KGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWT 134
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS ++EG + TG L+SLSEQ+LVDC R N GCNGGLM+ A+ +I GG++ E
Sbjct: 135 FSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELES 194
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY + +C R V + GY + DE +L +AV PV+V+I+A G +FQ
Sbjct: 195 AYPYTARDGRCKFDRSKV-VATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQL 253
Query: 286 YESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
YESGV+ C S LDHGV+AVGYGTE G +YWLV+NSWG WG+ GY+K+ + D N
Sbjct: 254 YESGVYDFRRCSSTNLDHGVLAVGYGTEGGQNYWLVKNSWGPGWGDQGYIKMSK---DKN 310
Query: 344 TGKCGIAMEASYPV 357
+CGIA ++ YP+
Sbjct: 311 N-QCGIATDSCYPL 323
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 190/314 (60%), Gaps = 17/314 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+++W K+GK+ G G R ++++ NL+ + +HN L Y++G+N +ADL NEE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ A+ + + K + ++Q + G LP SVDWR +G V PVKDQG CGSCW+
Sbjct: 79 FMAL----KGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWS 134
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS ++EG + TG L+SLSEQ+LVDC N GC+GGLM+ A+ +I GG+ E
Sbjct: 135 FSATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLES 194
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY +C + A V + G+ + DE SL +AV PV+VAI+A G FQ
Sbjct: 195 AYPYTAQNGRCHFDQSKA-VATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQL 253
Query: 286 YESGVF-TGEC-GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
YESGV+ C S+LDHGV+A GYGTE G DYWLV+NSWG WG GY+K+ RN
Sbjct: 254 YESGVYDRSRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRN----K 309
Query: 344 TGKCGIAMEASYPV 357
+ +CGIA A YP+
Sbjct: 310 SNQCGIATMACYPL 323
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 151/331 (45%), Positives = 200/331 (60%), Gaps = 39/331 (11%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEK----RFQIFKDNLRFID----EHNSLNRTYKVGL 97
DE +++ W +K ++EK R +++ NL+ I+ EH+ TY +G+
Sbjct: 25 DEHWNLWKDWHSKK--------YHEKEEGWRRMVWEKNLKKIELHNLEHSMGKHTYSLGM 76
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
N F D+T+EE+R + G + ++R+L S + E P SVDWR+KG V PVK
Sbjct: 77 NHFGDMTHEEFRQIMNGYKLKSQRKLRGSLFMEPNFL-----EAPRSVDWRDKGYVTPVK 131
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
DQG CGSCWAFST A+EG + TG L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191
Query: 217 IQNGGMDSEQDYPYLGA-ENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
NGG+DSE+ YPYLG E C DPS +A G+ DV E +L KAVA PV
Sbjct: 192 KDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDT---GFVDVPSGSERALMKAVASVGPV 248
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG +FQ Y SG+ + EC S LDHGV+ VGYG E +G YW+V+NSW +
Sbjct: 249 SVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDVDGKKYWIVKNSWSEN 308
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG+ GY+ + ++ CGIA ASYP+
Sbjct: 309 WGDKGYIYMAKD----KKNHCGIATAASYPL 335
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 202/309 (65%), Gaps = 22/309 (7%)
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLG 114
H KT + + +RF+IF++N++ I+EHN L ++Y +G+N+F+DL +EE+ Y G
Sbjct: 63 HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEF-VKYNG 121
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
K+ +K S A E P+SVDWR+KG V VK+QG CGSCW+FST ++
Sbjct: 122 L----KKTSLKDGGCSSYLAANNLVE-PDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSL 176
Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EG + +G+L+SLSE +LVDC + N GCNGGLMD AF++I GG++SE+DYPY
Sbjct: 177 EGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPK 236
Query: 234 ENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF 291
+ C + KV + D G DV E +LKKAV++ PVSVAI+A +FQ Y GV+
Sbjct: 237 QGTC--KFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVY 294
Query: 292 -TGECGS-ALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
EC S LDHGV+ VGYGT++ G DYW+V+NSWG++WGE+GYVK+ RN +CG
Sbjct: 295 DEPECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN----KKNQCG 350
Query: 349 IAMEASYPV 357
IA +ASYP+
Sbjct: 351 IATQASYPL 359
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/320 (45%), Positives = 191/320 (59%), Gaps = 15/320 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEE 107
+ T+ +H K R +IF DN I +HNS +YK+ +NK+ D+ + E
Sbjct: 34 WMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHE 93
Query: 108 YRAMYLGTRSDAKRRLMKSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
+ + G +L ++ + A LP+ VDWR++GAV PVKDQG CGSCW
Sbjct: 94 FVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVKDQGHCGSCW 153
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
+FS A+EG + TG L+SLSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 154 SFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 213
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY +KC + N+ + + GY D+ DE LK AVA PVSVAI+A ++FQ
Sbjct: 214 ASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATIGPVSVAIDASHQSFQ 272
Query: 285 HYESGV-FTGECGS-ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Y GV + EC S LDHGV+ +GYGT ENG DYWLV+NSWG WG NGY+K+ RN L+
Sbjct: 273 FYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMARNKLN 332
Query: 342 TNTGKCGIAMEASYPVKNSQ 361
CGIA ASYP+ S+
Sbjct: 333 ----HCGIASSASYPLVGSK 348
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 192/310 (61%), Gaps = 12/310 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W + HGK R I+++NL+ I HN ++K+ +N D+T+ E
Sbjct: 29 WKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQT 88
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
LG + ++ +S+ + A ++ +S+DWR KG V PVK+QG CGSCWAFST
Sbjct: 89 LLGLKL---KKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTT 145
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A+EG + TG+L+SLSEQ LVDC K N GC GGLMD AFQ+I +NGG+D+E+ YPY
Sbjct: 146 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPY 205
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
L + C ++A G+ D+ DE +L++A+A P+S+AI+A F Y G
Sbjct: 206 LAKDGVCH-YNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQG 264
Query: 290 VF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V+ +C S LDHGV+AVGYGT++G DYWLV+NSWG WGE GY+K+ RN D KC
Sbjct: 265 VYDDPDCSSTRLDHGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARNDHD----KC 320
Query: 348 GIAMEASYPV 357
G+A +ASYP+
Sbjct: 321 GVASKASYPL 330
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 154/358 (43%), Positives = 202/358 (56%), Gaps = 28/358 (7%)
Query: 14 VFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
+FLF I + A +I ++ V + T+ +H K R
Sbjct: 3 LFLFLIVAVLATAQAISFFE-------------LVNQEWTTFKMEHNKVYKNDVEERFRM 49
Query: 74 QIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
+IF DN I +HN +YK+ +NK+ D+ + E+ G +L ++
Sbjct: 50 KIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLP 109
Query: 130 -SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+ + A LP++VDWRE GAV PVKDQG CGSCW+FS A+EG + TG LI L
Sbjct: 110 IAASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPL 169
Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
SEQ L+DC K N GCNGGLMD AFQ+I N G+D+E YPY +KC + N+
Sbjct: 170 SEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGAR 229
Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVV 304
+ GY D+ +E LK AVA PVSVAI+A ++FQ Y GV + EC S LDHGV+
Sbjct: 230 DV-GYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVL 288
Query: 305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
AVGYGT ENG DYWLV+NSWG WG+NGY+K+ RN L+ CGIA ASYP+ SQ
Sbjct: 289 AVGYGTDENGQDYWLVKNSWGETWGDNGYIKMARNKLN----HCGIASTASYPLVGSQ 342
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 147/297 (49%), Positives = 187/297 (62%), Gaps = 20/297 (6%)
Query: 72 RFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEY-RAMYLGTRSDAKRRLMKS 126
R QI+ N + + HN L ++Y++G+ +FAD+ NEEY R + LG +
Sbjct: 6 RRQIWLSNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKRLISLGCLGAFNASAPRK 65
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
A R A G LP +VDWR+KG V VKDQ CGSCWAFS ++EG N TG+L+
Sbjct: 66 GSAFFRLA--EGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLV 123
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRN 243
SLSEQ+LVDC N GC GGLMD AF++I +NGG+D+E+ YPY + KC P
Sbjct: 124 SLSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESYPYEAEDGKCRFKPQNIG 183
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGSA-LD 300
AK GY DV+ DE +LK+AVA PVSVAI+A +FQ YESGV+ EC S LD
Sbjct: 184 AKCT---GYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELECSSEDLD 240
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
HGV+AVGYGT+NG DYWLV+NSWG WG+ GY+ + RN +CGIA ASYP+
Sbjct: 241 HGVLAVGYGTDNGQDYWLVKNSWGLGWGQKGYIMMSRN----KHNQCGIASMASYPL 293
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 191/315 (60%), Gaps = 16/315 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W + G++ N +R +I+ N R + HN + ++Y++G+ FAD+ NEE
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y R + G L + A R G +LP SVDWREKG V VKDQ CGSCW
Sbjct: 86 YKRQISQGCLGSFNASLPRRGSAYLRLP--EGADLPNSVDWREKGYVTEVKDQKQCGSCW 143
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST ++EG TG+L+SLSEQ+LVDC N GC GGLMD AF++I NGG+D+E
Sbjct: 144 AFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTE 203
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY + +C + N + GY DV DE +LK+AVA PVSVAI+A +FQ
Sbjct: 204 DSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSFQ 262
Query: 285 HYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
YESGV+ EC S+ LDHGV+AVGYG++NG DYWLV+NSWG WG GY+ + RN
Sbjct: 263 LYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRN---- 318
Query: 343 NTGKCGIAMEASYPV 357
+CGIA +SYP+
Sbjct: 319 KHNQCGIATASSYPL 333
>gi|403344237|gb|EJY71457.1| Cathepsin L [Oxytricha trifallax]
Length = 341
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 187/309 (60%), Gaps = 16/309 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
+ +LAK+GK+ E R +K N+ I HNS N T+ + NKF D T ++YR
Sbjct: 43 FANFLAKYGKSYGTREEFEFRLNQYKTNMALISAHNSKNGETFTLAANKFTDYTPQQYRK 102
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ LG +S K++ +++YA ++P SVDWREK AV PVKDQG CGSCWAFST
Sbjct: 103 L-LGYKSK------KNQNDAKKYATFNLTDVPSSVDWREKNAVTPVKDQGQCGSCWAFST 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCD--RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
++EG + I +G L S SEQ+LVDCD + N GCNGG M A + +N +D E DY
Sbjct: 156 TGSLEGRDAIASGVLQSYSEQQLVDCDFSKDGNQGCNGGDMGLAMAYSAKN-PLDLESDY 214
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY G + C + K + G V P LK A+A+ PVSVAIEA FQ Y
Sbjct: 215 PYEGVDGTCRAKQGQGKSKN-SGSTYVKPNSPDDLKAAIAEGPVSVAIEADSLFFQFYSK 273
Query: 289 GVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
GVF+ + CG+ LDHGV+AVGYGTENG DY+LV+NSW S WG +GY+K+ + N G C
Sbjct: 274 GVFSSKYCGTNLDHGVLAVGYGTENGSDYYLVKNSWSSGWGLDGYIKIG---VAANEGIC 330
Query: 348 GIAMEASYP 356
GI ME +P
Sbjct: 331 GIQMEPVFP 339
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 187/316 (59%), Gaps = 23/316 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+Q + A+HG+ + R +F+ N +FID+HN+ T+ + +N+F D+T+EE
Sbjct: 23 WQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 82
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCW 166
A G RR KA DE LPE VDWR KGAV PVKDQ CGSCW
Sbjct: 83 IVATMNGFLGAPTRRPAA--------VLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCW 134
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST ++EG + + G+L+SLSEQ LVDC K N GC GGLMD AF++I N G+D+E
Sbjct: 135 AFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTE 194
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY + KC N GY DV E +LKKAVA P+SV I+A F
Sbjct: 195 DSYPYEAQDGKCRFDASNVGATDT-GYVDVEHGSESALKKAVATIGPISVGIDASQSTFH 253
Query: 285 HYESGVFTGE-CGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Y +GV+ + C S LDHGV+AVGYG+ ENG D+WLV+NSW + WG+ GY+K+ RN
Sbjct: 254 FYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN--- 310
Query: 342 TNTGKCGIAMEASYPV 357
CGIA +ASYP+
Sbjct: 311 -RNNNCGIASQASYPL 325
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 189/318 (59%), Gaps = 26/318 (8%)
Query: 53 QTWLA---KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
Q WLA HGK R ++F DN + IDEHN+ +YK+ +N DL
Sbjct: 11 QEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMV 70
Query: 106 EEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
E++A+ G ++ R K V S + LP+SVDWR++GAV PVKDQG CGS
Sbjct: 71 HEFKALMNGFKKTPNAERNGKIYVPSN-------ENLPKSVDWRQRGAVTPVKDQGHCGS 123
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMD 223
CW+FS ++EG + TG L+SLSEQ LVDC + N+GC GGLM+ AFQ++ N G+D
Sbjct: 124 CWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGID 183
Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
+E YPY EN C + KV D GY D+ E L+ AVA P+SV I+A
Sbjct: 184 TEASYPYEARENNC--RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHE 241
Query: 282 AFQHYESGVFTGE-CG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
+FQ Y GV+ + C S LDHGV+ VGYGTENG DYWLV+NSWG WGE+GY+K+ RN
Sbjct: 242 SFQFYSEGVYKEQYCSPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESGYIKIARN- 300
Query: 340 LDTNTGKCGIAMEASYPV 357
+ CGIA ASYPV
Sbjct: 301 ---HKNHCGIASMASYPV 315
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 192/315 (60%), Gaps = 20/315 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
++ W H K + +R +I++DNL+ + +HN+ + +Y +G+NK+ADL EE
Sbjct: 28 WEAWKQTHSKQYTKEEEDNRR-KIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEE 86
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ M G + DA R K S A + P+SVDWR++G V PVKDQG CGSCWA
Sbjct: 87 FVQMMNGLKFDASRERQGIKFLSY-----AKFQAPDSVDWRDEGYVTPVKDQGQCGSCWA 141
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FST ++EG + TG L SLSEQ LVDC N GC GGLMDYAFQ+I N G+D+E
Sbjct: 142 FSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTED 201
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKA-VADQPVSVAIEAGGRAFQH 285
YPY ++ C S N GY DV DE +LK+A A+ P+SVAI+A +FQ
Sbjct: 202 KYPYEAEDDTCRFSPDNVGATD-SGYVDVDSGDEDALKEACAANGPISVAIDASHESFQL 260
Query: 286 YESGVFTGECGSA--LDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
YESGV+ E S+ LDHGV+ VGYGT++ G DYW+V+NSWG WG+ GY+ + RN
Sbjct: 261 YESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRN---- 316
Query: 343 NTGKCGIAMEASYPV 357
+CGIA ASYP
Sbjct: 317 KDNQCGIATSASYPT 331
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 187/316 (59%), Gaps = 23/316 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+Q + A+HG+ + R +F+ N +FID+HN+ T+ + +N+F D+T+EE
Sbjct: 22 WQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 81
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCW 166
A G RR KA DE LPE VDWR KGAV PVKDQ CGSCW
Sbjct: 82 IVATMNGFLGAPTRRPAA--------VLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCW 133
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST ++EG + + G+L+SLSEQ LVDC K N GC GGLMD AF++I N G+D+E
Sbjct: 134 AFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGIDTE 193
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY + KC N GY DV E +LKKAVA P+SV I+A F
Sbjct: 194 DSYPYEAQDGKCRFDASNVGATDT-GYVDVEHGSESALKKAVATIGPISVGIDASQSTFH 252
Query: 285 HYESGVFTGE-CGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Y +GV+ + C S LDHGV+AVGYG+ ENG D+WLV+NSW + WG+ GY+K+ RN
Sbjct: 253 FYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN--- 309
Query: 342 TNTGKCGIAMEASYPV 357
CGIA +ASYP+
Sbjct: 310 -RNNNCGIASQASYPL 324
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 185/308 (60%), Gaps = 19/308 (6%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
W H K + G R+ I+KDN R I EHN + + +N+F D+TN E++A
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFKAF--- 86
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
L V + P++VDWR +G V PVKDQG CGSCWAFST ++
Sbjct: 87 -----NGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSL 141
Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EG + TG+L+SLSEQ LVDC N GCNGGLMD AF +I +N G+DSE YPY
Sbjct: 142 EGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTAE 201
Query: 234 ENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF 291
+ KC + V + D G+ D+ +E LK+AVA P+SVAI+A +FQ Y SGV+
Sbjct: 202 DGKC--VFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVY 259
Query: 292 T-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
C S LDHGV+ VGYGTE+G DYWLV+NSW + WG+ GY+K++RN + +CGI
Sbjct: 260 NEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKN----QCGI 315
Query: 350 AMEASYPV 357
A +ASYP+
Sbjct: 316 ATKASYPL 323
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 195/322 (60%), Gaps = 15/322 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFA 101
D V + ++ +H K + R +IF +N + +HN L +K+GLNK+A
Sbjct: 21 DLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYA 80
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+ + E+ + G L S + + R+ A +LP++VDWR+KGAV VKDQG
Sbjct: 81 DMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQG 140
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQN 219
CGSCW+FS ++EG + TG+L+SLSEQ LVDC R N GCNGGLMD AF++I N
Sbjct: 141 HCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDN 200
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
GG+D+E+ YPYL + KC +N+ G+ D+ +E LK AVA PVS+AI+A
Sbjct: 201 GGIDTEKSYPYLAEDEKCHYKAQNSGATD-KGFVDIEEANEDDLKAAVATVGPVSIAIDA 259
Query: 279 GGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKL 335
FQ Y GV++ EC S LDHGV+ VGYGT ++G DYWLV+NSWG WG NGY+K+
Sbjct: 260 SHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLNGYIKM 319
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
RN CG+A +ASYP+
Sbjct: 320 ARN----QDNMCGVASQASYPL 337
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 191/314 (60%), Gaps = 10/314 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+Q W A++ +T ++RF ++ +N++FI+ N +Y++G N+FADLT EE++
Sbjct: 37 FQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENQFADLTEEEFKDT 96
Query: 112 YLGTRSD--AKRRLMKSKVASQRYACKAG----DELPESVDWREKGAVNPVKDQGSCGSC 165
YL + + M V + A +G +E P SVDWR KGAV PVK Q CGSC
Sbjct: 97 YLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQHCGSC 156
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLM-DYAFQFIIQNGGMDS 224
WAF+ VA++EG++KI TG L+SLSEQE+VDCDR N G A +++ +NGG+ +
Sbjct: 157 WAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNGGLTT 216
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DYPY+G + +C + I G + V +E +L+ AVA +PV+V+I A RAFQ
Sbjct: 217 ESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINA-SRAFQ 275
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y+ G+F+G C + +H V VGYG +G YW+V+NSWG WGE GYV++QR +
Sbjct: 276 FYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRG-VRAR 334
Query: 344 TGKCGIAMEASYPV 357
G CGIA+ Y V
Sbjct: 335 EGVCGIAIAPFYAV 348
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 194/320 (60%), Gaps = 23/320 (7%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
+ +Q + A++GK + R +++ N FI+ HN ++ + +N+F D+
Sbjct: 18 TLNEWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDM 77
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T EE A G S K KV DELP++VDWR+KGAV PVKDQ +CG
Sbjct: 78 TTEEINAAMNGFLSAGK------KVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACG 131
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS ++EG + + TG+L+SLSEQ LVDC D+ N GC GGLMD AF++I N G+
Sbjct: 132 SCWAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGI 191
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAG 279
D+E+ YPY E K P R N+ V ++ Y D+ E L+KAVA++ PVSVAI+A
Sbjct: 192 DTEESYPY---EAKNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDAS 248
Query: 280 GRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
F Y G++ E C S+ LDHGV+AVGYGT++ DYWLV+NSW WG++GY+K+ R
Sbjct: 249 TSTFHFYSRGIYYDEKCSSSFLDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSR 308
Query: 338 NLLDTNTGKCGIAMEASYPV 357
N CGIA +ASYPV
Sbjct: 309 N----RNNNCGIASQASYPV 324
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/354 (41%), Positives = 208/354 (58%), Gaps = 38/354 (10%)
Query: 14 VFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
+ LFF+ SA ++ + S W +++T K T+ + R
Sbjct: 7 ICLFFVCVYSAPTFNV-------ELDSHW-------ALFKTTFGKQYSTAEEI----TRR 48
Query: 74 QIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
++ N+ I +HN + TY +GLN +ADLTN E+ + G R +A ++K A
Sbjct: 49 LAWEANVAIIRQHNLEHDLGLHTYTLGLNNYADLTNAEFNQVMNGLRVNAS----QTKSA 104
Query: 130 SQR-YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
++R Y G ELP SVDWR KG V P+KDQG CGSCWAFS+ ++EG + TG+L+SL
Sbjct: 105 NRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFSSTGSLEGQHFAKTGQLVSL 164
Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
SEQ L DC +K N GCNGGLMD AF +I +N G+D+E YPY + KC + A V
Sbjct: 165 SEQNLTDCSQKQGNMGCNGGLMDQAFTYIKENNGIDTESSYPYKAVDEKCH--FKAADVG 222
Query: 248 SID-GYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGV 303
+ D GY D++ DE +L+ A+A P+SVAI+A +FQ Y SG + SA LDHGV
Sbjct: 223 ATDTGYTDIAQQDENALQSAIATVGPISVAIDASHSSFQLYRSGAYNERACSATQLDHGV 282
Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+AVGY +E+G DY++V+NSWG+ WG+ GY+ + RN +CGIA ++YP
Sbjct: 283 LAVGYDSEDGKDYYIVKNSWGTSWGQKGYIWMTRN----KNNQCGIATMSTYPT 332
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 148/357 (41%), Positives = 203/357 (56%), Gaps = 35/357 (9%)
Query: 12 TLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
TL+FL F+ S+A ++ + D H + A H K
Sbjct: 1 TLIFLLGAVFVQLSAALSLTNLLADEWH-----------------LFKATHKKEYPSQLE 43
Query: 69 NEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
+ R +I+ +N + +HN L ++Y+V +NKF DL + E+R++ G + K++
Sbjct: 44 EKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQH--KKQNS 101
Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
++ + A E+PESVDWREKGA+ PVKDQG CG CWAFS+ A+EG TG+
Sbjct: 102 SRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGK 161
Query: 185 LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
L+SL EQ L+DC K N GCNGGLMD AFQ+I N G+D+E YPY ++ C + RN
Sbjct: 162 LVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRN 221
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALD 300
V G+ D+ +E LK AVA PVSVAI+A +FQ Y GV + C S LD
Sbjct: 222 RGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLD 280
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
HGV+ VGYG++NG DYWLV+NSW WG+ GY+K+ RN CG+A ASYP+
Sbjct: 281 HGVLVVGYGSDNGKDYWLVKNSWSEHWGDQGYIKIARN----RKNHCGVATAASYPL 333
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 140/314 (44%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ + A H K + R +I+ +N + +HN L ++Y+V +NKF DL + E
Sbjct: 31 WHLFKATHKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHE 90
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R++ G + K++ ++ + A E+PESVDWREKGA+ PVKDQG CGSCWA
Sbjct: 91 FRSIMNGYQH--KKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWA 148
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG TG+L+SLSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 149 FSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 208
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
YPY + C + RN V G+ D+ +E LK AVA PVSVAI+A +FQ
Sbjct: 209 TYPYEAEDGVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 267
Query: 286 YESG-VFTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y G + C S LDHGV+ VGYG++NG DYWLV+NSW WG+ GY+K+ RN
Sbjct: 268 YSKGXYYEPSCDSDDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEGYIKIARN----R 323
Query: 344 TGKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 324 KNHCGVATAASYPL 337
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 139/299 (46%), Positives = 185/299 (61%), Gaps = 18/299 (6%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRR----L 123
R +I+ ++ I +HN +YK+G+NK+ D+ + E+ G AK +
Sbjct: 47 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 106
Query: 124 MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
V ++ A +LPE VDWR+ GAV +KDQG CGSCW+FST A+EG + +G
Sbjct: 107 KGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG 166
Query: 184 ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
L+SLSEQ L+DC + N GCNGGLMD AF++I NGG+D+EQ YPY G ++KC + +
Sbjct: 167 YLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPK 226
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT-GECGSA-L 299
N + G+ D+ DE L +AVA PVSVAI+A +FQ Y SGV+ EC S L
Sbjct: 227 NTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 285
Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGV+ VGYGT E GVDYWLV+NSWG WGE GY+K+ RN +CGIA ASYP+
Sbjct: 286 DHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRN----KNNRCGIASSASYPL 340
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 137/307 (44%), Positives = 186/307 (60%), Gaps = 17/307 (5%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
W H K + G R+ I+KDN R I EHN + + +N+F D+TN E++A
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFKAF--- 86
Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
L V + P++VDWR +G V PVKDQG CGSCWAFST ++
Sbjct: 87 -----NGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSL 141
Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
EG + TG+L+SLSEQ LVDC N GC+GGLMD AF +I +N G+DSE YPY
Sbjct: 142 EGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAE 201
Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT 292
+ KC ++++ + G+ D+ +E LK+AVA P+SVAI+A +FQ Y SGV+
Sbjct: 202 DGKC-VFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYN 260
Query: 293 -GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
C S LDHGV+ VGYGTE+G DYWLV+NSW + WG+ GY+K++RN + +CGIA
Sbjct: 261 EPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKN----QCGIA 316
Query: 351 MEASYPV 357
+ASYP+
Sbjct: 317 TKASYPL 323
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/297 (48%), Positives = 184/297 (61%), Gaps = 18/297 (6%)
Query: 71 KRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+R +IF++N + I+ HN+ TY +G N+FA +TN+E+ A +G R KS
Sbjct: 18 RRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGG-CLLDRNASKS 76
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
ELP++VDWR KG V PVK+Q CGSCWAFST ++EG TG+L+
Sbjct: 77 TADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGKLV 136
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRN 243
SLSEQ LVDC + N GCNGGLMD AF++I NGG+D+E YPY + KC P+
Sbjct: 137 SLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPADVG 196
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LD 300
A V GY D+S DE +L +AVA P+SVAI+A FQ Y GV + +C S LD
Sbjct: 197 ATVT---GYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELD 253
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
HGV+AVGYGTE G DYWLV+NSWG WG+NGY+ + RN +CGIA ASYP+
Sbjct: 254 HGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRN----KNNQCGIATSASYPL 306
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 141/294 (47%), Positives = 184/294 (62%), Gaps = 19/294 (6%)
Query: 71 KRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
+RF IF+DNL I+E N +N + + +G+N+FAD+TN E+ M LG + ++
Sbjct: 48 RRF-IFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGL--GGRNKIAGDS 104
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
V + +LP VDW +KG V VK+QG CGSCWAFST ++EG TG+L+S
Sbjct: 105 VFESSHV----QDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVS 160
Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC + N GCNGGLMD AF +I +NGG+D+E YPY G++ C N
Sbjct: 161 LSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGTCR-FLENKVG 219
Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGSA-LDHGV 303
++ G+ DV DE +LK+AVA P+SVAI+A FQ Y GV+ C S LDHGV
Sbjct: 220 ATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCSSTELDHGV 279
Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYGTE G DYWLV+NSWGS WG GY+K+ RN +CGIA +ASYP
Sbjct: 280 LVVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRN----KKNRCGIATQASYPT 329
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 191/320 (59%), Gaps = 15/320 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEE 107
+ T+ +H K R +IF DN I +HNS +YK+ +NK+ D+ + E
Sbjct: 28 WMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
+ + G +L ++ + A LP+ VDWR++GAV PVKDQG CGSCW
Sbjct: 88 FVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKVDWRKEGAVTPVKDQGHCGSCW 147
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
+FS A+EG + TG L+SLSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 148 SFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY +KC + N+ + + GY D+ +E LK AVA PVSVAI+A ++FQ
Sbjct: 208 ASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQ 266
Query: 285 HYESGV-FTGECGS-ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Y GV + EC S LDHGV+ +GYGT ENG DYWLV+NSWG WG NGY+K+ RN L+
Sbjct: 267 FYSEGVYYEPECSSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNNGYIKMARNKLN 326
Query: 342 TNTGKCGIAMEASYPVKNSQ 361
CGIA ASYP+ S+
Sbjct: 327 ----HCGIASSASYPLVGSK 342
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 193/317 (60%), Gaps = 25/317 (7%)
Query: 53 QTWLA---KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTN 105
+ WLA + GK+ R ++K+N R IDEHN +YK+ +N F DL
Sbjct: 24 EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E++A+ + +R K + + + + G +LP VDWR+KGAV PVKD G CGSC
Sbjct: 84 HEFKAL------NKLKRSAKQQNSGEVFRATGG-KLPAKVDWRQKGAVTPVKDPGQCGSC 136
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
WAFS+ ++ G + +L+SLSEQ+LVDC N GC+GG+M AFQ+I NGG+D+
Sbjct: 137 WAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDT 196
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
E YPY ++KC + V D GY D++ DE +LK+AVA+ P+SVAI+AG +
Sbjct: 197 EGSYPYEAEDDKC--RYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLS 254
Query: 283 FQHYESGVFTGE--CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y G++ + LDHGV+ VGYGTENG DYWLV+NSWG WGENGY+K+ RN
Sbjct: 255 FQFYSEGIYDEPFCSNTELDHGVLVVGYGTENGQDYWLVKNSWGPSWGENGYIKIARN-- 312
Query: 341 DTNTGKCGIAMEASYPV 357
+ CGIA ASYP+
Sbjct: 313 --HNNHCGIASMASYPI 327
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 147/320 (45%), Positives = 189/320 (59%), Gaps = 15/320 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEE 107
+ T+ +H K R +IF DN I +HN +YK+ +NK+ D+ + E
Sbjct: 28 WTTFKMEHNKVYKNDIEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
+ G +L ++ + A LP++VDWRE GAV PVKDQG CGSCW
Sbjct: 88 FVNTLNGFNKSINTQLRSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCW 147
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
+FS A+EG + TG LI LSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 148 SFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY +KC + N+ + GY D+ +E LK AVA PVSVAI+A ++FQ
Sbjct: 208 VTYPYEAENDKCRYNAANSGARDV-GYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQ 266
Query: 285 HYESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Y GV + EC S LDHGV+AVGYGT ENG DYWLV+NSWG WG+NGY+K+ RN L+
Sbjct: 267 FYSEGVYYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMARNKLN 326
Query: 342 TNTGKCGIAMEASYPVKNSQ 361
CGIA ASYP+ SQ
Sbjct: 327 ----HCGIASTASYPLVGSQ 342
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 191/314 (60%), Gaps = 10/314 (3%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+Q W A++ +T ++RF ++ +N++FI+ N +Y++G N+FADLT EE++
Sbjct: 37 FQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENRFADLTEEEFKDT 96
Query: 112 YLGTRSD--AKRRLMKSKVASQRYACKAG----DELPESVDWREKGAVNPVKDQGSCGSC 165
YL + + M V + A +G +E P SVDWR KGAV PVK Q CGSC
Sbjct: 97 YLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQHCGSC 156
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLM-DYAFQFIIQNGGMDS 224
WAF+ VA++EG++KI TG L+SLSEQE+VDCDR N G A +++ +NGG+ +
Sbjct: 157 WAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNGGLTT 216
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
E DYPY+G + +C + I G + V +E +L+ AVA +PV+V+I A RAFQ
Sbjct: 217 ESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINA-SRAFQ 275
Query: 285 HYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y+ G+F+G C + +H V VGYG +G YW+V+NSWG WGE GYV++QR +
Sbjct: 276 FYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRG-VRAR 334
Query: 344 TGKCGIAMEASYPV 357
G CGIA+ Y V
Sbjct: 335 EGVCGIAIAPFYAV 348
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ + A H K + R +I+ +N + +HN L ++Y+V +NKF DL + E
Sbjct: 31 WHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHE 90
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R++ G + K++ ++ + A E+PESVDWR KGA+ PVKDQG CGSCWA
Sbjct: 91 FRSIMNGYQH--KKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQCGSCWA 148
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG TG+LISLSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 149 FSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 208
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
YPY +N C + RN + G+ + +E LK AVA PVSVAI+A +FQ
Sbjct: 209 TYPYEAEDNVCRYNPRNRGAID-RGFVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 267
Query: 286 YESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y GV + C S LDHGV+ VGYG++NG DYWLV+NSW WG+ GY+K+ RN
Sbjct: 268 YSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARN----R 323
Query: 344 TGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 324 KNHCGIATAASYPL 337
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 201/327 (61%), Gaps = 26/327 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
D ++ ++ W HGK + +R +++ NL+ I+ EH+ TY++G+N+F
Sbjct: 22 DKQLDNHWEQWKNWHGKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHTYRLGMNRF 80
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+T+EE+R + G + +RR S + E+P S+DWREKG V PVKDQG
Sbjct: 81 GDMTHEEFRQVMNGYKHKKERRFRGSLFMEPNFL-----EVPNSLDWREKGYVTPVKDQG 135
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 136 ECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQ 195
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
G+DSE+ YPY+G +++ P + K + + G+ D+ E +L KA+A PVSVAI
Sbjct: 196 NGLDSEESYPYVGTDDQ--PCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAI 253
Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ + EC S LDHGV+AVGYG E +G YW+V+NSW +WG+
Sbjct: 254 DAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDK 313
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GYV + ++ CGIA ASYP+
Sbjct: 314 GYVYMAKD----RHNHCGIATAASYPL 336
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 195/317 (61%), Gaps = 20/317 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ +W K GK + +R + +N + + HN L ++Y++G+ FAD+ N+E
Sbjct: 26 FHSWKLKFGKIYKSVEEESQRKNTWLENRKLVLVHNMLADQGIKSYRLGMTYFADMDNQE 85
Query: 108 YR-AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
YR +++ G R K AS G LP++VDWR+KG V VKDQ +CGSCW
Sbjct: 86 YRQSVFKGCLGSFNR--TKGHRASTFLLQAGGAVLPDTVDWRDKGYVAEVKDQKNCGSCW 143
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFS ++EG TG+L+SLSEQ+LVDC K N GC GGLMD AF++I N G+D+E
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGIDTE 203
Query: 226 QDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
+ YPY + C P+ A + GY D++ DE +L+KAVA+ P+SVAI+AG +
Sbjct: 204 ESYPYEATDGDCRFKPATVGA---TCTGYVDINSEDENALQKAVANIGPISVAIDAGHIS 260
Query: 283 FQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y SG++ C S LDHGV+AVGYGT+N DYWLV+NSWG DWG+ GY+K+ RN
Sbjct: 261 FQLYGSGIYNEPNCSSEDLDHGVLAVGYGTDNQQDYWLVKNSWGLDWGDQGYIKMTRN-- 318
Query: 341 DTNTGKCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 319 --KNNQCGIATAASYPL 333
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 134/295 (45%), Positives = 189/295 (64%), Gaps = 14/295 (4%)
Query: 71 KRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+R ++F++NL+ I+ HN L+ +Y++G+N+FAD+ +E+ ++ G R + + ++ +
Sbjct: 62 QRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFASVVNGFRMNNRTKV-RD 120
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ S + LP VDWR++G V P+KDQG CGSCW+FST A+EG + TG+L+
Sbjct: 121 HLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLV 180
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ L+DC N GCNGG+MDYAFQ+I N G D+E YPY A+ C +
Sbjct: 181 SLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYEAADGPCRFKKEYVG 240
Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-EC-GSALDHG 302
GY D+ DE +K+AVA PVSVAI+A +FQ Y+SGV+ EC LDHG
Sbjct: 241 ATDT-GYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECDPEGLDHG 299
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+ VGYGTE G DYWLV+NSWG+ WG+ GY+K+ RN +CGI+ ASYP+
Sbjct: 300 VLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRN----KNNQCGISSMASYPL 350
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 191/315 (60%), Gaps = 16/315 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W + G++ N +R +I+ N R + HN + ++Y++G+ FAD+ NEE
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y R + G L + A R G +LP SVDWREKG V VKDQ CGSCW
Sbjct: 86 YKRQISQGCLGSFNASLPRRGSAYLRLP--EGADLPNSVDWREKGYVTDVKDQKQCGSCW 143
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST ++EG TG+L+SLSEQ+LVDC N GC GGLMD AF++I NGG+D+E
Sbjct: 144 AFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTE 203
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY + +C + N + GY DV DE +LK+A+A PVSVAI+A +FQ
Sbjct: 204 DSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSFQ 262
Query: 285 HYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
YESGV+ EC S+ LDHGV+AVGYG++NG DYWLV+NSWG WG GY+ + RN
Sbjct: 263 LYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRN---- 318
Query: 343 NTGKCGIAMEASYPV 357
+CGIA +SYP+
Sbjct: 319 KHNQCGIATASSYPL 333
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 201/327 (61%), Gaps = 21/327 (6%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLN 98
R D ++ + +Q W + H K + + +R +++ NL+ I+ HN SL + +YK+G+N
Sbjct: 35 RVDPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMN 93
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
+F D+T EE+R + G + R K ++ + E P SVDWREKG V PVKD
Sbjct: 94 QFGDMTAEEFRQLMNGYKHKKSER----KYRGSQFLEPSFLEAPRSVDWREKGYVTPVKD 149
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
QG CGSCWAFST A+EG + TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++
Sbjct: 150 QGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQ 209
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAI 276
NGG+DSE+ YPY +++ + + G+ D+ E +L KAVA PVSVAI
Sbjct: 210 DNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAI 269
Query: 277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ + +C S LDHGV+ VGYG E +G YW+V+NSWG WG+
Sbjct: 270 DAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDK 329
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY+ + ++ CGIA ASYP+
Sbjct: 330 GYIYMAKD----RKNHCGIATAASYPL 352
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 156/370 (42%), Positives = 216/370 (58%), Gaps = 35/370 (9%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHS-------SSWRTDDEVMTIYQTWLAK 58
MF +S LV L A+ + I + HDH+ S + DE ++ +
Sbjct: 1 MFRLLS-LVLL------CASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKES 53
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG 114
GK+ N N+ + F N+ IDEHN +R T+++GLN ADL +YR + G
Sbjct: 54 FGKSYNKDEEND-YMEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLN-G 111
Query: 115 TRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
R RR + S ++ E+P+SVDWR+KG V VK+QG CGSCWAFS
Sbjct: 112 YR---HRRNFGDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATG 168
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
A+EG + +G+++SLSEQ LVDC K N GCNGGLMD AF++I N G+D+E+ YPY+
Sbjct: 169 ALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYV 228
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV 290
G E KC +++ G+ D+ DE +LK AVA Q P+S+AI+AG R FQ Y+ GV
Sbjct: 229 GRETKCHFKKKDIGAED-KGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGV 287
Query: 291 FTG-ECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
+ EC S LDHGV+ VGYGT+ DYWL++NSWG WGE GY+++ RN + C
Sbjct: 288 YYDEECSSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARN----RSNHC 343
Query: 348 GIAMEASYPV 357
G+A +ASYP+
Sbjct: 344 GVATKASYPL 353
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 197/324 (60%), Gaps = 18/324 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D + +QT+ +H K R +IF +N I +HN L ++K+GLNK+A
Sbjct: 22 DVIKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYA 81
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKS--KVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ + E+ G ++L S + +LP+SVDWR KGAV VKDQ
Sbjct: 82 DMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQ 141
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + TG LISLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 142 GHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 201
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
NGG+D+E+ YPY G ++ C ++ + + D G+ D+ DE L +AVA PVSVAI
Sbjct: 202 NGGIDTEKSYPYEGIDDSCHFNK--GTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAI 259
Query: 277 EAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y +GV+ +C LDHGV+ VGYGT ENG DYWLV+NSWG+ WG+ G++
Sbjct: 260 DASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGFI 319
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN D N +CGIA +SYP+
Sbjct: 320 KMARN--DDN--QCGIATASSYPL 339
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 148/320 (46%), Positives = 192/320 (60%), Gaps = 26/320 (8%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K GK+ + R QI+ N + + HN L ++Y++G+ FAD+ NEE
Sbjct: 26 FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85
Query: 108 YRAMY----LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
Y+ + LG+ + + R S G +LP++VDWRE+G V VKDQ CG
Sbjct: 86 YKKLVSRGCLGSFNASLPRR-----GSTFLRLPEGIDLPDAVDWREQGYVTGVKDQKQCG 140
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS A+EG + TG L+SLSEQ+LVDC N GCNGG MD AF++I NGG+
Sbjct: 141 SCWAFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGI 200
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
D+E YPY + C R N V + GY DV+ +DE +LK+AVA PVSVAI+A
Sbjct: 201 DTEASYPYEAEDWLC---RYNPASVGATCSGYVDVNKYDEEALKEAVATIGPVSVAIDAS 257
Query: 280 GRAFQHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
+FQ Y SGV+ G LDHGV+AVGYGTENG DYWLV+NSWG WGE GY+K+ R
Sbjct: 258 HASFQFYTSGVYDEPGCSSIELDHGVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSR 317
Query: 338 NLLDTNTGKCGIAMEASYPV 357
N +CGIA ASYP+
Sbjct: 318 N----KHNQCGIASAASYPL 333
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 191/296 (64%), Gaps = 16/296 (5%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKS 126
R +IF +N + +HN L ++K+G+NK+AD+ + E+ + G R+ + R +S
Sbjct: 47 RMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGES 106
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
S + A +LP +DWR+KGAV PVKDQG CGSCW+FS ++EG + +G+L+
Sbjct: 107 D-DSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLV 165
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ LVDC K N GCNGGLMD AF++I NGG+D+EQ YPY + KC +N K
Sbjct: 166 SLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKN-K 224
Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECG-SALDHG 302
+ GY D+ +E L+ AVA PVSVAI+A ++FQ Y GV + EC S LDHG
Sbjct: 225 GATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHG 284
Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+ VGYGTE +G DYWLV+NSWG WG+ GY+K+ RN D N CGIA EASYP+
Sbjct: 285 VLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN-RDNN---CGIATEASYPL 336
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 140/314 (44%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ + A H K + R +I+ +N + +HN L ++Y V +NKF DL + E
Sbjct: 27 WHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHE 86
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R++ G + K++ ++ + A +PESVDWREKGA+ PVKDQG CGSCWA
Sbjct: 87 FRSIMNGYQH--KKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQCGSCWA 144
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG TG+L+SLSEQ L+DC K N GCNGGLMD AFQ+I N G+D+E
Sbjct: 145 FSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 204
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY ++ C + RN V G+ D+ +E LK AVA PVSVAI+A +FQ
Sbjct: 205 TYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 263
Query: 286 YESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y GV + C S LDHGV+ VGYG++NG DYWLV+NSW WG+ GY+K+ RN
Sbjct: 264 YSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKMARN----R 319
Query: 344 TGKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 320 KNHCGVASAASYPL 333
>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
Length = 379
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 138/345 (40%), Positives = 210/345 (60%), Gaps = 28/345 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
T +V +++Q W ++HG+ + KR +IFK+NL +I + N+ NR ++++GLNK
Sbjct: 36 TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNA-NRKSPHSHRLGLNK 94
Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
FAD+T +E+ YL D + ++ K+ ++Y+C D P S DWR+KG + VK
Sbjct: 95 FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QG CGS WAFS A+E + I TG+L+SLSEQELVDC + + GC G +F++++
Sbjct: 152 YQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGCYNGWHYQSFEWVL 210
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
++GG+ ++ DYPY E +C ++ K V+IDGYE + D E + A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269
Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
P+SV+I+A + F Y G++ GE C S ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGEDW 327
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
GE+GY+ +QRN + G CG+ ASYP K SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 202/330 (61%), Gaps = 19/330 (5%)
Query: 40 SSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKV 95
S+ + + + ++++QTW K + E++ + +N I EHN SL ++Y++
Sbjct: 17 SAMQLNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRL 76
Query: 96 GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY----ACKAGDELPESVDWREKG 151
+N++ DLT+EE+ +M G R+D RL + Y + + +LP VDWR+ G
Sbjct: 77 EMNEYGDLTSEEFSSMMNGYRNDI--RLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHG 134
Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMD 210
V PVK+QG CGSCW+FS ++EG +K TG+L+SLSEQ L+DC + N GCNGGLMD
Sbjct: 135 LVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMD 194
Query: 211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD- 269
AF++I GG+D+E YPY ++ C + ++ G+ D+ DE LK+A A
Sbjct: 195 QAFKYIKIQGGIDTEAYYPYEAKDDTCRFNITDSGATDT-GFVDIKSGDEEMLKEAAATV 253
Query: 270 QPVSVAIEAGGRAFQHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
P+SVAI+A +FQ Y +GV+ T + LDHGV+ VGYGTENG DYWLV+NSWG W
Sbjct: 254 GPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYGTENGKDYWLVKNSWGEGW 313
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GE GY+K+ RN +CGIA +ASYP+
Sbjct: 314 GEAGYIKMSRNA----DNQCGIATQASYPL 339
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 199/311 (63%), Gaps = 18/311 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W K+GKT + + R +I+ N +++EHNS++ ++++ +N+FADLT EE+ ++
Sbjct: 29 WRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTAEEFSSI 88
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
Y G + R RY G +P+SVDWR KG V PVK+Q CGSCWAFST
Sbjct: 89 YNG-YGKGRNRENHENTTIYRYT---GGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTT 144
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
++EG + TG+L+SLSEQ LVDCD+K + GC GGLM AF++I +N G+D+E+ YPY
Sbjct: 145 GSLEGAHAKKTGKLVSLSEQNLVDCDKK-DHGCQGGLMTTAFKYIEENKGIDTEESYPYK 203
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV 290
+C+ +++ +++ + + D +LKKAVA+ P+SVA++A +FQ Y+SG+
Sbjct: 204 AKNGRCE-FKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQLYKSGI 262
Query: 291 FTGECGSA--LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL--QRNLLDTNTGK 346
+ + S+ LDHGV+ VGYG E+G +YWLV+NSWG +WG GY K+ ++NL
Sbjct: 263 YDPKICSSRKLDHGVLVVGYGKEDGEEYWLVKNSWGKNWGMEGYFKIASKKNL------- 315
Query: 347 CGIAMEASYPV 357
CGI A YPV
Sbjct: 316 CGICTSACYPV 326
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 193/338 (57%), Gaps = 24/338 (7%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKFA 101
D +M + W+ ++ RF++++ N+R+I+ E + TY++G F
Sbjct: 54 DLMMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFT 113
Query: 102 DLTNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDE-----------LPESVDW 147
DLT+EE+ ++Y G D R + ++ + G E P +DW
Sbjct: 114 DLTDEEFISLYTGKIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDW 173
Query: 148 REKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
R++GAV PVKDQG CGSCWAF TVA +EGI+KI G L+SLSEQ+LVDCD ++ GCNGG
Sbjct: 174 RKRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDF-LDGGCNGG 232
Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
AFQ+IIQNGG+ + Y Y AE +C +R+ A I GY V E+S+ V
Sbjct: 233 WPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGNRKPA--AKITGYRKVKSNSEVSMVNIV 290
Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECG-SALDHGVVAVGYGTEN-GVDYWLVRNSWGS 325
A+QP++ +I G FQHY+ G++ G C S L+H + VGYG + G YW+V+NSWG+
Sbjct: 291 ANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWGA 350
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
WG GY+ ++R + G+CGIA+ +P+ N S
Sbjct: 351 AWGNKGYMLMKRGTKNP-LGQCGIAVRPIFPLMNGGRS 387
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 147/356 (41%), Positives = 211/356 (59%), Gaps = 19/356 (5%)
Query: 12 TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
TL+F+ + D+ S+ +N + D EV + + +H K G+
Sbjct: 3 TLIFVTLFCCVLSKDLHWESHRDNLYSNFQEVLDAEVA--WHKFKLEHNKVYVGIEEESL 60
Query: 72 RFQIFKDNLRFIDEHNSLNRT----YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R IF N +FI +HN+L+ T + VG+N+FAD+T E+ M G + D+ R
Sbjct: 61 RKTIFATNYKFIKDHNALHATGEKSFTVGVNEFADMTVHEFAQMMNGLKPDSTR------ 114
Query: 128 VASQRYACKAGD-ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
V+ Y D LP VDWR KG V+ VK+QGSCGSCWAFST ++EG + TG ++
Sbjct: 115 VSGSTYLSPNIDAPLPVEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMV 174
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
LSEQ LVDC N GCNGGLM AF++I N G+D+E+ YPY G + C ++N
Sbjct: 175 DLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKGIDTEEAYPYAGRDGDCK-FKKNKV 233
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA-LDHG 302
++ G+ ++ +E L++A+A PVSVAI+A ++F Y+SGV+ EC SA LDHG
Sbjct: 234 GATVTGFVEIPAGNEKKLQEALATVGPVSVAIDANHQSFMLYKSGVYDEPECDSAQLDHG 293
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL-DTNTGKCGIAMEASYPV 357
V+AVGYG+ +G DY++V+NSWG+ WGE GY++ + D G CGI ++ASYPV
Sbjct: 294 VLAVGYGSIHGKDYYIVKNSWGTTWGEQGYIRFSTTAVPDAIGGICGILLDASYPV 349
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 143/329 (43%), Positives = 201/329 (61%), Gaps = 23/329 (6%)
Query: 42 WRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGL 97
W+ D E+ +Q W + H K + +R +++ NL+ I+ HN +L + +YK+G+
Sbjct: 124 WQVDPELDGHWQLWKSWHRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGM 182
Query: 98 NKFADLTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
N+F D+T EE+R + G ++R+ S+ + E P SVDWREKG V PV
Sbjct: 183 NQFGDMTTEEFRQLMNGYVHKKSERKYRGSQFLEPNFL-----EAPRSVDWREKGYVTPV 237
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
KDQG CGSCWAFST A+EG + TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+
Sbjct: 238 KDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQY 297
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ NGG+DSE+ YPY +++ + + G+ D+ E +L KAVA PVSV
Sbjct: 298 VQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSV 357
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
AI+AG +FQ Y+SG+ + +C S LDHGV+ VGYG E +G YW+V+NSWG WG
Sbjct: 358 AIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWG 417
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ GY+ + ++ CGIA ASYP+
Sbjct: 418 DKGYIYMAKD----RKNHCGIATAASYPL 442
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 193/321 (60%), Gaps = 16/321 (4%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADL 103
V+ ++ + +H K + R +IF +N I HN + TYK+ +NK+ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGS 161
+ E+ + G R + ++ + + D +LP++VDWR KGAV P+KDQG
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQ 144
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
CGSCWAFS A+EG TG+L+SLSEQ LVDC RK N GCNGGLMD AF+++ +NG
Sbjct: 145 CGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENG 204
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
G+D+E+ YPY + KC + R A G+ DV E +LKKAVA PVSVAI+A
Sbjct: 205 GIDTEESYPYDAEDEKCHYNPRAAGAED-KGFVDVREGSEHALKKAVATVGPVSVAIDAS 263
Query: 280 GRAFQHYESGVFT-GECG-SALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQ 336
+FQ Y GV+ EC LDHGV+ VGYG ++G DYWLV+NSWG+ WG+ GYVK+
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
RN +CGIA AS+P+
Sbjct: 324 RN----RDNQCGIASSASFPL 340
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 192/309 (62%), Gaps = 20/309 (6%)
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMY 112
AKH KT +G +R+ I++ NL+ I+ HN L TY +G NK+AD+TNEE+R
Sbjct: 27 AKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMTNEEFRRTL 85
Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
G R D + L S + D LP +VDWR++G V VKDQG CGSCWAFST
Sbjct: 86 SGLRVD--KELTPGDFVSGMFK----DSLPTAVDWRKEGYVTEVKDQGQCGSCWAFSTTG 139
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
++EG + T +L+SLSE LVDC +K N GCNGGLMD AF++I N G+D+E+ YPY
Sbjct: 140 SLEGQHFKATKQLVSLSESNLVDCSKKWGNQGCNGGLMDNAFKYIADNKGIDTEKSYPYK 199
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV 290
+ KC+ + N Y+D++ E +L++AVA P+SVAI+A +FQ Y GV
Sbjct: 200 PEDRKCNFKKANVGATD-KLYKDITSGSEDALQEAVATIGPISVAIDASHDSFQLYSGGV 258
Query: 291 FTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
+ + S LDHGV+AVGY ++NG DYW+V+NSWG WG +GY+ + RN +CG
Sbjct: 259 YNEKACSTKTLDHGVLAVGYDSKNGDDYWIVKNSWGKSWGIDGYIWMSRN----KKNQCG 314
Query: 349 IAMEASYPV 357
IA ASYPV
Sbjct: 315 IATMASYPV 323
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 191/296 (64%), Gaps = 16/296 (5%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKS 126
R +IF +N + +HN L ++K+G+NK+AD+ + E+ + G R+ + R +S
Sbjct: 47 RMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGES 106
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
S + A +LP +DWR+KGAV PVKDQG CGSCW+FS ++EG + +G+L+
Sbjct: 107 D-DSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLV 165
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ LVDC K N GCNGGLMD AF++I NGG+D+EQ YPY + KC +N K
Sbjct: 166 SLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKN-K 224
Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGEC-GSALDHG 302
+ GY D+ +E L+ AVA PVSVAI+A ++FQ Y GV + +C S LDHG
Sbjct: 225 GATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHG 284
Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+ VGYGTE +G DYWLV+NSWG WG+ GY+K+ RN D N CGIA EASYP+
Sbjct: 285 VLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN-RDNN---CGIATEASYPL 336
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 198/323 (61%), Gaps = 21/323 (6%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFA 101
DE ++ + GK+ N N+ + F N+ IDEHN +R T+++GLN A
Sbjct: 41 DEAFKLWDDYKEAFGKSYNKDEEND-YMEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIA 99
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
DL +YR + G R RR + S ++ E+P+SVDWR+KG V VK+Q
Sbjct: 100 DLPFSQYRKLN-GYRH---RRNFGDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQ 155
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS A+EG + +G+++SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 156 GMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKD 215
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
N G+D+E+ YPY+G E KC +++ G+ D+ DE +LK AVA Q P+S+AI+
Sbjct: 216 NHGIDTEESYPYVGRETKCHFKKKDIGAED-KGFVDLPEGDEEALKVAVATQGPISIAID 274
Query: 278 AGGRAFQHYESGVFTG-ECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVK 334
AG R FQ Y+ GV+ EC S LDHGV+ VGYGT+ DYWL++NSWG WGE GY++
Sbjct: 275 AGHRTFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIR 334
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+ RN + CG+A +ASYP+
Sbjct: 335 IARN----RSNHCGVATKASYPL 353
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 149/353 (42%), Positives = 207/353 (58%), Gaps = 29/353 (8%)
Query: 15 FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
FL F++ A ++ +D + +++ MT H K R +
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQEQWGAFK-----MT--------HNKQYQSETEERFRMK 49
Query: 75 IFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKSKVA 129
IF +N + +HN L ++K+G+NK+AD+ + E+ + G R+ + R +S
Sbjct: 50 IFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESD-D 108
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S + A +LP +DWR+KGAV PVKDQG CGSCW+FS ++EG + +G+L+SLS
Sbjct: 109 SVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLS 168
Query: 190 EQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
EQ LVDC K N GCNGGLMD AF++I NGG+D+EQ YPY + KC +N K +
Sbjct: 169 EQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKN-KGAT 227
Query: 249 IDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGEC-GSALDHGVVA 305
GY D+ +E L+ AVA PVSVAI+A ++FQ Y GV + +C S LDHGV+
Sbjct: 228 DRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLV 287
Query: 306 VGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYGTE +G DYWLV+NSWG WG+ GY+K+ RN CGIA EASYP+
Sbjct: 288 VGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN----RNNNCGIATEASYPL 336
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 148/328 (45%), Positives = 197/328 (60%), Gaps = 22/328 (6%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFA 101
D VM +Q + A+H K N + R +IF DN + I +HN+ + YK+GLNK++
Sbjct: 21 DLVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYS 80
Query: 102 DLTNEEYRAMYLG-TRSDAKRRLM----KSKVASQRYACKAGDELPESVDWREKGAVNPV 156
D+ + E+ + G +S L K+ + + A +LP+ VDW + GAV PV
Sbjct: 81 DMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPV 140
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQF 215
KDQG CGSCWAFS A+EG++ T L+SLSEQ L+DC + N GCNGGLMD AFQ+
Sbjct: 141 KDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQY 200
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSV 274
+ NGG+D+E+ YPY G + C N+ + GY DV DE +LK AVA PVSV
Sbjct: 201 VRINGGIDTERSYPYEGNNDVCRYEPENSGAIDT-GYTDVPLGDEDALKSAVATVGPVSV 259
Query: 275 AIEAGGRAFQHYESGV-FTGECGS---ALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWG 328
AI+A +FQ Y SGV F C + +LDHGV+ VGYGT E DYWLV+NSWG WG
Sbjct: 260 AIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGDSWG 319
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
ENGY+K+ RN +CGIA + S+P
Sbjct: 320 ENGYIKMARNA----DNQCGIATQPSFP 343
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 192/324 (59%), Gaps = 25/324 (7%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D V++ +++W H K + + R +IF +N I HN+ TY + +N +
Sbjct: 23 DVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRISRHNAEAIQGRHTYFMKMNHYG 82
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DL + E+ AM G + K L + + S+ LPE VDWRE+GAV PVK+QG
Sbjct: 83 DLLHHEFVAMVNGYIYNNKTTLGGTFIPSKNI------NLPEHVDWREEGAVTPVKNQGQ 136
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
CGSCW+FS ++EG + TG+LISLSEQ LVDC RK N GC GGLMDYAF++I N
Sbjct: 137 CGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYIQDNN 196
Query: 221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
G+D+E YPY G + C DP + + G+ D+ E L+KA+A P+SVAI+
Sbjct: 197 GIDTEASYPYEGIDGHCHYDPKNKGGSDI---GFVDIKKGSEKDLQKALATVGPISVAID 253
Query: 278 AGGRAFQHYESGVFTGECGSA--LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYV 333
A +FQ Y GV++ + S LDHGV+AVGYGT+ G DYWLV+NSW WGE+GY+
Sbjct: 254 ASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGEDGYI 313
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN CGIA ASYPV
Sbjct: 314 KMARN----KDNMCGIASSASYPV 333
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 206/352 (58%), Gaps = 22/352 (6%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+A++G+ +R
Sbjct: 7 LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ NS N +Y +G+N+F D+TN E+ A Y G + + V S
Sbjct: 58 FQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGV--SLPLNIEREPVVS- 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR GAV VK+ CGSCWAF+ +A VE I KI G LISLSEQ
Sbjct: 115 -FDDVDISAVPQSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQ 173
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS--I 249
+++DC ++ GC+GG ++ A+ FII N G+ S YPY ++ + R N S I
Sbjct: 174 QVLDC--AVSYGCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQ-GTCRINGVPNSAYI 230
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
GY V +E S+ AV++QP++ +IEA G FQHY+ GVF+G CG++L+H + +GYG
Sbjct: 231 TGYTRVQSNNERSMMYAVSNQPIAASIEASGD-FQHYKRGVFSGPCGTSLNHAITIIGYG 289
Query: 310 TE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
+ +G +W+VRNSWG+ WGE GY+++ R+ + +++G CGIA+ YP S
Sbjct: 290 QDSSGKKFWIVRNSWGASWGERGYIRMARD-VSSSSGLCGIAIRPLYPTLQS 340
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 191/322 (59%), Gaps = 20/322 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--------TYKVGLNKFADL 103
+++W+A+HG+T +R +IF+ N ID NS ++++ N+FADL
Sbjct: 43 HESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADL 102
Query: 104 TNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
T+EE+RA G R A + + ++ +A + S+DWR GAV VKDQGSC
Sbjct: 103 TDEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQA--DAAGSMDWRAMGAVTGVKDQGSC 160
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
G CWAFS VAA+EG+ KI TG L+SLSEQ+LVDCD + GC GGLMD AFQ+I + GG
Sbjct: 161 GCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGG 220
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ SE YPY G + S R SI G+EDV +E +L AVA QPVSVAI G
Sbjct: 221 LASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGGDY 280
Query: 282 AFQHYE----SGVFTGECGSA-LDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKL 335
F+ Y+ G C S LDH + AVGYG +G YWL++NSWGS WGE+GYV++
Sbjct: 281 VFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRI 340
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
+R G CG+A ASYPV
Sbjct: 341 RRG--SRGEGVCGLAKLASYPV 360
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 200/322 (62%), Gaps = 22/322 (6%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHN----SLNRTYKVGLN 98
D + + + W A+H +T NE R ++ NL+ I+ HN + ++++G+N
Sbjct: 22 DQTLDSQWHQWKAQHRRT---YAANEDGWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
KF D+T EE++ + G S+ ++ K + + +LP+SVDWREKG V PVK+
Sbjct: 79 KFGDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLA----QLPKSVDWREKGYVTPVKN 134
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFII 217
QG CGSCWAFS ++EG T +L+SLSEQ LVDC + N GC+GGLMD AF+++
Sbjct: 135 QGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVK 194
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAI 276
NGG+D+EQ YPYLG +N+C R ++ G+ D+ +E +L KAVA+ P+SVAI
Sbjct: 195 NNGGIDTEQAYPYLGQDNECK-YRAECSGANVTGFVDIPSMNERALMKAVANVGPISVAI 253
Query: 277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
+AG +FQ YESGV + +C S+ LDHGV+ VGYG+ +YW+V+NSWG +WG+ GYV
Sbjct: 254 DAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKDEYWIVKNSWGEEWGKKGYVL 313
Query: 335 LQRNLLDTNTGKCGIAMEASYP 356
+ + CGIA ASYP
Sbjct: 314 MAK----FRNNHCGIATAASYP 331
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 143/328 (43%), Positives = 201/328 (61%), Gaps = 23/328 (7%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLN 98
R D E+ +Q W + H K + + +R +++ NL+ I+ HN +L + +YK+G+N
Sbjct: 1 RADPELDGHWQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMN 59
Query: 99 KFADLTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
+F D+T EE+R + G ++R+ S+ + E P SVDWREKG V PVK
Sbjct: 60 QFGDMTTEEFRQLMNGYAHKKSERKYRGSQFLEPSFL-----EAPRSVDWREKGYVTPVK 114
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
DQG CGSCWAFST A+EG + TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++
Sbjct: 115 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYV 174
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVA 275
NGG+DSE+ YPY +++ + + G+ D+ E +L KAVA PVSVA
Sbjct: 175 QDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVA 234
Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
I+AG +FQ Y+SG+ + +C S LDHGV+ VGYG E +G YW+V+NSWG WG+
Sbjct: 235 IDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGD 294
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY+ + ++ CGIA ASYP+
Sbjct: 295 KGYIYMAKD----RKNHCGIATAASYPL 318
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 194/322 (60%), Gaps = 16/322 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFA 101
D V + + H K R +IF +N + +HN L ++K+G+NK++
Sbjct: 21 DLVQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYS 80
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+ N E+ L + +K L ++ S + A ELP+ +DWR+ GAV PVKDQG
Sbjct: 81 DMLNHEF-VHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQG 139
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
CGSCW+FST ++EG + + +L+SLSEQ L+DC K N GCNGGLMD AF++I N
Sbjct: 140 QCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYIKDN 199
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
GG+D+EQ YPY + KC RN K + G+ D+ DE LK AVA P+SVAI+A
Sbjct: 200 GGIDTEQSYPYKAEDEKCHYKPRN-KGATDRGFVDIESGDEEKLKAAVATVGPISVAIDA 258
Query: 279 GGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKL 335
FQ Y GV + EC S LDHGV+ VGYGT E+G DYWLV+NSWG WG+ GY+K+
Sbjct: 259 SHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQGYIKM 318
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
RN D N CGIA +ASYP+
Sbjct: 319 ARN-RDNN---CGIATQASYPL 336
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 193/321 (60%), Gaps = 29/321 (9%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNE 106
+++ W +KH S R +++ NL+ I+ EH +Y++G+N F D+TNE
Sbjct: 32 LWKNWHSKHYHESE----EGWRRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMTNE 87
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E+R G + +R+ S Y + P++VDWREKG V PVKDQGSCGSCW
Sbjct: 88 EFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKDQGSCGSCW 142
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I N G+D+E
Sbjct: 143 AFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTE 202
Query: 226 QDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
+ YPY+G + DP + + + G+ D+ E ++ KAVA PVSVAI+AG +
Sbjct: 203 ESYPYVGTDE--DPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHES 260
Query: 283 FQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQ 336
FQ YESG+ + EC S LDHGV+ VGYG E +G YW+V+NSW WG+ GY+ +
Sbjct: 261 FQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMA 320
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
++ CGIA +SYP+
Sbjct: 321 KD----RKNHCGIATASSYPL 337
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/294 (45%), Positives = 185/294 (62%), Gaps = 16/294 (5%)
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
+ R+ FK+NL I + NS + +G+N ADL+NEEYR +YLG + DA R + + A
Sbjct: 46 QDRYNAFKNNLDLIHKWNSQGHSTVLGVNHLADLSNEEYRNLYLGVKVDASR--LPQQAA 103
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S + K + S+DWR GAV VKDQG CGSCW+FST ++EG N+I TG SLS
Sbjct: 104 SIKLN-KVFAPVAASLDWRSSGAVGRVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLS 162
Query: 190 EQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN---KCDPSRRNAK 245
EQ+L+DC R N GCNGGLMD A +++I GG+D+E+ YPY +++ K +P+ AK
Sbjct: 163 EQQLMDCSRDYGNEGCNGGLMDAAMKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIGAK 222
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGV 303
+ S Y DV E L + PVSVAI+A +FQ Y+SGV+ S+ LDHGV
Sbjct: 223 ISS---YIDVQRGSETDLAAKLNKGPVSVAIDASHSSFQLYKSGVYYEPACSSYNLDHGV 279
Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+AVGYGTE +YW+V+NSWG +WG +GY+ + ++ + CGI+ AS PV
Sbjct: 280 LAVGYGTEGSSNYWIVKNSWGPNWGLSGYIWMAKD----KSNHCGISSMASIPV 329
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 193/323 (59%), Gaps = 16/323 (4%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D + + T+ +H KT R +IF +N I +HN T+K+ +NK+A
Sbjct: 21 DVIKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYA 80
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKS--KVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ + E+R G + L S + A +LP+SVDWREKGAV VKDQ
Sbjct: 81 DMLHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQ 140
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + TG L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKD 200
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
NGG+D+E+ YPY G ++ C ++ + G+ D+ +E + +AVA PVSVAI+
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKDSVGATD-RGFADIPQGNEKKMAEAVATIGPVSVAID 259
Query: 278 AGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVK 334
A +FQ Y G++ EC S LDHGV+ VGYGT E+G DYWLV+NSWG+ WG+ G++K
Sbjct: 260 ASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIK 319
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+ RN +CGIA +SYP+
Sbjct: 320 MARN----EDNQCGIASASSYPL 338
>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
Length = 376
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 201/335 (60%), Gaps = 27/335 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + ++Y+ W + H S + + RF+ FK N R I E N + YK+GLNKFAD
Sbjct: 37 SEESMWSLYERWRSVH-TVSRDLREKQSRFEAFKANARHIGEFNKRKDVPYKLGLNKFAD 95
Query: 103 LTNEEYRAMYLGTR---SDAKRRLMK------SKVASQRYACKAGDELPESVDWREKGAV 153
LT EE+ + Y G + S+A RL S + + A GD P++ DWR+ GAV
Sbjct: 96 LTQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLAASVGDA-PDAWDWRDHGAV 154
Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG-CN-GGLMDY 211
VKDQG CGSCWAFS V AVE +N IVTG L++LSEQ+++DC AG C GG Y
Sbjct: 155 TAVKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCS---GAGDCTYGGYTYY 211
Query: 212 AFQFIIQNG-GMDSEQDYPYLGAENKCD--PSRRNAK---VVSIDGYEDVSPFDEMSLKK 265
A + I NG +D PY + P R +AK VV ID ++ DE +LK+
Sbjct: 212 AMLYAISNGLTLDQCGKTPYYQRYDAQQHLPCRFDAKKPPVVKIDSMYVMNNADEAALKR 271
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWG 324
AV QPVSV I+AGG +Y GVFTG CG++L+H V+ VGYG T +G YW+V+NSWG
Sbjct: 272 AVYKQPVSVLIDAGG--IGYYSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWIVKNSWG 329
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
+DWGE GY +L+R+ + T G CGI M YP+KN
Sbjct: 330 ADWGEKGYFRLKRD-VGTQGGLCGITMYPIYPIKN 363
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 195/329 (59%), Gaps = 26/329 (7%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLN 98
R D ++ + W H K+ + +R +++ NL+ I+ EH +Y++G+N
Sbjct: 21 RFDSQLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKNLKKIEMHNLEHTMGKHSYRLGMN 79
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
F D+TNEE+R G + +R+ S Y + P++VDWREKG V PVKD
Sbjct: 80 HFGDMTNEEFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKD 134
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
QGSCGSCWAFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 135 QGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQ 194
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSV 274
N G+D+E+ YPY+G + DP + + G+ D+ E ++ KAVA PVSV
Sbjct: 195 DNAGLDTEESYPYVGTDE--DPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSV 252
Query: 275 AIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
AI+AG +FQ YESG+ + EC S LDHGV+ VGYG E +G YW+V+NSW WG
Sbjct: 253 AIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWG 312
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ GY+ + ++ CGIA +SYP+
Sbjct: 313 DKGYIYMAKD----RKNHCGIATASSYPL 337
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 126/291 (43%), Positives = 182/291 (62%), Gaps = 9/291 (3%)
Query: 71 KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
KR+ IFK+NL +I HN +Y + +NKF DLT EE+R YLG + R +
Sbjct: 108 KRYAIFKNNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQRYLGYKKPDLR--TPPREVD 165
Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
+++P VDWR++G V VKDQG CGSCWAFS A+EG+ TG+L++LS+
Sbjct: 166 TTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQ 225
Query: 191 QELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
Q+LVDC R + N GC+GG M+ AF+++++NGG+ S ++YPY+ + C S+ + V +I
Sbjct: 226 QQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMRKDGVCKSSQCTS-VATI 284
Query: 250 DGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
GY V E S+K A+A PVSVAI+A AFQ Y G+F CG+ LDHGV+ VGY
Sbjct: 285 TGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFDAPCGTNLDHGVLLVGY 344
Query: 309 GTENG--VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
E DYW+++NSWG+ WG+ GY+ + + G+CG+ ++ S+PV
Sbjct: 345 SAETAGQGDYWIMKNSWGAAWGKGGYMLMA--MHKGPAGQCGVLLDGSFPV 393
>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
Length = 209
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 125/218 (57%), Positives = 149/218 (68%), Gaps = 11/218 (5%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPE VDWR KGAV P+K+QG CGSCWAFSTV VE IN+I TG LISLSEQ+LVDC +K
Sbjct: 1 LPEHVDWRAKGAVIPLKNQGKCGSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKK- 59
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GC GG D A+Q+II NGG+D+E +YPY + P R KVV IDG + V +E
Sbjct: 60 NHGCKGGYFDRAYQYIIANGGIDTEANYPYKAFQG---PCRAAKKVVRIDGCKGVPQCNE 116
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
+LK AVA QP VAI+A + FQHY+ G+FTG CG+ L+HGVV VGYG DYW+VR
Sbjct: 117 NALKNAVASQPSVVAIDASSKQFQHYKGGIFTGPCGTKLNHGVVIVGYGK----DYWIVR 172
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG WGE GY +++R G CGIA YP K
Sbjct: 173 NSWGRHWGEQGYTRMKR---VGGCGLCGIARLPFYPTK 207
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/296 (47%), Positives = 187/296 (63%), Gaps = 20/296 (6%)
Query: 73 FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+ F N+ I+EHN +R T+++GLN ADL EYR + G R RRL +
Sbjct: 65 MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRKLN-GYRH---RRLFGDSM 120
Query: 129 ASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
++ ++P+SVDWRE V PVK+QG CGSCWAFS A+EG + TG+L+
Sbjct: 121 RKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLV 180
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ LVDC K N GCNGGLMD AF++I N G+D+E+ YPY+G E +C +R+
Sbjct: 181 SLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIG 240
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
G+ D+ DE +LK AVA Q P+S+AI+AG R+FQ Y+ GV F EC S LDHG
Sbjct: 241 AED-RGFVDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHG 299
Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+ VGYGT+ DYW+++NSWG+ WGE GYV++ RN CG+A +ASYP+
Sbjct: 300 VLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIARN----RNNHCGVATKASYPL 351
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 186/302 (61%), Gaps = 32/302 (10%)
Query: 72 RFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE-------YRAMYLGTRSDAK 120
R ++ NL+ + EHN TY +G+NK+AD+T E Y A G R+ +
Sbjct: 47 RRATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDR 106
Query: 121 RRL-MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
SK+A LP++VDWR+KG V VKDQG CGSCWAFST A+EG +
Sbjct: 107 HTFSFNSKIA-----------LPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHF 155
Query: 180 IVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
TG+L+SLSEQ LVDC ++ N GCNGGLMD AF++I +N G+D+E YPY +N+C
Sbjct: 156 KQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCR 215
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGS 297
N G+ D++ DE +L++AVA P+SVAI+AG +FQ Y+ GV+ S
Sbjct: 216 FKAANVGATDT-GFTDITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCS 274
Query: 298 A--LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
LDHGV+AVGYGT++G DYWLV+NSWG WG+ GY+K+ RN +CGIA ASY
Sbjct: 275 QTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMTRN----KRNQCGIATAASY 330
Query: 356 PV 357
P+
Sbjct: 331 PL 332
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 193/308 (62%), Gaps = 19/308 (6%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
++QT+ AK+GK E R ++ N+ +I++ NS ++ +G+ FAD+TN E+
Sbjct: 26 LFQTFEAKYGKNYLS-SEREYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEF-- 82
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ +K K + + A + ES+DWREKGAV PVK+QGSCGSCWAFS
Sbjct: 83 ------ATSKLCGCMKKPLNHKQARVLNNMAVESIDWREKGAVTPVKNQGSCGSCWAFSA 136
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A+EG N + TG+L+SLSEQ+LVDCD + +AGC GG MD AF+++++ G+ +E+DYPY
Sbjct: 137 TGALEGGNFVATGKLVSLSEQQLVDCDTE-DAGCGGGFMDTAFEYVMKK-GLCTEEDYPY 194
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ C + + V+SI GYEDV D ++LK+A+ PVSVAI+A FQ Y GV
Sbjct: 195 HAKDEDCKDDQCTS-VISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGV 253
Query: 291 FTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+ CG++L+HGV+AVGY E Y +V+NSWG+ WG+ GYVK+ D G CGI
Sbjct: 254 LDSDMCGTSLNHGVLAVGYAKE----YIIVKNSWGASWGDKGYVKIAHR--DQGEGICGI 307
Query: 350 AMEASYPV 357
M ASYP
Sbjct: 308 NMAASYPT 315
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 209/368 (56%), Gaps = 33/368 (8%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M L + L F + S +S+ N + +S ++EV ++Q W +H +
Sbjct: 2 MSLQRTKLFPFFIVLVSFTCSLSLAMSSNQLEQFAS---EEEVFQLFQAWQKEHKREYGN 58
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFADLTNEEYRAMYLG------T 115
KRFQIF+ NLR+I+E N+ ++ +++GLNKFAD++ EE+ YL +
Sbjct: 59 QEEKAKRFQIFQSNLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYLKEIEMPYS 118
Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
+++++L K A C D LP SVDWR+KGAV V+DQG C S WAFS A+E
Sbjct: 119 NLESRKKLQKGDDAD----C---DNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIE 171
Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
GINKIVTG L+SLS Q++VDCD + GC GG AF ++I+NGG+D+E YPY
Sbjct: 172 GINKIVTGNLVSLSVQQVVDCD-PASHGCAGGFYFNAFGYVIENGGIDTEAHYPYTAQNG 230
Query: 236 KCDPSRRNA-KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
C + NA KVVSID V +E L + V+ QPVSV+I+A G Q Y GV+ GE
Sbjct: 231 TC---KANANKVVSIDNLLVVVGPEEALLCR-VSKQPVSVSIDATG--LQFYAGGVYGGE 284
Query: 295 -CGSALDHGVVA---VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT-NTGKCGI 349
C + VGYG+ G DYW+V+NSWG DWGE GY+ ++RN+ D G C I
Sbjct: 285 NCSKNSTKATLVCLIVGYGSVGGEDYWIVKNSWGKDWGEEGYLLIKRNVSDEWPYGVCAI 344
Query: 350 AMEASYPV 357
+P+
Sbjct: 345 NAAPGFPI 352
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/296 (47%), Positives = 187/296 (63%), Gaps = 20/296 (6%)
Query: 73 FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+ F N+ I+EHN +R T+++GLN ADL EYR + G R RRL +
Sbjct: 60 MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRKLN-GYRH---RRLFGDSM 115
Query: 129 ASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
++ ++P+SVDWRE V PVK+QG CGSCWAFS A+EG + TG+L+
Sbjct: 116 RKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLV 175
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ LVDC K N GCNGGLMD AF++I N G+D+E+ YPY+G E +C +R+
Sbjct: 176 SLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIG 235
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
G+ D+ DE +LK AVA Q P+S+AI+AG R+FQ Y+ GV F EC S LDHG
Sbjct: 236 AED-RGFVDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHG 294
Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+ VGYGT+ DYW+++NSWG+ WGE GYV++ RN CG+A +ASYP+
Sbjct: 295 VLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIARN----RNNHCGVATKASYPL 346
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 149/372 (40%), Positives = 215/372 (57%), Gaps = 31/372 (8%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MATAS LA+ L + + +A I+ ++ ++ W A++
Sbjct: 3 MATASASLALVMLFACSLLLAGTAFSDDTIAIP--------------LLERFKAWQAEYN 48
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
+T ++RF ++ +NLRFI N L+ +Y++G N+F DLT EE++ YL +
Sbjct: 49 RTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDE 108
Query: 119 --AKRRLMKSKVASQRYACKA-GD---ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
M V + A + GD E P SVDWR KGAV PVK+Q CGSCWAF+TVA
Sbjct: 109 QPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVA 168
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
++EG+++I TG L+SLSEQE+VDCDR N GC GG A +++ +NGG+ +E DYPY+
Sbjct: 169 SIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYV 228
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G++ +C + I GY+ V +E L++AVA +PV+V I+A RAFQ Y+ GVF
Sbjct: 229 GSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDA-SRAFQFYKRGVF 287
Query: 292 TGECG-SALDHGVVAVGYGTENGV-----DYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
+G C + ++H V VGYG+ YW+V+NSWG WGENGYV++ R + G
Sbjct: 288 SGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-G 346
Query: 346 KCGIAMEASYPV 357
C IA+E YPV
Sbjct: 347 MCAIAIEPYYPV 358
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 203/351 (57%), Gaps = 19/351 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
+VFLF A S S D D +M ++ W+ ++G+ +R
Sbjct: 7 VVFLFLFLCVMWASPSAASAD---------EPSDPMMKRFEEWMVEYGRVYKDNDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ NS N +Y +G+N+F D+TN E+ A Y G S R L +
Sbjct: 58 FQIFKNNVNHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGIS---RPLNIEREPVV 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR+ GAV VK+Q CG+CWAF+ +A VE I KI G L LSEQ
Sbjct: 115 SFDDVDISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQ 174
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
+++DC + GC GG AF+FII N G+ S YPY A+ C + I G
Sbjct: 175 QVLDCAK--GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCK-TNGVPNSAYITG 231
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V +E S+ AV+ QP++VA++A FQ+Y+SGVF G CG++L+H V A+GYG +
Sbjct: 232 YARVPRNNESSMMYAVSKQPITVAVDANAN-FQYYKSGVFNGPCGTSLNHAVTAIGYGQD 290
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
NG YW+V+NSWG+ WGE GY+++ R+ + +++G CGIA+++ YP S+
Sbjct: 291 SNGKKYWIVKNSWGARWGEAGYIRMARD-VSSSSGICGIAIDSLYPTLESR 340
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 188/312 (60%), Gaps = 20/312 (6%)
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMY 112
A HGK + R +I+ +N I HN +YK+ +N+F DL + E+
Sbjct: 55 ALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEF---- 110
Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ TR+ KR + Y G E LP++VDWR+KGAV PVK+QG CGSCWAFS
Sbjct: 111 VSTRNGFKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFS 170
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
T ++EG + TG ++SLSEQ LVDC K N GC GGLMD AF++I NGG+D+E Y
Sbjct: 171 TTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSY 230
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
PY G + C + + G+ D+ +E LKKAVA PVSVAI+A +FQ Y
Sbjct: 231 PYNGTDGICHFEKSDVGATDT-GFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYS 289
Query: 288 SGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ EC S +LDHGV+ VGYGT++G DYWLV+NSWG+ WG++GY+ + RN
Sbjct: 290 QGVYDEPECSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRN----KEN 345
Query: 346 KCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 346 QCGIASSASYPL 357
>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
Length = 208
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 123/218 (56%), Positives = 153/218 (70%), Gaps = 11/218 (5%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPE +DWR+KGAV PVK+QGSCGSCWAFSTV+ VE IN+I TG LISLSEQELVDCD+K
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GC GG +A+Q+II NGG+D++ +YPY + C + +KVVSIDGY V +E
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAA---SKVVSIDGYNGVPFCNE 116
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
+LK+AVA QP +VAI+A FQ Y SG+F+G CG+ L+HGV VGY +YW+VR
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVR 172
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG WGE GY+++ R G CGIA YP K
Sbjct: 173 NSWGRYWGEKGYIRMLR---VGGCGLCGIARLPYYPTK 207
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 192/319 (60%), Gaps = 18/319 (5%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
+ T ++ + ++H K + RF+IF +N + +HN+ +YK+ +NKF DL
Sbjct: 23 LRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDL 82
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
E+ M G R + + + A LP +VDWR+KGAV PVK+QG CG
Sbjct: 83 LPHEFAKMVNGYRGKQNKEQRPTFIPP---ANLNDSSLPTTVDWRKKGAVTPVKNQGQCG 139
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGM 222
SCWAFST ++EG + TG+L+SLSEQ LVDC D N GCNGGLMD FQ+I NGG+
Sbjct: 140 SCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGI 199
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
D+E+ +PY + C + A V + D G+ D+ E LKKAVA PVSVAI+A
Sbjct: 200 DTEESHPYTAQDGDC--KFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASH 257
Query: 281 RAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+FQ Y GV+ +C S+ LDHGV+ VGYG +NG YWLV+NSWG DWG+NGY+ + R+
Sbjct: 258 GSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYILMSRD 317
Query: 339 LLDTNTGKCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 318 ----KDNQCGIASSASYPL 332
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 193/319 (60%), Gaps = 24/319 (7%)
Query: 50 TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
+++Q +L K+ + + E+R IF +N I EHN L +Y +G+N F+D TN
Sbjct: 65 SMWQAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTN 124
Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
E + G R +K S+ SQ A P VDWR KGAV PVK+QG CGSC
Sbjct: 125 SELDVLR-GFRHSSK----ASRSGSQYIPFDAAP--PAEVDWRTKGAVTPVKNQGDCGSC 177
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
WAFS +EG + + TG+L+SLSEQ+LVDC N GC+GGLMD AF+++ ++ G+D+E
Sbjct: 178 WAFSATGGIEGQHYLATGKLVSLSEQQLVDCSSS-NDGCDGGLMDLAFEYVKEHKGIDTE 236
Query: 226 QDYPYL----GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
YPY+ G +C + A V++ GY D+ E+ L++AV P+SV I AG
Sbjct: 237 VHYPYVSGNTGYARQCSFDPKYA-AVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGL 295
Query: 281 RAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+F YESG+++ C LDHGV+ VGYG +NGV YWL++NSWG DWGENGYV++ RN
Sbjct: 296 PSFMAYESGIYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRN 355
Query: 339 LLDTNTGKCGIAMEASYPV 357
+ CG+A ASYP+
Sbjct: 356 ----HNNLCGVATMASYPL 370
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 203/351 (57%), Gaps = 19/351 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+ ++G+ +R
Sbjct: 7 LVFLFLFLCVMWASPSAASAD---------EPSDPMMKRFEEWMVEYGRVYKDNDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ NS N+ +Y +G+N+F D+TN E+ A Y G S R L +
Sbjct: 58 FQIFKNNVNHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGIS---RPLNIEREPVV 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR+ GAV VK+Q CG+CWAF+ +A VE I KI G L LSEQ
Sbjct: 115 SFDDVDISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQ 174
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
+++DC + GC GG AF+FII N G+ S YPY A+ C + I G
Sbjct: 175 QVLDCAK--GYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCK-TNGVPNSAYITG 231
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V +E S+ AV+ QP++VA++A + Q+Y SGVF G CG++L+H V A+GYG +
Sbjct: 232 YARVPRNNESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQD 290
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
NG YW+V+NSWG+ WGE GY+++ R+ + +++G CGIA+++ YP S+
Sbjct: 291 SNGKKYWIVKNSWGARWGEAGYIRMARD-VSSSSGICGIAIDSLYPTLESR 340
>gi|298709635|emb|CBJ31444.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 475
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 192/315 (60%), Gaps = 17/315 (5%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE---HNSLNRTYKVGLNKFADLTNEEYRAM 111
W K+G++ G + F ++ R D+ HN + Y + N ++ ++ +E+R
Sbjct: 164 WTYKYGQS---WGSVHEAFHALQNYARADDKIALHNHEDAGYTLAHNAYSHMSWQEFREH 220
Query: 112 Y-LGTRSDAKRRLMKSKVASQRYACKAGDEL------PESVDWREKGAVNPVKDQGSCGS 164
+ +G + ++ A + KA EL P+ VDW KGAV PVK+QGSCGS
Sbjct: 221 FSIGKDMVVPPDQLPAEFALRPRGEKAPKELLRGAPIPDEVDWVAKGAVTPVKNQGSCGS 280
Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
CW+FST ++EG + I G L LSEQELVDCD + GCNGGLMDY+F +I QNGG+ S
Sbjct: 281 CWSFSTTGSMEGAHFIKHGNLAVLSEQELVDCD-TYDMGCNGGLMDYSFHWIQQNGGICS 339
Query: 225 EQDYPYLGAENKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
E+DYPY A + C S + + +D + DV+ DE +L +AVA QPVS+AIEA +F
Sbjct: 340 EEDYPYTAAGDLCKKSTCDVVEGTMVDKWVDVASDDEQALMEAVAQQPVSIAIEADQMSF 399
Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Q Y GV T CG+ LDHGV+ VGYG +E+GV YW V+NSWG +WG GY+ L+R D
Sbjct: 400 QLYSGGVLTAACGTNLDHGVLLVGYGVSEDGVKYWKVKNSWGPEWGAEGYILLKRE-ADQ 458
Query: 343 NTGKCGIAMEASYPV 357
G+CGI +ASYPV
Sbjct: 459 EGGECGILEQASYPV 473
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 195/320 (60%), Gaps = 26/320 (8%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K GK+ R + N + + HN + ++Y++G+ FAD++NEE
Sbjct: 26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85
Query: 108 YRAMY----LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
YR + LG+ ++ K R S + + +P++VDWR+KG V +KDQ CG
Sbjct: 86 YRQLVFRGCLGSMNNTKAR-----GGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCG 140
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS ++EG TG+L+SLSEQ+LVDC N GC+GGLMD AFQ+I N G+
Sbjct: 141 SCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGL 200
Query: 223 DSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
D+E YPY + +C +PS A S GY D++ DE +L++AVA P+SVAI+AG
Sbjct: 201 DTEDSYPYEAQDGECRFNPSTVGA---SCTGYVDIASGDESALQEAVATIGPISVAIDAG 257
Query: 280 GRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
+FQ Y SGV+ +C S+ LDHGV+AVGYG+ NG DYW+V+NSWG DWG GY+ + R
Sbjct: 258 HSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSR 317
Query: 338 NLLDTNTGKCGIAMEASYPV 357
N + +CGIA ASYP+
Sbjct: 318 N----KSNQCGIATAASYPL 333
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/294 (47%), Positives = 188/294 (63%), Gaps = 15/294 (5%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +IF +N + I++HNS + ++K+ LN AD+ EY +YLG +K +K
Sbjct: 47 RKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKAN--NNK 104
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ S + A L + VDWR KGAV PVK+QG CGSCWAFST A+EG N TG+L+S
Sbjct: 105 LQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVS 164
Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC N GC GGLMD AFQ+I +N G+D+E+ YPY G + C R+ +
Sbjct: 165 LSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETC-RFRKTSIG 223
Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
+ G+ D++ DE +L +AVA P+SVAI+A ++FQ Y GV + EC S LDHGV
Sbjct: 224 ATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGV 283
Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYG E+ YWLV+NSWG+ WG+ GY+K+ R+ D N CGIA +ASYP+
Sbjct: 284 LVVGYGVEDNQKYWLVKNSWGTQWGDGGYIKMARD-QDNN---CGIATQASYPL 333
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 132/276 (47%), Positives = 174/276 (63%), Gaps = 14/276 (5%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
I LV L + SS A I+Y N D S ++ +++++ W HGKT
Sbjct: 7 ILKLVMLLLVFSSVTA----ITY-NPRDLS-----ENGLLSLFDRWCNHHGKTYTAK-QR 55
Query: 70 EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
RFQ+FK+NL +I EHNS N T+ +GLN F+DLT++E+R +G R +KS+
Sbjct: 56 PLRFQVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPPS--LKSRR 113
Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
+ +P S+DWR+K AV VKDQG+CG CWAFS A+EGINKIVTG L+SL
Sbjct: 114 REPKSGLLELYNIPSSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSL 173
Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
SEQEL DCD N+GC+GGLMDYAFQ++I NGG+D+E DYPY G + C+ + N +VV+
Sbjct: 174 SEQELCDCDTSYNSGCDGGLMDYAFQWVIVNGGIDTEVDYPYKGVQKACNSKKVNRRVVT 233
Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
ID Y DV +E +L +AV QPVSV I G RAFQ
Sbjct: 234 IDDYIDVPANNERALLQAVVGQPVSVGISGGERAFQ 269
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 26/327 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKF 100
D ++ ++ W HGK + +R I++ NLR I HN + TY++G+N F
Sbjct: 22 DKQLDDHWEQWKTWHGKNYHEKEEGWRRM-IWEKNLRKIQFHNLEHSMGIHTYRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+ +EE+R + G + +R+ S + E+P +DWREKG V PVKDQG
Sbjct: 81 GDMNHEEFRQVMNGYKHKTERKFKGSLFMEPNFL-----EVPSKLDWREKGYVTPVKDQG 135
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG G+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I N
Sbjct: 136 ECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDN 195
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
G+DSE+ YPYLG +++ P + K + + G+ D+ E +L KAVA PVSVAI
Sbjct: 196 NGLDSEEAYPYLGTDDQ--PCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAI 253
Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ F EC S LDHGV+ VGYG E +G YW+V+NSW WG+
Sbjct: 254 DAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGDK 313
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY+ + ++ CGIA ASYP+
Sbjct: 314 GYIYMAKD----RKNHCGIATAASYPL 336
>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 331
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 196/317 (61%), Gaps = 21/317 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEE 107
++ W +GK KR I+ +NL+++ +HN TYKV N+FADL+N+E
Sbjct: 24 WEEWKTLYGKVYRAE-EELKRQYIWLENLKYVTQHNLEADEGKHTYKVDTNQFADLSNDE 82
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSC 165
+R + S R + + + GD + P++VDWR++G V PVKDQ CGSC
Sbjct: 83 WREL---MTSQVTRPTNQMSFCNMTFM-TVGDHVIAPKNVDWRKEGYVTPVKDQKQCGSC 138
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
WAFST ++EG + TG+L+SLSEQ LVDC K N GC GGLMD F++I NGG+D+
Sbjct: 139 WAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSMKEGNHGCQGGLMDLGFEYIFDNGGIDT 198
Query: 225 EQDYPYLGA-ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
E YPY+ E +C R N+ ++ G D+ E +L KAVAD P+SVAI+AG ++
Sbjct: 199 ESSYPYMAKNEPQCMYKRSNSG-ATLTGCVDIKRGSESALMKAVADVGPISVAIDAGHKS 257
Query: 283 FQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y+SGV + C S LDHGV+AVG+G +NG D+WLV+NSWG WG GY+ + RN
Sbjct: 258 FQMYKSGVYYEPSCSSVKLDHGVLAVGFGADNGEDFWLVKNSWGPIWGMEGYIMMSRN-R 316
Query: 341 DTNTGKCGIAMEASYPV 357
D N CGIA +ASYP+
Sbjct: 317 DNN---CGIATQASYPL 330
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 202/360 (56%), Gaps = 35/360 (9%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
+ STL+ + +S+A +S D V++ +++W HGKT +
Sbjct: 1 MKCSTLLLSVLVIASTANAVSFF---------------DVVLSDWESWKLMHGKTYSSSI 45
Query: 68 HNEKRFQIFKDNLRFIDEHNS--LN--RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRL 123
+ R +I+ +N I HNS LN Y + +N + DL + E+ AM G + K
Sbjct: 46 EEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKTAS 105
Query: 124 MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
+ Y +LP VDWRE+GAV PVK+QG CGSCW+FS A+EG + TG
Sbjct: 106 LGG-----TYIPNKNIQLPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTG 160
Query: 184 ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
+LISLSEQ LVDC RK N GC GGLMD+AF +I N G+D+E YPY G + C + +
Sbjct: 161 KLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPK 220
Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT-GECGS-AL 299
N I G+ D+ E LKKAVA P+SVAI+A +FQ Y GV+ +C S L
Sbjct: 221 NKGGSDI-GFVDIKKGSEKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEEL 279
Query: 300 DHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGV+ VG+GT+ +G DYWLV+NSW WG+ GY+K+ RN CGIA ASYPV
Sbjct: 280 DHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKMARN----KENMCGIASSASYPV 335
>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
Length = 293
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 135/293 (46%), Positives = 188/293 (64%), Gaps = 19/293 (6%)
Query: 65 GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
G ++ R +F +++R ++ N+ +Y +GLN+FADLT EE+ ++YLG ++
Sbjct: 18 GGAEDKHRLALFAESVRIVETENAKGHSYTLGLNQFADLTTEEFSSLYLGL-------VL 70
Query: 125 KSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
++KV AS+ + GD E+VDWR+KGAV PVKDQ SCGSCWAFS A+EG TG
Sbjct: 71 ENKVQASESVVLQDGDS-EENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALVKSTG 129
Query: 184 ELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
+LI+LSEQ+LVDC K N GCNGGLM AF +++ G +E+DYPY G + +C + +
Sbjct: 130 KLINLSEQQLVDCVTKCN-GCNGGLMTAAFDYVLGR-GRATEKDYPYKGVDGRCKQTATD 187
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
K I GY +V + +LK AVA P+SVA+ A G Q Y+SGV CG+ LDHGV
Sbjct: 188 NK---IKGYNNVPQNNYKALKAAVAS-PLSVAVNAAG-TIQRYKSGVIDANCGTRLDHGV 242
Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+AVGY G DYW+V+NSWG+ +GENGY +++ + G CGI M A+ P
Sbjct: 243 LAVGY---QGEDYWIVKNSWGNGYGENGYFRVKMGTQNGGAGVCGINMMAAQP 292
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 196/317 (61%), Gaps = 19/317 (5%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNE 106
++Q + H +T G +R ++F++NL+ I HN L+ Y++G+N+FAD+
Sbjct: 42 LWQDFKTVHERTY-GETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEAN 100
Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
E+ ++ G R + R ++ + + + +P VDWR++G V PVK+QG CGSCW
Sbjct: 101 EFASIMNGFRMN-NRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSCW 159
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST ++EG + TG+L+SLSEQ LVDC N GCNGG++DYAFQ+I N G D+E
Sbjct: 160 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTE 219
Query: 226 QDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRA 282
YPY + C R + V + GY D+ DE +K+AVA PVSVAI+A +
Sbjct: 220 ACYPYEAVDGTC---RFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSS 276
Query: 283 FQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y+SG++ EC LDH V+ VGYGTE G DYWLV+NSWG+ WG+ GY+K+ RN+
Sbjct: 277 FQMYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMARNM- 335
Query: 341 DTNTGKCGIAMEASYPV 357
+CGIA +ASYP+
Sbjct: 336 ---DNQCGIASQASYPL 349
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 145/361 (40%), Positives = 204/361 (56%), Gaps = 22/361 (6%)
Query: 4 ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
+ +F+ + L SSSS + N D S DE + ++Q W +HG
Sbjct: 7 SKLFIFFFICITLICFSSSSNFPVQYSILGPNLDKLPS---QDETIQLFQLWRKEHGLVY 63
Query: 64 NGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAK 120
+ KRF+IF NL +I E N+ + Y +GLN FAD + E++ +YL +
Sbjct: 64 KDLKEMAKRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYLHSLDMPT 123
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
K+ +C A P S+DWR K AV +K+QGSCGSCWAFS A+EGI+ I
Sbjct: 124 DS--APKLNGPLLSCIA----PASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGIHAI 177
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE-NKCDP 239
TGELISLSEQELV+CDR ++ GCNGG ++ AF ++I NGG+ E +YPY G + C+
Sbjct: 178 TTGELISLSEQELVNCDR-VSKGCNGGWVNKAFDWVISNGGITLEAEYPYTGKDGGNCNS 236
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTG-ECGSA 298
++ +IDGYE V D L ++ QP+S+ + A FQ YESG+F G +C S+
Sbjct: 237 DKQVPIKATIDGYEQVEQSDN-GLLCSIVKQPISICLNA--TDFQLYESGIFDGQQCSSS 293
Query: 299 ---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
+H V+ VGY + NG DYW+V+NSWG+ WG NGY+ ++RN G CG+ A
Sbjct: 294 SKYTNHCVLIVGYDSSNGEDYWIVKNSWGTKWGINGYIWIKRN-TGLPYGVCGMNAWAYN 352
Query: 356 P 356
P
Sbjct: 353 P 353
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 197/321 (61%), Gaps = 18/321 (5%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
D +M ++ W A H ++ +RF++++ N+ +ID N TY++G N+FADL
Sbjct: 38 DMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADL 97
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD-----ELPESVDWREKGAVNPVKD 158
T EE+ A Y G + + + + A ++ D + P SVDWR KGAV PVK+
Sbjct: 98 TGEEFLARYAGGHTGSA--ITTAAEADGLWSSGGSDGSLEADPPASVDWRAKGAVTPVKN 155
Query: 159 QGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QGS C SCWAFS VA +E + I TG+L++LSEQ+LVDCD K + GCN G AFQ+I+
Sbjct: 156 QGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCD-KYDGGCNKGYYHRAFQWIM 214
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+NGG+ + YPY C ++ V+I G+ V+ +E++L+ AVA QP+ VAIE
Sbjct: 215 ENGGITTAAQYPYKAVRGACSAAK---PAVTITGHLAVAK-NELALQSAVARQPIGVAIE 270
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
+ Q Y+SGVF+ CG + H VV VGYG + +G+ YWLV+NSWG WGE GY++++
Sbjct: 271 V-PISMQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMR 329
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
R++ G CGIA++ +YP
Sbjct: 330 RDV--GGGGLCGIALDTAYPT 348
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/290 (47%), Positives = 189/290 (65%), Gaps = 24/290 (8%)
Query: 76 FKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
F N+ +I+ +N+ ++ YK G+N+F R+ K + S + +
Sbjct: 57 FXGNVNYIEACNNAADKPYKXGINQFP-------------PRNRFKGHMCSSIIRITTFK 103
Query: 135 CKAGDELPESVDWREKGAVNP--VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS-EQ 191
+ P +VD R+KGAV P VKDQG CG WA S VAA EGI+ + G+LI LS E
Sbjct: 104 FENVTATPSTVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEP 163
Query: 192 ELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR--RNAKVVS 248
ELVDCD K ++ GC GGL D AF+FIIQN G+++E +YPY G + KC+ + +NA +
Sbjct: 164 ELVDCDTKGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATI- 222
Query: 249 IDGYEDVSPFDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
I GY+DV +E + L+KAVA+ PVSVAI+A G FQ Y+SGVFTG CG+ LDHGV AVG
Sbjct: 223 ITGYDDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVG 282
Query: 308 YG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
YG +++G +YWLV+NS G +WGE GY+++QR +D+ CGIA++ASYP
Sbjct: 283 YGVSDDGTEYWLVKNSRGPEWGEEGYIRMQRG-VDSEEALCGIAVQASYP 331
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 204/329 (62%), Gaps = 27/329 (8%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
D + T ++ W + HGK+ +R +++++LR I+ HN SL + ++++G+N F
Sbjct: 22 DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEEHLRVIEIHNLEHSLGKHSFRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ NEE+R + G + ++L S + E+P+ VDWR++G V PVKDQ
Sbjct: 81 GDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFL-----EVPKHVDWRDEGYVTPVKDQ 135
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
G CGSCWAFST A+EG + TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++
Sbjct: 136 GQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKD 195
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVA 275
NGG+DSE YPY+G ++ P N + + + G+ D+ E +L KA+A PVSVA
Sbjct: 196 NGGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVA 253
Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
I+AG +FQ Y+SG+ F EC S LDHGV+ VGYG E +G YW+V+NSW WG+
Sbjct: 254 IDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQ 313
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NGY+ + ++ CGIA ASYP++
Sbjct: 314 NGYILMAKD----KDNHCGIATAASYPLE 338
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 148/305 (48%), Positives = 193/305 (63%), Gaps = 29/305 (9%)
Query: 71 KRFQIFKDNLRFI----DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
+ F++F+ NL I +E+N ++Y++GLN FA LT EE+ A YLG A+ K+
Sbjct: 50 RAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGY-GGAEVEQPKT 108
Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
+ A ++ K+ E+P SVDWREKGAV VK+QG+CGSCWAFS VAA+EG + + +GELI
Sbjct: 109 RRAG-KHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELI 167
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM--DSEQDYPYLGAENKCDPSRRN 243
SLSEQ+LVDC +K N GC GG MD AF++ + N G DSE+DYPY G + KC S
Sbjct: 168 SLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSADG 227
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF---TGECGSAL 299
+ +I GY DV +E L AVA+ PVSVAI AG A Q Y GVF G C L
Sbjct: 228 VR-ATISGYNDVKQGNETDLLDAVANVGPVSVAIHAGA-ALQFYLRGVFNGVAGTCFGPL 285
Query: 300 DHGVVAVGYGTEN-----GVDYWLVRNSWGSDWGENGYVKLQR--NLLDTNTGKCGIAME 352
+HGV AVGYGT + +DYW+++NSWG WGE G+V+ R NL CG+A
Sbjct: 286 NHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNL-------CGVANG 338
Query: 353 ASYPV 357
ASYP+
Sbjct: 339 ASYPL 343
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 186/314 (59%), Gaps = 17/314 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ W +HGK R I++ NL + +HN + TY +G+N+FADL NEE
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ AM G R + + K S D+LP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88 FVAMMTGFRVNGTSKAAK---GSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWA 144
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FS ++EG TG+L+SLSEQ LVDC + N GC+GG MD AFQ+II GG+D+E
Sbjct: 145 FSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR-NYGCHGGFMDRAFQYIIDAGGIDTEAT 203
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
Y Y + C + N ++ GY DV+ E +L+KAVA P+SVAI+A + F+ Y
Sbjct: 204 YSYRAVDGNCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFY 262
Query: 287 ESGVFT--GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
+SGV+ G + L H V+ VGYG T +G DYW+V+NSW WG NGY+ + RN
Sbjct: 263 KSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRN----K 318
Query: 344 TGKCGIAMEASYPV 357
+CGIA EASYP+
Sbjct: 319 DNQCGIASEASYPM 332
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 188/318 (59%), Gaps = 20/318 (6%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADL 103
T + + ++G+ R ++ N+ FI+ HN TY + +N+F D+
Sbjct: 18 TFTSFHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDM 77
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
TNEE A+ G ++ R + V R D LP VDWR KGAV PVKDQ +CG
Sbjct: 78 TNEEINAVMNGLLPASESRGVA--VLGGR-----DDTLPAEVDWRTKGAVTPVKDQKACG 130
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS ++EG + + G+L+SLSEQ LVDC K + GC GGLMD+AF +I NGG+
Sbjct: 131 SCWAFSATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGI 190
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
D+E YPY + KC + N+ ++ GY DV E +L+KAVA P+SVAI+A
Sbjct: 191 DTEASYPYEATDGKCQYNPANSG-ATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRS 249
Query: 282 AFQHYESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
F Y GV + EC S +LDHGV+AVGYGT++G DYWLV+NSW WG +G++++ RN
Sbjct: 250 TFHFYHKGVYYDKECSSTSLDHGVLAVGYGTQDGTDYWLVKNSWNITWGNHGFIEMSRN- 308
Query: 340 LDTNTGKCGIAMEASYPV 357
CGIA +ASYP+
Sbjct: 309 ---RNNNCGIATQASYPL 323
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 140/296 (47%), Positives = 186/296 (62%), Gaps = 20/296 (6%)
Query: 73 FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
+ F N+ I+EHN +R T+++GLN ADL EYR + G R RRL +
Sbjct: 60 MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRKLN-GYRH---RRLFGDSM 115
Query: 129 ASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
++ + P+SVDWRE V PVK+QG CGSCWAFS A+EG + TG+L+
Sbjct: 116 RKNGTKFLVPFNVKAPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLV 175
Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
SLSEQ LVDC K N GCNGGLMD AF++I N G+D+E+ YPY+G E +C +R+
Sbjct: 176 SLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIG 235
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
G+ D+ DE +LK AVA Q P+S+AI+AG R+FQ Y+ GV F EC S LDHG
Sbjct: 236 AED-RGFVDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHG 294
Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+ VGYGT+ DYW+++NSWG+ WGE GYV++ RN CG+A +ASYP+
Sbjct: 295 VLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIARN----RNNHCGVATKASYPL 346
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 196/326 (60%), Gaps = 24/326 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKF 100
D ++ + W H K + +R I++ NL+ I+ HN + TY++G+N F
Sbjct: 22 DQQLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHTYRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+T+EE+R + G + RR S + E+P +DWREKG V PVKDQG
Sbjct: 81 GDMTHEEFRQVMNGFKHKKDRRFRGSLFMEPNFI-----EVPNKLDWREKGYVTPVKDQG 135
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++
Sbjct: 136 ECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQ 195
Query: 220 GGMDSEQDYPYLGAENK-CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
G+DSE+ YPYLG +++ C +N+ + G+ D+ E +L KA+A PVSVAI+
Sbjct: 196 NGLDSEESYPYLGTDDQPCHFDPKNS-AANDTGFVDIPSGKERALMKAIAAVGPVSVAID 254
Query: 278 AGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENG 331
AG +FQ Y+SG+ + EC S LDHGV+AVGYG E +G YW+V+NSW +WG+ G
Sbjct: 255 AGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKG 314
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
Y+ + ++ CGIA ASYP+
Sbjct: 315 YIYMAKD----RHNHCGIATAASYPL 336
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 146/328 (44%), Positives = 203/328 (61%), Gaps = 33/328 (10%)
Query: 46 DEVMTIYQTW-LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
DE ++++W K+ + G R +++ NL+ I+ HN S+ + TY++G+N F
Sbjct: 27 DEHWNLWKSWHTKKYHEKEEGW-----RRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHF 81
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+TNEE+R + G + A+R++ S + E P S+DWR+KG V PVKDQG
Sbjct: 82 GDMTNEEFRQLMNGYKHKAERKVKGSLFLEPNFL-----EAPRSLDWRDKGYVTPVKDQG 136
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFS A+EG TG+++ LSEQ LV+C R + N GCNGGLMD AFQ++ N
Sbjct: 137 QCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMDQAFQYVKDN 196
Query: 220 GGMDSEQDYPYLGAEN-KC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAV-ADQPVSVA 275
G+DSE+ YPYLG ++ KC DP R NA V+ G+ D+ E +L KAV A P+SVA
Sbjct: 197 QGLDSEESYPYLGTDDQKCHYDP-RYNA--VNDTGFVDIKSGSEHALMKAVTAVGPISVA 253
Query: 276 IEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
I+AG +FQ Y+SG+ + EC S LDHGV+ VGYG E +G YW+V+NSW WG+
Sbjct: 254 IDAGHESFQFYQSGIYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIVKNSWSEKWGD 313
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GYV + ++ CGIA ASYP+
Sbjct: 314 KGYVYMAKD----RQNHCGIATAASYPL 337
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 144/330 (43%), Positives = 199/330 (60%), Gaps = 37/330 (11%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEK----RFQIFKDNLRFID----EHNSLNRTYKVGL 97
DE ++++W K ++EK R +++ NL+ I+ EH+ TY++G+
Sbjct: 25 DEHWDLWKSWHTKK--------YHEKEEGWRRMVWEKNLKKIELHNLEHSMGEHTYRLGM 76
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
N F D+T+EE+R + G + ++R+ S + E P SVDWR+ G V PVK
Sbjct: 77 NHFGDMTHEEFRQIMYGYKRKSERKFKGSLFMEPNFL-----EAPRSVDWRDNGYVTPVK 131
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
DQG CGSCWAFST A+EG + TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVS 273
N G+DSE YPYLG +++ P + K S + G+ D+ E +L KAVA PVS
Sbjct: 192 KDNQGLDSEDSYPYLGTDDQ--PCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS 249
Query: 274 VAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDW 327
VAI+AG +FQ Y+SG+ + EC S LDHGV+ VGYG E +G YW+V+NSW W
Sbjct: 250 VAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKW 309
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G+ GY+ + ++ CGIA ASYP+
Sbjct: 310 GDKGYIYMAKD----RKNHCGIATAASYPL 335
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 195/329 (59%), Gaps = 26/329 (7%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLN 98
R D ++ + W H K + +R +++ NL+ I+ EH ++++G+N
Sbjct: 21 RFDSQLEDHWHLWKNWHSKNYHASEEGWRRM-VWEKNLKKIEIHNLEHTMGKHSHRLGMN 79
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
F D+TNEE+R G + +R+ S Y + P++VDWREKG V PVKD
Sbjct: 80 HFGDMTNEEFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKD 134
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
QGSCGSCWAFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 135 QGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQ 194
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSV 274
N G+D+E+ YPY+G + DP + + + G+ D+ E ++ KAVA PVSV
Sbjct: 195 DNAGLDTEESYPYVGTDE--DPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSV 252
Query: 275 AIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
AI+AG +FQ YESG+ + EC S LDHGV+ VGYG E +G YW+V+NSW WG
Sbjct: 253 AIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWG 312
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ GY+ + ++ CGIA +SYP+
Sbjct: 313 DKGYIYMAKD----RKNHCGIATASSYPL 337
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 247 bits (631), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 140/328 (42%), Positives = 202/328 (61%), Gaps = 25/328 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
D + T ++ W + HGK+ +R +++ +LR I+ HN SL + ++++G+N F
Sbjct: 22 DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+ NEE+R + G K + K+ + E+P+ VDWR++G V PVKDQG
Sbjct: 81 GDMPNEEFRQLMNGY----KYKQTHKKLQGSHFLEPNFQEVPKHVDWRDEGYVTPVKDQG 136
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG + TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++ N
Sbjct: 137 QCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDN 196
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
GG+DSE YPY+G ++ P N + + + G+ D+ E +L KA+A PVSVAI
Sbjct: 197 GGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAI 254
Query: 277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ F EC S LDHGV+ VGYG E +G YW+V+NSW WG+N
Sbjct: 255 DAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQN 314
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GY+ + ++ CGIA ASYP++
Sbjct: 315 GYILMAKD----KDNHCGIATAASYPLE 338
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 198/327 (60%), Gaps = 26/327 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
D ++ +Q W H K + +R +++ NLR I+ EH+ +Y++G+N F
Sbjct: 21 DPQLDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHF 79
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+T+EE+R + G + +R+ S + E P +VDWR+KG V PVKDQG
Sbjct: 80 GDMTHEEFRQIMNGYKRREQRKYSGSLFMEPNFL-----EAPRAVDWRDKGYVTPVKDQG 134
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++ N
Sbjct: 135 QCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDN 194
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
G+DSE YPY G +++ P + NA+ +++ G+ D+ E +L KAVA PVSVAI
Sbjct: 195 QGLDSEDFYPYKGTDDQ--PCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAI 252
Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ F EC S LDHGV+ VGYG E +G YW+V+NSW WG+
Sbjct: 253 DAGHESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDK 312
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
G++ + ++ CGIA ASYP+
Sbjct: 313 GFIYMAKD----RHNHCGIATAASYPL 335
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 189/310 (60%), Gaps = 28/310 (9%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
++ W+++ + + RF+IFK NL+F++ N + N TYK+ +NKF+DLT+EE++A
Sbjct: 18 HEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEEFQA 77
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
Y+G + + K S RY + E ES+DWR +GAV PVKDQG CG CWAF+
Sbjct: 78 RYMGLVPEGMTGDSQ-KTVSFRY--ENVSETGESMDWRLEGAVTPVKDQGQCGCCWAFAA 134
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
VAAVEG+ KI GEL+SLSEQ+LVDC N GC+GGL A+ +I +N G+ SE++YP
Sbjct: 135 VAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGITSEENYP 194
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
Y + C + A +S GYE V DE +L KAV+ QH G
Sbjct: 195 YQAVQQTCKSTDPAAATIS--GYEAVPKDDEEALLKAVS---------------QH---G 234
Query: 290 VFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
+F E CG+ H V VGYGT E G+ YWL++NSWG WGENGY++++R+ +D G C
Sbjct: 235 IFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRD-VDEPQGMC 293
Query: 348 GIAMEASYPV 357
G+A A YPV
Sbjct: 294 GLAHRAYYPV 303
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 139/295 (47%), Positives = 190/295 (64%), Gaps = 19/295 (6%)
Query: 72 RFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R IF+ N++ I+ HN L +Y++GLN FAD+T +E+ Y GTR +A +++
Sbjct: 45 RRAIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMTPDEFEK-YRGTRFEAN----EAR 99
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
V+ ++ +P++VDWR +G V PVK+QG CGSCWAFST A+EG + +G+L+S
Sbjct: 100 VSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVS 159
Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC NAGCNGGLMD AF+FI GG+++E+ YPY G + C R
Sbjct: 160 LSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIG- 218
Query: 247 VSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTG-ECGS-ALDHGV 303
+ G+ DV DE +LK+A PVSVAI+A G+ FQ Y+ GV+ C S +LDHGV
Sbjct: 219 AKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGV 278
Query: 304 VAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYG T +G DYWLV+NSWGS WG++GY+++ RN +CGIA ASYP
Sbjct: 279 LVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRN----KENQCGIATMASYPT 329
>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 188/317 (59%), Gaps = 20/317 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K K+ + R Q++ +N +F+ HN L ++Y++G+ FAD+ NEE
Sbjct: 26 FHAWKLKFEKSYDSESDEAHRKQVWLNNRKFVLMHNILADQGLKSYRLGMTHFADMDNEE 85
Query: 108 YRAMY-LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
Y+ + G L + S G LP++VDWR+KG V VKDQ CGSCW
Sbjct: 86 YKQLVSQGCLHTFNASLPER--GSAFLGLPEGTALPDTVDWRDKGYVTEVKDQKQCGSCW 143
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
AFST +EG + TG+L+SLSEQ+L+DC N GCNGG + A Q+I NGG+D+E
Sbjct: 144 AFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKRALQYIQANGGIDTE 203
Query: 226 QDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
YPY +C P AK GY V P +E +LKKAVA P+SV I+A +
Sbjct: 204 TSYPYKAKGQRCRYKPDGIGAKCT---GYVHVKPSNEETLKKAVATLGPISVGIDASRHS 260
Query: 283 FQHYESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
FQ Y+SGV+ +C + LDHG +AVGYGTENG DYWL++NSWG WG+ GY+K+ RN
Sbjct: 261 FQFYQSGVYDDPDCSKTVLDHGALAVGYGTENGHDYWLIKNSWGLRWGDKGYIKMSRN-- 318
Query: 341 DTNTGKCGIAMEASYPV 357
+ +CGIA EASYP+
Sbjct: 319 --KSNQCGIASEASYPL 333
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 186/317 (58%), Gaps = 17/317 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNR-TYKVGLNKFADLTNEE 107
+ T+ H K + R +IF +N I HN LN +YK+G+NK+ D+ + E
Sbjct: 28 WNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLHHE 87
Query: 108 YRAMYLGTRSD--AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
+ G A+ R + + S R+ A E+P SVDWR GAV P+KDQG CGSC
Sbjct: 88 FINTLNGFNKSVSAQLRAQRRPIGS-RFIEPANVEIPSSVDWRTHGAVTPIKDQGHCGSC 146
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDS 224
W+FS A+EG + +TG+L+SLSEQ L+DC R N GCNGGLMD AFQ+I N G+D+
Sbjct: 147 WSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQAFQYIKDNHGLDT 206
Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
E YPY +KC + RN GY D+ +E LK AVA PVSVAI+A +F
Sbjct: 207 EISYPYEAENDKCRYNPRNNGATD-SGYVDIPEGNEKKLKAAVATIGPVSVAIDASAESF 265
Query: 284 QHYESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
Q Y GV + C S LDHGV+ VGYGT +N DYWLV+NSWG WG+ GY+K+ RN
Sbjct: 266 QFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQDYWLVKNSWGVTWGDEGYIKMARN-- 323
Query: 341 DTNTGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 324 --KDNHCGIASSASYPL 338
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 192/315 (60%), Gaps = 17/315 (5%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
D +M ++ W A H ++ +RF++++ N+ +ID N TY++G N+FADL
Sbjct: 38 DMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADL 97
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS-C 162
T EE+ A Y G + + S + P SVDWR KGAV PVK+QGS C
Sbjct: 98 TGEEFLARYAGGHTGSAITTAAEADGSLE------ADPPASVDWRAKGAVTPVKNQGSQC 151
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS VA +E + I TG+L++LSEQ+LVDCD K + GCN G AFQ+I++NGG+
Sbjct: 152 YSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCD-KYDGGCNKGYYHRAFQWIMENGGI 210
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
+ YPY C ++ V+I G+ V+ +E++L+ AVA QP+ VAIE +
Sbjct: 211 TTAAQYPYKAVRGACSAAK---PAVTITGHLAVAK-NELALQSAVARQPIGVAIEV-PIS 265
Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Q Y+SGVF+ CG + H VV VGYG + +G+ YWLV+NSWG WGE GY++++R++
Sbjct: 266 MQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDV-- 323
Query: 342 TNTGKCGIAMEASYP 356
G CGIA++ +YP
Sbjct: 324 GGGGLCGIALDTAYP 338
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 190/320 (59%), Gaps = 20/320 (6%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN---RTYKVGLNKFADL 103
E + W+ H K+ + H RF+I+K N R+I N + ++ V +N+F DL
Sbjct: 90 EEQRAFTEWMRTHRKSYH-HDHFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDL 148
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
T++E+ +Y G + + + +++A AG +PES DWR+KG V+ VKDQG CG
Sbjct: 149 TSDEFNRLYNGLHVFSAPKASEKVERPRQWANTAG--IPESGDWRQKGVVSRVKDQGMCG 206
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI--NAGCNGGLMDYAFQFIIQNGG 221
SCWAFST + EGIN I T L+ LSEQ LVDC N GCNGG MD AF++II N G
Sbjct: 207 SCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKG 266
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVV---SIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
+DSE YPY+ A+ +C R N K V + + DE +L A A QP+SV I+A
Sbjct: 267 IDSEASYPYVAADGQC---RFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDA 323
Query: 279 GGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
G +FQ Y GV+ EC S L+HGV+ VG+G E G YWLV+NSWG WG +GY+K+
Sbjct: 324 GRPSFQFYSKGVYNEPECSSTELNHGVLIVGWGVERGQAYWLVKNSWGQTWGMDGYIKMS 383
Query: 337 RNLLDTNTGKCGIAMEASYP 356
R D N +CGIA ASYP
Sbjct: 384 R---DKNN-QCGIATLASYP 399
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 191/315 (60%), Gaps = 21/315 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
++T+ HGK R +IF +N + I+ HN+ +YK+ +N F DL + E
Sbjct: 27 WETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMSHE 86
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+A+ G + + + + + D+LP+SVDWR+KGAV PVKDQG CGSCW+
Sbjct: 87 IKALMNGFK------MTPNTKREGKIYFPSNDKLPKSVDWRQKGAVTPVKDQGQCGSCWS 140
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS ++EG + G+L+SLSEQ L+DC ++ N GC GGLMD AFQ++ N G+D+E
Sbjct: 141 FSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTES 200
Query: 227 DYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY + C + KV D GY D+ DE +L+ A+A P+SVAI+A +F
Sbjct: 201 SYPYEARDYAC--RFKKDKVGGTDKGYVDIPEGDEKALQNALATVGPISVAIDASHESFH 258
Query: 285 HYESGVFTGECGSA--LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
Y GV+ S+ LDHGV+AVGYGTENG DYWLV+NSWG WGE+GY+K+ RN
Sbjct: 259 FYSEGVYNEPYCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGESGYIKIARN---- 314
Query: 343 NTGKCGIAMEASYPV 357
++ CGIA ASYP+
Sbjct: 315 HSNHCGIASMASYPI 329
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 203/329 (61%), Gaps = 27/329 (8%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
D + T ++ W + HGK+ +R +++ +LR I+ HN SL + ++++G+N F
Sbjct: 22 DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ NEE+R + G + ++L S + E+P+ VDWR++G V PVKDQ
Sbjct: 81 GDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFL-----EVPKHVDWRDEGYVTPVKDQ 135
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
G CGSCWAFST A+EG + TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++
Sbjct: 136 GQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKD 195
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVA 275
NGG+DSE YPY+G ++ P N + + + G+ D+ E +L KA+A PVSVA
Sbjct: 196 NGGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVA 253
Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
I+AG +FQ Y+SG+ F EC S LDHGV+ VGYG E +G YW+V+NSW WG+
Sbjct: 254 IDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQ 313
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NGY+ + ++ CGIA ASYP++
Sbjct: 314 NGYILMAKD----KDNHCGIATAASYPLE 338
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 190/318 (59%), Gaps = 16/318 (5%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
+ T ++ + + H KT RF+IF +N FI +HN +YK+G+N+FADL
Sbjct: 23 LRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADL 82
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
E+ M G + +RL A LP++VDWR+KGAV PVKDQG CG
Sbjct: 83 LPHEFVKMMNGYQG---KRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCG 139
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS+ ++EG + + TG+L+SLSEQ LVDC N GCNGGLMD +F +I NGG+
Sbjct: 140 SCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGI 199
Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
D+E YPY + C + + G+ D+ E L+KAVA PVSVAI+A +
Sbjct: 200 DTEDSYPYEAEDGDCRYKKEDVGATDT-GFVDIKEGSEKDLQKAVATVGPVSVAIDASQQ 258
Query: 282 AFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
+FQ Y GV+ C S +LDHGV+AVGYG +NG YWLV+NSW WG++GY+ + R
Sbjct: 259 SFQLYSEGVYDEPNCSSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMSR-- 316
Query: 340 LDTNTGKCGIAMEASYPV 357
D N +CGIA ASYP+
Sbjct: 317 -DKNN-QCGIASSASYPL 332
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 26/329 (7%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLN 98
R D ++ + W H K+ + +R +++ NL+ I+ EH +Y++G+N
Sbjct: 21 RFDSQLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKNLKKIEMHNLEHTMGKHSYRLGMN 79
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
F D+TNEE+R G + +R+ S Y + P++VDWREKG V PVKD
Sbjct: 80 HFGDMTNEEFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKD 134
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
QGSCGSCWAFST A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 135 QGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQ 194
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSV 274
N G+D+E+ YPY+G + DP + + G+ D+ E ++ KAVA PVSV
Sbjct: 195 DNAGLDTEESYPYVGTDE--DPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSV 252
Query: 275 AIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
AI+AG +FQ YE G+ + EC S LDHGV+ VGYG E +G YW+V+NSW WG
Sbjct: 253 AIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWG 312
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ GY+ + ++ CGIA +SYP+
Sbjct: 313 DKGYIYMAKD----RKNHCGIATASSYPL 337
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 149/372 (40%), Positives = 215/372 (57%), Gaps = 34/372 (9%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MATAS LA+ L ++ + D I ++ ++ W A++
Sbjct: 29 MATASASLALMFACSLLLAGTAFSDDTIAIP----------------LLERFKAWQAEYN 72
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
+T ++RF I+ +N+RFI N L+ +Y++G N+F DLT EE++ YL + D
Sbjct: 73 RTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYL-MKLD 131
Query: 119 AKRRLMKSKVASQRYACKAG-------DELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
+ ++ + AG E P SVDWR KGAV VKDQ CGSCWAF+TV
Sbjct: 132 EQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATV 191
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A++EG+++I TG L+SLSEQE+VDCDR N GC GG A +++ +NGG+ +E DYPY
Sbjct: 192 ASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPY 251
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+G++ +C + I GY+ V +E L++AVA QPV+V ++A RAFQ Y+SGV
Sbjct: 252 VGSQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFVDA-SRAFQFYKSGV 310
Query: 291 FTGEC-GSALDHGVVAVGYGT----ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
F+G C + ++H V VGYG+ G YW+V+NSWG WGENGYV++ R + G
Sbjct: 311 FSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRARE-G 369
Query: 346 KCGIAMEASYPV 357
C IA+E YPV
Sbjct: 370 MCAIAIEPYYPV 381
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 189/321 (58%), Gaps = 23/321 (7%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
+ T ++ + H K+ RF+IF +N I +HN+ +YK+G+N+F DL
Sbjct: 23 LRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDL 82
Query: 104 TNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
E+ ++ G R R M + LP +VDWR+KGAV PVKDQG
Sbjct: 83 LAHEFAKIFNGYRGQRTSRGSTFMPPANVNDS-------SLPSTVDWRKKGAVTPVKDQG 135
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
CGSCWAFS ++EG + + GEL+SLSEQ LVDC + N GC GGLMD AF++I N
Sbjct: 136 QCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKAN 195
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
G+D+E+ YPY ++KC + + G+ D+ E LKKAVA P+SVAI+A
Sbjct: 196 DGIDAEESYPYEAMDDKCRFKKEDVGATDT-GFVDIEGGSEDDLKKAVATVGPISVAIDA 254
Query: 279 GGRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
G +FQ Y GV+ EC S LDHGV+AVGYG ++G YWLV+NSWG WG+NGY+ +
Sbjct: 255 GHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMS 314
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
R D N +CGIA ASYP+
Sbjct: 315 R---DKNN-QCGIASAASYPL 331
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 144/331 (43%), Positives = 193/331 (58%), Gaps = 42/331 (12%)
Query: 52 YQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYR 109
+ W+ H K TS G R+ IFK N+ ++ + NS +GLN FAD+TNEEYR
Sbjct: 30 FTDWMITHQKSYTSEEFG---ARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYR 86
Query: 110 AMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
YLGT+ DA + + KV + A S DWR +GAV PVK+QG CG CW+
Sbjct: 87 NTYLGTKFDASSLIGTQEEKVFTTSSAA--------SKDWRSEGAVTPVKNQGQCGGCWS 138
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FST + EG + GEL+SLSEQ L+DC + N+GC+GGLM YAF++II N G+D+E
Sbjct: 139 FSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESS 197
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
YPY KC+ N+ ++ Y+ V+ E SL+ AV PVSVAI+A ++FQ Y
Sbjct: 198 YPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYT 256
Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGV-------------------DYWLVRNSWGSD 326
SG+ + EC S LDHGV+AVGYG+ +G +YW+V+NSWG+
Sbjct: 257 SGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTS 316
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG GY+ + RN D N CGIA AS+PV
Sbjct: 317 WGIEGYILMSRN-RDNN---CGIASSASFPV 343
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/331 (43%), Positives = 197/331 (59%), Gaps = 25/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
S++ + D + + W A H + GM R +++ N++ ID HN +
Sbjct: 16 SAAPKLDQSLTEQWYQWKATHRRLY-GMNEEGWRRAVWEKNMKMIDLHNREYSQGQHGFT 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G R+ R K KV + E+P+SVDW KG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFRNQKPR---KGKVFQEPLFA----EIPKSVDWTLKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNEGCNGGLMDNAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q++ +NGG+DSE+ YPYLG + + + G+ D+ P E +L KAVA P+
Sbjct: 188 QYVKENGGLDSEESYPYLGTDTDSCKYKPECSAANDTGFVDI-PQREKALMKAVATVGPI 246
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG ++FQ Y+SG+ + +C S LDHGV+ VGYG E N +W+V+NSWG +
Sbjct: 247 SVAIDAGHQSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPE 306
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG NGYVK+ + D N CGIA ASYP
Sbjct: 307 WGTNGYVKMAK---DQNN-HCGIATAASYPT 333
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/298 (46%), Positives = 189/298 (63%), Gaps = 23/298 (7%)
Query: 72 RFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +++ NLR I+ HN SL + +Y++G+N+F D+TNEE+R + G ++ ++++K
Sbjct: 48 RRVLWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYKN---QKMIKGS 104
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ E P++VDWREKG V PVKDQG CGSCWAFST A+EG + G+LIS
Sbjct: 105 T----FLAPNNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLIS 160
Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC R + N GCNGGLMD AFQ++ NGG+DSE YPY +++ N
Sbjct: 161 LSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNS 220
Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
+ G+ DV E L KAVA PVSVA++AG ++FQ Y+SG+ + EC S LDHGV
Sbjct: 221 ANDTGFVDVPSGSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGV 280
Query: 304 VAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYG E +G YW+V+NSW WG NGY+K+ ++ CGIA ASYP+
Sbjct: 281 LVVGYGFEGEDVDGKRYWIVKNSWSEKWGNNGYIKIAKD----RHNHCGIATAASYPL 334
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 186/310 (60%), Gaps = 28/310 (9%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
++GK + ++RF++F DNL+ I HN +YK+G+N+F DLT +E+R LG
Sbjct: 67 RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126
Query: 118 D----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
+ K L + V LPE+ DWRE G V+PVK+QG CGSCW FST A
Sbjct: 127 NCSATTKGNLKVTNVV-----------LPETKDWREAGIVSPVKNQGKCGSCWTFSTTGA 175
Query: 174 VEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
+E G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY G
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVF 291
C S N V ID +++ E LK AVA +PVS+A E + F+ Y+SGV+
Sbjct: 236 KNGLCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVY 293
Query: 292 TG-ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
T ECG+ ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++ C
Sbjct: 294 TSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME-----MGKNMC 348
Query: 348 GIAMEASYPV 357
GIA ASYPV
Sbjct: 349 GIATCASYPV 358
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/296 (47%), Positives = 180/296 (60%), Gaps = 18/296 (6%)
Query: 72 RFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +I+ DN R I EHN LN TYK+G+NK+ D+ + E+ G +
Sbjct: 49 RMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVTAGIETEG 108
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
V + A +LP+ VDW ++GAV VKDQG CGSCWAFS+ A+EG + TG L+S
Sbjct: 109 VT---FISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVS 165
Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ L+DC K N GCNGGLMDYAFQ+I N G+D+E+ YPY ++C + RN+
Sbjct: 166 LSEQNLIDCSGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRCRYNPRNSGA 225
Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGV 303
GY D+ DE LK AVA P+SVAI+A +FQ Y GV+ SA LDHGV
Sbjct: 226 TD-KGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYDPDCSAENLDHGV 284
Query: 304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYGT+ +G DYWLV+NSWG WG+ GY+K+ RN CGIA ASYP+
Sbjct: 285 LIVGYGTDETSGHDYWLVKNSWGKTWGQKGYIKMARN----KNNHCGIASSASYPL 336
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 150/365 (41%), Positives = 211/365 (57%), Gaps = 45/365 (12%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
MF + TL +IS+ AA I D DH +SW++ +HGK+ +
Sbjct: 2 MFALLVTL----YISAVFAAPSIDIQLD---DHWNSWKS-------------QHGKSYHE 41
Query: 66 MGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
+R I+++NLR I++HN SL N T+K+G+N+F D+TNEE+R G + D R
Sbjct: 42 DVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNR 100
Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
++ P+ VDWR++G V PVKDQ CGSCW+FS+ A+EG
Sbjct: 101 TSQGPLFMEPKFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRK 155
Query: 182 TGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG+LIS+SEQ LVDC R N GCNGGLMD AFQ++ +N G+DSEQ YPYL ++ P
Sbjct: 156 TGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDL--PC 213
Query: 241 RRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CG 296
R + + V I G+ D+ +E++L AVA PVSVAI+A ++ Q Y+SG++ C
Sbjct: 214 RYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACT 273
Query: 297 SALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
S LDH V+ VGYG + G YW+V+NSW WG+ GY+ + ++ CGIA
Sbjct: 274 SQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKD----KNNHCGIATM 329
Query: 353 ASYPV 357
ASYP+
Sbjct: 330 ASYPL 334
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 187/314 (59%), Gaps = 25/314 (7%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEEYRA 110
W H KT R +I++ NLR I HN SL TY +G+N D+T EE
Sbjct: 29 WKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTREEILQ 88
Query: 111 MYLGTR--SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
M+ GTR + RR S + AG +P+SVDWREKG V VK+QGSCGSCWAF
Sbjct: 89 MFAGTRVRPNLTRR-------SSPFVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAF 141
Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQD 227
S A+EG K TG++ SLS Q LVDC K N GCNGG M AFQ++I +GG+DS++
Sbjct: 142 SAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEA 201
Query: 228 YPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY + +C D S+R A S Y VS DE +LK+AVA P+SVAI+A F
Sbjct: 202 YPYTAMDGQCRYDQSQRAANCSS---YNYVSEGDEEALKQAVATIGPISVAIDATRPMFI 258
Query: 285 HYESGVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y SGV++ C ++HGV+ VGYG+ NG DYWLV+NSWG+ +G+ GY+++ RN
Sbjct: 259 LYHSGVYSDPTCTQNVNHGVLVVGYGSLNGEDYWLVKNSWGTRFGDGGYIRIARN----K 314
Query: 344 TGKCGIAMEASYPV 357
CGIA A YP+
Sbjct: 315 GNMCGIANYACYPL 328
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 144/330 (43%), Positives = 199/330 (60%), Gaps = 37/330 (11%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEK----RFQIFKDNLRFID----EHNSLNRTYKVGL 97
DE ++++W K ++EK R +++ NL+ I+ EH+ TY++G+
Sbjct: 25 DEHWDLWKSWHTKK--------YHEKEEGWRRMVWEKNLKKIELHNLEHSMGEHTYRLGM 76
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
N F D+T+EE+R + G + ++R+ S + E P SVDWR+ G V PVK
Sbjct: 77 NHFGDMTHEEFRQIMNGYKRKSERKFKGSLFMEPNFL-----EAPRSVDWRDNGYVTPVK 131
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
DQG CGSCWAFST A+EG + TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191
Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVS 273
N G+DSE YPYLG +++ P + K S + G+ D+ E +L KAVA PVS
Sbjct: 192 KDNQGLDSEDSYPYLGTDDQ--PCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS 249
Query: 274 VAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDW 327
VAI+AG +FQ Y+SG+ + EC S LDHGV+ VGYG E +G YW+V+NSW W
Sbjct: 250 VAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKW 309
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G+ GY+ + ++ CGIA ASYP+
Sbjct: 310 GDKGYIYMAKD----RKNHCGIATAASYPL 335
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 196/319 (61%), Gaps = 24/319 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K G+T + +R Q + +N + + HN L ++Y++G+ FAD+ NEE
Sbjct: 26 FHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEE 85
Query: 108 YRAMY----LGT-RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
Y+ + LG+ + RR S + +LP +VDWR+KG V VKDQ C
Sbjct: 86 YKRLISQGCLGSFNASLPRR------GSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQC 139
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCWAFS ++EG TG+L+SLSEQ+LVDC N GC GGLMD AF++I GG
Sbjct: 140 GSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGG 199
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
+D+E+ YPY + +C + +A + GY DVS DE +L++AVA P+SV I+A
Sbjct: 200 IDTEESYPYEAEDGECR-YKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASH 258
Query: 281 RAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+FQ YESG++ +C S+ LDHGV+AVGYG+ENG DYWLV+NSWG WG+ GY+K+ +N
Sbjct: 259 ISFQLYESGLYDEPQCSSSELDHGVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKN 318
Query: 339 LLDTNTGKCGIAMEASYPV 357
+ +CGIA ASYP+
Sbjct: 319 ----KSNQCGIATAASYPL 333
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 187/320 (58%), Gaps = 23/320 (7%)
Query: 50 TIYQTWL---AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
++ Q W A+HG+ + R +F+ N +FID+HN+ T+ + +N+F D
Sbjct: 19 SLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 78
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+T+EE+ A G + RR A + LP+ VDWR KGAV PVKDQ C
Sbjct: 79 MTSEEFTATMNGFLNVPSRRPTAILRAD------PDETLPKEVDWRTKGAVTPVKDQKQC 132
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCWAFST ++EG + + G+L+SLSEQ LVDC K N GC GGLMD AF++I N G
Sbjct: 133 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKG 192
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
+D+E YPY + KC N GY DV E +LKKAVA P+SVAI+A
Sbjct: 193 IDTEDSYPYEAQDGKCRFDASNVGATDT-GYVDVEHGSESALKKAVATIGPISVAIDASQ 251
Query: 281 RAFQHYESGVFTGE-CGSA-LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
+FQ Y GV+ E C S LDHGV+AVGYG TE G YWLV+NSW + WG GY+++ R
Sbjct: 252 PSFQFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSR 311
Query: 338 NLLDTNTGKCGIAMEASYPV 357
+ CGIA +ASYP+
Sbjct: 312 D----KKNNCGIASQASYPL 327
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 190/314 (60%), Gaps = 24/314 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
+Q++ KHGKT KRF IF++NLR I+ HN+ + +Y G+NKFAD+T E
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
++AM L T+ K S VA++ + G +PES+DWR + V P+KDQ CGSCWA
Sbjct: 86 FKAM-LATQVKTK----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWA 140
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
F+ V + EG + TG+L SEQ+LVDC +N GC+GG +D F + IQ G++ E D
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPY-IQTNGLELESD 199
Query: 228 YPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
YPY G + C S ++KVV+ + Y V P +E +L +AV PV++AI A Q
Sbjct: 200 YPYTGYDGYC--SYESSKVVTKVSSYVSV-PANEQALLEAVGTAGPVAIAINADD--LQF 254
Query: 286 YESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y SG+ + C LDHGV+AVGY +ENG DYWL++NSWG+DWGE+GY + R
Sbjct: 255 YFSGIIDDKYCDPEYLDHGVLAVGYDSENGRDYWLIKNSWGADWGESGYFRFLR-----G 309
Query: 344 TGKCGIAMEASYPV 357
CG+ +A YP+
Sbjct: 310 QNICGVKEDAVYPL 323
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 192/321 (59%), Gaps = 20/321 (6%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
RT D + + + ++GK+ +KRF+IF ++L+ + N +Y++G+N+F+D
Sbjct: 55 RTRDALR--FARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKGLSYRLGINRFSD 112
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
++ EE+RA LG + L A A LP++ DWRE G V+PVK+QG C
Sbjct: 113 MSWEEFRATRLGAAQNCSATL-----AGNHRMRAAAVALPKTKDWREDGIVSPVKNQGHC 167
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGG 221
GSCW FST A+E TG+ ISLSEQ+LVDC + N GCNGGL AF++I NGG
Sbjct: 168 GSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYIKYNGG 227
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGG 280
+D+E+ YPY G CD N V +D +++ E LK AVA +PVSVA +
Sbjct: 228 LDTEESYPYKGVNGICDFKAENVGVKVLDSV-NITLGAEDELKDAVALVRPVSVAFQV-V 285
Query: 281 RAFQHYESGVFTGE-CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
F+ Y+SGV+T + CG+ ++H V+AVGYG ENGV YWL++NSWG+DWG+ GY K++
Sbjct: 286 NGFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDKGYFKME 345
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 346 -----MGKNMCGVATCASYPI 361
>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
Length = 310
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 137/299 (45%), Positives = 186/299 (62%), Gaps = 23/299 (7%)
Query: 72 RFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +K NL+ I+ HN + TY++G+N F D+T+EE+R + G + RR S
Sbjct: 21 RRIFWKKNLKXIEMHNLXHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL 80
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ E+P +DWREKG V PVKDQG CGSCWAFST A+EG TG+L+S
Sbjct: 81 FMEPXFI-----EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVS 135
Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRNAK 245
LSEQ LVDC R + N GCNGGLMD AFQ++ G+DSE+ YPYLG +++ C +N+
Sbjct: 136 LSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNS- 194
Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
+ G+ D+ E +L KA+A PVSVAI+AG +FQ Y+SG+ + EC S LDHG
Sbjct: 195 AANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHG 254
Query: 303 VVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+AVGYG E +G YW+V+NSW +WG+ GY+ + ++ CGIA ASYP+
Sbjct: 255 VLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKD----RHNHCGIATAASYPL 309
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 199/318 (62%), Gaps = 19/318 (5%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
+M + +W A + ++ ++RFQ+++ N+ I+ N N TY +G N+FADLT E
Sbjct: 45 MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104
Query: 107 EYRAMY----LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-S 161
E+ +Y + R DA ++ ++ V+S A A P SVDWR KGAV P+K+QG S
Sbjct: 105 EFLDLYTMKGMPVRRDAGKK--RANVSSSAAAVDA----PTSVDWRSKGAVTPIKNQGPS 158
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
C SCWAF T A +E I KI TG+L+SLSEQEL+DCD + GCN G ++++IQNGG
Sbjct: 159 CSSCWAFVTAATIESITKITTGKLVSLSEQELIDCD-PYDGGCNLGYFVNGYRWVIQNGG 217
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+ +E +YPY C SR +I Y + P E L++AVA QPV+ AIE GG
Sbjct: 218 LTTEANYPYQARRYACSRSRAAQHAATISDYVQL-PAGEGQLQQAVAQQPVAAAIEMGG- 275
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNL 339
+ Q Y GVF+G+CG+ ++H + VGYG + +G+ YWLV+NSWG WGE GY++++R++
Sbjct: 276 SLQFYSGGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDV 335
Query: 340 LDTNTGKCGIAMEASYPV 357
G CGIA++ +YPV
Sbjct: 336 --GRGGLCGIALDLAYPV 351
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 192/315 (60%), Gaps = 26/315 (8%)
Query: 55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRA 110
W AKH K GM R +++ N++ I+ HN + + +N F D+TNEE+R
Sbjct: 32 WKAKHRKLY-GMREEGWRRAVWEKNMKMIEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQ 90
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ G R+ + K KV + + E+P+SVDWREKG V PVK+QG CGSCWAFS
Sbjct: 91 VMNGFRNQKHK---KGKVFQE----PSFLEVPKSVDWREKGYVTPVKNQGQCGSCWAFSA 143
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
A+EG TG+LISLSEQ LVDC R + N GC+GGLMDYAFQ+I +NGG+DSE+ YP
Sbjct: 144 TGALEGQMFRKTGKLISLSEQNLVDCSRPQGNEGCDGGLMDYAFQYIKENGGLDSEESYP 203
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
Y + C R V + G+ D+ P +E +L KAVA P+SVAI+AG +FQ Y+
Sbjct: 204 YDAMDESCK-YRPEYSVANDTGFVDI-PKEEKALMKAVATVGPISVAIDAGHESFQFYKE 261
Query: 289 GV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
GV F EC S +DHGV+ VGYG E + +WLV+NSWG +WG GY+K+ ++
Sbjct: 262 GVYFEPECSSDNVDHGVLVVGYGYEETESDNNKFWLVKNSWGEEWGLGGYIKMTKD---- 317
Query: 343 NTGKCGIAMEASYPV 357
CGIA ASYP
Sbjct: 318 QKNHCGIATAASYPT 332
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 191/314 (60%), Gaps = 24/314 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
+Q++ KHGKT KRF IF++NLR I+ HN+ + +Y G+NKFAD+T E
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
++AM L T+ K S VA++ + G +PES+DWR + V P+KDQ CGSCW+
Sbjct: 86 FKAM-LATQVKTK----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWS 140
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
F+ V + EG + TG+L SEQ+LVDC +N GC+GG +D F + IQ G++ E D
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPY-IQTNGLELESD 199
Query: 228 YPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
YPY G + C S ++KVV+ + Y V P +E +L +AV PV++AI A Q
Sbjct: 200 YPYTGYDGSC--SYDSSKVVTKVSSYVSV-PANEQALLEAVGTAGPVAIAINADD--LQF 254
Query: 286 YESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
Y SG+ + C LDHGV+AVGY +ENG+DYWL++NSWG+DWGE+GY + R
Sbjct: 255 YFSGIIDDKYCDPEWLDHGVLAVGYNSENGLDYWLIKNSWGADWGESGYFRFLR-----G 309
Query: 344 TGKCGIAMEASYPV 357
CG+ +A YP+
Sbjct: 310 QNICGVKEDAVYPL 323
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 149/372 (40%), Positives = 216/372 (58%), Gaps = 34/372 (9%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MATAS LA+ L ++ + D I ++ ++ W A++
Sbjct: 3 MATASASLALMFACSLLLAGTAFSDDTIAIP----------------LLERFKAWQAEYN 46
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
+T ++RF I+ +N+RFI N L+ +Y++G N+F DLT EE++ YL + D
Sbjct: 47 RTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYL-MKLD 105
Query: 119 AKRRLMKSKVASQRYACKAG-------DELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
+ ++ + AG E P SVDWR KGAV VKDQ CGSCWAF+TV
Sbjct: 106 EQPPAAEAMGPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATV 165
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A++EG+++I TG L+SLSEQE+VDCDR N GC GG A +++ +NGG+ +E DYPY
Sbjct: 166 ASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPY 225
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+G++ +C + I GY+ V +E L++AVA++PV+V I+A RAFQ Y+SGV
Sbjct: 226 VGSQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAERPVAVFIDA-SRAFQFYKSGV 284
Query: 291 FTGEC-GSALDHGVVAVGYGT----ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
F+G C + ++H V VGYG+ G YW+V+NSWG WGENGYV++ R + G
Sbjct: 285 FSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRARE-G 343
Query: 346 KCGIAMEASYPV 357
C IA+E YPV
Sbjct: 344 MCAIAIEPYYPV 355
>gi|5901663|gb|AAD55363.1| cysteine protease [Hordeum vulgare subsp. vulgare]
Length = 163
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 117/163 (71%), Positives = 137/163 (84%), Gaps = 1/163 (0%)
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
GSCWAFS V+ VE IN++VTGE+I+LSEQELV+C N+GCNGGLMD AF FII+NGG
Sbjct: 1 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 60
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+D+E+DYPY + KCD +R NAKVVSIDG+EDV DE SL+KAVA QPVSVAIEAGGR
Sbjct: 61 IDTEEDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 120
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRNSWG
Sbjct: 121 EFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWG 163
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 186/325 (57%), Gaps = 23/325 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M ++ W H ++ +RF +++ N FID N + TY++ N+FADL
Sbjct: 44 DMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADL 103
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---------ELPESVDWREKGAVN 154
T EE+ A Y G + V AGD ++P SVDWR +GAV
Sbjct: 104 TEEEFLATYTGYYAG------DGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVV 157
Query: 155 PVKDQGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
P K Q S C SCWAF T A +E +N I TG+L+SLSEQ+LVDCD + GCN G A+
Sbjct: 158 PPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAY 216
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+++++NGG+ +E DYPY C+ ++ I G+ V P +E +L+ AVA QPV+
Sbjct: 217 KWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVA 276
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENG 331
VAIE G Q Y+ GV+TG CG+ L H V VGYGT+ +G YW ++NSWG WGE G
Sbjct: 277 VAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERG 335
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYP 356
Y+++ R++ G CG+ ++ +YP
Sbjct: 336 YIRILRDV--GGPGLCGVTLDIAYP 358
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 120/209 (57%), Positives = 148/209 (70%), Gaps = 1/209 (0%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
+HGK + RF+IFK+NL+ IDE N + Y +GLN+F+DL+++E++ MYLG +
Sbjct: 3 QHGKIYESIEEKLHRFEIFKENLKHIDERNKIVSNYWLGLNEFSDLSHDEFKKMYLGLKV 62
Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
D L K + Q + + +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGI
Sbjct: 63 DHDL-LNNKKQSQQDFEYRDFVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGI 121
Query: 178 NKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
N+I TG L SLSEQEL+DCD N GCNGGLMDYAFQFII NGG+ E DYPYL E C
Sbjct: 122 NQIKTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPYLMEEGTC 181
Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKA 266
D R ++VV+IDGY DV DE SL KA
Sbjct: 182 DEKRDESEVVTIDGYRDVPANDEQSLLKA 210
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/323 (42%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
V + ++ +H K R +IF DN + +HN L YK+ +NK+ DL
Sbjct: 23 VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82
Query: 104 TNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
+ E+ + G T++ KR ++ + + A ++P++VDWR++GAV PVKDQG
Sbjct: 83 LHHEFVGLLNGFNRTKTYLKRGELQDSIT---FIEPAHVDIPDTVDWRQEGAVTPVKDQG 139
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
CGSCW+FS A+EG + T +L+SLSEQ LVDC + N GCNGGLMD AF++I N
Sbjct: 140 HCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNN 199
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
GG+D+E YPY+G + K S +N + + G+ D+ DE LK AVA P+S+AI+A
Sbjct: 200 GGIDTEAAYPYMGEDEKFRYSAKN-RGATDKGFVDIPSGDEDKLKAAVATVGPISIAIDA 258
Query: 279 GGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
+FQ Y +GV++ C S LDHGV+ VGYGT+ G+DYWLV+NSWG WG +GY+K
Sbjct: 259 SHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIK 318
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+ RN +CG+A +ASYP+
Sbjct: 319 MARN----QDNQCGVATQASYPL 337
>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
Length = 334
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/331 (44%), Positives = 198/331 (59%), Gaps = 25/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
S++ D + + + W A H + GM R +++ N++ I+ HN +
Sbjct: 16 SAAPELDQSLDSQWYQWKATHRRLY-GMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFT 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G R+ R K KV + E+P+SVDW +KG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFRNQKHR---KGKVFQEPLFA----EIPKSVDWTQKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD+AF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNQGCNGGLMDFAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q+I NGG+DSE+ YPYL + + V + G+ D+ P E +L KAVA P+
Sbjct: 188 QYIKDNGGLDSEESYPYLARDTDSCNYKPEYSVANDTGFVDI-PQRERALMKAVATVGPI 246
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG ++FQ Y+SG+ F +C S LDHGV+ VGYG E N +W+V+NSWG +
Sbjct: 247 SVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPE 306
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG NGYVK+ + D N CGIA ASYP
Sbjct: 307 WGCNGYVKMAK---DQNN-HCGIATAASYPT 333
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/296 (46%), Positives = 182/296 (61%), Gaps = 17/296 (5%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +IF DN R I EHN YK+G+NK+ D+ + E G + + +
Sbjct: 83 RMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTLNGFNKSV--TVSEEQ 140
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ + A ELP+SVDWR+KGAV +KDQG CGSCWAFS+ A+EG + +G L+S
Sbjct: 141 LIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVS 200
Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ L+DC K N GCNGGLMDYAF++I +N G+D+E+ YPY ++C + +N+
Sbjct: 201 LSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAENDQCRYNPKNSGA 260
Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
+ G+ D+ DE LK AVA P+SVAI+A +F Y GV + EC A LDHGV
Sbjct: 261 SDV-GFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYYEPECSPANLDHGV 319
Query: 304 VAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYGT++G DYWLV+NSWG WGE GY+K+ RN CGIA ASYP+
Sbjct: 320 LIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARN----KENHCGIASSASYPL 371
>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/298 (46%), Positives = 188/298 (63%), Gaps = 16/298 (5%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
K GK KR IF+ NL I++ N+ + +YK+G+N+ ADLT+EE+ A+ LGT
Sbjct: 34 KFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDLSYKLGVNEHADLTHEEFAALKLGTLK 93
Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
+ RR K + + +LP SVDWR K + PVKDQGSCGSCWAFST A+E
Sbjct: 94 MSTRRDDKFVIEADT------TQLPTSVDWRNKNVLTPVKDQGSCGSCWAFSTTGALEAQ 147
Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
I TG+L+SLSEQ+LVDC N GC GGLMD A+++ I++ G+D E Y Y G ++
Sbjct: 148 YAIATGKLLSLSEQQLVDCSSGYGNNGCEGGLMDDAYEY-IKSAGLDQESTYSYNGTDDV 206
Query: 237 CDPS--RRNAKVVS--IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF- 291
C S +R+ + + + G+ + E SL KA+AD PVSVA+ A F+ Y+SGV+
Sbjct: 207 CQGSLAKRSDGIPAGEVTGFHMLDK-TEQSLMKALADAPVSVAMYAADPDFRFYKSGVYS 265
Query: 292 TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
+ C LDHGVVAVGYGTENG DY+++RNSWGS WG+ GY L+R + + G+C I
Sbjct: 266 SATCNGKLDHGVVAVGYGTENGSDYFIIRNSWGSSWGQAGYFYLKRGV--SGYGECNI 321
>gi|348542774|ref|XP_003458859.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 194/316 (61%), Gaps = 22/316 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K K+ + +R QI+ +N + + +HN+L +++++G+ FAD+ NEE
Sbjct: 26 FHAWKLKFEKSYDSPSEETQRKQIWLNNRKLVLKHNALADLGLKSFRLGMTYFADMENEE 85
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
Y+ LG L + +R G LP++VDWR++G V VKDQ CGSCWA
Sbjct: 86 YKK--LGCLGSFNASLPRHGSTFRRLP--KGTVLPDTVDWRKQGYVTHVKDQKECGSCWA 141
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS A+EG TG+L+SLSEQ+LVDC RK N GC GG +AFQ+I NGG+D+E+
Sbjct: 142 FSATGALEGQYFKKTGKLVSLSEQQLVDCSRKFRNNGCEGGEPHWAFQYIRYNGGLDTEE 201
Query: 227 DYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
Y Y + +C +P AK GY +VSPF++ +LK+AVA P+SVAI+ +F
Sbjct: 202 SYHYEAKDGQCHYNPDSVGAKC---SGYVNVSPFED-ALKEAVATIGPISVAIDISRVSF 257
Query: 284 QHYESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
Q Y SGV+ S L+H V+AVGYGTENG DYWLV+NSWGS+WG GY+K+ RN
Sbjct: 258 QLYHSGVYDEPWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSEWGNKGYIKMTRN--- 314
Query: 342 TNTGKCGIAMEASYPV 357
+CGIA EASYP+
Sbjct: 315 -KDNQCGIATEASYPL 329
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 186/325 (57%), Gaps = 23/325 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M ++ W H ++ +RF +++ N FID N + TY++ N+FADL
Sbjct: 44 DMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADL 103
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---------ELPESVDWREKGAVN 154
T EE+ A Y G + V AGD ++P SVDWR +GAV
Sbjct: 104 TEEEFLATYTGYYAG------DGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVV 157
Query: 155 PVKDQGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
P K Q S C SCWAF T A +E +N I TG+L+SLSEQ+LVDCD + GCN G A+
Sbjct: 158 PPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAY 216
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+++++NGG+ +E DYPY C+ ++ I G+ V P +E +L+ AVA QPV+
Sbjct: 217 KWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVA 276
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENG 331
VAIE G Q Y+ GV+TG CG+ L H V VGYGT+ +G YW ++NSWG WGE G
Sbjct: 277 VAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERG 335
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYP 356
Y+++ R++ G CG+ ++ +YP
Sbjct: 336 YIRILRDV--GGPGLCGVTLDIAYP 358
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 199/327 (60%), Gaps = 26/327 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
D ++ + W + H K + +R +++ NL+ I+ EH+ +Y++G+N F
Sbjct: 22 DPQLDQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKKIELHNLEHSMGKHSYRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+T+EE++ + G + A+R+ S + E P SVDWREKG V PVKDQG
Sbjct: 81 GDMTHEEFKQIMNGYKHKAERKFKGSLFLEPNFL-----EAPRSVDWREKGYVTPVKDQG 135
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG TG+L+SLS Q LV+C R + N GCNGGLMD AFQ++ N
Sbjct: 136 ECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRPEGNEGCNGGLMDQAFQYVKDN 195
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
G+DSE YPYLG +++ P + K + + G+ D+ +E +L KAVA PVSVAI
Sbjct: 196 QGLDSEDSYPYLGTDDQ--PCHYDPKFSAANDTGFVDIPSGNERALMKAVASVGPVSVAI 253
Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ + EC S LDHGV+AVGYG + +G +W+V+NSW +WG+
Sbjct: 254 DAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDVDGKKFWIVKNSWSENWGDK 313
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY+ + ++ CGIA ASYP+
Sbjct: 314 GYIYMAKD----RKNHCGIATAASYPL 336
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 187/321 (58%), Gaps = 23/321 (7%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFAD 102
E + + W A HGK N RF+IF++N I +HN R TY +G+N F D
Sbjct: 18 EFSSEWLKWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGD 77
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
L + E+ G + + +P +W KGAV PVKDQG C
Sbjct: 78 LLHSEFLERSNGFQGGVS--------GGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKC 129
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
GSCWAFS +VEG + +L+SLSEQ+LVDC + N GC GGLMD AF++ I N G
Sbjct: 130 GSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKG 189
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
+ +E+ YPY +N C +++ V +I ++DV DE LK AVA+ PVSVAI+A
Sbjct: 190 IANEKSYPYTAKDNDC-KYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASS 248
Query: 281 RAFQHYESGVFTGE-CGS-ALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQ 336
FQ YESGV+ E C S LDHGV+AVGYGT+ +G+D+WLV+NSW + WG NGY+K+
Sbjct: 249 SKFQFYESGVYYDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMA 308
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
RN D N CGIA ASYP+
Sbjct: 309 RN-KDNN---CGIATMASYPI 325
>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
Length = 344
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 192/331 (58%), Gaps = 42/331 (12%)
Query: 52 YQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYR 109
+ W+ H K TS G R+ IF N+ ++ + NS +GLN FAD+TNEEYR
Sbjct: 30 FTDWMITHQKSYTSEEFG---ARYNIFTANMDYVQQWNSKGSETVLGLNNFADITNEEYR 86
Query: 110 AMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
YLGT+ DA + + KV + A S DWR +GAV PVK+QG CG CW+
Sbjct: 87 NTYLGTKFDASSLIGTQEEKVHTNSSAA--------SKDWRSEGAVTPVKNQGQCGGCWS 138
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FST + EG + GEL+SLSEQ L+DC + N+GC+GGLM YAF++II N G+D+E
Sbjct: 139 FSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESS 197
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
YPY KC+ N+ ++ Y+ V+ E SL+ AV PVSVAI+A ++FQ Y
Sbjct: 198 YPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYT 256
Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGV-------------------DYWLVRNSWGSD 326
SG+ + EC S LDHGV+AVGYG+ +G +YW+V+NSWG+
Sbjct: 257 SGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTS 316
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG GY+ + RN D N CGIA AS+PV
Sbjct: 317 WGIEGYILMSRN-RDNN---CGIASSASFPV 343
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 186/325 (57%), Gaps = 23/325 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M ++ W H ++ +RF +++ N FID N + TY++ N+FADL
Sbjct: 40 DMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADL 99
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---------ELPESVDWREKGAVN 154
T EE+ A Y G + V AGD ++P SVDWR +GAV
Sbjct: 100 TEEEFLATYTGYYAG------DGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVV 153
Query: 155 PVKDQGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
P K Q S C SCWAF T A +E +N I TG+L+SLSEQ+LVDCD + GCN G A+
Sbjct: 154 PPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAY 212
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
+++++NGG+ +E DYPY C+ ++ I G+ V P +E +L+ AVA QPV+
Sbjct: 213 KWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVA 272
Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENG 331
VAIE G Q Y+ GV+TG CG+ L H V VGYGT+ +G YW ++NSWG WGE G
Sbjct: 273 VAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERG 331
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYP 356
Y+++ R++ G CG+ ++ +YP
Sbjct: 332 YIRILRDV--GGPGLCGVTLDIAYP 354
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 193/312 (61%), Gaps = 25/312 (8%)
Query: 59 HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLG 114
HGK N + R IF +N + + +HN T+ + +NKF DLTNEE+R + +G
Sbjct: 27 HGKQYNEY-EDTARHAIFLENCKIVKQHNEEAAMGKHTFFMRMNKFGDLTNEEFRMLVIG 85
Query: 115 TRSDAKRRLMKSKVASQR----YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
+ LM+S Q + G ++ ++VDWR+KGAV VK+Q CGSCWAFST
Sbjct: 86 SG------LMQSNRTQQAEGGVFESIPGLKVNDTVDWRQKGAVTKVKNQEQCGSCWAFST 139
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
++EG + + +G L+SLSEQ LVDC RK N GC GGLMD AF++I NGG+D+E+ YP
Sbjct: 140 TGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMDQAFKYIKTNGGIDTEECYP 199
Query: 230 YLGA-ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
Y G E KC+ + + ++ + DV DE +LK+A A P+SV I+A +FQ Y+
Sbjct: 200 YKGRDERKCE-YKASCSGATLSSFVDVKTGDEDALKQASATIGPISVGIDASHPSFQLYD 258
Query: 288 SGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ C S LDHGV+ VGYGT++ DYWLV+NSWG+DWG GY+ + RN
Sbjct: 259 HGVYHEKRCSSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADWGMEGYIMMSRN----KDN 314
Query: 346 KCGIAMEASYPV 357
+CGIA +ASYPV
Sbjct: 315 QCGIATQASYPV 326
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 200/341 (58%), Gaps = 33/341 (9%)
Query: 29 IISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS 88
+I D N SS W+ MT Y+ + +E+RF+IF +N I +HN
Sbjct: 53 VIGVDWNFTLSSIWK---HFMTTYK-------RNYIDPSEHERRFKIFANNFVRISKHNV 102
Query: 89 L----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
+Y +G+N+F+D T+EE + R R + + +Y A P
Sbjct: 103 RFIQGQVSYTMGINEFSDKTDEELK------RLRCFRGSLNASRDGSKYITIAAPP-PSE 155
Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAG 203
+DWR KGAV PVK+QG+CGSCWAFS A+EG N + TG L+SLSEQ+LVDC + N
Sbjct: 156 IDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNA 215
Query: 204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPS-RRNAK--VVSIDGYEDVSPFD 259
CNGGLMD AF+++ + G+D+E YPY+ E +P+ R N K VV + GY D+
Sbjct: 216 CNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQ 275
Query: 260 EMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDY 316
LK+AV P+SVAI AG +F Y+SGV++ +C S LDHGV+ VGYG ENG+ Y
Sbjct: 276 VSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPY 335
Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WL++NSWG WGENGYVK+ R+ + CG+A ASYP+
Sbjct: 336 WLIKNSWGPHWGENGYVKILRD----HNNLCGVASMASYPL 372
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/292 (44%), Positives = 180/292 (61%), Gaps = 18/292 (6%)
Query: 72 RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
R+ FKDNL FI N++N+ ++G FADLTNEEYRA+YLG DA + Q
Sbjct: 48 RYSAFKDNLDFIHRWNAVNKETELGATVFADLTNEEYRAVYLGMNVDASNFAAQPATLDQ 107
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
Y + ++DWR GAV VKDQG CGSCWAFST AVEG ++I TG +SLSEQ
Sbjct: 108 VY-----QPVRSTLDWRNNGAVGRVKDQGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQ 162
Query: 192 ELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN---KCDPSRRNAKVV 247
+L+DC R N GC GGLMD A +I++ GG+++E+ YPY ++ K +P+ AK
Sbjct: 163 QLMDCSRSYGNHGCQGGLMDSAMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNGAK-- 220
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGS-ALDHGVVA 305
+ GY ++ E L + PV++A++A +FQ Y+SGVF C S +L HGV+A
Sbjct: 221 -LSGYSNIKRGSEADLAAKLNIGPVAIALDASHSSFQLYKSGVFYDPACSSTSLSHGVLA 279
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
VGYGTE YW+V+NSWG+ WG+ GY+ + + D N CG+A +S P+
Sbjct: 280 VGYGTEGSSAYWIVKNSWGTRWGDAGYIWIAK---DRNN-HCGVATMSSIPI 327
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 195/312 (62%), Gaps = 21/312 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
++ W K+ + S G+ E R +I+ +N+ ++ E N+ +YK+ N+FADLTN EYR +
Sbjct: 30 WEGWKLKYNR-SYGL-DEELRKKIWANNMLYVKEFNAEGHSYKLAANQFADLTNLEYRQI 87
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCWAFST 170
YLG ++A+ + QR K DE LP +VDWR KG V PVK+QG CGSCW+FS
Sbjct: 88 YLGYDNEARLSRKREGKVFQR---KMKDEDLPTTVDWRSKGVVTPVKNQGQCGSCWSFSA 144
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
++EG I +G+L+S SEQELVDC + N GC GGLMDYAF++ N + E DY
Sbjct: 145 TGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLA-EKESDYT 203
Query: 230 YLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
Y KC + NA+ V + D+ + +LK+AVA++ P++VA++A +FQ Y
Sbjct: 204 YTAKNGKC---KYNAQLGVTKDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMY 260
Query: 287 ESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
SG++T S LDHGV+ VGYGT+NGVDYWL++NSWG WG +GY K++ +
Sbjct: 261 HSGIYTPFLCSKTKLDHGVLVVGYGTDNGVDYWLIKNSWGMAWGMDGYFKIE-----MKS 315
Query: 345 GKCGIAMEASYP 356
KCGI +ASYP
Sbjct: 316 DKCGICTQASYP 327
>gi|348531517|ref|XP_003453255.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 189/321 (58%), Gaps = 32/321 (9%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
+ W K K+ + +R QI+ N + + +HN+L +++++G+ FAD+ NEE
Sbjct: 26 FHAWKLKFEKSYDSSSEETQRKQIWLTNRKLVLKHNALADQGLKSFRLGMTYFADMENEE 85
Query: 108 YRAM--------YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
Y+ + L R+ RL K V LP++VDWRE+G V VK Q
Sbjct: 86 YKKLGCLGSFNASLPCRASTLNRLPKVTV------------LPKTVDWREQGYVTDVKHQ 133
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
CGSCWAFS A+EG + TG L+ LSEQ+LVDC RK N GC+GG ++AFQ+I
Sbjct: 134 QQCGSCWAFSATGALEGQHFKKTGTLVPLSEQQLVDCSRKYRNNGCDGGEPNWAFQYIRD 193
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
NGG+D+E+ Y Y + +C R N+ +GY DVSPF+E ++ P+SV+I+
Sbjct: 194 NGGVDTEKSYRYEAKDGQCR-YRSNSIGAKCNGYVDVSPFEEALMEAVATIGPISVSIDD 252
Query: 279 GGRAFQHYESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
+FQ Y+SGV+ S L+H V+AVGYGTENG DYWLV+NSWGS WG GY+K+
Sbjct: 253 SRVSFQLYQSGVYDEPWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSGWGNKGYIKMT 312
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
RN +CGIA EASYP+
Sbjct: 313 RN----KGNQCGIATEASYPL 329
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/300 (45%), Positives = 186/300 (62%), Gaps = 25/300 (8%)
Query: 72 RFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R I++ NL I+ EH+ +Y++G+N F D+T+EE+R + G + +R+ + S
Sbjct: 47 RRMIWEKNLNKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRKTERKAIGSL 106
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ P +VDWREKG V PVKDQG CGSCWAFST A+ZG N G+L+S
Sbjct: 107 FMEPNFMVA-----PSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVS 161
Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC R + N GC GGLMD AFQ++ N G+DSE YPYLG +++ P + K
Sbjct: 162 LSEQNLVDCSRPEGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQ--PCHYDPKY 219
Query: 247 VSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALDH 301
S++ G+ D+ E +L KAVA PVSVAI+AG +FQ Y+SG+ + EC S LDH
Sbjct: 220 NSVNDTGFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDH 279
Query: 302 GVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GV+AVGYG E +G YW+V+NSW WG+ GY+ + ++ CGIA ASYP+
Sbjct: 280 GVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKD----RKNHCGIATAASYPL 335
>gi|323451555|gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
Length = 263
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/272 (48%), Positives = 173/272 (63%), Gaps = 10/272 (3%)
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
HN+ N TYK+G N+F+ + +E+ A Y+G + AK + + + A K D + V
Sbjct: 1 HNAKNSTYKLGHNEFSGMFWDEFVAQYVGDATGAKAYMERERNYDYTLA-KQVDAVASDV 59
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DW GAV VK+QG CGSCW+FST A+EG +I L SLSEQ LVDCD ++GCN
Sbjct: 60 DWVASGAVTGVKNQGQCGSCWSFSTTGALEGAFEIAGNTLTSLSEQNLVDCDT-TDSGCN 118
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMD AF++I NGG+ SE DY Y A+ C + KV ++ G+ DV DE +LK
Sbjct: 119 GGLMDNAFKWIQSNGGICSEADYAYTAAKGTCKTTCD--KVATLSGHTDVPSGDEDALKT 176
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVF-TGECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
AVA PVS+AIEA FQ Y SG+ + CG+ LDHGV+ VGYGT++G +YW V+NSWG
Sbjct: 177 AVAIGPVSIAIEADKSVFQSYSSGILDSSACGTNLDHGVLVVGYGTDDGSEYWKVKNSWG 236
Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ WGE+GYV++ R + CGIA E SYP
Sbjct: 237 TTWGESGYVRIAR-----GSNICGIASEPSYP 263
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 189/317 (59%), Gaps = 28/317 (8%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
++ + ++GK + ++RF++F DNL+ I HN +YK+G+N+F D+T +E+R
Sbjct: 60 LFARFAHRYGKRYETVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDITWDEFRR 119
Query: 111 MYLGTRSD----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
LG + K L + V LPE+ DWRE G V+PVK+QG CGSCW
Sbjct: 120 DRLGAAQNCSATTKGNLKLTNVV-----------LPETKDWREAGIVSPVKNQGKCGSCW 168
Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
FST A+E G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E
Sbjct: 169 TFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTE 228
Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQ 284
+ YPY G C S N V ID +++ E LK AVA +PVS+A E + F+
Sbjct: 229 EAYPYTGKNGLCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFK 286
Query: 285 HYESGVFTG-ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
Y+SGV+T ECG+ ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++
Sbjct: 287 QYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME---- 342
Query: 341 DTNTGKCGIAMEASYPV 357
CGIA ASYPV
Sbjct: 343 -MGKNMCGIATCASYPV 358
>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
Length = 334
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
+ D + T + W A H + G NE+ R +++ N++ I+ HN + +
Sbjct: 20 KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N F D+TNEE+R M +G + K R K KV + +LP+SVDWR+KG V PV
Sbjct: 77 MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+Q CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGG M AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ +NGG+DSE+ YPY+ + C N+ V + G+ V+P E +L KAVA P+SV
Sbjct: 190 VKENGGLDSEESYPYVAMDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++AG +FQ Y+SG+ F +C S LDHGV+ VGYG E N YWLV+NSWG +WG
Sbjct: 249 AVDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
NGYVK+ + D N CGIA ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 200/328 (60%), Gaps = 18/328 (5%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M ++ + A + +T +RF++++ N+ +I+ N + TY++G N+FADL
Sbjct: 33 DMLMMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADL 92
Query: 104 TNEEYRAMY-----LGTRSDA-KRRLMKSKVASQ------RYACKAGDEL-PESVDWREK 150
T +E+RAMY + +R DA +RR M + +A Y A +E P SVDWR K
Sbjct: 93 TVQEFRAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSK 152
Query: 151 GAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMD 210
GAV PVKDQG CG CWAF+TVA +EG++KI TG+L+SLSEQELVD + GC GGL +
Sbjct: 153 GAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVD-CDDADDGCGGGLPE 211
Query: 211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ 270
A +++ NGG+ +E +YPY G KCD + + I + V E L++AVA Q
Sbjct: 212 IAMEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQ 271
Query: 271 PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGE 329
PV+VAI A + Y+SGV++G C + DH V VGYG +N G YW+++NSW WGE
Sbjct: 272 PVAVAINAPD-SLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGE 330
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY ++QR + G CGIA ASYPV
Sbjct: 331 KGYGRMQRGVA-AKEGLCGIATHASYPV 357
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 189/310 (60%), Gaps = 21/310 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEY-R 109
+ ++AK+GK+ + R ++FK NL + +N+ N TY++GLNKFAD T EY R
Sbjct: 43 FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLNKFADYTEAEYKR 102
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ G + + R +K A + + V+W E+GAV PVKDQG CGSCW+FS
Sbjct: 103 LLGFGGQKNKNPRNIKVLGAPKN----------DGVNWVEQGAVTPVKDQGQCGSCWSFS 152
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
A+EG KI G L SLSEQ+LVDC + + N GC GG MD AFQ++ Q +++E Y
Sbjct: 153 ATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALETEDQY 211
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY ++ C S +A VV +D + DV+P + LK A+ PVSVAIEA FQ Y
Sbjct: 212 PYEAVDDTCRAS--SAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSG 269
Query: 289 GVFT-GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
GV CG+ LDHGV+AVGYG E+G DY+LV+NSWG+ WGE GYVK+ + + C
Sbjct: 270 GVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASPDNI----C 325
Query: 348 GIAMEASYPV 357
GI +ASYP+
Sbjct: 326 GILSQASYPI 335
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 187/310 (60%), Gaps = 22/310 (7%)
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGT 115
GK N + R IF++N + + +HN T+ + +NKF DLT EE+R + +G+
Sbjct: 8 GKQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIGS 67
Query: 116 RSDAKRRLMKSKVASQR----YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
M+S Q + G ++ ++VDWR+KGAV VK+Q CGSCWAFS
Sbjct: 68 G------FMQSNKTQQAEGGVFESLPGLKVDDTVDWRQKGAVTKVKNQEQCGSCWAFSAT 121
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
++EG + + T L+SLSEQ LVDC R+ N GC GG MD AF++I NGG+D+E+ Y Y
Sbjct: 122 GSLEGQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSY 181
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
G + + + ++ Y D+ DEM+L +AV+ P+SVAI+AG ++FQ Y G
Sbjct: 182 RGRDESMCRYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHHG 241
Query: 290 VF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
V+ +C S LDHGV+AVGYG+ NG DYWLV+NSWG++WG GY+ + RN +C
Sbjct: 242 VYDEPKCSSTHLDHGVLAVGYGSSNGSDYWLVKNSWGTEWGMEGYIMMSRN----KHNQC 297
Query: 348 GIAMEASYPV 357
GIA A YPV
Sbjct: 298 GIATRAIYPV 307
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 201/325 (61%), Gaps = 21/325 (6%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRF--QIFKDNLRFIDEHNSL----NRTYKVGLNK 99
D + +QT+ +H K N + E+RF +IF +N I +HN L ++K+GLNK
Sbjct: 21 DVIKEEWQTFKMEHRK--NYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNK 78
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSK-VASQRYACKAGDELPESVDWREKGAVNPVKD 158
+AD+ + E++ G ++ L + Y A ++P++VDWR+ GAV VKD
Sbjct: 79 YADMLHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKD 138
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
QG CGSCW+FS+ ++EG + G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 139 QGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 198
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQ-PVSVA 275
NGG+D+E+ YPY G ++ C ++ A V + D G+ D+ DE ++ KAVA PV+VA
Sbjct: 199 DNGGVDTEKSYPYEGIDDSCHFNK--ATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVA 256
Query: 276 IEAGGRAFQHYESGVFTG-ECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGY 332
I+A +FQ Y GV+ C S LDHGV+ VGYGT+ +G DYWLV+NSWG+ WG+ GY
Sbjct: 257 IDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGY 316
Query: 333 VKLQRNLLDTNTGKCGIAMEASYPV 357
+K+ RN +CGIA +S+P
Sbjct: 317 IKMARN----QDNQCGIATASSFPT 337
>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
Full=Cathepsin V; Flags: Precursor
gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
Length = 334
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
+ D + T + W A H + G NE+ R +++ N++ I+ HN + +
Sbjct: 20 KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N F D+TNEE+R M +G + K R K KV + +LP+SVDWR+KG V PV
Sbjct: 77 MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+Q CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGG M AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ +NGG+DSE+ YPY+ + C N+ V + G+ V+P E +L KAVA P+SV
Sbjct: 190 VKENGGLDSEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++AG +FQ Y+SG+ F +C S LDHGV+ VGYG E N YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
NGYVK+ + D N CGIA ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332
>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
Length = 334
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
+ D + T + W A H + G NE+ R +++ N++ I+ HN + +
Sbjct: 20 KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N F D+TNEE+R M +G + K R K KV + +LP+SVDWR+KG V PV
Sbjct: 77 MNAFPDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+Q CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGG M AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ +NGG+DSE+ YPY+ + C N+ V + G+ V+P E +L KAVA P+SV
Sbjct: 190 VKENGGLDSEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++AG +FQ Y+SG+ F +C S LDHGV+ VGYG E N YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
NGYVK+ + D N CGIA ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332
>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
Length = 401
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 145/369 (39%), Positives = 205/369 (55%), Gaps = 25/369 (6%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDD-------EVMTIYQTWLAKHG 60
+ I+TL+ +F + +S+ +N D + D E ++ + K+
Sbjct: 38 IIIATLIAIFIVL---VVTVSLYITNNTSDKIDDFVPGDYVDPATREYRKSFEEFKKKYN 94
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
KT + M +RF+I+K N+ FI NS +Y + +N+F DL+ EE+ A + G D+K
Sbjct: 95 KTYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGYIKDSK 154
Query: 121 --RRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
R+ KS S A + +E P S++W E G VNP+++Q +CGSCWAFS VAA+EG
Sbjct: 155 DDERVFKSSRVS---ASELEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEG 211
Query: 177 INKIVTGE-LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
T L SLSEQ+ VDC ++ N GC+GG M AFQ+ I+N + + DYPY E
Sbjct: 212 ATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDYPYFAEE 271
Query: 235 NKC-DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT 292
C D N + + Y+ V P + +LK A+A P+SVAI+A FQ Y+SGVF
Sbjct: 272 KTCMDSFCENYIEIPVKAYKYVFPRNINTLKTALAKYGPISVAIQADQTPFQFYKSGVFD 331
Query: 293 GECGSALDHGVVAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
CG+ ++HGVV VGY + +YWLVRNSWG WGE GY+KL L G CGI
Sbjct: 332 APCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLA--LHSGKKGTCGIL 389
Query: 351 MEASYPVKN 359
+E YPV N
Sbjct: 390 VEPVYPVIN 398
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 185/310 (59%), Gaps = 28/310 (9%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
++GK + ++RF++F DNL+ I HN +YK+G+N+F DLT +E+R LG
Sbjct: 67 RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126
Query: 118 D----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
+ K L + V LPE+ WRE G V+PVK+QG CGSCW FST A
Sbjct: 127 NCSATTKGNLKVTNVV-----------LPETKGWREAGIVSPVKNQGKCGSCWTFSTTGA 175
Query: 174 VEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
+E G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY G
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVF 291
C S N V ID +++ E LK AVA +PVS+A E + F+ Y+SGV+
Sbjct: 236 KNGLCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVY 293
Query: 292 TG-ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
T ECG+ ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++ C
Sbjct: 294 TSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME-----MGKNMC 348
Query: 348 GIAMEASYPV 357
GIA ASYPV
Sbjct: 349 GIATCASYPV 358
>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
Length = 334
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
+ D + T + W A H + G NE+ R +++ N++ I+ HN + +
Sbjct: 20 KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N F D+TNEE+R M +G + K R K KV + +LP+SVDWR+KG V PV
Sbjct: 77 MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+Q CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGG M AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ +NGG+DSE+ YPY+ + C N+ V + G+ V+P E +L KAVA P+SV
Sbjct: 190 VKENGGLDSEESYPYVAMDEICKYRPENS-VANDTGFTVVTPGKEKALMKAVATVGPISV 248
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++AG +FQ Y+SG+ F +C S LDHGV+ VGYG E N YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
NGYVK+ + D N CGIA ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 194/323 (60%), Gaps = 27/323 (8%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
E+ + +Q +L HGK G +R I++ NL +I++HN + ++ +G+N++ D
Sbjct: 22 ELDSEWQLYLKAHGKQY-GAEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+TNEE+R+ T + K R S+ + GD LP++VDWR KG V P+K+QG C
Sbjct: 81 MTNEEFRS----TMNGYKMRNGTSRGSLYLPPSNIGD-LPDTVDWRPKGYVTPIKNQGQC 135
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCW+FS ++EG TG+L SLSEQ LVDC +K N GC GGLMD AFQ+I N G
Sbjct: 136 GSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSG 195
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSI--DGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
+D+E YPY KC R NA V G+ D+ E L+ AVA P+SVAI+A
Sbjct: 196 IDTESSYPYEAKNGKC---RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDA 252
Query: 279 GGRAFQHYESGV----FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
+FQ Y SGV F E + LDHGV+AVGYGTE+G DYWLV+NSWG WG+ GY+
Sbjct: 253 SHMSFQLYRSGVYHEFFCSE--TRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIM 310
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+ RN + CGIA ASYP
Sbjct: 311 MSRNKRNN----CGIATSASYPT 329
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 141/324 (43%), Positives = 189/324 (58%), Gaps = 18/324 (5%)
Query: 51 IYQTWL---AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADL 103
+ Q W+ +H K R +I+ N I +HN L + TY++ +NK+ D+
Sbjct: 24 VNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKYGDM 83
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
N E++ M G L ++ + ELP+ VDWR+ GAV VKDQG C
Sbjct: 84 LNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHC 143
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCWAFS ++EG + TG L+SLSEQ L+DC N GCNGGLMD AF +I N G
Sbjct: 144 GSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYIKDNKG 203
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
+D+E+ YPY G ++KC +R++ + G+ D+ DE LK AVA PVSVAI+A
Sbjct: 204 LDTEKTYPYEGEDDKCRYDKRSSGASDV-GFVDIPVGDEQKLKAAVATVGPVSVAIDASH 262
Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
++FQ Y G+ F EC S LDHGV+ VGYGT E G DYW+V+NSWG WGE GY+K+ R
Sbjct: 263 QSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEGRDYWIVKNSWGESWGEKGYIKMAR 322
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQ 361
N+ CGIA ASYP+ S+
Sbjct: 323 NI----DNHCGIASSASYPIVGSR 342
>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
Length = 334
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 197/328 (60%), Gaps = 29/328 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
+ D + T + W A H + G NE+ R +++ N++ I+ HN + +
Sbjct: 20 KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N F D+TNEE+R M +G + K R K KV + +LP+SVDWR+KG V PV
Sbjct: 77 MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+Q CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGG M AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ +NGG+DSE+ YPY+ + C N+ V + G+ V+P E +L KAVA P+SV
Sbjct: 190 VKENGGLDSEESYPYVAMDEICKYRPENS-VANDTGFTVVTPGKEKALMKAVATVGPISV 248
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++AG +FQ Y+SG+ F +C S LDHGV+ VGYG E N YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
NGYVK+ ++ CGIA ASYP
Sbjct: 309 SNGYVKIAKD----KKNHCGIATAASYP 332
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/298 (46%), Positives = 184/298 (61%), Gaps = 20/298 (6%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R ++F DN I HN L + +Y++ +N F DL + E+ G R + RR+ +
Sbjct: 51 RMKVFMDNKHKIARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRH-SLRRVTGDE 109
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ S + +P+SVDWR +GAV VK+QG CGSCWAFST ++EG + T +L S
Sbjct: 110 IDSVTFIPAYNVTVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTS 169
Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRNA 244
LSEQ L+DC K N GC+GGLMD AF +I N G+D+EQ YPY G ++KC P A
Sbjct: 170 LSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKCRYKPQESGA 229
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGE-CGSA---L 299
+ G+ D+ DE LK AVA P+SVAI+A ++FQ Y+ GV+ + CG+ L
Sbjct: 230 ---TDKGFVDIPQGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDL 286
Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
DHGV+AVGYGTENG DYWLV+NSWG WG +GY+K+ RN CGIA ASYP+
Sbjct: 287 DHGVLAVGYGTENGKDYWLVKNSWGKRWGLDGYIKMARN----KHNHCGIATSASYPL 340
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 147/371 (39%), Positives = 213/371 (57%), Gaps = 31/371 (8%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
MATAS LA+ L + + +A I+ ++ ++ W A++
Sbjct: 3 MATASASLALVMLFACSLLLAGTAFSDDTIAIP--------------LLERFKAWQAEYN 48
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
+T ++RF ++ +NLRFI N L+ +Y++G N+F DLT EE++ YL +
Sbjct: 49 RTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDE 108
Query: 119 --AKRRLMKSKVASQRYACKA-GD---ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
M V + A + GD E P SVDWR KGAV PVK+Q CGSCWAF+TVA
Sbjct: 109 QPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVA 168
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
++EG+++I TG L+SLSEQE+VDCDR N GC GG A +++ +NGG+ +E DYPY+
Sbjct: 169 SIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYV 228
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
G++ +C + I GY+ V +E L++AVA +PV+V I+A RAFQ Y+ GVF
Sbjct: 229 GSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDA-SRAFQFYKRGVF 287
Query: 292 TGECG-SALDHGVVAVGYGTENGV-----DYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
+G C + ++H V VGYG+ YW+V+NSWG WGENGYV++ R + G
Sbjct: 288 SGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-G 346
Query: 346 KCGIAMEASYP 356
C IA+E P
Sbjct: 347 MCAIAIEPLLP 357
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 188/322 (58%), Gaps = 25/322 (7%)
Query: 48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADL 103
V++ +++W HGK+ + R +I +N I HN+ +Y + +N + DL
Sbjct: 23 VLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDL 82
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
+ E+ AM G K L S + S+ +LP VDWRE GAV PVK+QG CG
Sbjct: 83 LHHEFVAMVNGYEYVNKTSLGGSFIPSKNV------KLPTHVDWREDGAVTPVKNQGQCG 136
Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
SCWAFS+ ++EG TG+LI LSEQ LVDC RK N GC GGLMD+AF +I N G+
Sbjct: 137 SCWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGI 196
Query: 223 DSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
D+E YPY G +C DPS++ + + G+ DV E L KAVA PVSVAI+A
Sbjct: 197 DTEGSYPYEGVGGRCHYDPSKKGSSDI---GFVDVKKGSEEELLKAVASVGPVSVAIDAS 253
Query: 280 GRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKL 335
+FQ Y GV F +C LDHGV+ VGYGT+ +G DYWLV+NSW +WG+ GY+K+
Sbjct: 254 HMSFQFYSHGVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKM 313
Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
RN CGIA ASYPV
Sbjct: 314 ARN----KKNMCGIASSASYPV 331
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 146/329 (44%), Positives = 199/329 (60%), Gaps = 30/329 (9%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNK 99
TD + + +W H KT +R +++ NL+ I+ HN SL + +Y++G+N+
Sbjct: 21 TDPALDNHWYSWKDWHKKTYAPKEEGWRRV-LWEKNLKMIEFHNLDHSLGKHSYRLGMNQ 79
Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
F D+TNEE++ + G ++ + + + E P+SVDWR+KG V PVKDQ
Sbjct: 80 FGDMTNEEFKQLMNGYKN-------QKMIRGSTFLAPNNFEAPKSVDWRKKGYVTPVKDQ 132
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
G CGSCWAFST A+EG + T +LISLSEQ LVDC R + N GCNGGLMD AFQ++
Sbjct: 133 GQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGNEGCNGGLMDQAFQYVKD 192
Query: 219 NGGMDSEQDYPYLGAENK-C--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
NGG+DSE YPY +++ C DP+ +A G+ DV E L KAVA PVSV
Sbjct: 193 NGGIDSEDSYPYTAKDDQECHYDPNNNSANDT---GFVDVQSGCEKDLMKAVASVGPVSV 249
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
AI+AG ++FQ Y+SG+ + EC S LDHGV+ VGYG E +G YW+V+NSW WG
Sbjct: 250 AIDAGHQSFQFYQSGIYYEPECSSEDLDHGVLVVGYGFESEDVDGKKYWIVKNSWSEKWG 309
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+NGY+ N+ CGIA ASYP+
Sbjct: 310 DNGYI----NIAKDRHNHCGIATAASYPL 334
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 194/319 (60%), Gaps = 27/319 (8%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEY----- 108
+H K + R +I+ N I +HN +++ +NK+ADL +EE+
Sbjct: 33 QHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLN 92
Query: 109 ---RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
R+ G++ + +LM + + A ++P ++DWREKGAV PVKDQG CGSC
Sbjct: 93 GFNRSAAAGSKLLGREQLMTIE-EPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSC 151
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
W+FS A+EG + TG+L+SLSEQ LVDC K N GCNGGLMD AFQ++ N G+D+
Sbjct: 152 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDT 211
Query: 225 EQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGR 281
E+ YPY +++C N K + + G+ D+ DE +LKKA+A PVSVAI+A
Sbjct: 212 EKAYPYEAIDDEC---HYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHE 268
Query: 282 AFQHYESGV-FTGECGS-ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+FQ Y GV + +C S LDHGV+AVGYG TE+G DYWLV+NSWG+ WG+ GYVK+ RN
Sbjct: 269 SFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARN 328
Query: 339 LLDTNTGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 329 ----RENHCGIATTASYPL 343
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 141/324 (43%), Positives = 196/324 (60%), Gaps = 25/324 (7%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFA 101
DE ++++W +K+ + G R +++ NL+ I+ HN SL + +Y +G+N F
Sbjct: 26 DEHWDLWKSWHSKNYQHEKEEGW---RRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFG 82
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
D+TNEE+R + G + L + K + E P+ VDWRE+G V PVKDQG
Sbjct: 83 DMTNEEFRQVMNGYK------LQQRKFKGSLFLEPNNMEAPKQVDWREEGYVTPVKDQGQ 136
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST A+EG T +L+SLSEQ LVDC R + N GCNGGLMD AFQ+I N
Sbjct: 137 CGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNS 196
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
G+DSE+ YPYLG +++ + + G+ D+ E +L KA+A PVSVAI+AG
Sbjct: 197 GLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVSVAIDAG 256
Query: 280 GRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYV 333
+FQ Y+SG+ + EC S LDHGV+AVGYG E +G YW+V+NSW WG+ GY+
Sbjct: 257 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYI 316
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
+ ++ CGIA ASYP+
Sbjct: 317 LMAKD----RKNHCGIATAASYPL 336
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 199/349 (57%), Gaps = 38/349 (10%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLN 98
S+S ++ + + W+ KH ++ N R+ ++K N+ +++E NS +GLN
Sbjct: 17 SASSYSEQQYRDSFTNWMQKHSRSYASHEFN-TRYSVYKKNMDYVNEWNSKGSETVLGLN 75
Query: 99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
AD+TN+EY+A+YLGT++DA RL + ++ K LP S+DW +GAV VK+
Sbjct: 76 SLADMTNQEYQAIYLGTKTDATARLAAASASASF--GKVQGALPASIDWVAQGAVTQVKN 133
Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
QG CGSCW+FS + EG ++I T L++LSEQ L+DC N GCNGGLMD AF++II
Sbjct: 134 QGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYII 193
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
NGG+D+E YPY+ KC + N+ ++ Y DV+ E +L+ PVSVAI+
Sbjct: 194 ANGGIDTEASYPYVAKVQKCKYNPANSG-ATLSSYVDVTSGSESALQSQTVKGPVSVAID 252
Query: 278 AGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGT------------------------- 310
A ++FQ Y+SGV + C S LDHGV+ VGYGT
Sbjct: 253 ASHQSFQLYDSGVYYEPACSSTNLDHGVLVVGYGTASANGSSDSDSSAASQSSSSESSDD 312
Query: 311 --ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
G +W V+NSWG +WG +GY+++ RN D N CGIA AS P+
Sbjct: 313 QATQGAQFWKVKNSWGPEWGLSGYIQMARN-RDNN---CGIATTASQPI 357
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 194/319 (60%), Gaps = 19/319 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
++ W+A+ G++ G +R ++F N R +D N NRTY +GLN+F+DLT+ E+
Sbjct: 42 HERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDHEFLQ 101
Query: 111 MYLG-TRSDAKRRLM--KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+LG R +R L+ + +V + A G ++P SVDWR KGAV +K+Q SCGSCWA
Sbjct: 102 QHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPYSVDWRAKGAVTEIKNQRSCGSCWA 161
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
F+ VAA EG+ KI TG LIS+SEQ+++DC DR + C+ G + A ++++ +GG+ E
Sbjct: 162 FAAVAATEGLVKIATGNLISMSEQQVLDCTGDR---SSCDSGYISDALRYVVTSGGLQRE 218
Query: 226 QDYPYLGAENKCDPSRRNAK---VVSIDGYEDVS-PFDEMSLKKAVADQPVSVAIEAGGR 281
Y Y G + C SRR A+ S+ G + DE +L+ A QPV+V +EA
Sbjct: 219 AAYAYTGQKGACG-SRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASEP 277
Query: 282 AFQHYESGVFTG--ECGSALDHGVVAVGYGTENGV-DYWLVRNSWGSDWGENGYVKLQRN 338
F+HY SGV+ G CG L+H + VGYGTENG +YWLV+N WG+ WGENGY+++ R
Sbjct: 278 DFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENGYMRVARR 337
Query: 339 LLDTNTGKCGIAMEASYPV 357
+ CGIA A YP
Sbjct: 338 --NGAGANCGIASVAFYPT 354
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/296 (45%), Positives = 184/296 (62%), Gaps = 19/296 (6%)
Query: 72 RFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R IF+DNL+ I+ HN + +Y +G+N+FAD+T+ EY +G
Sbjct: 43 RRLIFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLITSNLTKTGS 102
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
A+ RY ++ ++VDWR+KG V +KDQG CGSCWAFST ++EG + TG L+S
Sbjct: 103 RATYRYM--PNMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVS 160
Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRNA 244
LSEQ LVDC R+ N GC GG MD FQ+IIQN G+D+EQ YPY ++C D S A
Sbjct: 161 LSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKAKNHRCKFDNSCIGA 220
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGSA-LDH 301
+ S + DV+ DE +LK+A A+ P+SV I+A ++FQ Y SGV+ EC S LDH
Sbjct: 221 TMSS---FTDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDH 277
Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GV+ VGYGT DYWLV+NSWG+ WG GY+ + RN +CG+A +AS+PV
Sbjct: 278 GVLVVGYGTYGSKDYWLVKNSWGTVWGNEGYIMMSRN----KDNQCGVATDASFPV 329
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 188/310 (60%), Gaps = 21/310 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEY-R 109
+ ++AK+GK+ + R ++FK NL + +N N TY++GLNKFAD T EY R
Sbjct: 43 FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAEYKR 102
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ G + + R +K A + + V+W E+GAV PVKDQG CGSCW+FS
Sbjct: 103 LLGFGGQKNKNPRNIKVLGAPKN----------DGVNWVEQGAVTPVKDQGQCGSCWSFS 152
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
A+EG KI G L SLSEQ+LVDC + + N GC GG MD AFQ++ Q +++E Y
Sbjct: 153 ATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALETEDQY 211
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
PY ++ C S +A VV +D + DV+P + LK A+ PVSVAIEA FQ Y
Sbjct: 212 PYEAVDDTCRAS--SAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSG 269
Query: 289 GVFT-GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
GV CG+ LDHGV+AVGYG E+G DY+LV+NSWG+ WGE GYVK+ + + C
Sbjct: 270 GVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASPDNI----C 325
Query: 348 GIAMEASYPV 357
GI +ASYP+
Sbjct: 326 GILSQASYPI 335
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 195/323 (60%), Gaps = 27/323 (8%)
Query: 47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
E+ + +Q +L HGK G +R I++ NL +I++HN + ++ +G+N++ D
Sbjct: 22 ELDSEWQLYLKAHGKQY-GAEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
+TNEE+R+ T + K R S+ + GD LP++VDWR KG V P+K+QG C
Sbjct: 81 MTNEEFRS----TMNGYKMRNGTSRGSLYLPPSNIGD-LPDTVDWRPKGYVTPIKNQGQC 135
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCW+FS ++EG TG+L SLSEQ LVDC +K N GC GGLMD AFQ+I N G
Sbjct: 136 GSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNG 195
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSI--DGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
+D+E YPY KC R NA V G+ D+ E L+ AVA P++VAI+A
Sbjct: 196 IDTESSYPYEAKNGKC---RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDA 252
Query: 279 GGRAFQHYESGV----FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
+FQ Y+SGV F E + LDHGV+AVGYGTE+G DYWLV+NSWG WG+ GY+
Sbjct: 253 SHMSFQLYKSGVYHEFFCSE--TRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIM 310
Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
+ RN + CGIA ASYP
Sbjct: 311 MSRNKRNN----CGIATSASYPT 329
>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
Length = 334
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 197/331 (59%), Gaps = 25/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
S++ + D + + W A HG+ GM R +++ N++ I+ HN +
Sbjct: 16 SAAPKLDQNLDADWYKWKATHGRLY-GMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFS 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G ++ + K KV + E+P+SVDWREKG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKVFHESLVL----EVPKSVDWREKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
VK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 AVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q++ NGG+D+E+ YPYLG E + + G+ D+ P E +L KAVA P+
Sbjct: 188 QYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPI 246
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG +FQ Y+SG+ + +C S LDHGV+ VGYG E N +W+V+NSWG +
Sbjct: 247 SVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPE 306
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG NGYVK+ + D N CGI+ ASYP
Sbjct: 307 WGWNGYVKMAK---DQNN-HCGISTAASYPT 333
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/381 (36%), Positives = 207/381 (54%), Gaps = 37/381 (9%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDE----VMTIYQTWLAKHGKTS 63
+ + T+ + + SSS ++ + D+ HS D +M +Q W+A G++
Sbjct: 16 VVLVTICQMLAVGSSS--ELMPPTTDDEMIHSDYSGRDKHNDLLMMGRFQGWMAAQGRSY 73
Query: 64 NGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
+RF+++K N+R+I+ N + T+++G F DLT+EE+ A+Y G+
Sbjct: 74 WTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGPFTDLTHEEFSALYNGSMPPP 133
Query: 120 KRRL------------------MKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQ 159
+ + VA G P S DWR+ GAV P+KDQ
Sbjct: 134 EEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGPRPWPPRSRDWRKHGAVTPIKDQ 193
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
G CGSCWAF TVA +EG +KIV G L+SLSEQ+L+DCD N+GC GG + A+++I +
Sbjct: 194 GRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCD-YTNSGCKGGFVIRAYRWIRKI 252
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
GG+ + YPY GA KC +R I G+ V E++L AVA QPV+V I A
Sbjct: 253 GGLTTSSAYPYKGARGKC--MKRRRAAARIAGWRSVRSRSEVALVNAVAGQPVAVYISAS 310
Query: 280 GRAFQHYESGVFTGECGSA-LDHGVVAVGYG--TENGVDYWLVRNSWGSDWGENGYVKLQ 336
G+ FQHY+ G+ G C +A L+H V VGYG + G YW+V+NSWG+ WG+ GY+ ++
Sbjct: 311 GKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTGAKYWIVKNSWGTTWGQEGYILMK 370
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
R + G+CGIA +P+
Sbjct: 371 RGTRNPR-GQCGIATSPVFPL 390
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 200/331 (60%), Gaps = 26/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYK 94
S++ D + + W A H + GM R +++ N++ I++HN R ++
Sbjct: 16 SATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIEQHNQEYREGKHSFT 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+T+EE+R + G ++ R+ K KV + +A P SVDWREKG V
Sbjct: 75 MAMNAFGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA----PRSVDWREKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC + N GCNGGLMDYAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEGCNGGLMDYAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q++ NGG+DSE+ YPY E C + + + V + G+ D+ P E +L KAVA P+
Sbjct: 188 QYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDI-PKQEKALMKAVATVGPI 245
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVA++AG ++FQ Y+ G+ F +C S +DHGV+ VGYG E + YWLV+NSWG +
Sbjct: 246 SVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG GY+K+ ++ + CGIA ASYP
Sbjct: 306 WGMGGYIKMAKDRRN----HCGIASAASYPT 332
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 133/327 (40%), Positives = 194/327 (59%), Gaps = 19/327 (5%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
D +M + W A H ++ +RFQ+++DN+ +I+ N + TY++G N+FADL
Sbjct: 35 DMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADL 94
Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYA--------CKAGDEL---PESVDWREKGA 152
T EE+ A + D R V + GD++ P SVDWR KGA
Sbjct: 95 TREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGA 154
Query: 153 VNPVKDQGSCGSC-WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
V P K Q S S WAF VA +E ++ I TG+L++LSEQ+LVDCD + + GCN G
Sbjct: 155 VVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCD-QYDGGCNRGTFRR 213
Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
AF ++IQNGG+ +E +YPY A+ C+ ++ + V +I G+ V +E+++K AVA QP
Sbjct: 214 AFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVAAISGHASVPGSNELAMKHAVATQP 273
Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGE 329
V+ AIE G Q Y+SGV++G CG+ L+H V VGYG + G YW+V+NSWG WGE
Sbjct: 274 VAAAIELGSD-MQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSWGQTWGE 332
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYP 356
GY+++QR +L G CGI ++ +YP
Sbjct: 333 RGYIRMQRKIL--GPGLCGIMLDVAYP 357
>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 282
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 132/275 (48%), Positives = 179/275 (65%), Gaps = 21/275 (7%)
Query: 92 TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ---RYACKAGDELPESVDWR 148
++K+G+N ADL EYR R + RR +AS+ ++ E+P++VDWR
Sbjct: 19 SFKIGINHIADLPFAEYR------RLNGFRRTFGDNIASRNATKWRAPLNFEVPDAVDWR 72
Query: 149 EKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGG 207
++G V PVK+QG CGSCWAFS ++EG +K TG+L+SLSEQ LVDC N GCNGG
Sbjct: 73 DEGYVTPVKNQGMCGSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSADFGNNGCNGG 132
Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
LMD+AF+++ QN G+D+E+ YPY + KC + N G+ D+ DE LK AV
Sbjct: 133 LMDFAFEYVKQNHGIDTEESYPYKAKQKKCHFQKANVGADDT-GFVDLPEADEEQLKAAV 191
Query: 268 ADQ-PVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGVVAVGYGT--ENGVDYWLVRNS 322
A Q PVSVAI+AG R+F+ Y++GV+ + S LDHGV+ VGYGT E+G DYW+V+NS
Sbjct: 192 ASQGPVSVAIDAGHRSFRLYKTGVYYEKHCSPEQLDHGVLVVGYGTDPEHG-DYWIVKNS 250
Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG +WGE GYV++ RN CGIA +ASYP+
Sbjct: 251 WGEEWGEKGYVRIARN----RNNHCGIASKASYPL 281
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 132/294 (44%), Positives = 182/294 (61%), Gaps = 16/294 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
++++ AK+GKT + R I+ + EHN+ +YK+GLN FAD+ N E
Sbjct: 27 WESYKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGE 86
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R M G R R + V S LP SVDWR KGAV P+K+QG CGSCWA
Sbjct: 87 FRKMMNGYRRGTPRNSVVVHVESNI-------TLPASVDWRTKGAVTPIKNQGQCGSCWA 139
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FST ++EG + + G+L+SLSEQELVDC + N GC+GGLMD AF +I +N G+D+EQ
Sbjct: 140 FSTTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQ 199
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
YPY G + C + + ++ G+ DV+ E L+ A A P+SVAI+A FQ
Sbjct: 200 SYPYTGEDGTCSFKKSDV-AATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQL 258
Query: 286 YESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
YESGV+ +C + LDHGV+ VGYGT++G YWLV+NSWG+DWG +GY+++ R
Sbjct: 259 YESGVYDVSDCSTTELDHGVLVVGYGTDDGTAYWLVKNSWGTDWGHHGYIQMSR 312
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 196/319 (61%), Gaps = 25/319 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEE 107
+ +W ++HGK+ + +R I+++NLR I++HN SL N T+K+G+N+F D+TNEE
Sbjct: 28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEE 86
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R G + D R + + P+ VDWR++G V PVKDQ CGSCW+
Sbjct: 87 FRQAMNGYKQDPNRTSKGALFMEPSFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 141
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG TG+LIS+SEQ LVDC R + N GCNGG+MD AFQ++ +N G+DSEQ
Sbjct: 142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQ 201
Query: 227 DYPYLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
YPYL ++ P R + + V I G+ D+ +E++L AVA PVSVAI+A ++
Sbjct: 202 SYPYLARDDL--PCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSL 259
Query: 284 QHYESGVFTGE-CGSALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQRN 338
Q Y+SG++ C S LDH V+ VGYG + G YW+V+NSW WG+ GY+ + ++
Sbjct: 260 QFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKD 319
Query: 339 LLDTNTGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 320 ----KNNHCGIATMASYPL 334
>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
max]
Length = 379
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 135/345 (39%), Positives = 207/345 (60%), Gaps = 28/345 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
T +V +++Q W ++HG+ + KR +IFK+N +I + N+ NR ++++GLNK
Sbjct: 36 TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNA-NRKSPHSHRLGLNK 94
Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
FAD+T +E+ YL D + ++ K+ ++Y+C D P S DWR+KG + VK
Sbjct: 95 FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QG CG WAFS A+E + I TG+L+SLSEQELVDC + + G G +F++++
Sbjct: 152 YQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQYQSFEWVL 210
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
++GG+ ++ DYPY E +C ++ K V+IDGYE + D E + A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269
Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
P+SV+I+A + F Y G++ GE C S ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGEDW 327
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
GE+GY+ +QRN + G CG+ ASYP K SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 152/371 (40%), Positives = 208/371 (56%), Gaps = 30/371 (8%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMS----IISYDNNHDHSSSWRT---DDEVMTIYQ 53
MA + + S L L +++ S+ D S ++S D HD SS+ +
Sbjct: 1 MARVAGLVVSSILFLLCCVAAGSSFDESNPIKLVS-DRLHDFESSFVKVLGQSRRALSFA 59
Query: 54 TWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYL 113
+ +HGK G + RF IF ++L I N Y +GLN+FAD T +E++ L
Sbjct: 60 RFAHRHGKRYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTLGLNQFADWTWQEFQKYRL 119
Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDEL-PESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
G + A+ R K + L PE+ DWRE+G V+PVK+QG CGSCW FST
Sbjct: 120 GAAQNCS--------ATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTG 171
Query: 173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
A+E G+ ISLSEQ+LVDC R N GCNGGL AF++I NGG+D+E+ YPY
Sbjct: 172 ALEAAYHQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYT 231
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGV 290
G ++ C S N V ++ +++ E LK AVA +PVSVA E G +F+ Y+ GV
Sbjct: 232 GKDDACKFSSENVGVRVVESV-NITLGAEDELKHAVAFVRPVSVAFEVVG-SFRLYKEGV 289
Query: 291 F-TGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
+ T CGS ++H V+AVGYG ENG+ YWL++NSWG DWG+NGY K++
Sbjct: 290 YTTSTCGSTPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNGYFKME-----MGKNM 344
Query: 347 CGIAMEASYPV 357
CGIA ASYPV
Sbjct: 345 CGIATCASYPV 355
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 184/307 (59%), Gaps = 18/307 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ ++ A++GK R ++F N+ + + NS + Y VG FAD+TN E+
Sbjct: 23 FNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTNTEFAV- 81
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
S ++K K+ + A + E+VDWREKGAV PVK+Q SCGSCWAFS
Sbjct: 82 -----SKLCGCMLKPKMT--KPATPIMEPAAEAVDWREKGAVTPVKNQASCGSCWAFSAT 134
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
A+EG N + GELISLSEQ+LVDCD + ++GC GGLM YAF++ + GM E+DYPY
Sbjct: 135 GAMEGRNFVANGELISLSEQQLVDCDHQ-SSGCGGGLMTYAFEY-AKKKGMCKEEDYPYH 192
Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
+ C + VV GYE+V FD +LK+AV+ PVSVA+EA FQ Y GV
Sbjct: 193 AVDEDCK-DDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEADSIVFQMYTGGVI 251
Query: 292 -TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
+ CG++L+HGV+AVGY G DYW+V+NSWG WG+ GY+K++ ++ G CGI
Sbjct: 252 DSSACGTSLNHGVLAVGY----GADYWIVKNSWGESWGDKGYLKIKYT--ESGAGICGIN 305
Query: 351 MEASYPV 357
SYP
Sbjct: 306 QMNSYPT 312
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 196/316 (62%), Gaps = 20/316 (6%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEE 107
++ W+ HGK + MG +R I++DNLR I +HN TY++G+N+F D+TN E
Sbjct: 28 WKEWVDYHGKEYSAMGEEMERRMIWEDNLRIITKHNLEHSQGKTTYRLGMNEFGDMTNAE 87
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+ A TR+ K + + +LP+SVDWR +G V PVKDQG CGSCWA
Sbjct: 88 FVA----TRTMKKMSGVPKVGQGSTFLPSEFLQLPDSVDWRTEGYVTPVKDQGQCGSCWA 143
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FSTV A+EG + + TG L+SLSEQ LVDC + + N GCNGG +A ++I NGG+D+E
Sbjct: 144 FSTVGALEGQHFVKTGTLVSLSEQNLVDCSQAEGNDGCNGGWPAWADEYIKSNGGIDTEV 203
Query: 227 DYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
YPY G ++ C R + V +I G+ +V E +L+KA+A P+SV I+A +FQ
Sbjct: 204 GYPYEGVDDSC--HYRTSDVGATITGFAEVEADSEKALEKALAQVGPISVCIDATQPSFQ 261
Query: 285 HYESGVF-TGECGS-ALDHGVVAVGY-GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
YESGV+ +C S ALDH V AVGY T +G Y++V+NSWG+ WG+ GY+ + R+
Sbjct: 262 LYESGVYDEPDCSSTALDHCVTAVGYDSTADGDKYYIVKNSWGTTWGQEGYIWMSRD--- 318
Query: 342 TNTGKCGIAMEASYPV 357
+CGIA A+YP+
Sbjct: 319 -KQKQCGIATNATYPL 333
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 146/364 (40%), Positives = 206/364 (56%), Gaps = 30/364 (8%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDE-VMTI---------YQTWLAKH 59
+ ++V + I++S+AAD+ + S R +E V+ I + + ++
Sbjct: 7 LPSVVLVILIAASAAADIGFDESNPIRMVSDGLREIEESVVQILGQSRHVLSFARFTHRY 66
Query: 60 GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
GK + RF IFK+NL I N +YK+G+N+FADLT +E++ LG +
Sbjct: 67 GKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLSYKLGVNQFADLTWQEFQRNKLGAAQNC 126
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
L S ++ LPE+ DWRE G V+PVKDQG CGSCW FST A+E
Sbjct: 127 SATLKGSHKLTEA-------ALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179
Query: 180 IVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY G + C
Sbjct: 180 QAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKDGTCK 239
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFT-GECG 296
S N V +D +++ E LK AV +PVS+A E ++F+ Y+SGV+T CG
Sbjct: 240 YSAENVGVQVLDSV-NITLGAEDELKHAVGLVRPVSIAFEV-VKSFRLYKSGVYTDSHCG 297
Query: 297 SA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEA 353
+ ++H V+AVGYG E+GV YWL++NSWG+DWG+ GY K++ CGIA A
Sbjct: 298 NTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFKME-----MGKNMCGIATCA 352
Query: 354 SYPV 357
SYPV
Sbjct: 353 SYPV 356
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/307 (44%), Positives = 187/307 (60%), Gaps = 22/307 (7%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
++GK + ++RF++F DNL+ I HN +YK+G+N+F DLT +E+R LG
Sbjct: 67 RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126
Query: 118 DAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
+ A+ + K + LPE+ DWRE G V+PVK+QG CGSCW FST A+E
Sbjct: 127 NCS--------ATTKGNVKLTNAVLPETKDWREDGIVSPVKNQGKCGSCWTFSTTGALEA 178
Query: 177 INKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY G
Sbjct: 179 AYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNG 238
Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTG- 293
C S N V ID +++ E LK AVA +PVS+A E + F+ Y+SGV++
Sbjct: 239 LCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVYSST 296
Query: 294 ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
ECG+ ++H V+AVGYG ENGV YWL++NSWG+DWG++GY K++ CGIA
Sbjct: 297 ECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDDGYFKME-----MGKNMCGIA 351
Query: 351 MEASYPV 357
ASYPV
Sbjct: 352 TCASYPV 358
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 30/319 (9%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
++ + K G+ + R +F DNL++I+E N TY + +N+F+D+TNE+
Sbjct: 20 WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEK 79
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES--VDWREKGAVNPVKDQGSCGSC 165
+ A+ G + + + + D PES VDWR KGAV PVKDQG CGSC
Sbjct: 80 FNAVMKGYKKGPRPAAVFTST----------DAAPESTEVDWRTKGAVTPVKDQGQCGSC 129
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMD 223
WAFST +EG + + TG L+SLSEQ+LVDC N GCNGG ++ A ++ NGG+D
Sbjct: 130 WAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVD 189
Query: 224 SEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
+E YPY +N C R N+ + + GY ++ E +LK A D P+SVAI+A
Sbjct: 190 TESSYPYEARDNTC---RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASH 246
Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
R+FQ Y +GV + C S+ LDH V+AVGYG+E G D+WLV+NSW + WGE+GY+K+ RN
Sbjct: 247 RSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARN 306
Query: 339 LLDTNTGKCGIAMEASYPV 357
CGIA +A YP
Sbjct: 307 ----RNNNCGIATDACYPT 321
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 194/319 (60%), Gaps = 25/319 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
+ +W ++HGK+ + +R I+++NLR I++HN N T+K+G+N+F D+TNEE
Sbjct: 28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEE 86
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
+R G + D R + + P+ VDWR++G V PVKDQ CGSCW+
Sbjct: 87 FRQAMNGYKQDPNRTSKGALFMEPSFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 141
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS+ A+EG TG+LIS+SEQ LVDC R + N GCNGG+MD AFQ++ +N G+DSEQ
Sbjct: 142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQ 201
Query: 227 DYPYLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
YPYL ++ P R + + V I G+ D+ +E++L AVA PVSVAI+A ++
Sbjct: 202 SYPYLARDDL--PCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSL 259
Query: 284 QHYESGVFTGE-CGSALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQRN 338
Q Y+SG++ C S LDH V+ VGYG + G YW+V+NSW WG+ GY+ + ++
Sbjct: 260 QFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKD 319
Query: 339 LLDTNTGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 320 ----KNNHCGIATMASYPL 334
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 147/361 (40%), Positives = 212/361 (58%), Gaps = 34/361 (9%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
S L L +IS A+ + + +D SS E+ ++ + +GK+ + M +
Sbjct: 4 FSLLCILTWISVE-ASSLKFQPLRHQNDVMSS-----ELNELWTEYKETYGKSYD-MKED 56
Query: 70 EKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
R +++ NLR I HN + ++ +G+N+ +DLT EYR LG R R K
Sbjct: 57 VVRRSLWEGNLRHISMHNVKHDLGKHSFSMGINELSDLTPSEYRQR-LGLRPALGERTGK 115
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
V + G+++PE VDWR+KG V PVK+QG+CGSCWAFS+ ++EG + +TG+L
Sbjct: 116 KFVYN-------GEKVPEHVDWRDKGYVTPVKNQGACGSCWAFSSTGSLEGQHFRLTGQL 168
Query: 186 ISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC----DPS 240
+SLSEQ LVDC +K NAGCNGG MD AF ++ N G+D+E YPY G ++ C P
Sbjct: 169 VSLSEQNLVDCTKKYGNAGCNGGWMDNAFNYVKANNGIDTEAFYPYEGHDDWCGYDGSPG 228
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF--TGECGS 297
+ A G+ DV DE++LK+AVA PVSV I+A R+FQ Y+SG++ S
Sbjct: 229 HKGANCT---GHVDVQQGDELALKQAVATVGPVSVGIDATHRSFQLYKSGIYDEVACSNS 285
Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ DH V+ VGYG++ G DYWLV+NSWG+ WG +GY+ + RN +C IA ASYP
Sbjct: 286 STDHAVLVVGYGSQGGHDYWLVKNSWGTSWGMDGYIMMSRN----KGNQCAIASYASYPT 341
Query: 358 K 358
+
Sbjct: 342 E 342
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 196/324 (60%), Gaps = 18/324 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D VM + T+ +H K R +IF +N I +HN ++K+ +NK+A
Sbjct: 57 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 116
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
DL + E+R + G ++L + + + + A LP+SVDWR KGAV VKDQ
Sbjct: 117 DLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 176
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + +G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 177 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 236
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
NGG+D+E+ YPY ++ C ++ V + D G+ D+ DE + +AVA PVSVAI
Sbjct: 237 NGGIDTEKSYPYEAIDDSCHFNK--GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 294
Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y GV+ +C + LDHGV+ VG+GT E+G DYWLV+NSWG+ WG+ G++
Sbjct: 295 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 354
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN +CGIA +SYP+
Sbjct: 355 KMLRN----KENQCGIASASSYPL 374
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/331 (43%), Positives = 197/331 (59%), Gaps = 26/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYK 94
SS+ D + + W A H + GM E R +++ N++ I+ E+N ++
Sbjct: 16 SSALTFDRSLEAQWIKWKAMHNRLY-GMNEEEWRRAVWEKNMKMIELHNHEYNQGKHSFT 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G ++ R+ KV + E P SVDWREKG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFQN---RKPRNGKVFQEPLF----HEAPRSVDWREKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC + N GC+GGLMDYAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQGCDGGLMDYAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q++ +NGG+DSE+ YPY E C + + V + G+ D+ P E +L KAVA P+
Sbjct: 188 QYVQENGGLDSEESYPYEATEESCKYNPEYS-VANDTGFVDI-PKLEKALMKAVATVGPI 245
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE-NGVD---YWLVRNSWGSD 326
SVAI+AG +FQ Y+ G+ F EC S +DHGV+ VGYG E G D YWLV+NSWG
Sbjct: 246 SVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTGSDNSKYWLVKNSWGEK 305
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG +GY+K+ ++ CGIA ASYP
Sbjct: 306 WGMDGYIKMAKD----RKNHCGIASAASYPT 332
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 196/324 (60%), Gaps = 18/324 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D VM + T+ +H K R +IF +N I +HN ++K+ +NK+A
Sbjct: 53 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
DL + E+R + G ++L + + + + A LP+SVDWR KGAV VKDQ
Sbjct: 113 DLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 172
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + +G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 173 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 232
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
NGG+D+E+ YPY ++ C ++ V + D G+ D+ DE + +AVA PVSVAI
Sbjct: 233 NGGIDTEKSYPYEAIDDSCHFNK--GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290
Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y GV+ +C + LDHGV+ VG+GT E+G DYWLV+NSWG+ WG+ G++
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN +CGIA +SYP+
Sbjct: 351 KMLRN----KENQCGIASASSYPL 370
>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
Length = 214
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 115/220 (52%), Positives = 152/220 (69%), Gaps = 7/220 (3%)
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PES+DWR+KGAV PVKDQ CGSCWAFSTVA VEGINKIVTG+LISLSEQEL+DCDR+ +
Sbjct: 2 PESIDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-S 60
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGG + Q+++ N G+ +E +YPY + C + V I GY+ V P DE+
Sbjct: 61 HGCNGGYQTTSLQYVVDN-GVHTEYEYPYEKKQGNCRAKDKKGLKVQITGYKRVPPNDEI 119
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL K +A+QPVSV IE+ R+F Y G++ G CG+ LDH V A+GYG DY L++N
Sbjct: 120 SLIKVIANQPVSVLIESKDRSFHFYRGGIYKGPCGTRLDHAVTAIGYGK----DYILIKN 175
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
SWG +WGE GY++++R + G CG+ + +P+K Q
Sbjct: 176 SWGPNWGEKGYIRIKR-ASGKSEGICGVYKSSYFPIKGYQ 214
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 198/327 (60%), Gaps = 26/327 (7%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
D ++ ++ W + H K + +R +++ NL+ I+ EH+ +Y++G+N F
Sbjct: 21 DPQLDDHWELWKSWHSKKYHEKEEGWRRM-VWEKNLKKIELHNLEHSMGTHSYRLGMNHF 79
Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
D+T+EE+R + G + A+ + S + E P+SVDWR+ G V PVKDQG
Sbjct: 80 GDMTHEEFRQLMNGYKRKAETKARGSLFLEPNFL-----EAPKSVDWRDNGYVTPVKDQG 134
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
CGSCWAFST A+EG + TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++ N
Sbjct: 135 QCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDN 194
Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
G+DSE YPYLG +++ P + S++ G+ D+ E +L KAVA PVSVAI
Sbjct: 195 QGLDSEDSYPYLGTDDQ--PCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAI 252
Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
+AG +FQ Y+SG+ + EC S LDHGV+ VGYG + +G YW+V+NSW WG+
Sbjct: 253 DAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDGKKYWIVKNSWSEKWGDK 312
Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
GY+ + ++ CGIA ASYP+
Sbjct: 313 GYIYMAKD----RKNHCGIATAASYPL 335
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/302 (45%), Positives = 183/302 (60%), Gaps = 21/302 (6%)
Query: 72 RFQIFKDNLRFIDEHNSLNR----TYKVGLNKF---ADLTNEEYRAMYLGTRSDAKRR-- 122
R +I+ ++ I +HN +YK+G+N + D+ + E+ G AK
Sbjct: 47 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKN 106
Query: 123 --LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ V ++ A +LPE VDWR+ GAV +KDQG CGSCW+FST A+EG +
Sbjct: 107 LYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFR 166
Query: 181 VTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
+G L+SLSEQ L+DC + N GCNGGLMD AF++I NGG+D+EQ YPY G ++KC
Sbjct: 167 QSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRY 226
Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT-GECGS 297
+ +N + G+ D+ DE L +AVA PVSVAI+A FQ Y SGV+ EC S
Sbjct: 227 NPKNTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSS 285
Query: 298 A-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
LDHGV+ VGYGT E GVDYWLV+NSWG WGE GY+K+ RN +CGIA ASY
Sbjct: 286 TDLDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRN----KNNRCGIASSASY 341
Query: 356 PV 357
P+
Sbjct: 342 PL 343
>gi|129353|sp|P22895.1|P34_SOYBN RecName: Full=P34 probable thiol protease; Flags: Precursor
Length = 379
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 135/345 (39%), Positives = 207/345 (60%), Gaps = 28/345 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
T +V +++Q W ++HG+ + KR +IFK+N +I + N+ NR ++++GLNK
Sbjct: 36 TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNA-NRKSPHSHRLGLNK 94
Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
FAD+T +E+ YL D + ++ K+ ++Y+C D P S DWR+KG + VK
Sbjct: 95 FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QG CG WAFS A+E + I TG+L+SLSEQELVDC + + G G +F++++
Sbjct: 152 YQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQYQSFEWVL 210
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
++GG+ ++ DYPY E +C ++ K V+IDGYE + D E + A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269
Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
P+SV+I+A + F Y G++ GE C S ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGFDW 327
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
GE+GY+ +QRN + G CG+ ASYP K SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 184/312 (58%), Gaps = 20/312 (6%)
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMY 112
A HGK R +I+ +N I HN +YK+ +N+F DL + E+
Sbjct: 32 ALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEF---- 87
Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQGSCGSCWAFS 169
+ TR+ KR S + G E LP++VDWR+KGAV PVK+QG CGSCWAFS
Sbjct: 88 VSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFS 147
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
T ++EG + T +L+SLSEQ LVDC R N GC GGLMD AF++I N G+D+E Y
Sbjct: 148 TTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSY 207
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
PY + C +R + G+ D+ DE LKKAVA PVSVAI+A +FQ Y
Sbjct: 208 PYNATDGVCHFNRSDVGATDT-GFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYS 266
Query: 288 SGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ EC S LDHGV+ VGYGT++G DYWLV+NSWG+ WG+ GY+ + RN
Sbjct: 267 EGVYDEPECSSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRN----KDN 322
Query: 346 KCGIAMEASYPV 357
+CGIA ASYP+
Sbjct: 323 QCGIASSASYPL 334
>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
Length = 401
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 144/369 (39%), Positives = 205/369 (55%), Gaps = 25/369 (6%)
Query: 8 LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDD-------EVMTIYQTWLAKHG 60
+ I+TL+ +F + +S+ +N D + D E ++ + K+
Sbjct: 38 IIIATLIAIFIVL---VVTVSLYITNNTSDKIDDFVPGDYVDPATREYRKSFEEFKKKYH 94
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + M +RF+I+K N+ FI NS +Y + +N+F DL+ EE+ A + G D+K
Sbjct: 95 KVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGYIKDSK 154
Query: 121 --RRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
R+ KS S A ++ +E P S++W E G VNP+++Q +CGSCWAFS VAA+EG
Sbjct: 155 DDERVFKSSRVS---ASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEG 211
Query: 177 INKIVTGE-LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
T L SLSEQ+ VDC ++ N GC+GG M AFQ+ I+N + + DYPY E
Sbjct: 212 ATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDYPYFAEE 271
Query: 235 NKC-DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT 292
C D N + + Y+ V P + +LK A+A P+SVAI+A FQ Y+SGVF
Sbjct: 272 KTCMDSFCENYIEIPVKAYKYVFPRNINALKTALAKYGPISVAIQADQTPFQFYKSGVFD 331
Query: 293 GECGSALDHGVVAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
CG+ ++HGVV VGY + +YWLVRNSWG WGE GY+KL L G CGI
Sbjct: 332 APCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLA--LHSGKKGTCGIL 389
Query: 351 MEASYPVKN 359
+E YPV N
Sbjct: 390 VEPVYPVIN 398
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 191/312 (61%), Gaps = 14/312 (4%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
++ + A+H K R IF++N +FI++HNS + +G+N F DLTN+EYR
Sbjct: 81 WENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRE 140
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
YLG R S + S+ + +++P+ +DWR++G V PVK+QG CGSCWAFS
Sbjct: 141 RYLGYRRPENTPSKASYIFSR---AEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSA 197
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
V ++EG + TG+L+SLSEQ LVDC + N+GCNGG MD AF+++ N G+D+E YP
Sbjct: 198 VGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYP 257
Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV-ADQPVSVAIEAGGRAFQHYES 288
Y+G + C ++ ++ G+ DV DE +L++AV PVSVAI+A FQ Y
Sbjct: 258 YVGTDGSCHFKNKSIG-ATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRG 316
Query: 289 GVFTGE-CG-SALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
GV+ C S LDHGV+ VGYG + G D+W+V+NSWG WG GY+++ RN
Sbjct: 317 GVYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRN----KGN 372
Query: 346 KCGIAMEASYPV 357
+CGIA +AS P
Sbjct: 373 QCGIASKASIPT 384
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 135/300 (45%), Positives = 186/300 (62%), Gaps = 25/300 (8%)
Query: 72 RFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +++ NL+ I+ EH+ +Y++G+N F D+T+EE+R + G + +R+ S
Sbjct: 12 RRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYKRKPQRKFTGSL 71
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+ E P +VDWR+ G V PVKDQG CGSCWAFST A+EG + TG+L+S
Sbjct: 72 FMEPNFL-----EAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVS 126
Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC R + N GCNGGLMD AFQ+I N G+DSE YPYLG +++ P + K
Sbjct: 127 LSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQ--PCHYDPKY 184
Query: 247 VSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALDH 301
S + G+ D+ E +L KAVA PVSVAI+AG +FQ Y+SG+ + +C S LDH
Sbjct: 185 NSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDH 244
Query: 302 GVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
GV+ VGYG E +G YW+V+NSW WG+ GY+ + ++ CGIA ASYP+
Sbjct: 245 GVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKD----RKNHCGIATAASYPL 300
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 187/314 (59%), Gaps = 23/314 (7%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
+ + +HGK+ ++RF+IF ++L + N +YK+G+N+F+D+T EE++A
Sbjct: 58 FARFAVRHGKSYGSAAEVQRRFRIFSESLDEVRSTNRKGLSYKLGINRFSDMTWEEFQAT 117
Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
LG L + + + + LPE+ DWRE G V+PVKDQ SCGSCW FST
Sbjct: 118 KLGAAQTCSATLAGNHLM------RDANALPETKDWRETGIVSPVKDQASCGSCWTFSTT 171
Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
A+E TG+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY
Sbjct: 172 GALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGIDTEESYPY 231
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESG 289
G C NA V D +++ E LK AV +PVSVA E F+ Y+SG
Sbjct: 232 KGVNGVCKYRPENAAVQVADSV-NITLNAEDELKNAVGLVRPVSVAFEV-IDGFKQYKSG 289
Query: 290 VFTGE-CGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ--RNLLDTN 343
V+T + CG+ D H V+AVGYG ENGV YWL++NSWG+DWGE+GY K++ +N+
Sbjct: 290 VYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDGYFKMEMGKNM---- 345
Query: 344 TGKCGIAMEASYPV 357
C +A ASYP+
Sbjct: 346 ---CAVATCASYPI 356
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 202/329 (61%), Gaps = 27/329 (8%)
Query: 45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
D + T ++ W + HGK+ +R +++ +LR I+ HN SL + ++++G+N F
Sbjct: 22 DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHF 80
Query: 101 ADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ NEE+R + G + ++L S + E+P+ VDWR++G V PVKDQ
Sbjct: 81 GDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFL-----EVPKHVDWRDEGYVTPVKDQ 135
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
G CGSCWAFST A+EG + TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++
Sbjct: 136 GQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKD 195
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVA 275
NGG+DSE YPY+G ++ P N + + + G+ D+ E +L KA+A PVSVA
Sbjct: 196 NGGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVA 253
Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
I+AG +FQ Y+SG+ F EC S LDHGV+ VGYG E +G YW+V+NSW G+
Sbjct: 254 IDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKLGQ 313
Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NGY+ + ++ CGIA ASYP++
Sbjct: 314 NGYILMAKD----KDNHCGIATAASYPLE 338
>gi|75994626|gb|ABA33834.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 124/252 (49%), Positives = 172/252 (68%), Gaps = 21/252 (8%)
Query: 51 IYQTWLAKHGK--TSN-----GMGHNEK----RFQIFKDNLRFIDEHNSLN----RTYKV 95
+Y+ W +KHG+ +SN G E+ R ++F+DNLR+ID+HN+ T+++
Sbjct: 1 MYEAWKSKHGRGGSSNDDCDIAPGEEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60
Query: 96 GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
GL FADLT +EYR LG R+ +R + R GD LP+++DWR+ GAV
Sbjct: 61 GLTPFADLTLDEYRGRVLGFRARGRRSGHGYRARRPR----GGDLLPDAIDWRQLGAVTE 116
Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
VKDQ CG CWAFS VAA+EG+N I TG L+SLSEQE++DCD + ++GC+GG M+ AF+F
Sbjct: 117 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRF 175
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
+I NGG+DSE DYP++G + CD S+ +N KV +IDG +V +E +L++AVA QPVSV
Sbjct: 176 VIGNGGIDSEADYPFIGTDGTCDASKEKNEKVATIDGLVEVVSNNETALQEAVAIQPVSV 235
Query: 275 AIEAGGRAFQHY 286
AI+A GRAFQHY
Sbjct: 236 AIDASGRAFQHY 247
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 190/312 (60%), Gaps = 17/312 (5%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADLTNEEYR 109
+Q W K+ K +R I++ N +F++ HN S + V +N+FADL E+
Sbjct: 24 FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFG 83
Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
++ G S S +G ++P++VDW+EKGAV P+K+QG CGSCW+FS
Sbjct: 84 RIFNGLLP------RPSSYNSTNIYKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFS 137
Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
+ ++EG + I TG L+SLSEQ+L+DC K N GCNGGLMD +F+++ G ++E +Y
Sbjct: 138 STGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNY 197
Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
PY AEN + VV+ Y D+ DE SLK AVA+ P+SVAI+A +FQ Y
Sbjct: 198 PYT-AENGVCRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYN 256
Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
SGV + C S LDHGV+A+GYGTE+G DYWLV+NSWG+ WG GY+K+ RN
Sbjct: 257 SGVYYASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRN----RNN 312
Query: 346 KCGIAMEASYPV 357
CGIA +ASYP
Sbjct: 313 NCGIATQASYPT 324
>gi|52546918|gb|AAU81592.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 196
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 114/190 (60%), Positives = 141/190 (74%), Gaps = 2/190 (1%)
Query: 184 ELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
+L+SLSEQELVDCD N GCNGGLMD AF FI + GG+ +E++YPY+ A+ KCD +RN
Sbjct: 4 KLVSLSEQELVDCDNGENQGCNGGLMDLAFDFIKKKGGITTEENYPYMAADGKCDLKKRN 63
Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
VVSIDG+EDV P DE SL KAVA+QPVSVAIEA G FQ Y GVFTG+CG+ LDHGV
Sbjct: 64 TPVVSIDGHEDVPPNDEESLLKAVANQPVSVAIEASGSDFQFYSEGVFTGDCGTELDHGV 123
Query: 304 VAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
VGYGT +G YW VRNSWG +WGE GY+++QR+ +D G CGIAM+ SYP+K S +
Sbjct: 124 AIVGYGTTLDGTKYWTVRNSWGPEWGEKGYIRMQRD-IDAEEGLCGIAMQPSYPIKTSSD 182
Query: 363 SAKPKPHSSA 372
+ P ++
Sbjct: 183 NPTGTPAATP 192
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 149/358 (41%), Positives = 205/358 (57%), Gaps = 41/358 (11%)
Query: 12 TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
TL+ F ++A ++ NH + W W A H + G E
Sbjct: 4 TLILAAFCLGLASAALTF-----NHSLEAQWIK----------WKAMHNRLY-GKNEEEW 47
Query: 72 RFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
R +++ N++ I+ E+N ++ + +N F D+TNEE+R + G ++ R+ K
Sbjct: 48 RRAVWEKNMKTIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQN---RKPRNGK 104
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
V + E P SVDWREKG V PVK+QG CGSCWAFS A+EG TG+L+S
Sbjct: 105 VFQEPLL----HEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVS 160
Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
LSEQ LVDC + N GCNGGLMDYAFQ++ +NGG+DSE+ YPY E C + + + V
Sbjct: 161 LSEQNLVDCSGPQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPKYS-V 219
Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
+ G+ D+ P E +L KAVA P+SVAI+AG +FQ Y+ G+ F EC S +DHGV
Sbjct: 220 ANDTGFVDI-PKLEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGV 278
Query: 304 VAVGYGTE-NGVD---YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
+ VGYG E G D YWLV+NSWG +WG +GY+K+ ++ CGIA ASYP
Sbjct: 279 LVVGYGFERTGSDNSKYWLVKNSWGEEWGMDGYIKMAKD----RKNHCGIASAASYPT 332
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.131 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,956,069,429
Number of Sequences: 23463169
Number of extensions: 256869948
Number of successful extensions: 643337
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6724
Number of HSP's successfully gapped in prelim test: 887
Number of HSP's that attempted gapping in prelim test: 612730
Number of HSP's gapped (non-prelim): 9208
length of query: 372
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 228
effective length of database: 8,980,499,031
effective search space: 2047553779068
effective search space used: 2047553779068
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)