BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017548
(369 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 616 bits (1589), Expect = e-174, Method: Compositional matrix adjust.
Identities = 288/354 (81%), Positives = 321/354 (90%), Gaps = 5/354 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +AS V+ ND DD +IRQVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17 SAVASTVSSNDLDDPLIRQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFKANLRRAK+ Q++DPTA HG+TKFSDLTP EFRRQFLGL R LRLP DA KAPILPT
Sbjct: 73 GVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPT 132
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVS 252
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSS 314
NFSV+S DEDQ+AANLVKHGPL+V INA +MQTY+GGVSCPYIC K DHGVL+VGYGS+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVLLVGYGSA 312
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
G+APIRFKEKP+WIIKNSWG+NWGENGYYKIC GRN+CGVDSMVS+VAAIHTT+
Sbjct: 313 GYAPIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAAIHTTA 366
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 616 bits (1588), Expect = e-174, Method: Compositional matrix adjust.
Identities = 288/354 (81%), Positives = 320/354 (90%), Gaps = 5/354 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +AS V+ ND DD +IRQVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17 SAVASTVSSNDLDDPLIRQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFKANLRRAK+ Q++DPTA HG+TKFSDLTP EFRRQFLGL R LRLP DA KAPILPT
Sbjct: 73 GVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPT 132
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREADYPYTGTDGGTCKFDKSKVVASVS 252
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSS 314
NFSV+S DEDQ+AANLVKHGPL+V INA +MQTY+GGVSCPYIC K DHGVL+VGYGS+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVLLVGYGSA 312
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
G+APIRFKEKP+WIIKNSWG+NWGENGYYKIC GRN+CGVDSMVS+VAAIHTT+
Sbjct: 313 GYAPIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAAIHTTA 366
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 287/368 (77%), Positives = 329/368 (89%), Gaps = 5/368 (1%)
Query: 2 ERLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
R ++S L+ LLS +AS + ++ DD +IRQVVP DG+Q DHLLNAEHHF+ FK+KF
Sbjct: 3 RRCLISFLVYALLSFTIASTTSPDELDDPLIRQVVP-DGDQ--DHLLNAEHHFTTFKAKF 59
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
KTYATQEEHDYRF++FKANLRRA++ Q++DPTAVHGVT FSDLTP EFRRQ+LGL RRL
Sbjct: 60 GKTYATQEEHDYRFKLFKANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQYLGL-RRL 118
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
RLPADA +APILPTNDLPTDFDWRDHGAVT VK+QG+CGSCWSFSA GALEGAHFL+TGE
Sbjct: 119 RLPADAHEAPILPTNDLPTDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGE 178
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEY LKAGG+ERE+DYPYTG D G
Sbjct: 179 LVSLSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLKAGGLEREEDYPYTGNDRG 238
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
CKFD++KI A+VSNFSV+S DEDQ+AANLVKHGPLAVGINAV+MQTY+GGVSCPYIC K
Sbjct: 239 PCKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQTYMGGVSCPYICSK 298
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
DHGVL+VGYGS+G+APIR K+KP+WIIKNSWGE+WGENGYY+IC GRN+CGVD+MVSS
Sbjct: 299 RQDHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGESWGENGYYRICRGRNICGVDAMVSS 358
Query: 361 VAAIHTTS 368
VAAIH S
Sbjct: 359 VAAIHPNS 366
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 287/354 (81%), Positives = 319/354 (90%), Gaps = 5/354 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +AS V+ D DD +I QVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17 SAVASTVSSTDLDDPLIIQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFKANLRRAK+ Q++DPTA HGVTKFSDLTP EFRRQFLGL RRLRLP DA KAPILPT
Sbjct: 73 GVFKANLRRAKKHQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPT 132
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVS 252
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSS 314
NFSV+S DEDQ+AANLVKHGPL+V INA +MQTY+GGVSCPYIC K DHGVL+VGYGS+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVLLVGYGSA 312
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
G+APIRFKEKP+WIIKNSWG+NWGENGYYKIC GRN+CGVDSMVS+VAAIHT +
Sbjct: 313 GYAPIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAAIHTAA 366
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 283/342 (82%), Positives = 313/342 (91%), Gaps = 4/342 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +I QVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF VFKANLRRAK+
Sbjct: 29 DDPLIIQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK 84
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q++DPTA HGVTKFSDLTP EFRRQFLGL RRLRLP DA KAPILPT DLPTD+DWRDH
Sbjct: 85 HQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPTTDLPTDYDWRDH 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDHECDPEE G+CDS
Sbjct: 145 GAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDS 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VSNFSV+S DEDQ+
Sbjct: 205 GCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQI 264
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVKHGPL+V INA +MQTY+GGVSCPYIC K DHGVL+VGYGS+G+APIRFKEKP+
Sbjct: 265 AANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVLLVGYGSAGYAPIRFKEKPF 324
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
WIIKNSWG+NWGENGYYKIC GRN+CGVDSMVS+VAAIHTT+
Sbjct: 325 WIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAAIHTTA 366
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 601 bits (1550), Expect = e-169, Method: Compositional matrix adjust.
Identities = 286/369 (77%), Positives = 325/369 (88%), Gaps = 5/369 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDD-AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
M L + LLL S+ +A+ ++D+D +IRQVVP + + HLLNAEHHFS FK+K
Sbjct: 1 MANLSILFFGLLLFSAAVATVERIDDEDNLLIRQVVP---DAEDHHLLNAEHHFSAFKTK 57
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
F+KTYATQEEHD+RFR+FK NL RAK Q LDP+AVHGVT+FSDLTPSEFR QFLGL +
Sbjct: 58 FAKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPSEFRGQFLGL-KP 116
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
LRLP+DAQKAPILPT+DLPTDFDWRDHGAVTGVK+QG+CGSCWSFSA GALEGAHFLSTG
Sbjct: 117 LRLPSDAQKAPILPTSDLPTDFDWRDHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTG 176
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
LVSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEY LKAGG+ RE+DYPYTG D
Sbjct: 177 GLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLKAGGLMREEDYPYTGRDR 236
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
G CKFDKSKIAA+V+NFSV+S DE+Q+AANLVK+GPLAVGINAV+MQTYIGGVSCPYICG
Sbjct: 237 GPCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGINAVFMQTYIGGVSCPYICG 296
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
K+LDHGVL+VGYGS +APIRFKEKPYWIIKNSWGE+WGE GYYKIC GRNVCGVDSMVS
Sbjct: 297 KHLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWGEEGYYKICRGRNVCGVDSMVS 356
Query: 360 SVAAIHTTS 368
+VAAIH ++
Sbjct: 357 TVAAIHVSN 365
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 281/355 (79%), Positives = 321/355 (90%), Gaps = 5/355 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
+ +A+A ++D DD +IRQVVP + + HLLNAEHHFS FK+KF KTYATQEEHD+RF
Sbjct: 16 ATVAAAERIDDEDDLLIRQVVP---DAEDHHLLNAEHHFSAFKTKFGKTYATQEEHDHRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
R+FK NL RAK Q LDP+AVHGVT+FSDLTP+EFRRQFLGL + LRLP+DAQKAPILPT
Sbjct: 73 RIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPAEFRRQFLGL-KPLRLPSDAQKAPILPT 131
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
NDLPTDFDWR+HGAVTGVK+QG+CGSCWSFSA GALEGAHFLSTGELVSLSEQQLVDCDH
Sbjct: 132 NDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDH 191
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGCNGGLM +AFEY L+AGG+ REKDYPYTG D G CKFDKSK+AA+V+
Sbjct: 192 ECDPEERGACDSGCNGGLMTTAFEYTLQAGGLMREKDYPYTGRDRGPCKFDKSKVAASVA 251
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSS 314
NFSV+S DE+Q+AANLV++GPLAVGINAV+MQTYIGGVSCPYICGK+LDHGVL+VGYGS
Sbjct: 252 NFSVVSLDEEQIAANLVQNGPLAVGINAVFMQTYIGGVSCPYICGKHLDHGVLLVGYGSG 311
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
+APIRFKEKPYWIIKNSWGE+WGE GYYKIC GRNVCGVDSMVS+VAAIH +++
Sbjct: 312 AYAPIRFKEKPYWIIKNSWGESWGEEGYYKICRGRNVCGVDSMVSTVAAIHVSNN 366
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 278/370 (75%), Positives = 328/370 (88%), Gaps = 7/370 (1%)
Query: 1 MERLILSSLLLL--LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
MER SL++ L SS+L +A + DD +IRQVVP ED+LL+A+HHF+ FK+
Sbjct: 1 MERSCFLSLIVFAFLSSSILFTATSDELDDPLIRQVVPD----VEDYLLSAQHHFTAFKA 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YATQEEHDYRF+VFKANLRRA++ QL+DP+AVHGVTKFSDLTP EFRRQ+LGL +
Sbjct: 57 KFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDPSAVHGVTKFSDLTPREFRRQYLGL-K 115
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LRLPADA +APILPT+ +P DFDWRDHGAVT VK+QG+CGSCWSFSA GALEGAHFL+T
Sbjct: 116 KLRLPADAHEAPILPTDGIPEDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLAT 175
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GELVSLSEQQLVDCDHECDP E G+CDSGCNGGLM +AFEYILKAGG+ERE+DYPYTG+D
Sbjct: 176 GELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILKAGGLEREEDYPYTGSD 235
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G CKF+++KIAA+V+NFSV+S DEDQ+AANLV++GPLAVGINAV+MQTYIGGVSCPYIC
Sbjct: 236 RGPCKFERAKIAASVNNFSVVSVDEDQIAANLVQNGPLAVGINAVFMQTYIGGVSCPYIC 295
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
K DHGV++VGYGS+G+AP+R K+KP+WIIKNSWGENWGENGYYKIC GRNVCGVD+MV
Sbjct: 296 SKRQDHGVVLVGYGSAGYAPVRLKDKPFWIIKNSWGENWGENGYYKICRGRNVCGVDAMV 355
Query: 359 SSVAAIHTTS 368
S+VAAIHTT+
Sbjct: 356 STVAAIHTTA 365
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 275/349 (78%), Positives = 315/349 (90%), Gaps = 5/349 (1%)
Query: 25 NDDDAMIRQVVPSDGE---QSEDHLLNAEHH-FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+DDD +IRQVVP G+ E++LL A+HH FS+FK +F K+YA+QEEHDYRF+VFKAN
Sbjct: 30 SDDDIIIRQVVPELGDVEGSEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKAN 89
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
LRRA+R Q LDP+A HGVT+FSDLTP+EFR +LGL R L+LP DAQKAPILPTNDLP D
Sbjct: 90 LRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGL-RPLKLPHDAQKAPILPTNDLPED 148
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWRDHGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG LVSLSEQQLV+CDHECDPEE
Sbjct: 149 FDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEE 208
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
GSCDSGCNGGLMN+AFEY LKAGG+ +E+DYPYTGTD GSCKFDK+KIAA+VSNFSVIS
Sbjct: 209 MGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVIS 268
Query: 261 SDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
DEDQ+AANLVK+GPLAV INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGS+G+APIR
Sbjct: 269 LDEDQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLVGYGSAGYAPIR 328
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
K+KPYWIIKNSWGENWGENG+YKIC GRNVCGVDSMVS+VAA+HTTS+
Sbjct: 329 MKDKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVAAVHTTSN 377
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 586 bits (1510), Expect = e-165, Method: Compositional matrix adjust.
Identities = 275/349 (78%), Positives = 314/349 (89%), Gaps = 5/349 (1%)
Query: 25 NDDDAMIRQVVPSDGE---QSEDHLLNAEHH-FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+DDD +IRQVVP G+ E++LL A+HH FS+FK +F K+YA+QEEHDYRF+VFKAN
Sbjct: 30 SDDDIIIRQVVPELGDVEGGEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKAN 89
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
LRRA+R Q LDP+A HGVT+FSDLTP+EFR +LGL R L+LP DAQKAPILPTNDLP D
Sbjct: 90 LRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGL-RPLKLPHDAQKAPILPTNDLPED 148
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWRDHGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG LVSLSEQQLV+CDHECDPEE
Sbjct: 149 FDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEE 208
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
GSCDSGCNGGLMN+AFEY LKAGG+ +E+DYPYTGTD GSCKFDK+KIAA+VSNFSVIS
Sbjct: 209 MGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVIS 268
Query: 261 SDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
DEDQ+AANLVK GPLAV INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGS+G+APIR
Sbjct: 269 LDEDQIAANLVKIGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLVGYGSAGYAPIR 328
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
K+KPYWIIKNSWGENWGENG+YKIC GRNVCGVDSMVS+VAA+HTTS+
Sbjct: 329 MKDKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVAAVHTTSN 377
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 278/368 (75%), Positives = 318/368 (86%), Gaps = 5/368 (1%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R L L LL ++ L A +D DD +IRQVV DG+ LLNA+HHF++FK +F
Sbjct: 4 RFSLLFLCTLLATTSLVFAAEDDDGDDVLIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EEHDYR VFKAN+RRAKR Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 60 KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLK 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC K
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKR 299
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
LDHGVL+VGYGS+G+APIR KEKPYWIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+V
Sbjct: 300 LDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTV 359
Query: 362 AAIHTTSS 369
AA+ TT+S
Sbjct: 360 AAVSTTTS 367
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 275/346 (79%), Positives = 308/346 (89%), Gaps = 6/346 (1%)
Query: 27 DDAMIRQVVP----SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
DD +IRQVVP E+ EDHLLNAEHHF+ FK+KF K YAT+EEHD RF VFK+NLR
Sbjct: 23 DDILIRQVVPDAVGEAAEKEEDHLLNAEHHFASFKAKFGKKYATKEEHDRRFGVFKSNLR 82
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
RA+ LDP+AVHGVTKFSDLTP+EFRRQFLG + LRLPA+AQKAPILPT DLP DFD
Sbjct: 83 RARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGF-KPLRLPANAQKAPILPTKDLPKDFD 141
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WRD GAVT VKDQGACGSCWSFS TGALEGAH+L+TGELVSLSEQQLVDCDH CDPEE G
Sbjct: 142 WRDKGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYG 201
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
+CDSGCNGGLMN+AFEYIL++GGV++EKDYPYTG DG +CKFDK+K+AA VSN+SV+S D
Sbjct: 202 ACDSGCNGGLMNNAFEYILQSGGVQKEKDYPYTGRDG-TCKFDKTKVAATVSNYSVVSLD 260
Query: 263 EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFK 322
EDQ+AANLVK+GPLAVGINAV+MQTYIGGVSCPYICGK+LDHGVLIVGYG +APIRFK
Sbjct: 261 EDQIAANLVKNGPLAVGINAVFMQTYIGGVSCPYICGKHLDHGVLIVGYGEGAYAPIRFK 320
Query: 323 EKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
KPYWIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+VAAI+ +S
Sbjct: 321 NKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAIYPSS 366
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 274/367 (74%), Positives = 316/367 (86%), Gaps = 5/367 (1%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R L L LL ++ L A + DD +IRQVV G+ LLNA+HHF++FK +F K
Sbjct: 4 RFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD-----LLNADHHFTVFKRRFGK 58
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
YA+ EEHDYR VFKAN+RRAK+ Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 59 VYASDEEHDYRLSVFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKF 118
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+LV
Sbjct: 119 PADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLV 178
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D C
Sbjct: 179 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVC 238
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLAV INAV++QTYIGGVSCPYIC K L
Sbjct: 239 RFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFVQTYIGGVSCPYICSKRL 298
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGVL+VGYGS+G+APIR KEKPYWIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+VA
Sbjct: 299 DHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 358
Query: 363 AIHTTSS 369
A+ TT+S
Sbjct: 359 AVSTTTS 365
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 278/368 (75%), Positives = 318/368 (86%), Gaps = 5/368 (1%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R L L LL ++ L A +D DD +IRQVV DG+ LLNA+HHF++FK +F
Sbjct: 4 RFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EEHDYR VFKAN+RRAKR Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 60 KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLK 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC K
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKR 299
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
LDHGVL+VGYGS+G+APIR KEKPYWIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+V
Sbjct: 300 LDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTV 359
Query: 362 AAIHTTSS 369
AA+ TT+S
Sbjct: 360 AAVSTTTS 367
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 583 bits (1503), Expect = e-164, Method: Compositional matrix adjust.
Identities = 276/346 (79%), Positives = 311/346 (89%), Gaps = 4/346 (1%)
Query: 23 AVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
+ + DD +IRQVVP +GE EDHLLNAEHHFS FKSKF KTYAT+EEHD+RF VFK+N+R
Sbjct: 23 STDADDILIRQVVP-EGE-VEDHLLNAEHHFSTFKSKFGKTYATKEEHDHRFGVFKSNMR 80
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
RA+ LDP+AVHGVTKFSDLTP+EF R+FLGL + LRLPA AQKAPILPTN+LP DFD
Sbjct: 81 RARLHAQLDPSAVHGVTKFSDLTPAEFHRKFLGL-KPLRLPAHAQKAPILPTNNLPKDFD 139
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WRD GAVT VKDQG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE G
Sbjct: 140 WRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYG 199
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
SCDSGCNGGLMN+AFEY++ +GGV+REKDYPYTG DG +CKFDKSKIAA+VSN+SVIS D
Sbjct: 200 SCDSGCNGGLMNNAFEYLIGSGGVQREKDYPYTGRDG-TCKFDKSKIAASVSNYSVISLD 258
Query: 263 EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFK 322
E+Q+AANLVK+GPLAV INAV+MQTY+GGVSCPYICGK+LDHGVL+VGYG +APIRFK
Sbjct: 259 EEQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICGKHLDHGVLLVGYGEGAYAPIRFK 318
Query: 323 EKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
EKPYWIIKNSWGENWG NGYYKIC GRNVCGVDSMVS+V AIH ++
Sbjct: 319 EKPYWIIKNSWGENWGGNGYYKICRGRNVCGVDSMVSTVGAIHAST 364
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 582 bits (1501), Expect = e-164, Method: Compositional matrix adjust.
Identities = 275/371 (74%), Positives = 324/371 (87%), Gaps = 11/371 (2%)
Query: 10 LLLLLSSVLASAVAV------NDDDAMIRQVVP----SDGEQSEDHLLNAEHHFSLFKSK 59
L+++LS + ASA+ +D D +IRQVV ++G +D LL A+HHFS+FK K
Sbjct: 7 LIVVLSLLAASAIGSEVISGESDGDFIIRQVVDDGGVNEGSNGDDLLLGADHHFSVFKQK 66
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NR 118
F K+YA++EEHD+RFRVFKANL+RA+R Q LDP+A HGVT+FSDLTPSEFRR FLGL +R
Sbjct: 67 FGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRSR 126
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
RL LPADA KAPILPT+ LPTDFDWRD GAV+ VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 127 RLGLPADANKAPILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEGANFLAT 186
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK+GG+ +E+DYPYTGTD
Sbjct: 187 GKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDYPYTGTD 246
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G+CKFDKSKIAA+V+NFSV+S DE+Q+AANLVK+GPLAV INAV+MQTYI GVSCPYIC
Sbjct: 247 RGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIKGVSCPYIC 306
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
K+LDHGVL+VGYGS G+APIR K+KPYWIIKNSWG NWGENGYYKIC GRN+CGVDSMV
Sbjct: 307 SKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGENGYYKICRGRNICGVDSMV 366
Query: 359 SSVAAIHTTSS 369
S+VAA+HT ++
Sbjct: 367 STVAAVHTAAN 377
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 582 bits (1501), Expect = e-164, Method: Compositional matrix adjust.
Identities = 280/342 (81%), Positives = 310/342 (90%), Gaps = 4/342 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV + EDHLLNAEHHF+ FKSKF K YATQEEHDYRF VFKANL RAK+
Sbjct: 29 DDPLIRQVV----SEGEDHLLNAEHHFTTFKSKFGKNYATQEEHDYRFSVFKANLLRAKK 84
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q++DPTA HGVTKFSDLTP EFRRQ LGL RRLRLP DA KAPILPT DLPTDFDWRDH
Sbjct: 85 HQIMDPTAAHGVTKFSDLTPKEFRRQLLGLKRRLRLPTDANKAPILPTGDLPTDFDWRDH 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCWSFSATGALEGAH+L+TGELVSLSEQQLVDCDHECDPEE G+CDS
Sbjct: 145 GAVTSVKDQGSCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDS 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC+GGLMN+AFEY LKAGG+EREKDYPYTG D G+CKF+KSK+AA+VSNFSV+S DEDQ+
Sbjct: 205 GCSGGLMNNAFEYALKAGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQI 264
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVKHGPL+V INAV+MQTYIGGVSCPYIC K+ DHGVL+VGYG++G+APIRFKEKP+
Sbjct: 265 AANLVKHGPLSVAINAVFMQTYIGGVSCPYICSKHQDHGVLLVGYGAAGYAPIRFKEKPF 324
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
WIIKNSWGENWGENGYYKIC RN+CGVDSMVS+VAAIH T+
Sbjct: 325 WIIKNSWGENWGENGYYKICRARNICGVDSMVSTVAAIHATA 366
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 274/339 (80%), Positives = 308/339 (90%), Gaps = 4/339 (1%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
+IRQVVP +GE EDHLLNAEHHFS FK+KF KTYAT+EEHD+RF VFK+N+RRA+
Sbjct: 30 LIRQVVP-EGE-VEDHLLNAEHHFSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQ 87
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
LDP+AVHGVTKFSDLTP+EF R+FLGL + LRLPA AQKAPILPTN+LP DFDWRD GAV
Sbjct: 88 LDPSAVHGVTKFSDLTPAEFHRKFLGL-KPLRLPAHAQKAPILPTNNLPKDFDWRDKGAV 146
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VKDQG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE GSCDSGCN
Sbjct: 147 TNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCN 206
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMN+AFEY++ +GGV+REKDYPYTG DG +CKFDKSKIAA+VSN+SVIS DE+Q+AAN
Sbjct: 207 GGLMNNAFEYLIGSGGVQREKDYPYTGRDG-TCKFDKSKIAASVSNYSVISLDEEQIAAN 265
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLAV INAV+MQTY+GGVSCPYICGK+LDHGVL+VGYG +APIRFKEKPYWII
Sbjct: 266 LVKNGPLAVAINAVYMQTYVGGVSCPYICGKHLDHGVLLVGYGEGAYAPIRFKEKPYWII 325
Query: 330 KNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
KNSWGENWGENGYYKIC GRNVCGVDSMVS+V AIH ++
Sbjct: 326 KNSWGENWGENGYYKICRGRNVCGVDSMVSTVGAIHAST 364
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 275/368 (74%), Positives = 315/368 (85%), Gaps = 8/368 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
MERL L SLL +L +SA+A +D+D +IRQVV E + HLLNAEHHFSLFKSKF
Sbjct: 1 MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKANLRRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 55 GKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ +CD+GC GGLM +AFEY LKAGG++ EKDYPYTG DG
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDG- 232
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKIAAAV+NFSVI DEDQ+AANLVKHGPLAVGINA WMQTY+GGVSCP IC K
Sbjct: 233 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFK 292
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
DHGVL+VGYGS GFAPIR KEK YWIIKNSWGENWGE+GYYKIC G N+CGVD+MVS+
Sbjct: 293 RQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
Query: 361 VAAIHTTS 368
V A HTT+
Sbjct: 353 VTAAHTTN 360
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 275/366 (75%), Positives = 314/366 (85%), Gaps = 11/366 (3%)
Query: 8 SLLLLLLSSVLASAVAVNDDDA-----MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
S LL L+ ++L SA + +IRQVVP D LL+AEH F LFK+KF K
Sbjct: 7 SALLFLIPTLLFSAAVSDISSDESDDLLIRQVVPEG-----DDLLSAEHQFGLFKAKFGK 61
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
TY+T EEHDYRF VF+ANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR +LGL + LRL
Sbjct: 62 TYSTVEEHDYRFSVFEANLRRARRHQLLDPSAVHGVTRFSDLTPDEFRRDYLGL-KPLRL 120
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
PADAQKAPILPTNDLPTDFDWRDHGAVT VKDQG+CGSCWSFSA GALEGAHFL+TG L+
Sbjct: 121 PADAQKAPILPTNDLPTDFDWRDHGAVTPVKDQGSCGSCWSFSAIGALEGAHFLTTGNLI 180
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
S+SEQQLVDCDHECDPEE G+CD GCNGGLM SAFEYILKAGGVERE+ YPY G+D GSC
Sbjct: 181 SMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSC 240
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
KF+KS+I A+VSNFSV+S DEDQ+AAN+VK+GPLAVGINAV+MQTY+ GVSCPYIC + L
Sbjct: 241 KFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVSCPYICSRNL 300
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGV++VGYGS+G+APIRFKEKPYWIIKNSWGE+WGE+GYYKIC G N CGVDSMVS+VA
Sbjct: 301 DHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWGEDGYYKICRGHNACGVDSMVSTVA 360
Query: 363 AIHTTS 368
AI TT+
Sbjct: 361 AIQTTT 366
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/343 (78%), Positives = 306/343 (89%), Gaps = 4/343 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV DG+ LLNA+HHF++FK +F K YA+ EEHDYR VFKAN+RRAKR
Sbjct: 27 DDILIRQVV-GDGDGD---LLNADHHFAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKR 82
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+ PADA+ APILPT++LP+DFDWRD
Sbjct: 83 HQQLDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDR 142
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDPEE+GSCDS
Sbjct: 143 GAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDS 202
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMNSAFEY LKAGG+ RE+DYPYTG D C+FDK+KIAA V+NFSV+S DEDQ+
Sbjct: 203 GCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQI 262
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVK+GPLAV INAV+MQTYIGGVSCPYIC K LDHGVL+VGYGS+G+APIR KEKPY
Sbjct: 263 AANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPY 322
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
WIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+VAA+ TT+S
Sbjct: 323 WIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAVSTTTS 365
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 275/366 (75%), Positives = 319/366 (87%), Gaps = 4/366 (1%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSED-HLLNAEHHFSLFKSKFSK 62
++SS+L + S+V A + + +D +IRQV E S + +LL AEHHFSLFK KF K
Sbjct: 10 FVISSILFV--SAVTAETLTTDGEDPLIRQVTDGQDESSANPNLLGAEHHFSLFKKKFKK 67
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
TYA+QEEHDYRF++FK+NLRRA+R Q LDPTA HGVT+FSDLT SEFRRQFLGL RRLRL
Sbjct: 68 TYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHGVTQFSDLTHSEFRRQFLGL-RRLRL 126
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P DA +AP+LPTNDLP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGA++L+TG+LV
Sbjct: 127 PKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLV 186
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP E G+CDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD G+C
Sbjct: 187 SLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGAC 246
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC K L
Sbjct: 247 QFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRL 306
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGVL+VGYGS+G+APIR KEKPYWIIKNSWGENWGE+GYYKIC GRN+CGVDSMVS+VA
Sbjct: 307 DHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGESGYYKICRGRNICGVDSMVSTVA 366
Query: 363 AIHTTS 368
A+ T S
Sbjct: 367 AVQTAS 372
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 274/368 (74%), Positives = 314/368 (85%), Gaps = 8/368 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
MERL L SLL +L +SA+A +D+D +IRQVV E + HLLNAEHHFSLFKSKF
Sbjct: 1 MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKAN RRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 55 GKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ +CD+GC GGLM +AFEY LKAGG++ EKDYPYTG DG
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDG- 232
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKIAAAV+NFSVI DEDQ+AANLVKHGPLAVGINA WMQTY+GGVSCP IC K
Sbjct: 233 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFK 292
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
DHGVL+VGYGS GFAPIR KEK YWIIKNSWGENWGE+GYYKIC G N+CGVD+MVS+
Sbjct: 293 RQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
Query: 361 VAAIHTTS 368
V A HTT+
Sbjct: 353 VTAAHTTN 360
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/340 (79%), Positives = 308/340 (90%), Gaps = 3/340 (0%)
Query: 30 MIRQVVPSDGE-QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ 88
+IRQVVP GE + ED+LLNAEHHF+ FK+KF+KTYAT+EEHD+RF VFK+NLRRA+
Sbjct: 32 LIRQVVPDVGEAEEEDNLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHA 91
Query: 89 LLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGA 148
LDP+AVHGVTKFSDLTP+EFRRQFLGL + LR PA AQKAPILPT DLP DFDWRD GA
Sbjct: 92 KLDPSAVHGVTKFSDLTPAEFRRQFLGL-KPLRFPAHAQKAPILPTKDLPKDFDWRDKGA 150
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
VT VKDQGACGSCWSFS TGALEGAH+L+TGELVSLSEQQLVDCDH CDPEE G+CDSGC
Sbjct: 151 VTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGC 210
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
NGGLMN+AFEYIL++GGV++EKDYPYTG D G+CKFDK+K+AA VSN+SV+S DE+Q+AA
Sbjct: 211 NGGLMNNAFEYILQSGGVQKEKDYPYTGRD-GTCKFDKTKVAATVSNYSVVSLDEEQIAA 269
Query: 269 NLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWI 328
NLVK+GPLAV INAV+MQTY+GGVSCPYICGK+LDHGVL+VGYG +APIRFK KPYWI
Sbjct: 270 NLVKNGPLAVAINAVFMQTYVGGVSCPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWI 329
Query: 329 IKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
IKNSWGE+WGENGYYKIC GRNVCGVDSMVS+VAAI+ +S
Sbjct: 330 IKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAIYPSS 369
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/367 (74%), Positives = 314/367 (85%), Gaps = 5/367 (1%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R L L LL ++ L A + DD +IRQVV G+ LLNA+HHF++FK +F K
Sbjct: 4 RFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD-----LLNADHHFTVFKRRFGK 58
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
YA+ EEHDYR FKAN+RRAK+ Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 59 VYASDEEHDYRLSEFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKF 118
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSC SFS TGALEGA+FL+TG+LV
Sbjct: 119 PADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCCSFSTTGALEGANFLATGKLV 178
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+D+PYTG D C
Sbjct: 179 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDHPYTGNDLQVC 238
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC K L
Sbjct: 239 RFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRL 298
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGVL+VGYGS+G+APIR KEKPYWIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+VA
Sbjct: 299 DHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 358
Query: 363 AIHTTSS 369
A+ TT+S
Sbjct: 359 AVSTTTS 365
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 576 bits (1484), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/352 (77%), Positives = 313/352 (88%), Gaps = 11/352 (3%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
V D+D +IRQVV SDGE +D LLNA+HHF+LFKSK+ K+YATQEEHDYR VFKANL
Sbjct: 19 VVRADEDPLIRQVV-SDGE--DDALLNADHHFTLFKSKYGKSYATQEEHDYRLSVFKANL 75
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTN 135
RRAKR QLLDP+AVHGVTKFSDLTP EFRR FLG+ R+L+LPADA A ILPT+
Sbjct: 76 RRAKRHQLLDPSAVHGVTKFSDLTPKEFRRTFLGIRKSSSGKRKLKLPADAHAAEILPTS 135
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
DLP+DFDWRD+GAVTGVKDQG+CGSCWSFS TGALEGA+FL+TGELVSLSEQQLVDCDH
Sbjct: 136 DLPSDFDWRDYGAVTGVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHL 195
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPEE+G+CDSGCNGGLM +A+EY+L++GG+E+EKDYPYTG D G+CKFDKSKIAAAV+N
Sbjct: 196 CDPEEAGACDSGCNGGLMTTAYEYVLQSGGLEKEKDYPYTGKD-GTCKFDKSKIAAAVAN 254
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSS 314
FSV+S DEDQ+AANLVKHGPL+VGINAV+MQTYIGGVSCPYIC K LDHGVL+VGYG++
Sbjct: 255 FSVVSLDEDQIAANLVKHGPLSVGINAVFMQTYIGGVSCPYICSKRNLDHGVLLVGYGAA 314
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHT 366
G+APIRFK+KPYWI+KNSWGENWGE GYYKIC G N+CG+DSMVS+V A T
Sbjct: 315 GYAPIRFKDKPYWIVKNSWGENWGEEGYYKICRGNNICGIDSMVSTVTAAST 366
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 270/368 (73%), Positives = 312/368 (84%), Gaps = 6/368 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M L L++ S A++ DD+ +I QVV G + L AEHHF FK +F
Sbjct: 1 MNNPTLIIFFLVIFSVFFAASADGGDDEPLIMQVVEGSGVR-----LGAEHHFLDFKRRF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA+QEEH+YRF VFKAN+RRA+R Q LDP+A HGVT+FSDLT SEFR + LGL R +
Sbjct: 56 GKAYASQEEHNYRFEVFKANMRRARRHQSLDPSAAHGVTRFSDLTASEFRNKVLGL-RGV 114
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
RLP++A KAPILPT++LP+DFDWRDHGAVT VK+QG+CGSCWSFS TGALEGAHFLSTGE
Sbjct: 115 RLPSNANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLSTGE 174
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEYILK+GGV RE+DYPY+GTD G
Sbjct: 175 LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRG 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
+CKFDK+KIAA+V+NFSVIS DEDQ+AANLVK+GPLAV INA +MQTYIGGVSCPYIC +
Sbjct: 235 NCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYICSR 294
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
LDHGVL+VGYGS +APIR KEKP+WIIKNSWGENWGENGYYKIC GRN+CGVDSMVS+
Sbjct: 295 RLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 354
Query: 361 VAAIHTTS 368
VAA+HTT+
Sbjct: 355 VAAVHTTT 362
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 274/368 (74%), Positives = 314/368 (85%), Gaps = 5/368 (1%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R L L LL ++ L A +D DD +IRQVV DG+ LLNA+HHF++FK +F
Sbjct: 4 RFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EEHDYR VFKAN+RRAKR Q LDP AVHGVT+FSD TP+EFRR+FLGLNRRL+
Sbjct: 60 KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDSTPTEFRRKFLGLNRRLK 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
PADA+ APILPT++LP+DFDWRD GAVT VK+QG CG CWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDRGAVTPVKNQGTCGLCWSFSTTGALEGANFLATGKL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE+GSCD GCNGGLMNSAFEY LKAGG+ RE+DYPYTG D
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDFGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC K
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKR 299
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
LDHGVL+VGYGS+G+APIR KEKPYWIIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+V
Sbjct: 300 LDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTV 359
Query: 362 AAIHTTSS 369
AA+ TT+S
Sbjct: 360 AAVSTTTS 367
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 272/350 (77%), Positives = 310/350 (88%), Gaps = 5/350 (1%)
Query: 21 AVAVNDDDAMIRQVVPSDGEQ-SEDHLLNAE-HHFSLFKSKFSKTYATQEEHDYRFRVFK 78
A +N DD +IR+VV DG+ S +LL+AE HHFSLFKSKF K+Y +QEEHDYRF VFK
Sbjct: 21 AETLNGDDPLIREVV--DGQDASSSNLLSAEQHHFSLFKSKFKKSYGSQEEHDYRFSVFK 78
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
ANLRRA R Q LDPTA HGVT+FSDLTP+EFR+Q LGL RRLRLP DA +APILPT+DLP
Sbjct: 79 ANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQVLGL-RRLRLPKDANEAPILPTSDLP 137
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DFDWRD GAV +K+QG+CGSCWSFSATGALEGAHFL+TGELVSLSEQQLVDCDHECDP
Sbjct: 138 EDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDHECDP 197
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
EE GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD +CKFDK+K+AA V+NFSV
Sbjct: 198 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRDACKFDKNKVAARVANFSV 257
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAP 318
+S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC + LDHGVL+VGYGS+G++P
Sbjct: 258 VSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYSP 317
Query: 319 IRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
+R KEKP+WIIKNSWGE WGENG+YKIC GRNVCGVDSMVS+VAA+ T+S
Sbjct: 318 VRMKEKPFWIIKNSWGEKWGENGFYKICRGRNVCGVDSMVSTVAAVQTSS 367
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 266/349 (76%), Positives = 301/349 (86%), Gaps = 1/349 (0%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
SA N DD++IRQVV E S + L +HHFSLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 20 SAETFNGDDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYRFSVFKS 79
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA APILPTNDLP
Sbjct: 80 NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANTAPILPTNDLPE 138
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+ GAV VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 139 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 198
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK+K+AA V+NFSV+
Sbjct: 199 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKNKVAAGVANFSVV 258
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPI 319
S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC + LDHGVL+VGYGS+ +AP+
Sbjct: 259 SLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLVGYGSAAYAPV 318
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDSMVS+VAA+ T S
Sbjct: 319 RMKEKPYWIIKNSWGESWGENGFYKICRGRNICGVDSMVSTVAAVQTNS 367
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 266/349 (76%), Positives = 301/349 (86%), Gaps = 1/349 (0%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
SA N DD++IRQVV E S + L +HH SLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 26 SAETFNGDDSLIRQVVEGQDESSPNLLTAEQHHLSLFKRKFKKSYLSQEEHDYRFSVFKS 85
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA KAPILPTNDLP
Sbjct: 86 NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANKAPILPTNDLPE 144
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+ GAV VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 145 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 204
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK K+AA V+NFSV+
Sbjct: 205 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKDKVAAGVANFSVV 264
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPI 319
S DEDQ+AANLVK+GPLAV NAV+MQTYIGGVSCPYIC + LDHGVL+VGYGS+G+AP+
Sbjct: 265 SLDEDQIAANLVKNGPLAVATNAVFMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPV 324
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDSMVS+VAA+ T+S
Sbjct: 325 RMKEKPYWIIKNSWGESWGENGFYKICRGRNICGVDSMVSTVAAVQTSS 373
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 573 bits (1477), Expect = e-161, Method: Compositional matrix adjust.
Identities = 275/338 (81%), Positives = 304/338 (89%), Gaps = 4/338 (1%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
+IRQVVP +GE EDHLLNAEHHFS FK+KF KTYAT+EEHD+RF VFK+NLRRA+
Sbjct: 29 LIRQVVP-EGE-VEDHLLNAEHHFSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQ 86
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
LDP+AVHGVTKFSDLT +EF+RQFLGL + L LPA+AQKAPILPTN+LP DFDWRD GAV
Sbjct: 87 LDPSAVHGVTKFSDLTAAEFQRQFLGL-KPLGLPANAQKAPILPTNNLPKDFDWRDKGAV 145
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VKDQGACGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE G+CDSGCN
Sbjct: 146 TNVKDQGACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCN 205
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMN+AFEYIL AGGV+RE+DYPY G D SCKFDKSKIAA+V+N+SVIS DEDQ+AAN
Sbjct: 206 GGLMNNAFEYILGAGGVQREEDYPYAGRDS-SCKFDKSKIAASVANYSVISLDEDQIAAN 264
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLAVGINAV+MQTYIGGVSCPYIC K LDHGV IVGYG SG+APIRFKEKPYWII
Sbjct: 265 LVKNGPLAVGINAVYMQTYIGGVSCPYICAKRLDHGVQIVGYGESGYAPIRFKEKPYWII 324
Query: 330 KNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
KNSWGE+WGENGYYKIC G+N CGVDSMVS+V AIH +
Sbjct: 325 KNSWGESWGENGYYKICRGQNACGVDSMVSTVGAIHAS 362
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 572 bits (1475), Expect = e-161, Method: Compositional matrix adjust.
Identities = 267/351 (76%), Positives = 310/351 (88%), Gaps = 7/351 (1%)
Query: 19 ASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
AS + + +D +I+Q+V DG DH L+A+HHF LFK +F K+YATQE+HDYRF VFK
Sbjct: 22 ASGKSSDGEDLVIQQIV--DG----DHPLSADHHFRLFKRRFGKSYATQEDHDYRFSVFK 75
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NLRRA+ Q LDP+AVHGVT+FSDLTP+EFRR LGL +RLR PADA KAPILPT DLP
Sbjct: 76 TNLRRARHHQRLDPSAVHGVTQFSDLTPAEFRRNHLGL-KRLRFPADANKAPILPTEDLP 134
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DFDWRDHGAV VK+QG+CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDP
Sbjct: 135 ADFDWRDHGAVASVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 194
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
EE GSCDSGCNGGLMNSA EY LKAGG+ RE+DYPY+GTD G+CKFD++KIAA+V+NFSV
Sbjct: 195 EEPGSCDSGCNGGLMNSALEYTLKAGGLMREEDYPYSGTDRGTCKFDETKIAASVANFSV 254
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAP 318
+S DE+Q+AANLVK+GPLAV INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGS+G+AP
Sbjct: 255 VSLDENQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLVGYGSAGYAP 314
Query: 319 IRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
IR KEKPYWIIKNSWGE+WGENG+YKIC GRNVCGVDSMVS+VAA+HTTS+
Sbjct: 315 IRMKEKPYWIIKNSWGESWGENGFYKICQGRNVCGVDSMVSTVAAVHTTSN 365
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 265/349 (75%), Positives = 300/349 (85%), Gaps = 1/349 (0%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
SA N DD++IRQVV E S + L +HHFSLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 20 SAETFNGDDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYRFSVFKS 79
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA APILPTNDLP
Sbjct: 80 NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANTAPILPTNDLPE 138
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+ GAV VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 139 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 198
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK+K+AA V+NFS +
Sbjct: 199 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKNKVAAGVANFSAV 258
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPI 319
S DEDQ+AANLVK+GPLAV INAV+MQTYIGGVSCPYIC + LDHGVL+VGYGS+ +AP+
Sbjct: 259 SLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLVGYGSAAYAPV 318
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDSMVS+VAA+ T S
Sbjct: 319 RMKEKPYWIIKNSWGESWGENGFYKICRGRNICGVDSMVSTVAAVQTNS 367
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 570 bits (1469), Expect = e-160, Method: Compositional matrix adjust.
Identities = 272/368 (73%), Positives = 312/368 (84%), Gaps = 6/368 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+RL L SL L +SA+A D+D +IRQVV S+ E + HLLNAEHHFSLFKSKF
Sbjct: 1 MDRLFLLSLPRFAL---FSSAIAFPDEDPLIRQVV-SETETDDSHLLNAEHHFSLFKSKF 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKANLRRA+ QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 57 GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 115
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP D+DWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 116 KPKVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 175
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECD E+ SCD+GC GGLM +AFEY LKAGG++ EKDYPYTG DG
Sbjct: 176 LVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDG- 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKIAAAV+NFSVI DEDQ+AANLVKHGPLAVGINA WMQTY+GGVSCP IC K
Sbjct: 235 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFK 294
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
DHGVL+VGYGS GFAPIR KEK YWIIKNSWGENWGE+GYYKIC G N+CGVD+MVS+
Sbjct: 295 RQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 354
Query: 361 VAAIHTTS 368
V A HTT+
Sbjct: 355 VTAAHTTN 362
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 271/368 (73%), Positives = 311/368 (84%), Gaps = 8/368 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
MERL L SLL +L +SA+A +D+D +IRQVV E + HLLNAEHHFSLFKSKF
Sbjct: 1 MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKANLRRA+ QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 55 GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ +CD+GC GG +AFEY LKAGG++ EKDYPYTG DG
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQLEKDYPYTGKDG- 232
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKI AAV+NFSVI DEDQ+AANLVKHGPLAVGINA WMQTY+GGVSCP IC K
Sbjct: 233 KCHFDKSKICAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFK 292
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
DHGVL+VGYGS GFAPIR KEK YWIIKNSWGENWGE+GYYKIC G N+CGVD+MVS+
Sbjct: 293 RQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352
Query: 361 VAAIHTTS 368
V A HTT+
Sbjct: 353 VTAAHTTN 360
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 268/367 (73%), Positives = 315/367 (85%), Gaps = 13/367 (3%)
Query: 10 LLLLLSSVLASA--VAVN------DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
L+ +LS +L ++ +AVN DDD +IRQVV + + H+LNAEHHF+LFK +F
Sbjct: 7 LVFVLSILLTTSFLLAVNGEIKGGDDDILIRQVVGDE----DHHMLNAEHHFTLFKKRFG 62
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
KTYA+ EEH YRF VFKANLRRA R Q LDP+AVHGVT+FSD+TP EF ++FLG+NRRLR
Sbjct: 63 KTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDMTPDEFSQKFLGVNRRLR 122
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
P+DA KAPILPT DLP+DFDWR+HGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 123 FPSDANKAPILPTEDLPSDFDWREHGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKL 182
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE SCDSGC+GGLMNSAFEY LKAGG+ RE+DYPYTGTD +
Sbjct: 183 VSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDKAT 242
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
CKFD +K+AA V+NFSV+S DE+Q+AANLVK+GPLAV INAV+MQTY+GGVSCPYIC K
Sbjct: 243 CKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKQ 302
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
LDHGVL+VGYG +GF+PIR KEKPYWIIKNSWGE WGE+GYYKI GRNVCGVDSMVS+V
Sbjct: 303 LDHGVLLVGYG-TGFSPIRMKEKPYWIIKNSWGEKWGESGYYKIRRGRNVCGVDSMVSTV 361
Query: 362 AAIHTTS 368
AA+ T+S
Sbjct: 362 AAVSTSS 368
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 265/350 (75%), Positives = 307/350 (87%), Gaps = 8/350 (2%)
Query: 20 SAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
S V+ N+ DD +IRQVV + D LL+AEHHF+ FK++F KTYAT EEHDYRF +FK
Sbjct: 23 SDVSSNELDDLLIRQVV-----SNSDDLLSAEHHFAAFKARFRKTYATAEEHDYRFSIFK 77
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
ANLRRAKR QLLDP+AVHGVT+FSDLTP+EFR+ +LGL + LR P D Q+APILPTNDLP
Sbjct: 78 ANLRRAKRNQLLDPSAVHGVTRFSDLTPAEFRQNYLGL-KPLRFPIDTQQAPILPTNDLP 136
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
TDFDWRDHGAVT VKDQG CGSCWSFS TGALEGAHFL+TG LVSLSEQQLVDCDHECDP
Sbjct: 137 TDFDWRDHGAVTAVKDQGECGSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDP 196
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
EE G+CD GCNGGLMN+AFEYILKAGGV R +DYPYTGTD G CKFDK+KIAA+VSNFS
Sbjct: 197 EEYGACDRGCNGGLMNTAFEYILKAGGVVRGEDYPYTGTD-GHCKFDKTKIAASVSNFST 255
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAP 318
+S DEDQ+AANLVK+GPLAVGINA++MQ+Y GGVSCP+IC L+HGVL+VGYGS+G++P
Sbjct: 256 VSIDEDQIAANLVKNGPLAVGINAIFMQSYAGGVSCPFICSTSLNHGVLLVGYGSAGYSP 315
Query: 319 IRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
IRFKEKPYW++KNSWG+NWGE+GYYKIC G N+CGVDSMVS+VAAI + +
Sbjct: 316 IRFKEKPYWLLKNSWGQNWGEHGYYKICRGHNICGVDSMVSTVAAIQSAT 365
>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 567 bits (1460), Expect = e-159, Method: Compositional matrix adjust.
Identities = 263/368 (71%), Positives = 312/368 (84%), Gaps = 6/368 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ L ++L + SV A + +D +IRQVV +G + L AEHHF+LFK KF
Sbjct: 1 MDHRTLLLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVR-----LGAEHHFNLFKHKF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y++++EHDYRF++FK+NL RAKR QL+DP+AVHGVT+FSDLTP EFR+ LGL R +
Sbjct: 56 GKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGL-RGV 114
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
LP DA APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTG+
Sbjct: 115 GLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGK 174
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ GSCD+GCNGGLMNSAFEYILK+GGV RE+DYPY+GTD G
Sbjct: 175 LVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRG 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
SCKFDK KIAA+V+NFSV+S DEDQ+AANLVK+GPLA+ +NAV+MQTY+GGVSCPYIC K
Sbjct: 235 SCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLAIALNAVYMQTYVGGVSCPYICSK 294
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
LDHGVL+VGYGS ++PIR KEKPYWIIKNSWGE WGENGYYKIC GRN+CGVDSMVS+
Sbjct: 295 RLDHGVLLVGYGSGAYSPIRLKEKPYWIIKNSWGETWGENGYYKICRGRNICGVDSMVST 354
Query: 361 VAAIHTTS 368
VAA+HTT+
Sbjct: 355 VAAVHTTT 362
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 565 bits (1457), Expect = e-159, Method: Compositional matrix adjust.
Identities = 267/363 (73%), Positives = 312/363 (85%), Gaps = 9/363 (2%)
Query: 9 LLLLLLSSVLASAVAVND---DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
LLL+ S V A+ A +D ++ +I QVV DG D L AEHHF FK +F K Y
Sbjct: 8 LLLVAFSLVFAAVSASSDGGNEEPLIMQVV--DGG---DVRLGAEHHFLEFKRRFGKAYD 62
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+++EHDYR++VFKAN+RRA+R Q LDP+A HGVT+FSDLTPSEFR + LGL R +RLP D
Sbjct: 63 SEDEHDYRYKVFKANMRRARRHQSLDPSAAHGVTRFSDLTPSEFRNKVLGL-RGVRLPLD 121
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A KAPILPT++LP+DFDWRDHGAVT VK+QG+CGSCWSFS TGALEGAHFLSTGELVSLS
Sbjct: 122 ANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLS 181
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYILK+GGV RE+DYPY+G D G+CKFD
Sbjct: 182 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGADSGTCKFD 241
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHG 305
K+KIAA+V+NFSV+S DEDQ+AANLVK+GPLAV INA +MQTYIGGVSCPY+C + L+HG
Sbjct: 242 KTKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYVCSRRLNHG 301
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
VL+VGYGS +APIR KEKP+WIIKNSWGENWGENGYYKIC GRN+CGVDSMVS+VA++H
Sbjct: 302 VLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYKICRGRNICGVDSMVSTVASVH 361
Query: 366 TTS 368
TT+
Sbjct: 362 TTT 364
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 264/347 (76%), Positives = 302/347 (87%), Gaps = 5/347 (1%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
+A +DDD +IRQVV E ++H+LNAEHHFSLFKSK+ K YA+QEEHD+R +VFKANL
Sbjct: 19 IAFSDDDPLIRQVV---SETDDNHMLNAEHHFSLFKSKYGKIYASQEEHDHRLKVFKANL 75
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RRA+R QLLDPTA HG+T+FSDLTPSEFRR +LGL++ R +AQKAPILPT+DLP DF
Sbjct: 76 RRARRHQLLDPTAEHGITQFSDLTPSEFRRTYLGLHKP-RPKLNAQKAPILPTSDLPEDF 134
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQLVDCDHECD EE
Sbjct: 135 DWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEK 194
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
CD+GCNGGLM +AFEY LKAGG++REKDYPYTG DG C FDKSKIAA+V+NFSVI
Sbjct: 195 SECDAGCNGGLMTTAFEYTLKAGGLQREKDYPYTGRDG-KCHFDKSKIAASVANFSVIGL 253
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRF 321
DEDQ+AANLVKHGPLAVGINA WMQTY+ GVSCP IC K DHGVL+VGYGS+GFAPIR
Sbjct: 254 DEDQIAANLVKHGPLAVGINAAWMQTYMRGVSCPLICFKRQDHGVLLVGYGSAGFAPIRL 313
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
KEKPYWIIKNSWGENWGE+GYYKIC G N+CGVD+MVS+V A HTT+
Sbjct: 314 KEKPYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVSTVTATHTTN 360
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 268/344 (77%), Positives = 303/344 (88%), Gaps = 5/344 (1%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD+ +IRQVV E ++HLLNAEHHFS FK+KFSKTYAT+EEHDYRF VFK+NL RA
Sbjct: 25 DDDNILIRQVV----EDGDEHLLNAEHHFSAFKTKFSKTYATKEEHDYRFGVFKSNLLRA 80
Query: 85 KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWR 144
K Q LDP+A+HGVTKFSDLTPSEFR QFLGL + L LP+DA APILPT++LP DFDWR
Sbjct: 81 KSHQELDPSAIHGVTKFSDLTPSEFRSQFLGL-KPLSLPSDAHNAPILPTDNLPKDFDWR 139
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
DHGAVT VK+QG GSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDP+ + +C
Sbjct: 140 DHGAVTNVKNQGTGGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPDLNDAC 199
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
DSGCNGGLM +AF Y KAGG+ RE+DY YTG D G CKFDKSKIAA+VSNFSV+S DED
Sbjct: 200 DSGCNGGLMTTAFGYTKKAGGLVREEDYLYTGRDRGPCKFDKSKIAASVSNFSVVSLDED 259
Query: 265 QMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
Q+AANLVK+GPL+VGINAV+MQTYIGGVSCP+ICGK+LDHGVL+VGYG+ G+APIRFKEK
Sbjct: 260 QIAANLVKNGPLSVGINAVYMQTYIGGVSCPFICGKHLDHGVLLVGYGAGGYAPIRFKEK 319
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
PYWIIKNSWGENWGENGYYKIC G N+CGVDSMVS+V AIHT S
Sbjct: 320 PYWIIKNSWGENWGENGYYKICRGPNMCGVDSMVSTVIAIHTFS 363
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 265/369 (71%), Positives = 312/369 (84%), Gaps = 7/369 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+R + +++L + +S N DD +IRQVV + EDHLLNAEHHF+ FKSKF
Sbjct: 1 MDRRFIFAIVLFA-AVATSSTDNTNTDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
SK+Y+T+EEHDYRF VFK+NL +AK Q LDPTA HG+TKFSDLT SEFRRQFLGL +RL
Sbjct: 56 SKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKRL 115
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
RLPA AQKAPILPT +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+
Sbjct: 116 RLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGK 175
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDH CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D G
Sbjct: 176 LVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-G 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
SCKFDKSK+ A+VSNFSV+S DE+Q+AANLVK+GPLAVGINA WMQTY+ GVSCPY+C K
Sbjct: 235 SCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCPYVCAK 294
Query: 301 -YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
LDHGVL+VG+G +APIR KEKPYWI+KNSWG+NWGE GYYKIC GRNVCGVDSMVS
Sbjct: 295 SRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSMVS 354
Query: 360 SVAAIHTTS 368
+VAA + +
Sbjct: 355 TVAAAQSNN 363
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 262/341 (76%), Positives = 296/341 (86%), Gaps = 4/341 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVV D +Q LL+AE HFS F S++ K+YA + EH YRF VFK+NLRRA+R
Sbjct: 23 EDPVIRQVVSDDQQQ----LLSAEAHFSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARR 78
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDPTAVHGVT+F+DLTPSEFRR +LGL RR R APILPTN+LP DFDWRDH
Sbjct: 79 HQRLDPTAVHGVTRFADLTPSEFRRTYLGLRRRPRTAGSTHDAPILPTNELPADFDWRDH 138
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG+CGSCWSFSA GALEGA++LSTG LVSLSEQQLVDCDHECD E SCD
Sbjct: 139 GAVTPVKNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQ 198
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM +AFEYILK+GG+ERE DYPYTGTD G+CKF+K+KI+A SNFSV+S DEDQ+
Sbjct: 199 GCNGGLMTTAFEYILKSGGLEREADYPYTGTDRGTCKFNKAKISAVASNFSVVSIDEDQI 258
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVKHGPLAVGINAV+MQTY+GGVSCPYICGK+LDHGVL+VGYGS+GFAPIRFKEKPY
Sbjct: 259 AANLVKHGPLAVGINAVFMQTYVGGVSCPYICGKHLDHGVLLVGYGSAGFAPIRFKEKPY 318
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
WIIKNSWGENWGENGYYKIC GRNVCGVDSMVSSV+A HT+
Sbjct: 319 WIIKNSWGENWGENGYYKICRGRNVCGVDSMVSSVSAFHTS 359
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 265/369 (71%), Positives = 312/369 (84%), Gaps = 7/369 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+R + +++L + +S N DD +IRQVV + EDHLLNAEHHF+ FKSKF
Sbjct: 1 MDRRFIFAIVLFA-AVATSSTDDTNTDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
SK+Y+T+EEHDYRF VFK+NL +AK Q LDPTA HG+TKFSDLT SEFRRQFLGL +RL
Sbjct: 56 SKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKRL 115
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
RLPA AQKAPILPT +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+
Sbjct: 116 RLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGK 175
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDH CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D G
Sbjct: 176 LVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-G 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
SCKFDKSK+ A+VSNFSV+S DE+Q+AANLVK+GPLAVGINA WMQTY+ GVSCPY+C K
Sbjct: 235 SCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCPYVCAK 294
Query: 301 -YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
LDHGVL+VG+G +APIR KEKPYWI+KNSWG+NWGE GYYKIC GRNVCGVDSMVS
Sbjct: 295 SRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSMVS 354
Query: 360 SVAAIHTTS 368
+VAA + +
Sbjct: 355 TVAAAQSNN 363
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 562 bits (1449), Expect = e-158, Method: Compositional matrix adjust.
Identities = 266/342 (77%), Positives = 301/342 (88%), Gaps = 6/342 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +I QVV SDG D LLNAE+ F+ FK+KF KTYAT EEHD+RF VFKANLRRAKR
Sbjct: 35 EDLLIHQVV-SDG----DDLLNAEYQFAEFKTKFGKTYATAEEHDHRFNVFKANLRRAKR 89
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
QLLDP+A HGVT+FSDLTP EFR+ +LGL +RL+LPADAQKAPILPT DLPTDFDWRDH
Sbjct: 90 HQLLDPSAEHGVTQFSDLTPREFRQNYLGL-KRLQLPADAQKAPILPTKDLPTDFDWRDH 148
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG CGSCWSFS GALEGAHFL+TG LVSLS QQL+DCD ECDPEE +CD
Sbjct: 149 GAVTAVKDQGYCGSCWSFSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDD 208
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEYILKAGGV +E+DYPYTGTD G C+F+K+KIAA+V+NFSV+S DEDQ+
Sbjct: 209 GCNGGLMNNAFEYILKAGGVAQEEDYPYTGTDRGLCRFNKTKIAASVANFSVVSLDEDQI 268
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVK+GPLAVGINAV+MQTY GVSCPYIC LDHGVL+VGYGS+G++PIRFKEKPY
Sbjct: 269 AANLVKNGPLAVGINAVFMQTYKSGVSCPYICSSTLDHGVLLVGYGSAGYSPIRFKEKPY 328
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
WIIKNSWGE+WGE GYYKIC G N+CGVDSMVS+VAAIHTT+
Sbjct: 329 WIIKNSWGESWGEQGYYKICRGHNICGVDSMVSTVAAIHTTA 370
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 562 bits (1449), Expect = e-158, Method: Compositional matrix adjust.
Identities = 266/369 (72%), Positives = 314/369 (85%), Gaps = 6/369 (1%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+RL L SLL+ + S +SA A +D+D +IRQV S+ + + +HLLNAEHHFSLFKSKF
Sbjct: 1 MDRLFLLSLLVFTIFS--SSAFAFSDEDPLIRQVT-SESDDNNNHLLNAEHHFSLFKSKF 57
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YATQEEHD+R +VFKANLRRA+R QLLDPTA HG+TKFSDLTPSEFRR +LGL++
Sbjct: 58 GKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSEFRRTYLGLHKP- 116
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ KAPILPT+DLP DFDWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 117 KPKLSTTKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 176
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECD E+ CD+GC GGLM +AFEY LKAGG++REKDYPYTG + G
Sbjct: 177 LVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRN-G 235
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKIAA+V+N+SV+ DEDQ+AANLVKHGPLAVGIN+ WMQTYIGGVSCP +C K
Sbjct: 236 QCHFDKSKIAASVTNYSVVGLDEDQIAANLVKHGPLAVGINSAWMQTYIGGVSCPLVCFK 295
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
+ DHGVL+VGYGS+GFAPIR K KPYWIIKNSWGE+WGE+GYYKIC G+ N+CGVD+MVS
Sbjct: 296 HQDHGVLLVGYGSAGFAPIRLKAKPYWIIKNSWGEHWGEHGYYKICRGQHNICGVDAMVS 355
Query: 360 SVAAIHTTS 368
+V A HTT+
Sbjct: 356 TVTAAHTTN 364
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 562 bits (1448), Expect = e-158, Method: Compositional matrix adjust.
Identities = 264/344 (76%), Positives = 299/344 (86%), Gaps = 6/344 (1%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
N DD +IRQVV + +EDH+LNAEHHF+ FKSKFSK YAT+EEHDYRF VFK+NL +
Sbjct: 26 TNSDDLLIRQVV----DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIK 81
Query: 84 AKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
AK Q LDP+A HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPTN+LP DFDW
Sbjct: 82 AKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTNNLPEDFDW 141
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+L SLSEQQLVDCDH CDPEE GS
Sbjct: 142 REKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGS 201
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
CDSGCNGGLMN+AFEYIL++GGV EKDY YTG D GSCKFDKSK+ A+VSNFSV+S DE
Sbjct: 202 CDSGCNGGLMNNAFEYILQSGGVVSEKDYAYTGRD-GSCKFDKSKVVASVSNFSVVSLDE 260
Query: 264 DQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFK 322
DQ+AANLVK+GPLAV INA WMQTY+ GVSCPYIC K LDHGVL++G+G G+APIR K
Sbjct: 261 DQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYICAKARLDHGVLLLGFGQGGYAPIRLK 320
Query: 323 EKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHT 366
EKPYWIIKNSWG+NWGE GYYKIC GRNVCGVDSMVS+VAA +
Sbjct: 321 EKPYWIIKNSWGQNWGEEGYYKICRGRNVCGVDSMVSTVAAAQS 364
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 273/366 (74%), Positives = 316/366 (86%), Gaps = 5/366 (1%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
LI ++LL + L S + S IRQVVP E++++HLLNAEHHFSLFKSK+ KT
Sbjct: 9 LIAATLLAVSLGSAVISGEVNYGFVNPIRQVVP---EENDEHLLNAEHHFSLFKSKYEKT 65
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-LRL 122
YATQEEHD+RFRVFKANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR+FLGL RR RL
Sbjct: 66 YATQEEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRL 125
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P D Q APILPT+DLPT+FDWR+ GAVT VK+QG CGSCWSFSA GALEGAHFL+T ELV
Sbjct: 126 PTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELV 185
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP ++ SCDSGC+GGLMN+AFEY LKAGG+ +E+DYPYTG D +C
Sbjct: 186 SLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDNTAC 245
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
KFDKSKIAA+VSNFSV+SSDEDQ+AANLVKHGPLA+ INA+WMQTYIGGVSCPY+C K
Sbjct: 246 KFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQTYIGGVSCPYVCSKSQ 305
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG-RNVCGVDSMVSSV 361
DHGVL+VG+GSSG+APIR KEKPYWIIKNSWG WGE+GYYKIC G N+CG+D+MVS+V
Sbjct: 306 DHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVSTV 365
Query: 362 AAIHTT 367
AA+HT+
Sbjct: 366 AAVHTS 371
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 263/343 (76%), Positives = 299/343 (87%), Gaps = 6/343 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF VFKANL +AK
Sbjct: 21 DDFLIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKL 76
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDPTA HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPT +LP DFDWR+
Sbjct: 77 HQKLDPTAEHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPEDFDWREK 136
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH CDPEE+GSCDS
Sbjct: 137 GAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDS 196
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSNFSV+S DE+Q+
Sbjct: 197 GCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSNFSVVSLDEEQI 255
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKP 325
AANLVK+GPLAV INA WMQ Y+ GVSCPY+C K LDHGVL+VG+G +APIR KEKP
Sbjct: 256 AANLVKNGPLAVAINAAWMQAYMSGVSCPYVCAKARLDHGVLLVGFGKGAYAPIRLKEKP 315
Query: 326 YWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
YWIIKNSWG+NWGE GYYKIC GRNVCGVDSMVS+VAA + +
Sbjct: 316 YWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVSTVAAAQSNN 358
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 262/340 (77%), Positives = 300/340 (88%), Gaps = 3/340 (0%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
VA +D+D +IRQVV S+ E + HLLNAEHHFSLFKSKF K YA++EEHD+RF+VFKANL
Sbjct: 19 VAFSDEDPLIRQVV-SETETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANL 77
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++ + +A+KAPILPT+DLP D+
Sbjct: 78 RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP-KPKLNAEKAPILPTSDLPADY 136
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQLVDCDHECDPE+
Sbjct: 137 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 196
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCD+GC+GGLM +AFEY LKAGG++REKDYPYTG G C FDKSKIAAAV+NFSVI
Sbjct: 197 DSCDAGCSGGLMTTAFEYTLKAGGLQREKDYPYTGKXG-KCHFDKSKIAAAVTNFSVIGL 255
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRF 321
DEDQ+AANLVKHGPLAVGINA WMQTY+GGVSCP IC K DHGVL+VGYGS GFAPIR
Sbjct: 256 DEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFKRQDHGVLLVGYGSHGFAPIRL 315
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
KEK YWIIKNSWGENWGE+GYYKIC G N+CGVD+MVS+V
Sbjct: 316 KEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVSTV 355
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 258/348 (74%), Positives = 304/348 (87%), Gaps = 3/348 (0%)
Query: 21 AVAVNDDDAMIRQVVPSDGEQSE-DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
AV + D +IRQVV +D + E D LL+ EHHF LFK+KF +TY T+EEH+YR VFK+
Sbjct: 17 AVTADSSDPLIRQVVQNDETEIESDPLLDPEHHFKLFKNKFGRTYDTEEEHEYRLTVFKS 76
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRAKR Q+LDPTA HGVTKFSDLTPSEFR+++LGL +L+LPADA KAPILPT++LP
Sbjct: 77 NLRRAKRHQVLDPTAKHGVTKFSDLTPSEFRKKYLGLKSKLKLPADANKAPILPTSNLPQ 136
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWRD GAVT VK+QG+CGSCWSFS TGALEG+HFL TGELVSLSEQQLVDCDHECDP
Sbjct: 137 DFDWRDKGAVTPVKNQGSCGSCWSFSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDPA 196
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E SCDSGCNGGLMN+AFEYILKAGG+++E DYPYTG D G+CKFDKSKIAA+V+NFSV+
Sbjct: 197 EYNSCDSGCNGGLMNNAFEYILKAGGLQKEADYPYTGRD-GTCKFDKSKIAASVANFSVV 255
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAP 318
S+DEDQ+AANLV +GPLA+GINA WMQTYIG VSCPYIC K +DHGVL+VGYGS+G+AP
Sbjct: 256 STDEDQIAANLVTNGPLAIGINAAWMQTYIGQVSCPYICSKTKMDHGVLLVGYGSAGYAP 315
Query: 319 IRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHT 366
+RFKEKPYWIIKNSWGE+WGE+GYYK+C G N CG+D+MVS+V + +T
Sbjct: 316 LRFKEKPYWIIKNSWGEDWGEDGYYKLCSGYNACGMDTMVSAVVSTNT 363
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 266/340 (78%), Positives = 305/340 (89%), Gaps = 7/340 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D +IRQVV + +EDH+LNAEHHFS FKSKFSKTYAT+EEHDYRF VFK+N+RRAK
Sbjct: 1 DLLIRQVV----DDNEDHVLNAEHHFSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLH 56
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
LDP+AVHGVTKFSDLTPSEFRRQFLGL + LRLP AQKAPILPT+DLP DFDWRD G
Sbjct: 57 AKLDPSAVHGVTKFSDLTPSEFRRQFLGL-KPLRLPEHAQKAPILPTHDLPEDFDWRDKG 115
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCW+FS TGALEG+HFL+TGELVSLS+QQLVDCDH CDPE+ G+CDSG
Sbjct: 116 AVTHVKNQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSG 175
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMN+AFEYIL++GGV+RE+DYPYTG D G D++ AA+VSNFSV+S DEDQ++
Sbjct: 176 CNGGLMNNAFEYILESGGVQREEDYPYTGRDRGPA-IDEAN-AASVSNFSVVSLDEDQIS 233
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
ANLVK+GPLA+GINAV+MQTYIGGVSCPYICGK LDHGVL+VGYG +G+APIR KEKPYW
Sbjct: 234 ANLVKNGPLAIGINAVFMQTYIGGVSCPYICGKNLDHGVLLVGYGKAGYAPIRLKEKPYW 293
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
IIKNSWGE+WGENGYYKIC GRNVCGVDSMVS+VAA+HT+
Sbjct: 294 IIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAVHTS 333
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 559 bits (1441), Expect = e-157, Method: Compositional matrix adjust.
Identities = 264/353 (74%), Positives = 304/353 (86%), Gaps = 8/353 (2%)
Query: 18 LASAVA--VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
+A+AV N+DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF
Sbjct: 15 VATAVTDDTNNDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFG 70
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
VFK+NL +AK Q DPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT
Sbjct: 71 VFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTT 130
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
+LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH
Sbjct: 131 NLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHV 190
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSN
Sbjct: 191 CDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSN 249
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSS 314
FSV++ DEDQ+AANLVK+GPLAV INA WMQTY+ GVSCPY+C K LDHGVL+VG+G
Sbjct: 250 FSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYVCAKSRLDHGVLLVGFGKG 309
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
+APIR KEKPYWIIKNSWG+NWGE GYYKIC GRNVCGVDSMVS+VAA +
Sbjct: 310 AYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVSTVAAAQSN 362
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 559 bits (1441), Expect = e-157, Method: Compositional matrix adjust.
Identities = 257/342 (75%), Positives = 303/342 (88%), Gaps = 4/342 (1%)
Query: 28 DAMIRQVVPSDG-EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
D +I QV +DG E +E LL AEHH+SLFK +F K+Y +Q+EHDYRF++F+ NLRRA R
Sbjct: 34 DPLIEQV--TDGHEGAEPQLLTAEHHYSLFKKRFKKSYGSQKEHDYRFKIFQVNLRRAAR 91
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDP+A HGVT+FSDLTP EFR+ +LGL RRLRLP DA +APILPT++LP DFDWR+
Sbjct: 92 HQNLDPSATHGVTQFSDLTPGEFRKAYLGL-RRLRLPKDATEAPILPTDNLPQDFDWREK 150
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG+CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDPEE+GSCDS
Sbjct: 151 GAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDS 210
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD G+CKFD +K+AA V+NFSV+S DEDQ+
Sbjct: 211 GCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDNTKVAAKVANFSVVSLDEDQI 270
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANL K+GPLAV INAV+MQTYIGGVSCPYIC K LDHGVL+VGYGS+G+AP+R K+KPY
Sbjct: 271 AANLFKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVGYGSAGYAPVRMKDKPY 330
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
WIIKNSWGENWGENG+Y+IC GRN+CGVDSMVS+VAA++T S
Sbjct: 331 WIIKNSWGENWGENGFYRICRGRNICGVDSMVSTVAAVNTNS 372
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 268/375 (71%), Positives = 317/375 (84%), Gaps = 16/375 (4%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+R L SLL+ L+ A+ V D+D +IRQVV SDGE +D LLNA+HHF+LFKSK+
Sbjct: 1 MDRFSLPSLLIHALT---AACVVRADEDPLIRQVV-SDGE--DDALLNADHHFTLFKSKY 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K+YATQEEHDYR VFKANLRRAKR Q+LDP+AVHGVTKFSDLTP EFRR +LG+ +
Sbjct: 55 GKSYATQEEHDYRLSVFKANLRRAKRHQMLDPSAVHGVTKFSDLTPKEFRRTYLGIRKSS 114
Query: 121 RL--------PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
PADA A ILPT+DLP DF+WRD+GAVTGVKDQG CGSCWSFS TG LEG
Sbjct: 115 SSKQKLKLKLPADAHAAEILPTSDLPFDFEWRDYGAVTGVKDQGLCGSCWSFSTTGTLEG 174
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+FL+TGEL+SL+EQ+LVDCDH CDP+++G+CD+GCNGGLM +A+EY+L++GG+E+EKDY
Sbjct: 175 TNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVLQSGGLEKEKDY 234
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
PYTG D G+CKFDKSKIAAAV+NFSV+S DEDQ+AANLVKHGPL+VGIN+++MQTYIGGV
Sbjct: 235 PYTGRD-GTCKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINSIFMQTYIGGV 293
Query: 293 SCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
SCPYIC K LDHGVLIVGYG++G+APIRFK+KPYWIIKNSWGENWGE GYYKIC G N+
Sbjct: 294 SCPYICSKKNLDHGVLIVGYGAAGYAPIRFKDKPYWIIKNSWGENWGEEGYYKICRGNNI 353
Query: 352 CGVDSMVSSVAAIHT 366
CGVDSMVSSV A T
Sbjct: 354 CGVDSMVSSVTAAST 368
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 260/342 (76%), Positives = 298/342 (87%), Gaps = 8/342 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EFR LGL R + LP+DA APILPT++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+CDPEE+GSCDSG
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSG 198
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMNSAFEYIL GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIA 258
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
ANLVK+GPLAV INAV+MQTY+GGVSCPY+C K L+HGVL+VGYGS +APIR K+KPYW
Sbjct: 259 ANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYW 318
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
IIKNSWGENWGENGYYKIC GRN+CGVDSMVS+VAA+HTT +
Sbjct: 319 IIKNSWGENWGENGYYKICRGRNICGVDSMVSTVAALHTTGN 360
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 262/342 (76%), Positives = 304/342 (88%), Gaps = 7/342 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D +IRQV +DG+ H+LNAEHHF+ FK+KF K+YATQEEHDYRF VF+ANLRRAK
Sbjct: 24 DPLIRQV--TDGDH---HMLNAEHHFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLH 78
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
LDP+A HGVTKFSDLTP EF+RQ+LGL + LRLP+ A KAPILPT+DLP +FDWRD G
Sbjct: 79 AKLDPSAEHGVTKFSDLTPEEFKRQYLGL-KPLRLPSTANKAPILPTSDLPENFDWRDKG 137
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCW+FS TGALEGAH+LSTGELVSLSEQQLVDCDH CDPEE G+CD+G
Sbjct: 138 AVTPVKNQGSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAG 197
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMN+AF+YIL+AGGV+ EKDYPY+G D +CKFDKSK+AA V+NFSV+S DEDQ+A
Sbjct: 198 CNGGLMNNAFDYILQAGGVQTEKDYPYSGRDE-TCKFDKSKVAATVANFSVVSLDEDQIA 256
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
ANLVKHGPLAVGINA++MQTYIGGVSCPYICGK LDHGVL+VGYG++G+APIRFK+KP+W
Sbjct: 257 ANLVKHGPLAVGINAIFMQTYIGGVSCPYICGKNLDHGVLLVGYGAAGYAPIRFKDKPFW 316
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
IIKNSWGE+WGE+GYYKIC G+NVCGVDSMVSSV A TSS
Sbjct: 317 IIKNSWGESWGEDGYYKICRGKNVCGVDSMVSSVVATTFTSS 358
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 268/371 (72%), Positives = 311/371 (83%), Gaps = 7/371 (1%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFKS
Sbjct: 1 MDRLKLCFSVFVLFFLIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKS 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVRA 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRD GAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLAV INA +MQTYIGGVSCPYIC
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC 296
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
+ L+HGVL+VGYGS+G+AP RFKEKPYWIIKNSWGE WGENG+YKIC GRN+CGVDS+V
Sbjct: 297 TRRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSLV 356
Query: 359 SSV-AAIHTTS 368
S+V AA+ TT+
Sbjct: 357 STVTAAVSTTA 367
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 271/374 (72%), Positives = 311/374 (83%), Gaps = 15/374 (4%)
Query: 1 MERLILSSLLL-LLLSSVLASAV-------AVNDD-DAMIRQVVPSDGEQSEDHLLNAEH 51
MER L +LLS+ +A V AV+D+ D +IRQVV ++D L AE
Sbjct: 1 MERFNAIPLFFAILLSATVAYGVSSDQINSAVSDEEDILIRQVVSG----ADDRPLTAEQ 56
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF FK KF KTY T EEHDYRFRVFKANLR+AKR Q LDP AVHGVT+FSDLT SEFR
Sbjct: 57 HFQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDPDAVHGVTRFSDLTESEFRE 116
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
F+GLNR LRLPADA +APILPT++L +DFDWRD GAVT VKDQG+CGSCWSFSA GALE
Sbjct: 117 NFVGLNR-LRLPADAHQAPILPTDNLASDFDWRDQGAVTPVKDQGSCGSCWSFSAVGALE 175
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GA+FLSTG+L+SLSEQQLVDCDHECDPEE+G+CD+GCNGGLM SAFEYI+KAGG+ERE+D
Sbjct: 176 GANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVKAGGLEREED 235
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
YPYTGTD GSCKF KIAA+ +NFSVIS+D DQ+AANLVK+GPLA+GINAV+MQTY+ G
Sbjct: 236 YPYTGTDRGSCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAVFMQTYMKG 295
Query: 292 VSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
+SCPYIC K LDHGVL+VGYG++GFAPIR KEKPYWIIKNSWGENWGENGYY IC G+N
Sbjct: 296 ISCPYICSKRNLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWGENGYYFICKGKN 355
Query: 351 VCGVDSMVSSVAAI 364
+CG +SMVSSVAAI
Sbjct: 356 ICGSESMVSSVAAI 369
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 264/372 (70%), Positives = 312/372 (83%), Gaps = 9/372 (2%)
Query: 3 RLILSSLLLLLL-----SSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK 57
+L S +LL+L S ++A + + DD +IRQVV DG +E +L++E HFSLFK
Sbjct: 5 KLSFSVFVLLILFVSVSSGIVAETSSSDGDDLVIRQVV--DG--AEPKVLSSEDHFSLFK 60
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
KF K YA+ EEHDYR VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 61 RKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVR 120
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+LP DA KAPILPT +LP DFDWRD GAVT VK+QG+CGSCWSFSATGALEGA+FL+
Sbjct: 121 GGFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLA 180
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TG+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTG
Sbjct: 181 TGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGK 240
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI 297
DG +CK DKSKI A+VSNFSVIS DEDQ+AANLVK+GPLAV INA +MQTYIGGVSCPYI
Sbjct: 241 DGPTCKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYI 300
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
C + L+HGVL+VGYGS+G+AP RFKEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDS+
Sbjct: 301 CARRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSL 360
Query: 358 VSSVAAIHTTSS 369
VS+V+A +T++
Sbjct: 361 VSTVSATVSTTA 372
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 553 bits (1424), Expect = e-155, Method: Compositional matrix adjust.
Identities = 266/371 (71%), Positives = 310/371 (83%), Gaps = 6/371 (1%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLAV INA +MQTYIGGVSCPYIC
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC 296
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
+ L+HGVL+VGYG++G+AP RFKEKPYWIIKNSWGE WGENG+YKIC GRN+CGVDSMV
Sbjct: 297 TRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMV 356
Query: 359 SSVAAIHTTSS 369
S+VAA +T++
Sbjct: 357 STVAATVSTTA 367
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 552 bits (1423), Expect = e-155, Method: Compositional matrix adjust.
Identities = 261/366 (71%), Positives = 305/366 (83%), Gaps = 8/366 (2%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL L S L L +SA+A +DDD +IRQVV + ++H+LNAEHHFSLFK+KF K
Sbjct: 1 RLFLLSFLAFAL---FSSAIAFSDDDPLIRQVVSGN---DDNHMLNAEHHFSLFKAKFGK 54
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
YA+QEEHD+R +VFKANL RAKR QLLDP+A HG+T+FSDLTPSEFRR +LGLN+ R
Sbjct: 55 IYASQEEHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSEFRRTYLGLNKP-RP 113
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+A+KAPILPT DLP+DFDWR+ GAVT VK+QG+CGSCWSFS TGA+EGAHFL+TGELV
Sbjct: 114 NLNAEKAPILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELV 173
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP E CD+GCNGGLM +AFEY LKAGG++ EKDYPYTG +G C
Sbjct: 174 SLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKAGGLQLEKDYPYTGRNG-KC 232
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
FDKS+IAA+VSNFSV+ DEDQ+AANL+KHGPLAVGINA WMQTY+ GVSCP IC K
Sbjct: 233 HFDKSRIAASVSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQTYVRGVSCPLICFKRQ 292
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGVL+VGYGS GFAPIR K KPYWIIKNSWG+ WGE+GYYKIC G ++CGVD+MVS+V
Sbjct: 293 DHGVLLVGYGSEGFAPIRLKNKPYWIIKNSWGKTWGEHGYYKICRGHHICGVDAMVSTVT 352
Query: 363 AIHTTS 368
A HTT+
Sbjct: 353 ATHTTN 358
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 259/341 (75%), Positives = 297/341 (87%), Gaps = 9/341 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EF+ LGL R + LP+DA APILPT++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTQFSDLTPMEFQHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE-CDPEESGSCDS 206
AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+ CDPEE+GSCDS
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDS 198
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMNSAFEYIL GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+
Sbjct: 199 GCNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQI 258
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVK+GPLAV INAV+MQTY+GGVSCPY+C K L+HGVL+VGYGS +APIR K+KPY
Sbjct: 259 AANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPY 318
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
WIIKNSWGENWGENGYYKIC GRN+CGVDSMVS+VAA+HTT
Sbjct: 319 WIIKNSWGENWGENGYYKICRGRNICGVDSMVSTVAALHTT 359
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 272/367 (74%), Positives = 314/367 (85%), Gaps = 9/367 (2%)
Query: 5 ILSSLLLLLLSSVL-----ASAVAVND-DDAMIRQVVP-SDGEQSEDHLLNAEHHFSLFK 57
+LS +LLL SS L AS V+ ++ DD +IRQVV +D ++D LLNAEHHFS FK
Sbjct: 3 LLSRFVLLLFSSSLVFAATASTVSSDESDDLLIRQVVAGADDHDNDDLLLNAEHHFSSFK 62
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
+F K Y + +EHD RF VFKANLRRAKR Q+LDP+AVHGVT+F DLTP+EFRR +LGL
Sbjct: 63 KRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDPSAVHGVTQFFDLTPAEFRRTYLGL- 121
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+RLRLPAD +APILPTNDLP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+
Sbjct: 122 KRLRLPADTHEAPILPTNDLPADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TG+LVSLSEQQLVDCDH CD E+ SCDSGCNGGLM SAFEY LKAGG+ERE+DYPYTGT
Sbjct: 182 TGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLKAGGLEREEDYPYTGT 241
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI 297
D CKFDK+KIA + SNFSV+S DE+Q+AANLV +GPLA+GINA++MQTYIGGVSCPYI
Sbjct: 242 DHSKCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQTYIGGVSCPYI 301
Query: 298 CGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS 356
C K LDHGVL+VGYGS+GFAPIRFKEKPYWIIKNSWGE+WGE GYYKIC GRN+CG+DS
Sbjct: 302 CSKRLLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWGEKGYYKICRGRNICGMDS 361
Query: 357 MVSSVAA 363
MVS+VAA
Sbjct: 362 MVSAVAA 368
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 265/371 (71%), Positives = 310/371 (83%), Gaps = 6/371 (1%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFE+ LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEHTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLAV INA +MQTYIGGVSCPYIC
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC 296
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
+ L+HGVL+VGYG++G+AP RFKEKPYWIIKNSWGE WGENG+YKIC GRN+CGVDSMV
Sbjct: 297 TRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMV 356
Query: 359 SSVAAIHTTSS 369
S+VAA +T++
Sbjct: 357 STVAATVSTTA 367
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 258/365 (70%), Positives = 307/365 (84%), Gaps = 4/365 (1%)
Query: 4 LILSSLLLLLLSSVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+I + L S L S +V +D D +IRQVV +DG+ + H L AEHHFSLFK +F K
Sbjct: 10 VITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNH-HALGAEHHFSLFKRRFGK 68
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-RRLR 121
+YAT+EEHD RF++FKAN+RRA+R Q DP+A+HGVT+FSDLTP EFR+ FLGL RLR
Sbjct: 69 SYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLR 128
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LP D APILPT +LP DFDWR HG VT VK+QG+CGSCWSFS TGALEGA+FL+TGEL
Sbjct: 129 LPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFLATGEL 188
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE +CDSGCNGGLMNSAFEY LKAGG+ +E+DYPY G D +
Sbjct: 189 VSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDRNT 248
Query: 242 CKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKIAA+++NFSV++S DEDQ+AANLVK+GPLA+ INAV+MQTYIGGVSCP+IC K
Sbjct: 249 CNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSK 308
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
LDHGVL+VGYGS+G+APIR ++K YWIIKNSWGE+WGENGYYKIC GRN+CGVDS+VS+
Sbjct: 309 RLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKICRGRNICGVDSLVST 368
Query: 361 VAAIH 365
VAA+H
Sbjct: 369 VAAVH 373
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 550 bits (1416), Expect = e-154, Method: Compositional matrix adjust.
Identities = 263/364 (72%), Positives = 309/364 (84%), Gaps = 4/364 (1%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
M+RL LS S+ LL V AS+ DD +I+QVV DG +E ++L++E HFSLFK K
Sbjct: 1 MDRLKLSLSVFALLFIVVSASSDGNEGDDLVIKQVV--DG-GAEPNVLSSEDHFSLFKKK 57
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
F K YA++EEHDYRF VFK+NLRRA+R Q LDP+A HGVT+FSDLT SEF+R+ LG+
Sbjct: 58 FGKVYASREEHDYRFSVFKSNLRRARRHQKLDPSARHGVTQFSDLTRSEFKRKHLGVKGG 117
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+LP DA KAPILPT +LP +FDWR+ GAVT VK+QG+CGSCWSFSATGALEGA+FL+TG
Sbjct: 118 FKLPKDANKAPILPTENLPEEFDWRERGAVTPVKNQGSCGSCWSFSATGALEGANFLATG 177
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTG DG
Sbjct: 178 KLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGKDG 237
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
+CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLAV INA +MQTYIGGVSCPYIC
Sbjct: 238 ATCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYICM 297
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
+ L+HGVL+VGYGS+G+AP RFKEKPYWIIKNSWGE WGE+G+YKIC GRNVCGVDS+VS
Sbjct: 298 RRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGEDGFYKICRGRNVCGVDSLVS 357
Query: 360 SVAA 363
+V A
Sbjct: 358 TVTA 361
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 549 bits (1415), Expect = e-154, Method: Compositional matrix adjust.
Identities = 268/366 (73%), Positives = 313/366 (85%), Gaps = 5/366 (1%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
LI ++LL L S + S + IRQVVP E++++ LLNAEHHF+LFKSK+ KT
Sbjct: 9 LIAATLLAGSLGSTVISGEVTDGFVNPIRQVVP---EENDEQLLNAEHHFTLFKSKYEKT 65
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-LRL 122
YATQ EHD+RFRVFKANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR+FLGL RR RL
Sbjct: 66 YATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRL 125
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P D Q APILPT+DLPT+FDWR+ GAVT VK+QG CGSCWSFSA GALEGAHFL+T ELV
Sbjct: 126 PTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELV 185
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP ++ SCDSGC+GGLMN+AFEY LKAGG+ +E+DYPYTG D +C
Sbjct: 186 SLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTAC 245
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
KFDKSKI A+VSNFSV+SSDEDQ+AANLV+HGPLA+ INA+WMQTYIGGVSCPY+C K
Sbjct: 246 KFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQ 305
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG-RNVCGVDSMVSSV 361
DHGVL+VG+GSSG+APIR KEKPYWIIKNSWG WGE+GYYKIC G N+CG+D+MVS+V
Sbjct: 306 DHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVSTV 365
Query: 362 AAIHTT 367
AA+HT+
Sbjct: 366 AAVHTS 371
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 549 bits (1415), Expect = e-154, Method: Compositional matrix adjust.
Identities = 259/341 (75%), Positives = 295/341 (86%), Gaps = 9/341 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EFR LGL R + LP+DA APILPT++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE-CDPEESGSCDS 206
AVT VK+QG+CGSCWSFSATGALEGAHFLSTG+LVSLSEQQLVDCDHE CDPEE+GSCDS
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDS 198
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC GGLMNSAFEYIL GGV RE+DYPY+GT GG+CKFD++KIAA+V+NFSV+S DEDQ+
Sbjct: 199 GCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQI 258
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVK+GPLAV INAV+MQTY+GGVSCPY+C K L+HGVL+VGYGS +APIR K+KPY
Sbjct: 259 AANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPY 318
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
WIIKNSWGENWGENGYYKIC GRNVCGVDSMVS+VAA+HTT
Sbjct: 319 WIIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVAALHTT 359
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 255/342 (74%), Positives = 295/342 (86%), Gaps = 8/342 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D +IRQVV +G L AEHHF FK +F K Y ++EEH YRF VFK+N+ RA+R
Sbjct: 27 DPLIRQVVDGEG-------LGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EFR LGL R + LP+DA APIL T++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILRTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CG+CWSFSATGALEGAHFLSTG+LVSLSEQQLVDCDHECDPEE+GSCDSG
Sbjct: 139 AVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSG 198
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
C GGLMNSAFEYIL GGV RE+DYPY+GT GG+CKFD++KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIA 258
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
ANLVK+GPLAV INAV+MQTY+GGVSCPY+C K L+HGVL+VGYGS +APIR K+KPYW
Sbjct: 259 ANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYW 318
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
IIKNSWGENWGENGYYKIC GRNVCGVDSMVS+VAA+HTT +
Sbjct: 319 IIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVAALHTTGN 360
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 546 bits (1408), Expect = e-153, Method: Compositional matrix adjust.
Identities = 259/343 (75%), Positives = 294/343 (85%), Gaps = 7/343 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
D+ MIRQV E D LNAE HF FK++F KTYAT EEHDYRF VFKANLRRAKR
Sbjct: 31 DNLMIRQV-----ESHVDDFLNAERHFEKFKARFQKTYATPEEHDYRFNVFKANLRRAKR 85
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
QLLDP+AVHGVT+FSDLTP+EFRR +LGLN LR PADAQ+APILPT++LPTDFDWR++
Sbjct: 86 HQLLDPSAVHGVTQFSDLTPAEFRRDYLGLNP-LRFPADAQQAPILPTDNLPTDFDWREN 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG CGSCWSFS GALEGAHFL+TG L SLSEQQLVDCD ECDPEE +CD
Sbjct: 145 GAVTPVKNQGNCGSCWSFSTIGALEGAHFLATGNLESLSEQQLVDCDRECDPEEYDACDD 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEYILK GGVEREKDYPYTG D CKF++SKI A+VSNFSV+S DEDQ+
Sbjct: 205 GCNGGLMNNAFEYILKTGGVEREKDYPYTGRDRSPCKFNESKIVASVSNFSVVSIDEDQI 264
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
AANLVK+GPLAVGINAV+MQTY GVSCP++C LDHGVL+VGYGS+G++PIRFKEKPY
Sbjct: 265 AANLVKNGPLAVGINAVFMQTYTAGVSCPFLCSGELDHGVLLVGYGSAGYSPIRFKEKPY 324
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS-VAAIHTTS 368
WI+KNSW + WGE+GYY+IC G+N+CGVDSMVSS VAAI TTS
Sbjct: 325 WILKNSWSKYWGEHGYYRICRGQNMCGVDSMVSSVVAAIQTTS 367
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 546 bits (1406), Expect = e-153, Method: Compositional matrix adjust.
Identities = 268/356 (75%), Positives = 311/356 (87%), Gaps = 8/356 (2%)
Query: 16 SVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
SV+A+A N+DD +IRQV + +D LLNAEHHF+ FKSKFSK+YAT+EEHDYRF
Sbjct: 13 SVVATATKDDNNDDFLIRQVT----DHEDDQLLNAEHHFTTFKSKFSKSYATKEEHDYRF 68
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFK+NL++AK Q LDP+A HGVTKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT
Sbjct: 69 GVFKSNLKKAKLHQKLDPSAEHGVTKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPT 128
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
N+LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+LVSLSEQQLVDCDH
Sbjct: 129 NNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDH 188
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
CDP+E SCDSGCNGGLMN+AFEY+L++GGV RE+DY YTG D GSCKFDKSKIAA+VS
Sbjct: 189 VCDPDEYNSCDSGCNGGLMNNAFEYLLQSGGVVREQDYSYTGRD-GSCKFDKSKIAASVS 247
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGS 313
NFSV+S DEDQ+AANLVK+GPLAV INA WMQTY+ GVSCPYIC K LDHGVL+VG+G
Sbjct: 248 NFSVVSVDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYICAKSRLDHGVLLVGFG- 306
Query: 314 SGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTSS 369
+GFAPIR KEKPYWIIKNSWG+NWGE GYYKIC GRN+CGVDSMVS+VAA+H +++
Sbjct: 307 NGFAPIRLKEKPYWIIKNSWGQNWGEEGYYKICRGRNICGVDSMVSTVAAVHASNN 362
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 256/363 (70%), Positives = 299/363 (82%), Gaps = 6/363 (1%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L L + L V S D+D +IRQVV +++E +L++E HF+LFK KF K Y
Sbjct: 5 LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +LP D
Sbjct: 61 SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHG 305
+SKI A+VSNFSV+S +EDQ+AANL+K+GPLAV INA +MQTYIGGVSCPYIC + L+HG
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHG 300
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
VL+VGYGS+GF+ R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDS+VS+VAA
Sbjct: 301 VLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA-- 358
Query: 366 TTS 368
TTS
Sbjct: 359 TTS 361
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 540 bits (1392), Expect = e-151, Method: Compositional matrix adjust.
Identities = 254/361 (70%), Positives = 301/361 (83%), Gaps = 8/361 (2%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R++ S LL V S D+D +IRQVV +++E +L++E HF+LFK KF K
Sbjct: 5 RVLFSVSLLF----VFVSVSICGDEDLLIRQVV----DEAEPKVLSSEDHFTLFKKKFGK 56
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
Y + EEH YRF VFKANLRRA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +L
Sbjct: 57 DYGSIEEHYYRFSVFKANLRRAMRHQKMDPSARHGVTQFSDLTGSEFRRKHLGVTGGFKL 116
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P DA +APILPT++LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LV
Sbjct: 117 PKDANQAPILPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLV 176
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTGTDGGSC
Sbjct: 177 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGTDGGSC 236
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
K D+SKI A+VSNFSV+S +EDQ+AANLVK+GPLAV INA +MQTYIGGVSCPYIC + L
Sbjct: 237 KLDRSKIVASVSNFSVVSINEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYICSRRL 296
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
+HGVL++GYGSSG++ R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDS+VS+VA
Sbjct: 297 NHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVA 356
Query: 363 A 363
A
Sbjct: 357 A 357
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 255/363 (70%), Positives = 298/363 (82%), Gaps = 6/363 (1%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L L + L V S D+D +IRQVV +++E +L++E HF+LFK KF K Y
Sbjct: 5 LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +LP D
Sbjct: 61 SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNG LMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHG 305
+SKI A+VSNFSV+S +EDQ+AANL+K+GPLAV INA +MQTYIGGVSCPYIC + L+HG
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHG 300
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
VL+VGYGS+GF+ R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CGVDS+VS+VAA
Sbjct: 301 VLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA-- 358
Query: 366 TTS 368
TTS
Sbjct: 359 TTS 361
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 251/365 (68%), Positives = 300/365 (82%), Gaps = 10/365 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+I + L S L S +V +D D +IRQVV +DG+ + H L AEHHFSLFK +F K
Sbjct: 10 VITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNH-HALGAEHHFSLFKRRFGK 68
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-RRLR 121
+YAT+EEHD RF++FKAN+RRA+R Q DP+A+HGVT+FSDLTP EFR+ FLGL RLR
Sbjct: 69 SYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLR 128
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LP D APILPT +LP DFDWR HG VT VK+QG+CGSCWSFS TGALEGA+FL
Sbjct: 129 LPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFL----- 183
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
LSEQQLVDCDHECDPEE +CDSGCNGGLMNSAFEY LKAGG+ +E+DYPY G D +
Sbjct: 184 -XLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDRNT 242
Query: 242 CKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK 300
C FDKSKIAA++++FSV++S DEDQ+AANLVK+GPLA+ INAV+MQTYIGGVSCP+IC K
Sbjct: 243 CNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSK 302
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
LDHGVL+VGYGS+G+APIR ++K YWIIKNSWGE+WGENGYYKIC GRN+CGVDS+VS+
Sbjct: 303 RLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKICRGRNICGVDSLVST 362
Query: 361 VAAIH 365
VAA+H
Sbjct: 363 VAAVH 367
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 258/340 (75%), Positives = 295/340 (86%), Gaps = 8/340 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EF+ LGL R + LP+DA APILPT++LP DFDWR HG
Sbjct: 80 QLLDPSAVHGVTQFSDLTPMEFQHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWRGHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+CDPEE+GSC SG
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCGSG 198
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMNSAFEYIL GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIA 258
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
ANLVK+GPLAV INAV+MQTY+GGVSCPY+C K L+HGVL+VGYGS +APIR K+KPYW
Sbjct: 259 ANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYW 318
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
IIKNSWGENWGENGYYKIC GRN+CGVDSMVS+VAA+HTT
Sbjct: 319 IIKNSWGENWGENGYYKICRGRNICGVDSMVSTVAALHTT 358
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 249/376 (66%), Positives = 300/376 (79%), Gaps = 13/376 (3%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
+ RL + +LLLS V A + V +D +I QVV G++ + LNAE HF+ F +F
Sbjct: 4 LRRLPIVVAAVLLLSGVAALSSPV--EDPLIEQVV--GGDEKNELELNAEAHFASFVQRF 59
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
+K+Y +EH +R VF ANLRRA+R Q LDP+AVHGVTKFSDLTP EFR +FLGL +
Sbjct: 60 NKSYRDADEHAHRLSVFTANLRRARRHQRLDPSAVHGVTKFSDLTPDEFRDRFLGLRKYR 119
Query: 121 R-----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R L A AP LPT+ LPT+FDWR+HGAV VKDQG+CGSCWSFS +GALEGAH+
Sbjct: 120 RSFLKGLSGSAHDAPALPTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHY 179
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
L+TG+L LSEQQ+VDCDHECDP E +CD+GCNGGLM +AF Y+ KAGG+E EKDYPYT
Sbjct: 180 LATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYT 239
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP 295
G GG+CKFDKSKIAA V NFS ++ DEDQ+AANLVKHGPLA+GINAV+MQTYIGGVSCP
Sbjct: 240 GR-GGACKFDKSKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCP 298
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVC 352
+ICG++LDHGVL+VGYGS+G+AP+RFKEKPYWIIKNSWGENWGE+GYYKIC G +N C
Sbjct: 299 FICGRHLDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGENWGESGYYKICRGAHVKNKC 358
Query: 353 GVDSMVSSVAAIHTTS 368
GVDSMVS+V AIHT++
Sbjct: 359 GVDSMVSTVTAIHTSN 374
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 245/349 (70%), Positives = 284/349 (81%), Gaps = 11/349 (3%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G D LNAE HF F +F K+Y +EH YR VFKANLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKANLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAV VK+QG+CGSCWSFSA+GALEGAH+L+TG+L LSEQQ VDCDHECD E
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+DG CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSDG-KCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRF 321
DE Q++ANL+KHGPLA+GINA +MQTYIGGVSCPYICG++LDHGVL+VGYG+SGFAPIR
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRL 320
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
K+KPYWIIKNSWGENWGENGYYKIC G RN CGVDSMVS+V+A+H +
Sbjct: 321 KDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVHAS 369
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 516 bits (1330), Expect = e-144, Method: Compositional matrix adjust.
Identities = 246/360 (68%), Positives = 286/360 (79%), Gaps = 11/360 (3%)
Query: 16 SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
S A+ D+D +IRQVV G +D+ L HF+ F +F KTY EEH +R
Sbjct: 18 SPAAATATAGDEDPLIRQVV--GGADGDDNDLELSSHFTSFVQRFGKTYKDAEEHAHRLS 75
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAP 130
VFKANLRRA+R QLLDP+A HG+TKFSDLTP+EFRR FLGL R + A AP
Sbjct: 76 VFKANLRRARRHQLLDPSAEHGITKFSDLTPAEFRRTFLGLKTSRRSFLREIGGSAHDAP 135
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
+LPT+ LP DFDWRDHGAV VK+QG+CGSCWSFSA+GALEGA++L+TG++ LSEQQ V
Sbjct: 136 VLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMEVLSEQQFV 195
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCDHECDPEE SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG DG +CKFDKSKI
Sbjct: 196 DCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGRDG-TCKFDKSKIV 254
Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVG 310
A+V NFSV+S DE+Q+AANLVKHGPLA+GINA +MQTYIGGVSCPYICG+ LDHGVL+VG
Sbjct: 255 ASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICGRSLDHGVLLVG 314
Query: 311 YGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
YG+SGFAP R K KPYW+IKNSWGENWGE GYYKIC G RN CGVDSMVS+VAA HT+
Sbjct: 315 YGASGFAPSRLKNKPYWVIKNSWGENWGEKGYYKICRGSNVRNKCGVDSMVSTVAAAHTS 374
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 239/315 (75%), Positives = 273/315 (86%), Gaps = 2/315 (0%)
Query: 54 SLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
+LFK KF K Y + EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+
Sbjct: 1 ALFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKH 60
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
LG+ +LP DA +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGA
Sbjct: 61 LGVKGGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGA 120
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
HFL+TG+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYP
Sbjct: 121 HFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYP 180
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVS 293
YTGTDGGSCK D+SKI A+VSNFSV+S +EDQ+AANL+K+GPLAV INA +MQTYIGGVS
Sbjct: 181 YTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVS 240
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
CPYIC + L+HGVL+VGYGS+GF+ R KEKPYWIIKNSWGE+WGENG+YKIC GRN+CG
Sbjct: 241 CPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICG 300
Query: 354 VDSMVSSVAAIHTTS 368
VDS+VS+VAA TTS
Sbjct: 301 VDSLVSTVAA--TTS 313
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/349 (69%), Positives = 283/349 (81%), Gaps = 11/349 (3%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G D LNAE HF F +F K+Y +EH YR VFK NLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAV VK+QG+CGSCWSFSA+GALEGAH+L+TG+L LSEQQ VDCDHECD E
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+DG CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSDG-KCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRF 321
DE Q++ANL+KHGPLA+GINA +MQTYIGGVSCPYICG++LDHGVL+VGYG+SGFAPIR
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRL 320
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
K+KPYWIIKNSWGENWGENGYYKIC G RN CGVDSMVS+V+A+H +
Sbjct: 321 KDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVHAS 369
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 244/361 (67%), Positives = 291/361 (80%), Gaps = 11/361 (3%)
Query: 15 SSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +A+A +++ +IRQVV G + LNAE HF+ F +F K+Y +EH YR
Sbjct: 14 SPAVAAASVPGEEEPLIRQVV--GGGDDNELELNAERHFASFVQRFGKSYRDADEHAYRL 71
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKA 129
VFKANLRRA+R QLLDP+A HGVTKFSDLTP+EFRR +LGL R L A +A
Sbjct: 72 SVFKANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHEA 131
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
P+LPT+ LP DFDWRDHGAV VK+QG+CGSCWSFSA+GALEGA++L+TG++ LSEQQ+
Sbjct: 132 PVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQM 191
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCDHECD E SCD+GCNGGLM +AF Y+LK+GG+E EKDYPYTG DG +CKFDKSKI
Sbjct: 192 VDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLKSGGLESEKDYPYTGRDG-TCKFDKSKI 250
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIV 309
+V NFSV+S DEDQ+AANLVKHGPLA+GINA +MQTYIGGVSCPYICG++LDHGVL+V
Sbjct: 251 VTSVQNFSVVSVDEDQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLV 310
Query: 310 GYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHT 366
GYG+SGFAPIR K+K YWIIKNSWGENWGE+GYYKIC G RN CGVDSMVS+V+AIHT
Sbjct: 311 GYGASGFAPIRLKDKAYWIIKNSWGENWGEHGYYKICRGSNVRNKCGVDSMVSTVSAIHT 370
Query: 367 T 367
+
Sbjct: 371 S 371
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 242/349 (69%), Positives = 282/349 (80%), Gaps = 11/349 (3%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G + LNAE HF F +F K+Y EEH YR +FKANLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNELELNAESHFLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGKSANEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAVT VK+QG+CGSCWSFS +GALEGAH+L+TG+L LSEQQ+VDCDH CD E
Sbjct: 142 DWRDHGAVTPVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTNAFSYLQKAGGLESEKDYPYTGSDD-KCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRF 321
DE Q+AANL+KHGPLA+GINA +MQTYIGGVSCPYICG+ LDHGVL+VGYG++GFAPIR
Sbjct: 261 DEGQIAANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRTLDHGVLLVGYGAAGFAPIRL 320
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
K+KPYWIIKNSWGENWGENGYYKIC G RN CGVDSMVS+V+A+ T+
Sbjct: 321 KDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVRTS 369
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 507 bits (1305), Expect = e-141, Method: Compositional matrix adjust.
Identities = 241/360 (66%), Positives = 288/360 (80%), Gaps = 11/360 (3%)
Query: 16 SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
S + A D++ +IRQVV G D+ L + F F +F KTY EEH +R
Sbjct: 18 SPAPATAAAGDEEPLIRQVV--GGADPLDNDLELDSQFVGFVQRFGKTYRDAEEHAHRLS 75
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAP 130
VFKANLRRA+R QLLDP+A HGVTKFSDLTP+EFRR +LGL R + A AP
Sbjct: 76 VFKANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLKTTRRSFLREMAGSAHDAP 135
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
+LPT+ LP DFDWRDHGAV VK+QG+CGSCWSFSA+GALEGA++L++G++ LSEQQLV
Sbjct: 136 VLPTDGLPEDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLV 195
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCDHECDP E SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG DG +CKFDKSKIA
Sbjct: 196 DCDHECDPSEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKDG-TCKFDKSKIA 254
Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVG 310
A+V N+SV++ DE+Q+AANLVK+GPLA+GINA +MQTYIGGVSCPYICG++LDHGVL+VG
Sbjct: 255 ASVQNYSVVAVDEEQIAANLVKYGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVG 314
Query: 311 YGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
YG+SGFAP RFKEKPYWIIKNSWGENWG+ GYYKIC G RN CGVDSMVS+V+A H++
Sbjct: 315 YGASGFAPSRFKEKPYWIIKNSWGENWGDKGYYKICRGSNVRNKCGVDSMVSTVSATHSS 374
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 240/315 (76%), Positives = 275/315 (87%), Gaps = 4/315 (1%)
Query: 55 LFKSKF-SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
L + KF + YAT+EEHD+RF VFK+NLRRA P VHGVTKFSDLTP+EFRRQF
Sbjct: 7 LSRPKFRPRPYATKEEHDHRFGVFKSNLRRASCTPSSTPR-VHGVTKFSDLTPAEFRRQF 65
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
LGL + +R PA AQKAPILPT DLP DFDWRD GAVT VKDQG CGSCWSFS TGALEGA
Sbjct: 66 LGL-KAVRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGGCGSCWSFSTTGALEGA 124
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
++L+TGELVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFEYIL++GGV++EKDYP
Sbjct: 125 YYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEKDYP 184
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVS 293
YTG DG +CKFDK+K+AA VSN+SV+ DE+Q+AANLVK+GPLAV INAV+MQTY+GGVS
Sbjct: 185 YTGRDG-TCKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVS 243
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
CPYICGK+LDHGVL+VGYG +APIRFK KPYWIIKNSWGE+WGENGY +IC GRNVCG
Sbjct: 244 CPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYDEICRGRNVCG 303
Query: 354 VDSMVSSVAAIHTTS 368
VDSMVS+VAAI+ +S
Sbjct: 304 VDSMVSTVAAIYPSS 318
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 238/348 (68%), Positives = 280/348 (80%), Gaps = 9/348 (2%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
++D +I QVV G + ED L+AE HF+ F+ +F +TY E YR VF ANLRRA+
Sbjct: 33 EEDPLIEQVV--GGGEEEDAQLDAEAHFASFERRFGRTYRDAGERAYRMSVFAANLRRAR 90
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFD 142
R Q LDPTA HGVTKFSDLTP EFR +FLGL R + + +APILPT+ LP DFD
Sbjct: 91 RHQRLDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLPDDFD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCDHECD ES
Sbjct: 151 WREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESR 210
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
+CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V NFSVIS +
Sbjct: 211 ACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGREN-TCKFDKSKIVAQVKNFSVISVN 269
Query: 263 EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFK 322
EDQ+AANLVKHGPLA+ INA +MQTYIGGVSCP+ICG++LDHGVL+VGYGS+G+APIRFK
Sbjct: 270 EDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGSAGYAPIRFK 329
Query: 323 EKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
EKPYWIIKNSWGENWGE GYYKIC G +N CGVDSMVSSV AIHT+
Sbjct: 330 EKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTAIHTS 377
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 499 bits (1286), Expect = e-139, Method: Compositional matrix adjust.
Identities = 236/349 (67%), Positives = 283/349 (81%), Gaps = 12/349 (3%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +I QVV D E + LNAE HF+ F +F K+Y +EH++R VF+ANLRRA+R
Sbjct: 34 EDPLIEQVVGGDAENELE--LNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANLRRARR 91
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
Q LDP+AVHG+TKFSDLTP EFR +FLGL + R + A AP LPT+ LPT+F
Sbjct: 92 HQRLDPSAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHDAPALPTDGLPTEF 151
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAV VKDQG+CGSCWSFS +GALEGA++L+TG+L LSEQQLVDCDHECDP E
Sbjct: 152 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEP 211
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
+CD+GCNGGLM +AF Y+ KAGG+E EKDYPYTG + +CKFDKSKIAA V NFS ++
Sbjct: 212 RACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYTGRN-SACKFDKSKIAAQVKNFSTVAI 270
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRF 321
DEDQ+AANLVKHGPLA+GINAV+MQTYIGGVSCPYICG++LDH V +VGYGS+G+AP+RF
Sbjct: 271 DEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPYICGRHLDH-VFLVGYGSAGYAPLRF 329
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
KEKPYWIIKNSWGENWGE+GYYKIC G +N CGVDSMVS+V AIHT+
Sbjct: 330 KEKPYWIIKNSWGENWGESGYYKICRGPHVKNKCGVDSMVSTVTAIHTS 378
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 237/350 (67%), Positives = 282/350 (80%), Gaps = 11/350 (3%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
D++ +IRQVV G D+ L + F +F KTY EEH +R VFKANLRRA+
Sbjct: 28 DEEPLIRQVV--GGADPLDNDLELDSQLLGFVQRFGKTYRDAEEHAHRLSVFKANLRRAR 85
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTD 140
R Q+LDP+A HGVTKFSDLTP+EFRR FLGL R + A AP+LPT+ LP D
Sbjct: 86 RHQMLDPSAEHGVTKFSDLTPAEFRRTFLGLKTTRRSFLREMAGSAHDAPVLPTDGLPED 145
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWRDHGAV VK+QG+C SCWSFSA+GALEGA++L+TG++ LSEQQLVDCDHECDP E
Sbjct: 146 FDWRDHGAVGPVKNQGSCWSCWSFSASGALEGANYLATGKMEVLSEQQLVDCDHECDPAE 205
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG DG +CKF+KSKIAA+V NFSV++
Sbjct: 206 PDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKDG-TCKFEKSKIAASVQNFSVVA 264
Query: 261 SDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
DE+Q+AANLV++GPLA+GINA +MQTYIGGVSCPYICG++LDHGVL+VGYG+SGFAP R
Sbjct: 265 VDEEQIAANLVEYGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPSR 324
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
FKEKPYWIIKNSWGENWG+ GYYKIC G RN CGVDSMVS+V+A H +
Sbjct: 325 FKEKPYWIIKNSWGENWGDKGYYKICRGSNVRNKCGVDSMVSTVSATHAS 374
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 241/368 (65%), Positives = 288/368 (78%), Gaps = 15/368 (4%)
Query: 7 SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
S+LL + SV+ + A DD +IRQV V SD + + L NAE HF F +
Sbjct: 4 STLLFSAFCIFSVIFLSSATKPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y+ EEH++RF VFK+NL RA Q LDP A HGVTKFSDLT EFR Q+LGL
Sbjct: 64 YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGL--- 120
Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
R P DA APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI 297
D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+VGINA +MQTY+GGVSCPY+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCPYV 298
Query: 298 CGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS 356
C K LDHGVL+VGYG++ FAPIR K+KPYW+IKNSWG NWGENGYYK+C G NVCG+++
Sbjct: 299 CSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGINN 358
Query: 357 MVSSVAAI 364
MVS+VAAI
Sbjct: 359 MVSTVAAI 366
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 241/368 (65%), Positives = 288/368 (78%), Gaps = 15/368 (4%)
Query: 7 SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
S+LL + SV+ + A DD +IRQV V SD + + L NAE HF F +
Sbjct: 4 STLLFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y+ EEH++RF VFK+NL RA Q LDP A HGVTKFSDLT EFR Q+LGL
Sbjct: 64 YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGL--- 120
Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
R P DA APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI 297
D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+VGINA +MQTY+GGVSCPY+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCPYV 298
Query: 298 CGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS 356
C K LDHGVL+VGYG++ FAPIR K+KPYW+IKNSWG NWGENGYYK+C G NVCG+++
Sbjct: 299 CSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGINN 358
Query: 357 MVSSVAAI 364
MVS+VAAI
Sbjct: 359 MVSTVAAI 366
>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
Length = 292
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 221/289 (76%), Positives = 256/289 (88%), Gaps = 1/289 (0%)
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RLRLPADAQKAPILPTNDLPT 139
+RRA+R Q LDPTAVHGVT+FSDLTP EF+R +LGL + + L A +AP+LPTNDLP
Sbjct: 1 MRRARRHQQLDPTAVHGVTQFSDLTPGEFKRTYLGLRKGKKHLVGSAHEAPLLPTNDLPE 60
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWRD GAVTGVK+QG+CGSCWSFS +GALEGA+FL+TG+L +LSEQQ+VDCDHECD E
Sbjct: 61 DFDWRDKGAVTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAE 120
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E CD GCNGGLMN+AF+Y+ K GG+E EKDYPYTGTD G+CKFD+SKI A+V NFSV+
Sbjct: 121 EPDDCDQGCNGGLMNTAFQYLQKVGGLESEKDYPYTGTDRGTCKFDESKIKASVHNFSVV 180
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPI 319
S DE+Q+AANLVKHGPLA+ INAV+MQTYIGGVSCPYICGK+LDHGVL+VGYGS+G+API
Sbjct: 181 SIDEEQIAANLVKHGPLAIAINAVFMQTYIGGVSCPYICGKHLDHGVLLVGYGSAGYAPI 240
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
R KEKPYWIIKNSWGE WGENGYYKIC GRNVCGVDSMVS+V AIHTT+
Sbjct: 241 RLKEKPYWIIKNSWGETWGENGYYKICRGRNVCGVDSMVSTVTAIHTTA 289
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 240/368 (65%), Positives = 287/368 (77%), Gaps = 15/368 (4%)
Query: 7 SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
S+LL + SV+ + A DD +IRQV V SD + + L NAE HF F +
Sbjct: 4 STLLFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y+ EEH++RF VFK+NL RA Q LDP A HGVTKFSDLT FR Q+LGL
Sbjct: 64 YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEGFRHQYLGL--- 120
Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
R P DA APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI 297
D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+VGINA +MQTY+GGVSCPY+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCPYV 298
Query: 298 CGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS 356
C K LDHGVL+VGYG++ FAPIR K+KPYW+IKNSWG NWGENGYYK+C G NVCG+++
Sbjct: 299 CSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGINN 358
Query: 357 MVSSVAAI 364
MVS+VAAI
Sbjct: 359 MVSTVAAI 366
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 220/342 (64%), Positives = 273/342 (79%), Gaps = 8/342 (2%)
Query: 28 DAMIRQVVPSDGEQSEDHL------LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
D+ IR+V + ++S L L+ E HF F ++F K YAT E + +R +VF+ANL
Sbjct: 27 DSGIREVTDTARDESNGRLDAAKALLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANL 86
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RA Q LDP+AVHG+T+FSDLT EF++QFLGL RL +A KAP+LPTNDLP DF
Sbjct: 87 VRAVSHQALDPSAVHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDF 145
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAVT VK+QGACGSCW+FS TGA+EGAHFL TG+L+SLSEQQLVDCDH CDP +
Sbjct: 146 DWREHGAVTEVKNQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDK 205
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCD+GCNGGLM +A++Y++K+GG+E E DYPYTG G C+F+ +KI A+V+NFS +S
Sbjct: 206 VSCDAGCNGGLMTNAYDYVMKSGGLETETDYPYTGNSNGKCQFNANKIVASVANFSTVSL 265
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIR 320
DEDQ+AANLVKHGPLA+GINAV+MQTYIGGVSCP IC K ++DHGVL+VGYG+ G+APIR
Sbjct: 266 DEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPIICSKHHIDHGVLLVGYGAKGYAPIR 325
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
F EKPYWIIKNSWG WGE GYYKIC G +CG+++MVS+VA
Sbjct: 326 FTEKPYWIIKNSWGATWGEQGYYKICRGHGMCGMNTMVSTVA 367
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 476 bits (1226), Expect = e-132, Method: Compositional matrix adjust.
Identities = 218/329 (66%), Positives = 269/329 (81%), Gaps = 4/329 (1%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
V +G++S LL+ E HF F ++F K YAT E + +R +VF+ANL RA Q LDP+A
Sbjct: 5 VVDNGDRSA--LLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDPSA 62
Query: 95 VHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
VHG+T+FSDLT EF++QFLGL RL +A KAP+LPTNDLP DFDWR+HGAVT VK+
Sbjct: 63 VHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDFDWREHGAVTEVKN 121
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QGACGSCW+FS TGA+EGAHFL TG+L+SLSEQQLVDCDH CDP + SCD+GCNGGLM
Sbjct: 122 QGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMT 181
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
+A++Y++K+GG+E E DYPYTG G C+F+ +KI A+V+NFS +S DEDQ+AANLVKHG
Sbjct: 182 NAYDYVMKSGGLETETDYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHG 241
Query: 275 PLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
PLA+GINAV+MQTYIGGVSCP IC K ++DHGVL+VGYG+ G+APIRF EKPYWIIKNSW
Sbjct: 242 PLAIGINAVFMQTYIGGVSCPIICSKHHIDHGVLLVGYGAKGYAPIRFTEKPYWIIKNSW 301
Query: 334 GENWGENGYYKICMGRNVCGVDSMVSSVA 362
G WGE GYYKIC G +CG+++MVS+VA
Sbjct: 302 GATWGEQGYYKICRGHGMCGMNTMVSTVA 330
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 476 bits (1225), Expect = e-132, Method: Compositional matrix adjust.
Identities = 233/370 (62%), Positives = 284/370 (76%), Gaps = 12/370 (3%)
Query: 5 ILSSLLLLLLSSVLASA-VAVNDDDAM----IRQVVPSDGEQSEDHL----LNAEHHFSL 55
ILS LL L+ ++ A A +D +A+ IR+V DGE D L LNAE HF+
Sbjct: 18 ILSLALLFLVPTITAHVHEASSDLNAVLPNPIREVTDMDGEGVIDDLRRGLLNAEAHFAH 77
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F KF+K Y+ EEH RF +FK NL +A R Q LD A+HG+ KFSDLT EF Q+LG
Sbjct: 78 FVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDRDAIHGINKFSDLTEEEFHEQYLG 137
Query: 116 LNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
L R L Q APILPT+DLP DFDWR+ GAVT VK+QGACGSCW+FS TGA+EGA+
Sbjct: 138 LTTPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTPVKNQGACGSCWTFSTTGAMEGAN 197
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F+ TG+L+SLSEQQLVDCDHECD E CDSGCNGGLM +A++Y LKAGG++RE+DYPY
Sbjct: 198 FMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALKAGGLQREEDYPY 257
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSC 294
TG D GSCKFD +K+AA V+NFS +S DEDQ+AANLVK+GPLAVGINA +MQTY+GGVSC
Sbjct: 258 TGID-GSCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLAVGINAAFMQTYVGGVSC 316
Query: 295 PYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
PY+C K LDHGVL+VGYG++G+AP R K KP+WIIKNSWG +WGE+GYYK+C G NVCG
Sbjct: 317 PYVCNKQNLDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDWGEDGYYKLCRGHNVCG 376
Query: 354 VDSMVSSVAA 363
+++MVS+VAA
Sbjct: 377 INTMVSTVAA 386
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 229/348 (65%), Positives = 271/348 (77%), Gaps = 26/348 (7%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
++D +I QVV G + ED L+AE HF+ F+ +F +TY RRA+
Sbjct: 33 EEDPLIDQVV--GGGEEEDAQLDAEAHFASFERRFGRTYP--------------GPRRAR 76
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFD 142
R LDPTA HGVTKFSDLTP EFR +FLGL R + + +APILPT+ LP DFD
Sbjct: 77 R---LDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLPDDFD 133
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCDHECD ES
Sbjct: 134 WREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESR 193
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
+CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V NFSVIS +
Sbjct: 194 ACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGREN-TCKFDKSKIVAQVKNFSVISVN 252
Query: 263 EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFK 322
EDQ+AANLVKHGPLA+ INA +MQTYIGGVSCP+ICG++LDHGVL+VGYGS+G+APIRFK
Sbjct: 253 EDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGSAGYAPIRFK 312
Query: 323 EKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
EKPYWIIKNSWGENWGE GYYKIC G +N CGVDSMVSSV AIHT+
Sbjct: 313 EKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTAIHTS 360
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 209/291 (71%), Positives = 241/291 (82%), Gaps = 7/291 (2%)
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPT 139
R R LDPTA HGVTKFSDLTP EFR + LGL R + + +APILPT+ LP
Sbjct: 55 RELRAARLDPTATHGVTKFSDLTPGEFRDRLLGLRRPSLEGLVGGEPHEAPILPTDGLPD 114
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCDHECD
Sbjct: 115 DFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDAS 174
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
ES +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V NFSVI
Sbjct: 175 ESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGREN-TCKFDKSKIVAQVKNFSVI 233
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPI 319
S +EDQ+AANLVKHGPLA+ INA +MQTYIGGVSCP+ICG++LDHGVL+VGYGS+G+API
Sbjct: 234 SVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGSAGYAPI 293
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
RFKEKPYWIIKNSWGENWGE GYYKIC G +N CGVDSMVSSV AIHT+
Sbjct: 294 RFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTAIHTS 344
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 211/366 (57%), Positives = 260/366 (71%), Gaps = 12/366 (3%)
Query: 4 LILSSLLLLLLSSVLAS-----AVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
L+L +++L + AS + DDA+ V EQ L+ AE F F
Sbjct: 6 LLLVGIVVLGFAGFAASLPTGDTIREVTDDALSNGSV----EQFAHALIGAEKRFESFMK 61
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
F K Y + EE+++RF VFK+NL +A + Q LDPTA HGVT FSDLT EF ++LGL R
Sbjct: 62 DFGKVYHSVEEYEHRFGVFKSNLLKALKHQALDPTASHGVTMFSDLTEEEFTSKYLGLKR 121
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
L + A +AP LPT DLP +FDWR+ GAV VKDQG CGSCW+FS TGA+EGAHFL++
Sbjct: 122 PSVL-SSAPQAPPLPTEDLPPNFDWREKGAVGPVKDQGGCGSCWAFSTTGAVEGAHFLNS 180
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDH+CD EE+ +CD+GCNGG M +A++Y+ AGG+E E DYPY G D
Sbjct: 181 GKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYVEAAGGLELESDYPYEGRD 240
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G CKFD +K+A VSNF+ I DEDQ+AA L+K GPLA+GINA +MQTYI GVSCP C
Sbjct: 241 -GKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLAIGINAEFMQTYIAGVSCPIFC 299
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
K LDHGVL+VGY GFAP R KPYWIIKNSWG NWG+NGYYKIC G CG+++M
Sbjct: 300 NKRNLDHGVLLVGYAERGFAPARLAYKPYWIIKNSWGPNWGDNGYYKICRGHGECGLNTM 359
Query: 358 VSSVAA 363
VS+V+A
Sbjct: 360 VSAVSA 365
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 207/349 (59%), Positives = 255/349 (73%), Gaps = 5/349 (1%)
Query: 18 LASAVAVNDDDAMIRQVVPSDG--EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
L +++ + D + V DG EQ LL AE F F +F K Y T EE+++RF+
Sbjct: 19 LVASLPLRDVIQQVTDGVRVDGSVEQFAHALLGAEKQFESFIKEFGKVYHTVEEYEHRFK 78
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
VFK+NL RA + Q LDPTA HGVT FSDLT EF Q+LGL R L + A A LPT
Sbjct: 79 VFKSNLLRALKHQALDPTASHGVTMFSDLTEEEFATQYLGLKRPSAL-STAPTAEPLPTG 137
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
DLP FDWR+ GAV VK+QG+CGSCW+FS TGA+EGAHFL+TG+L+SLSEQQLVDCDH+
Sbjct: 138 DLPPSFDWREKGAVGPVKNQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQ 197
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPEE+ +CD+GC GGLM +A++Y+ +AGG+E E DYPY G D G C+F+ +K+AA VSN
Sbjct: 198 CDPEEAQACDAGCGGGLMTNAYKYVEEAGGLELESDYPYKGRD-GKCQFNPNKVAAKVSN 256
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSS 314
F+ I DEDQ+AA L+K GPLA+GINA +MQTY+ GVSCP C K LDHGVL+VGY
Sbjct: 257 FTNIPIDEDQVAAYLIKSGPLAIGINAEFMQTYVAGVSCPIFCNKRNLDHGVLLVGYAEH 316
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAA 363
GFAP R KPYWIIKNSWG WG+ GYYKIC G CG+++MVS+VAA
Sbjct: 317 GFAPARLAYKPYWIIKNSWGPMWGDKGYYKICRGHGECGLNTMVSAVAA 365
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 200/364 (54%), Positives = 266/364 (73%), Gaps = 7/364 (1%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
++ +L L+ +L V + +D IRQV +D + +LL + E F LF S +
Sbjct: 1 MVAKALAQLITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-- 119
K Y+T+EE+ +R +F N+ +A Q++DP+AVHGVT+FSDLT EF+R + G+
Sbjct: 60 KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGG 119
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R +AP++ + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L+SLSEQQLVDCD CDP++ +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG
Sbjct: 180 KLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR- 238
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
G CKFD K+A V NF+ I DE+Q+AANLV+HGPLAVG+NAV+MQTYIGGVSCP IC
Sbjct: 239 GHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICS 298
Query: 300 KY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
K ++HGVL+VGYGS GF+ +R KPYWIIKNSWG+ WGENGYYK+C G ++CG++SMV
Sbjct: 299 KRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMV 358
Query: 359 SSVA 362
S+VA
Sbjct: 359 SAVA 362
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 192/241 (79%), Positives = 215/241 (89%), Gaps = 1/241 (0%)
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
KAPILPT+DLP DFDWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQ
Sbjct: 15 KAPILPTSDLPDDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQ 74
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
QLVDCDHECD E+ CD+GC GGLM +AFEY LKAGG++REKDYPYTG DG C FDKS
Sbjct: 75 QLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRDG-KCHFDKS 133
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVL 307
KIAA+V+NFSV+ DEDQ+AANLVKHGPLAVGINA WMQTY+GGVSCP IC K DHGVL
Sbjct: 134 KIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFKRQDHGVL 193
Query: 308 IVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTT 367
+VGYGS+GFAPIR KEKPYWIIKNSWGE+WGE GYYKIC GRN+CGVD+MVS+V A HTT
Sbjct: 194 LVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQGYYKICRGRNICGVDAMVSTVTAAHTT 253
Query: 368 S 368
+
Sbjct: 254 N 254
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/365 (54%), Positives = 265/365 (72%), Gaps = 8/365 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
++ +L L+ + V + +D IRQV +D + +LL + E F +F S +
Sbjct: 1 MVAKALAQLITCIIFFCHVVASVEDLTIRQVT-ADERRVRPNLLGTHTESKFRVFMSDYG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-- 119
K Y+T+EE+ +R +F N+ +A Q++DPTAVHGVT+FSDLT EF+R + G+
Sbjct: 60 KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGG 119
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R A +AP++ + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGHAVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179
Query: 180 ELVSLSEQQLVDCDHE-CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+L+SLSEQQLVDCD CDP++ +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG
Sbjct: 180 KLLSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR 239
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G CKFD K+A V NF+ I DEDQ+AANLV+ GPLAVG+NAV+MQTYIGGVSCP IC
Sbjct: 240 -GHCKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLIC 298
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
K ++HGVL+VGYGS GF+ +R KPYWIIKNSWG+ WGENGYYK+C G ++CG++SM
Sbjct: 299 SKRKVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSM 358
Query: 358 VSSVA 362
VS+VA
Sbjct: 359 VSAVA 363
>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
Length = 245
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/247 (78%), Positives = 223/247 (90%), Gaps = 3/247 (1%)
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
AD KAP LPT++LP +FDWR+ GAVT VK+QG+CGSCWSFS TGALEGA++L+TGEL+S
Sbjct: 1 ADENKAPKLPTSNLPEEFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGELIS 60
Query: 184 LSEQQLVDCDHECDPEESG-SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
LSEQQLVDCDHECDPEE SCD+GCNGGLMN+AFEY LKAGG+++EKDYPYTG DG +C
Sbjct: 61 LSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALKAGGLQKEKDYPYTGKDG-TC 119
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
KFDK+KIAA+V NFSV+S DEDQ+AANLVK+GPLAVGINA WMQTYIGGVSCPYICGK L
Sbjct: 120 KFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQTYIGGVSCPYICGKSL 179
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGVLIVGYG +G+AP+R K KPYWIIKNSWGE+WGE+GYYKIC GRNVCGV+SMVSSV
Sbjct: 180 DHGVLIVGYG-TGYAPVRLKNKPYWIIKNSWGESWGESGYYKICRGRNVCGVESMVSSVT 238
Query: 363 AIHTTSS 369
A H T++
Sbjct: 239 AAHFTTT 245
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 195/365 (53%), Positives = 260/365 (71%), Gaps = 9/365 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L+ ++ L LL S + SA A+ D +RQV +DGE + +E F +F K+ K+
Sbjct: 42 LLACAISLALLISAIPSATALRRDPEFLRQV--TDGEIFNNLPAGSERKFVMFMEKYGKS 99
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-- 121
Y T++E+ +RF +F NL RA Q LDPTAVHGVT+FSDL+ EF R F+G+
Sbjct: 100 YPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGE 159
Query: 122 -LPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LP Q + LP FDWRD GAVT VK QG CGSCW+FS GA+EGA+F++T
Sbjct: 160 GLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIAT 219
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCDH CDP + +C++GCNGGLM +A++Y++++GG+E E YPYTG
Sbjct: 220 GNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTGRS 279
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G C F KIA VSNF+ I DE+Q+AA+LV+ GPLAVG+NAV+MQTYIGGVSCP IC
Sbjct: 280 -GQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGGVSCPLIC 338
Query: 299 GK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
GK +++HGVL+VGYG GF+ +RF++ PYW+IKNSWGE WGE+GYY++C G +CG+++M
Sbjct: 339 GKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGEHGYYRLCRGHGMCGINTM 398
Query: 358 VSSVA 362
VS+V
Sbjct: 399 VSAVV 403
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 195/365 (53%), Positives = 260/365 (71%), Gaps = 9/365 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L+ ++ L LL S + SA A+ D +RQV +DGE + +E F +F K+ K+
Sbjct: 42 LLACAISLALLISAIPSATALRRDPEFLRQV--TDGEIFNNLPAGSERKFVMFMEKYGKS 99
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-- 121
Y T++E+ +RF +F NL RA Q LDPTAVHGVT+FSDL+ EF R F+G+
Sbjct: 100 YPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGE 159
Query: 122 -LPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LP Q + LP FDWRD GAVT VK QG CGSCW+FS GA+EGA+F++T
Sbjct: 160 GLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIAT 219
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCDH CDP + +C++GCNGGLM +A++Y++++GG+E E YPYTG
Sbjct: 220 GNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTGRS 279
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
G C F KIA VSNF+ I DE+Q+AA+LV+ GPLAVG+NAV+MQTYIGGVSCP IC
Sbjct: 280 -GQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGGVSCPLIC 338
Query: 299 GK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
GK +++HGVL+VGYG GF+ +RF++ PYW+IKNSWGE WGE+GYY++C G +CG+++M
Sbjct: 339 GKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGEHGYYRLCRGHGMCGINTM 398
Query: 358 VSSVA 362
VS+V
Sbjct: 399 VSAVV 403
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/364 (54%), Positives = 262/364 (71%), Gaps = 11/364 (3%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
++ +L L+ +L V + +D IRQV +D + +LL + E F LF S +
Sbjct: 1 MVAKALAQLITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-- 119
K Y+T+EE+ +R +F N+ +A Q++DP+AVHGVT+FSDLT EF+R + G+
Sbjct: 60 KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGG 119
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R +AP++ + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L+SLSEQQLVDCD + +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG
Sbjct: 180 KLLSLSEQQLVDCDQ----ADKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR- 234
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
G CKFD K+A V NF+ I DE+Q+AANLV+HGPLAVG+NAV+MQTYIGGVSCP IC
Sbjct: 235 GHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICS 294
Query: 300 KY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
K ++HGVL+VGYGS GF+ +R KPYWIIKNSWG+ WGENGYYK+C G ++CG++SMV
Sbjct: 295 KRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMV 354
Query: 359 SSVA 362
S+VA
Sbjct: 355 SAVA 358
>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
Length = 331
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/367 (55%), Positives = 251/367 (68%), Gaps = 42/367 (11%)
Query: 2 ERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
+ +L S+L L S LA ++ + +D +I+QVV G AE+ F+ FK +F
Sbjct: 6 QTFMLFSVLFLFFSVDLAFSMPKDREDPIIQQVVDKGG---------AEYQFNEFKQRFG 56
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K Y++++EHDYRF VFK+NL RAKR ++DP+A HGVT+FSDLTP EFR LGL + +
Sbjct: 57 KVYSSKDEHDYRFNVFKSNLHRAKRHGIMDPSATHGVTRFSDLTPREFRNSILGL-KGVG 115
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LP A+ APIL T +LP DFDWR+ GAVT V++QG CGS WSFS GALEGAHFLS+GEL
Sbjct: 116 LPRHAKAAPILSTENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGAHFLSSGEL 175
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ VDCDHE YI K GG+ R +DY Y T+
Sbjct: 176 VSLSEQHHVDCDHE-----------------------YIQKYGGLMRVEDYTYYKTNTAR 212
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
+ +NFS IS D++Q+ ANLVKHGPLA INAV+MQTY+GG+SCPYIC +
Sbjct: 213 ---------SVAANFSSISVDDNQITANLVKHGPLAAAINAVYMQTYVGGISCPYICTRR 263
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
LD GVL+VGYGS A ++ KEKPYWI+KNSWGE WGENGYYKIC GRN+CGVDSMVS+V
Sbjct: 264 LDLGVLLVGYGSGAGADMKEKEKPYWIVKNSWGETWGENGYYKICRGRNICGVDSMVSTV 323
Query: 362 AAIHTTS 368
AA HTT+
Sbjct: 324 AAAHTTT 330
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 191/360 (53%), Positives = 257/360 (71%), Gaps = 9/360 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ L+ + L L + L++A + R++ D E LL E F +F + ++
Sbjct: 10 MCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNE-----LLRTEKKFKVFMENYGRS 64
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y+T+EE+ R +F N+ RA Q LDPTAVHGVT+FSDLT EF + + G+N
Sbjct: 65 YSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDEFEKLYTGVNGGFPSS 124
Query: 124 ADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+A AP L + LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+L
Sbjct: 125 NNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKL 184
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQL+DCD++CD E SCD+GCNGGLM +A+ Y+L++GG+E E YPYTG + G
Sbjct: 185 VSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGE 243
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG-K 300
CKFD KIA ++NF+ I +DE+Q+AA LVK+GPLA+G+NA++MQTYIGGVSCP IC K
Sbjct: 244 CKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPLICSKK 303
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
L+HGVL+VGYG+ GF+ +R KPYWIIKNSWGE WGE+GYYK+C G +CG+++MVS+
Sbjct: 304 RLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLCRGHGMCGINTMVSA 363
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 192/367 (52%), Positives = 263/367 (71%), Gaps = 13/367 (3%)
Query: 4 LILSS-LLLLLLSSVLASAVAVND----DDAMIRQVVPSDGEQSEDHLLN--AEHHFSLF 56
L+L+ + LL++S+ ++ ++ +++ + I QV + + +HLLN ++ F F
Sbjct: 37 LVLAGCMFLLVISTQISFSLGLDNGRVSEGGFIAQVTE---KFNREHLLNLRSKTLFDKF 93
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
+ K Y+T EE+ R R+F+ NL +A Q LDPTAVHG+T FSDLT EF ++ GL
Sbjct: 94 IVEHGKVYSTIEEYVRRLRIFEKNLLKAAENQALDPTAVHGITPFSDLTEYEFESRYTGL 153
Query: 117 -NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R L + Q A ILP +DLP +FDWR+ GAVT VK QG CGSCW+FS TG +EGA+F
Sbjct: 154 LGVRQGLVNEKQTAEILPVDDLPANFDWREKGAVTEVKTQGNCGSCWAFSTTGVVEGANF 213
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
L+TG+L++LSEQQL+DCDH+CDP + +CD+GC+GGLM +A+ Y+++AGG+E K+YPYT
Sbjct: 214 LATGKLLNLSEQQLIDCDHKCDPLNTKACDNGCHGGLMTNAYNYLMEAGGIEEAKNYPYT 273
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP 295
G G CKF+ A NF+ ++ DE Q+AANLVKHGPLAVG+NA +MQTYIGGVSCP
Sbjct: 274 GVQ-GDCKFNPDLAAVKAINFTTVNLDEKQIAANLVKHGPLAVGLNAAFMQTYIGGVSCP 332
Query: 296 YICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
IC K +++HGVL+VGYG GFA +R +PYWIIKNSWG+ WGE+GYYK+C G CG+
Sbjct: 333 LICSKRFINHGVLLVGYGHKGFALLRLGYRPYWIIKNSWGKRWGEHGYYKLCRGHGECGM 392
Query: 355 DSMVSSV 361
+ MVS+V
Sbjct: 393 NKMVSAV 399
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 187/364 (51%), Positives = 254/364 (69%), Gaps = 11/364 (3%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
SL+L L+ A V+D + + ++ LL E F++F + K Y+T+
Sbjct: 16 SLVLFALTLSSARQTTVHD--------IAKKLKLQDNQLLRTEKKFNVFMENYGKKYSTR 67
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
EE+ R +F N+ RA Q LDPTA+HGVT+FSDLT EF+R + G+N +
Sbjct: 68 EEYLQRLEIFAGNMLRAPENQALDPTAIHGVTQFSDLTEDEFQRHYTGVNGGFPWNNGVR 127
Query: 128 K-APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
AP L + LP DFDWR+ GAVT VK QG CGSCW+FS TG++EGA+F++TG+L++LSE
Sbjct: 128 DVAPPLKVDGLPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLNLSE 187
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQLVDCD +CD ES +CD+GC GGLM +A++Y+L++GG+E E YPYTG G CKFD
Sbjct: 188 QQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQSGGLEEESSYPYTGAK-GECKFDP 246
Query: 247 SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG-KYLDHG 305
K+A ++NF+ I DE+Q+AA LVKHGPLAVG+NA++MQTYIGGVSCP IC K+L+HG
Sbjct: 247 GKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQTYIGGVSCPLICSKKWLNHG 306
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
VL+VGY + GF+ +R KPYWIIKNSWG+ WG +GYYK+C G +CG+++MVS+
Sbjct: 307 VLLVGYRAKGFSILRLGNKPYWIIKNSWGKRWGVDGYYKLCRGHGMCGMNTMVSTAMVTQ 366
Query: 366 TTSS 369
T ++
Sbjct: 367 TQTA 370
>gi|357473731|ref|XP_003607150.1| Cysteine proteinase [Medicago truncatula]
gi|355508205|gb|AES89347.1| Cysteine proteinase [Medicago truncatula]
Length = 326
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 203/366 (55%), Positives = 252/366 (68%), Gaps = 48/366 (13%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L+L S+L L S LA + + +D +I+QVV G AEH F+ FK +F K
Sbjct: 7 LMLFSVLFLFFSVDLAFSTPNDREDPIIQQVVDKGG---------AEHQFNEFKQRFGKV 57
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y++++EHDYRF VFK+NL RAKR ++DP+A HGVT+FSDLTP EFR LGL + + LP
Sbjct: 58 YSSKDEHDYRFNVFKSNLHRAKRHVIMDPSATHGVTRFSDLTPREFRNSILGL-KGVGLP 116
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
A+ APIL + +LP DFDWR+ GAVT V++QG CGS WSFS GALEGA+FLSTGELVS
Sbjct: 117 RHAKAAPILSSENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGANFLSTGELVS 176
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LS+QQ VDCDH EYI K+GG+ R +DY Y
Sbjct: 177 LSDQQHVDCDH-----------------------EYIKKSGGLMRVEDYTYY-------- 205
Query: 244 FDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL 302
K+ IA +V +NFS + D+DQ+AANL+K+GPLAV INA +MQTY+GGVSCPY C + L
Sbjct: 206 --KTNIARSVAANFSSVLVDDDQIAANLLKYGPLAVAINAAYMQTYVGGVSCPYTCTRRL 263
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
DHGVL+VGYGS + KEKPYWI+K+SWGE WGENGYYKIC GRN+CGVDSMVS+VA
Sbjct: 264 DHGVLLVGYGSGAYT----KEKPYWIVKSSWGETWGENGYYKICRGRNICGVDSMVSTVA 319
Query: 363 AIHTTS 368
A TT+
Sbjct: 320 AAQTTT 325
>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
Length = 318
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 188/250 (75%), Positives = 216/250 (86%), Gaps = 4/250 (1%)
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+R PA AQKAPILPT DLP DFDWRD GAVT VKD G CGSCWSFS TGALE + +L+TG
Sbjct: 71 VRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATG 130
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
ELVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFE IL++GGV++EKD PYTG D
Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE-ILQSGGVQKEKDIPYTGRD- 188
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
G+CKFDK+K+ AA +S DE+Q+AANLVK+GPLAV INAV+MQTY+GGVSCPYICG
Sbjct: 189 GTCKFDKTKV-AATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICG 247
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN-GYYKICMGRNVCGVDSMV 358
K+LDHGVL+VGYG +APIRFK KPYWIIKNSWGE+WGEN GY +IC GRNVCGVD+MV
Sbjct: 248 KHLDHGVLLVGYGEGRYAPIRFKNKPYWIIKNSWGESWGENDGYDEICRGRNVCGVDAMV 307
Query: 359 SSVAAIHTTS 368
S+VAAI+ +S
Sbjct: 308 STVAAIYASS 317
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 179/319 (56%), Positives = 236/319 (73%), Gaps = 2/319 (0%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
+LL E F +F + +K YAT+EE+ +RF +F NL RA Q LDPTA+HGVT F DL
Sbjct: 6 NLLGTEEKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDPTAIHGVTPFMDL 65
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
T EF R + G+ +P + + + LP FDWR+ GAVT VK QG+CGSCW+F
Sbjct: 66 TEEEFERMYAGVLGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAF 125
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG++EGA+F++TG+L++LSEQQLVDCD CD + SCD GC GGLM +A+ Y+++AG
Sbjct: 126 STTGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIEAG 185
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G++ E YPYTG G CKFD KIA V+NF+ I+ DE+Q+AANLV HGPLA+G+NA++
Sbjct: 186 GLQEESSYPYTGKS-GECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIF 244
Query: 285 MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
MQTYIGGVSCP ICG K+L+HGVL+VGYG+ G++ +RF KPYWIIKNSWG +WGE GYY
Sbjct: 245 MQTYIGGVSCPLICGKKWLNHGVLLVGYGARGYSILRFGYKPYWIIKNSWGNHWGEKGYY 304
Query: 344 KICMGRNVCGVDSMVSSVA 362
++C G +CG++ MVS+V
Sbjct: 305 RLCRGHGMCGMNKMVSAVV 323
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/354 (53%), Positives = 246/354 (69%), Gaps = 12/354 (3%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L L LSS L + D V E ++ LL E F LF +SK Y+T E
Sbjct: 19 LCALTLSSSLHHETLIQD--------VARKLELKDNDLLTTEKKFKLFMKDYSKKYSTTE 70
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL-RLPADAQ 127
E+ R +F N+ +A Q LDPTA+HGVT+FSDL+ EF R + G A
Sbjct: 71 EYLLRLGIFAKNMVKAAEHQALDPTAIHGVTQFSDLSEEEFERFYTGFKGGFPSSNAAGG 130
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
AP L P +FDWR+ GAVTG+K QG CGSCW+F+ TG++EGA+FL+TG+LVSLSEQ
Sbjct: 131 VAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLATGKLVSLSEQ 190
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
QLVDCD++CD ++ SCD+GCNGGLM +A++Y+++AGG+E E YPYTG G CKFD +
Sbjct: 191 QLVDCDNKCDITKT-SCDNGCNGGLMTTAYDYLMEAGGLEEETSYPYTGAQ-GECKFDPN 248
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGV 306
K+A VSNF+ I +DE+Q+AA LV HGPLA+ +NAV+MQTY+GGVSCP IC K L+HGV
Sbjct: 249 KVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVGGVSCPLICSKRRLNHGV 308
Query: 307 LIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
L+VGY + GF+ +R ++KPYW IKNSWGE WGE GYYK+C G +CG+++MVS+
Sbjct: 309 LLVGYNAEGFSILRLRKKPYWTIKNSWGEQWGEKGYYKLCRGHGMCGMNTMVSA 362
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 179/237 (75%), Positives = 207/237 (87%), Gaps = 4/237 (1%)
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
T+ LP DFDWR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCD
Sbjct: 145 TDGLPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCD 204
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
HECD ES +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V
Sbjct: 205 HECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGREN-TCKFDKSKIVAQV 263
Query: 254 SNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGS 313
NFSVIS +EDQ+AANLVKHGPLA+ INA +MQTYIGGVSCP+ICG++LDHGVL+VGYGS
Sbjct: 264 KNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGS 323
Query: 314 SGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVAAIHTT 367
+G+APIRFKEKPYWIIKNSWGENWGE GYYKIC G +N CGVDSMVSSV AIHT+
Sbjct: 324 AGYAPIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTAIHTS 380
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/387 (50%), Positives = 255/387 (65%), Gaps = 28/387 (7%)
Query: 4 LILSSLLLLLLSSVLASAVAVN-------DDDAMIRQVVPSD------GEQSEDHLL--- 47
++ +L + LLS L S+ D MIRQV + G S +H L
Sbjct: 9 MLTCTLAITLLSCALISSTTFQHEIQYRVQDPLMIRQVTDNHHHRHHPGRSSANHRLLGT 68
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
E HF F ++ KTY+T EE+ +R +F NL +A Q +DP+A+HGVT+FSDLT
Sbjct: 69 TTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQAMDPSAIHGVTQFSDLTEE 128
Query: 108 EFRRQFLGLNRRLRLPADAQKAP----------ILPTNDLPTDFDWRDHGAVTGVKDQGA 157
EF ++GL + Q ++ +DLP FDWR+ GAVT VK QG
Sbjct: 129 EFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLPESFDWREKGAVTEVKTQGR 188
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TGA+EGA+F++TG+L+SLSEQQLVDCDH CD +E CD GC+GGLM +AF
Sbjct: 189 CGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAF 248
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y+++AGG+E E YPYTG G CKF+ K+A V NF+ I DE Q+AAN+V +GPLA
Sbjct: 249 NYLIEAGGIEEEVTYPYTGKR-GECKFNPEKVAVKVRNFAKIPEDESQIAANVVHNGPLA 307
Query: 278 VGINAVWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
+G+NAV+MQTYIGGVSCP IC K ++HGVL+VGYGS GF+ +R KPYWIIKNSWG+
Sbjct: 308 IGLNAVFMQTYIGGVSCPLICDKKRINHGVLLVGYGSRGFSILRLGYKPYWIIKNSWGKR 367
Query: 337 WGENGYYKICMGRNVCGVDSMVSSVAA 363
WGE+GYY++C G N+CG+ +MVS+V
Sbjct: 368 WGEHGYYRLCRGHNMCGMSTMVSAVVT 394
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/356 (52%), Positives = 249/356 (69%), Gaps = 10/356 (2%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L+ + L L + L+SA + R++ D E LL E F +F + ++Y+
Sbjct: 12 LARVSLFLFALTLSSAHESTTVHDIARKLKVGDNE-----LLRTEKKFKVFMENYGRSYS 66
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T+EE+ R +F N+ RA Q LDPTAVHGVT+FSDLT EF + + G
Sbjct: 67 TREEYLRRLGIFSQNMLRAAEHQALDPTAVHGVTQFSDLTEVEFEKLYTGXPST---NTA 123
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
AP L LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+LVSLS
Sbjct: 124 GGVAPPLEVEGLPENFDWREKGAVTEVKIQGRCGSCWAFSTTGSIEGANFLATGKLVSLS 183
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQL+DCD++C+ E SCD+GCNGGLM +A+ Y+L++GG+E E YPYTG + G CKFD
Sbjct: 184 EQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGECKFD 242
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG-KYLDH 304
KI ++NF+ I DE+Q+AA LVK+GPLA+G+NA++MQTYIGGVSCP IC K L+H
Sbjct: 243 PEKITVRITNFTNIPVDENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPLICSKKRLNH 302
Query: 305 GVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
GVL+VGYG+ GF+ +R KPYWIIKNSWG+ WGE+GYYK+C G +CG+++MVS+
Sbjct: 303 GVLLVGYGAKGFSILRLGNKPYWIIKNSWGKKWGEDGYYKLCRGHGMCGINTMVSA 358
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 178/321 (55%), Positives = 237/321 (73%), Gaps = 3/321 (0%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSD 103
D +L E F +F K+ K Y+++EE+ +R +F N+ RA Q LDPTA+HGVT FSD
Sbjct: 52 DGVLGTEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSD 111
Query: 104 LTPSEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
L+ EF R F G+ R + A+ A L + LP FDWR+ GAVT VK QG CGSCW
Sbjct: 112 LSEEEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCW 171
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TGA+EGAHF+ST +L++LSEQQLVDCDH CD + +CDSGC GGLM +A++Y+++
Sbjct: 172 AFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE 231
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
AGG+E E YPYTG G CKF ++A V NF+ + +E+Q+AANLV HGPLAVG+NA
Sbjct: 232 AGGLEEESSYPYTGKH-GECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNA 290
Query: 283 VWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
++MQTYIGGVSCP IC K +++HGVL+VGYG+ G++ +RF KPYWIIKNSWG+ WGE+G
Sbjct: 291 IFMQTYIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHG 350
Query: 342 YYKICMGRNVCGVDSMVSSVA 362
YY++C G +CG+++MVS+V
Sbjct: 351 YYRLCRGHGMCGMNTMVSAVV 371
>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
Length = 381
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/318 (58%), Positives = 234/318 (73%), Gaps = 3/318 (0%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
N E +F +F K+ K Y T+EE+ +R VF NL RA Q+LDPTAVHG+T F DLT
Sbjct: 62 NTEENFKMFMIKYDKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPTAVHGITPFMDLTEE 121
Query: 108 EFRRQFLGLNRRLRLPADAQKAP-ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF R + G+ + A+ A L T LP+ FDWR GAVT VK QGACGSCW+FS
Sbjct: 122 EFERMYTGVVGGGAVGAEGVTATSFLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFST 181
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGA+EGA+F++TG+L++LSEQQLVDCD CD +E +CD GC GGLM +A+ Y+++AGG+
Sbjct: 182 TGAIEGANFIATGKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGL 241
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E E YPYTG G CKFD+ KIA V NF+ I DE+Q+AA+LV HGPLA+G+NAV+MQ
Sbjct: 242 EDEISYPYTGKP-GKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQ 300
Query: 287 TYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
TYIGGVSCP ICG K+++HGVL+VGYG+ GF+ +R KPYWIIKNSWG+ WGE GYY+I
Sbjct: 301 TYIGGVSCPLICGKKWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEEGYYRI 360
Query: 346 CMGRNVCGVDSMVSSVAA 363
C G +CG+D MVS+V
Sbjct: 361 CKGYGMCGMDRMVSAVVT 378
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 175/317 (55%), Positives = 232/317 (73%), Gaps = 3/317 (0%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
+ E F +F K+ K Y+++EE+ +R +F N+ RA Q LDP A+HGVT FSDL+
Sbjct: 1 MGGEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSE 60
Query: 107 SEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G+ R + A+ A L + LP FDWR+ GAVT VK QG CGSCW+FS
Sbjct: 61 EEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGA+EGAHF+ST +L++LSEQQLVDCDH CD + +CDSGC GGLM +A++Y+++AGG
Sbjct: 121 TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGG 180
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM 285
+E E YPYTG G CKF ++A V NF+ + BE+Q+AANLV HGPLAVG+NA +M
Sbjct: 181 LEEESSYPYTGKH-GECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFM 239
Query: 286 QTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
QTYIGGVSCP IC K +++HGVL+VGYG+ G++ +RF KPYWIIKNSWG WGE+GYY+
Sbjct: 240 QTYIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWGEHGYYR 299
Query: 345 ICMGRNVCGVDSMVSSV 361
+C G +CG+++MVS+V
Sbjct: 300 LCRGHGMCGMNTMVSAV 316
>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
Length = 363
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 183/358 (51%), Positives = 247/358 (68%), Gaps = 22/358 (6%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ L+ + L L + L++A + R++ D E LL E F +F + ++
Sbjct: 10 MCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNE-----LLRTEKKFKVFMENYGRS 64
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y+T+EE+ R +F N+ RA Q LDPTAVHGVT+FS +
Sbjct: 65 YSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSLPVSNN--------------- 109
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
A AP L + LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+LVS
Sbjct: 110 AAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVS 169
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LS+QQL+DCD++CD E SCD+GCNGGLM +A+ Y+L++GG+E E YPYTG + G CK
Sbjct: 170 LSDQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGECK 228
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG-KYL 302
FD KIA ++NF+ I +DE+Q+AA LVK+GPLA+G+NA++MQTYIGGVSCP IC K L
Sbjct: 229 FDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPLICSKKRL 288
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+HGVL+VGYG+ GF+ +R KPYWIIKNSWGE WGE+GYYK+C G +CG+++MVS+
Sbjct: 289 NHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLCRGHGMCGINTMVSA 346
>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
Length = 373
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/355 (53%), Positives = 241/355 (67%), Gaps = 22/355 (6%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHL----LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+ DD IRQV +DG +S L E F+ F + + Y+ EE+ R RVF AN
Sbjct: 19 STDDGFIRQV--TDGRRSRAGAGALGLLPEAQFAAFVRRHGRRYSGPEEYARRLRVFAAN 76
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK-----APILP-- 133
L RA Q LDPTA HGVT FSDLT EF + G+ R D Q+ AP P
Sbjct: 77 LARAAAHQALDPTARHGVTPFSDLTREEFEARLTGV--RAGAGGDVQRLVMSGAPAAPPA 134
Query: 134 ----TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
+ LP FDWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TG+L+ LSEQQL
Sbjct: 135 SQEEVSRLPASFDWRDKGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGKLLELSEQQL 194
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCDH C C++GC GGLM +A+ Y++K+GG+ ++ YPYTG G C+FD +K
Sbjct: 195 VDCDHTCSAVAQNECNNGCAGGLMTNAYAYLMKSGGLMEQRAYPYTGAP-GPCRFDPAKA 253
Query: 250 AAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVL 307
A V+NF+ + + DE Q+ A LV+ GPLAVG+NA +MQTY+GGVSCP +C + +++HGVL
Sbjct: 254 AVRVANFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGGVSCPLLCPRAWVNHGVL 313
Query: 308 IVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
+VGYG+ GFA +R +PYWIIKNSWGE WGE GYY++C G NVCGVDSMVS+VA
Sbjct: 314 LVGYGARGFAALRLGYRPYWIIKNSWGERWGEQGYYRLCRGSNVCGVDSMVSAVA 368
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 186/348 (53%), Positives = 233/348 (66%), Gaps = 17/348 (4%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD IRQV L E F+ F + + Y+ +E+ R RVF ANL RA
Sbjct: 34 DDKFIRQVTTQGTRAGAGPGLLPEAQFAAFVRRHGRRYSGPKEYARRLRVFAANLARAAA 93
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK----APILPTND------ 136
Q LDPTA HGVT FSDLT EF + GL R D Q+ P P
Sbjct: 94 HQALDPTARHGVTPFSDLTREEFEARLTGL----RAGGDVQRLMSGVPAAPPASKEEVAR 149
Query: 137 LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
LP FDWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TGELV LSEQQLVDCDH C
Sbjct: 150 LPASFDWRDKGAVTGVKTQGACGSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHTC 209
Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
C++GC GGLM +A+ Y++++GG+ + YPYTG G C+FD +++A V+NF
Sbjct: 210 SAVAQNECNNGCAGGLMTNAYSYLMESGGLMEQSAYPYTGA-AGPCRFDPTQVAVRVANF 268
Query: 257 SVI-SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSS 314
+ + + DE Q+ A LV+ GPLAVG+NA +MQTY+GGVSCP IC + +++HGVL+VGYG+
Sbjct: 269 TAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGGVSCPLICPRAWVNHGVLLVGYGAR 328
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
GFA +R +PYWIIKNSWG+ WGE GYY++C G NVCGVDSMVS+VA
Sbjct: 329 GFAALRLGYRPYWIIKNSWGKQWGEQGYYRLCRGSNVCGVDSMVSAVA 376
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 191/347 (55%), Positives = 241/347 (69%), Gaps = 20/347 (5%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
IRQV +DG LL E F+ F + + Y+ EE+ R RVF ANL RA Q L
Sbjct: 29 IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLN-------RRLRLPADAQKAPILPTNDLPTDFDW 143
DPTA HGVT FSDLT EF + GL RR +P+ A A + LP FDW
Sbjct: 86 DPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPS-AAPATEEEVSGLPASFDW 144
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
RD GAVT VK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+
Sbjct: 145 RDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTE 204
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS--- 260
CDSGC GGLM +A+ Y++ +GG+ + YPYTG G+C+FD +++A V+NF+V++
Sbjct: 205 CDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GTCRFDANRVAVRVANFTVVAPPG 263
Query: 261 -SDED---QMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSG 315
+D D QM A LV+HGPLAVG+NA +MQTY+GGVSCP +C + +++HGVL+VGYG G
Sbjct: 264 GNDGDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPLVCPRAWVNHGVLLVGYGERG 323
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
FA +R +PYWIIKNSWG+ WGE GYY++C GRNVCGVD+MVS+VA
Sbjct: 324 FAALRLGHRPYWIIKNSWGKAWGEQGYYRLCRGRNVCGVDTMVSAVA 370
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 190/346 (54%), Positives = 239/346 (69%), Gaps = 22/346 (6%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
IRQV +DG LL E F+ F + + Y+ EE+ R RVF ANL RA Q L
Sbjct: 29 IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLN--------RRLRLP-ADAQKAPILPTNDLPTDF 141
DPTA HGVT FSDLT EF + GL RR RLP A A + LP+ F
Sbjct: 86 DPTARHGVTPFSDLTREEFEARLTGLATDVGDDDVRRRRLPMPSAAPATEEEVSGLPSSF 145
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+
Sbjct: 146 DWRDRGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKK 205
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS- 260
CDSGC GGLM +A+ Y++ +GG+ + YPYTG G+C+FD +++A V+NF+V++
Sbjct: 206 TECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GACRFDANRVAVRVANFTVVAP 264
Query: 261 ------SDED-QMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYG 312
+D D QM A LV+HGPLAVG+NA +MQTY+GGVSCP +C + +++HGVL+VGYG
Sbjct: 265 AAGPGGNDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPLVCPRAWVNHGVLLVGYG 324
Query: 313 SSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
GFA +R +PYWIIKNSWG+ WGE GYY++C GRNVCGVD+M+
Sbjct: 325 ERGFAALRLGHRPYWIIKNSWGKAWGEQGYYRLCRGRNVCGVDTML 370
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 193/357 (54%), Positives = 246/357 (68%), Gaps = 18/357 (5%)
Query: 19 ASAVAVNDDDAMIRQVV----PSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT-QEEHDYR 73
A+A A DD +IRQV P+ LL E F+ F + K Y+ EE+ R
Sbjct: 19 AAAGASGDD--VIRQVTDNGAPAARRPPSPGLL-PEAKFAAFVRRHGKEYSGGAEEYARR 75
Query: 74 FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR---LRLPADAQKAP 130
RVF ANL RA Q LDP A HGVT FSDLTP EF+ + GL ++ +PA A +A
Sbjct: 76 LRVFAANLARAAAHQALDPGARHGVTPFSDLTPEEFQARLTGLQQQGTNNNMPA-AARAT 134
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
LP FDWR GAVT VK QG CGSCW+FS TGA+EGAHF++TG+L++LSEQQLV
Sbjct: 135 AEELATLPASFDWRAKGAVTEVKMQGMCGSCWAFSTTGAVEGAHFVATGKLLNLSEQQLV 194
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCDH CD CDSGC+GGLM +A+ Y+++AGG+ + YPYTG G +C+FD +K+A
Sbjct: 195 DCDHTCDAVAKNECDSGCSGGLMTNAYTYLIRAGGLMEQAAYPYTGAQG-TCRFDANKVA 253
Query: 251 AAVSNFSVISSD-EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG-KYLDHGVLI 308
V++F+ + D EDQ+ A+LV+ GPLAVG+NA +MQTY+GGVSCP +C K ++HGVL+
Sbjct: 254 VRVTSFTAVPPDDEDQIRASLVRAGPLAVGLNAAFMQTYLGGVSCPLLCPRKLINHGVLL 313
Query: 309 VGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVSSVA 362
VGYG+ G AP+R +PYWIIKNSWG+ WGE GYY++C G RNVCGVDSMVS+VA
Sbjct: 314 VGYGARGLAPLRLGYRPYWIIKNSWGKEWGEGGYYRLCRGARNRNVCGVDSMVSAVA 370
>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 360 bits (923), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 185/340 (54%), Positives = 229/340 (67%), Gaps = 9/340 (2%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
+IRQV S LL E F+ F + K Y+ EE+ R RVF AN+ RA Q
Sbjct: 28 VIRQVTDSGHGAGHPGLL-PEAQFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQA 86
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLPADAQKAPILPTNDLPTDFDW 143
LDP A HGVT FSDLT EF + GL R R A A LP FDW
Sbjct: 87 LDPGARHGVTPFSDLTREEFEARLTGLVGAGDVLRSARRMPAAAPATEEEVAALPASFDW 146
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
RD GAVT VK QG CGSCW+FS TGA+EGA+F++TG+L+ LSEQQLVDCDH CD
Sbjct: 147 RDKGAVTDVKMQGVCGSCWAFSTTGAVEGANFVATGKLLDLSEQQLVDCDHTCDAVAKTE 206
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
C+SGC+GGLM +A+ Y++ +GG+ + YPYTG G C+FD+ K+A V+NF+ + DE
Sbjct: 207 CNSGCSGGLMTNAYRYLMSSGGLMEQAAYPYTGAQ-GPCRFDRGKVAVRVANFTAVPLDE 265
Query: 264 DQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYL-DHGVLIVGYGSSGFAPIRFK 322
DQM A LV+ GPLAVG+NA +MQTY+GGVSCP IC + + +HGVL+VGYG+ GF+ +R
Sbjct: 266 DQMRAALVRGGPLAVGLNAAFMQTYVGGVSCPLICPRAMVNHGVLLVGYGARGFSALRLG 325
Query: 323 EKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
+PYW+IKNSWG WGE GYYK+C GRNVCGVDSMVS+VA
Sbjct: 326 YRPYWLIKNSWGAQWGEGGYYKLCRGRNVCGVDSMVSAVA 365
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 171/285 (60%), Positives = 215/285 (75%), Gaps = 8/285 (2%)
Query: 81 LRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLG---LNRRLRLPADAQKAPI--LPT 134
L RA +Q D +A HGVT+FSDLTP EF ++LG L+ R A+ I LPT
Sbjct: 3 LIRAATQQANDRGSAKHGVTRFSDLTPEEFAERYLGHVKLSSEHREKVRARGGVIEDLPT 62
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
LP +FDWR GAV+ VKDQG CGSCW+FS TGA+EGAHF+STG+LV LSEQQL+DCD
Sbjct: 63 KHLPAEFDWRFKGAVSRVKDQGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDV 122
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
CDP+ +CDSGCNGGL ++A EYI++ GG++ EK YPY G + G CK D+ + A +
Sbjct: 123 GCDPDVPNACDSGCNGGLPSNAMEYIVEHGGIDTEKSYPYVG-EKGECKADEGTLGATLK 181
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGS 313
NFS +SSDE QMAA LVKHGPL++GINA WMQTYIGGV+CP++C + LDHGVLIVGYGS
Sbjct: 182 NFSYVSSDEKQMAAALVKHGPLSIGINAAWMQTYIGGVACPWLCDSEALDHGVLIVGYGS 241
Query: 314 SGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
SGFAP+R++++PYWI+KNSW WGE GYY+IC + CG+++MV
Sbjct: 242 SGFAPVRWQQEPYWIVKNSWSPAWGEGGYYRICKDKGSCGINNMV 286
>gi|5777611|emb|CAB53397.1| cysteine protease [Medicago sativa]
Length = 209
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 166/210 (79%), Positives = 186/210 (88%), Gaps = 2/210 (0%)
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGS W+FS TGALEGA++L+TG+LVSLSEQQLVDCDH CDPEE SCDSGCNGGLMN+AF
Sbjct: 1 CGSGWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAF 60
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EYIL++GGV EKDY YTG D GSCKFDKSKI A+VSNFSV+S DEDQ+AANLVK+GPLA
Sbjct: 61 EYILQSGGVVSEKDYAYTGRD-GSCKFDKSKIVASVSNFSVVSLDEDQIAANLVKNGPLA 119
Query: 278 VGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
V INA WMQTY+ GVSCP+IC K LDHGVL+VG+GS G+APIR KEKPYWIIKNSWG+N
Sbjct: 120 VAINAAWMQTYMSGVSCPHICAKARLDHGVLLVGFGSGGYAPIRLKEKPYWIIKNSWGQN 179
Query: 337 WGENGYYKICMGRNVCGVDSMVSSVAAIHT 366
WGE GYYKIC GRNVCGVDSMVS+VAA +
Sbjct: 180 WGEEGYYKICRGRNVCGVDSMVSTVAAAQS 209
>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
Length = 500
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 162/318 (50%), Positives = 220/318 (69%), Gaps = 24/318 (7%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDP----TAVHGVTKFSDLTPSEFRRQFLGL----- 116
T+EE++ R +F+ N +RA R++ D +A HGVTKF DL+ EFR Q+LGL
Sbjct: 188 TEEEYEKRMEIFQENWKRAIEREIDDRKGGGSAKHGVTKFFDLSEEEFREQYLGLLSTST 247
Query: 117 --------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R+ ++ A +++ LP +DWR GAVT VKDQG CGSCW+FS TG
Sbjct: 248 SSSASKDAFRKHQMEAPSEE----DLEKLPQYYDWRARGAVTPVKDQGQCGSCWTFSTTG 303
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EGA+F+ TG+LVSLSEQQL+DCD C P+ +CDSGCNGGL ++A EYI++ GG++
Sbjct: 304 AIEGANFIKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEHGGLDT 363
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
EK YPY +C+ + K+ A +SN++ + +E MA LVK+GPL++GINA WMQ+Y
Sbjct: 364 EKSYPYKAYKEDTCRAKEGKLGATISNYTFVGKNETHMAHALVKYGPLSIGINAAWMQSY 423
Query: 289 IGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
+GGV+CP++C K LDHGVLIVGYG GFAP R ++PYW+IKNSWG WGE GYY+IC
Sbjct: 424 VGGVACPWLCNKDALDHGVLIVGYGEEGFAPARLHKEPYWVIKNSWGMGWGEEGYYRICK 483
Query: 348 GRNVCGVDSMVSSVAAIH 365
+ CGV++MV VAA++
Sbjct: 484 DKGNCGVNNMV--VAALN 499
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 172/331 (51%), Positives = 218/331 (65%), Gaps = 23/331 (6%)
Query: 50 EHHFSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
E F F + KTYA+ +E+ R +F N+ RAK D A +G T F+DLT E
Sbjct: 5 ERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEMSARD-GAEYGATPFADLTEDE 63
Query: 109 FRRQFLGLNRRLRLPADAQKA------------PILPTNDLPTDFDWRDHGAVTGVKDQG 156
F L +R P DA + P LPT ++P +FDWR GAVT VK+QG
Sbjct: 64 FASSLL-----MREPIDAARVERLKRHESSRVLPHLPTENIPLNFDWRALGAVTPVKNQG 118
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCWSFSATGA+EGAHF+ +G LVSLSEQQLVDCDH CDP+ +CDSGC+GGL +A
Sbjct: 119 MCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANA 178
Query: 217 FEYILKAGGVEREKDYPYTGTDG-GSCKF-DKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
Y++K GG++ E YPY G G G CK + AA ++N+S +S+DE Q+AA LVKHG
Sbjct: 179 MAYVVKRGGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYSFVSADESQIAAALVKHG 238
Query: 275 PLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIR-FKEKPYWIIKNS 332
PL+VGI+A WMQ Y GV+CP+ C K LDHGVLIVG+G+ G AP R F+ +P+W+IKNS
Sbjct: 239 PLSVGIDARWMQLYRRGVACPWACDKTRLDHGVLIVGFGAEGRAPARGFRREPFWLIKNS 298
Query: 333 WGENWGENGYYKICMGRNVCGVDSMVSSVAA 363
WG WGE GYYKIC + CGV++MV + A
Sbjct: 299 WGARWGEEGYYKICKDKGSCGVNTMVLAAQA 329
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 157/271 (57%), Positives = 196/271 (72%), Gaps = 8/271 (2%)
Query: 101 FSDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
FSDLT EF ++LG R R + LP LP +FDWR GAVT VKD
Sbjct: 2 FSDLTAEEFAARYLGHVRLSSEEREKRKARGGETLETLPVEHLPEEFDWRFKGAVTRVKD 61
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CGSCW+FS TGA+EGAHF+STG+LV LSEQQLVDCD CDP+ +CDSGCNGGL +
Sbjct: 62 QGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPS 121
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
+A EYI++ GG++ EK YPY G + G CK K K+ A + NFS +S DE QMAA LVK+G
Sbjct: 122 NAMEYIVEHGGIDTEKSYPYVG-EKGECKAKKGKLGATLKNFSFVSDDEKQMAAALVKYG 180
Query: 275 PLAVGINAVWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
PL++GINA WMQ+YIGGV+CP++C + LDHGVLIVGYGSSGFAP+R+ +PYWI+KNSW
Sbjct: 181 PLSIGINAAWMQSYIGGVACPWLCDAESLDHGVLIVGYGSSGFAPVRWAPEPYWIVKNSW 240
Query: 334 GENWGENGYYKICMGRNVCGVDSMVSSVAAI 364
WGE GYY+IC + CG+++MV + +
Sbjct: 241 SPAWGEGGYYRICKDKGSCGINNMVVAAHGV 271
>gi|388519111|gb|AFK47617.1| unknown [Medicago truncatula]
Length = 241
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 155/202 (76%), Positives = 177/202 (87%), Gaps = 4/202 (1%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
N DD +IRQVV + +EDH+LNAEHHF+ FKSKFSK YAT+EEHDYRF VFK+NL +
Sbjct: 26 TNSDDLLIRQVV----DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIK 81
Query: 84 AKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
AK Q LDP+A HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPTN+LP DFDW
Sbjct: 82 AKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTNNLPEDFDW 141
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+L SLSEQQLVDCDH CDPEE GS
Sbjct: 142 REKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGS 201
Query: 204 CDSGCNGGLMNSAFEYILKAGG 225
CDSGCNGGLMN+AFEYIL++GG
Sbjct: 202 CDSGCNGGLMNNAFEYILQSGG 223
>gi|52546916|gb|AAU81591.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 190
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 154/189 (81%), Positives = 170/189 (89%), Gaps = 1/189 (0%)
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
ELVSLSEQQLVDCDHECDPEE SCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD
Sbjct: 3 ELVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 62
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
CKFD +K+AA V+NFSV+S DE+Q+AANLVK+GPLAV INAV+MQTY+GGVSCPYIC
Sbjct: 63 AKCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICS 122
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
K DHGVL+VGYG SGFAPIR KEKPYWIIKNSWGE WGE+GYYKIC GRNVCGVDSMVS
Sbjct: 123 KRQDHGVLLVGYG-SGFAPIRMKEKPYWIIKNSWGEKWGESGYYKICRGRNVCGVDSMVS 181
Query: 360 SVAAIHTTS 368
+VAA+ T+S
Sbjct: 182 TVAAVSTSS 190
>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 252
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 167/241 (69%), Positives = 195/241 (80%), Gaps = 7/241 (2%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHL-LNAEHHFSLFKSKFSKTYATQEE 69
+ L +SV +S + +DD +I QVVP E ED L LNAE HFS F +F K+YA ++E
Sbjct: 15 VALSASVASSWPSYAEDDPLIVQVVP---ESDEDELRLNAEAHFSSFLRRFGKSYADEKE 71
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---ADA 126
H YRF VFKANLRRA+R Q +DPTAVHG+TKFSDLTP+EFRR +LGL RL A +
Sbjct: 72 HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
+APILPTN+LPTDFDWRDHGAVTGVKDQG+CGSCWSFSA+GALEGA+FL+TG+L SLSE
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQLESLSE 191
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQLVDCDHECD E SCDSGCNGGLM +AFEY+LK+GG+E EKDYPYTGTD G CKFD+
Sbjct: 192 QQLVDCDHECDSSEPDSCDSGCNGGLMTTAFEYLLKSGGLELEKDYPYTGTDRGRCKFDE 251
Query: 247 S 247
S
Sbjct: 252 S 252
>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 291
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 164/289 (56%), Positives = 205/289 (70%), Gaps = 10/289 (3%)
Query: 84 AKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRR----LRLPADAQKAPILPTNDLP 138
A RQ D +AVHGVT+FSDLTP+EF FLG + + P P +DLP
Sbjct: 4 AAERQAQDRGSAVHGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHDLP 63
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
+FDWR+ GAVT VK+QGACGSCW+FSATGA+EGA+FL TGELVSLSEQQLVDCDH CDP
Sbjct: 64 LEFDWRERGAVTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDP 123
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
+CD GCNGGL +A Y+ K G++ E +YPY G DG AA+VS+F++
Sbjct: 124 SAPRNCDYGCNGGLPLNAMRYVQKH-GLDTESNYPYKGVDGKCASARHGPAAASVSSFNL 182
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFA 317
+S++E Q+AA L+KHGPL++GI+A WMQTY+GGV+CP+IC K LDHGVLIVGYG +G A
Sbjct: 183 VSTNETQIAAALLKHGPLSIGIDAAWMQTYVGGVACPWICNKAGLDHGVLIVGYGVNGTA 242
Query: 318 PIR--FKEKPYWIIKNSWGENWG-ENGYYKICMGRNVCGVDSMVSSVAA 363
P R + + YWI+KNSWG NWG E GYY IC R CG+++MV + A
Sbjct: 243 PARPWHRRQDYWIVKNSWGPNWGVEGGYYHICKDRAACGLNTMVVAADA 291
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 143/237 (60%), Positives = 188/237 (79%), Gaps = 2/237 (0%)
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A+ A L + LP FDWR+ GAVT VK QG CGSCW+FS TGA+EGAHF+ST +L++LS
Sbjct: 6 AETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLS 65
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDH CD + +CDSGC GGLM +A++Y+++AGG+E E YPYTG G CKF
Sbjct: 66 EQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKH-GECKFK 124
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDH 304
++A V NF+ + +E+Q+AANLV HGPLAVG+NA++MQTYIGGVSCP IC K +++H
Sbjct: 125 PDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPLICPKRWINH 184
Query: 305 GVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
GVL+VGYG+ G++ +RF KPYWIIKNSWG+ WGE+GYY++C G +CG+++MVS+V
Sbjct: 185 GVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNTMVSAV 241
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 168/365 (46%), Positives = 233/365 (63%), Gaps = 33/365 (9%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L+ +L+ LL V+ + + D IRQV +D + +D E HF F KF K Y
Sbjct: 5 LAIILVGLLILVVCCSSSNRLDIGKIRQV--TDNLEVKD----VEGHFKHFMQKFGKVYG 58
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T EE+ +R +VF+ANL + DPTA+HG+T F+DLTP E R FLG R+
Sbjct: 59 TTEEYVHRLKVFQANLAHVMSLKKQDPTAIHGITSFADLTPEELSR-FLGF-RKAYSNRV 116
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+AP+LPT++LP FDWR+HGAVT VK QG CGSCW+FS TG +EGA+FL TG+L+SLS
Sbjct: 117 VNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLS 176
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD------G 239
E+QL+DCD++ D+GC GG M SA+EY+ KA G+E E+DYPY
Sbjct: 177 EEQLIDCDYK---------DNGCEGGDMLSAYEYV-KARGLEAEEDYPYEELGYRHKPVR 226
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
G C++ SK+ A ++N+S +S DEDQ+AANLVK+GPL++ + + TY GGV+CP IC
Sbjct: 227 GPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGVACPRICP 286
Query: 300 KYLDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
++HGVL+VGYG +G YW KN+W + +GENGY+++C G VC ++S V
Sbjct: 287 GEINHGVLLVGYGVENGLR--------YWTFKNTWTDEFGENGYFRLCRGVGVCDMNSEV 338
Query: 359 SSVAA 363
+V+
Sbjct: 339 GTVST 343
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 168/365 (46%), Positives = 232/365 (63%), Gaps = 33/365 (9%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L+ +L+ LL V+ + + D IRQV +D + +D E HF F KF K Y
Sbjct: 5 LAIILVGLLILVICCSSSNRLDIGKIRQV--TDNLEVDD----VEGHFKHFMQKFGKVYG 58
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T EE+ +R +VF+ANL + DPTA+HG+T F+DLTP E R FLG R+
Sbjct: 59 TTEEYVHRLKVFQANLVHVMSLKKQDPTAIHGITSFADLTPEELSR-FLGF-RKAYSNRV 116
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+AP+LPT++LP FDWR+HGAVT VK QG CGSCW+FS TG +EGA+FL TG+L+SLS
Sbjct: 117 VNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLS 176
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD------G 239
E+QL+DCD++ D+GC GG M SA+EY+ KA G+E ++DYPY
Sbjct: 177 EEQLIDCDYK---------DNGCEGGDMLSAYEYV-KARGLEADEDYPYEELGYRHKPVR 226
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
G C++ SK+ A ++N+S +S DEDQ+AANLVK+GPL++ + + TY GGV+CP IC
Sbjct: 227 GPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGVACPRICP 286
Query: 300 KYLDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
++HGVL+VGYG +G YW KNSW + +GENGY+++C G VC + S V
Sbjct: 287 GEINHGVLLVGYGVENGLR--------YWTFKNSWTDEFGENGYFRLCRGVGVCDMTSEV 338
Query: 359 SSVAA 363
+V+
Sbjct: 339 GTVST 343
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 159/319 (49%), Positives = 216/319 (67%), Gaps = 18/319 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F F K++K Y T EEH+ R+++FKAN+ +++ + G+TKFSDLTP EF+R
Sbjct: 33 FIKFSRKYAKVYGT-EEHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEEFKRM 91
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL---------PTDFDWRDHGAVTGVKDQGACGSCWS 163
FL + P +A+K P + + PT FDWR HGAVT VK+QGACGSCW+
Sbjct: 92 FL---MKTYTPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWT 148
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYILK 222
FS TG +EG + G+LVSLSEQQLVDCDH C + +CDSGCNGGLM SAF+Y++K
Sbjct: 149 FSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIK 208
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
GG++ E YPY G D +C+F+KS +AA +S+++ ISSDE+QMAA L +GP+++ INA
Sbjct: 209 NGGLDTEDSYPYEGVD-DTCRFNKSNVAATISSWTSISSDENQMAAWLAANGPISIAINA 267
Query: 283 VWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
W+Q Y G+S P+ C + LDHGVLIVGYG E+ YWI+KNSWG +WGE+G
Sbjct: 268 EWLQYYTSGISDPWFCNPQDLDHGVLIVGYGVG--KSWLGSEENYWIVKNSWGSDWGEDG 325
Query: 342 YYKICMGRNVCGVDSMVSS 360
Y++I G+ CG++S+ SS
Sbjct: 326 YFRIIRGKGKCGLNSVPSS 344
>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
Length = 350
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/319 (48%), Positives = 212/319 (66%), Gaps = 18/319 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F F K +K Y E+H R+++FK+N+ +A+ + GV+KF DLTP EF+R
Sbjct: 36 FVKFSKKHAKLYGA-EDHGKRYQIFKSNVEKARYYNHVGKRETFGVSKFMDLTPEEFKRM 94
Query: 113 FLGLNRRLRLPADAQKAPILP---------TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
FL + P +A+K P D PT +DWR GAVT VK+QGACGSCW+
Sbjct: 95 FL---MKTYTPEEARKILAAPKEAVVTAQQVKDTPTSWDWRQKGAVTPVKNQGACGSCWT 151
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE-SGSCDSGCNGGLMNSAFEYILK 222
FS TG +EG H + TG+LVSLSEQQLVDCDH C + +CD+GCNGGLM SAF+Y++K
Sbjct: 152 FSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWSAFQYVIK 211
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
GG+ E YPY G D +C+F+KS +A +++++ I SDE +MAA L +GP+++ INA
Sbjct: 212 TGGLVTEDSYPYEGVD-DTCRFNKSNVAVTINSWTSIPSDEGKMAAWLAANGPISIAINA 270
Query: 283 VWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
W+QTY G+S P+ C + LDHGVLIVG+G +G + KE YWIIKNSWG +WGE+G
Sbjct: 271 EWLQTYTSGISNPWFCNPQDLDHGVLIVGFG-TGSNWLGEKED-YWIIKNSWGADWGESG 328
Query: 342 YYKICMGRNVCGVDSMVSS 360
Y++I G+ CG++S+ SS
Sbjct: 329 YFRIVRGKGKCGLNSVPSS 347
>gi|2253415|gb|AAB62937.1| stress-induced cysteine proteinase [Lavatera thuringiaca]
Length = 175
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 137/174 (78%), Positives = 160/174 (91%)
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDP++ G+C++GC+GGLM SAFEY LKAGG+ERE++YPYTG D G CKFDK+KIAA+VS
Sbjct: 1 ECDPQQYGACNAGCSGGLMTSAFEYTLKAGGLEREEEYPYTGIDRGGCKFDKTKIAASVS 60
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSS 314
NFSVIS DEDQ+AAN+VKHGPLAVGINA +MQTYIGGVSCPYIC + LDHGVL+VGYG++
Sbjct: 61 NFSVISVDEDQIAANMVKHGPLAVGINAAFMQTYIGGVSCPYICFRSLDHGVLLVGYGAA 120
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
G+AP+RFKEKP+WIIKNSWG NWGE+GYYKIC GRNVCGVDSMVSSVAA+ T S
Sbjct: 121 GYAPVRFKEKPFWIIKNSWGANWGEDGYYKICRGRNVCGVDSMVSSVAALQTKS 174
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 203/322 (63%), Gaps = 17/322 (5%)
Query: 45 HLLNAEHHFSLFKSKFSKTYAT-QEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
H L AEH F F S + Y + + RF +FK N+R+ + TA +GVT+F+
Sbjct: 2363 HHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFA 2422
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
DLT EF + +G+ LR P Q + ++P P FDWRDHGAVTGVKDQG+CGSC
Sbjct: 2423 DLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVTAPDSFDWRDHGAVTGVKDQGSCGSC 2482
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG +EG + TG+LVSLSEQ+LVDCD D GCNGGL ++A+ I
Sbjct: 2483 WAFSVTGNIEGQWKMKTGDLVSLSEQELVDCD---------KLDQGCNGGLPDNAYRAIE 2533
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ GG+E E DYPY G+D C F+K+ +S I+S+E MA LVKHGP+++GIN
Sbjct: 2534 QLGGLESEDDYPYEGSD-DKCSFNKTLARVQISGAVNITSNETDMAKWLVKHGPISIGIN 2592
Query: 282 AVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y+GG+S P+ +C LDHGVLIVGYG+ + P+ K PYWIIKNSWG +WG
Sbjct: 2593 ANAMQFYMGGISHPWRMLCNPSNLDHGVLIVGYGAKDY-PLFHKHLPYWIIKNSWGTSWG 2651
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G CGV+ M SS
Sbjct: 2652 EQGYYRVYRGDGTCGVNQMASS 2673
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 163/373 (43%), Positives = 228/373 (61%), Gaps = 33/373 (8%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +LIL ++LL+ S +LA A +A+ +SE L F+ F+ K
Sbjct: 1 MNKLIL--VVLLVASFILAIEAAKGPFNAL---------PESEMQQL-----FTQFRRKH 44
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG----- 115
K Y T++ D R+++FK N+ RA+ L GVT+FSDLTP EF+ FL
Sbjct: 45 VKLYGTKQVQDRRYQIFKQNVERARFENYLTERDNMGVTRFSDLTPDEFKSMFLMKSYTP 104
Query: 116 ------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
L+ + PA+A K + +D P +FDWR+H AVT VKDQG CGSCW+FS TG
Sbjct: 105 KQARELLSGMRQYPANA-KLTMKQVSDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGN 163
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDP-EESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EG + TG+L+SLSEQQLVDCDH C E +C++GCNGGLM S+FE+I+K GG+
Sbjct: 164 VEGMYAAKTGKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKTGGLVT 223
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
E+ YPY D C+F+ S +SN++ +SS+ED+MAA L +GP+A+ INA ++Q Y
Sbjct: 224 EESYPYEAVD-NRCRFNVSNAVVKISNWTFVSSNEDEMAAWLANNGPIAIAINADYLQYY 282
Query: 289 IGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+ P C + L+HGVLIVGYG A K + YWI+KNSW +WGE GY ++
Sbjct: 283 RKGILNPSRCDPEELNHGVLIVGYGEEKAA--NGKVEKYWIVKNSWSASWGEKGYVRVLR 340
Query: 348 GRNVCGVDSMVSS 360
G+ VCG++++ SS
Sbjct: 341 GKGVCGLNAVPSS 353
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 203/322 (63%), Gaps = 16/322 (4%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
L E F F K+ K Y +EE + RF++FK NL + Q + T +GVT+F+DLT
Sbjct: 725 LKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLT 784
Query: 106 PSEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+EF+ + LGL L+ D +P +LP+D+DWR H VT VKDQG+CGSCW+F
Sbjct: 785 KAEFKARHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + GEL+SLSEQ+LVDCD DSGCNGGL ++A+ I + G
Sbjct: 845 SVTGNIEGQYAIKHGELLSLSEQELVDCD---------KLDSGCNGGLPDTAYRAIEELG 895
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY D C F+K+K+ + + I+S+E QMA LVK+GP+++GINA
Sbjct: 896 GLELESDYPYDAED-EKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINANA 954
Query: 285 MQTYIGGVSCP--YICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
MQ Y+GGVS P ++C LDHGVLIVGYG F PI K PYWIIKNSWG WGE G
Sbjct: 955 MQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVK-FYPIFKKTMPYWIIKNSWGPRWGEQG 1013
Query: 342 YYKICMGRNVCGVDSMVSSVAA 363
YY++ G CGV+ MV+S
Sbjct: 1014 YYRVYRGDGTCGVNKMVTSAVV 1035
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 157/322 (48%), Positives = 202/322 (62%), Gaps = 22/322 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
E F F KF KTY + +E RF++FK NL+ + Q + TA +GVT F+DLTP E
Sbjct: 576 ETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKE 635
Query: 109 FRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
F+ ++LGL L+ + P+ +P LP FDWRDH VT VKDQG CGSCW+F
Sbjct: 636 FKARYLGLRPELK---HENEIPLPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGSCWAF 692
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + +L+SLSEQ+LVDCD S D GCNGG M +A++ I + G
Sbjct: 693 SVTGNVEGQYAIKHNQLLSLSEQELVDCD---------SLDEGCNGGDMENAYKAIERLG 743
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY D C F ++K V + I+SDE +MA LVK+GP++VGINA
Sbjct: 744 GLELESDYPYDAKD-EKCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPISVGINANA 802
Query: 285 MQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
MQ Y GGVS P ++C K LDHGVLIVGYG S + P+ KE PYWIIKNSWG WGE G
Sbjct: 803 MQFYFGGVSHPLNFLCNPKNLDHGVLIVGYGISKY-PLFHKELPYWIIKNSWGPRWGERG 861
Query: 342 YYKICMGRNVCGVDSMVSSVAA 363
YY++ G CGV++M +S
Sbjct: 862 YYRVYRGDGTCGVNTMATSAVV 883
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 206/315 (65%), Gaps = 22/315 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F++ F + Y +++E RF++F N+R+AK+ Q ++ TAV+GVTKF+D++ SEF+
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFK- 476
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
Q++G +KA I N LP FDWR+HGAVT VK+QG+CGSCW+FS TG +E
Sbjct: 477 QYVGKVWDQNANKGMKKAKIPEMNSLPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGNIE 536
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G +S +LVSLSEQ+LVDCD D GCNGGL + A++ I++ GG+E E D
Sbjct: 537 GQWAISKKKLVSLSEQELVDCD---------KVDEGCNGGLPSQAYKEIIRLGGLETETD 587
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
Y Y G + C DKSKI ++ ISS+E +MAA LVK+GP+++GINA MQ Y+GG
Sbjct: 588 YKYRGHN-EKCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPISIGINAFAMQFYMGG 646
Query: 292 VSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
+S P+ C K LDHGVLIVGYG G KPYWIIKNSWG +WGE GYY + G
Sbjct: 647 ISHPWKIFCNPKELDHGVLIVGYGVKG-------SKPYWIIKNSWGPDWGEKGYYLVYRG 699
Query: 349 RNVCGVDSMVSSVAA 363
VCG+++M +S
Sbjct: 700 AGVCGLNTMCTSAVV 714
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 153/331 (46%), Positives = 209/331 (63%), Gaps = 21/331 (6%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHG 97
E +D LL F F ++KTY + +E R++VF+ NL+ ++ R+ TAV+G
Sbjct: 570 AEDYKDELL-----FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYG 624
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
VT F+DLTP EF+ ++LGL L D + ++P DLP FDWR++ AVT VKDQG
Sbjct: 625 VTMFADLTPEEFKTKYLGLKTNLNQENDIPLQEAVIPDIDLPPKFDWREYNAVTPVKDQG 684
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FSA G +EG + + +L+SLSEQ+LVDCD+ D GC GG M +A
Sbjct: 685 QCGSCWAFSAIGNIEGQYAIKHKKLLSLSEQELVDCDN---------LDDGCGGGYMINA 735
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
++ + K GG+E E DYPY + C F K+K V++ I++DE +MA LVK+GP+
Sbjct: 736 YKTVEKLGGLELETDYPYDARN-EKCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGPI 794
Query: 277 AVGINAVWMQTYIGGVSCP--YICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+VGINA MQ Y GGVS P ++C LDHGVLIVGY +S + P+ K+ PYWIIKNSW
Sbjct: 795 SVGINANAMQFYFGGVSHPFKFLCDPANLDHGVLIVGYATSTY-PLFKKKLPYWIIKNSW 853
Query: 334 GENWGENGYYKICMGRNVCGVDSMVSSVAAI 364
G WGE GYY++ G CGV++M SS +
Sbjct: 854 GPKWGEQGYYRVYRGDGTCGVNAMASSAIVV 884
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 195/322 (60%), Gaps = 12/322 (3%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFS 102
L + F F+ KF+K Y + EE+ RF +FK+NL + + L+ GV KF+
Sbjct: 23 LEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACG 159
DL+ EF+ +L N+ D A L N +PT FDWR GAVT VK+QG CG
Sbjct: 82 DLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFE 218
SCWSFS TG +EG HF+S +LVSLSEQ LVDCDHEC + E +CD GCNGGL +A+
Sbjct: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYN 200
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
YI+K GG++ E YPYT G C F+ + I A +SNF++I +E MA +V GPLA+
Sbjct: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAI 260
Query: 279 GINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
+AV Q YIGGV LDHG+LIVGY + I K PYWI+KNSWG +WG
Sbjct: 261 AADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWG 318
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GY + G+N CGV + VS+
Sbjct: 319 EQGYIYLRRGKNTCGVSNFVST 340
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 194/319 (60%), Gaps = 12/319 (3%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
+ F F+ KF+K Y + EE+ RF +FK+NL + + L+ GV KF+DL+
Sbjct: 26 QSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
EF+ +L N+ D A L N +PT FDWR GAVT VK+QG CGSCW
Sbjct: 85 SDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYIL 221
SFS TG +EG HF+S +LVSLSEQ LVDCDHEC + E +CD GCNGGL +A+ YI+
Sbjct: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
K GG++ E YPYT G C F+ + I A +SNF++I +E MA +V GPLA+ +
Sbjct: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 263
Query: 282 AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
AV Q YIGGV LDHG+LIVGY + I K PYWI+KNSWG +WGE G
Sbjct: 264 AVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQG 321
Query: 342 YYKICMGRNVCGVDSMVSS 360
Y + G+N CGV + VS+
Sbjct: 322 YIYLRRGKNTCGVSNFVST 340
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 208/320 (65%), Gaps = 17/320 (5%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF F KF + Y EE++YR +VF+ N+ ++R + + +G+TKFSDLT EFR+
Sbjct: 36 HFLDFTRKFQRFYKGPEEYEYRLKVFRENIETSRRMNIREGNNNYGITKFSDLTSDEFRK 95
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---------PTDFDWRDHGAVTGVKDQGACGSCW 162
+L + P + QK + +N + P +DWR+HGA+TGVKDQG CGSCW
Sbjct: 96 FYL---MEKKTPKEIQKMMRMDSNKMVSNSYAKPAPDHYDWRNHGAITGVKDQGQCGSCW 152
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP-EESGSCDSGCNGGLMNSAFEYIL 221
+FSA G++EG++ + +LVS SEQQLVDCD+ C E SCD GCNGGL SA++Y++
Sbjct: 153 AFSAIGSIEGSYAIKHKQLVSFSEQQLVDCDNNCVTFENQQSCDDGCNGGLQWSAYQYLM 212
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
KAGGV EKDYPY + C+ + A +SN++++S++E +MA L ++GP+AV +N
Sbjct: 213 KAGGVVTEKDYPYYA-ERYKCEVKPANFVAKLSNWTMLSTNETEMANWLAENGPIAVALN 271
Query: 282 AVWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
A ++Q Y G++ P C LDHGVLIVGYG F K +PYWI+KNSWG ++GE+
Sbjct: 272 ADFLQNYNNGIADPAWCDPTQLDHGVLIVGYGLETF--WFGKPQPYWIVKNSWGYDFGED 329
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY++I G CG++++ S+
Sbjct: 330 GYFRIVKGVGRCGINTVPSA 349
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 154/309 (49%), Positives = 193/309 (62%), Gaps = 13/309 (4%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR---RAKRRQLLDPTAVH-GVTKFSDLT 105
E F F+ K++K Y T E+ RF FK+NL+ R ++V GV +F+DL+
Sbjct: 25 ETQFRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKNRDAASRKSSVRFGVNEFADLS 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
SEFR +L + +R P +A A LP DLPT FDWR GAVTGVK+QG CGSCWSFS
Sbjct: 84 QSEFRATYLNSVQAVRDP-NAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCWSFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYILKAG 224
TG +EG FL+ L LSEQ LVDCDHEC + CD GCNGGL +A+ YI+K G
Sbjct: 143 TTGNVEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIKNG 202
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G++ E YPY G D G+C F + I A +SN++ +SS+E QMAA LV +GPLA+ +AV
Sbjct: 203 GIDTEASYPYQGVD-GTCSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAADAVE 261
Query: 285 MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Q Y+GGV P CG LDHG+LIVGY + I K+K YWI+KNSWG WGE GY
Sbjct: 262 WQFYLGGVFDVP--CGNTLDHGILIVGYSAEN--TIFHKDKAYWIVKNSWGATWGEQGYI 317
Query: 344 KICMGRNVC 352
I G C
Sbjct: 318 YISRGNGEC 326
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 199/322 (61%), Gaps = 15/322 (4%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
D L + F F +K Y + EE RFR+F AN+++ K Q + +A++G T+F+
Sbjct: 271 DDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFA 330
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT +EF++++LGL+ + A I + +P +FDWR+H VT VK+QGACGSCW
Sbjct: 331 DLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCW 390
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA +EG + L + EL+SLSEQ+L+DCD+ D+GC GGLM AFE +
Sbjct: 391 AFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMTQAFEAVEN 441
Query: 223 AGGVEREKDYPYTG-TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E DYPY G D C+ KS + ++S +S+DE+ +A LVKHGPL+VG+N
Sbjct: 442 LGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVN 501
Query: 282 AVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y+GGVS P +C K LDHGV IVGYG K PYW+IKNSWG WG
Sbjct: 502 ANAMQFYMGGVSHPIHALCSPKSLDHGVAIVGYGVHR-TKYTHKNLPYWLIKNSWGPGWG 560
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY + G CGV+ MVSS
Sbjct: 561 EKGYYLLYRGDGSCGVNQMVSS 582
>gi|118483347|gb|ABK93575.1| unknown [Populus trichocarpa]
Length = 157
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 127/156 (81%), Positives = 146/156 (93%)
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
MN+AFEY LKAGG+EREKDYPYTG D G+CKF+KSK+AA+VSNFSV+S DEDQ+AANLVK
Sbjct: 1 MNNAFEYALKAGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQIAANLVK 60
Query: 273 HGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
HGPL+V INAV+MQTYIGGVSCPYIC K+ DHGVL+VGYG++G+APIRFKEKP+WIIKNS
Sbjct: 61 HGPLSVAINAVFMQTYIGGVSCPYICSKHQDHGVLLVGYGAAGYAPIRFKEKPFWIIKNS 120
Query: 333 WGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
WGENWGENGYYKIC RN+CGVDSMVS+VAAIH T+
Sbjct: 121 WGENWGENGYYKICRARNICGVDSMVSTVAAIHATA 156
>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
Length = 346
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 202/321 (62%), Gaps = 17/321 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLT 105
+ F F+ K++K Y++ E + +F FKANL + ++ +L GV +F+DL+
Sbjct: 26 QTQFVAFQQKYNKVYSSNE-YSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFADLS 84
Query: 106 PSEFRRQFLGLNRRLRLP-ADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSC 161
+EFR+ +L N ++ P A AP+L L PT FDWR GAVTGVK+QG CGSC
Sbjct: 85 AAEFRKYYL--NAQVAKPDASLPMAPLLTEEVLETIPTAFDWRTKGAVTGVKNQGQCGSC 142
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYI 220
WSFS TG +EG +L+ LV LSEQ LVDCDH+C + + SCD+GC+GGL +A+ Y+
Sbjct: 143 WSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAYRYV 202
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
++ GG++ E YPY G SCKF +AA +SNF++I +E QMA L HGPLA+
Sbjct: 203 IENGGLDSENSYPYLAVTGDSCKFKSGNVAAKISNFTMIPQNETQMAGYLATHGPLAIAA 262
Query: 281 NAVWMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
+A Q YIGGV P CG+ LDHG+LIVG+ S I KPYWI+KNSWG +WGE
Sbjct: 263 DAAEWQFYIGGVFDLP--CGQSLDHGILIVGF--SAEKNIFGHLKPYWIVKNSWGASWGE 318
Query: 340 NGYYKICMGRNVCGVDSMVSS 360
GY + G+N+CGV VS+
Sbjct: 319 QGYLYLGKGKNLCGVSDFVST 339
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 199/322 (61%), Gaps = 15/322 (4%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
D L + F F +K Y + EE RFR+F AN+++ K Q + +A++G T+F+
Sbjct: 271 DDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFA 330
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT +EF++++LGL+ + A I + +P +FDWR+H VT VK+QGACGSCW
Sbjct: 331 DLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCW 390
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA +EG + L + EL+SLSEQ+L+DCD+ D+GC GGLM AFE +
Sbjct: 391 AFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMTQAFEAVEN 441
Query: 223 AGGVEREKDYPYTG-TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E DYPY G D C+ KS + ++S +S+DE+ +A LVKHGPL+VG+N
Sbjct: 442 LGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVN 501
Query: 282 AVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y+GGVS P +C K LDHGV IVGYG + P P+W IKNSWG+ WG
Sbjct: 502 ANAMQFYMGGVSHPIHALCSPKSLDHGVAIVGYGVHKY-PYLNATLPFWTIKNSWGDKWG 560
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
GYY + G CGV+ MVSS
Sbjct: 561 MQGYYLLYRGDGSCGVNQMVSS 582
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 205/328 (62%), Gaps = 26/328 (7%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL------RRAKRRQLLDPTAVHGVTK 100
+ +E F+ F +++TY+T EE + R R+F+ NL R+ +R TA + V
Sbjct: 576 VRSEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERG-----TAHYDVNM 630
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F+D++P EFR ++LGL LR D + +P +LP FDWR+ VT VKDQG CG
Sbjct: 631 FADMSPEEFRSRYLGLRPDLRSENDIPLREAEIPDVELPPKFDWREKSVVTPVKDQGMCG 690
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG + + G L+SLSEQ+LVDCD D GCNGGL ++A+
Sbjct: 691 SCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRA 741
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I K GG+E E DYPY + C F K+ +++ I+S+E QMA LV++GP+++G
Sbjct: 742 IEKLGGLELESDYPYEA-ENEKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIG 800
Query: 280 INAVWMQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
INA MQ Y+GGVS P ++C K LDHGVLIVGYG+S + P+ K+ PYW IKNSWG+
Sbjct: 801 INANAMQFYVGGVSHPFKFLCNPKNLDHGVLIVGYGTSDY-PLFHKKLPYWTIKNSWGKR 859
Query: 337 WGENGYYKICMGRNVCGVDSMVSSVAAI 364
WGE GYY++ G CG++++ +S +
Sbjct: 860 WGEQGYYRVYRGDGTCGLNTLATSAVVV 887
>gi|144228217|gb|ABO93617.1| papain-like cysteine proteinase [Vitis vinifera]
Length = 161
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 128/161 (79%), Positives = 147/161 (91%)
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
QLVDCDHECDPEE G+CD GCNGGLM SAFEYILKAGGVERE+ YPY G+D GSCKF+KS
Sbjct: 1 QLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSCKFNKS 60
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVL 307
+I A+VSNFSV+S DEDQ+AAN+VK+GPLAVGINAV+MQTY+ GVSCPYIC + LDHGV+
Sbjct: 61 QIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVSCPYICSRNLDHGVV 120
Query: 308 IVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
+VGYGS+G+APIRFKEKPYWIIKNSWGE+WGE+GY K C G
Sbjct: 121 LVGYGSAGYAPIRFKEKPYWIIKNSWGESWGEDGYDKNCRG 161
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 152/317 (47%), Positives = 208/317 (65%), Gaps = 17/317 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FS+F ++KTY +EEH+ RF +FK NL+R A +L + TA +G+T+FSDL+PSEF R
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225
Query: 112 QFLGLNRRL-RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LGL + L A+ + + P N+ LP FDWR GAVT VK+QG CGSCW+FS TG
Sbjct: 226 HYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSVTGN 285
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG FLS +L+SLSEQ+LVDCDH D GC GG M A + +++ GG+E E
Sbjct: 286 VEGQWFLSRSKLLSLSEQELVDCDH---------GDHGCKGGYMGQAMKAVIEMGGLETE 336
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
+YPY G D G+C+F+K++ A V +F + +E ++A L+KHGP+++GINA MQ Y
Sbjct: 337 SEYPYKGVD-GTCEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVSIGINANAMQFYF 395
Query: 290 GGVSCP--YICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
GG+S P ++C LDHGVL+VG+G + R K PYWI+KNSWG+ WGE GYY++
Sbjct: 396 GGISHPWKFLCSPTDLDHGVLLVGFGVDKRS-FRRKPVPYWIVKNSWGKYWGEKGYYRVY 454
Query: 347 MGRNVCGVDSMVSSVAA 363
G CGV+ M S
Sbjct: 455 RGDGTCGVNQMALSAVV 471
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/331 (43%), Positives = 212/331 (64%), Gaps = 25/331 (7%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTK 100
SED + AE F+ F + +++TY++ E + RF++F+ NL + R+ T ++GV
Sbjct: 461 SED--MKAERLFNNFMTTYNRTYSSLE-RNLRFKIFRENLNFIEELRETEQGTGIYGVNM 517
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQG 156
F+D++ EFR ++LGL L+ + P+ +P DLP+ FDWR G VT VK+QG
Sbjct: 518 FADMSQKEFRTRYLGLRPDLQ---SENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQG 574
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS TG +EG + + G+L+SLSEQ+LVDCDH D GCNGGL ++A
Sbjct: 575 QCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDH---------LDEGCNGGLPDNA 625
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
+ I + GG+E E DYPY + C F ++ + +++ I+S+E Q+A LV++GP+
Sbjct: 626 YRAIEQLGGLELESDYPYEA-ENEKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPI 684
Query: 277 AVGINAVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
A+GINA MQ Y+GGVS P +C L+HGVLIVGYG+S + P+ K PYWIIKNSW
Sbjct: 685 AIGINANAMQFYMGGVSHPLKILCNPNNLNHGVLIVGYGTSRY-PLFHKNLPYWIIKNSW 743
Query: 334 GENWGENGYYKICMGRNVCGVDSMVSSVAAI 364
G++WGE GYY++ G CG+++M SS +
Sbjct: 744 GKSWGEQGYYRVYRGDGTCGLNTMASSAVVV 774
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 200/326 (61%), Gaps = 18/326 (5%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQE-EHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
H + AE F F + + Y E RF +FK N+++ + T V+ VT+F+
Sbjct: 223 HHVQAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFT 282
Query: 103 DLTPSEFRRQFLGLNRRLRLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
DLT EF+ ++LGLN L+ P ++A I + LP FDWR GAVT VKDQGACGS
Sbjct: 283 DLTYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVTEVKDQGACGS 342
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG L TG+L+SLSEQ+LVDCD D GC+GG M++A+ I
Sbjct: 343 CWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCD---------KMDDGCDGGYMDNAYRAI 393
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
+ GG+E E++YPY D C F+KS +S ISS+E MA LV +GP+++GI
Sbjct: 394 EQLGGLETEEEYPYEAED-DKCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPISIGI 452
Query: 281 NAVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y+GGVS P+ +C K +DHGVLIVGYG + P+ K+ PYW++KNSWG W
Sbjct: 453 NANAMQFYVGGVSHPWKALCNPKNIDHGVLIVGYGIKEY-PLFNKQLPYWVVKNSWGPGW 511
Query: 338 GENGYYKICMGRNVCGVDSMVSSVAA 363
GE GYY++ G CGV++M SS
Sbjct: 512 GEQGYYRVFRGDGTCGVNTMASSAVV 537
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 143/323 (44%), Positives = 202/323 (62%), Gaps = 16/323 (4%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLT 105
+ +E F F + +++TYAT+EE + R +F+ NL + R+ T +GV +F+D++
Sbjct: 721 MRSERLFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVS 780
Query: 106 PSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF +LGL LR + + +P +LP FDWR GAVT VK+QG CGSCW+F
Sbjct: 781 TEEFHAFYLGLRPDLRTENNIPLRQAEIPDIELPNSFDWRQKGAVTPVKNQGMCGSCWAF 840
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + +L+SLSEQ+LVDCD D GCNGGL ++A+ I K G
Sbjct: 841 SVTGNVEGQYAIKHNKLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRAIEKLG 891
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY + C F K+ V + I+S+E Q+A LV +GP+++GINA
Sbjct: 892 GLELESDYPYEA-ENERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGINANA 950
Query: 285 MQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
MQ Y+GGVS P ++C K LDHGVLIVGYG+S + P+ K+ PYWI+KNSWG+ WGE G
Sbjct: 951 MQFYMGGVSHPFKFLCNPKNLDHGVLIVGYGTSNY-PLFHKKLPYWIVKNSWGDRWGEQG 1009
Query: 342 YYKICMGRNVCGVDSMVSSVAAI 364
YY++ G CG+++M SS +
Sbjct: 1010 YYRVYRGDGTCGLNTMASSAVVV 1032
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 200/328 (60%), Gaps = 33/328 (10%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPS 107
A + F F + K Y ++ + RFRVFK NL+ + Q + TAV+G+T+FSDLTP
Sbjct: 153 AWNQFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPE 212
Query: 108 EFRRQFLGL--------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
EF++ +L NR + L A+ + LP FDWRDHGAVT VK+QG CG
Sbjct: 213 EFKKIYLPYIWDEPIVPNRMVDLTAEG----VHLNETLPESFDWRDHGAVTDVKNQGFCG 268
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG FL+ +LVSLSEQ+LVDCD D GC GGL + A++
Sbjct: 269 SCWAFSTTGNIEGQWFLAKKKLVSLSEQELVDCD---------KVDDGCEGGLPSQAYKE 319
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I++ GG+E E YPY G G C ++++ A +++ + DE+ M A LVK GP+++G
Sbjct: 320 IMRMGGLETESAYPYDGR-GEECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPISIG 378
Query: 280 INAVWMQTYIGGVSCP--YICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
INA +Q Y G+S P + C Y L+HGVL+VGYGS K KPYWIIKNSWG
Sbjct: 379 INANPLQFYRHGISHPWKFFCEPYMLNHGVLLVGYGSE-------KNKPYWIIKNSWGPK 431
Query: 337 WGENGYYKICMGRNVCGVDSMVSSVAAI 364
WGENGYY++ G+NVCGV M +S +
Sbjct: 432 WGENGYYRLYRGKNVCGVHEMPTSAVVL 459
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 155/319 (48%), Positives = 204/319 (63%), Gaps = 16/319 (5%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
+ E F F KF+KT+++ E RF++FK NL+ K Q + TA +GVT F+DLT
Sbjct: 570 IKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFADLT 629
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
P EF+ ++LG L+ + A I ++ LP FDWRD+ AVT VKDQG CGSCW+F
Sbjct: 630 PKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPPKFDWRDYNAVTPVKDQGLCGSCWAF 689
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + +L+SLSEQ+L+DCD + D GCNGG M +A++ I K G
Sbjct: 690 SVTGNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLG 740
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY G + C F K V I+S+E +MA L+K+GP+++GINA
Sbjct: 741 GLELESDYPYDGRN-EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANA 799
Query: 285 MQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
MQ YIGGVS P ++C K LDHGVLIVGYG S + P+ KE PYWIIKNSWG WGENG
Sbjct: 800 MQFYIGGVSHPFHFLCNPKDLDHGVLIVGYGISKY-PLFHKELPYWIIKNSWGSRWGENG 858
Query: 342 YYKICMGRNVCGVDSMVSS 360
YY++ G CGV++M SS
Sbjct: 859 YYRVYRGDGTCGVNAMASS 877
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 148/331 (44%), Positives = 205/331 (61%), Gaps = 29/331 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLT 105
E F F++K++K Y+ EE+ +F FK+NL K+ + GV KF+DL+
Sbjct: 24 ESQFIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLS 82
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILP--TNDL----PTDFDWRDHGA---------VT 150
EF++ +L ++ RL D P+LP ++D+ P FDWR+ G VT
Sbjct: 83 KEEFKKYYLS-SKEARLTDDL---PMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVT 138
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCN 209
VK+QG CGSCWSFS TG +EG H+LSTG LV LSEQ LVDCDH C E C++GC+
Sbjct: 139 AVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCD 198
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGL +A+ YI+K GG++ E YPYT D G CKF+ +++ A +S+F+++ +E Q+A+
Sbjct: 199 GGLQPNAYNYIIKNGGIQTEATYPYTAVD-GECKFNSAQVGAKISSFTMVPQNETQIASY 257
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
L +GPLA+ +A Q Y+GGV + CG+ LDHG+LIVGYG+ I K PYWII
Sbjct: 258 LFNNGPLAIAADAEEWQFYMGGV-FDFPCGQTLDHGILIVGYGAQD--TIVGKNTPYWII 314
Query: 330 KNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
KNSWG +WGE GY K+ + CGV + VSS
Sbjct: 315 KNSWGADWGEAGYLKVERNTDKCGVANFVSS 345
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 166/382 (43%), Positives = 225/382 (58%), Gaps = 39/382 (10%)
Query: 9 LLLLLLSSVLASAVAVNDDD------AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
L +L S LA D + + P D + SED +A F F + K
Sbjct: 3 LFSILAGSALAGVAEFLQDSYDHSKLSEFFKTTPEDFDVSED---DARKQFENFLLEHPK 59
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
Y+ QE H RF+ F NL+R K ++ +A +GVT+F+DL+ EFRR +LGL L+
Sbjct: 60 MYSEQESHS-RFQTFWENLKRIKFHNHIEQGSAKYGVTEFADLSDFEFRRHYLGLKPELK 118
Query: 122 LPA----------DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+P ++K T D FDW + GAVT VK+QG CGSCW+FS TG +E
Sbjct: 119 IPNRKKYERKSRNSSKKLKFAKTVD--ETFDWVEKGAVTEVKNQGMCGSCWAFSTTGNIE 176
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GA F +TG+LVSLSEQ+LVDCD + DSGCNGGLM+ AFE +++ GG+E E+
Sbjct: 177 GAWFKATGDLVSLSEQELVDCDQK---------DSGCNGGLMDQAFEEVIRIGGLETEQQ 227
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
YPY G +C F+KS + +F I DE+++A L +HGPL++ INA MQ Y GG
Sbjct: 228 YPYDGVQ-ETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGG 286
Query: 292 VSCP--YICGKY-LDHGVLIVGYGSSGFAPIRFKE-KPYWIIKNSWGENWGENGYYKICM 347
+S P ++C + LDHGVL+VGYG R + +PYW IKNSWG WGE+GYY++
Sbjct: 287 ISHPLSFLCSQDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGEDGYYRVAR 346
Query: 348 GRNVCGVDSMVSS--VAAIHTT 367
G+ VCGV+ MVS+ V A +TT
Sbjct: 347 GKGVCGVNKMVSTSIVNAQNTT 368
>gi|118488886|gb|ABK96252.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 156
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 123/156 (78%), Positives = 144/156 (92%)
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
MNSAFEY LKAGG+ RE+DYPYTGTD G+CKFDK+K+AA V+NFSV+S DEDQ+AANLVK
Sbjct: 1 MNSAFEYTLKAGGLMREEDYPYTGTDRGACKFDKNKVAARVANFSVVSLDEDQIAANLVK 60
Query: 273 HGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
+GPLAV INAV+MQTYIGGVSCPYIC + LDHGVL+VGYGS+G++P+R KEKP+WIIKNS
Sbjct: 61 NGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNS 120
Query: 333 WGENWGENGYYKICMGRNVCGVDSMVSSVAAIHTTS 368
WGE WGENG+YKIC GRNVCGVDSMVS+VAA+ T+S
Sbjct: 121 WGEKWGENGFYKICRGRNVCGVDSMVSTVAAVQTSS 156
>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
Length = 309
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 149/282 (52%), Positives = 187/282 (66%), Gaps = 19/282 (6%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
IRQV +DG LL E F+ F + + Y+ EE+ R RVF ANL RA Q L
Sbjct: 29 IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLN-------RRLRLPADAQKAPILPTNDLPTDFDW 143
DPTA HGVT FSDLT EF + GL RR +P+ A A + LP FDW
Sbjct: 86 DPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPS-AAPATEEEVSGLPASFDW 144
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
RD GAVT VK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+
Sbjct: 145 RDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTE 204
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS--- 260
CDSGC GGLM +A+ Y++ +GG+ + YPYTG G+C+FD +++A V+NF+V++
Sbjct: 205 CDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GTCRFDANRVAVRVANFTVVAPPG 263
Query: 261 -SDED---QMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
+D D QM A LV+HGPLAVG+NA +MQTY+GGVSCP +C
Sbjct: 264 GNDGDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPLVC 305
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/326 (45%), Positives = 200/326 (61%), Gaps = 17/326 (5%)
Query: 40 EQSEDHLLN-AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
++S H LN EH F+ F+ KF + Y T E RFR+FK NL+ + + +A +G
Sbjct: 152 KKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYG 211
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
+T+F+D+T E++ Q GL +R A + +P DLP +FDWR+ GA++ VK+QG
Sbjct: 212 ITEFADMTSPEYK-QRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGN 270
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG +EG H + TG L SEQ+L+DCD + DS CNGGL ++A+
Sbjct: 271 CGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNGGLPDNAY 321
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E I K GG+E E DYPY C F+ +KI V + +E +A L+ +GP++
Sbjct: 322 EAIEKIGGLELESDYPYHARK-DQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPIS 380
Query: 278 VGINAVWMQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
+GINA MQ Y GGVS P +C K LDHGVLIVGYG S + P+ K PYWI+KNSWG
Sbjct: 381 IGINANAMQFYRGGVSHPPHILCSRKNLDHGVLIVGYGVSDY-PMFKKTLPYWIVKNSWG 439
Query: 335 ENWGENGYYKICMGRNVCGVDSMVSS 360
+ WGE GYY++ G N CGV M SS
Sbjct: 440 KKWGEQGYYRVYRGDNTCGVSEMSSS 465
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 203/325 (62%), Gaps = 22/325 (6%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
E++ED + F F ++++TY++QE+ D R R+F NL+ A++ Q LD TA +GV
Sbjct: 165 EETED-FVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGV 223
Query: 99 TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
TKFSDLT EFR +L + + K +P P +DWR+HGAV+ VK+QG C
Sbjct: 224 TKFSDLTEEEFRTLYLNPLLSQQKLQRSMKPAAMPHGPAPPSWDWREHGAVSPVKNQGMC 283
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FS TG +EG F+ TG+LVSLSEQ+LVDCD + D C GGL ++A+E
Sbjct: 284 GSCWAFSVTGNIEGQWFVKTGKLVSLSEQELVDCD---------TADQACGGGLPSNAYE 334
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
I K GGVE E DY YTG SC F K+ A +++ +S DE+++AA L ++GP++V
Sbjct: 335 AIEKLGGVETETDYSYTGKK-QSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVSV 393
Query: 279 GINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
+NA MQ Y GVS P C ++ DH VL+VGYG + KP+W IKNSWGE
Sbjct: 394 ALNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGYGER-------QGKPFWAIKNSWGE 446
Query: 336 NWGENGYYKICMGRNVCGVDSMVSS 360
++GE GYY + G +CG+++M SS
Sbjct: 447 DYGEQGYYYLYRGSRLCGINTMCSS 471
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 196/313 (62%), Gaps = 21/313 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F ++++TY++QEE D R RVF NL+ A++ Q LD TA +GVTKFSDLT EFR
Sbjct: 175 QFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSDLTEEEFR 234
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L + + K +P P +DWR+HGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 TLYLNPLLSQQNLQQSMKPAAMPRGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNI 294
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG F TG+LVSLSEQ+LVDCD + D C GGL ++A+E I K GG+E E
Sbjct: 295 EGQWFAKTGKLVSLSEQELVDCD---------TVDQACGGGLPSNAYEAIEKLGGLETET 345
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY YTG SC F K+ A +++ +S+DE+++AA L ++GP++V +NA MQ Y
Sbjct: 346 DYSYTGKK-QSCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVSVALNAFAMQFYRK 404
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GVS P C ++ DH VL+VGYG + KP+W IKNSWGE++GE GYY +
Sbjct: 405 GVSHPLKIFCNPWMIDHAVLLVGYGER-------QGKPFWAIKNSWGEDYGEQGYYYLYR 457
Query: 348 GRNVCGVDSMVSS 360
G +CG++ M SS
Sbjct: 458 GSRLCGINKMCSS 470
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 140/303 (46%), Positives = 193/303 (63%), Gaps = 15/303 (4%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
++Y T EE RFR+F+AN+++A Q + TA +GVT FSD++ EF++ +LGL +R
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGLKKRT 568
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
Q+ +P LP ++DWR++ AVT VK+QG CGSCW+FS TG +EG + + TG
Sbjct: 569 PDIKFKQEMAQIPNITLPEEYDWRNYNAVTPVKNQGMCGSCWAFSVTGNIEGQYAIKTGN 628
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD D GC GGL +A+ I + GG+E E DYPY+G D
Sbjct: 629 LVSLSEQELVDCD---------KYDDGCEGGLFETAYHAIEELGGLELESDYPYSGRD-N 678
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP--YIC 298
+C F+ S++ ++++ IS+DE MA LV +GP+++GINA MQ Y+GGVS P ++C
Sbjct: 679 TCHFNSSEVRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSHPLKFLC 738
Query: 299 G-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
K LDHGVLIVGYG + + PYW+IKNSW WG GYY + G CGV+
Sbjct: 739 DPKTLDHGVLIVGYGIHR-TWLLHRHLPYWLIKNSWSSYWGAKGYYMLYRGDGSCGVNQW 797
Query: 358 VSS 360
SS
Sbjct: 798 PSS 800
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 199/328 (60%), Gaps = 17/328 (5%)
Query: 40 EQSEDHLLN-AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
++S H LN EH F+ F+ KF + Y T E RFR+FK NL+ + + +A +G
Sbjct: 152 KKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYG 211
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
+T+F+D+T E++ Q GL +R A + +P DLP +FDWR+ GA++ VK+QG
Sbjct: 212 ITEFADMTSPEYK-QRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGN 270
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG +EG H + TG L SEQ+L+DCD + DS CNGGL ++A+
Sbjct: 271 CGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNGGLPDNAY 321
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E I K GG+E E DYPY C F+ +KI V + +E +A L+ +GP++
Sbjct: 322 EAIEKIGGLELESDYPYHARK-DQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPIS 380
Query: 278 VGINAVWMQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
+GINA MQ Y GGVS P +C K LDHGVLIVGY S + P+ K PYWI+KNSWG
Sbjct: 381 IGINANAMQFYRGGVSHPPHILCSRKNLDHGVLIVGYRVSDY-PMFKKTLPYWIVKNSWG 439
Query: 335 ENWGENGYYKICMGRNVCGVDSMVSSVA 362
+ WGE GYY++ G N CGV M SS
Sbjct: 440 KKWGEQGYYRVYRGDNTCGVSEMSSSAV 467
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 201/316 (63%), Gaps = 16/316 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
E F F KF+KT+++ E RF++FK NL+ Q + TA +GVT F+DLTP E
Sbjct: 573 EMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKE 632
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
F+ ++LG L+ + A I ++ LP FDWRD+ VT VKDQG CGSCW+FS T
Sbjct: 633 FKTRYLGFRPELKQENEIPLAKIEVSDIFLPLKFDWRDYNVVTPVKDQGLCGSCWAFSVT 692
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G +EG + + +L+SLSEQ+L+DCD + D GCNGG M +A++ I K GG+E
Sbjct: 693 GNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLGGLE 743
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
E DYPY G + C F K V I+S+E +MA L+K+GP+++GINA MQ
Sbjct: 744 LESDYPYDGRN-EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQF 802
Query: 288 YIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
YIGGVS P ++C K LDHGVLIVGYG S + P+ K+ PYWIIKNSWG WGENGYY+
Sbjct: 803 YIGGVSHPFHFLCNPKDLDHGVLIVGYGISKY-PLFHKKLPYWIIKNSWGSRWGENGYYR 861
Query: 345 ICMGRNVCGVDSMVSS 360
+ G CGV++M SS
Sbjct: 862 VYRGDGTCGVNAMASS 877
>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
Length = 227
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 130/244 (53%), Positives = 177/244 (72%), Gaps = 26/244 (10%)
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
AP+LPT++LP FDWR+HGA+T VK+QG+CGSCW+FS+TGA+EGAHFL + EL+SL E+Q
Sbjct: 1 APLLPTDNLPKSFDWREHGAMTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQ 60
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS------- 241
LVDCD D GC GG M +A+EYI KA G+E E+DYPY +
Sbjct: 61 LVDCDR---------MDGGCKGGDMLNAYEYI-KAKGLEAEEDYPYQEENYKEYMFPHHR 110
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC--G 299
C F SK+AA ++N+S +S DEDQ+AANLVK+GPL++ +NA ++ Y+GGV+CP IC G
Sbjct: 111 CHFRPSKVAATIANYSTVSEDEDQIAANLVKNGPLSIALNANYIMDYMGGVACPRICPGG 170
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
++H VL+VGYG G +KPYWI+KNSW EN+GE+GY+++C G VCG+++ VS
Sbjct: 171 DNMNHAVLLVGYGMDG-------DKPYWILKNSWSENYGEDGYFRLCRGFGVCGMNTRVS 223
Query: 360 SVAA 363
+V+A
Sbjct: 224 TVSA 227
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 161/373 (43%), Positives = 217/373 (58%), Gaps = 37/373 (9%)
Query: 9 LLLLLLSSVLASAVAVNDDD------AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
L +L S LA D + + P D + SED +A F F + K
Sbjct: 3 LFSILAGSALAGVAEFLQDSYDHSKLSEFFKTTPEDFDVSED---DARKQFENFLLEHPK 59
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
Y+ QE H RF+ F NL+R K ++ +A +GVT+F+DL+ EFRR +LGL L+
Sbjct: 60 MYSEQESHS-RFQTFWENLKRIKFHNHIEQGSAKYGVTEFTDLSDFEFRRHYLGLKPELK 118
Query: 122 ----------LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
++K T D FDW + GAVT VK+QG CGSCW+FS TG +E
Sbjct: 119 NLNRKKYERKSRNSSKKLKFAKTAD--ETFDWVEKGAVTEVKNQGMCGSCWAFSTTGNIE 176
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GA F +TG+L+SLSEQ+LVDCD + DSGCNGGLM+ AFE +++ GG+E E+
Sbjct: 177 GAWFKATGDLISLSEQELVDCDQK---------DSGCNGGLMDQAFEEVIRIGGLETEQQ 227
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
YPY G +C F+KS + +F I DE+++A L +HGPL++ INA MQ Y GG
Sbjct: 228 YPYDGVQ-ETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGG 286
Query: 292 VSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKE-KPYWIIKNSWGENWGENGYYKICM 347
VS P ++C LDHGVL+VGYG R + +PYW IKNSWG WGE+GYY++
Sbjct: 287 VSHPLSFLCSPDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGEDGYYRVAR 346
Query: 348 GRNVCGVDSMVSS 360
G+ VCGV+ MVS+
Sbjct: 347 GKGVCGVNKMVST 359
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 206/345 (59%), Gaps = 28/345 (8%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
V + D + +Q VPS + ED +L F F + ++K Y+ QEE R ++F NL
Sbjct: 137 VELTDTETSQKQNVPSS--ELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNL 194
Query: 82 RRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQKAPILPTNDLP 138
++A+ Q +D TA +GVTK+SDLT EFR +L L+ + P K I+P P
Sbjct: 195 KKAQMIQEMDQGTAEYGVTKYSDLTEDEFRSLYLNPLLSSK---PLYQMKKAIVPNMSAP 251
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
+DWRDHGAVT VK+QG CGSCW+FS G +EG FL G LVSLSEQ+LVDCD
Sbjct: 252 DQWDWRDHGAVTEVKNQGMCGSCWAFSVIGNIEGQWFLKKGSLVSLSEQELVDCD----- 306
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D C GGL ++A+E I K GG+E E++Y Y G +C F SK++A +++
Sbjct: 307 ----GVDHACAGGLPSNAYEAIEKLGGIETEQEYSYEG-HKNTCSFSTSKVSAYINSSVE 361
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSG 315
I DE+++AA L ++GP+++ +NA MQ Y G+S P+ +C ++ DH VL+VGYG
Sbjct: 362 IPKDENEIAAWLAQNGPISIALNAFAMQFYRKGISHPFRILCNPWMIDHAVLLVGYGERN 421
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
P+W IKNSWG +WGE GYY + G CG+++M SS
Sbjct: 422 GT-------PFWAIKNSWGTDWGEQGYYYLYRGTGACGMNTMCSS 459
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 143/314 (45%), Positives = 199/314 (63%), Gaps = 28/314 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL 114
F ++++TY+ ++E RFR++K NLR AK Q + TA++G T+FSDLT +EFR+ +
Sbjct: 10 FIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAEFRK--I 67
Query: 115 GLNRRLRLPADAQKAPI-----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
L + P K + ND+P FDWR+ AVT VK+QG+CGSCW+FS TG
Sbjct: 68 MLPYKWETPKVPNKMANFKEFGIAQNDIPESFDWREKNAVTEVKNQGSCGSCWAFSVTGN 127
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EGA + T +LVSLSEQ+LVDCD D GCNGGL ++A+ I++ GG+E E
Sbjct: 128 IEGAWAIKTSKLVSLSEQELVDCD---------IIDQGCNGGLPSNAYREIIRMGGLEAE 178
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
DYPY G G C K IA +++ + DE++MAA LV GP+++G+NA +Q Y
Sbjct: 179 SDYPYDGR-GEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPISIGLNANPLQFYR 237
Query: 290 GGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
G++ P+ C K+LDHGVLIVGYGS +KPYWIIKNSWG WGE GY+++
Sbjct: 238 HGIAHPWRVFCSPKHLDHGVLIVGYGSE-------TDKPYWIIKNSWGTKWGEEGYFRLF 290
Query: 347 MGRNVCGVDSMVSS 360
G+NVCG+ M ++
Sbjct: 291 RGKNVCGIQEMATT 304
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 208/335 (62%), Gaps = 35/335 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FS+F ++KTY +EEH+ RF +FK NL+R A +L + TA +G+T+FSDL+PSEF R
Sbjct: 34 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93
Query: 112 QFLGLNRRL-RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWS------ 163
+LGL + L A+ + + P N+ LP FDWR GAVT VK+QG CGSCW+
Sbjct: 94 HYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSXXTE 153
Query: 164 ------------FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
FS TG +EG FLS +L+SLSEQ+LVDCDH D GC GG
Sbjct: 154 VKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDH---------GDHGCKGG 204
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
M A + +++ GG+E E +YPY G D G+C+F+K++ A V +F + +E ++A L+
Sbjct: 205 YMGQAMKAVIEMGGLETESEYPYKGVD-GTCEFNKTESKARVQSFVGLPQNETELAYWLM 263
Query: 272 KHGPLAVGINAVWMQTYIGGVSCP--YICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWI 328
KHGP+++GINA MQ Y GG+S P ++C LDHGVL+VG+G + R K PYWI
Sbjct: 264 KHGPVSIGINANAMQFYFGGISHPWKFLCSPTDLDHGVLLVGFGVDKRS-FRRKPVPYWI 322
Query: 329 IKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAA 363
+KNSWG+ WGE GYY++ G CGV+ M S
Sbjct: 323 VKNSWGKYWGEKGYYRVYRGDGTCGVNQMALSAVV 357
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/320 (46%), Positives = 194/320 (60%), Gaps = 35/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F K++K Y++Q+E D R +F NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 176 QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFR 235
Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L L+R ++ PA K P P +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQGMCGSCWA 288
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +EG FL G LVSLSEQ+LVDCD D CNGGL ++A+E I K
Sbjct: 289 FSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNAYEAIEKL 339
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+E E DY Y G SC F K+AA +++ +S DE ++AA L ++GP++V +NA
Sbjct: 340 GGLETETDYSYIGKK-QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAF 398
Query: 284 WMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
MQ Y GVS P C ++ DH VL+VGYG K P+W IKNSWGE++GE
Sbjct: 399 AMQFYRKGVSHPLKIFCNPWMIDHAVLMVGYGER-------KGIPFWAIKNSWGEDYGEQ 451
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GYY + G N CG++ M SS
Sbjct: 452 GYYNLYRGSNACGINKMCSS 471
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 199/339 (58%), Gaps = 33/339 (9%)
Query: 37 SDGE----QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP 92
SDGE + EDH A H F FK K S+ Y + EH+ RFR+FK NL + ++ +
Sbjct: 841 SDGEGHYSKGEDH---ARHLFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQ 897
Query: 93 -TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWR 144
TA +G+T F+D+T +E+R Q GL +P D KA I +LP FDWR
Sbjct: 898 GTAKYGITHFADMTSAEYR-QRTGLV----IPRDEDRNHVGNPKAEIDENMELPESFDWR 952
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
+ GAV+ VK+QG CGSCW+FS G +EG H + T L SEQ+L+DCD +
Sbjct: 953 ELGAVSPVKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCD---------AV 1003
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
DS C GG M+ A++ I K GG+E E +YPY +C F+ +++ V + +E
Sbjct: 1004 DSACQGGYMDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNET 1063
Query: 265 QMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRF 321
MA LV +GP+++G+NA MQ Y GG+S P+ +C K LDHGVLIVGYG + P+
Sbjct: 1064 AMAQYLVANGPISIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEY-PMFN 1122
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
K PYWI+KNSWG WGE GYY+I G N CGV M SS
Sbjct: 1123 KTMPYWIVKNSWGPKWGEQGYYRIFRGDNTCGVSEMASS 1161
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/345 (42%), Positives = 199/345 (57%), Gaps = 39/345 (11%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
DDDA +R++ F F+ + YA+ EH+ RF +F+ NL + +
Sbjct: 1515 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 1556
Query: 86 RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
+ + TA +GVTKF+D+T +E+R GL A+ A + + DLP
Sbjct: 1557 QLNKFERGTAKYGVTKFADMTVAEYR-AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLP 1615
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD
Sbjct: 1616 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 1670
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D+GC GG M+ AF+ I + GG+E E DYPY SC F++S V
Sbjct: 1671 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 1726
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSG 315
+ +E +A L+K+GP+A+G+NA MQ Y GG+S P+ +C K +DHGVLIVGYG
Sbjct: 1727 MPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKE 1786
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+ P+ K PYWIIKNSWG WGE GYY+I G N CGV M SS
Sbjct: 1787 Y-PMFNKTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASS 1830
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/345 (42%), Positives = 199/345 (57%), Gaps = 39/345 (11%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
DDDA +R++ F F+ + YA+ EH+ RF +F+ NL + +
Sbjct: 1491 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 1532
Query: 86 RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
+ + TA +GVTKF+D+T +E+R GL A+ A + + DLP
Sbjct: 1533 QLNKFERGTAKYGVTKFADMTVAEYR-AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLP 1591
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD
Sbjct: 1592 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 1646
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D+GC GG M+ AF+ I + GG+E E DYPY SC F++S V
Sbjct: 1647 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 1702
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSG 315
+ +E +A L+K+GP+A+G+NA MQ Y GG+S P+ +C K +DHGVLIVGYG
Sbjct: 1703 MPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKE 1762
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+ P+ K PYWIIKNSWG WGE GYY+I G N CGV M SS
Sbjct: 1763 Y-PMFNKTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASS 1806
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/320 (46%), Positives = 194/320 (60%), Gaps = 35/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F K++K Y++Q+E D R +F NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 176 QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFR 235
Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L L+R ++ PA K P P +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQGMCGSCWA 288
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +EG FL G LVSLSEQ+LVDCD D CNGGL ++A+E I K
Sbjct: 289 FSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNAYEAIEKL 339
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+E E DY Y G SC F K+AA +++ +S DE ++AA L ++GP++V +NA
Sbjct: 340 GGLETETDYSYIGKK-QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAF 398
Query: 284 WMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
MQ Y GVS P C ++ DH VL+VGYG K P+W IKNSWGE++GE
Sbjct: 399 AMQFYRKGVSHPLKIFCNPWMIDHAVLMVGYGER-------KGIPFWAIKNSWGEDYGEQ 451
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GYY + G N CG++ M SS
Sbjct: 452 GYYYLHRGSNACGINKMCSS 471
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 148/345 (42%), Positives = 199/345 (57%), Gaps = 39/345 (11%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
DDDA +R++ F F+ + YA+ EH+ RF +F+ NL + +
Sbjct: 634 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 675
Query: 86 RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
+ + TA +GVTKF+D+T +E+R GL A+ A + + DLP
Sbjct: 676 QLNKFERGTAKYGVTKFADMTVAEYRAH-TGLVVPKHDRANHVGNRVASEEDVAGVGDLP 734
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD
Sbjct: 735 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 789
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D+GC GG M+ AF+ I + GG+E E DYPY SC F++S V
Sbjct: 790 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 845
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSG 315
+ +E +A L+K+GP+A+G+NA MQ Y GG+S P+ +C K +DHGVLIVGYG
Sbjct: 846 MPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKE 905
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+ P+ K PYWIIKNSWG WGE GYY+I G N CGV M SS
Sbjct: 906 Y-PMFNKTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASS 949
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 193/315 (61%), Gaps = 23/315 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N F F+ KF K+Y++ R+ +FK NL + + Q L+ TA +G+TKFSDL+
Sbjct: 122 NTSRLFEEFQRKFRKSYSSDTAK--RYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSA 179
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFR + RR + + I PT LP FDWR +GAVT VKDQG CGSCW+F
Sbjct: 180 EEFRHSLANMKRR-KSKGSQMETAIFPTTIQSLPPSFDWRANGAVTEVKDQGMCGSCWAF 238
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
+ TG +EG F T +L+SLSEQQL+DCD + D CNGGL A++ I+K G
Sbjct: 239 ATTGNIEGQWFRKTNKLISLSEQQLLDCDTK---------DEACNGGLPEWAYDEIVKMG 289
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+ EKDYPY SC + I+A ++ + + SDE ++AA LV++GP++VG+NA +
Sbjct: 290 GLMSEKDYPYEAMKEQSCHLRRPNISAYINGSATLPSDEAKLAAWLVQNGPISVGVNANF 349
Query: 285 MQTYIGGVSCP--YICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Q Y+GG+S P +C + LDH VL+VGYG S F +PYWI+KNSWG WGE G
Sbjct: 350 LQFYLGGISHPPHMLCSEAGLDHAVLLVGYGVSTFL-----RRPYWIVKNSWGGGWGEKG 404
Query: 342 YYKICMGRNVCGVDS 356
Y+++ G CG+++
Sbjct: 405 YFRMYRGDGTCGINA 419
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 203/345 (58%), Gaps = 27/345 (7%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
+DD ++++ + + D+++ + F F + K Y + E RFRVFK N +
Sbjct: 149 THDDSVTVQELRKAKIIKPRDYVI--WNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKV 206
Query: 84 AKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA----QKAPILPTNDLP 138
+ Q + TAV+G TKFSD+T EF+ L +P D ++ + DLP
Sbjct: 207 IRELQKNEQGTAVYGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKEGVTISEEDLP 266
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWR+HGAVT VK+QG+CGSCW+FS TG +EGA FL+ +LVSLSEQ+LVDCD
Sbjct: 267 DSFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD----- 321
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 322 ----SVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVE 376
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSG 315
+ DE +M LV GP+++G+NA +Q Y GV P+ C + L+HGVLIVGYG G
Sbjct: 377 LPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG 436
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
KPYWI+KNSWG WGE GY+K+ G+NVCGV M +S
Sbjct: 437 -------RKPYWIVKNSWGPTWGEAGYFKLYRGKNVCGVQEMATS 474
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 203/345 (58%), Gaps = 27/345 (7%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
+DD ++++ + + D+++ + F F + K Y + E RFRVFK N +
Sbjct: 149 THDDSVTVQELRKAKIIKPRDYVV--WNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKV 206
Query: 84 AKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA----QKAPILPTNDLP 138
+ Q + TAV+G TKFSD+T EF+ L +P D ++ + DLP
Sbjct: 207 IRELQKNEQGTAVYGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKEGVTISEEDLP 266
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWR+HGAVT VK+QG+CGSCW+FS TG +EGA FL+ +LVSLSEQ+LVDCD
Sbjct: 267 DSFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD----- 321
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 322 ----SVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVE 376
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSG 315
+ DE +M LV GP+++G+NA +Q Y GV P+ C + L+HGVLIVGYG G
Sbjct: 377 LPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG 436
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
KPYWI+KNSWG WGE GY+K+ G+NVCGV M +S
Sbjct: 437 -------RKPYWIVKNSWGPTWGEAGYFKLYRGKNVCGVQEMATS 474
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 201/321 (62%), Gaps = 22/321 (6%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
++LL + F L KF+K Y ++EE RFR+F+AN+++ + TA +G+T+FS
Sbjct: 128 EYLLQSFKDFVL---KFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFS 184
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DL+ +EF+ +LGL ++ P +P LP +FDWR + AVT VK+QG+CGSCW
Sbjct: 185 DLSVTEFK-NYLGLKKK---PESKLPTAEIPDVKLPDNFDWRHYNAVTPVKNQGSCGSCW 240
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + EL+SLSEQ+L+DCD D+GCNGG M +E I+K
Sbjct: 241 AFSVTGNIEGLWAIKKHELLSLSEQELIDCD---------KIDNGCNGGYMPETYEAIMK 291
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
GG+E E DYPY + C +K++I ++ ++ E +A L K+GP++ G+NA
Sbjct: 292 LGGLETETDYPYEA-ENEKCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSAGLNA 350
Query: 283 VWMQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
MQ Y+GG+S P +C + DHG+LIVGYG + ++ + PYWIIKNSWG++WGE
Sbjct: 351 NAMQFYLGGISHPPKILCNPEEQDHGILIVGYGIHKSSILK-RTIPYWIIKNSWGKHWGE 409
Query: 340 NGYYKICMGRNVCGVDSMVSS 360
GYY++ G VCG++ MVSS
Sbjct: 410 KGYYRLYRGSGVCGINQMVSS 430
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 193/320 (60%), Gaps = 26/320 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F FK + YA+ EH+ R+ +F+ NL + + + T +GVTKF+D+T +E+R
Sbjct: 1477 QFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYR 1536
Query: 111 RQFLGLNRRLRLP---ADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+ L +P ++ + PI + LPT FDWRDHGAVTGVK+QG CGSCW+
Sbjct: 1537 -----AHTGLIVPKQHSNHIRNPIATVSTERTSLPTSFDWRDHGAVTGVKNQGNCGSCWA 1591
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA G +EG H + T +L + SEQ+L+DCD + D+GCNGG M+ AF+ I K
Sbjct: 1592 FSAIGNIEGLHQIKTKKLEAYSEQELIDCD---------TVDNGCNGGYMDDAFKAIEKL 1642
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+E E +YPY +C F+K+ V + +E +A L+++GP+A+G+NA
Sbjct: 1643 GGLELEDEYPYQAKAQKTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGLNAN 1702
Query: 284 WMQTYIGGVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
MQ Y GG+S P+ +C K +DHGVLIVGYG + P+ K PYW IKNSWG WGE
Sbjct: 1703 AMQFYRGGISHPWHLLCSHKQIDHGVLIVGYGVKEY-PLFNKTLPYWTIKNSWGPKWGEQ 1761
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GYY+I G N CGV M SS
Sbjct: 1762 GYYRIYRGDNSCGVSEMASS 1781
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 199/331 (60%), Gaps = 28/331 (8%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVT 99
+SEDH + H F FK++ ++TY + EH+ RFR+FK NL + ++ + TA +G+T
Sbjct: 1137 KSEDH---SRHLFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGIT 1193
Query: 100 KFSDLTPSEFR-RQFLGLNRR------LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
F+D+T +E+R R L + R +R P A I +LP FDWR+ GAV+ V
Sbjct: 1194 HFADMTSAEYRARTGLVVPREGDEVNHIRNPM----AEIDEHMELPDAFDWRELGAVSEV 1249
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
K+QG CGSCW+FS G +EG H + T +L SEQ+L+DCD + DS CNGG
Sbjct: 1250 KNQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCD---------TVDSACNGGF 1300
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
M+ A++ I K GG+E E +YPY +C F+K+ V + +E +A LV
Sbjct: 1301 MDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAVDLPKNETAIAQFLVA 1360
Query: 273 HGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWII 329
+GP+++G+NA MQ Y GG+S P+ +C K LDHGVLIVGYG + P+ K PYWI+
Sbjct: 1361 NGPVSIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEY-PMFNKTLPYWIV 1419
Query: 330 KNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
KNSWG WGE GYY++ G N CGV M +S
Sbjct: 1420 KNSWGPKWGEQGYYRVFRGDNTCGVSEMATS 1450
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 198/324 (61%), Gaps = 23/324 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP---TAVHGVTKFS 102
L +H F +F+ K+ + YA EH R R+F+ NLR + +L D +A +G+T+F+
Sbjct: 292 LNKVDHLFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQ--ELNDNEQGSAKYGITEFA 349
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T SE+ Q GL +R K ++P +LP +FDWR+ AVT VK+QG+CGS
Sbjct: 350 DMTSSEYT-QRAGLWQRSANKPTGGKPAVVPAYKGELPKEFDWREKNAVTQVKNQGSCGS 408
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG + + TGEL SEQ+L+DCD S DS CNGGLM++A++ I
Sbjct: 409 CWAFSVTGNIEGLYAIKTGELREFSEQELLDCD---------STDSACNGGLMDNAYKAI 459
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
GG+E E +YPY C F+K+ V++F + +E M L+ +GP+++G
Sbjct: 460 KDIGGLEYESEYPYLAKK-KQCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPISIG 518
Query: 280 INAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
+NA MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+KNSWG
Sbjct: 519 LNANAMQFYRGGVSHPWGPLCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPR 577
Query: 337 WGENGYYKICMGRNVCGVDSMVSS 360
WGE GYY+I G N CGV M +S
Sbjct: 578 WGEQGYYRIYRGDNTCGVSEMATS 601
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 138/293 (47%), Positives = 191/293 (65%), Gaps = 27/293 (9%)
Query: 74 FRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL--GLNRRLRLPADAQKAP 130
++F++N+R+A + Q +D TA +G T FSDL+ EFR+Q + G + L DA+
Sbjct: 1 MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWGKPLYEMKDAE--- 57
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
+P D+P DWRD G VT VK+QG+CGSCW+FS TG +EG + + TG+LVSLSEQ+LV
Sbjct: 58 -IPLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELV 116
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCD + D GC GGL ++A++ I K GG+E E DYPY G D CKF+K+++
Sbjct: 117 DCD---------TIDKGCEGGLPSNAYKQIEKLGGLESESDYPYKGAD-SKCKFNKAEVK 166
Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICG-KYLDHGVL 307
+++ VIS DE ++AA L K+GP+++GINA MQ Y+GG++ P+ C L+HGVL
Sbjct: 167 VTINSSVVISKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKIFCNPSSLNHGVL 226
Query: 308 IVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
IVGYG PYWIIKNSWG +WGE GYY I G CG+++M +S
Sbjct: 227 IVGYGVKNGT-------PYWIIKNSWGPSWGEKGYYLIYRGGGCCGLNTMCTS 272
>gi|24417396|gb|AAN60308.1| unknown [Arabidopsis thaliana]
Length = 193
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 132/197 (67%), Positives = 155/197 (78%), Gaps = 6/197 (3%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L VL S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVLVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHE 195
G+LVSLSEQQLVDCDH+
Sbjct: 177 GKLVSLSEQQLVDCDHQ 193
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 193/323 (59%), Gaps = 35/323 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F +K++K Y++QEE D R R+F NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 176 QFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFR 235
Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L L++ ++ PA K P P +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHQPMK-PATPAKGPS------PDSWDWRDHGAVSPVKNQGMCGSCWA 288
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG FL G L+SLSEQ+LVDCD D C GGL ++A+E I K
Sbjct: 289 FSVIGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEKL 339
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+E E DY YTG C F K+AA +++ + DE ++AA L ++GP++V +NA
Sbjct: 340 GGLETESDYSYTGHK-QRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVSVALNAF 398
Query: 284 WMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
MQ Y G+S P C ++ DH VL+VGYG K P+W IKNSWGE++GE
Sbjct: 399 AMQFYRKGISHPLKIFCNPWMIDHAVLLVGYGER-------KGIPFWAIKNSWGEDYGEQ 451
Query: 341 GYYKICMGRNVCGVDSMVSSVAA 363
GYY + G N CG++ M SS
Sbjct: 452 GYYYLYRGSNACGINKMCSSAVV 474
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 197/331 (59%), Gaps = 20/331 (6%)
Query: 39 GEQSEDH--LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAV 95
G + +H L EH F F+ KF + Y E R R+F+ NLR ++ + +A
Sbjct: 287 GHKKHNHHSLDKVEHLFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAK 346
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA--PILPTNDLPTDFDWRDHGAVTGVK 153
+G+T+F+D+T +E++ + R P QKA P P +LP +FDWR GAV+ VK
Sbjct: 347 YGITEFADMTSTEYKERTGLWQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAVSSVK 406
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG+CGSCW+FS G +EG + + TG+L SEQ+L+DCD + DS CNGGL
Sbjct: 407 NQGSCGSCWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCD---------TKDSACNGGLP 457
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
++A++ I + GG+E E +YPY C F+K+ V+ F + ++E M L+
Sbjct: 458 DNAYKAIQEIGGLEYESEYPYKARK-EQCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIA 516
Query: 273 HGPLAVGINAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
+GP+++GINA MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+
Sbjct: 517 NGPISIGINANAMQFYRGGVSHPWKILCEKSNLDHGVLIVGYGVSDY-PNFHKTLPYWIV 575
Query: 330 KNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
KNSWG WGE GYY++ G N CGV M SS
Sbjct: 576 KNSWGPRWGEQGYYRVYRGDNTCGVSEMASS 606
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 193/322 (59%), Gaps = 19/322 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
L EH F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 307 LDKVEHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADM 366
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R A ++P + +LP +FDWR AVTGVK+QG CGSCW
Sbjct: 367 TSTEYKER-TGLWQRDEAKATGGSPAVVPAYSGELPKEFDWRSKNAVTGVKNQGQCGSCW 425
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + L GEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 426 AFSVTGNIEGLYALKYGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 476
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E +YPY C F+K+ V +F + +E M LV +GP+++GIN
Sbjct: 477 IGGLEYEAEYPYEAKK-KQCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPISIGIN 535
Query: 282 AVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y GGVS P+ +C K LDHGVL+VGYG S + P K PYWI+KNSWG WG
Sbjct: 536 ANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNYHKTLPYWIVKNSWGPRWG 594
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G N CGV M +S
Sbjct: 595 EQGYYRVYRGDNTCGVSEMATS 616
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/324 (43%), Positives = 194/324 (59%), Gaps = 20/324 (6%)
Query: 45 HLLNA-EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
H LN EH F F+ K+ + YA EH R R+F+ NLR + + +A +G+T+F+
Sbjct: 302 HTLNKIEHLFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFA 361
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T +E++ GL +R A ++P ++P +FDWR AVT VK+QG CGS
Sbjct: 362 DMTSTEYKLH-AGLWQRSEDKPTGGAAAVVPPYAGEMPKEFDWRQKKAVTHVKNQGQCGS 420
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG + + TGEL SEQ+L+DCD S DS CNGGLM++A++ I
Sbjct: 421 CWAFSVTGNIEGLYAIKTGELEEFSEQELLDCD---------STDSACNGGLMDNAYKAI 471
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
GG+E E +YPY C F+++ +S F + +E M L+ +GP+++G
Sbjct: 472 KDIGGLEYESEYPYAAKK-MQCHFNRTMSHVQLSGFVDLPKGNETAMQEWLLSNGPISIG 530
Query: 280 INAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
+NA MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+KNSWG
Sbjct: 531 LNANAMQFYRGGVSHPWAPLCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPR 589
Query: 337 WGENGYYKICMGRNVCGVDSMVSS 360
WGE GYY+I G N CGV M +S
Sbjct: 590 WGEQGYYRIYRGDNTCGVSEMATS 613
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 198/328 (60%), Gaps = 19/328 (5%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGV 98
+ S L A+H F F+ +F + Y + E R R+F+ NL+ ++ + + +A +G+
Sbjct: 296 KHSHRALDKADHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGI 355
Query: 99 TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQG 156
T+F+D+T SE++ + GL +R A ++P +LP +FDWR AVT VK+QG
Sbjct: 356 TEFADMTSSEYKER-TGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQG 414
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
+CGSCW+FS TG +EG H + TG+L SEQ+L+DCD + DS CNGGLM++A
Sbjct: 415 SCGSCWAFSVTGNIEGLHAVKTGDLKEFSEQELLDCD---------TTDSACNGGLMDNA 465
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
++ I GG+E E +YPY C F+++ V+ F + +E M L+ +GP
Sbjct: 466 YKAIKDIGGLEYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGP 524
Query: 276 LAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
+++GINA MQ Y GGVS P+ +C K LDHGVL+VGYG S + P K PYWI+KNS
Sbjct: 525 ISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSEY-PNFHKTLPYWIVKNS 583
Query: 333 WGENWGENGYYKICMGRNVCGVDSMVSS 360
WG WGE GYY++ G N CGV M +S
Sbjct: 584 WGPRWGEQGYYRVYRGDNTCGVSEMATS 611
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 194/322 (60%), Gaps = 19/322 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
L +H F F+ K+ + YA EH R R+F+ +L+ + + +A +G+T+F+D+
Sbjct: 286 LNKVDHLFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADM 345
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E+ Q GL +R A ++P +LP +FDWR AVT VK+QG CGSCW
Sbjct: 346 TSTEYA-QRAGLWQRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNAVTHVKNQGQCGSCW 404
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EGA+ + TG+L SEQ+L+DCD S DS CNGGLM++A++ I
Sbjct: 405 AFSVTGNIEGAYAIKTGDLQEFSEQELLDCD---------SKDSACNGGLMDNAYKAIKD 455
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E +YPY G C F+++ VS F + +E M L+ +GP+++GIN
Sbjct: 456 IGGLEYESEYPYEGKK-KQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPISIGIN 514
Query: 282 AVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+KNSWG WG
Sbjct: 515 ANAMQFYRGGVSHPWSPLCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 573
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G N CGV M +S
Sbjct: 574 EQGYYRVYRGDNTCGVSEMATS 595
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 153/367 (41%), Positives = 206/367 (56%), Gaps = 29/367 (7%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD----GEQSEDHLLNAEHHFSLFKSK 59
+ + +L+ + LS+V + A+ I V D + + F F
Sbjct: 1 MAILTLIAVFLSTVALGSQAIGPRTITINNVPMIDEIERNTNESGSVDKTQDLFQDFMKT 60
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
+ K Y T+EEH R+++F+ NL +A+R +Q T +GVTKF DL+ EFR+ +L
Sbjct: 61 YDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFRKYYLTPVW 120
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRD--HGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R P +KA I P P FDWRD AVT VK+QG CGSCW+FS TG +EG +
Sbjct: 121 RGSDP-HMKKAEI-PKGTPPAAFDWRDADKNAVTKVKNQGTCGSCWAFSTTGNIEGQWKI 178
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G LVSLSEQ+LVDCD D GCNGGL ++A++ I++ GG+ E DYPYTG
Sbjct: 179 KKGTLVSLSEQELVDCD---------KLDQGCNGGLPSNAYQEIMRFGGIMSEDDYPYTG 229
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY 296
D CK + + ++ IS DE MA+ L +GP+++GINA MQ Y GGVS P+
Sbjct: 230 RD-QDCKLNATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANAMQFYFGGVSHPW 288
Query: 297 --ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
C + LDHGVLIVGYG+ PYWIIKNSWG +WG GYY + G VCG
Sbjct: 289 KIFCNPENLDHGVLIVGYGTK-------DGTPYWIIKNSWGRSWGVEGYYLVYRGGGVCG 341
Query: 354 VDSMVSS 360
++ M +S
Sbjct: 342 LNEMCTS 348
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 207/349 (59%), Gaps = 29/349 (8%)
Query: 22 VAVNDDDAM-IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+ + DDD++ ++++ + + D+++ + F F + K Y+ + E RFR FK N
Sbjct: 142 IQLTDDDSITVQELRKAKIIRPRDYVI--WNSFLDFIDRHEKRYSNKREVLKRFRTFKKN 199
Query: 81 LRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPT 134
+ + Q + TAV+G TKFSD+T EF++ L + AD +K I +
Sbjct: 200 AKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQTMLPYQWEQPVYPMDQADFEKEGITISE 259
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLP FDWRD GAVT VK+QG CGSCW+FS TG +EGA FL+ +LVSLSEQ+LVDCD
Sbjct: 260 EDLPESFDWRDKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFLAKNKLVSLSEQELVDCD- 318
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 319 --------GVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGK-GETCHLVRKDIAVYIN 369
Query: 255 NFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGY 311
+ DE +M LV GP+++G+NA +Q Y GV P+ C + L+HGVLIVGY
Sbjct: 370 GSIELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGY 429
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
G G KPYWI+KNSWG WGE+GY+K+ G+NVCGV M +S
Sbjct: 430 GKDG-------RKPYWIVKNSWGPTWGESGYFKLYRGKNVCGVQEMATS 471
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 201/341 (58%), Gaps = 32/341 (9%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKS------KFSKTYATQEEHDYRFRVFKANLRRA 84
+ Q+ P+ S+D A HH +FK+ +++K+Y +E +YR++VF N+ RA
Sbjct: 30 MMQLQPATRRFSQD---TATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARA 86
Query: 85 KRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
Q D T +G TK SDLT E + F + + + +KA I N LP FDW
Sbjct: 87 MLFQKHDNATGRYGFTKLSDLTDQEVK-SFYAMKKWPQQLYPTKKANIPQLNSLPQSFDW 145
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R GAVT VKDQ CG+CW+F+ TG +EG +L+ G+L SLSEQ+LVDCD
Sbjct: 146 RSKGAVTAVKDQKRCGACWAFATTGNIEGQWYLNKGKLYSLSEQELVDCD---------K 196
Query: 204 CDSGCNGGLMNSAFEYIL-KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
D GC GGL +A+ I+ + GG+E EKDYPY + G CK +KS+ +++ +S++
Sbjct: 197 IDEGCKGGLPLNAYHSIMNRLGGLETEKDYPYVAKN-GKCKLNKSEEVVYINSSVKVSTN 255
Query: 263 EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI--CG-KYLDHGVLIVGYGSSGFAPI 319
E +AA LV HGP+A+GIN+V M Y GG++ P C K LDHGVLIVGYG
Sbjct: 256 ETDLAAWLVAHGPVAIGINSVNMLHYKGGIAHPTNKDCNPKLLDHGVLIVGYGEE----- 310
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
K PYWIIKNSWG +WGE GYY++ G CG++ +S
Sbjct: 311 --KSTPYWIIKNSWGTDWGEKGYYRVVRGIGACGLNKSATS 349
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 193/322 (59%), Gaps = 19/322 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
L +H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+DL
Sbjct: 302 LDKVDHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADL 361
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T SE++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW
Sbjct: 362 TSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKNAVTPVKNQGSCGSCW 420
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 421 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 471
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E +YPY C F+++ V+ F + +E M L+ GP+++GIN
Sbjct: 472 IGGLEYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGPISIGIN 530
Query: 282 AVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y GGVS P+ +C K LDHGVL+VGYG S + P K PYWI+KNSWG WG
Sbjct: 531 ANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 589
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G N CGV M +S
Sbjct: 590 EQGYYRVYRGDNTCGVSEMATS 611
>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
Length = 347
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/323 (44%), Positives = 189/323 (58%), Gaps = 21/323 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV-------HGVTKFS 102
E F F+ K++K Y + E +F FK NL R L+ A GV +F+
Sbjct: 24 EIQFRDFQVKYNKVYGSHE-FSQKFVTFKDNLNRIDT---LNANAAASGSDTKFGVNEFA 79
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACG 159
DL+ EFR+ ++ +P+DAQ A L P+ FDWR GAVT VK+QG CG
Sbjct: 80 DLSVQEFRKFYMNA-VPASVPSDAQVAGDYSDETLASIPSSFDWRTKGAVTPVKNQGQCG 138
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE-SGSCDSGCNGGLMNSAFE 218
SCWSFS TG +EG FL+ L LSEQ LVDCDH C + SCD GCNGGL +AF+
Sbjct: 139 SCWSFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHHCMTYDGQQSCDDGCNGGLQPNAFQ 198
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
YI+ GG++ E YPY C+F S I A +SN+ ++S++E Q+AA L +GP+++
Sbjct: 199 YIIGNGGIDTETSYPYLAVAQDKCQFKASNIGAKISNWQMLSTNETQIAAYLALNGPVSI 258
Query: 279 GINAVWMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q YIGGV P CGK LDHG+LIVGY + I KPYW +KNSWG +W
Sbjct: 259 AADAAEWQFYIGGVFDLP--CGKALDHGILIVGYDTE--TNIFGHAKPYWWVKNSWGASW 314
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GY K+ G CG+++ VS+
Sbjct: 315 GEQGYLKVLRGAGECGLNTFVST 337
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 204/345 (59%), Gaps = 28/345 (8%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD ++++ + + D+++ + F F + K Y + E RFRVFK N +
Sbjct: 148 HDDSITVQELRKAKIIRPRDYVI--WNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVI 205
Query: 85 KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPTNDLP 138
+ Q + TAV+G TKFSD+T EF++ L + A+ +K + + DLP
Sbjct: 206 RELQKNEQGTAVYGFTKFSDMTTMEFKKIMLPYQWEQPVYPMEQANFEKHDVTINEEDLP 265
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWR+ GAVT VK+QG CGSCW+FS TG +EGA F++ +LVSLSEQ+LVDCD
Sbjct: 266 ESFDWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCD----- 320
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 321 ----SMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVE 375
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSG 315
+ DE +M LV GP+++G+NA +Q Y GV P+ C + L+HGVLIVGYG G
Sbjct: 376 LPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG 435
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
KPYWI+KNSWG NWGE GY+K+ G+NVCGV M +S
Sbjct: 436 -------RKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGVQEMATS 473
>gi|357473429|ref|XP_003606999.1| Cysteine proteinase [Medicago truncatula]
gi|355508054|gb|AES89196.1| Cysteine proteinase [Medicago truncatula]
Length = 210
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 124/196 (63%), Positives = 152/196 (77%), Gaps = 6/196 (3%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ L ++L + SV A + +D +IRQVV +G + L AEHHF+LFK KF
Sbjct: 1 MDHRTLLLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVR-----LGAEHHFNLFKHKF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y++++EHDYRF++FK+NL RAKR QL+DP+AVHGVT+FSDLTP EFR+ LGL R +
Sbjct: 56 GKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGL-RGV 114
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
LP DA APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTG+
Sbjct: 115 GLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGK 174
Query: 181 LVSLSEQQLVDCDHEC 196
LVSLSEQQLVDCDHE
Sbjct: 175 LVSLSEQQLVDCDHEV 190
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 193/318 (60%), Gaps = 19/318 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 305 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 364
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 365 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 423
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 424 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 474
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVWM 285
E E +YPY C F+++ V+ F + +E M L+ +GP+++GINA M
Sbjct: 475 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAM 533
Query: 286 QTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GGVS P+ +C K LDHGVL+VGYG S + P K PYWI+KNSWG WGE GY
Sbjct: 534 QFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLPYWIVKNSWGPRWGEQGY 592
Query: 343 YKICMGRNVCGVDSMVSS 360
Y++ G N CGV M +S
Sbjct: 593 YRVYRGDNTCGVSEMATS 610
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 193/318 (60%), Gaps = 19/318 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 306 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 365
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 366 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 424
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 425 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 475
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVWM 285
E E +YPY C F+++ V+ F + +E M L+ +GP+++GINA M
Sbjct: 476 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAM 534
Query: 286 QTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GGVS P+ +C K LDHGVL+VGYG S + P K PYWI+KNSWG WGE GY
Sbjct: 535 QFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLPYWIVKNSWGPRWGEQGY 593
Query: 343 YKICMGRNVCGVDSMVSS 360
Y++ G N CGV M +S
Sbjct: 594 YRVYRGDNTCGVSEMATS 611
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 193/317 (60%), Gaps = 26/317 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K+YA ++ RF +FK NL RA+ QL + TA +GVT+FSDLTP
Sbjct: 27 SARELYEQFKRDYGKSYANDDDEK-RFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF +FL R ++ + P DWR+ GAV V+DQG+CGSCW+FS
Sbjct: 86 EEFAAKFLSS----RFDDQVERVQLNDLKAAPESVDWRELGAVAPVEDQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG FL TG+LVSLS+QQLVDCD + DSGC+GG + + I++ GG+
Sbjct: 142 AGNVEGQWFLKTGQLVSLSKQQLVDCDVQ---------DSGCDGGYPPTTYGEIIRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E ++DYPY G + CK D+SK+ A +++ V+ ++E + AA + +HGP++ GINAV +Q
Sbjct: 193 EAQRDYPYVGRE-QPCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTLQ 251
Query: 287 TYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+S P +L+HGVL VGYG+ PYWIIKNSWG WGE GY+
Sbjct: 252 FYQSGISHPSKSQCQPDWLNHGVLSVGYGTEDGV-------PYWIIKNSWGTGWGEKGYF 304
Query: 344 KICMGRNVCGVDSMVSS 360
++ G CG++ +VSS
Sbjct: 305 RLYRGDGTCGIEKVVSS 321
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 206/346 (59%), Gaps = 28/346 (8%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
+DD ++++ + + D+++ + F F + K Y+ + E RFR FK N +
Sbjct: 147 THDDSITVQELRKAKIIRPRDYVI--WNSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKV 204
Query: 84 AKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPTNDL 137
+ Q + +AV+G TKFSD+T EF++ L + AD +K + + +DL
Sbjct: 205 IRELQKNEQGSAVYGFTKFSDMTTMEFKQTMLPYQWEQPVYPMAEADFEKEGVTISEDDL 264
Query: 138 PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECD 197
P FDWRDHGAVT VK+QG CGSCW+FS TG +EGA +L+ +LVSLSEQ+LVDCD
Sbjct: 265 PDSFDWRDHGAVTQVKNQGNCGSCWAFSTTGNVEGAWYLAKKKLVSLSEQELVDCD---- 320
Query: 198 PEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS 257
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 321 -----SVDQGCNGGLPSNAYKEIMRMGGLEPEDAYPYDGK-GETCHIVRKDIAVYINGSV 374
Query: 258 VISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSS 314
+ DE ++ LV GP+++G+NA +Q Y GV P+ C + L+HGVLIVGYG
Sbjct: 375 ELPHDEVKIQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKD 434
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
G KPYWI+KNSWG WGE+GY+++ G+NVCGV M +S
Sbjct: 435 G-------RKPYWIVKNSWGPTWGESGYFRLYRGKNVCGVQEMATS 473
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 193/318 (60%), Gaps = 19/318 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 166 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 225
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 226 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 284
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 285 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 335
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVWM 285
E E +YPY C F+++ V+ F + +E M L+ +GP+++GINA M
Sbjct: 336 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAM 394
Query: 286 QTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GGVS P+ +C K LDHGVL+VGYG S + P K PYWI+KNSWG WGE GY
Sbjct: 395 QFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLPYWIVKNSWGPRWGEQGY 453
Query: 343 YKICMGRNVCGVDSMVSS 360
Y++ G N CGV M +S
Sbjct: 454 YRVYRGDNTCGVSEMATS 471
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 150/365 (41%), Positives = 204/365 (55%), Gaps = 49/365 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCDH D GCNGG + I K GG+E DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV--SCPY 296
G C ++SK A V++ +V+ E A L + GPL+ +NAV +Q Y+GG+ P+
Sbjct: 208 -GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPF 266
Query: 297 ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVD 355
+C + L+H VL VGYG+ F PYWI+KNSWG +GE GY++I G CG++
Sbjct: 267 LCNPHGLNHAVLTVGYGTE-FG------IPYWIVKNSWGVGFGEKGYFRIFRGAGTCGIN 319
Query: 356 SMVSS 360
+VS+
Sbjct: 320 LVVST 324
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 200/363 (55%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSN-DDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A ++ +++ E A L GPL+ +NA +Q Y GG+ P +C
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPRLC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|323713320|gb|ADY04414.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSHDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713078|gb|ADY04293.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713086|gb|ADY04297.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRLKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713016|gb|ADY04262.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713018|gb|ADY04263.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713020|gb|ADY04264.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713022|gb|ADY04265.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713024|gb|ADY04266.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713026|gb|ADY04267.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713030|gb|ADY04269.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713032|gb|ADY04270.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713034|gb|ADY04271.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713036|gb|ADY04272.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713038|gb|ADY04273.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713040|gb|ADY04274.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713042|gb|ADY04275.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713044|gb|ADY04276.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713046|gb|ADY04277.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713048|gb|ADY04278.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713050|gb|ADY04279.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713052|gb|ADY04280.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713054|gb|ADY04281.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713056|gb|ADY04282.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713058|gb|ADY04283.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713060|gb|ADY04284.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713062|gb|ADY04285.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713064|gb|ADY04286.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713066|gb|ADY04287.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713068|gb|ADY04288.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713070|gb|ADY04289.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713072|gb|ADY04290.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713074|gb|ADY04291.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713076|gb|ADY04292.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713080|gb|ADY04294.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713084|gb|ADY04296.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713088|gb|ADY04298.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713090|gb|ADY04299.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713092|gb|ADY04300.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713094|gb|ADY04301.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713096|gb|ADY04302.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713098|gb|ADY04303.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713100|gb|ADY04304.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713102|gb|ADY04305.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713104|gb|ADY04306.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713106|gb|ADY04307.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713108|gb|ADY04308.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713110|gb|ADY04309.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713112|gb|ADY04310.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713114|gb|ADY04311.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713116|gb|ADY04312.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713118|gb|ADY04313.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713120|gb|ADY04314.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713122|gb|ADY04315.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713124|gb|ADY04316.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713126|gb|ADY04317.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713128|gb|ADY04318.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713130|gb|ADY04319.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713132|gb|ADY04320.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713134|gb|ADY04321.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713136|gb|ADY04322.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713138|gb|ADY04323.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713140|gb|ADY04324.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713142|gb|ADY04325.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713144|gb|ADY04326.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713146|gb|ADY04327.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713148|gb|ADY04328.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713150|gb|ADY04329.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713152|gb|ADY04330.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713154|gb|ADY04331.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713156|gb|ADY04332.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713158|gb|ADY04333.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713160|gb|ADY04334.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713162|gb|ADY04335.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713166|gb|ADY04337.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713168|gb|ADY04338.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713170|gb|ADY04339.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713172|gb|ADY04340.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713174|gb|ADY04341.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713180|gb|ADY04344.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713182|gb|ADY04345.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713184|gb|ADY04346.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713186|gb|ADY04347.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713188|gb|ADY04348.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713190|gb|ADY04349.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713192|gb|ADY04350.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713194|gb|ADY04351.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713196|gb|ADY04352.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713198|gb|ADY04353.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713200|gb|ADY04354.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713202|gb|ADY04355.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713204|gb|ADY04356.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713206|gb|ADY04357.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713212|gb|ADY04360.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713216|gb|ADY04362.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713218|gb|ADY04363.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713220|gb|ADY04364.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713222|gb|ADY04365.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713224|gb|ADY04366.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713226|gb|ADY04367.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713230|gb|ADY04369.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713232|gb|ADY04370.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713234|gb|ADY04371.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713236|gb|ADY04372.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713238|gb|ADY04373.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713240|gb|ADY04374.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713246|gb|ADY04377.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713248|gb|ADY04378.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713250|gb|ADY04379.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713252|gb|ADY04380.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713254|gb|ADY04381.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713256|gb|ADY04382.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713258|gb|ADY04383.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713260|gb|ADY04384.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713262|gb|ADY04385.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713264|gb|ADY04386.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713266|gb|ADY04387.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713268|gb|ADY04388.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713270|gb|ADY04389.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713274|gb|ADY04391.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713276|gb|ADY04392.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713278|gb|ADY04393.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713280|gb|ADY04394.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713282|gb|ADY04395.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713284|gb|ADY04396.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713286|gb|ADY04397.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713288|gb|ADY04398.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713290|gb|ADY04399.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713292|gb|ADY04400.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713294|gb|ADY04401.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713296|gb|ADY04402.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713298|gb|ADY04403.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713300|gb|ADY04404.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713302|gb|ADY04405.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713304|gb|ADY04406.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713306|gb|ADY04407.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713308|gb|ADY04408.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713310|gb|ADY04409.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713312|gb|ADY04410.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713314|gb|ADY04411.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713316|gb|ADY04412.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713318|gb|ADY04413.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713322|gb|ADY04415.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713324|gb|ADY04416.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713326|gb|ADY04417.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713328|gb|ADY04418.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713330|gb|ADY04419.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713332|gb|ADY04420.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713334|gb|ADY04421.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713336|gb|ADY04422.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713338|gb|ADY04423.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713340|gb|ADY04424.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713342|gb|ADY04425.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713344|gb|ADY04426.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713346|gb|ADY04427.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713348|gb|ADY04428.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713350|gb|ADY04429.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713352|gb|ADY04430.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713354|gb|ADY04431.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713356|gb|ADY04432.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713358|gb|ADY04433.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713360|gb|ADY04434.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713362|gb|ADY04435.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713364|gb|ADY04436.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713366|gb|ADY04437.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713368|gb|ADY04438.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713370|gb|ADY04439.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713372|gb|ADY04440.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713374|gb|ADY04441.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713376|gb|ADY04442.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713378|gb|ADY04443.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713380|gb|ADY04444.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713382|gb|ADY04445.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713384|gb|ADY04446.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713386|gb|ADY04447.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713388|gb|ADY04448.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713390|gb|ADY04449.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713392|gb|ADY04450.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713394|gb|ADY04451.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713396|gb|ADY04452.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713398|gb|ADY04453.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713400|gb|ADY04454.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713402|gb|ADY04455.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713404|gb|ADY04456.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713408|gb|ADY04458.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713410|gb|ADY04459.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713412|gb|ADY04460.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713414|gb|ADY04461.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713416|gb|ADY04462.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713418|gb|ADY04463.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713420|gb|ADY04464.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713422|gb|ADY04465.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713424|gb|ADY04466.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713426|gb|ADY04467.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713428|gb|ADY04468.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713430|gb|ADY04469.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713432|gb|ADY04470.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713434|gb|ADY04471.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713436|gb|ADY04472.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713438|gb|ADY04473.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713440|gb|ADY04474.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713442|gb|ADY04475.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713444|gb|ADY04476.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713448|gb|ADY04478.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713454|gb|ADY04481.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713458|gb|ADY04483.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713460|gb|ADY04484.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713462|gb|ADY04485.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713464|gb|ADY04486.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713466|gb|ADY04487.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713468|gb|ADY04488.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713470|gb|ADY04489.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713474|gb|ADY04491.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713478|gb|ADY04493.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713494|gb|ADY04501.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713496|gb|ADY04502.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713498|gb|ADY04503.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713500|gb|ADY04504.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713502|gb|ADY04505.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713504|gb|ADY04506.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713506|gb|ADY04507.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713508|gb|ADY04508.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713510|gb|ADY04509.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713512|gb|ADY04510.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713514|gb|ADY04511.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713516|gb|ADY04512.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713518|gb|ADY04513.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713520|gb|ADY04514.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713522|gb|ADY04515.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713524|gb|ADY04516.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713526|gb|ADY04517.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713528|gb|ADY04518.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713210|gb|ADY04359.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDRWGEEGFYKICRGRNICG 144
>gi|323713176|gb|ADY04342.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 132/144 (91%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LK GG+ +E+DYPYTGTD GSCKF+KSKIAAAV+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKTGGLMKEEDYPYTGTDKGSCKFEKSKIAAAVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713456|gb|ADY04482.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRVKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713228|gb|ADY04368.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713242|gb|ADY04375.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713244|gb|ADY04376.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713272|gb|ADY04390.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713446|gb|ADY04477.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713450|gb|ADY04479.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 114/144 (79%), Positives = 132/144 (91%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGNKWGEEGFYKICRGRNICG 144
>gi|323713208|gb|ADY04358.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYG+SG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGTSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713452|gb|ADY04480.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P++ KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVKMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713164|gb|ADY04336.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713178|gb|ADY04343.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 132/144 (91%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LK GG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKTGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 199/363 (54%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSN-DDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A ++ +++ E A L GPL+ +NA +Q Y GG+ P C
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPKWC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|371781479|emb|CCA95098.1| putative responsive to dehydration 19, partial [Liriodendron
tulipifera]
Length = 150
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 115/147 (78%), Positives = 135/147 (91%), Gaps = 1/147 (0%)
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
SCD+GCNGGLM SAF+Y LK+GG+E+E+DYPYTG DG +CKF+KSKIAA+ N++V+S D
Sbjct: 4 SCDAGCNGGLMTSAFKYTLKSGGLEKEEDYPYTGKDGATCKFEKSKIAASALNYTVVSID 63
Query: 263 EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRF 321
EDQ+AANLVK GPLAVGINAV+MQTYIGGVSCPYIC K LDHGVL+VGYG++G+APIRF
Sbjct: 64 EDQIAANLVKFGPLAVGINAVFMQTYIGGVSCPYICSKRLLDHGVLLVGYGAAGYAPIRF 123
Query: 322 KEKPYWIIKNSWGENWGENGYYKICMG 348
K+KPYWIIKNSWGE+WGENGYYKIC G
Sbjct: 124 KDKPYWIIKNSWGESWGENGYYKICRG 150
>gi|323713214|gb|ADY04361.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 132/144 (91%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAG + +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGALMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713406|gb|ADY04457.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 132/144 (91%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKI A+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIVASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713028|gb|ADY04268.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 133/144 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKP+WII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPHWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 150/365 (41%), Positives = 202/365 (55%), Gaps = 49/365 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCDH D GCNGG + I K GG+E DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV--SCPY 296
G C ++SK A V+ +V+ E A L + GPL+ +NAV +Q Y+GG+ P+
Sbjct: 208 -GICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPF 266
Query: 297 ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVD 355
+C + L+H VL VGYG+ F PYWI+KNSWG +GE GY++I G CG++
Sbjct: 267 LCNPHGLNHAVLTVGYGTE-FG------IPYWIVKNSWGVGFGEKGYFRIFRGAGTCGIN 319
Query: 356 SMVSS 360
+VS+
Sbjct: 320 LVVST 324
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 148/363 (40%), Positives = 198/363 (54%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A V+ +++ E A L GPL+ +NA +Q Y GG+ P C
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPKWC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++HGVL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHGVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|323713082|gb|ADY04295.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 132/144 (91%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+ KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVSMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKICMGRNVCG 353
KNSWG+ WGE G+YKIC GRN+CG
Sbjct: 121 KNSWGDKWGEEGFYKICRGRNICG 144
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 190/322 (59%), Gaps = 19/322 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
L +H F F+ +F + Y E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 316 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 375
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R ++P + P +FDWR AVT VK+QG+CGSCW
Sbjct: 376 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 434
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 435 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 485
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E +YPY C F+++ VS F + +E M L+ HGP+++G+N
Sbjct: 486 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLN 544
Query: 282 AVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+KNSWG WG
Sbjct: 545 ANAMQFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 603
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G N CGV M +S
Sbjct: 604 EQGYYRVYRGDNTCGVSEMATS 625
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 190/322 (59%), Gaps = 19/322 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
L +H F F+ +F + Y E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 314 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 373
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R ++P + P +FDWR AVT VK+QG+CGSCW
Sbjct: 374 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 432
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 433 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 483
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E +YPY C F+++ VS F + +E M L+ HGP+++G+N
Sbjct: 484 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLN 542
Query: 282 AVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+KNSWG WG
Sbjct: 543 ANAMQFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 601
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G N CGV M +S
Sbjct: 602 EQGYYRVYRGDNTCGVSEMATS 623
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 192/322 (59%), Gaps = 31/322 (9%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ-LLD---PTAVHGVTK 100
H L+ + + FK + +K+Y E RF +F+ +LR+ + D T GVTK
Sbjct: 15 HALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTK 74
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
F+DLT EF LG++R + + P DLP+ FDWR+ GAVT VKDQG+CGS
Sbjct: 75 FADLTEKEFS-DMLGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGS 133
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFS TG +EGA+FL TG+LVSLSEQ LVDC E C GC+GG M+ A EYI
Sbjct: 134 CWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-------DC-YGCSGGYMDKALEYI 185
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVG 279
AGG+ E DYPY G D C+FD SK+AA +SNF+ I +DED + ++ GP++V
Sbjct: 186 ETAGGIMSENDYPYEGID-DKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVA 244
Query: 280 INAVW-MQTYIGGV---SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
I+A + Q Y G+ S Y L+HGVL+VGYG+ KE+ YWI+KNSWG
Sbjct: 245 IDASFNFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTE-------KEQDYWIVKNSWGA 297
Query: 336 NWGENGYYKICMGR---NVCGV 354
+WG +GY I M R N CG+
Sbjct: 298 DWGMDGY--IWMSRNKNNQCGI 317
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 146/363 (40%), Positives = 198/363 (54%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A ++ +++ E A L GPL+ +NA +Q Y GG+ P +C
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPRLC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 187/318 (58%), Gaps = 28/318 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFS 102
LN + + FK K +K+Y + E RFR+F+ NLR+ + + T GVTKF+
Sbjct: 17 LNDKEEWVQFKVKNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFT 76
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT EF L L++ R + P DLP+ FDWRD GAVT VKDQG CGSCW
Sbjct: 77 DLTEKEFL-DLLVLSKNARPNRTHATHLLAPLRDLPSAFDWRDKGAVTEVKDQGMCGSCW 135
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG++E AHFL TG LVSLSEQ LVDC + +C GC GG M+ A EYI K
Sbjct: 136 TFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKD-------TC-YGCGGGWMDKALEYIEK 187
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGIN 281
GG+ EKDYPY G D +C+FD SK+AA +SNF+ I +DE+ + + GP++V I+
Sbjct: 188 -GGIMSEKDYPYEGVD-DNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAID 245
Query: 282 A-VWMQTYIGGVSCPYICGKYLD---HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
A Q Y+ G+ C D HGVL+VGYG+ K YWIIKNSWG NW
Sbjct: 246 ASATFQLYVSGILDDTECSNEFDSLNHGVLVVGYGTEN-------GKDYWIIKNSWGVNW 298
Query: 338 GENGYYKICMGR-NVCGV 354
G +GY ++ + N CG+
Sbjct: 299 GMDGYIRMSRNKNNQCGI 316
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/327 (44%), Positives = 195/327 (59%), Gaps = 22/327 (6%)
Query: 44 DHLL---NAEHHFSLFKS--KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
D+LL + E+H SLF K ++E+ YRF VF NL + + + TA +G
Sbjct: 1408 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1467
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQG 156
+T+F+D+T EF R LGL LR + A +P +LP +FDWR VT VK+Q
Sbjct: 1468 ITRFADMTQKEFSRS-LGLRTDLRNENETPFAQAKIPNIELPKEFDWRKKNVVTEVKNQE 1526
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS TG +EG + L G+L+ SEQ+LVDCD + D GCNGGLM++A
Sbjct: 1527 QCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLMDTA 1577
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
+ I K GG+E E+DYPY D C F+++ V+ IS +E MA LV +GP+
Sbjct: 1578 YRSIEKIGGLETEQDYPYDAED-EKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPI 1636
Query: 277 AVGINAVWMQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
++ INA MQ Y+GGVS P ++C K LDHGVLIVGYG + P+ K PYWI+KNSW
Sbjct: 1637 SIAINANAMQFYMGGVSHPFKFLCSPKNLDHGVLIVGYGVHNY-PLFKKSLPYWIVKNSW 1695
Query: 334 GENWGENGYYKICMGRNVCGVDSMVSS 360
G WGE GYY++ G CG++ SS
Sbjct: 1696 GTGWGEQGYYRVYRGDGTCGLNQTPSS 1722
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/327 (44%), Positives = 195/327 (59%), Gaps = 22/327 (6%)
Query: 44 DHLL---NAEHHFSLFKS--KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
D+LL + E+H SLF K ++E+ YRF VF NL + + + TA +G
Sbjct: 1443 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1502
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQG 156
+T+F+D+T EF R LGL LR + A +P +LP +FDWR VT VK+Q
Sbjct: 1503 ITRFADMTQKEFSRS-LGLRTDLRNENETPFAQAKIPNIELPKEFDWRKKNVVTEVKNQE 1561
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS TG +EG + L G+L+ SEQ+LVDCD + D GCNGGLM++A
Sbjct: 1562 QCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLMDTA 1612
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
+ I K GG+E E+DYPY D C F+++ V+ IS +E MA LV +GP+
Sbjct: 1613 YRSIEKIGGLETEQDYPYDAED-EKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPI 1671
Query: 277 AVGINAVWMQTYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
++ INA MQ Y+GGVS P ++C K LDHGVLIVGYG + P+ K PYWI+KNSW
Sbjct: 1672 SIAINANAMQFYMGGVSHPFKFLCSPKNLDHGVLIVGYGVHNY-PLFKKSLPYWIVKNSW 1730
Query: 334 GENWGENGYYKICMGRNVCGVDSMVSS 360
G WGE GYY++ G CG++ SS
Sbjct: 1731 GTGWGEQGYYRVYRGDGTCGLNQTPSS 1757
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 190/322 (59%), Gaps = 19/322 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
L +H F F+ +F + Y E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 164 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 223
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R ++P + P +FDWR AVT VK+QG+CGSCW
Sbjct: 224 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 282
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 283 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 333
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E +YPY C F+++ VS F + +E M L+ HGP+++G+N
Sbjct: 334 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLN 392
Query: 282 AVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y GGVS P+ +C K LDHGVLIVGYG S + P K PYWI+KNSWG WG
Sbjct: 393 ANAMQFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 451
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY++ G N CGV M +S
Sbjct: 452 EQGYYRVYRGDNTCGVSEMATS 473
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 187/319 (58%), Gaps = 34/319 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + ++K+YA E R +F NL A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213
Query: 112 QFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L L R P A + P P +DWRDHGAVTGVK+QGACGSCW+FS
Sbjct: 214 SYLNPLLSSLPGRALRPGPATRGPA------PASWDWRDHGAVTGVKNQGACGSCWAFSV 267
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG FL G L++LSEQ+LVDCD + D C GGL ++A+ I K GG+
Sbjct: 268 TGNVEGQWFLRRGALLALSEQELVDCD---------TLDQACGGGLPSNAYTAIEKLGGL 318
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E EKDY Y G C F K +++ +S DE+++A L ++GP+++ +NA MQ
Sbjct: 319 ETEKDYSYEGRK-ERCSFSPDKARVYINSSVDLSRDEEELATWLAENGPVSIALNAFAMQ 377
Query: 287 TYIGGVSCPY--ICGK-YLDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Y GVS P+ +C ++DH VL+VGYG SG P+W IKNSWG +WGE GY
Sbjct: 378 FYRRGVSHPFRPLCSPWFIDHAVLLVGYGHRSGI--------PFWAIKNSWGPDWGEEGY 429
Query: 343 YKICMGRNVCGVDSMVSSV 361
Y + G CGV++M SS
Sbjct: 430 YYLYRGARACGVNAMASSA 448
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 197/363 (54%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A V+ +++ E A L GPL+ +NA +Q Y GG+ P C
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPKWC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 192/317 (60%), Gaps = 26/317 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA +++ RF +FK NL RA++ Q+ + TA +GVT+FSDLTP
Sbjct: 27 NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF ++LGL R+ + + P DWR+ GAV +++QG+CGSCW+FS
Sbjct: 86 EEFEAKYLGL----RIDEQVDRVQLNDLQTAPASVDWREKGAVGPIENQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG FL TG LVSLS+QQLVDCD + D+GC GG ++ I + GG+
Sbjct: 142 VGNIEGQWFLKTGYLVSLSKQQLVDCD---------TVDNGCYGGYPPYTYKEIKRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E + DYPYTG G C+ D+SK+ A + + V+ +DE++ AA L +HGP++ +NA ++Q
Sbjct: 193 ELQSDYPYTGW-GHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQ 251
Query: 287 TYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P +C + L+H VL VGY + PYWIIKNSWG +WGE+GY+
Sbjct: 252 FYQSGILHPSKAMCSPEGLNHAVLTVGYDTK-------HGIPYWIIKNSWGTSWGEDGYF 304
Query: 344 KICMGRNVCGVDSMVSS 360
+I G CG+D + +S
Sbjct: 305 RIYRGDGTCGIDRLTTS 321
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 190/319 (59%), Gaps = 36/319 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY T+EE +R +F N+ RA+ Q LD TA +GVTKFSDLT EFR
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFRT 221
Query: 112 QFL------GLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+L GL +++RL P +D P ++DWR+ GAVT VK+QG CGSCW+F
Sbjct: 222 FYLNPLLKEGLGKKMRLAK--------PVDDPAPPEWDWRNKGAVTKVKNQGMCGSCWAF 273
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG FL G+L+SLSEQ+LVDCD + D C GGL ++A+ I G
Sbjct: 274 SVTGNVEGQWFLKQGDLLSLSEQELVDCD---------TLDKACMGGLPSNAYSAIKTLG 324
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DY Y G +C F K+ +++ +S DE ++AA L K GP+++ INA
Sbjct: 325 GLETEDDYSYHG-HLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGPISIAINAFG 383
Query: 285 MQTYIGGVSCP--YICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
MQ Y G+S P +C ++DH VL+VGYG+ + P+W IKNSWG +WGE G
Sbjct: 384 MQFYRRGISRPLRLLCSPWFIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDWGEEG 436
Query: 342 YYKICMGRNVCGVDSMVSS 360
YY + G CGV+ M SS
Sbjct: 437 YYYLHRGSRACGVNVMASS 455
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 196/363 (53%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFETRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A V+ +++ E A L GPL+ +NA +Q Y GG+ P C
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPKWC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 146/363 (40%), Positives = 197/363 (54%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A ++ +++ E A L GPL+ +NA +Q Y GG+ P +C
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPRLC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 140/295 (47%), Positives = 183/295 (62%), Gaps = 24/295 (8%)
Query: 73 RFRVFKANLRRAKR---RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
RF++F+ N+++ +L D A +GVT+FSDL EFRR +L L D +A
Sbjct: 2 RFKIFRENMKKINTLNDNELGD--AEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRA 59
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
I P D P FDWRDH AVT VK+QG CGSCW+FS T +EG + +LVSLSEQ+L
Sbjct: 60 KI-PDVDPPASFDWRDHNAVTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLVSLSEQEL 118
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCD D GC GGL +A+E I++ GG+E EK YPY D CKF +
Sbjct: 119 VDCD---------KLDDGCEGGLPVNAYEEIIRLGGLESEKKYPYDAED-EKCKFTVGDV 168
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP--YICGK-YLDHGV 306
A +++ ISS+E MAA L K+GP+++GINA MQ Y+GGVS P ++C LDHGV
Sbjct: 169 AVYINSSVNISSNEADMAAWLYKNGPISIGINAFAMQFYMGGVSHPFSFLCSPDELDHGV 228
Query: 307 LIVGYGS-SGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
LIVGYG+ G+ F + PYWI+KNSWG +WG GYY + G VCG++ M +S
Sbjct: 229 LIVGYGTKKGW----FSDSPYWIVKNSWGASWGVQGYYLVYRGDGVCGLNKMPTS 279
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 197/320 (61%), Gaps = 28/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 266
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD S D GCNGGL ++A+E I++
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 317
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C + +AA +++ ++ DE ++A L H ++VG+NA+
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 376
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGE
Sbjct: 377 LLQFYRHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEK 430
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG+++ +S
Sbjct: 431 GYFRMYRGDGTCGINTDATS 450
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 193/325 (59%), Gaps = 28/325 (8%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
ED ++ F F +++TY T+EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 151 EDFVMQVASIFKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKF 210
Query: 102 SDLTPSEFRRQFLG-LNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGAC 158
SDLT EFR +L L + LR +++ P+ + P ++DWR+ GAVT VKDQG C
Sbjct: 211 SDLTEEEFRTIYLNPLLKELR----SKRMPLAMSVSGPAPPEWDWRNKGAVTKVKDQGMC 266
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FS TG +EG FL G+L+SLSEQ+LVDCD D C GGL ++A+
Sbjct: 267 GSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDK---------LDKACLGGLPSNAYS 317
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
I GG+E E DY Y G +C F K +++ +S +E ++AA L K+GP+++
Sbjct: 318 AIKTLGGLETEDDYGYNG-HLQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPISI 376
Query: 279 GINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
INA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG
Sbjct: 377 AINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------DIPFWAIKNSWGT 429
Query: 336 NWGENGYYKICMGRNVCGVDSMVSS 360
+WGE GYY + G CGV+ M SS
Sbjct: 430 DWGEEGYYYLHRGSGACGVNIMASS 454
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 185/313 (59%), Gaps = 21/313 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F K+ K Y++QEE + R ++F+ NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 174 QFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFR 233
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L K P +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 234 STYLNPLLSQWTLHRGMKPAPPAKTPAPDSWDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL G L+SLSEQ+LVDCD D C GGL ++A+E I K GG+E E
Sbjct: 294 EGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEKLGGLESET 344
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY YTG C F K+AA +++ + DE ++AA L ++GP++V +NA MQ Y
Sbjct: 345 DYSYTGHK-QKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPISVALNAFAMQFYKK 403
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GVS P+ C ++ DH VL+VGYG P+W IKNSWGE++GE GYY +
Sbjct: 404 GVSHPWKIFCNPWMIDHAVLLVGYGERNGI-------PFWAIKNSWGEDYGEQGYYYLQR 456
Query: 348 GRNVCGVDSMVSS 360
G N CG++ M SS
Sbjct: 457 GSNACGINRMGSS 469
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 196/318 (61%), Gaps = 33/318 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +K++K Y++QEE D R ++FK NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 178 FKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSDLTEEEFRL 237
Query: 112 QFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+L RR PA ++P P +DWRDHGAV+ VK+QG CGSCW+FS
Sbjct: 238 TYLNPLLSQWTLRRPMKPASPARSPA------PASWDWRDHGAVSPVKNQGLCGSCWAFS 291
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG +EG FL G+L+SLSEQ+LVDCD D C GGL ++A+E I GG
Sbjct: 292 VTGNIEGQWFLKHGKLLSLSEQELVDCD---------GLDHACRGGLPSNAYEAIEGLGG 342
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM 285
+E E DY Y+G C F K+AA +++ + SDE++MAA L ++GP++V +NA M
Sbjct: 343 LEAENDYTYSGHK-QKCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVSVALNAFAM 401
Query: 286 QTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GVS P+ +C ++ DH VL+VGYG P+W IKNSWGE++GE GY
Sbjct: 402 QFYKKGVSHPWMILCNPWMIDHAVLLVGYGERNGI-------PFWAIKNSWGEDYGEEGY 454
Query: 343 YKICMGRNVCGVDSMVSS 360
Y + G N CG++ M SS
Sbjct: 455 YYLYKGSNACGINKMGSS 472
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 194/320 (60%), Gaps = 26/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N + + FK K+ K Y E+ + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 115 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 173
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 174 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 231
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 232 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 282
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++VG+NA+
Sbjct: 283 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 341
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGEN
Sbjct: 342 LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEN 395
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG++++ +S
Sbjct: 396 GYFRMYRGDGTCGINTVATS 415
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 146/363 (40%), Positives = 196/363 (53%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + F K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFTLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A V+ +++ E A L GPL+ +NA +Q Y GG+ P C
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPKWC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEKGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 190/317 (59%), Gaps = 21/317 (6%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEF 109
+F F +F+K Y T++ ++ +FK+N+ AKR Q + TA++G T F+D+TP EF
Sbjct: 64 ENFKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEF 123
Query: 110 RRQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R+ L N ++ P ++ +P +++ DWR AVT VKDQG CGSCW+F
Sbjct: 124 RKTHLNFNPNNVKKP---KRMANIPKSNISERMDWRKFNAVTSVKDQGNCGSCWAFCTVA 180
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EGA + T +L+SLSEQQLVDCD D GC GGL +A+ I++ GG+E+
Sbjct: 181 NIEGAWAVKTAQLISLSEQQLVDCDR---------LDDGCEGGLPVNAYLEIIRLGGLEK 231
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
E+DY YT G CKF+ +K A +++ V+ DED +A + ++GP+AVG+NA M Y
Sbjct: 232 EEDYKYTAR-SGKCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVAVGLNADAMMFY 290
Query: 289 IGGVSCP--YICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
G++ P +C ++HGV IVGY F PYWIIKNSWG NWGE GYY +
Sbjct: 291 RSGIAHPSRLMCSPDGINHGVTIVGY---DVKESLFWSTPYWIIKNSWGPNWGEKGYYYL 347
Query: 346 CMGRNVCGVDSMVSSVA 362
G+ VCG+D M SSV
Sbjct: 348 YRGKGVCGIDQMASSVV 364
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 194/320 (60%), Gaps = 26/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N + + FK K+ K Y E+ + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 153 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 211
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 212 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 269
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 270 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 320
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++VG+NA+
Sbjct: 321 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 379
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGEN
Sbjct: 380 LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEN 433
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG++++ +S
Sbjct: 434 GYFRMYRGDGTCGINTVATS 453
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 196/320 (61%), Gaps = 28/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 15 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74 DEFSRTHLTAPWR----ASSKRNTIPPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 129
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD S D GCNGGL ++A+E I++
Sbjct: 130 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 180
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C +AA +++ ++ DE ++A L H ++VG+NA+
Sbjct: 181 GGLMLEDNYPYDAKN-EKCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 239
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGE
Sbjct: 240 LLQFYRHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEK 293
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG+++ +S
Sbjct: 294 GYFRMYRGDGTCGINTGATS 313
>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
Length = 266
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 129/273 (47%), Positives = 173/273 (63%), Gaps = 14/273 (5%)
Query: 93 TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
TAV+G T FSD + +E++ G N LR + +P DLP +FDWR+H VT V
Sbjct: 3 TAVYGDTPFSDWSAAEYKAHLAGFNPSLRQSNARLRQAAIPEIDLPDEFDWRNHSVVTPV 62
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG+CGSCW+FS TG +EG + + G+L+SLSEQ+LVDCD DSGCNGGL
Sbjct: 63 KDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCD---------KLDSGCNGGL 113
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
+A++ I GG+E E DYPY G + CKF+ + V+ IS++E +MA L++
Sbjct: 114 PENAYKAIHDIGGLETESDYPYNGHE-NKCKFNSNITRVQVTGGVEISTNETEMAQWLIQ 172
Query: 273 HGPLAVGINAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
+GP+++GINA MQ Y GGVS P+ +C +DHGVLIVGYG S + P K PYWI+
Sbjct: 173 NGPISIGINANAMQYYRGGVSHPWKVLCRPGGIDHGVLIVGYGVSQY-PKFNKTLPYWIV 231
Query: 330 KNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
KNSWG WGE GYY++ G CG++ M +S
Sbjct: 232 KNSWGTRWGEQGYYRVFRGDGTCGLNQMCTSAT 264
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 183/310 (59%), Gaps = 28/310 (9%)
Query: 53 FSLFKSKFSKTYATQE---EHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSE 108
F F F + Y + E++YR+ VF N+ + Q TA +G TKF+D+T +E
Sbjct: 156 FDKFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAE 215
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
FR+ G ++ + +K +P +P ++DWR HGAVT VK+QG CGSCW+FSA G
Sbjct: 216 FRKLQSGPLKKTGI----KKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIG 271
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EG + GEL+SLSEQ+LVDCD D GC GG M+ A+E I+K GG
Sbjct: 272 NMEGQWQIKKGELISLSEQELVDCD---------KVDGGCEGGEMSDAYEAIIKLGGAMS 322
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
E+ YPY G + CKF+ + + ++ + IS +E +MA L HGP+++GINA+ MQ Y
Sbjct: 323 EEKYPYRG-ENEKCKFNMTDVRVKINGYVNISKNETEMAGWLAAHGPISIGINALMMQFY 381
Query: 289 IGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
GG++ P+ C LDHGVLIVGY +PYWI+KNSWG++WGE GYY +
Sbjct: 382 FGGIAHPWKIFCSPDSLDHGVLIVGYSVK-------DGEPYWIVKNSWGKDWGEEGYYLV 434
Query: 346 CMGRNVCGVD 355
G CG++
Sbjct: 435 YRGDGTCGLN 444
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 193/320 (60%), Gaps = 27/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N + + FK K+ K Y E + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 153 NVDEKYVQFKLKYRKQY--HETDEIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 268
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 269 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 319
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++VG+NA+
Sbjct: 320 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 378
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGEN
Sbjct: 379 LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEN 432
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG++++ +S
Sbjct: 433 GYFRMYRGDGTCGINTVATS 452
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 190/322 (59%), Gaps = 35/322 (10%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L R + D Q + PT P DWR+ GAVT V++QG+CGSCW+F
Sbjct: 81 EEFAAKYL----RAAVNND-QVERVRPTGLKAAPERMDWREKGAVTAVENQGSCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA G +EG F+ TG+LVSLS+QQLVDCD + GCNGG S++ I G
Sbjct: 136 SAAGNVEGQWFIKTGQLVSLSKQQLVDCDRVAE---------GCNGGWPVSSYLEIKHMG 186
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY G + +C +K K+ A + + V+ + E++ AA L +HGPL+ +NAV
Sbjct: 187 GLESESDYPYVGAE-QTCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVA 245
Query: 285 MQTYIGGV------SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
+Q Y GV CP L+H VL VGY G + PYWIIKNSWG +WG
Sbjct: 246 LQHYQSGVLNPTYEECP---DTELNHAVLTVGYDKEG-------DMPYWIIKNSWGTDWG 295
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GY+++ G CG++ M +S
Sbjct: 296 EKGYFRLFRGDYTCGINRMATS 317
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 146/363 (40%), Positives = 196/363 (53%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRET 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LS QQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSGQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A V+ +++ E A L GPL+ +NA +Q Y GG+ P C
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPKWC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 148/365 (40%), Positives = 202/365 (55%), Gaps = 49/365 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCDH + GCNGG + I K GG+E DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LEKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV--SCPY 296
G C ++SK A V++ +V+ E A L + GPL+ +NAV +Q Y+GG+ P+
Sbjct: 208 -GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPF 266
Query: 297 ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVD 355
+C + L+H VL VGYG+ F PYWI+KNS G +GE GY++I G CG++
Sbjct: 267 LCNPHGLNHAVLTVGYGTE-FG------IPYWIVKNSLGVGFGEKGYFRIFRGAGTCGIN 319
Query: 356 SMVSS 360
+VS+
Sbjct: 320 LVVST 324
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 189/324 (58%), Gaps = 31/324 (9%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA ++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 27 NARELYEQFKRDYGKVYANDDDQK-RFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L P + Q + PT P DWR+ GAV V++QG+CGSCW+F
Sbjct: 86 EEFAAKYLSR------PMNDQVERVRPTGLKAAPERMDWREWGAVGPVENQGSCGSCWAF 139
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG FL TG+LVSLS+QQLVDCD D GC GG +A+ I++ G
Sbjct: 140 SVAGNVEGQWFLKTGQLVSLSKQQLVDCD---------VMDYGCGGGWPTNAYMEIMRMG 190
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E + DYPY G C +K K+ A + + V+ + E++ AA L +HGPL+ +NA +
Sbjct: 191 GLELQSDYPYVGVQ-QQCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGY 249
Query: 285 MQTYIGGVSCPYI--CGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Q Y G+S P C L+H VL VGY + PYWIIKNSWG WGENG
Sbjct: 250 LQFYQSGISHPSYEECSPASLNHAVLTVGYDTENGV-------PYWIIKNSWGTGWGENG 302
Query: 342 YYKICMGRNVCGVDSMVSSVAAIH 365
Y+++ G CG++ M++S A IH
Sbjct: 303 YFRLYRGDGTCGINRMITS-AIIH 325
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 184/321 (57%), Gaps = 27/321 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK K+ KTY+ ++ + RFR+FK NL RAKR Q ++ TA +GVT+FSDLT
Sbjct: 27 DARALYEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTS 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWS 163
EF+ ++L R+R P D+ D FDWRDHGAV V DQG CGSCW+
Sbjct: 86 EEFKTRYL----RMRFDEPIVNEDPTPQEDVTMDNSNFDWRDHGAVGPVLDQGDCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG F TG+L+ LSEQQL+DCDH D GC+GG + I +
Sbjct: 142 FSVIGNVEGQWFRKTGDLLGLSEQQLIDCDHS---------DQGCDGGYPPQTYSAIEEM 192
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+E DYPYTG D G C D+SK A V+ + + E A +L + GPL+ G+NAV
Sbjct: 193 GGLELRSDYPYTGKD-GICYMDQSKFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAV 251
Query: 284 WMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Q Y G+ P C L+H VL VGYG PYWI+KNSWG+ +GE GY
Sbjct: 252 LLQLYKRGIMRPRWCNPAELNHAVLTVGYGME-------HRMPYWIVKNSWGKRFGEKGY 304
Query: 343 YKICMGRNVCGVDSMVSSVAA 363
++I G CG++ V++
Sbjct: 305 FRIYRGDGTCGINRAVTTAVV 325
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 196/320 (61%), Gaps = 28/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWRKKGAVTEVKNQGMCGSCWA 266
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD + D GCNGGL ++A+E I++
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------NLDDGCNGGLPSNAYESIIRM 317
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C + +AA +++ ++ DE ++A L H ++VG+NA+
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 376
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGE
Sbjct: 377 LLQFYRHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEK 430
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG+++ +S
Sbjct: 431 GYFRMYRGDGTCGINTDATS 450
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 191/331 (57%), Gaps = 34/331 (10%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D L F F + +++TY T+EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 103 QDFYLRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKF 162
Query: 102 SDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
SDLT EFR +L L +++RL + P ++DWR GAVT VK+Q
Sbjct: 163 SDLTEEEFRTMYLNPLLKEELGKKMRLVK-------FVGDPAPPEWDWRKKGAVTKVKNQ 215
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS TG +EG FL G+L+SLSEQ+LVDCD D C GGL ++
Sbjct: 216 GMCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCD---------KVDKACMGGLPSN 266
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A+ I GG+E E DY Y+G +C F K +++ +S +E ++AA L K+GP
Sbjct: 267 AYSAIKTLGGLETEDDYSYSG-HLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGP 325
Query: 276 LAVGINAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
+++ INA MQ Y G+S P +C + ++DH VL+VGYG+ + P+W IKNS
Sbjct: 326 ISIAINAFGMQFYRHGISRPLRPLCSRWFIDHAVLLVGYGNRS-------DVPFWAIKNS 378
Query: 333 WGENWGENGYYKICMGRNVCGVDSMVSSVAA 363
WG +WGE GYY + G CGV+ M SS
Sbjct: 379 WGTDWGEEGYYYLHRGSGACGVNVMASSAVV 409
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 145/363 (39%), Positives = 196/363 (53%), Gaps = 47/363 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQ LVDCD+ D GC+GG I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQPLVDCDY---------LDGGCDGGYPPQTNTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYIC 298
GG C DKSK A ++ +++ E A L GPL+ +NA +Q Y GG+ P +C
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPRLC 266
Query: 299 GKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VL VGYG KPYWI+KNSWGE++GE GY++I G CG++S+
Sbjct: 267 DPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSI 319
Query: 358 VSS 360
V++
Sbjct: 320 VTT 322
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/354 (37%), Positives = 201/354 (56%), Gaps = 39/354 (11%)
Query: 30 MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
MI ++ Q E HL +A+H+F F ++K YA + +YRF++F NL
Sbjct: 5 MIFTILLVASSQIEGHLKFDIHDAQHYFETFIVNYNKQYADTKTKNYRFKIFVQNLEYIN 64
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
+ L+ +A++ + KFSDL+ +E ++ GL R P++ K AP
Sbjct: 65 EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSRK--PSNMVKSTSNFCNVIHLDAPPDA 122
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
++LP +FDWR + +T VKDQGACGSCW+ +A G LE + + L++LSEQQL+DCD
Sbjct: 123 RDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
S + C+GGLM++AFE ++ AGG+ E DYPY GT G CK D K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GICKIDNKKFALSV 232
Query: 254 SNFS-VISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGY 311
S+ I +E+ + L+ GP+A+ I+A + TY G+ + C L+H VL+VGY
Sbjct: 233 SSCKRYIFQNEENLKKELITTGPIAMAIDAASISTYSKGI--IHFCENLGLNHAVLLVGY 290
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
G+ G YW +KNSWG +WGE+GY+++ N CG+++ +++ A IH
Sbjct: 291 GTEGGV-------SYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAASATIH 337
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 194/320 (60%), Gaps = 26/320 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTP 106
N + + FK K+ K Y + E + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 15 NVDEKYVQFKLKYRKQYH-ETEDEIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTT 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 131
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 132 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 182
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++VG+NA+
Sbjct: 183 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 241
Query: 284 WMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+Q Y G+S P+ C KY LDH VL+VGYG S K +P+WI+KNSWG WGEN
Sbjct: 242 LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSE------KNEPFWIVKNSWGVEWGEN 295
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ G CG++++ +S
Sbjct: 296 GYFRMYRGDGSCGINTVATS 315
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 131/296 (44%), Positives = 182/296 (61%), Gaps = 22/296 (7%)
Query: 53 FSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFR 110
F +F K+ +TY++ +E++ RF +FK N + + ++ TAV+G+TKF D++ E+
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R R +P + L T ++P DWR HGAVT VK+QG+CGSCW+FS TG +
Sbjct: 229 RTLAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNV 288
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL +L+SLSEQ+LVDCD + DSGC GGL ++A++ I K GG+E EK
Sbjct: 289 EGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYKSIEKLGGLEPEK 339
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DYPY G +G C +S V+N + DE ++AA L ++GP+++GINA MQ Y G
Sbjct: 340 DYPYVG-EGEKCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPISIGINANLMQFYWG 398
Query: 291 GVSCPY--ICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
G+S P+ C K LDHGVLIVGYG+ P+WIIKNSWG +WGE Y
Sbjct: 399 GISHPWKIFCNPKSLDHGVLIVGYGTE-------NGTPFWIIKNSWGPDWGEEEEY 447
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 70/115 (60%), Gaps = 9/115 (7%)
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
E+ R R +P + L T ++P DWR HGAVT VK+QG+CGSCW+FS T
Sbjct: 446 EYHRTLAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTT 505
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
G +EG FL +L+SLSEQ+LVDCD + DSGC GGL ++A++ I K
Sbjct: 506 GNVEGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYKSIEK 551
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 28/36 (77%)
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
P+WIIKNSWG +WGE GYY+I G CG+++M +S
Sbjct: 557 PFWIIKNSWGPDWGEEGYYRIYRGDGSCGLNNMATS 592
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 185/319 (57%), Gaps = 31/319 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F KF + Y++ EE RFR++ N+ AK+ Q + TA++G TKFSD+T EF++
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
L R+ ++ + L +LP+ FDWR G VT VKDQG+CGSCW+FS T
Sbjct: 219 IMLPSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSCGSCWAFSVT 278
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G +E + TG+L+SLSEQ+L+DCD D GCNGGL +AF I + GG+E
Sbjct: 279 GNIESLWAIKTGKLISLSEQELIDCD---------VIDKGCNGGLPINAFREIKRMGGLE 329
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
E YPY + G+C +++IA ++ + I +E M A + + GPL+VGI+A +
Sbjct: 330 PEDQYPYEAKN-GTCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLSVGIDAELLSY 388
Query: 288 YIGGV------SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Y G+ CP ++HGVLI GYG PYW IKNSWGE WGENG
Sbjct: 389 YKSGILHPSKSRCP---PSKINHGVLITGYGIEN-------NLPYWTIKNSWGEQWGENG 438
Query: 342 YYKICMGRNVCGVDSMVSS 360
Y+++ G+N+CGV +VSS
Sbjct: 439 YFQLMRGKNICGVSDLVSS 457
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 185/322 (57%), Gaps = 35/322 (10%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L + Q + PT P DWR GAVT V++QG+CGSCW+F
Sbjct: 81 EEFAAKYLSAPVN-----NDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG F+ TG+LVSLS+QQLVDCD GCNGG S++ I+ G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYMG 186
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY G + +C +K K+ A + + V+ +E+ AA L +HGPL+ +NAV
Sbjct: 187 GLESESDYPYVGVE-QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVA 245
Query: 285 MQTYIGGV------SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
+Q Y GV CP L+H VL VGY G + PYWIIKNSWG +WG
Sbjct: 246 LQYYQSGVLKPTFEECP---DTELNHAVLTVGYDKEG-------DMPYWIIKNSWGTDWG 295
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GY+++ G CG++ M +S
Sbjct: 296 EKGYFRLFRGDCTCGINRMATS 317
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 185/322 (57%), Gaps = 35/322 (10%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 27 SARELYEQFKRGYGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L D Q + PT P DWR GAVT V++QG+CGSCW+F
Sbjct: 86 EEFAAKYLSAPVN-----DDQVKRMRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 140
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG F+ TG+LVSLS+QQLVDCD GCNGG S++ I+ G
Sbjct: 141 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYMG 191
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY G + +C +K K+ A + + V+ +E+ AA L +HGPL+ +NAV
Sbjct: 192 GLESESDYPYVGVE-QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVA 250
Query: 285 MQTYIGGV------SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
+Q Y GV CP L+H VL VGY G + PYWIIKNSWG +WG
Sbjct: 251 LQHYQSGVLKPTFDECP---DTELNHAVLTVGYDKEG-------DMPYWIIKNSWGTDWG 300
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GY+++ G CG++ M +S
Sbjct: 301 EKGYFRLFRGDCTCGINRMATS 322
>gi|18419649|gb|AAL69389.1|AF462226_1 putative cysteine proteinase [Narcissus pseudonarcissus]
Length = 136
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 112/132 (84%), Positives = 123/132 (93%)
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP 295
G DG CK DKSKIAA+VSNFSV+S DE+Q+AANLV+HGPLA+GINA +MQTYIGGVSCP
Sbjct: 2 GMDGAVCKLDKSKIAASVSNFSVVSIDEEQIAANLVQHGPLAIGINAAFMQTYIGGVSCP 61
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVD 355
YICGK+LDHGVL+VGYGSSG+APIRFKEKPYWIIKNSWGENWGE GYYKIC GRNVCGVD
Sbjct: 62 YICGKHLDHGVLLVGYGSSGWAPIRFKEKPYWIIKNSWGENWGEKGYYKICKGRNVCGVD 121
Query: 356 SMVSSVAAIHTT 367
SMVS+V AIHTT
Sbjct: 122 SMVSTVTAIHTT 133
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 190/314 (60%), Gaps = 21/314 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F F K Y +++E R+ +FK N++ + Q + TAV+GVT F+DLTP EFR
Sbjct: 195 QFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFR 254
Query: 111 RQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ +L +R +LP Q+ +P + +DWR+H AVT VK+QG CGSCW+F+
Sbjct: 255 KFYLSPQWKRDQLP---QRKASIPKGKIEDRWDWREHNAVTEVKNQGMCGSCWAFATIAN 311
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + GELVSLSEQ+LVDCD + D GC+GG ++A++ I++ GG+ E
Sbjct: 312 VEGVWAVKKGELVSLSEQELVDCD---------TLDQGCSGGYPSNAYKEIIRLGGLTTE 362
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
+Y Y G + G+C+F +++ + DE ++AA + ++GP+AVGINA M Y
Sbjct: 363 TNYSYDG-NQGTCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAMMFYR 421
Query: 290 GGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
G++ P ++C LDHGV IVGY + K KPYWIIKNSWG +WGE GYY +
Sbjct: 422 HGIAHPWRFLCSPDALDHGVAIVGYDVEKQSK---KPKPYWIIKNSWGTHWGEGGYYMLY 478
Query: 347 MGRNVCGVDSMVSS 360
G VCGV+ MV+S
Sbjct: 479 RGAGVCGVNKMVTS 492
>gi|323713472|gb|ADY04490.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713476|gb|ADY04492.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713480|gb|ADY04494.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713482|gb|ADY04495.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713484|gb|ADY04496.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713486|gb|ADY04497.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713488|gb|ADY04498.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713490|gb|ADY04499.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713492|gb|ADY04500.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 138
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 109/137 (79%), Positives = 127/137 (92%)
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMNSAFEY LKAGG+ +E+DYPYTGTD GSCKF+KSKIAA+V+NFSV+S DEDQ+AAN
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIAAN 60
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
LVK+GPLA+ INAV+MQTY+GGVSCPYIC K LDHGVL+VGYGSSG++P+R KEKPYWII
Sbjct: 61 LVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWII 120
Query: 330 KNSWGENWGENGYYKIC 346
KNSWG+ WGE G+YKIC
Sbjct: 121 KNSWGDKWGEEGFYKIC 137
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 179/318 (56%), Gaps = 26/318 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GGLM++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
EK YPY G + CK K+ A ++ I DED +A L +GP+AV ++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 343 YKICMGRNVCGVDSMVSS 360
+I G N C V + SS
Sbjct: 312 IRIEKGTNQCLVAQLASS 329
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 130/354 (36%), Positives = 201/354 (56%), Gaps = 39/354 (11%)
Query: 30 MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
MI ++ Q E HL +A+H+F F ++K Y + +YRF++FK NL
Sbjct: 5 MIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLEDIN 64
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
+ L+ +A++ + KFSDL+ +E ++ GL + P++ + AP
Sbjct: 65 EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKK--PSNMVRSTSNFCNVIHLDAPPDV 122
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
++LP +FDWR + +T VKDQGACGSCW+ +A G LE + + L++LSEQQL+DCD
Sbjct: 123 HDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
S + C+GGLM++AFE ++ AGG+ E DYPY GT G CK D K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GVCKIDNKKFALSV 232
Query: 254 SNFS-VISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGY 311
S+ I +E+ + L+ GP+A+ I+A + TY G+ + C L+H VL+VGY
Sbjct: 233 SSCKRYIFQNEENLKKELITMGPIAMAIDAASISTYSKGI--IHFCENLGLNHAVLLVGY 290
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
G+ G YW +KNSWG +WGE+GY+++ N CG+++ +++ A IH
Sbjct: 291 GTEGGV-------SYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAASATIH 337
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 178/318 (55%), Gaps = 26/318 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GGLM++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
EK YPY G + CK K+ A ++ I DED +A L +GP+AV ++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 343 YKICMGRNVCGVDSMVSS 360
+I G N C V + SS
Sbjct: 312 IRIEKGTNQCLVAQLASS 329
>gi|16076439|emb|CAC94444.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 110/133 (82%), Positives = 125/133 (93%)
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
DHECDPEE G+CDSGC+GGLM +AFEY LKAGG+EREKDYPYTGTD GSCKFDKSKIAA+
Sbjct: 1 DHECDPEEYGACDSGCSGGLMTTAFEYTLKAGGLEREKDYPYTGTDRGSCKFDKSKIAAS 60
Query: 253 VSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYG 312
VSNFSV+S DEDQ+AANLVK+GPLA+GINA +MQTY+ GVSCPYICG+ LDHGVL+VGYG
Sbjct: 61 VSNFSVVSIDEDQIAANLVKNGPLAIGINAAFMQTYMKGVSCPYICGRRLDHGVLLVGYG 120
Query: 313 SSGFAPIRFKEKP 325
S+GF+PIRFKEKP
Sbjct: 121 SAGFSPIRFKEKP 133
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 185/312 (59%), Gaps = 21/312 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY++QEE + R R+F+ N++ A+ Q L+ +A +G+TKFSDLT EFR
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L K I + P +DWRDHGAV+ VK+QG CGSCW+FS TG +E
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIE 294
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G F TG+L+SLSEQ+LVDCD D C GGL ++A+E I GG+E E D
Sbjct: 295 GQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETETD 345
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
Y YTG SC F K+AA +++ + DE ++AA L ++GP++ +NA MQ Y G
Sbjct: 346 YSYTGHK-QSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKG 404
Query: 292 VSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
VS P C ++ DH VL+VG+G P+W IKNSWGE++GE GYY + G
Sbjct: 405 VSHPLKIFCNPWMIDHAVLLVGFGQRNGV-------PFWAIKNSWGEDYGEQGYYYLYRG 457
Query: 349 RNVCGVDSMVSS 360
+CG+ M SS
Sbjct: 458 SGLCGIHKMCSS 469
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 182/328 (55%), Gaps = 31/328 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
A +HF+ F + K Y + E RF +FK NL + Q D TA++G+ +F+DL+
Sbjct: 58 FGAWNHFTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLS 117
Query: 106 PSEFRRQFLG--------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
P EF++ L NR + L A+ + P LP FDWR+HGAVT VK +G
Sbjct: 118 PEEFKKTHLPHTWKQPDHPNRIVDLAAEG----VDPKEPLPESFDWREHGAVTKVKTEGH 173
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
C +CW+FS TG +EG FL+ +LVSLS QQL+DCD D GCNGG A+
Sbjct: 174 CAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDCD---------VVDEGCNGGFPLDAY 224
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ I++ GG+E E YPY C+ S IA ++ + DE++M A LVK GP++
Sbjct: 225 KEIVRMGGLEPEDKYPYEAK-AEQCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPIS 283
Query: 278 VGINAVWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
+GI +Q Y GGVS P C + HG L+VGYG K PYWIIKNSWG N
Sbjct: 284 IGITVDDIQFYKGGVSRPTTCRLSSMIHGALLVGYGVE-------KNIPYWIIKNSWGPN 336
Query: 337 WGENGYYKICMGRNVCGVDSMVSSVAAI 364
WGE+GYY++ G N C ++ +S +
Sbjct: 337 WGEDGYYRMVRGENACRINRFPTSAVVL 364
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 185/312 (59%), Gaps = 21/312 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY++QEE + R R+F+ N++ A+ Q L+ +A +G+TKFSDLT EFR
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L K I + P +DWRDHGAV+ VK+QG CGSCW+FS TG +E
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIE 294
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G F TG+L+SLSEQ+LVDCD D C GGL ++A+E I GG+E E D
Sbjct: 295 GQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETETD 345
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
Y YTG SC F K+AA +++ + DE ++AA L ++GP++ +NA MQ Y G
Sbjct: 346 YSYTGHK-QSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKG 404
Query: 292 VSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
VS P C ++ DH VL+VG+G P+W IKNSWGE++GE GYY + G
Sbjct: 405 VSHPLKIFCNPWMIDHAVLLVGFGQRNGV-------PFWAIKNSWGEDYGEQGYYYLYRG 457
Query: 349 RNVCGVDSMVSS 360
+CG+ M SS
Sbjct: 458 SGLCGIHKMCSS 469
>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 134/359 (37%), Positives = 194/359 (54%), Gaps = 58/359 (16%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-------- 90
G E LN +F FK + K YAT+EEH R+++F N+ + ++
Sbjct: 4 GAYDEKEALN---YFQHFKKLYLKRYATEEEHHRRWKIFYDNINLVNQLNIMHKPNEIAG 60
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK----APILP-----TNDLPTDF 141
P A +G+T+F D++P+EF R L LP QK P P + LP F
Sbjct: 61 KPVAQYGITQFMDMSPNEFARVKL-------LPPTKQKDINHTPTAPKEKYQIDALPESF 113
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAVT VKDQ +CGSCW+FS +EGA+FL+ L S QQLVDCD
Sbjct: 114 DWREHGAVTAVKDQASCGSCWAFSTVENIEGAYFLAGHNLTKFSPQQLVDCD-------- 165
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG--------------------S 241
+ + GC GG A +YI K GG+ E YPY G +
Sbjct: 166 -NLNCGCFGGFPFIAMQYIQKRGGLATESSYPYCIPPLGNCFPCNTNKTYCPSGEYCNRT 224
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
C ++ A V+ + +S +ED +AA LVK+GPL++ +NA+W+Q Y G+S P C
Sbjct: 225 CSVQNYQLVAKVAGYENVSQNEDDIAAYLVKNGPLSICLNAMWLQFYHSGISDPMYCPPD 284
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+DH VL+VG+G+ ++ YWI+KNSWGE+WGE GY+++ G++ CG+++MV++
Sbjct: 285 IDHAVLLVGFGTH--TNWLGEKTNYWIVKNSWGESWGEKGYFRLIRGKDKCGINTMVAN 341
>gi|559532|emb|CAA57675.1| cysteine proteinase [Zea mays]
Length = 145
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 110/144 (76%), Positives = 128/144 (88%), Gaps = 4/144 (2%)
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E EKDYPYTG+DG CKFDKSKI A+V NFSV+S DE Q++AN +KHGPLA+GINA +MQ
Sbjct: 1 ESEKDYPYTGSDG-KCKFDKSKIVASVQNFSVVSVDEAQISANRIKHGPLAIGINAAYMQ 59
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
TYIGGVSCPYICG++LDHGVL+VGYG+SGFAP+R K+KPYWIIKNSWGENWGENGYYKIC
Sbjct: 60 TYIGGVSCPYICGRHLDHGVLLVGYGASGFAPMRLKDKPYWIIKNSWGENWGENGYYKIC 119
Query: 347 MG---RNVCGVDSMVSSVAAIHTT 367
G RN CGVDSMVS+V+A+H +
Sbjct: 120 RGSNVRNKCGVDSMVSTVSAVHAS 143
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 188/322 (58%), Gaps = 35/322 (10%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L L +D Q + PT P DWR GAVT V++QG CGSCW+F
Sbjct: 81 EEFAAKYLSP----PLNSD-QVERVQPTGLKAAPERMDWRAKGAVTPVENQGECGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG F+ TG+LVSLS+QQLVDCD + GCNGG +S++ I+ G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDMAAE---------GCNGGWPSSSYLEIMDMG 186
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E E DYPY G + +C +K K+ A + + V+ + E++ L +HGPL+ +NAV
Sbjct: 187 GLESENDYPYVGVE-QTCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVA 245
Query: 285 MQTYIGGV------SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
+Q Y G+ CP L+H VL VGY G + PYWIIKNSWG +WG
Sbjct: 246 LQHYQSGILHPSHKDCP---DDDLNHAVLTVGYDREG-------DMPYWIIKNSWGTDWG 295
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GY+++ G VCG++ M +S
Sbjct: 296 EKGYFRLFRGDCVCGINRMATS 317
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 186/317 (58%), Gaps = 31/317 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F F ++K Y+++E ++ R +FK NLRR + D A HG+T+F+DLT EF
Sbjct: 30 FKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDE-AQHGITQFADLTHEEFADM 88
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+LG +LR ++Q L + PT DW GAVT VK+QG+CGSCW+FS TG++
Sbjct: 89 YLGYKPQLR---NSQAKVSLSSTPFTAPTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSI 145
Query: 171 EGAHFLSTGE-LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
EG + L + L S SEQQLVDCD + D GCNGGLM++AF Y L++ +E E
Sbjct: 146 EGQYVLQLKQNLTSFSEQQLVDCDTK--------EDQGCNGGLMDNAFTY-LESAKLETE 196
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF------SVISSDEDQMAANLVKHGPLAVGINAV 283
YPYT D GSCK+++S V++F ++ E+ M L GPL+V INA
Sbjct: 197 SAYPYTAVD-GSCKYNQSLGVVGVASFVDIEQGKTVADTENTMGVALDNIGPLSVAINAN 255
Query: 284 WMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Q Y GG+S P IC L+HGVLIVG GS K +W +KNSWG +WGE GY
Sbjct: 256 NLQFYAGGISNPLICNPNGLNHGVLIVGLGSE-------NGKDFWKVKNSWGASWGEKGY 308
Query: 343 YKICMGRNVCGVDSMVS 359
++I G+ CG++ VS
Sbjct: 309 FRIVRGKGKCGINRAVS 325
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 183/319 (57%), Gaps = 34/319 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + ++K+YA E R +F NL A + Q LD +A +GVTKFSDLT EFR
Sbjct: 270 FKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTEEEFRM 329
Query: 112 QFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L L R PA + P P +DWRDHGA+T K+QG CGSCW+FS
Sbjct: 330 FYLNPLLSSLPGRALRPAPRARGPA------PASWDWRDHGALTAAKNQGMCGSCWAFSV 383
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG FL G L++LSEQ+LVDCD + D C GGL ++A+ I GG+
Sbjct: 384 TGNVEGQWFLRRGALLTLSEQELVDCD---------TLDQACGGGLPSNAYTAIETLGGL 434
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E EKDY Y G C F K A +++ +S DE ++AA L ++GP+++ +NA MQ
Sbjct: 435 ETEKDYSYEGRK-ERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVSIALNAFAMQ 493
Query: 287 TYIGGVSCPY--ICGK-YLDHGVLIVGYGS-SGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Y GVS P+ +C ++DH VL+VGYG SG P+W IKNSWG +WGE GY
Sbjct: 494 FYRRGVSHPFRPLCSPWFIDHAVLLVGYGDRSGI--------PFWAIKNSWGPDWGEEGY 545
Query: 343 YKICMGRNVCGVDSMVSSV 361
Y + G CG+++M SS
Sbjct: 546 YYLYRGARACGMNTMASSA 564
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 146/373 (39%), Positives = 199/373 (53%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 146/373 (39%), Positives = 199/373 (53%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 146/373 (39%), Positives = 199/373 (53%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 178/319 (55%), Gaps = 26/319 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD S D+GC GG M++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCD---------SKDNGCGGGFMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
EK YPY G + CK ++ A ++ I DED +A L +GP+AV ++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 343 YKICMGRNVCGVDSMVSSV 361
+I G N C V + SS
Sbjct: 312 IRIEKGTNQCLVAQLASSA 330
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 146/373 (39%), Positives = 199/373 (53%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAIAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 177/319 (55%), Gaps = 26/319 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 23 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 82
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G
Sbjct: 83 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 142
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V
Sbjct: 143 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 193
Query: 227 EREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
EK YPY DG C ++ A ++ I DED +A L +GP+AV ++A
Sbjct: 194 YTEKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 253
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 254 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 303
Query: 343 YKICMGRNVCGVDSMVSSV 361
+I G N C V + SS
Sbjct: 304 IRIEKGTNQCLVAQLASSA 322
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 188/320 (58%), Gaps = 24/320 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH---GVTKFSDL 104
+AE H++ FKS K+Y +E R +F+ NL + ++ + GV +F+D+
Sbjct: 23 SAEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADM 82
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
T +EF LGL R ++ D+ DLP + DW G VT VK+QG CGSCW+F
Sbjct: 83 TNTEFSNMLLGLGGRNKIAGDSVFESS-HVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAF 141
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG+LEG F TG+LVSLSEQ LVDC + + GCNGGLM+ AF YI K G
Sbjct: 142 STTGSLEGQVFKKTGKLVSLSEQNLVDCS-------TSEGNQGCNGGLMDQAFTYIKKNG 194
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA- 282
G++ E YPYTG+D G+C+F ++K+ A VS F V S DE+ + + GP++V I+A
Sbjct: 195 GIDTEAAYPYTGSD-GTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDAS 253
Query: 283 -VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
++ Q Y GGV P+ C LDHGVL+VGYG+ G K YW++KNSWG +WG
Sbjct: 254 SIFFQFYRGGVYNPWFCSSTELDHGVLVVGYGTEG-------GKDYWLVKNSWGSSWGLK 306
Query: 341 GYYKICMG-RNVCGVDSMVS 359
GY K+ +N CG+ + S
Sbjct: 307 GYIKMVRNKKNRCGIATQAS 326
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 188/320 (58%), Gaps = 33/320 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F KF + Y++ E RF+ + NL ++ Q + TA++GVT+FSD++P EF++
Sbjct: 170 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQK 229
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
L R+ ++ + + L N+LP FDWR G VT VK+QG+CGSCW+FS T
Sbjct: 230 TMLPSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVT 289
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G +EG + TG+L+SLSEQ+L+DCD D GCNGGL +AF I + GG+E
Sbjct: 290 GNIEGLWAIKTGKLISLSEQELIDCDR---------IDKGCNGGLPINAFREIQRMGGLE 340
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
E YPY + G+C +S IA + + I +E M A +V+ GPL+VGI+A +
Sbjct: 341 PEDQYPYKARN-GTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAY 399
Query: 288 YIGGV------SCPYICGKYLDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Y G+ CP +DHGVLI GYG +G PYW IKNSWG+ WGE+
Sbjct: 400 YKSGILHPSRSRCP---PSGIDHGVLITGYGVENGL--------PYWTIKNSWGDQWGED 448
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ +G++VCGV +VSS
Sbjct: 449 GYFRLMLGKDVCGVSDLVSS 468
>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
Length = 274
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 131/287 (45%), Positives = 177/287 (61%), Gaps = 21/287 (7%)
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RR + ++ D A +G + F+DLT EFR+ +L + + A I P P F
Sbjct: 5 RRIQEKEQGD--ATYGASPFADLTAEEFRKNYLSPVWNVTHDPFLKPASI-PIETPPDAF 61
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDH AVT VK+QG+CGSCW+FS TG +EG + +L+SLSEQ+LVDCD
Sbjct: 62 DWRDHDAVTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDK------- 114
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
D GCNGGL A++ I++ GG+E EKDYPY G G C F+K+++ ++ ISS
Sbjct: 115 --VDLGCNGGLPLQAYKEIMRIGGLETEKDYPYEGK-GDKCVFEKAEVEVNITGAVNISS 171
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP--YICG-KYLDHGVLIVGYG-SSGFA 317
+ED M A L K+GP+++G+NA MQ Y+GGVS P ++C LDHGVLI GYG G+
Sbjct: 172 NEDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPFSFLCSPSSLDHGVLITGYGIKQGW- 230
Query: 318 PIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAI 364
+ P+W IKNSWGE+WGE GYY + G VCGV+ M +S +
Sbjct: 231 ---MSDSPFWAIKNSWGESWGEKGYYLLYRGAGVCGVNQMPTSATVV 274
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 188/320 (58%), Gaps = 33/320 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F KF + Y++ E RF+ + NL ++ Q + TA++GVT+FSD++P EF++
Sbjct: 135 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQK 194
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
L R+ ++ + + L N+LP FDWR G VT VK+QG+CGSCW+FS T
Sbjct: 195 TMLPSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVT 254
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G +EG + TG+L+SLSEQ+L+DCD D GCNGGL +AF I + GG+E
Sbjct: 255 GNIEGLWAIKTGKLISLSEQELIDCDR---------IDKGCNGGLPINAFREIQRMGGLE 305
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
E YPY + G+C +S IA + + I +E M A +V+ GPL+VGI+A +
Sbjct: 306 PEDQYPYKARN-GTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAY 364
Query: 288 YIGGV------SCPYICGKYLDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Y G+ CP +DHGVLI GYG +G PYW IKNSWG+ WGE+
Sbjct: 365 YKSGILHPSRSRCP---PSGIDHGVLITGYGVENGL--------PYWTIKNSWGDQWGED 413
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY+++ +G++VCGV +VSS
Sbjct: 414 GYFRLMLGKDVCGVSDLVSS 433
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 176/319 (55%), Gaps = 26/319 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DW GAVT VKDQG CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GGLM++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
EK YPY G + CK K+ A ++ I DED +A L +GP+AV ++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 343 YKICMGRNVCGVDSMVSSV 361
+I G N C V SS
Sbjct: 312 IRIEKGTNQCLVAQRASSA 330
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 139/321 (43%), Positives = 175/321 (54%), Gaps = 34/321 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK ++ + Y + E +R VF+ NL AK +P A GVT FSDLT EFR
Sbjct: 37 QFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F +R R+P D + D P DWRD GAVT VKDQG CGSCW+F
Sbjct: 97 RHHSGAAHFAAGRKRARVPVD------VGVGDAPAAVDWRDRGAVTPVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK-- 222
SA G +EG FL+ L SLSEQ LV CD + DSGC+GGLMNSAFE+I++
Sbjct: 151 SAIGNVEGQWFLAGNALTSLSEQMLVSCD---------TMDSGCDGGLMNSAFEWIVEHH 201
Query: 223 AGGVEREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E+ Y Y DG C+ + A ++ + DE +MA L +GPLAV +
Sbjct: 202 NGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPPDEAKMATWLAANGPLAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A Y GGV + + LDHGVL+VGY S AP PYWI+KNSWG WGE+
Sbjct: 262 DASSWMFYTGGVLTSCVSNE-LDHGVLLVGYNDSA-AP------PYWIVKNSWGTLWGED 313
Query: 341 GYYKICMGRNVCGVDSMVSSV 361
GY +I G N C V SS
Sbjct: 314 GYVRIAKGTNQCLVKEEASSA 334
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 186/321 (57%), Gaps = 18/321 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKF 101
+E + AE F F +++ K+YA+ EE R R+F+ NL R + A +GV KF
Sbjct: 21 AEAGTMTAEQQFRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTGARYGVNKF 79
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+DLTP EF+ +L R A A + T LP+ FDWRD GAVT KDQG CG
Sbjct: 80 ADLTPKEFKATYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTKDQGQCG-- 137
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS T A+E FLS +LVSL+ QQ+VDCD G+ D GC+GG +A+EY++
Sbjct: 138 WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQ-------GNGDYGCDGGDPPTAYEYVI 190
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS--DEDQMAANLVKHGPLAVG 279
KAGG++ E+ YPYT D G C F S + A +SN++ I++ +E +M L GPL++
Sbjct: 191 KAGGLDTEESYPYTAED-GQCAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLSIC 249
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENWG 338
++A Q YIGGV +C LDH V+I GY G+ F + W I+NSWGE+WG
Sbjct: 250 VDASSWQYYIGGVITS-LCEDSLDHCVMITGYSVQEGW---DFMKYDVWNIRNSWGEDWG 305
Query: 339 ENGYYKICMGRNVCGVDSMVS 359
GY + G N+CGV V+
Sbjct: 306 YGGYLYVQRGSNLCGVGDEVT 326
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 186/322 (57%), Gaps = 27/322 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA +++ RF +FK NL RA++ Q+ + TA +GVT+FSDLTP
Sbjct: 27 NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF +LG R+ + + P DWR GAV V+DQG+CGSCW+FS
Sbjct: 86 EEFAAMYLGS----RIDERVDRVQLNDLQTAPASVDWRKKGAVGPVEDQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
T +EG FL TG LVSLS+QQLVDCD D GC+GG ++ I + GG+
Sbjct: 142 TANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E + YPYT +C+ D+SK+ A + + V+ +DE++ AA L +HGP++ +NA +Q
Sbjct: 193 ELQSAYPYTSWK-QACRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQ 251
Query: 287 TYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P +C + L+H VL VGY + PYW ++NSWG WGENGY+
Sbjct: 252 FYQSGILHPSKAMCSPEGLNHAVLTVGYDTEHGV-------PYWTVRNSWGTRWGENGYF 304
Query: 344 KICMGRNVCGVDSMVSSVAAIH 365
+I G CG+D + +S A IH
Sbjct: 305 RIYRGDGTCGIDRLTTS-AIIH 325
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 146/373 (39%), Positives = 198/373 (53%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VK QG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC K LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSKQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 183/324 (56%), Gaps = 30/324 (9%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L + Q + PT P DWR GAVT V++QG+CGSCW+F
Sbjct: 81 EEFAAKYLSAPVN-----NDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG F+ TG+LVSLS+QQLVDCD D GCNGG S++ I+ G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD---------GCNGGWPASSYLEIMHMG 186
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+E + DYPY G C +K ++ A + + + ED AA L +HGPL+ +NA+
Sbjct: 187 GLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAIT 245
Query: 285 MQTYIGGVSCPYI--CGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Q Y G+ P C L+H VL VGY G + PYWIIKNSW WGE G
Sbjct: 246 LQYYQSGIIHPSYEECSPVDLNHAVLTVGYDKEG-------DMPYWIIKNSWNVEWGEKG 298
Query: 342 YYKICMGRNVCGVDSMVSSVAAIH 365
Y+++ G CG++ M +S A IH
Sbjct: 299 YFRLYRGDGTCGINRMPTS-AIIH 321
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 178/321 (55%), Gaps = 23/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG--GV 226
+EG ++ L SLSEQ LV CD E D GC GGLM++AF++I+ + V
Sbjct: 158 NIEGQWKVTGHNLTSLSEQMLVSCDTE---------DLGCAGGLMDNAFKWIVSSNRHNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G C+ + A + + + DE+ +A L K+GP+A+ +++
Sbjct: 209 FTEESYPYASKGGNVPPCRMSGKVVGAKIRDHVDLPKDENAIAEWLAKNGPVAIAVDSTS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV I K LDHGVL+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SKQLDHGVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSSVAAIH 365
I G N C V + +S A +H
Sbjct: 321 IEKGTNQCLVKNYATS-AVVH 340
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GGLM+ AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TNDFGCEGGLMDDAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G DKS + A + + + DE+ +A L K+GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV I ++LDHGVL+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SEHLDHGVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C + ++ SS
Sbjct: 321 IEKGTNQCLMKNLPSS 336
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 145/373 (38%), Positives = 197/373 (52%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY S PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDS-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 121/321 (37%), Positives = 175/321 (54%), Gaps = 30/321 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FKSKF+K Y + EH F +K + + Q+ +P A G TKFSD++P EF +
Sbjct: 33 FEEFKSKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPEEFENK 92
Query: 113 FLGLNRRL---------RLPADAQKAPI-----LPTNDLPTDFDWRDHGAVTGVKDQGAC 158
L + L +L A+ K + + +DLP FDWRD G +T K Q C
Sbjct: 93 MLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTC 152
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+F+ TG +E + L GEL+ SEQ L+DCD + + GC GGLM A++
Sbjct: 153 GSCWTFATTGVIESQYALKYGELLHFSEQMLLDCD---------NINQGCRGGLMTDAYQ 203
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
++ ++GG++ Y C FDK+K+ A V ++ I +E+ + LVK+GP+AV
Sbjct: 204 FLQQSGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVAV 263
Query: 279 GINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
GINA +Q Y GG+ P C ++H VLIVGYG + PYW+IKN WG WG
Sbjct: 264 GINARTLQFYEGGIVDPKNCDDKINHAVLIVGYGVE-------EGIPYWLIKNQWGAEWG 316
Query: 339 ENGYYKICMGRNVCGVDSMVS 359
G++K+ G+ CG+ + S
Sbjct: 317 IKGFFKLIRGKKQCGIHTYAS 337
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 389
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 134/318 (42%), Positives = 175/318 (55%), Gaps = 26/318 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
K YPY DG C ++ A ++ I DED +A L +GP+AV ++A
Sbjct: 202 YTGKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 343 YKICMGRNVCGVDSMVSS 360
+I G N C V + SS
Sbjct: 312 IRIEKGTNQCLVAQLASS 329
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 144/373 (38%), Positives = 197/373 (52%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 144/373 (38%), Positives = 197/373 (52%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 144/373 (38%), Positives = 197/373 (52%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|16076437|emb|CAC94443.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 106/133 (79%), Positives = 123/133 (92%)
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
DHECDPEE GSCDSGC+GGLMNSAFEY LKAGG+ RE+DYPYTGTD +CKFDKSKIAA+
Sbjct: 1 DHECDPEEQGSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDRSTCKFDKSKIAAS 60
Query: 253 VSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYG 312
VSNFSVIS DEDQ+AANLVK+GPLAV INAV+MQT++GGVSCPYIC + LDHGVL+VG+G
Sbjct: 61 VSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTHVGGVSCPYICSRRLDHGVLLVGFG 120
Query: 313 SSGFAPIRFKEKP 325
S+G++P+R KEKP
Sbjct: 121 SAGYSPVRMKEKP 133
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 173/312 (55%), Gaps = 26/312 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGRCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
EK YPY DG C ++ A ++ I DED +A L +GP+AV ++A
Sbjct: 202 YTEKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV SC + L+HGVL+VGY S + PYWIIKNSW +WGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 343 YKICMGRNVCGV 354
+I G N C V
Sbjct: 312 IRIEKGTNQCLV 323
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 143/364 (39%), Positives = 200/364 (54%), Gaps = 46/364 (12%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+I S+L+LL++ A+A R DG L ++ F + +K K+
Sbjct: 5 MIASTLILLVVVGATPFAIA--------RPAALEDGRA-----LEIKNMFEDWAAKHGKS 51
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR- 121
Y++ E R +F L ++ + T G+ KFSDLT +EFR +G +R R
Sbjct: 52 YSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRY 111
Query: 122 ---LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LPA+ + + + LPT DWR GAVT +KDQG CGSCW+FSA ++E AHFL+T
Sbjct: 112 QDRLPAEDEDVDV---SSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLAT 168
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
ELVSLSEQQL+DCD + D+GC+GGLM +AF++++K GGV E YPYTG+
Sbjct: 169 KELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGS- 218
Query: 239 GGSCKFDKSKI---AAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQTYIGGVS 293
GSC +K I A ++ F V++ D V P+ V I + Q Y G+
Sbjct: 219 VGSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGIL 278
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM--GRNV 351
CG LDHGVL++GYG+ G PYWIIKNSWG +WGE+G+ KI G +
Sbjct: 279 SGQ-CGDSLDHGVLLIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMKIERKDGDGI 330
Query: 352 CGVD 355
CG++
Sbjct: 331 CGMN 334
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 133/360 (36%), Positives = 196/360 (54%), Gaps = 30/360 (8%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
++L V AS N DA+I V + + + +L A +F F++K+ K YA E
Sbjct: 4 IILFFVFVFASGGFDNGVDAIIDYVTAAPQFKLQYNLERAPQYFETFQTKYKKVYADDNE 63
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
DYR+++FK NL + + +AV+ + KF+DLT +E +F GL +R PA
Sbjct: 64 RDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGLG--IRSPALKNSC 121
Query: 130 -PIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
P++ P+ FDWR +T VKDQG CGSCW+FS LE + + E V LS
Sbjct: 122 EPVIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHVDLS 181
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCD + D GC GGL+++A+E I+ GG+E E+DYPY G C+
Sbjct: 182 EQQLVDCD---------TIDMGCAGGLLHTAYEEIMAMGGLEYEEDYPYRSVQ-GPCRLQ 231
Query: 246 KSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LD 303
K +V N + + ED++ L + GP+AV ++AV + Y GG+ C Y L+
Sbjct: 232 SDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITS--CKNYGLN 289
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAA 363
H VL+VGYG P+W++KNSWG ++GENG+ ++ N CG M++ +AA
Sbjct: 290 HAVLLVGYGIENGV-------PFWVLKNSWGSDYGENGFVRVKRNVNSCG---MINELAA 339
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 177/318 (55%), Gaps = 26/318 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CDP E C GG M++AF +I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQTLVS----CDPTE-----YACEGGFMDNAFRWIISSNKGKV 208
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY+ G + +C + A +S++ + DE+ +A L K+GP++V ++A
Sbjct: 209 FTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDENAIAEWLAKNGPVSVIVDATS 268
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q+Y GGV SC K L+H VL+VGY + + PYWIIKNSW E WGE GY
Sbjct: 269 FQSYTGGVLTSC---LSKILNHAVLLVGYDDTS-------KPPYWIIKNSWSEKWGEKGY 318
Query: 343 YKICMGRNVCGVDSMVSS 360
+I G N C V SS
Sbjct: 319 IRIEKGTNQCLVQEYASS 336
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 143/373 (38%), Positives = 197/373 (52%), Gaps = 55/373 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+R+R K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRVR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 273
Query: 290 GGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 274 GGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 323
Query: 348 GRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 324 GTNQCLMNQAVSS 336
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 181/308 (58%), Gaps = 25/308 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKFSDLTPSE 108
F FK K KTY Q E RF +FK NLR ++ +L + G+ +F+D+T E
Sbjct: 25 FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
FR FL L+ + P +L +P DWR G VTGVKDQG CGSCW+FS TG
Sbjct: 85 FR-AFLTLSSSKK-PHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ E A++ G+LVSLSEQQLVDC S ++GCNGG ++ F Y+ K+ G+E
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDC--------STDINAGCNGGYLDETFTYV-KSKGLEA 193
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
E YPY GTD GSCK+ SK+ VS S+ S DE+ + + GP++V I+A ++ +
Sbjct: 194 ESTYPYKGTD-GSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLSS 252
Query: 288 YIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y G+ C L+HGVL+VGYG+S K YWI+KNSWG ++GE+GY+++
Sbjct: 253 YESGIYEDDWCSPSELNHGVLVVGYGTS-------NGKKYWIVKNSWGGSFGESGYFRLL 305
Query: 347 MGRNVCGV 354
G+N CGV
Sbjct: 306 RGKNECGV 313
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 186/322 (57%), Gaps = 27/322 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA +++ RF +FK NL RA++ Q + TA +GVT+FSDLT
Sbjct: 27 NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTN 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF +LG R+ + + P DWR+ GAV V+ QG+CGSCW+FS
Sbjct: 86 EEFAAMYLGS----RIDERVDRVQLNDLQTAPASVDWREKGAVGPVEHQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
T +EG FL TG LVSLS+QQLVDCD D GC+GG ++ I + GG+
Sbjct: 142 TANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E + YPYTG + +C+ D+SK+ A + + V+ +E++ AA L +HGP++ +NA +Q
Sbjct: 193 ELQSAYPYTGWE-QACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQ 251
Query: 287 TYIGGVSCP--YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P Y C + L+H VL VGY + + PYW ++NSWG WGENGY+
Sbjct: 252 FYRYGILHPSEYACSPEGLNHAVLTVGYDTE-------RGVPYWTVRNSWGTRWGENGYF 304
Query: 344 KICMGRNVCGVDSMVSSVAAIH 365
+I G CG+D + +S A IH
Sbjct: 305 RIYRGDGTCGIDRLTTS-AIIH 325
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 178/316 (56%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT V+D+ C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPDAVDWRKKGAVTPVRDERLCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ L+ CD D GC GGLM+ AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLLSCDTRED---------GCGGGLMDRAFQWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY TDG + +KS + A +S++ + DE+ +A L K+GP+A+ + A
Sbjct: 209 FTEQSYPYASTDGDVPRCNKSGKVVGAKISDYVDLPQDENAIAEWLAKNGPVAIAVEATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+Q Y GGV I + LDHGVL+VGY + + PYWIIKNSWG+ WGE GY +
Sbjct: 269 LQRYTGGVLTSCI-SEQLDHGVLLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C + + SS
Sbjct: 321 IEKGTNQCLMKNYASS 336
>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
Length = 597
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 191/324 (58%), Gaps = 24/324 (7%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F + +++TY T+EE +R VF +N+ RA++ Q LD TA +GVTK
Sbjct: 289 SQDFSVKMASIFKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTK 348
Query: 101 FSDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
FSDLT EFR +L N LR +P + P ++DWR +GAVT VKDQG CG
Sbjct: 349 FSDLTEEEFRTIYL--NPLLREVPGKKMHLAKSIGDPAPPEWDWRKNGAVTKVKDQGMCG 406
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+
Sbjct: 407 SCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSA 457
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I GG+E E DY Y G +C F K +++ +S +E ++AA L K GP++V
Sbjct: 458 IKNLGGLETEDDYSYQG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVA 516
Query: 280 INAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
INA MQ Y G++ P +C +L DH VLIVGYG+ E P+W IKNSWG +
Sbjct: 517 INAFGMQFYRHGIAHPLRPLCSPWLIDHAVLIVGYGNRS-------EVPFWAIKNSWGTD 569
Query: 337 WGENGYYKICMGRNVCGVDSMVSS 360
WGE GYY + G CGV++M SS
Sbjct: 570 WGEKGYYYLHRGSGSCGVNTMASS 593
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 195/345 (56%), Gaps = 26/345 (7%)
Query: 24 VNDDDAMIRQVVPSDGEQS--EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
D + ++ +P+ S +D + F F +++TY ++EE +R VF +N+
Sbjct: 132 TEDRNETLKSTLPALNRDSLPQDFSVKMASIFKKFVRTYNRTYESKEEAQWRLSVFASNM 191
Query: 82 RRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPT 139
RA++ Q LD TA +G+TKFSDLT EFR +L N LR + P D P
Sbjct: 192 VRAQKIQSLDRGTAQYGITKFSDLTEEEFRTIYL--NPLLRSEPGKKMQLAKPVEDPAPP 249
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
+DWR GAVT VKDQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD
Sbjct: 250 QWDWRSKGAVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDK----- 304
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
D C GGL ++A+ I GG+E E+DY Y G +C F K +++ +
Sbjct: 305 ----LDKACLGGLPSNAYSAIKNLGGLETEEDYTYQG-HMQACNFSAQKAKVYINDSVEL 359
Query: 260 SSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGF 316
S +E ++AA L K GP++V INA MQ Y G++ P +C +L DH VL+VGYG+
Sbjct: 360 SQNEQKLAAWLAKRGPISVAINAFGMQFYRRGIAHPLRPLCSPWLIDHAVLLVGYGNRS- 418
Query: 317 APIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
P+W IKNSWG +WGE GYY + G VCGV++M SS
Sbjct: 419 ------ATPFWAIKNSWGADWGEEGYYYLYRGSGVCGVNTMASSA 457
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 187/322 (58%), Gaps = 23/322 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 183 QDLAVKMASIFRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGVTKF 242
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
SDLT EFR +L N LR P K + P ++DWR GAVT VKDQG CGSC
Sbjct: 243 SDLTEEEFRTTYL--NPLLREPGKKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSC 300
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL +SA+ I
Sbjct: 301 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIK 351
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG+E E DY Y G +C F K +++ +S +E ++AA L K GP++V IN
Sbjct: 352 NLGGLETEDDYSYRG-HMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAIN 410
Query: 282 AVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WG
Sbjct: 411 AFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWG 463
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GYY + G CGV++M SS
Sbjct: 464 EKGYYYLHRGSGACGVNTMASS 485
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 199/362 (54%), Gaps = 34/362 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
++L V+AS N +A+I V + + + +L A +F F++K+ K YA E
Sbjct: 4 IILFFVFVVASGGLDNGVNAVIDYVAAAPHFKLQYNLERAPQYFETFQTKYKKVYADDNE 63
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR---LRLPADA 126
DYR+++FK NL + + +AV+ + KF+DLT +E +F GL + L+ D
Sbjct: 64 RDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGLGVKSPNLKNFCD- 122
Query: 127 QKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
P++ P+ FDWR +T VKDQG CGSCW+FS LE + + E +
Sbjct: 123 ---PLIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHID 179
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQQLVDCD + D GC GGL+++A+E I+ GGVE E+DYPY G C+
Sbjct: 180 LSEQQLVDCD---------TIDMGCAGGLLHTAYEEIMSMGGVEYEEDYPYRSVQ-GPCR 229
Query: 244 FDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY- 301
+ K +V N + I ED++ L + GP+AV ++AV + Y GG+ C Y
Sbjct: 230 IENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITS--CKNYG 287
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
L+H VL+VGYG+ P+W++KNSWG ++GENG+ ++ N CG M++ +
Sbjct: 288 LNHAVLLVGYGTENGI-------PFWVLKNSWGTDYGENGFVRVKRNVNSCG---MINEL 337
Query: 362 AA 363
AA
Sbjct: 338 AA 339
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 191/326 (58%), Gaps = 24/326 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVT 99
Q +D F F + +++TY T+EE +R VF N+ RA++ Q LD TA +G+T
Sbjct: 164 QPQDFSGKMASIFKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGIT 223
Query: 100 KFSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
KFSDLT EFR +L N LR P + P +P D+DWR GAVT VKDQG C
Sbjct: 224 KFSDLTEEEFRTIYL--NPLLREDPGQKMRLGKAPKGPVPPDWDWRTKGAVTKVKDQGMC 281
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GG+ ++A+
Sbjct: 282 GSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KVDKACMGGVPSNAYS 332
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
I GG+E E+DY Y G +C F K +++ +S +E ++AA L K+GP++V
Sbjct: 333 AIKTLGGLETEEDYSYHG-HLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPISV 391
Query: 279 GINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
INA MQ Y G++ P +C +L DH VLIVGYG+ + P+W IKNSWG
Sbjct: 392 AINAFGMQFYRHGIAHPLRPLCSPWLIDHAVLIVGYGNR-------SDVPFWAIKNSWGT 444
Query: 336 NWGENGYYKICMGRNVCGVDSMVSSV 361
+WGE GYY + G CGV++M SS
Sbjct: 445 DWGEEGYYYLHRGSGACGVNTMASSA 470
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 142/362 (39%), Positives = 200/362 (55%), Gaps = 44/362 (12%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+I S+L+LL++ A+A R DG L ++ F + +K K+
Sbjct: 1 MIASTLILLVVVGATPFAIA--------RPAALEDGRA-----LEIKNMFEDWAAKHGKS 47
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR- 121
Y++ E R +F L ++ + T G+ KFSDLT +EFR +G +R R
Sbjct: 48 YSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRY 107
Query: 122 ---LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LPA+ + + + LPT DWR GAVT +KDQG CGSCW+FSA ++E AHFL+T
Sbjct: 108 QDRLPAEDEDVDV---SSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLAT 164
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
ELVSLSEQQL+DCD + D+GC+GGLM +AF++++K GGV E YPYTG+
Sbjct: 165 KELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGS- 214
Query: 239 GGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQTYIGGVSCP 295
GSC +K+K A ++ F V++ D V P+ V I + Q Y G+
Sbjct: 215 VGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGI-LS 273
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM--GRNVCG 353
C LDHGVL++GYG+ G PYWIIKNSWG +WGE+G+ KI G +CG
Sbjct: 274 GKCDDSLDHGVLLIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMKIERKDGDGMCG 326
Query: 354 VD 355
++
Sbjct: 327 MN 328
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 188/329 (57%), Gaps = 34/329 (10%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY T+EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 183 QDFSVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKF 242
Query: 102 SDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
SDLT EFR +L R++RL P P ++DWR GAVT VKDQ
Sbjct: 243 SDLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLP-------PPEWDWRKKGAVTKVKDQ 295
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D GC GGL ++
Sbjct: 296 GMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------VDKGCMGGLPSN 346
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A+ I GG+E E+DY Y G +C F+ K +++ +S +E ++AA L + GP
Sbjct: 347 AYSAIKTLGGLETEEDYSYRG-HLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGP 405
Query: 276 LAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
++V INA MQ Y G+S P +C +L DH VL+VGYG+ P+W IKNS
Sbjct: 406 ISVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------ATPFWAIKNS 458
Query: 333 WGENWGENGYYKICMGRNVCGVDSMVSSV 361
WG +WGE GYY + G CGV+ M SS
Sbjct: 459 WGTDWGEEGYYYLYRGSGACGVNIMASSA 487
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 197/343 (57%), Gaps = 25/343 (7%)
Query: 25 NDDDAMIRQVVP--SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
+D + + V+P + +D + F F + +++TY T+EE ++R VF N+
Sbjct: 162 DDRNETLSSVLPLLNKDPLPQDFSVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMV 221
Query: 83 RAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RA++ Q LD TA +G+TKFSDLT EFR +L R + A + + P ++
Sbjct: 222 RAQKIQALDRGTAQYGITKFSDLTEEEFRTIYLNPLLRENRGKKMRLAKSISDHAPPPEW 281
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR GAVT VKDQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD
Sbjct: 282 DWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCD-------- 333
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
D C GGL ++A+ I+ GG+E E DY Y G +C F K +++ +S
Sbjct: 334 -KVDKACLGGLPSNAYSAIMTLGGLETEDDYSYQG-HLQACSFSAKKARVYINDSMELSQ 391
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGS-SGFA 317
+E ++AA L K GP++V INA MQ Y G+S P +C +L DH VL+VGYG+ SG
Sbjct: 392 NEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSGI- 450
Query: 318 PIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
P+W IKNSWG +WGE GYY + G CGV++M SS
Sbjct: 451 -------PFWAIKNSWGTDWGEEGYYYLHRGSGACGVNTMASS 486
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 135/360 (37%), Positives = 189/360 (52%), Gaps = 51/360 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ IL+SLL++ +S+ L + DG HF FK K
Sbjct: 1 MKSFILASLLVVAVSATL----------------LKEDGV-----------HFQSFKLKH 33
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
KTY Q E RF +F+ NLR+ + +H G+ KF+D+T +EF+ L
Sbjct: 34 GKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFK-AMLAT 92
Query: 117 NRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ + A K L +P DWR VT +KDQ CGSCWSF+ G+ EGA+
Sbjct: 93 QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYA 152
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
LSTG+L SEQQLVDC + + GC+GG ++ F YI + G+E E DYPYT
Sbjct: 153 LSTGKLTRFSEQQLVDC--------TTDLNYGCDGGYLDDTFPYI-QTNGLELESDYPYT 203
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP 295
G D GSC +D SK+ VS++ + ++E + + GP+A+ INA +Q Y G+
Sbjct: 204 GYD-GSCSYDSSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGIIDD 262
Query: 296 YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
C ++LDHGVL VGY S YW+IKNSWG +WGE+GY++ G+N+CGV
Sbjct: 263 KYCDPEWLDHGVLAVGYNSE-------NGLDYWLIKNSWGADWGESGYFRFLRGQNICGV 315
>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
Length = 383
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 169/322 (52%), Gaps = 34/322 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ANL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTAFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYW+IKNSW WGE+
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSA-------AVPYWVIKNSWTTQWGED 313
Query: 341 GYYKICMGRNVCGVDSMVSSVA 362
GY +I G N C V SS A
Sbjct: 314 GYIRIAKGSNQCLVKEEASSAA 335
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/333 (39%), Positives = 185/333 (55%), Gaps = 56/333 (16%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
FS FK+ ++ Y + E RF +F AN+++A +P A G +F+D++ EF+ +
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 113 F-----------------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+ AD QK DWR GAVT VK+Q
Sbjct: 85 HNAARHYAAAKARRAKHTKSFTKEEIKAADGQK------------IDWRLKGAVTSVKNQ 132
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G+CGSCWSFS TG +EG + ++TG LVSLSEQ+LV CD + D+GCNGGLM++
Sbjct: 133 GSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDN 183
Query: 216 AFEYIL--KAGGVEREKDYPYTGTDG--GSCKF--DKSKIAAAVSNFSVISSDEDQMAAN 269
AF +++ + G + E YPY +G +C + D + A +SNF I+ E+ MAA
Sbjct: 184 AFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAF 243
Query: 270 LVKHGPLAVGINAVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
+ +GPL++G++A Q+Y GG+ CP + +DHGVLIVGY + AP PYW
Sbjct: 244 VFNYGPLSIGVDASTWQSYAGGIITYCPDV---QIDHGVLIVGYDDT--AP-----TPYW 293
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
IIKNSW NWGE+GY ++ G N+CG+ S SS
Sbjct: 294 IIKNSWTANWGEDGYIRVAKGSNMCGLTSTPSS 326
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/374 (37%), Positives = 200/374 (53%), Gaps = 57/374 (15%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
+++LI++ LLL+ L+S S ++ E F F+ K+
Sbjct: 2 IKKLIVAILLLVALASARTSNLSF------------------------EETQFREFQLKY 37
Query: 61 SKTYATQEEHDY--RFRVFKANLRRAK------RRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+K Y E H++ + FK +L+R + +R +D GV KF+DL+ EF
Sbjct: 38 NKHY---ESHEFAQKLATFKNSLKRIQELNDMAKRAKVDTE--FGVNKFADLSKEEFANY 92
Query: 113 FLGLNRRLRLPADAQK-APILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L N+ D++ AP ++LPT FDWR GAVT VKDQG CGSCWSFS TG
Sbjct: 93 YL--NKGGMESTDSETYAPDYSDKEISNLPTSFDWRTQGAVTPVKDQGQCGSCWSFSTTG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EG FL+ +L LSEQ LVDC + D GCNGGLM A++YI++ G++
Sbjct: 151 NVEGQWFLAGNDLTGLSEQNLVDCSTKND---------GCNGGLMPLAYDYIVENNGIDT 201
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
E YPY +C+F+ + I A + + +SS+E QM NLV +GPL++ +A Q Y
Sbjct: 202 EASYPYLAIQQKNCQFNPANIGAKIDGYYNVSSNETQMQINLVNNGPLSIAADAAEWQYY 261
Query: 289 IGGVSCPY--ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
G+ ICGK LDHG+LIVGYG F + +WIIKNSW +WG +G+ I
Sbjct: 262 KKGIFSGIFGICGKNLDHGILIVGYGQQ---TTEFGTELFWIIKNSWSTDWGLSGFMLIK 318
Query: 347 MGRNVCGVDSMVSS 360
G CG++ V+S
Sbjct: 319 RGTGECGINLAVTS 332
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 185/313 (59%), Gaps = 24/313 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 82 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 142 IYL--NPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 199
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 200 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 250
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G SC F K +++ V+S +E ++AA L K GP++V INA MQ Y
Sbjct: 251 DYSYQG-HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVAINAFGMQFYRH 309
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY +
Sbjct: 310 GISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLHR 362
Query: 348 GRNVCGVDSMVSS 360
G CGV++M SS
Sbjct: 363 GSGACGVNTMASS 375
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 139/368 (37%), Positives = 187/368 (50%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ NL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 109 QERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++ G V E YPY
Sbjct: 163 AGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ + DE Q+AA L +GP+AVG++A TY GGV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGVDASSWMTYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYWIIKNSW WGE GY ++ G N C
Sbjct: 274 MTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEGGYIRVAKGSNQC 325
Query: 353 GVDSMVSS 360
V SS
Sbjct: 326 LVKEEASS 333
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 187/324 (57%), Gaps = 24/324 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY +QEE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 153 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKF 212
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT-DFDWRDHGAVTGVKDQGACGS 160
SDLT EFR +L N L+ P P D+P +DWR+ GAVT VKDQG CGS
Sbjct: 213 SDLTEEEFRTIYL--NPLLKDAPGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGS 270
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 271 CWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDK---------TDKACLGGLPSNAYSAI 321
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F K +++ +S +E ++AA L K+GP+++ I
Sbjct: 322 RTLGGLETEDDYSYRGR-LQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAI 380
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ P+W IKNSWG +W
Sbjct: 381 NAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAI-------PFWAIKNSWGTDW 433
Query: 338 GENGYYKICMGRNVCGVDSMVSSV 361
GE GYY + G CGV+ M SS
Sbjct: 434 GEEGYYYLHRGSGACGVNIMASSA 457
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/313 (41%), Positives = 176/313 (56%), Gaps = 32/313 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 110 RRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R +L + L+ P +K + T P DWR GAVT VKDQG CGSCW+FS
Sbjct: 98 RATYLNGAKYYAAALKRP---RKVVNVSTGKAPPAIDWRKKGAVTPVKDQGKCGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +EG ++ EL SLSEQ LV CD+ D GC GG ++ A ++I+ +
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCDN---------MDYGCRGGFLDRALKWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
G V E+ YPY TDG +KS + A +S + DE+ +A L K+GP+A+ ++
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVD 265
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Y GGV SC L+HGVL+VGY S + PYWIIKNSWG+ WGE
Sbjct: 266 ASSFLDYTGGVLTSCS---SDALNHGVLLVGYDDS-------SKPPYWIIKNSWGKKWGE 315
Query: 340 NGYYKICMGRNVC 352
GY ++ G N C
Sbjct: 316 EGYIRVEKGTNQC 328
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 192/330 (58%), Gaps = 24/330 (7%)
Query: 37 SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAV 95
+ G S+D + F F + +++TY ++EE +R +F N+ RA++ Q LD TA
Sbjct: 178 NKGPLSKDFSMQMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTAR 237
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKD 154
+G+TKFSDLT EFR +L N LR + P D P ++DWR+ GAVT VK+
Sbjct: 238 YGITKFSDLTEEEFRTIYL--NPLLREDPGKKMRVAKPVGDPAPPEWDWRNKGAVTNVKN 295
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL +
Sbjct: 296 QGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------MDKACLGGLPS 346
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
+A+ I GG+E E+DY Y G +C F K +++ +S +E ++AA L K G
Sbjct: 347 NAYSAIKNLGGLETEEDYSYQG-QMQACNFSAEKAKVYINDSVELSHNEQKLAAWLAKKG 405
Query: 275 PLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKN 331
P++V INA MQ Y G+S P +C +L DH VLIVGYG+ + P+W IKN
Sbjct: 406 PISVAINAFGMQFYRHGISRPLRPLCTPWLIDHAVLIVGYGNR-------SDIPFWAIKN 458
Query: 332 SWGENWGENGYYKICMGRNVCGVDSMVSSV 361
SWG +WGE GYY + G CGV++M SS
Sbjct: 459 SWGTDWGEQGYYYLHRGSGACGVNTMASSA 488
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/333 (39%), Positives = 185/333 (55%), Gaps = 56/333 (16%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
FS FK+ ++ Y + E RF +F AN+++A +P A G +F+D++ EF+ +
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 113 F-----------------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+ AD QK DWR GAVT VK+Q
Sbjct: 85 HNAARHYAAAKARRAKHTKSFTKEEIKAADGQK------------IDWRLKGAVTSVKNQ 132
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G+CGSCWSFS TG +EG + ++TG LVSLSEQ+LV CD + D+GCNGGLM++
Sbjct: 133 GSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDN 183
Query: 216 AFEYIL--KAGGVEREKDYPYTGTDG--GSCKF--DKSKIAAAVSNFSVISSDEDQMAAN 269
AF +++ + G + E YPY +G +C + D + A +SNF I+ E+ MAA
Sbjct: 184 AFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAF 243
Query: 270 LVKHGPLAVGINAVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
+ +GPL++G++A Q+Y GG+ CP + +DHGVLIVGY + AP PYW
Sbjct: 244 VFNYGPLSIGVDASTWQSYAGGIITYCPDV---QIDHGVLIVGYDDT--AP-----TPYW 293
Query: 328 IIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
IIKNSW NWGE+GY ++ G N+CG+ S SS
Sbjct: 294 IIKNSWTANWGEDGYIRVAKGSNMCGLTSTPSS 326
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 30 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 89
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 90 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 148
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 149 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 199
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++V I
Sbjct: 200 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 258
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 259 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDW 311
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 312 GEKGYYYLHRGSGACGVNTMASS 334
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/296 (43%), Positives = 171/296 (57%), Gaps = 26/296 (8%)
Query: 73 RFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI 131
R+ VF+ NL++A+ Q + TA +GVTKF DLT EFRR +L + PA
Sbjct: 1 RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFRRYYL--TPVWKAPAKPLPPAT 58
Query: 132 LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVD 191
+P D PT FDWRDHGAVT VKDQG CGSCW+FS TG +EG + G L LSEQ
Sbjct: 59 IPKKDAPTAFDWRDHGAVTEVKDQGQCGSCWAFSTTGNIEGQWAIKKGNLPDLSEQH--- 115
Query: 192 CDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAA 251
+ +S ++ I G+E EK YPY D C D SK+
Sbjct: 116 ---------TSKIESCHINPIVKRTKRSIDGKSGLESEKAYPYEAKD-EQCHMDYSKVQV 165
Query: 252 AVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICG-KYLDHGVLI 308
+++ IS DE+ MA+ L ++GP+++GINA MQ Y+GG+S P+ C + LDHGVLI
Sbjct: 166 YINSSVNISKDENDMASWLAENGPISIGINAFPMQFYMGGISHPWRIFCNPEELDHGVLI 225
Query: 309 VGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAI 364
VGYG+ E PYWIIKNSWG+NWGE GYY + G VCG+++M +S +
Sbjct: 226 VGYGTK-------DETPYWIIKNSWGKNWGEEGYYLVYRGGGVCGLNTMCTSSVVL 274
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/360 (36%), Positives = 189/360 (52%), Gaps = 51/360 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ IL+SLL++ +S+ L + DG HF FK K
Sbjct: 1 MKSFILASLLVVAVSATL----------------LKEDGA-----------HFQSFKLKH 33
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
KTY Q E RF +F+ NLR+ + +H G+ KF+D+T +EF+ L
Sbjct: 34 GKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFK-AMLAT 92
Query: 117 NRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ + A K L +P DWR VT +KDQ CGSCW+F+ G+ EGA+
Sbjct: 93 QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYA 152
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
LSTG+L SEQQLVDC + + GC+GG ++ F YI + G+E E DYPYT
Sbjct: 153 LSTGKLTRFSEQQLVDC--------TTDLNYGCDGGYLDDTFPYI-QTNGLELESDYPYT 203
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP 295
G D G C ++ SK+ VS++ + ++E + + GP+A+ INA +Q Y G+
Sbjct: 204 GYD-GYCSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGIIDD 262
Query: 296 YICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
C +YLDHGVL VGY S + YW+IKNSWG +WGE+GY++ G+N+CGV
Sbjct: 263 KYCDPEYLDHGVLAVGYDSE-------NGRDYWLIKNSWGADWGESGYFRFLRGQNICGV 315
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 186/331 (56%), Gaps = 38/331 (11%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY +QEE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 153 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKF 212
Query: 102 SDLTPSEFRRQFL--------GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
SDLT EFR +L G N RL P T+ P +DWR+ GAVT VK
Sbjct: 213 SDLTEEEFRTIYLNPLLKDAPGRNMRLAQPV---------TDVPPPQWDWRNKGAVTDVK 263
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL
Sbjct: 264 DQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDK---------TDKACLGGLP 314
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
++A+ I GG+E E DY Y G +C F K +++ +S +E ++AA L K
Sbjct: 315 SNAYSAIRTLGGLETEDDYSYRG-HLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKK 373
Query: 274 GPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIK 330
GP++V INA MQ Y G+S P +C +L DH VL+VGYG+ P+W IK
Sbjct: 374 GPISVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------ATPFWAIK 426
Query: 331 NSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
NSWG NWGE GYY + G CGV+ M SS
Sbjct: 427 NSWGTNWGEEGYYYLHRGSGACGVNIMASSA 457
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 236 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 294
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 295 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 345
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++V I
Sbjct: 346 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPISVAI 404
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 405 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDW 457
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 458 GEKGYYYLHRGSGACGVNTMASS 480
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 186/331 (56%), Gaps = 38/331 (11%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY +QEE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 170 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKF 229
Query: 102 SDLTPSEFRRQFL--------GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
SDLT EFR +L G N RL P T+ P +DWR+ GAVT VK
Sbjct: 230 SDLTEEEFRTIYLNPLLKDAPGRNMRLAQPV---------TDVPPPQWDWRNKGAVTDVK 280
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL
Sbjct: 281 DQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDK---------TDKACLGGLP 331
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
++A+ I GG+E E DY Y G +C F K +++ +S +E ++AA L K
Sbjct: 332 SNAYSAIRTLGGLETEDDYSYRG-HLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKK 390
Query: 274 GPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIK 330
GP++V INA MQ Y G+S P +C +L DH VL+VGYG+ P+W IK
Sbjct: 391 GPISVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------ATPFWAIK 443
Query: 331 NSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
NSWG NWGE GYY + G CGV+ M SS
Sbjct: 444 NSWGTNWGEEGYYYLHRGSGACGVNIMASSA 474
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 84 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 143
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 144 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 202
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 203 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 253
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++V I
Sbjct: 254 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 312
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 313 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDW 365
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 366 GEKGYYYLHRGSGACGVNTMASS 388
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 174/312 (55%), Gaps = 27/312 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC GGLM+ AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G DKS + A + + + DE+ +A L K+GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV I ++LDHGVL+VGY + + PYWIIKNSW + WGE GY
Sbjct: 269 FQSYTGGVLTSCI-SEHLDHGVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYSA 320
Query: 345 I-----CMGRNV 351
+ C+ +N+
Sbjct: 321 LRRHNQCLMKNL 332
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 236 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 294
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 295 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 345
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++V I
Sbjct: 346 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 404
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 405 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDW 457
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 458 GEKGYYYLHRGSGACGVNTMASS 480
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 129/313 (41%), Positives = 175/313 (55%), Gaps = 32/313 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 110 RRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R +L + L+ P +K + T P DWR GAVT VKDQ CGSCW+FS
Sbjct: 98 RATYLNGAKYYAAALKRP---RKVVTVSTGKAPPAIDWRKKGAVTPVKDQRKCGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +EG ++ EL SLSEQ LV CD+ D GC GGLM+ A ++I+ +
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCDN---------MDDGCQGGLMDRALKWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
G V E+ YPY TDG +KS + A +S + DE+ +A L K+GP+A+ ++
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVD 265
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Y GGV SC L+H VL+VGY S + PYWIIKNSWG+ WGE
Sbjct: 266 ASSFLDYTGGVLTSCS---SDALNHDVLLVGYDDSS-------KPPYWIIKNSWGKKWGE 315
Query: 340 NGYYKICMGRNVC 352
GY ++ G N C
Sbjct: 316 EGYIRVEKGTNQC 328
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 189/317 (59%), Gaps = 23/317 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSN-DDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ +NA+++Q
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQ 253
Query: 287 TYIGGV--SCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P +C + L+H VL VGYG G PYW +KNSW +GENGY+
Sbjct: 254 FYTEGILHPLPALCDAQSLNHAVLTVGYGKEG-------RLPYWTVKNSWSTMFGENGYF 306
Query: 344 KICMGRNVCGVDSMVSS 360
+I G CG++++VS+
Sbjct: 307 RIYRGDGTCGINTLVST 323
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 189/317 (59%), Gaps = 23/317 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 15 NARQLYEEFKLKYKKSYSN-DDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 74 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 132
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 133 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 183
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ +NA+++Q
Sbjct: 184 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQ 242
Query: 287 TYIGGV--SCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P +C + L+H VL VGYG G PYW +KNSW +GENGY+
Sbjct: 243 FYTEGILHPLPALCDAQSLNHAVLTVGYGKEG-------RLPYWTVKNSWSTMFGENGYF 295
Query: 344 KICMGRNVCGVDSMVSS 360
+I G CG++++VS+
Sbjct: 296 RIYRGDGTCGINTLVST 312
>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
Length = 265
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 121/275 (44%), Positives = 164/275 (59%), Gaps = 25/275 (9%)
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWRDHGA 148
+G+T F+D+T +E+R++ L +P D KA I +LP FDWR+ GA
Sbjct: 2 YGITHFADMTSAEYRQR-----TGLVIPRDEDRNHVGNPKAEIDENMELPESFDWRELGA 56
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
V+ VK+QG CGSCW+FS G +EG H + T L SEQ+L+DCD + DS C
Sbjct: 57 VSPVKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCD---------AVDSAC 107
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
GG M+ A++ I K GG+E E +YPY +C F+ +++ V + +E MA
Sbjct: 108 QGGYMDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNETAMAQ 167
Query: 269 NLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGVLIVGYGSSGFAPIRFKEKP 325
LV +GP+++G+NA MQ Y GG+S P+ +C K LDHGVLIVGYG + P+ K P
Sbjct: 168 YLVANGPISIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEY-PMFNKTMP 226
Query: 326 YWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
YWI+KNSWG WGE GYY+I G N CGV M SS
Sbjct: 227 YWIVKNSWGPKWGEQGYYRIFRGDNTCGVSEMASS 261
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 179/323 (55%), Gaps = 31/323 (9%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ KTY ++ +YRF VFK NL RA + Q ++ TA +GVT+F DLT
Sbjct: 302 NARQLYEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTS 360
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPIL-PTNDLPTD---FDWRDHGAVTGVKDQGACGSCW 162
EF+ Q+LG D Q + P+ + D FDWRDHGAV V DQG CGSCW
Sbjct: 361 QEFQIQYLGFKYE-----DMQDTEEMSPSTRVVMDEDSFDWRDHGAVGPVLDQGKCGSCW 415
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS G +EG FL TGEL+SLSEQQL+DCD + D GCNGG + ++K
Sbjct: 416 AFSTIGNIEGQWFLKTGELLSLSEQQLIDCD---------NVDEGCNGGYPPKTYGAVIK 466
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
GG+E DYPY C D+ K+ +++ V +E A L GPL+ +NA
Sbjct: 467 MGGLELNSDYPYKAL-AEKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNA 525
Query: 283 VWMQTYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
++ Y G+ + + L+H VL VGYG+ PYW +KNSWG +GE
Sbjct: 526 NPLKFYKTGIMHLPVASCFPRALNHAVLTVGYGTE-------NGLPYWTVKNSWGTAFGE 578
Query: 340 NGYYKICMGRNVCGVDSMVSSVA 362
+GY++I G CG++ +VS+ A
Sbjct: 579 DGYFRIYRGGGTCGINRLVSTAA 601
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 116/205 (56%), Gaps = 22/205 (10%)
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
+FDWR HGAV V +QG CGSCW+FSA G +EG FL +GEL+ LS QQ++DCDH
Sbjct: 42 NFDWRQHGAVGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCDH----- 96
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
D GCNGG + + + GG++ + DY Y G C D+SK A V N SVI
Sbjct: 97 ----VDHGCNGGYPPQVYRQVNQMGGLQLDADYSYKAAV-GKCHTDRSKFRAYV-NSSVI 150
Query: 260 SSDEDQMAANLVKH-GPLAVGINAVWMQTYIGGV--SCPYICGK-YLDHGVLIVGYGSSG 315
S +Q AN +K GPLA +NA +Q Y G+ P C L+H VL VGYG+
Sbjct: 151 LSQNEQFQANKLKTIGPLASTLNARTLQFYRKGIMHPTPSACNPGQLNHAVLTVGYGTE- 209
Query: 316 FAPIRFKEKPYWIIKNSWGENWGEN 340
+ PYWI+KNSW +GE
Sbjct: 210 ------QGMPYWIVKNSWSRGFGEQ 228
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 139/368 (37%), Positives = 186/368 (50%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ NL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 109 QERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD DSGC GGLMN+AFE+I++ G V E YPY
Sbjct: 163 AGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ + DE Q+AA L +GP+AV ++A TY GGV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYWIIKNSW WGE+GY +I G N C
Sbjct: 274 MTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTAQWGEDGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V SS
Sbjct: 326 LVKEEASS 333
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 189/317 (59%), Gaps = 23/317 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSN-DDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFKQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ +NA+++Q
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQ 253
Query: 287 TYIGGV--SCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P +C + L+H VL VGYG G PYW +KNSW +GENGY+
Sbjct: 254 FYTEGILHPLPALCDAQSLNHAVLTVGYGKEG-------RLPYWTVKNSWSTMFGENGYF 306
Query: 344 KICMGRNVCGVDSMVSS 360
+I G CG++++VS+
Sbjct: 307 RIYRGDGTCGINTLVST 323
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 187/322 (58%), Gaps = 24/322 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 241 QDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKF 300
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EFR +L N LR + DL P ++DWR GAVT VKDQG CGS
Sbjct: 301 SDLTEEEFRTIYL--NPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 358
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 359 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 409
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G SC F K +++ V+S +E ++AA L K GP++V I
Sbjct: 410 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVAI 468
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 469 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDW 521
Query: 338 GENGYYKICMGRNVCGVDSMVS 359
GE GYY + G CGV++M S
Sbjct: 522 GEKGYYYLHCGSEACGVNTMAS 543
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 179/312 (57%), Gaps = 26/312 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+K K++K Y++QEE R RV+ +NL+ + + +F+DL P EF + G
Sbjct: 22 WKLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSEREGYTVAMNEFADLDPREFVSHYNG 81
Query: 116 LNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
L RR P + P D LPT DWR G VTGVK+QG CGSCW+FSATG+LEG
Sbjct: 82 LRRR---PHTSSGEPCTLGEDVSALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGSLEG 138
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
HF +TG+LVSLSEQ LVDC S + GCNGGL + AF+Y++K GG++ E Y
Sbjct: 139 QHFNATGKLVSLSEQNLVDC-------SSAEGNEGCNGGLPDDAFKYVIKNGGIDTEASY 191
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINA--VWMQTYI 289
PY D C + + I + S++ I S E Q+ GP+ VGI+A + Q Y
Sbjct: 192 PYVARD-EKCHYSSANIGSTCSSYVDIESKSEAQLQVASATVGPIPVGIDASHLGFQLYD 250
Query: 290 GGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GGV +C + LDHGVL+VGYG +KEK YW++KNSWG NWG +G +
Sbjct: 251 GGVYHSDLCSQTRLDHGVLVVGYGV-------YKEKDYWMVKNSWGTNWGISGDMMMSRN 303
Query: 349 R-NVCGVDSMVS 359
R N CG+ +M S
Sbjct: 304 RDNNCGIATMAS 315
>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
Length = 271
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 171/288 (59%), Gaps = 28/288 (9%)
Query: 80 NLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
L AKR Q ++ TA +GVT+FSDLT EF+ ++L R+R + P D+
Sbjct: 1 QLAAAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMRFDGPIVSEDLTPEEDVT 56
Query: 139 TD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
D FDWR+HGAV V DQG CGSCW+FS G +EG F TG+L++LSEQQLVDCDH
Sbjct: 57 MDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDH- 115
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
D GCNGG + I K GG+E DYPYTG D G C ++SK A V++
Sbjct: 116 --------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD-GICYMNQSKFVAYVND 166
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV--SCPYICGKY-LDHGVLIVGYG 312
+V+ E A L + GPL+ +NAV +Q Y+GG+ P++C + L+H VL VGYG
Sbjct: 167 STVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYG 226
Query: 313 SSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+ F PYWI+KNSWG +GE GY++I G CG++ +VS+
Sbjct: 227 TE-FG------IPYWIVKNSWGVGFGEKGYFRIFRGAGTCGINLVVST 267
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 138/368 (37%), Positives = 186/368 (50%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ NL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTAFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 109 QERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD DSGC GGLMN+AFE+I++ G V E YPY
Sbjct: 163 AGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYTEDSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ + DE Q+AA L +GP+AV ++A TY GGV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYW+IKNSW WGE+GY +I G N C
Sbjct: 274 MTSCV-SEQLDHGVLLVGYNDSAAV-------PYWVIKNSWTTQWGEDGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V SS
Sbjct: 326 LVKEEASS 333
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 184/312 (58%), Gaps = 22/312 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 5 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L R + P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 65 IYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 123
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E D
Sbjct: 124 GQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETEDD 174
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
Y Y G SC F K +++ +S +E ++AA L K GP++V INA MQ Y G
Sbjct: 175 YSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHG 233
Query: 292 VSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY + G
Sbjct: 234 ISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLHRG 286
Query: 349 RNVCGVDSMVSS 360
CGV++M SS
Sbjct: 287 SGACGVNTMASS 298
>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
Length = 484
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 236 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 294
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG ++G FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 295 CWAFSVTGNVKGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 345
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++V I
Sbjct: 346 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 404
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 405 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDW 457
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 458 GEKGYYYLHRGSGACGVNTMASS 480
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 185/313 (59%), Gaps = 24/313 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 280 IYL--NSLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 337
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 338 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 388
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G SC F K +++ +S +E ++AA L K GP++V INA MQ Y
Sbjct: 389 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRH 447
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY +
Sbjct: 448 GISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLHR 500
Query: 348 GRNVCGVDSMVSS 360
G CGV++M SS
Sbjct: 501 GSGACGVNTMASS 513
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 187/324 (57%), Gaps = 24/324 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F +++TY ++EE +R +F N+ RA++ Q LD TA +GVTKF
Sbjct: 72 QDLTVKMASIFRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKF 131
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EFR +L N LR + DL P ++DWR GAVT VKDQG CGS
Sbjct: 132 SDLTEEEFRTIYL--NPLLREEPGKKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 189
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL +SA+ I
Sbjct: 190 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAI 240
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F K +++ +S +E ++AA L K GP++V I
Sbjct: 241 KNLGGLETEDDYSYRG-HMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 299
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 300 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DIPFWAIKNSWGTDW 352
Query: 338 GENGYYKICMGRNVCGVDSMVSSV 361
GE GYY + G CGV++M SS
Sbjct: 353 GEKGYYYLHRGSGACGVNTMASSA 376
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 174/316 (55%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCHSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC GG + AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DFGCGGGFSDPAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G DKS + A + + + DE+ +A L K GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKKGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV I ++LDHGVL+VGY + + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SEHLDHGVLLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C + ++ SS
Sbjct: 321 IEKGTNQCLMKNLPSS 336
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 138/368 (37%), Positives = 186/368 (50%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ NL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 109 QERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD DSGC GGLMN+AFE+I++ G V E YPY
Sbjct: 163 AGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYTEGSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ + DE Q+AA L +GP+AV ++A TY GGV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYW+IKNSW WGE+GY +I G N C
Sbjct: 274 MTSCV-SEQLDHGVLLVGYNDSAAV-------PYWVIKNSWTTQWGEDGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V SS
Sbjct: 326 LVKEEASS 333
>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
Length = 500
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 166/320 (51%), Gaps = 34/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 70 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 129
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 130 RYHNGAVHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 183
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 184 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 234
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 235 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 294
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNSW WGE
Sbjct: 295 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEE 346
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY +I G N C V SS
Sbjct: 347 GYIRIAKGLNQCLVKEEASS 366
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 184/314 (58%), Gaps = 24/314 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R +F N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 253 IYL--NPLLREEPSNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 310
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 311 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 361
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G SC F K +++ +S +E ++AA L K GP++V INA MQ Y
Sbjct: 362 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRH 420
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY +
Sbjct: 421 GISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLHR 473
Query: 348 GRNVCGVDSMVSSV 361
G CGV++M SS
Sbjct: 474 GSGACGVNTMASSA 487
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 186/322 (57%), Gaps = 40/322 (12%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFS 102
+N F FK+KF+K Y + EE RF VF N+ R VH V +F+
Sbjct: 24 VNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFA 83
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT E+R+ +L L + Q+ + N DWR GAVT +K+QG CGSCW
Sbjct: 84 DLTNEEYRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCW 141
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
SFS TG++EGAH ++TG LVSLSEQQLVDC SGS + GCNGGLM++AF+YI+
Sbjct: 142 SFSTTGSVEGAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDNAFKYII 193
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGI 280
GG++ E+DYPYT DG K +SK A ++S + V ++EDQ+AA V+ GP++V I
Sbjct: 194 SNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVSVAI 252
Query: 281 NA--VWMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
A Q Y GV S P CG LDHGVL+VGY S YWI+KNSWG +W
Sbjct: 253 EADQQSFQMYSSGVFSGP--CGTNLDHGVLVVGYTSD-----------YWIVKNSWGASW 299
Query: 338 GENGYYKICMGRNV-----CGV 354
G+ GY I M R V CG+
Sbjct: 300 GDQGY--IMMKRGVSSAGICGI 319
>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 166/320 (51%), Gaps = 34/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNSW WGE
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEE 313
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY +I G N C V SS
Sbjct: 314 GYIRIAKGSNQCLVKEEASS 333
>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
Full=Major cysteine proteinase; Flags: Precursor
gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
Length = 467
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 166/320 (51%), Gaps = 34/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNSW WGE
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEE 313
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY +I G N C V SS
Sbjct: 314 GYIRIAKGSNQCLVKEEASS 333
>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 166/320 (51%), Gaps = 34/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAVHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNSW WGE
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEE 313
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY +I G N C V SS
Sbjct: 314 GYIRIAKGSNQCLVKEEASS 333
>gi|343477207|emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 171/316 (54%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+EG ++ EL SLSEQ LV CD + D GC GG + AF++IL G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTD---------DFGCRGGFSDPAFKWILWSNKGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G +CK + A +SN + DED + L + GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCKMSGKVVGAKISNRLYLPEDEDMITEWLARKGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV I K +++G L+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SKEMNYGALLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C V ++ SS
Sbjct: 321 IEKGTNQCLVKNLPSS 336
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/322 (39%), Positives = 182/322 (56%), Gaps = 28/322 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F FK+ ++ YA+ +E RF +F AN+++A +P A G +F+D++ EF
Sbjct: 22 EVLFRDFKTTHARNYASADEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEF 81
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ + + A K T + + DWR GAVT VK+QG+CGSCWSF
Sbjct: 82 QTRHNAARHYAAVMARPPKNTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSF 141
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
S TG +EG H ++TG+LVSLSEQ+LV CD + D GC+GGLM++AF ++L A
Sbjct: 142 STTGNIEGQHAIATGQLVSLSEQELVSCD---------TVDDGCSGGLMDNAFGWLLSAH 192
Query: 224 -GGVEREKDYPYTGTDG--GSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
G + E YPY +G +C F+ + + A +++F I E MAA + K+GPL++
Sbjct: 193 NGQITTEASYPYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLSI 252
Query: 279 GINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
G++A Q+YIGG+ + +DHGVLIVG+ + PYWIIKNSW WG
Sbjct: 253 GVDASSWQSYIGGI-LSHCSDVQIDHGVLIVGFDDTA-------STPYWIIKNSWSSMWG 304
Query: 339 ENGYYKICMGRNVCGVDSMVSS 360
E GY ++ G N CG+ S SS
Sbjct: 305 EQGYIRVAKGSNQCGLTSFPSS 326
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/306 (40%), Positives = 177/306 (57%), Gaps = 25/306 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKFSDLTPSE 108
F FK + K+Y Q E RF +F+AN+ ++ L + + +F+DLT E
Sbjct: 26 FETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTDLTQEE 85
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F+ +LGL+ + L Q L ++PT DWR G VTGVK+QG+CGSCWSF+ TG
Sbjct: 86 FKA-YLGLHVKPVLNNTIQYE--LKGLEVPTSVDWRSAGQVTGVKNQGSCGSCWSFALTG 142
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ EGA++ +LVSLSEQQLVDC S S + GCNGG +++ F YI + G++
Sbjct: 143 STEGAYYRKHKQLVSLSEQQLVDC--------STSINYGCNGGFLDATFPYIEQY-GLQT 193
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
E YPYTG D GSCK+D SK+ +SN+ + E ++ + GP+A+ ++A ++ +Y
Sbjct: 194 ESSYPYTGVD-GSCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVAITMDASYLSSY 252
Query: 289 IGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+ C L+H VL+VGYGS + YWI+KNSWG WGE GY+++
Sbjct: 253 SSGIYAANKCTTTNLNHAVLVVGYGSQ-------NGQNYWIVKNSWGSGWGEQGYFRLLR 305
Query: 348 GRNVCG 353
G N CG
Sbjct: 306 GSNECG 311
>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
Length = 338
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 187/323 (57%), Gaps = 22/323 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 30 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 89
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 90 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 148
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 149 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 199
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E DY Y G SC F K +++ +S +E ++AA L K GP++V I
Sbjct: 200 KNLGGLETVDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 258
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +W
Sbjct: 259 NAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDW 311
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 312 GEKGYYYLHRGSGACGVNTMASS 334
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 181/310 (58%), Gaps = 29/310 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANL-----RRAKRRQLLDPTAVHGVTKFSDLTPS 107
F FK K +KTY T E R+ +F+A L ++ Q L+ T GV KFSD T
Sbjct: 23 FQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFEQGLE-TYKKGVNKFSDWTQD 81
Query: 108 EFRRQFLGLNRRLRLPADAQKA-PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF +LGL+ + PA K P + T +P DWR G VTGVK+QG CGSCW+FS
Sbjct: 82 EFN-AYLGLHPK---PAKLGKGIPYVKTGVSVPASVDWRTEGYVTGVKNQGDCGSCWAFS 137
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG++EGA F STG+LVSLSEQQLVDC + G+ + GC+GG + F YI + G
Sbjct: 138 LTGSVEGALFKSTGKLVSLSEQQLVDCTY-------GTVNFGCDGGYLEETFPYIQET-G 189
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM 285
+E E YPY D G+CKFD SK+ ++++ DE+ + GP++V ++A ++
Sbjct: 190 LEAEASYPYKARD-GTCKFDASKVVTKINDYVYWYGDEEALLEATATIGPISVAMDANYI 248
Query: 286 QTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+Y GV +C L+HGVL+VGYGS YW++KNSW E+WGE+GY K
Sbjct: 249 DSYASGVFSSRLCSSDDLNHGVLVVGYGSENGV-------NYWLVKNSWAEDWGESGYLK 301
Query: 345 ICMGRNVCGV 354
+ G+N CG+
Sbjct: 302 LLRGQNECGI 311
>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
Length = 467
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 170/315 (53%), Gaps = 22/315 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYKSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 QFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
++ G A+ + +P DWR GAVT VKDQG CGSCW+FSA G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGVPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVER 228
E FL+ L +LSEQ LV CD DSGC+GGLMN AFE+I++ G V
Sbjct: 157 ESQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNDAFEWIVQENDGAVYT 207
Query: 229 EKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E+ YPY +G S C + A ++ + DE Q+AA L +GP+AV ++A
Sbjct: 208 EESYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVAVAVDATSWM 267
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
TY GGV + + LDHGVL+VGY S AP+ PYWIIKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDS--APV-----PYWIIKNSWTTLWGEDGYIRIA 319
Query: 347 MGRNVCGVDSMVSSV 361
G N C V SS
Sbjct: 320 KGSNQCLVKEEASSA 334
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 176/317 (55%), Gaps = 29/317 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y ++ E +R ++F N + AK QL V G+ K++D+ E
Sbjct: 27 WQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHE 86
Query: 109 FRRQFLGLNRRLRLPADAQKA-----PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
F+ G N +R AQ+ I P N +P DWR HGAVT VKDQG CGSCW
Sbjct: 87 FKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCW 146
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
SFS+TG+LEG HF G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 SFSSTGSLEGQHFRKAGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKD 199
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGIN 281
GGV+ EK YPY G D SC F+K+ + A + F + DE+ M + GP+AV I+
Sbjct: 200 NGGVDTEKSYPYEGID-DSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAID 258
Query: 282 AV--WMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A Q Y GV + P LDHGVL+VGYG+ + YW++KNSWG WG
Sbjct: 259 ASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDK------DGQDYWLVKNSWGTTWG 312
Query: 339 ENGYYKICMGR-NVCGV 354
+ GY K+ + N CG+
Sbjct: 313 DQGYIKMARNQDNQCGI 329
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 137/331 (41%), Positives = 182/331 (54%), Gaps = 35/331 (10%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKF 101
L+ E H +K + K YA + E +R ++F N + AK QL V G+ K+
Sbjct: 23 LIKEEWH--TYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKY 80
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKDQ 155
+D+ EF+ G N LR + + T +P DWR+HGAVTGVKDQ
Sbjct: 81 ADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS+TGALEG HF G LVSLSEQ LVDC + ++GCNGGLM++
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDN 193
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
AF YI GG++ EK YPY G D SC F+K+ I A + F + DE++M + G
Sbjct: 194 AFRYIKDNGGIDTEKSYPYEGID-DSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMG 252
Query: 275 PLAVGINAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGS--SGFAPIRFKEKPYWII 329
P++V I+A Q Y GV C + LDHGVL+VGYG+ SG YW++
Sbjct: 253 PVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGM--------DYWLV 304
Query: 330 KNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
KNSWG WGE GY K+ + N CG+ + S
Sbjct: 305 KNSWGTTWGEQGYIKMARNQNNQCGIATASS 335
>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 426
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 167/322 (51%), Gaps = 34/322 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNSW WGE
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSA-------AVPYWIIKNSWTTQWGEE 313
Query: 341 GYYKICMGRNVCGVDSMVSSVA 362
GY +I G N C V SS A
Sbjct: 314 GYIRIAKGLNQCLVKEEASSAA 335
>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 139/368 (37%), Positives = 186/368 (50%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ NL A+ +P A GVT FSDLT EF + F
Sbjct: 49 YESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFWSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 109 QERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD DSGC GGLMN+AFE+I++ G V E YPY
Sbjct: 163 AGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYTEGSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ I DE Q+AA L +GP+AV ++A TY GGV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVEIPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYW+IKNSW +WGE GY +I G N C
Sbjct: 274 MTSCV-SEQLDHGVLLVGYNDSAAV-------PYWVIKNSWTTHWGEGGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V VSS
Sbjct: 326 LVKEGVSS 333
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 127/324 (39%), Positives = 180/324 (55%), Gaps = 32/324 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F FK+ ++ YA+ +E RF +F N+++A +P A G +F+D+T EF
Sbjct: 22 EVLFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEF 81
Query: 110 RRQF----LGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ + + R P + + + DWR GAVT VK+QGACGSCWSF
Sbjct: 82 QTRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSF 141
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
S TG +EG H ++TG+LV++SEQ+LV CD D GCNGGLM++AF +++ A
Sbjct: 142 STTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLISAH 192
Query: 224 -GGVEREKDYPYTGTDG----GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
G + E +YPY +G S + + A +S F I+ E+ MAA + KHGPL++
Sbjct: 193 KGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSI 252
Query: 279 GINAVWMQTYIGGVS--CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
G++A Q+Y GG+ CP +DHGVLIVG+ + PYWIIKNSW N
Sbjct: 253 GVDASTWQSYAGGIMSYCPQ---DQIDHGVLIVGFDDTA-------STPYWIIKNSWTAN 302
Query: 337 WGENGYYKICMGRNVCGVDSMVSS 360
WGE GY ++ G N CG+ S SS
Sbjct: 303 WGEEGYIRVAKGSNQCGLTSHPSS 326
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 183/312 (58%), Gaps = 33/312 (10%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + RF++FK NL+ + + D T G+T+F+DLT EFR +L +++
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
D+ K + D LP + DWR +GAV VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGEL+SLSEQ+LVDCD G ++GC+GG+MN AFE+I+K GG+E ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVS 293
D G C DK+ V+ + + D+++ V H P++V I A Q Y GV
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVM 283
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-- 351
CG LDHGV++VGYGS+ + YWII+NSWG NWG++GY K + RN+
Sbjct: 284 TG-TCGISLDHGVVVVGYGST-------SGEDYWIIRNSWGLNWGDSGYVK--LQRNIDD 333
Query: 352 ----CGVDSMVS 359
CG+ M S
Sbjct: 334 PFGKCGIAMMPS 345
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 192/326 (58%), Gaps = 34/326 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+P+D +D LL + F+ + K K Y+ EE +RF V+K NL +R + +
Sbjct: 31 MPTD--VGKDQLLAGQ--FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSY 86
Query: 95 VHGVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
G+TKF+DLT EFRRQ+ G +RRL+ +A + ++ P DWR+ GAVT
Sbjct: 87 WLGLTKFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVT 146
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
VKDQG+CGSCW+FSA G++EG + + TG+ +SLS Q+LVDCD + + GCNG
Sbjct: 147 SVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK--------YNQGCNG 198
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAA---AVSNFSVISSDEDQMA 267
GLM+ AF+++++ GG++ EKDYPY G DG + D +K+ A + ++ + ++++
Sbjct: 199 GLMDYAFDFVIQNGGIDTEKDYPYQGYDG---RCDVNKMNARVVTIDSYEDVPENDEEAL 255
Query: 268 ANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKP 325
V P++V I A Q Y GGV CG LDHGVL VGYGS K
Sbjct: 256 KKAVAGQPVSVAIEAGGRDFQLYSGGVFTGR-CGTDLDHGVLAVGYGSE-------KGLD 307
Query: 326 YWIIKNSWGENWGENGYYKICMGRNV 351
YWI+KNSWGE WGE+GY + M RN+
Sbjct: 308 YWIVKNSWGEYWGESGYLR--MQRNL 331
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 147/374 (39%), Positives = 203/374 (54%), Gaps = 35/374 (9%)
Query: 1 MERLILSSLLLLLLSSVL-ASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH--- 52
M R+ +S LL+L++ V ASA + D I+QVV SDG E S ++ H
Sbjct: 1 MARVSPASFLLILIACVAGASAGSSFADQNPIKQVV-SDGLRELEASVLQVIGQTRHSLA 59
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ F ++ K+Y T EE RF +F +L+ + + GV +F+DLT EFR+
Sbjct: 60 FARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLSYTLGVNEFADLTWEEFRKH 119
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
LG + A + L LP DWR+ G VT VK+QG CGSCW+FS TGALE
Sbjct: 120 RLGAAQNC--SATLKGNHKLTNGLLPLKKDWREVGIVTPVKNQGHCGSCWTFSTTGALEA 177
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
A+ + G+ + LSEQQLVDC + + GCNGGL + AFEYI GG++ E+ Y
Sbjct: 178 AYVQAFGKAIFLSEQQLVDCARAYN-------NFGCNGGLPSQAFEYIKANGGLDTEEAY 230
Query: 233 PYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTY 288
PYTG D G CKF I V N ++ + DE + A V+ P++V V + Y
Sbjct: 231 PYTGVD-GVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVR--PVSVAFEVVSGFRLY 287
Query: 289 IGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
GV CG ++H V+ VGYG + PYW+IKNSWG +WG+NGY+K+
Sbjct: 288 KSGVYTSDTCGNTPMDVNHAVVAVGYGVE-------NDVPYWLIKNSWGADWGDNGYFKM 340
Query: 346 CMGRNVCGVDSMVS 359
MG+N+CGV + S
Sbjct: 341 EMGKNMCGVATCAS 354
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 179/321 (55%), Gaps = 32/321 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK+ ++ YA+ +E RF +F N+++A +P A G +F+D+T EF+ +
Sbjct: 10 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 69
Query: 113 F----LGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+ R P + + + DWR GAVT VK+QGACGSCWSFS T
Sbjct: 70 HNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSFSTT 129
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GG 225
G +EG H ++TG+LV++SEQ+LV CD D GCNGGLM++AF +++ A G
Sbjct: 130 GNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLISAHKGQ 180
Query: 226 VEREKDYPYTGTDG----GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E +YPY +G S + + A +S F I+ E+ MAA + KHGPL++G++
Sbjct: 181 IATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSIGVD 240
Query: 282 AVWMQTYIGGVS--CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Q+Y GG+ CP +DHGVLIVG+ + PYWIIKNSW NWGE
Sbjct: 241 ASTWQSYAGGIMSYCPQ---DQIDHGVLIVGFDDTA-------STPYWIIKNSWTANWGE 290
Query: 340 NGYYKICMGRNVCGVDSMVSS 360
GY ++ G N CG+ S SS
Sbjct: 291 EGYIRVAKGSNQCGLTSHPSS 311
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 170/311 (54%), Gaps = 30/311 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 30 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 89
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 90 YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 149
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 150 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMGQAFDWLLNNRNGAV 200
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
YPY +G + +S I A + I S+ED MAA L +GP+A+ ++A
Sbjct: 201 YTGASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDAS 260
Query: 284 WMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Y GGV SC GK L+HGVL+VGY +G E PYW+IKNSWGENWGE G
Sbjct: 261 AFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWVIKNSWGENWGEKG 310
Query: 342 YYKICMGRNVC 352
Y ++ G N C
Sbjct: 311 YVRVRKGTNEC 321
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 183/312 (58%), Gaps = 33/312 (10%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + RF++FK NL+ + + D T G+T+F+DLT EFR +L +++
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
D+ K + D LP + DWR +GAV VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGEL+SLSEQ+LVDCD G ++GC+GG+MN AFE+I+K GG+E ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVS 293
D G C DK+ V+ + + D+++ V H P++V I A Q Y GV
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVM 283
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-- 351
CG LDHGV++VGYGS+ + YWII+NSWG NWG++GY K + RN+
Sbjct: 284 TG-TCGISLDHGVVVVGYGST-------SGEDYWIIRNSWGLNWGDSGYVK--LQRNIDD 333
Query: 352 ----CGVDSMVS 359
CG+ M S
Sbjct: 334 PFGKCGIAMMPS 345
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 143/371 (38%), Positives = 201/371 (54%), Gaps = 35/371 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
+ ILSS++L++L + A+A D+ IR V SDG E+S +L H F+
Sbjct: 4 KTILSSVVLVVLFAASAAANIGFDESNPIRMV--SDGLREVEESVSQILGQSRHVLSFAR 61
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F ++ K Y EE RF +FK NL + + GV +F+DLT EF+R LG
Sbjct: 62 FTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLG 121
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ A + + + LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 122 AAQNC--SATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ EK YPYT
Sbjct: 180 QAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYT 232
Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGG 291
G D +CKF + V N ++ + DE + A LV+ P+++ + + Y G
Sbjct: 233 GKD-ETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSG 289
Query: 292 VSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
V CG ++H VL VGYG PYW+IKNSWG +WG+ GY+K+ MG
Sbjct: 290 VYTDSHCGSTPMDVNHAVLAVGYGVEDGV-------PYWLIKNSWGADWGDKGYFKMEMG 342
Query: 349 RNVCGVDSMVS 359
+N+CG+ + S
Sbjct: 343 KNMCGIATCAS 353
>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
Length = 462
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 190/323 (58%), Gaps = 24/323 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKF
Sbjct: 155 QDFSVKMATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKF 214
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EF +L N L+ + + + NDL P ++DWR GAVT VKDQG CGS
Sbjct: 215 SDLTEEEFHTIYL--NPLLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGS 272
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 273 CWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYTAI 323
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F +++ +S DE+++AA L + GP++V I
Sbjct: 324 KNLGGLETEDDYGYQG-HVQACNFSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAI 382
Query: 281 NAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G++ P+ +C ++DH VL+VGYG+ PYW IKNSWG +W
Sbjct: 383 NAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRS-------NIPYWAIKNSWGRDW 435
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 436 GEEGYYYLYRGSGACGVNTMASS 458
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 173/313 (55%), Gaps = 32/313 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P E
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEL 97
Query: 110 RRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R +L + L+ P +K + T P DWR GAVT VKDQ CGSCW+FS
Sbjct: 98 RATYLNGAKYYAAALKRP---RKVVNVSTGKAPPAVDWRKKGAVTPVKDQRKCGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
ATG +EG ++ EL SLSEQ LV CD+ D GC GGLM+ A ++I+ +
Sbjct: 155 ATGNIEGQWKVAGHELTSLSEQMLVSCDN---------MDDGCQGGLMDRALKWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
G V E+ YPY TDG C + A +S + DE+ +A L K+GP+A+ ++
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVD 265
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Y GGV SC L+H VL+VGY + + PYWIIKNSWG+ WGE
Sbjct: 266 ASSFLDYKGGVLTSCS---SDALNHDVLLVGYDDTS-------KPPYWIIKNSWGKKWGE 315
Query: 340 NGYYKICMGRNVC 352
GY ++ G N C
Sbjct: 316 EGYIRVEKGTNQC 328
>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
Length = 459
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 197/344 (57%), Gaps = 28/344 (8%)
Query: 25 NDDDAMIRQVVP--SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
+D + + V+P + +D + F F + +++TY TQEE +R VF N+
Sbjct: 132 DDRNETLSSVLPLLNKDPLPQDFSVKMASIFKEFVTTYNRTYGTQEEAQWRLSVFSNNMV 191
Query: 83 RAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTD 140
RA++ Q LD TA +G+TKFSDLT EFR +L N L+ + D P +
Sbjct: 192 RAQKIQALDRGTAQYGITKFSDLTEEEFRAIYL--NPLLKENRNKMMHLAKSIGDHAPPE 249
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
+DWR GAVT VK+QG CGSCW+FS TG +EG FL G+L+SLSEQ+L+DCD
Sbjct: 250 WDWRTKGAVTNVKNQGMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELLDCD------- 302
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
D C GGL ++A+ I GG+E E DY Y+G +C F K +++ +S
Sbjct: 303 --KVDKACLGGLPSNAYLAIKNLGGLETEDDYSYSG-HLQTCSFSAKKAKVYINDSVELS 359
Query: 261 SDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGS-SGF 316
+E ++AA L K GP++V INA MQ Y G+S P +C +L DH VL+VGYG+ SG
Sbjct: 360 QNEQKLAAWLAKKGPISVAINAFGMQFYRRGISHPLRPLCSPWLIDHAVLLVGYGNRSGI 419
Query: 317 APIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
P+W IKNSWG +WGE GYY + G CGV++M SS
Sbjct: 420 --------PFWAIKNSWGTDWGEEGYYYLYRGSGACGVNAMASS 455
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 184/313 (58%), Gaps = 24/313 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 84 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 144 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 201
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 202 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 252
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G +C F K +++ +S +E ++AA L K GP++V INA MQ Y
Sbjct: 253 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQFYRH 311
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY +
Sbjct: 312 GISRPLRPLCSPWLIDHAVLLVGYGNRS-------DIPFWAIKNSWGTDWGEKGYYYLHR 364
Query: 348 GRNVCGVDSMVSS 360
G CGV++M SS
Sbjct: 365 GSGACGVNTMASS 377
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 180/310 (58%), Gaps = 32/310 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRR 111
++ + +K SKTY E + RF +FK NLR + + T G+T+F+DLT E+R
Sbjct: 48 YNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRA 107
Query: 112 QFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+FLG R ++ +Q+ + LP DWR GAV+ +KDQG+CGSCW+FS
Sbjct: 108 KFLGTKSDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFST 167
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
A+EG + + TGEL+SLSEQ+LVDCD S ++GCNGGLM++AF++I+ GG+
Sbjct: 168 IAAVEGVNKIVTGELISLSEQELVDCDR--------SYNAGCNGGLMDNAFQFIINNGGI 219
Query: 227 EREKDYPYTGTDGGSCKFDKSKI---AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
+ +KDYPY DG K D +K+ A + F + + ++ V H P++V I A
Sbjct: 220 DTDKDYPYQAVDG---KCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEAS 276
Query: 284 WM--QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
M Q Y GV CG LDHGV+IVGYG+ YW+++NSWG +WGENG
Sbjct: 277 GMALQFYQSGVFTGE-CGSALDHGVVIVGYGTEDGI-------DYWLVRNSWGRDWGENG 328
Query: 342 YYKICMGRNV 351
Y K M RNV
Sbjct: 329 YIK--MQRNV 336
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 172/302 (56%), Gaps = 20/302 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F + K K Y++ EEH +R+ V+K NL +R + + G+TKF+D+T EFRR
Sbjct: 45 QFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDEFRR 104
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
Q+ G + + ++ P DWR GAVT VKDQG+CGSCW+FSA G++E
Sbjct: 105 QYTGTRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAIGSVE 164
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TGE VSLSEQ+LVDCD E + GCNGGLM+ AF++IL+ GG++ E D
Sbjct: 165 GINAIRTGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFILENGGIDTEND 216
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYI 289
YPY G DG K+ + + + ++++ V P++V I A Q Y
Sbjct: 217 YPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYS 276
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GGV CG LDHGVL VGYGS G YWI+KNSWGE WGE+GY + M R
Sbjct: 277 GGVFTGE-CGTDLDHGVLAVGYGSEG-------SLDYWIVKNSWGEYWGESGYLR--MQR 326
Query: 350 NV 351
N+
Sbjct: 327 NI 328
>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 138/368 (37%), Positives = 185/368 (50%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ NL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 109 QERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD DSGC GGLMN+AF +I++ G V E YPY
Sbjct: 163 AGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFGWIVQENNGAVYTENSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ + DE Q+AA L +GP+AV ++A TY GGV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYWIIKNSW WGE+GY +I G N C
Sbjct: 274 MTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTAQWGEDGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V SS
Sbjct: 326 LVKEEASS 333
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 184/314 (58%), Gaps = 24/314 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 247 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 305 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G +C F K +++ +S +E ++AA L K GP++V INA MQ Y
Sbjct: 356 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQFYRH 414
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY +
Sbjct: 415 GISRPLRPLCSPWLIDHAVLLVGYGNR-------SDIPFWAIKNSWGTDWGEKGYYYLHR 467
Query: 348 GRNVCGVDSMVSSV 361
G CGV++M SS
Sbjct: 468 GSGACGVNTMASSA 481
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 184/313 (58%), Gaps = 24/313 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 223 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 280
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 281 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 331
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G +C F K +++ +S +E ++AA L K GP++V INA MQ Y
Sbjct: 332 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRH 390
Query: 291 GVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G+S P +C +L DH VL+VGYG+ + P+W IKNSWG +WGE GYY +
Sbjct: 391 GISRPLRPLCSPWLIDHAVLLVGYGNR-------SDIPFWAIKNSWGTDWGEKGYYYLHR 443
Query: 348 GRNVCGVDSMVSS 360
G CGV++M SS
Sbjct: 444 GSGACGVNTMASS 456
>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
Length = 462
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 191/324 (58%), Gaps = 24/324 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKF
Sbjct: 155 QDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKF 214
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EF +L N L+ + + +P NDL P ++DWR GAVT VK+QG CGS
Sbjct: 215 SDLTEEEFHTIYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGS 272
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 273 CWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAI 323
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F +++ +S +E+++AA L + GP++V I
Sbjct: 324 KNLGGLETEDDYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAI 382
Query: 281 NAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G++ P+ +C ++DH VL+VGYG+ PYW IKNSWG +W
Sbjct: 383 NAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDW 435
Query: 338 GENGYYKICMGRNVCGVDSMVSSV 361
GE GYY + G CGV++M SS
Sbjct: 436 GEEGYYYLYRGSGACGVNTMASSA 459
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 142/361 (39%), Positives = 190/361 (52%), Gaps = 51/361 (14%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
+L+ + L ++SVLA VAV D L N EH FK F KTY
Sbjct: 140 VLTIEMRLYIASVLALVVAVGAD------------------LTNFEH----FKEHFGKTY 177
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTA---VHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+EH R +F+ NL ++ + G+T+F+D++ +EFR+ +LGL
Sbjct: 178 EG-DEHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEFRQTYLGLRMNAS 236
Query: 122 LPADA---QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
A Q+ + DLP DWRD GAV+ VKDQG CGSCW+FS +GA+EG HFL
Sbjct: 237 TIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKN 296
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQQ+VDC D GCNGG A EY+ GG+E E YPY G
Sbjct: 297 GELLSLSEQQMVDCSW---------LDFGCNGGQPMLAMEYVRFNGGLELETAYPYKGV- 346
Query: 239 GGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCP 295
GGSC DK AA ++ F + E + + K GP++VG++A Q Y G+ P
Sbjct: 347 GGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYKSGIYNP 406
Query: 296 YICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCG 353
C LDH VL VGYG+S + YW++KNSW +WGE GY+K+ + N CG
Sbjct: 407 ESCSSIGLDHAVLAVGYGTS-------DDGDYWLVKNSWNTSWGEKGYFKLPRNKGNKCG 459
Query: 354 V 354
+
Sbjct: 460 I 460
>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
Length = 332
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 188/314 (59%), Gaps = 24/314 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKFSDLT EF
Sbjct: 35 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N L+ + + +P NDL P ++DWR GAVT VK+QG CGSCW+FS TG +
Sbjct: 95 IYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 152
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 153 EGQWFLNRGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYAAIKNLGGLETED 203
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G +C F +++ +S +E+++AA L + GP++V INA MQ Y
Sbjct: 204 DYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYRH 262
Query: 291 GVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G++ P+ +C ++DH VL+VGYG+ PYW IKNSWG +WGE GYY +
Sbjct: 263 GIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDWGEEGYYYLYR 315
Query: 348 GRNVCGVDSMVSSV 361
G CGV++M SS
Sbjct: 316 GSGACGVNTMASSA 329
>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
Length = 462
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 191/324 (58%), Gaps = 24/324 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKF
Sbjct: 155 QDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKF 214
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EF +L N L+ + + +P NDL P ++DWR GAVT VK+QG CGS
Sbjct: 215 SDLTEEEFHTIYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGS 272
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 273 CWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAI 323
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F +++ +S +E+++AA L + GP++V I
Sbjct: 324 KNLGGLETEDDYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAI 382
Query: 281 NAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G++ P+ +C ++DH VL+VGYG+ PYW IKNSWG +W
Sbjct: 383 NAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDW 435
Query: 338 GENGYYKICMGRNVCGVDSMVSSV 361
GE GYY + G CGV++M SS
Sbjct: 436 GEEGYYYLYRGSGACGVNTMASSA 459
>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
Length = 459
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 185/312 (59%), Gaps = 22/312 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY T+EE +R +F +N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 162 FKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 221
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L + ++A + + P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 222 IYLNPLLKEEPGVKMRRAKSV-GDSAPPEWDWRSKGAVTEVKDQGMCGSCWAFSVTGNVE 280
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E D
Sbjct: 281 GQWFLNRGALLSLSEQELLDCD---------KVDKACMGGLPSNAYSAIKTLGGLETEDD 331
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGG 291
Y Y G +C F K +++ ++ +E ++AA L K GP++V INA MQ Y G
Sbjct: 332 YSYHG-HLQACSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPISVAINAFGMQFYRHG 390
Query: 292 VSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
+S P +C +L DH VL+VGYG+ P+W IKNSWG +WGE GYY + G
Sbjct: 391 ISHPLRPLCSPWLIDHAVLLVGYGNRSAV-------PFWAIKNSWGTDWGEEGYYYLYRG 443
Query: 349 RNVCGVDSMVSS 360
CGV++M SS
Sbjct: 444 SGACGVNTMASS 455
>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
Length = 462
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 191/324 (58%), Gaps = 24/324 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKF
Sbjct: 155 QDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKF 214
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EF +L N L+ + + +P NDL P ++DWR GAVT VK+QG CGS
Sbjct: 215 SDLTEEEFHTIYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGS 272
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 273 CWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAI 323
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F +++ +S +E+++AA L + GP++V I
Sbjct: 324 KNLGGLETEDDYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAI 382
Query: 281 NAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G++ P+ +C ++DH VL+VGYG+ PYW IKNSWG +W
Sbjct: 383 NAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDW 435
Query: 338 GENGYYKICMGRNVCGVDSMVSSV 361
GE GYY + G CGV++M SS
Sbjct: 436 GEEGYYYLYRGSGACGVNTMASSA 459
>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 467
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 165/320 (51%), Gaps = 34/320 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD D GC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DFGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNSW WGE
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEE 313
Query: 341 GYYKICMGRNVCGVDSMVSS 360
GY +I G N C V SS
Sbjct: 314 GYIRIAKGSNQCLVKEEASS 333
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 177/315 (56%), Gaps = 29/315 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKF 101
L N F FK K SK+Y+ Q E R +F NLR + L + V +F
Sbjct: 18 LENVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQF 77
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGS 160
+DLT EF+ +L L+ + L P + T +PT DWR G VTGVKDQG CGS
Sbjct: 78 TDLTIDEFK-AYLTLHSKPTL----NTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGS 132
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS G+ EGA++ STG+LVSLSEQQL+DC + + + GC+GG + F Y+
Sbjct: 133 CWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDC--------TTNVNDGCDGGYLEETFPYV 184
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
+ G V E YPYTG D G+C+ +S + VS + ++ + D + A + GP++V +
Sbjct: 185 QQTGLVS-ESSYPYTGRD-GNCRISESDVVTKVSKYVLLGGEADLLEA-VGSVGPVSVAM 241
Query: 281 NAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
+A ++ +Y GV +C Y L+HGVL+VGYG+ K YW+IKNSWG WGE
Sbjct: 242 DATYIYSYASGVYESSLCSLYSLNHGVLVVGYGTQ-------DGKDYWLIKNSWGNTWGE 294
Query: 340 NGYYKICMGRNVCGV 354
GY K+ G N CG+
Sbjct: 295 QGYLKLLRGTNECGI 309
>gi|343475823|emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 169/313 (53%), Gaps = 22/313 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+EG ++ EL SLSEQ LV CD + D GC GL + AF++IL G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCELGLKDPAFQWILWSNKGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G +C + A +SN + DED +A L + GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDMSGKVVGAKISNMRYLPLDEDTIAEWLARKGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GGV I + L++G L+VGY + + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SRRLNYGALLVGYDDTS-------KPPYWIIKNSWGKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSM 357
I G N C V ++
Sbjct: 321 IEKGTNQCLVKNL 333
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 145/374 (38%), Positives = 205/374 (54%), Gaps = 41/374 (10%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
+ IL S++L++L + A+A D+ IR V SDG E+S +L H F+
Sbjct: 4 KTILPSVVLVILIAASAAADIGFDESNPIRMV--SDGLREIEESVVQILGQSRHVLSFAR 61
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F ++ K Y EE RF +FK NL R +++L + GV +F+DLT EF+R
Sbjct: 62 FTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRL---SYKLGVNQFADLTWQEFQRN 118
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
LG + A + + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE
Sbjct: 119 KLGAAQNC--SATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEA 176
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ Y
Sbjct: 177 AYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAY 229
Query: 233 PYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTY 288
PYTG D G+CK+ + V N ++ + DE + A LV+ P+++ V + Y
Sbjct: 230 PYTGKD-GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVR--PVSIAFEVVKSFRLY 286
Query: 289 IGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
GV CG ++H VL VGYG PYW+IKNSWG +WG+ GY+K+
Sbjct: 287 KSGVYTDSHCGNTPMDVNHAVLAVGYGIEDGV-------PYWLIKNSWGADWGDKGYFKM 339
Query: 346 CMGRNVCGVDSMVS 359
MG+N+CG+ + S
Sbjct: 340 EMGKNMCGIATCAS 353
>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 389
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 127/345 (36%), Positives = 189/345 (54%), Gaps = 38/345 (11%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSD 103
+L + FS FK++ K Y EE RF +F+ NL ++ Q+ + TA +G+T+FSD
Sbjct: 32 NLTQVKQLFSKFKAEHKKFYNFLEEQR-RFEIFRQNLDIISELNQVEEGTAEYGITQFSD 90
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+T EF+ Q L + R ++ + D PT +DWRDHGAVT VK+QG G+CW
Sbjct: 91 MTTEEFKSQILIPSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCW 150
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG FL+ LVSLSE+Q+VDCD +P +G D G GG AF+Y++
Sbjct: 151 TFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEP-STGHADCGVFGGWPYLAFDYVIN 209
Query: 223 AGGVEREKDYPYTGTDGG--------------------------SCKFDKSKIAAAVSNF 256
AGG+ E+ YPY +GG C+ + IAA + ++
Sbjct: 210 AGGLPSEETYPYCVGNGGCYPCPAPGYNETLCGPAVPYCNATAYPCRQGQVPIAAKIEDW 269
Query: 257 SVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSG 315
+S DED + L + GPL+V ++A ++Q Y G+S P C K L+H VL+ GYG
Sbjct: 270 KALSKDEDSIKQQLFEIGPLSVALDASYLQFYKKGISAPKFCSKTTLNHAVLLTGYGIDN 329
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+W +KNSWG WGE GY+++ G +CG+++ V++
Sbjct: 330 GV-------EFWNVKNSWGAKWGEQGYFRLKRGVGMCGINTQVAT 367
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 169/308 (54%), Gaps = 22/308 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQGACGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGACGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GGLM+ + ++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
+ YPY G +KS + A +S + DE+ +A L K+GP+A+ ++A
Sbjct: 209 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Y GGV I K LDH VL+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 269 FLGYKGGVLTSCI-SKGLDHDVLLVGYNDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 345 ICMGRNVC 352
I G N C
Sbjct: 321 IEKGTNQC 328
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 169/308 (54%), Gaps = 22/308 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQGACGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GGLM+ + ++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
+ YPY G +KS + A +S + DE+ +A L K+GP+A+ ++A
Sbjct: 209 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Y GGV I K LDH VL+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 269 FLGYKGGVLTSCI-SKGLDHDVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 345 ICMGRNVC 352
I G N C
Sbjct: 321 IEKGTNQC 328
>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
Length = 417
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 191/324 (58%), Gaps = 24/324 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKF
Sbjct: 110 QDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKF 169
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGS 160
SDLT EF +L N L+ + + +P NDL P ++DWR GAVT VK+QG CGS
Sbjct: 170 SDLTEEEFHTIYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGS 227
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 228 CWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAI 278
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F +++ +S +E+++AA L + GP++V I
Sbjct: 279 KNLGGLETEDDYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAI 337
Query: 281 NAVWMQTYIGGVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G++ P+ +C ++DH VL+VGYG+ PYW IKNSWG +W
Sbjct: 338 NAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDW 390
Query: 338 GENGYYKICMGRNVCGVDSMVSSV 361
GE GYY + G CGV++M SS
Sbjct: 391 GEEGYYYLYRGSGACGVNTMASSA 414
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 170/311 (54%), Gaps = 30/311 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 158 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
YPY +G + +S I A + I S+ED MAA L +GP+A+ ++A
Sbjct: 209 YTGASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDAS 268
Query: 284 WMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Y GGV SC GK L+HGVL+VGY +G E PYW+IKNSWGENWGE G
Sbjct: 269 AFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEKG 318
Query: 342 YYKICMGRNVC 352
Y ++ G N C
Sbjct: 319 YVRVRKGTNEC 329
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 175/320 (54%), Gaps = 27/320 (8%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTP 106
H+ L+K +K Y+ EEH R ++ NL++ + L VH G+ K++D+T
Sbjct: 26 QHWKLWKEANNKRYSDAEEH-VRRATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTV 84
Query: 107 SEFRRQFLGLNRRLR--LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+EF + G N +R D LP DWRD G VT VKDQG CGSCW+F
Sbjct: 85 TEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSCWAF 144
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TGALEG HF TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI +
Sbjct: 145 STTGALEGQHFKQTGKLVSLSEQNLVDCSGK-------QGNMGCNGGLMDQAFEYIKENN 197
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINA- 282
G++ E YPY D C+F + + A + F+ I+S DE + + GP++V I+A
Sbjct: 198 GIDTEDSYPYEAVD-NQCRFKAANVGATDTGFTDITSKDESALQQAVATVGPISVAIDAG 256
Query: 283 -VWMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P+ LDHGVL VGYG+ K YW++KNSWGE WG+
Sbjct: 257 HTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGTD-------SGKDYWLVKNSWGEGWGDK 309
Query: 341 GYYKICMG-RNVCGVDSMVS 359
GY K+ RN CG+ + S
Sbjct: 310 GYIKMTRNKRNQCGIATAAS 329
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 169/308 (54%), Gaps = 22/308 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 33 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 92
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQGACGSCW+FSA G
Sbjct: 93 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIG 152
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GGLM+ + ++I+ + G V
Sbjct: 153 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 203
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
+ YPY G +KS + A +S + DE+ +A L K+GP+A+ ++A
Sbjct: 204 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS 263
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Y GGV I K LDH VL+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 264 FLGYKGGVLTSCI-SKGLDHDVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 315
Query: 345 ICMGRNVC 352
I G N C
Sbjct: 316 IEKGTNQC 323
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 134/358 (37%), Positives = 196/358 (54%), Gaps = 27/358 (7%)
Query: 1 MERLILSSLLLLLLSSVLASA-VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
M+ + +++L L S++++ +++ + DA S D +NA + L K
Sbjct: 1 MKLIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVK-- 58
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--- 116
KTY E D RF++FK NLR D T G+ KF+DLT E+R + G+
Sbjct: 59 HGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTI 118
Query: 117 -NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+++ + + + LP DWR+ GAVT VKDQG+CGSCW+FS TG++EG +
Sbjct: 119 DDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNK 178
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ TG+L+S+SEQ+LV+CD S + GCNGGLM+ AFE+I+K GG++ E+DYPYT
Sbjct: 179 IVTGDLISVSEQELVNCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYT 230
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVS 293
G DG K K+ + ++ + +++ V + P+AV I A Q Y G+
Sbjct: 231 GKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIF 290
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGVL GYG+ K YW++KNSWG WGE GY K M RN+
Sbjct: 291 TG-SCGTALDHGVLAAGYGTE-------DGKDYWLVKNSWGAEWGEGGYLK--MERNI 338
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 178/320 (55%), Gaps = 22/320 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTP 106
N H+ FK++ +K Y + E R +F+ N + + + G+ F DLT
Sbjct: 76 NLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTN 135
Query: 107 SEFRRQFLGLNRRLRLPADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
E+R ++LG R P+ A + D+P DWRD G VT VK+QG CGSCW+F
Sbjct: 136 KEYRERYLGYRRPENTPSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAF 195
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA G+LEG HF STG+LVSLSEQ LVDC PE +SGCNGG M+ AFEY+
Sbjct: 196 SAVGSLEGQHFKSTGKLVSLSEQNLVDCS---TPE----GNSGCNGGWMDQAFEYVKDNH 248
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAV 283
G++ E YPY GTD GSC F I A + F V DE+ + + GP++V I+A
Sbjct: 249 GIDTEDSYPYVGTD-GSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDAS 307
Query: 284 WM--QTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
M Q Y GGV + P+ LDHGVL+VGYG +F+ K +W++KNSWG WG
Sbjct: 308 SMLFQFYRGGVYNVPWCSTSELDHGVLVVGYGK------QFQGKDFWMVKNSWGVGWGIY 361
Query: 341 GYYKICMGR-NVCGVDSMVS 359
GY ++ + N CG+ S S
Sbjct: 362 GYIEMSRNKGNQCGIASKAS 381
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 187/317 (58%), Gaps = 23/317 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V D+G CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDKGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GP + +NA+ +Q
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPFSSALNALSLQ 253
Query: 287 TYIGGV--SCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P +C + L+H VL VGYG G PYW +KNSW +GENGY+
Sbjct: 254 FYTEGILHPLPALCDAQSLNHAVLTVGYGKEG-------RLPYWTVKNSWSTMFGENGYF 306
Query: 344 KICMGRNVCGVDSMVSS 360
+I G CG++++VS+
Sbjct: 307 RIYRGDGPCGINTLVST 323
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 170/311 (54%), Gaps = 30/311 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
YPY +G + +S I A + I S+ED MAA L +GP+A+ ++A
Sbjct: 209 YTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDAS 268
Query: 284 WMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Y GGV SC GK L+HGVL+VGY +G E PYW+IKNSWGENWGE G
Sbjct: 269 AFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEKG 318
Query: 342 YYKICMGRNVC 352
Y ++ G N C
Sbjct: 319 YVRVRKGTNEC 329
>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
Length = 408
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 141/352 (40%), Positives = 197/352 (55%), Gaps = 40/352 (11%)
Query: 24 VNDDDAMIRQVVPSDGEQS--EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
+D + + V+P ++ +D + F F + +++TY ++EE +R VF N+
Sbjct: 81 TDDKNETLSSVLPLLNKEPLPQDFSVKMASIFKEFVTTYNRTYESKEETQWRMSVFSNNM 140
Query: 82 RRAKRRQLLDP-TAVHGVTKFSDLTPSEFR--------RQFLGLNRRLRLPADAQKAPIL 132
RA++ Q LD TA +GVTKFSDLT EFR R++ G N RL D
Sbjct: 141 MRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLREYRGKNMRL----DKSTG--- 193
Query: 133 PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDC 192
+ P+++DWR GAVT VK+QG CGSCW+FS TG +EG FL G L+SLSEQ+L+DC
Sbjct: 194 --DSAPSEWDWRRKGAVTKVKNQGMCGSCWAFSVTGNVEGQWFLKQGALLSLSEQELLDC 251
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
D D C GGL ++A+ I GG+E E DY Y G +C F K
Sbjct: 252 DK---------VDKACLGGLPSNAYSAIKTLGGLETEDDYSYRGR-MQTCGFSPKKARVY 301
Query: 253 VSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIV 309
+++ +S +E+ +AA L + GP++V INA MQ Y G+S P +C +L DH VL+V
Sbjct: 302 INDSVELSQNEETLAAWLAEKGPISVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLV 361
Query: 310 GYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSV 361
GYG+ P+W IKNSWG +WGE GYY + G CGV++M SS
Sbjct: 362 GYGNRSGT-------PFWAIKNSWGSDWGEEGYYYLHRGSGACGVNTMASSA 406
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 189/326 (57%), Gaps = 38/326 (11%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
+ H+ LFK + +KTY Q++ R +F+AN+++ LL + G+ F+D+T
Sbjct: 23 DEHWELFKRQHNKTY-LQKQDVGRRAIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMT 81
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND-----LPTDFDWRDHGAVTGVKDQGACGS 160
P EF + R R A+ + L D +P DWR G VT VK+QG CGS
Sbjct: 82 PDEFEKY-----RGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGS 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TGALEG HF +G+LVSLSEQ LVDC + ++GCNGGLM++AF +I
Sbjct: 137 CWAFSTTGALEGQHFRRSGDLVSLSEQMLVDC-------SAVYGNAGCNGGLMDNAFRFI 189
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQM--AANLVKHGPLA 277
AGG+E EK YPYTG D G+C FD I A ++ F V S DE+ + AA +V GP++
Sbjct: 190 KDAGGLETEKSYPYTGKD-GTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVV--GPVS 246
Query: 278 VGINAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V I+A Q Y GV C LDHGVL+VGYG++ K YW++KNSWG
Sbjct: 247 VAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVGYGTT------RDGKDYWLVKNSWG 300
Query: 335 ENWGENGYYKICMGR-NVCGVDSMVS 359
+WG++GY ++ + N CG+ +M S
Sbjct: 301 SSWGQSGYIQMSRNKENQCGIATMAS 326
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 199/370 (53%), Gaps = 39/370 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M I S L LL+ SVL ++++ V ++ ++E A + + +
Sbjct: 1 MATSIKSITLALLIFSVLLISLSLG-------SVTATETTRNEAE---ARRMYERWLVEN 50
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLN-R 118
K Y E + RF +FK NL+ + + + T G+T+F+DLT EFR +L
Sbjct: 51 RKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKME 110
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R R+P +K + LP DWR GAV VKDQG+CGSCW+FSA GA+EG + + T
Sbjct: 111 RTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKT 170
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD S + GC GGLM+ AF++I++ GG++ E+DYPY TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATD 222
Query: 239 GGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCP 295
C DK + + + ++++ + + P++V I A Q Y GV
Sbjct: 223 VNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTG 282
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV---- 351
CG LDHGV+ VGYGS G + YWI++NSWG NWGE+GY+K + RN+
Sbjct: 283 -TCGTSLDHGVVAVGYGSEG-------GQDYWIVRNSWGSNWGESGYFK--LERNIKESS 332
Query: 352 --CGVDSMVS 359
CGV M S
Sbjct: 333 GKCGVAMMAS 342
>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
Length = 394
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 141/350 (40%), Positives = 201/350 (57%), Gaps = 30/350 (8%)
Query: 23 AVNDDDAMIRQVVPSDGEQS--EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+D + + V+P ++ +D + F F + +++TY ++EE ++R VF N
Sbjct: 65 VTDDKNETLSSVLPLLNKEPLPQDFSVRMVSIFKEFVTTYNRTYESKEEAEWRMSVFSNN 124
Query: 81 LRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDL 137
+ RA++ Q LD TA +G+TKFSDLT EFR +L N LR +K + + +
Sbjct: 125 VMRAQKIQALDRGTAQYGITKFSDLTEEEFRTIYL--NPLLR-ENRGKKMDLAKSIGDSA 181
Query: 138 PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECD 197
P ++DWR+ GAVT VKDQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD
Sbjct: 182 PPEWDWRNKGAVTQVKDQGMCGSCWAFSVTGNVEGQWFLKRGALLSLSEQELLDCDK--- 238
Query: 198 PEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS 257
D C GGL ++A+ I GG+E E DY Y G +C F K +++
Sbjct: 239 ------VDKACLGGLPSNAYSAIKTLGGLETEDDYSYRG-HVQTCSFSSKKARVYINDSV 291
Query: 258 VISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGS- 313
+S +E ++ A L ++GP++V INA MQ Y G+S P +C +L DH VL+VGYG+
Sbjct: 292 ELSQNEQKLVAWLAQNGPISVAINAFGMQFYRRGISHPLRPLCSPWLIDHAVLLVGYGNR 351
Query: 314 SGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAA 363
SG P+W IKNSWG +WGE GYY + G CGV++M SS
Sbjct: 352 SGI--------PFWAIKNSWGTDWGEEGYYYLHRGSGACGVNTMASSAVV 393
>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 167/326 (51%), Gaps = 34/326 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+AV +
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVGLPQDEAQIAAWLAVNGPVAVAV 261
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A TY GGV + + LDHGVL+VGY S PYWIIKNS WGE
Sbjct: 262 DASSWMTYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSRTTQWGEE 313
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIHT 366
GY +I G N C V SS + +
Sbjct: 314 GYIRIAKGSNQCLVKEEASSAVVLRS 339
>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
Length = 320
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 186/315 (59%), Gaps = 26/315 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSN-DDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFRQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ +NA+++Q
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQ 253
Query: 287 TYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
+ P +C + L+H VL VGYG G PYW +KNSW +GENGY++I
Sbjct: 254 HPL-----PALCDAQSLNHAVLTVGYGKEG-------RLPYWTVKNSWSTMFGENGYFRI 301
Query: 346 CMGRNVCGVDSMVSS 360
G CG++++VS+
Sbjct: 302 YRGDGTCGINTLVST 316
>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
Length = 462
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 135/323 (41%), Positives = 187/323 (57%), Gaps = 24/323 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F +++TY ++EE +R VF N+ +A++ + LD TA +G+TKF
Sbjct: 155 QDFSVKMTTVFKDFMITYNRTYESREETQWRLTVFTRNMVKAQKIEALDRGTAQYGITKF 214
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGS 160
SDLT EF +L N L+ ++ + ND P ++DWR GAVT VKDQG CGS
Sbjct: 215 SDLTEEEFYTIYL--NPLLQKKPGSKMSLAKSINDPAPPEWDWRKKGAVTKVKDQGMCGS 272
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GG+ ++A+ I
Sbjct: 273 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACLGGMPSNAYTAI 323
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG+E E DY Y G +C F K +++ +S +E +MAA L + GP++V I
Sbjct: 324 KSLGGLETEDDYSYKGY-VQACNFSAQKAKVYINDSVELSKNESKMAAWLAQKGPISVAI 382
Query: 281 NAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA MQ Y G++ P +C +L DH VL+VGYG+ PYW IKNSWG NW
Sbjct: 383 NAFGMQFYRHGIAHPLRPLCSPWLIDHAVLLVGYGNRS-------NTPYWAIKNSWGSNW 435
Query: 338 GENGYYKICMGRNVCGVDSMVSS 360
GE GYY + G CGV++M SS
Sbjct: 436 GEEGYYYLYRGSGACGVNTMASS 458
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 174/319 (54%), Gaps = 26/319 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 117 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 176
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R +L G +K + T P DWR GAVT VKDQG+CGSCW+F+A G
Sbjct: 177 RATYLNGAKYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAAIG 236
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + + C GG + AF++I+ + G V
Sbjct: 237 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCGGGFADRAFKWIVSSNKGNV 287
Query: 227 EREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY DG C + A +S + DE+ +A L ++GP+A+ ++A
Sbjct: 288 FTERSYPYASIDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDAST 347
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Y GGV SC K+++H VL+VGY + + PYWIIKNSW + WGE GY
Sbjct: 348 FLDYKGGVLTSC---SSKHVNHEVLLVGYNDTS-------KPPYWIIKNSWDKEWGEEGY 397
Query: 343 YKICMGRNVCGVDSMVSSV 361
+I G N+C + SV
Sbjct: 398 IRIEKGTNLCLMKEYARSV 416
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 144/369 (39%), Positives = 201/369 (54%), Gaps = 33/369 (8%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFK 57
++SS++LLL + ASA A + DD+ ++V SDG E S ++ H F+ F
Sbjct: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF F NL + + G+ KF+D + EF+R LG
Sbjct: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNKFADWSWEEFQRHRLGAA 126
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + L + LP DWR+ G V+ VKDQG CGSCW+FS TG+LE A+ +
Sbjct: 127 QNC--SATTKGNHKLTADVLPETKDWRESGIVSPVKDQGHCGSCWTFSTTGSLEAAYHQA 184
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 185 FGKGISLSEQQLVDCAQAFN-------NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 237
Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVS 293
D G CKF + V N ++ + DE Q A LV+ P++V V + Y GV
Sbjct: 238 D-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR--PVSVAFEVVDGFRFYKSGVY 294
Query: 294 CPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
CG ++H V+ VGYG PYW+IKNSWGENWG++GY+KI MG+N
Sbjct: 295 SSTKCGNTPMDVNHAVVAVGYGVE-------DGVPYWLIKNSWGENWGDHGYFKIKMGKN 347
Query: 351 VCGVDSMVS 359
+CG+ + S
Sbjct: 348 MCGIATCAS 356
>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
Length = 456
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 177/314 (56%), Gaps = 21/314 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK++ K+Y + E YR RVF+ +++ A+ +P A GVTKFSDLT EF+
Sbjct: 35 QFAAFKAEHGKSYTSAAEEGYRMRVFEESMKAAQAHAAANPHAKFGVTKFSDLTHEEFKT 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ A + P+ T P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 95 LYANGAAHFAAAAKRARRPVSVTGTAPDEWDWRKKGAVTPVKDQGHCGSCWTFSTTGNIE 154
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVERE 229
G ++ EL +LSEQ LV CD D GC+GGLM++AFE+I+ G V E
Sbjct: 155 GQWAVAGNELTNLSEQMLVSCDAR---------DYGCSGGLMDNAFEWIVNQNDGFVFTE 205
Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
+ YPY G + C K+ A + + +DE++MAA L +GP+++ ++A +
Sbjct: 206 ESYPYASGSGDAPLCDVGGRKVGATIKGHVGLPNDEEKMAAWLAANGPISIAVDADSFKA 265
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
Y GGV G+ LDHGVL+VGY + PYWIIKNSWG NWGE+GY ++
Sbjct: 266 YKGGVLTGCEEGQ-LDHGVLLVGYN-------KVANPPYWIIKNSWGPNWGEHGYIRVGF 317
Query: 348 GRNVCGVDSMVSSV 361
G N C ++S S
Sbjct: 318 GTNQCNLNSYACSA 331
>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 171/310 (55%), Gaps = 26/310 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R +L G +K + T P DWR GAVT VKDQG+CGSCW+F+ATG
Sbjct: 98 RATYLNGAKYYAAALERPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAATG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + + C GG + AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCRGGFADRAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY TDG C + A +S + DE+ +A L ++GP+A+ ++A
Sbjct: 209 FTEESYPYASTDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDAST 268
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Y GGV SC + L H VL+VGY + + PYWIIKNSW + WGE GY
Sbjct: 269 FLDYKGGVLTSCS---SEGLSHDVLLVGYNDT-------SKPPYWIIKNSWDKEWGEEGY 318
Query: 343 YKICMGRNVC 352
+I G N+C
Sbjct: 319 IRIEKGTNLC 328
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 140/365 (38%), Positives = 198/365 (54%), Gaps = 35/365 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
+ +LSS++L++L + A+A D+ IR V SDG E++ +L H F+
Sbjct: 4 KTVLSSVVLVILIAASAAADIGFDELNPIRMV--SDGLREVEETVSQILGQSRHVLTFAR 61
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F ++ K Y EE RF +FK NL + + GV +F+DLT EF+R LG
Sbjct: 62 FTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLG 121
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ A + + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 122 AAQNC--SATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPY
Sbjct: 180 QAFGKGISLSEQQLVDCAGAYN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYI 232
Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGG 291
G D G+CKF + V N ++ + DE + A LV+ P+++ + + Y G
Sbjct: 233 GKD-GTCKFSAENVGVQVLDSVNITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSG 289
Query: 292 VSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
V CG ++H VL VGYG PYW+IKNSWG +WG+ GY+K+ MG
Sbjct: 290 VYTDSHCGSTPMDVNHAVLAVGYGVEDGV-------PYWLIKNSWGADWGDKGYFKMEMG 342
Query: 349 RNVCG 353
+N+CG
Sbjct: 343 KNMCG 347
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 142/376 (37%), Positives = 203/376 (53%), Gaps = 53/376 (14%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSL---FKSKFSKTYAT 66
+ LLL S+ ++ + I+ E ++LL ++ + FK K +K+Y T
Sbjct: 4 ITLLLHSIFLLGFVNSEQISQIQ-------EHPRNNLLINHPYYPVWTNFKLKHAKSYKT 56
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSDLTPSEFRRQFLGLNRRLRL 122
++E RF+VF +N + ++ + H + KF+D+T +EFR++ G +L
Sbjct: 57 KDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGF----KL 112
Query: 123 PAD---AQKAPI--------LPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
PA A+ P+ +P N +P DWR G VT VKDQG+CGSCW+FSATG+L
Sbjct: 113 PAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSL 172
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H+ TG+LVSLSEQ LVDCD D D GCNGG M+ AF+Y+ G++ E
Sbjct: 173 EGQHYKQTGKLVSLSEQNLVDCDVNGD-------DEGCNGGYMDGAFQYVETNKGIDTEA 225
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAV--WMQT 287
YPY G D G C+F + A + F + +E + A + GP++V I+A Q
Sbjct: 226 SYPYKGRD-GRCRFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQF 284
Query: 288 YIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GV C +YLDHGVL VGY S+ K Y+I+KNSW E+WG++GY I
Sbjct: 285 YSHGVYYDRSCSPEYLDHGVLAVGYNSTK------DGKQYYIVKNSWSEDWGDDGY--IL 336
Query: 347 MGR---NVCGVDSMVS 359
M R N CG+ +M S
Sbjct: 337 MSRRKNNNCGIATMAS 352
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 148/369 (40%), Positives = 194/369 (52%), Gaps = 33/369 (8%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSED--HLLNAEHH---FSLFK 57
RL S LLL+LS +A +V DD IR V E + +L H F+ F
Sbjct: 4 RLFFVSSLLLVLSCAVAGSVF--DDSNPIRMVSDRLRELELEVVRVLGQVPHALRFARFA 61
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y T EE RF +F +L K + GV +F+D T EFR+ LG
Sbjct: 62 HRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFRKHRLGAA 121
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + L LP DWR G V+ VKDQG CGSCW+FS TGALE A+ +
Sbjct: 122 QNC--SATTKGSHKLTDTALPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQA 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+ +SLSEQQLVDC G + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 180 HGKGISLSEQQLVDCGR-------GFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGV 232
Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVS 293
D GSCKF + V N ++ + DE + A V+ P++V V + Y GV
Sbjct: 233 D-GSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVR--PVSVAFEVVSGFRLYSKGVY 289
Query: 294 CPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
CG ++H VL VGYG PYW+IKNSWG NWG+NGY+K+ MG+N
Sbjct: 290 TSNSCGSTPMDVNHAVLAVGYGVE-------DGIPYWLIKNSWGGNWGDNGYFKMEMGKN 342
Query: 351 VCGVDSMVS 359
+CGV + S
Sbjct: 343 MCGVATCAS 351
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 170/311 (54%), Gaps = 30/311 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
YPY +G + +S I A + I S+ED MAA L +GP+A+ ++A
Sbjct: 209 YTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDAS 268
Query: 284 WMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Y GGV SC GK L+HGVL+VGY +G E PYW+IKNSWG+NWGE G
Sbjct: 269 AFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWLIKNSWGKNWGEKG 318
Query: 342 YYKICMGRNVC 352
Y ++ G N C
Sbjct: 319 YVRVRKGTNEC 329
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 181/321 (56%), Gaps = 28/321 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
E + FK KTY EE RF +F+ N+++ + L + GV +FSDL
Sbjct: 53 EQAWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLK 112
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EF + + GL ++ L D + L N+L P DWR G VT VK+QG CGSCWS
Sbjct: 113 HEEFVK-YNGL-KKTSLK-DGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWS 169
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG+LEG HF +G+LVSLSE QLVDC E GCNGGLM++AF+YI
Sbjct: 170 FSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNE-------GCNGGLMDNAFKYIKSV 222
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINA 282
GG+E E+DYPY G+CKFD +K+AA + V S E + + + GP++V I+A
Sbjct: 223 GGLESEEDYPYKPKQ-GTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDA 281
Query: 283 VW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q+Y GGV P + LDHGVL VGYG+ + + YWI+KNSWG WGE
Sbjct: 282 SHSSFQSYAGGVYDEPECSSEQLDHGVLCVGYGTDD------QGQDYWIVKNSWGAEWGE 335
Query: 340 NGYYKICMG-RNVCGVDSMVS 359
+GY K+ +N CG+ + S
Sbjct: 336 DGYVKMSRNKKNQCGIATQAS 356
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 140/371 (37%), Positives = 204/371 (54%), Gaps = 35/371 (9%)
Query: 3 RLIL-SSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLF 56
R IL S++LL+L+++ A ++ D+ IR V + E+S +L H F+ F
Sbjct: 5 RTILPSAVLLILIAASTAESIGF-DESNPIRMVSDRLREVEESVVQILGQSRHVISFARF 63
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
++ K Y EE RF +FK NL + + GV +F+D+T EF+R LG
Sbjct: 64 AHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADMTWQEFQRTKLGA 123
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ A + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 124 AQNC--SATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 181
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+ +SLSEQQLVDC +G+ ++ GCNGGL + AFEYI GG++ E+ YPYT
Sbjct: 182 AFGKGISLSEQQLVDC--------AGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYT 233
Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGG 291
G D G+CK+ + V N ++ + DE + A LV+ P+++ + + Y G
Sbjct: 234 GED-GTCKYSAENVGVEVLDSVNITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSG 290
Query: 292 VSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
V CG+ ++H VL VGYG PYW+IKNSWG +WG+ GY+K+ MG
Sbjct: 291 VYSDSHCGQTPMDVNHAVLAVGYGIEDGV-------PYWLIKNSWGADWGDKGYFKMEMG 343
Query: 349 RNVCGVDSMVS 359
+N+CG+ + S
Sbjct: 344 KNMCGIATCAS 354
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 143/371 (38%), Positives = 198/371 (53%), Gaps = 41/371 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M I S L LL+ S+L ++++ V +D ++E A + + +
Sbjct: 1 MATPIKSITLALLIFSMLLISLSLG-------SVTAADTTRNEAE---ARRMYEQWLVEN 50
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL-GLNR 118
K Y E + RF +F NL+ + + + T G+T+F+DLT EFR +L
Sbjct: 51 RKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKME 110
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R R+P ++ + LP DWR GAV VKDQG CGSCW+FSA GA+EG + + T
Sbjct: 111 RTRVPVKGERYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKT 170
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD S + GC GGLM+ AF++I++ GG++ E+DYPYT TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATD 222
Query: 239 GGSCKFDK--SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSC 294
C DK S++ V +DE + L P++V I A Q Y GV
Sbjct: 223 DNICNSDKKNSRVVTIDGYEDVPQNDEKSLKKALANQ-PISVAIEAGGRAFQLYKSGVFT 281
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV--- 351
CG LDHGV+ VGYGS G + YWI++NSWG NWGE+GY+K + RN+
Sbjct: 282 G-TCGTSLDHGVVAVGYGSEG-------GQDYWIVRNSWGSNWGESGYFK--LERNIKES 331
Query: 352 ---CGVDSMVS 359
CGV M S
Sbjct: 332 SGKCGVAMMAS 342
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 178/306 (58%), Gaps = 25/306 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
++ + +K K Y E + RF +FK NL+ + + G+ +F+DLT E+R
Sbjct: 47 YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRSYKVGLNRFADLTNEEYRSM 106
Query: 113 FLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
FLG R ++ + +++ + ++ LP DWR+ GAV +KDQG+CGSCW+FS
Sbjct: 107 FLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAFSTV 166
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
A+EG + ++TGE++ LSEQ+LVDCD + D+GCNGGLM+ AFE+I+ GG++
Sbjct: 167 AAVEGVNQIATGEMIQLSEQELVDCDR--------TYDAGCNGGLMDYAFEFIINNGGID 218
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--M 285
E+DYPY G DG K+ +++++ + ++ V H P++V I A
Sbjct: 219 TEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAF 278
Query: 286 QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Q Y+ GV CG+ LDHGV++VGYG+ A +WI++NSWG +WGENGY I
Sbjct: 279 QLYLSGVFTGE-CGRALDHGVVVVGYGTDNGA-------DHWIVRNSWGTSWGENGY--I 328
Query: 346 CMGRNV 351
M RNV
Sbjct: 329 RMERNV 334
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 128/346 (36%), Positives = 194/346 (56%), Gaps = 22/346 (6%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++L L +ASAV ++ + V + G +S+ +++ + L K ++ +
Sbjct: 2 VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAW-LVKHGKAQNQNSLV 60
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-LPADAQ 127
E D RF +FK NLR + + G+T+F+DLT E+R ++LG + +Q
Sbjct: 61 EKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSQ 120
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ ++LP DWR GAV VKDQG+CGSCW+FS GA+EG + + TG+L++LSEQ
Sbjct: 121 RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 180
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG + K+
Sbjct: 181 ELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKN 232
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHG 305
+ ++ + + ++ V H P++V I A Q Y G+ CG LDHG
Sbjct: 233 AKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIF-DGTCGTQLDHG 291
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
V+ VGYG+ K YWI++NSWG++WGE+GY K M RN+
Sbjct: 292 VVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLK--MARNI 328
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/366 (40%), Positives = 196/366 (53%), Gaps = 50/366 (13%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L L+L++V+ S AV+ D + Q +S FK + SK Y ++ E
Sbjct: 3 LFLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
+R ++F N + AK +L V G+ K++D+ EF G N+ +
Sbjct: 44 ERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103
Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+D A I P N LP DWRD GAVT VKDQG CGSCWSFSATG+LEG HF TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTG 163
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+LVSLSEQ LVDC SG ++GCNGGLM++AF YI GG++ EK YPY D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAED 215
Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-SC 294
C + A F I ++ED + A + GP+++ I+A Q Y GV S
Sbjct: 216 -EKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSD 274
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCG 353
P + LDHGVL+VGYG+S + YW++KNSWG +WG NGY K+ + N+CG
Sbjct: 275 PECSSQELDHGVLVVGYGTSD------DGQDYWLVKNSWGPSWGLNGYIKMARNQDNMCG 328
Query: 354 VDSMVS 359
V S S
Sbjct: 329 VASQAS 334
>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 404
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 174/314 (55%), Gaps = 38/314 (12%)
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ------- 112
+ K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR +
Sbjct: 2 YGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASY 61
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
F +RLR K + T P DWR+ GAVT +KDQG CGSCW+F + G +EG
Sbjct: 62 FAAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPMKDQGQCGSCWAFYSIGNIEG 115
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREK 230
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 116 QWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 166
Query: 231 DYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA+ ++A Y
Sbjct: 167 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 226
Query: 289 IGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
GG+ SC + LDHGVL+VGY + PYWIIKNSW WGE+GY +I
Sbjct: 227 NGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIE 276
Query: 347 MGRNVCGVDSMVSS 360
G N C ++ VSS
Sbjct: 277 KGTNQCLMNQAVSS 290
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 127/309 (41%), Positives = 169/309 (54%), Gaps = 26/309 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC 352
++ MG N C
Sbjct: 321 RVVMGVNAC 329
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 179/331 (54%), Gaps = 31/331 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D+GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 321 RVVMGVNACLLSEYPVSAHVRESAAPGTSTS 351
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 179/319 (56%), Gaps = 28/319 (8%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+SE+ ++ + + +K K Y E + RF +FK NL+ + T G+ +
Sbjct: 37 RSEEEVMGM---YQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNR 93
Query: 101 FSDLTPSEFRRQFLGL----NRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
F+DLT E+R +LG RR +L + + ++P LP DWR+ GAV VKDQ
Sbjct: 94 FADLTNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQ 153
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
+CGSCW+FS A+EG + + TGEL+SLSEQ+LVDCD E D GCNGGLM+
Sbjct: 154 RSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTE--------YDMGCNGGLMDY 205
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
AF++I+K GG++ EKDYPYTG DG KS ++ + + +++ V H P
Sbjct: 206 AFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQP 265
Query: 276 LAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
++V + A +Q Y+ G+ CG LDHG++ VGYG+ YWI++NSW
Sbjct: 266 VSVAVEAGGRALQLYVSGIFTGE-CGTALDHGIVAVGYGTE-------NGTDYWIVRNSW 317
Query: 334 GENWGENGYYKICMGRNVC 352
G +WGENGY I M RN+
Sbjct: 318 GSSWGENGY--IRMERNMA 334
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 184/324 (56%), Gaps = 32/324 (9%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-- 96
GE+S+D + + +K++ +++Y +E + R +F+ NLR + +
Sbjct: 36 GERSDDEV---HRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSF 92
Query: 97 --GVTKFSDLTPSEFRRQFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
G+T+F+DLT E+R +LG+ RR + + ++DLP DWRD GAV
Sbjct: 93 RLGLTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAV 152
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
VKDQG+CGSCW+FS A+EG + + TG+L+SLSEQ+LVDCD + GCN
Sbjct: 153 VDVKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDT--------YYNQGCN 204
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ ++DYPYTG DG ++ K+ + ++ + ++++
Sbjct: 205 GGLMDYAFEFIISNGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQK 264
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V + P++V I A Q Y G+ Y CG LDHGV +GYGS K YW
Sbjct: 265 AVANQPVSVAIEAGGRAFQLYESGIFTGY-CGTELDHGVTAIGYGSE-------NGKYYW 316
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I+KNSWG +WGE+GY + M RN+
Sbjct: 317 IVKNSWGSDWGESGYIR--MERNI 338
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 127/309 (41%), Positives = 169/309 (54%), Gaps = 26/309 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC 352
++ MG N C
Sbjct: 321 RVVMGVNAC 329
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 195/350 (55%), Gaps = 32/350 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKY 301
K+ + ++ + + ++ V H P+++ I A Q Y G+ CG
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIF-DGSCGTQ 294
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
LDHGV+ VGYG+ K YWI++NSWG++WGE+GY + M RN+
Sbjct: 295 LDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLR--MARNI 335
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 141/373 (37%), Positives = 199/373 (53%), Gaps = 34/373 (9%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL----NAEH--HFS 54
M R S L++L+ AS+ + DD+ IR VV + E +L ++ H F+
Sbjct: 1 MARTSFSLLIILIACVAGASSASTFDDENPIRTVVSDALREFETSILSVLGDSRHALSFA 60
Query: 55 LFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
F ++ K Y T EE RF +F NL+ + + GV F+D T EFRR L
Sbjct: 61 RFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFRRHRL 120
Query: 115 GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
G + A + L LP DWR G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 121 GAAQNC--SATTKGNHKLTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAY 178
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYP 233
+ G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEY+ GG++ E+ YP
Sbjct: 179 KQAFGKGISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYP 230
Query: 234 YTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYI 289
YTG + G CKF + V N ++ + DE + A V+ P++V V + Y
Sbjct: 231 YTGKN-GECKFSSENVGVQVLDSVNITLGAEDELKHAVAFVR--PVSVAFQVVNGFRLYK 287
Query: 290 GGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
GV CG+ ++H VL VGYG PYW+IKNSWG +WG++GY+K+
Sbjct: 288 EGVYTSDTCGRTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWGADWGDSGYFKME 340
Query: 347 MGRNVCGVDSMVS 359
MG+N+CGV + S
Sbjct: 341 MGKNMCGVATCAS 353
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/358 (35%), Positives = 189/358 (52%), Gaps = 42/358 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +++ +LLLL + A+A+++ + SE+ +++ + + K
Sbjct: 1 MPSMLIPTLLLLSFTFSHATAMSIIN--------------YSENEVMDMYEEWLV---KH 43
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--- 117
K Y +E + RF+VFK NL + + T G+ KF+D+T E+R +LG
Sbjct: 44 RKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDA 103
Query: 118 --RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R ++ + + LP DWR GAV +KDQG CGSCW+FS A+EG +
Sbjct: 104 KRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINN 163
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ TGE VSLSEQ+LVDCD E D GCNGGLM+ AF++I++ GG++ E+DYPY
Sbjct: 164 IVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQ 215
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVS 293
G DG + K + + + S+ + V H P++V I A +Q Y GV
Sbjct: 216 GIDGTCDQTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVF 275
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV++VGYG+ YW+++NSWG WGE+GY+K M RNV
Sbjct: 276 TGK-CGTALDHGVVVVGYGTENGV-------DYWLVRNSWGTGWGEDGYFK--MERNV 323
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 195/350 (55%), Gaps = 32/350 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKY 301
K+ + ++ + + ++ V H P+++ I A Q Y G+ CG
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIF-DGSCGTQ 294
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
LDHGV+ VGYG+ K YWI++NSWG++WGE+GY + M RN+
Sbjct: 295 LDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLR--MARNI 335
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 130/369 (35%), Positives = 196/369 (53%), Gaps = 30/369 (8%)
Query: 2 ERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--GEQSEDHLLNAEHHFSLFKSK 59
+ + +++L+LLL +A + A+ +V PS G + + + ++
Sbjct: 9 KHITMTTLMLLLCVIAIADCIC---QAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQ 65
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTPSEFRRQFLGLNR 118
+ + Y E +RF+VFKAN R V G +F+DLT EF + GL +
Sbjct: 66 YRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRK 125
Query: 119 RLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+P+ A++ P D DWR GAVT VK+QG CG CW+FSA GA+EG
Sbjct: 126 PAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEG 185
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
++TG LVSLSEQQ++DCD E G + GCNGG M++AF+Y++ GGV E Y
Sbjct: 186 LIMITTGNLVSLSEQQILDCD-----ESDG--NQGCNGGYMDNAFQYVVNNGGVTTEDAY 238
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIG 290
PY+ G+C+ + AA +S F + S ++ AN V + P++VG++ + Q Y G
Sbjct: 239 PYSAVQ-GTCQ--NVQPAATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQG 295
Query: 291 GVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
G+ CG ++H V +GYG+ + YWI+KNSWG WGENG+ ++ MG
Sbjct: 296 GIYDGDGCGTDMNHAVTAIGYGADD------QGTQYWILKNSWGTGWGENGFMQLQMGVG 349
Query: 351 VCGVDSMVS 359
CG+ +M S
Sbjct: 350 ACGISTMAS 358
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 195/350 (55%), Gaps = 32/350 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKY 301
K+ + ++ + + ++ V H P+++ I A Q Y G+ CG
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIF-DGSCGTQ 294
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
LDHGV+ VGYG+ K YWI++NSWG++WGE+GY + M RN+
Sbjct: 295 LDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLR--MARNI 335
>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
Length = 345
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 175/313 (55%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+ ++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MGRN C
Sbjct: 317 KGYVRVAMGRNAC 329
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/358 (35%), Positives = 189/358 (52%), Gaps = 42/358 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +++ +LLLL + A+A+++ + SE+ +++ + + K
Sbjct: 1 MPSMLIPTLLLLSFTFSHATAMSIIN--------------YSENEVMDMYEEWLV---KH 43
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--- 117
K Y +E + RF+VFK NL + + T G+ KF+D+T E+R +LG
Sbjct: 44 RKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDA 103
Query: 118 --RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R ++ + + LP DWR GAV +KDQG CGSCW+FS A+EG +
Sbjct: 104 KRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINN 163
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ TGE VSLSEQ+LVDCD E D GCNGGLM+ AF++I++ GG++ E+DYPY
Sbjct: 164 IVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQ 215
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVS 293
G DG + K + + + S+ + V H P++V I A +Q Y GV
Sbjct: 216 GIDGTCDETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVF 275
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV++VGYG+ YW+++NSWG WGE+GY+K M RNV
Sbjct: 276 TGK-CGTALDHGVVVVGYGTENGV-------DYWLVRNSWGTGWGEDGYFK--MERNV 323
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 175/311 (56%), Gaps = 25/311 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A HF FK + K++ + +RF FK N++ A +P A + V+ KF+ LTP
Sbjct: 38 ASAHFMHFKKQHGKSFGEEAVEGHRFNAFKENMQTAVYLNAQNPHAHYDVSGKFAALTPQ 97
Query: 108 EFRRQFLGLNRRLR-LPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF +Q+L + R L A ++A + + DWR+ GAVT VKDQG CGSCW+FS
Sbjct: 98 EFAKQYLNPDYYTRQLKAHKERAHVYEGVRGGLSAVDWREKGAVTEVKDQGLCGSCWAFS 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
A G +EG LS LVSLSEQ LV CD + D GCNGGLM+ A+ +I+K +
Sbjct: 158 AIGNIEGQWALSGNTLVSLSEQMLVSCD---------TVDMGCNGGLMDQAWAWIIKNHS 208
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E YPYT DG + K+ A +S + DED + A L K+GP+++ ++A
Sbjct: 209 GAVYTEVSYPYTSGDGSTASCLSTGKVGARISGQVSLPQDEDAIEAWLEKNGPISIAVDA 268
Query: 283 VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV C Y L+HGVL+VGY +S PYWI+KNSWG +WGE+G
Sbjct: 269 TTWQLYFGGVVSN--CFAYNLNHGVLLVGYNNSA-------NPPYWIVKNSWGTSWGEHG 319
Query: 342 YYKICMGRNVC 352
Y ++ G N C
Sbjct: 320 YIRLAKGSNQC 330
>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
Length = 334
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 177/334 (52%), Gaps = 30/334 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK L
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANTVEEAYRAKVFKENAIRIAKHNDLFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G ++++D+ E + G L+ + + DWR GAVT
Sbjct: 70 EVTFKVGYSQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT DG SC + + A + + V + E +
Sbjct: 183 GLMDSAFEYVESNGGIDTEESYPYTAVDGDSCLYKAANNAGVNTGYKDVQAKSESALRDA 242
Query: 270 LVKHGPLAVGINAV-W-MQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPY 326
+ K GP++V I+A W Q Y G+ C YLDHGVL VGYGS + K +
Sbjct: 243 VEKAGPVSVAIDASNWSFQMYSSGIYYESACSSDYLDHGVLAVGYGS------EWPNKEF 296
Query: 327 WIIKNSWGENWGENGYYKICMG-RNVCGVDSMVS 359
WI+KNSWG +WGE GY K+ +N CG+ + S
Sbjct: 297 WIVKNSWGTSWGEEGYIKMARNKKNNCGIATEAS 330
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 177/310 (57%), Gaps = 29/310 (9%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K + Y+ +E D R++ FK N+ + + V G+TKF+DLT E+++ +LG+
Sbjct: 39 KHDRAYSHEEFTD-RYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKHYLGIKV 97
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
N + L A AQK P DWR+ GAV+ VKDQG CGSCWSFS TGA+EGAH +
Sbjct: 98 NVKKNLNA-AQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAVEGAHQI 156
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
+G +VSLSEQ LVDC + + GC GGLM +AFEYI+ GG+ E YPYT
Sbjct: 157 KSGNMVSLSEQNLVDCSGQYG-------NQGCEGGLMVNAFEYIIDNGGIATESSYPYTA 209
Query: 237 TDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAVWM--QTYIGGV- 292
G CKF KS A + + I +ED + A L K P++V I+A M Q Y GV
Sbjct: 210 AQ-GRCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQ-PVSVAIDASHMSFQLYSSGVY 267
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV- 351
P + LDHGVL VGYG+ + K Y+IIKNSWG WG++GY I M RN
Sbjct: 268 DEPACSSEALDHGVLAVGYGT-------LEGKDYYIIKNSWGPTWGQDGY--IFMSRNAQ 318
Query: 352 --CGVDSMVS 359
CGV +M S
Sbjct: 319 NQCGVATMAS 328
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 139/372 (37%), Positives = 197/372 (52%), Gaps = 42/372 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +L + L +S+ +V +D + S +HL + + LF+S
Sbjct: 1 MALSVLKTSFLTFFASLFVCSVLAHDFSIV---------GYSPEHLTSVDKLVELFESWI 51
Query: 61 S---KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
S K Y + EE +RF VFK NL+ +R + G+ +F+DL+ EF+ +FLGL
Sbjct: 52 SGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLY 111
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
++ DLP DWR GAVT VK+QG+CGSCW+FS A+EG + +
Sbjct: 112 PEFPRKKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 171
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G L SLSEQQL+DCD S ++GCNGGLM+ AFE+I+ GG+ +E+DYPY
Sbjct: 172 AGNLTSLSEQQLIDCD--------TSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYL-M 222
Query: 238 DGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-S 293
+ G+C + ++ +S + + +++Q + H PL+V I+A Q Y GGV S
Sbjct: 223 EEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFS 282
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN--- 350
P CG LDHGV VGYGSS Y I+KNSWG WGE GY + M RN
Sbjct: 283 GP--CGTDLDHGVAAVGYGSSSGI-------DYIIVKNSWGPKWGERGYLR--MKRNTGK 331
Query: 351 ---VCGVDSMVS 359
+CG++ M S
Sbjct: 332 PEGLCGINKMAS 343
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 178/331 (53%), Gaps = 31/331 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 187
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 188 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 247
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 248 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 298
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 299 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 358
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 359 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 410
Query: 344 KICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 411 RVVMGVNACLLSEYPVSAHVRESAAPGTSTS 441
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/370 (36%), Positives = 201/370 (54%), Gaps = 40/370 (10%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD----GEQSEDHLLNAEHHFSLFKSKF 60
I S L ++ S LAS ++ D +P+D E++E H++ H+ + K
Sbjct: 10 IAISFLFMVFSLSLASMSIIDYD-------LPADPLQSTERTEAHMMKMYEHWLV---KH 59
Query: 61 SKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LN 117
K Y E + RF +FK NLR ++ + T G+TKF+DLT E+R +LG +
Sbjct: 60 GKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKME 119
Query: 118 RRLRLPADAQKAPILPT---NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++ +L + + + +DLP+ DWR+ GAVT VKDQG CGSCW+FS G++EG +
Sbjct: 120 KKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGIN 179
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TG+L+SLSEQ+LVDCD + + GCNGGLM+ AFE+I+K GG++ E DYPY
Sbjct: 180 QIVTGDLISLSEQELVDCDK--------AYNQGCNGGLMDYAFEFIIKNGGIDSEADYPY 231
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV 292
+D K+ + + + ++++ V + P++V I A Q Y GV
Sbjct: 232 RASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGV 291
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
CG LDHGV+ VGYG+ YWI++NSWG WGE+GY + M RNV
Sbjct: 292 FTGR-CGTNLDHGVVAVGYGTENGI-------DYWIVRNSWGPKWGESGYIR--MERNVA 341
Query: 353 GVDSMVSSVA 362
D+ +A
Sbjct: 342 STDTGKCGIA 351
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 193/358 (53%), Gaps = 34/358 (9%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG--------EQSEDHLLNAEHHFSL 55
L+ +++ LL+ +S L DDD + P + E E+H NA F
Sbjct: 67 LVAAAVSLLVFASFLIQWQG--DDDRGVFPPSPVEDHKTPVNIWEWKEEHFQNA---FGS 121
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F++ + K+YAT+EE R+ +FK NL + + F DL+ EFRR++LG
Sbjct: 122 FRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRKYLG 181
Query: 116 LNRRLRLPAD----AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
N+ L ++ A + + +D+P+ DWR+ G VT VKDQ CGSCW+FSATGALE
Sbjct: 182 YNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALE 241
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GAH TGEL+SLSEQ+LVDC + GC+GG MN AF+Y++ +GG+ E+
Sbjct: 242 GAHCAKTGELLSLSEQELVDCS-------LAEGNQGCSGGEMNDAFQYVVDSGGLCSEEG 294
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYI 289
YPY D G CK K+ +S F + + + H P+++ I A + Q Y
Sbjct: 295 YPYLARD-GECKRACKKV-VTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYH 352
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GV CG LDHGVL+VGYG+ + +K +WI+KNSWG WG +GY + M
Sbjct: 353 EGV-FDASCGTDLDHGVLLVGYGTD-----KETKKDFWIMKNSWGSGWGRDGYMYMAM 404
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 180/322 (55%), Gaps = 27/322 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLT 105
+ ++ +K++ K Y + EE R +++ NL + + L T G+ +F+DL
Sbjct: 25 DEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLK 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCW 162
EF G R A+ + LP+N +LP DWR G VT VKDQG CGSCW
Sbjct: 85 NEEFVAMMTGF-RVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG+LEG HF +TG+LVSLSEQ LVDC + E GC+GGLM+ AF+YI+K
Sbjct: 144 AFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNE-------GCDGGLMDQAFQYIIK 196
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGIN 281
AGG++ E+ YPY D G C F K+ I A V+ ++ ++SD + V H GP++V I+
Sbjct: 197 AGGIDTEESYPYKAVD-GECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISVAID 255
Query: 282 AVWM--QTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A M Q Y GV + P LDHGVL VGYG++ YWI+KNSW E WG
Sbjct: 256 ASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGT------DYWIVKNSWAETWG 309
Query: 339 ENGYYKICMGR-NVCGVDSMVS 359
NGY + + N CG+ + S
Sbjct: 310 MNGYLWMSRNKDNQCGIATQAS 331
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 177/320 (55%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLL---DPTAVHGVTKFSDLTPSEFRR 111
FK + KTY + E +R ++F N + AK Q + T V K++D+ EFR
Sbjct: 30 FKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADMLHHEFRE 89
Query: 112 QFLGLN----RRLRL--PADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N + LR P+ I P + LP DWR+ GAVT VKDQG CGSCW+F
Sbjct: 90 TMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCGSCWAF 149
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF TG LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 150 SSTGALEGQHFRKTGTLVSLSEQNLVDC-------SAKYGNNGCNGGLMDNAFRYIKDNG 202
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY G D SC F+K + A F+ I +E +MA + GP++V I+A
Sbjct: 203 GIDTEKSYPYEGID-DSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDAS 261
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y G+ + P + LDHGVL+VGYG+ K YW++KNSWG WG+
Sbjct: 262 HESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESG------KDYWLVKNSWGTTWGDK 315
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ N CG+ S S
Sbjct: 316 GFIKMARNEDNQCGIASASS 335
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/323 (41%), Positives = 175/323 (54%), Gaps = 30/323 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK QL V G+ K++D+ E
Sbjct: 28 WQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLHHE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAP------ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
F G N L A A I P + LP DWR+ GAVTGVKDQG CGSC
Sbjct: 88 FHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSC 147
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF TG L+SLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 148 WAFSSTGALEGQHFRKTGTLISLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 200
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGI 280
GG++ EK YPY G D SC F+K I A F+ I DE ++A + GP++V I
Sbjct: 201 DNGGIDTEKSYPYEGID-DSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAI 259
Query: 281 NAVW--MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV C + LDHGVL+VGYG+ K YW++KNSWG W
Sbjct: 260 DASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDE------NGKDYWLVKNSWGTTW 313
Query: 338 GENGYYKICMG-RNVCGVDSMVS 359
G+ G+ K+ N CG+ + S
Sbjct: 314 GDKGFIKMARNDDNQCGIATASS 336
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 142/365 (38%), Positives = 200/365 (54%), Gaps = 33/365 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGV 292
DGG CKF I V N ++ + DE + A LV+ P++V V + Y GV
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--PVSVAFEVVHEFRFYKKGV 290
Query: 293 SCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
CG ++H VL VGYG + PYW+IKNSWG WG+NGY+K+ MG+
Sbjct: 291 FTSNTCGNTPMDVNHAVLAVGYGVE-------DDVPYWLIKNSWGGEWGDNGYFKMEMGK 343
Query: 350 NVCGV 354
N+CGV
Sbjct: 344 NMCGV 348
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 178/331 (53%), Gaps = 31/331 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 321 RVVMGVNACLLSEYPVSAHVRESAAPGTSTS 351
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 175/321 (54%), Gaps = 28/321 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K + ++ E +R ++F N + AK QL V G+ K+SD+ E
Sbjct: 27 WQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHE 86
Query: 109 FRRQFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
F+ G N +R AQ I P N +P DWR HGAVT VKDQG CGSCW+
Sbjct: 87 FKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCGSCWA 146
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS+T ALEG HF G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 FSSTAALEGQHFRKAGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDN 199
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
GG++ EK YPY G D SC F KS + A + F + DE+ + + GP++V I+A
Sbjct: 200 GGIDTEKSYPYEGID-DSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDA 258
Query: 283 VW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y GV + P + LDHGVL+VGYG+ YW++KNSWG WG+
Sbjct: 259 SHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGL------DYWLVKNSWGTTWGD 312
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY K+ + N CG+ + S
Sbjct: 313 QGYIKMARNQDNQCGIATASS 333
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 178/331 (53%), Gaps = 31/331 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 321 RVVMGVNACLLSEYPVSAHVRESAAPGTSTS 351
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 178/332 (53%), Gaps = 32/332 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK----IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 268
Query: 283 VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 320
Query: 343 YKICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 321 VRVVMGVNACLLSEYPVSAHVRESAAPGTSTS 352
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 175/316 (55%), Gaps = 28/316 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK+ F K Y T EE RF +F+ L R ++ + + GV +FSD++ E+ R
Sbjct: 57 FKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLR 116
Query: 112 QFLGLNRRLRLPADAQ--KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
GL R R + + + L DWRD G VT VK+QG CGSCWSFS TG+
Sbjct: 117 HN-GLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGS 175
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVER 228
LEG HF TG+L+SLSEQQLVDC SG+ + GCNGGLM++AFEYI GG+E
Sbjct: 176 LEGQHFRQTGKLISLSEQQLVDC--------SGTFGNEGCNGGLMDNAFEYIKSIGGLEG 227
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--VWM 285
E DYPYT G C KS A + + V S DED + L GP++V I+A
Sbjct: 228 EDDYPYTAKQ-GKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHASF 286
Query: 286 QTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV C + LDHGVL VGYG+ YW++KNSWGE WGE GY K
Sbjct: 287 QSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGG------DYWLVKNSWGEMWGEEGYIK 340
Query: 345 ICMGR-NVCGVDSMVS 359
+ + N CG+ + S
Sbjct: 341 MSRNKDNQCGIATQAS 356
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 143/376 (38%), Positives = 200/376 (53%), Gaps = 38/376 (10%)
Query: 1 MERLILSSLLLL---LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNA---EHH-- 52
M R L L++ L +S LA D++ IRQVV + E+ +L H
Sbjct: 1 MSRFSLLLALVVAGGLFASALAGPATFADENP-IRQVVSDGLHELENAILQVVGKTRHAL 59
Query: 53 -FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ F ++ K Y + EE RF VF NL+ + + GV +F+DLT EFRR
Sbjct: 60 SFARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRR 119
Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LG + + K + TN LP DWR+ G V+ VK+QG CGSCW+FS TGAL
Sbjct: 120 DRLGAAQNC---SATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGAL 176
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+
Sbjct: 177 EAAYSQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEE 229
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQ 286
YPYTG + G CKF + V N ++ + DE + A LV+ P+++ + +
Sbjct: 230 AYPYTGKN-GLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR--PVSIAFEVIKGFK 286
Query: 287 TYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y GV CG ++H VL VGYG PYW+IKNSWG +WG+NGY+
Sbjct: 287 QYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWGADWGDNGYF 339
Query: 344 KICMGRNVCGVDSMVS 359
K+ MG+N+CG+ + S
Sbjct: 340 KMEMGKNMCGIATCAS 355
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 170/307 (55%), Gaps = 29/307 (9%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN- 117
K K+Y E + RF++FK NLR T G+ +F+DLT E+R +LG
Sbjct: 52 KHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSMYLGART 111
Query: 118 ---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
RRL + + + LP DWR+ GAV GVKDQG+CGSCW+FS A+EG +
Sbjct: 112 GSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGIN 171
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E+DYPY
Sbjct: 172 QIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPY 223
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM--QTYIGGV 292
DG ++ K+ + ++ + + +Q V + P++V I A M Q Y GV
Sbjct: 224 NARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESGV 283
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV- 351
CG LDHGV VGYG+ YWI+KNSWG +WGE+GY + M RN
Sbjct: 284 FTGN-CGTALDHGVTAVGYGTE-------NSVDYWIVKNSWGSSWGESGYIR--MERNTG 333
Query: 352 ----CGV 354
CG+
Sbjct: 334 ATGKCGI 340
>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
Length = 236
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 110/240 (45%), Positives = 152/240 (63%), Gaps = 18/240 (7%)
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
++ K PT LP FDWR HG VT VKDQG CGSCW+F+ TG +EG + T +LVS
Sbjct: 8 SNRPKVTSYPTQSLPGSFDWRQHGVVTEVKDQGMCGSCWAFAVTGNIEGQWYKKTKKLVS 67
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQQL+DCD + D CNGG A+E I+K GG+ EKDYPY +C
Sbjct: 68 LSEQQLLDCDKK---------DEACNGGFPEWAYESIVKMGGLMSEKDYPYEAHK-ETCN 117
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP--YICGKY 301
+ I+A +++ +S DE ++AA L ++GP++VG+NA ++Q Y GGVS P +C +
Sbjct: 118 LKPNNISAYINDSVTLSKDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPPHMLCSEQ 177
Query: 302 -LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
LDH VL+VGYG + F ++PYWI+KNSWG +WGE GY++I G CG+++ +S
Sbjct: 178 GLDHAVLLVGYGVTSFW-----QRPYWIVKNSWGRSWGEKGYFRIYRGDGTCGINADATS 232
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 174/321 (54%), Gaps = 36/321 (11%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L E+ F+ F++++ K Y E +R +VF N+ A++ D G T F+D+T
Sbjct: 17 LRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTN 76
Query: 107 SEFRRQFLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+EF L L ++ PA PI+ DWR+ GAVT VK+Q +CGSCW+
Sbjct: 77 TEFAVSKLCGCMLKPKMTKPA----TPIM--EPAAEAVDWREKGAVTPVKNQASCGSCWA 130
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGA+EG +F++ GEL+SLSEQQLVDCDH+ SGC GGLM AFEY K
Sbjct: 131 FSATGAMEGRNFVANGELISLSEQQLVDCDHQ---------SSGCGGGLMTYAFEY-AKK 180
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA- 282
G+ +E+DYPY D CK DK + + + V GP++V + A
Sbjct: 181 KGMCKEEDYPYHAVD-EDCKDDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEAD 239
Query: 283 -VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+ Q Y GGV CG L+HGVL VGYG+ YWI+KNSWGE+WG+ G
Sbjct: 240 SIVFQMYTGGVIDSSACGTSLNHGVLAVGYGAD-----------YWIVKNSWGESWGDKG 288
Query: 342 YYKICM---GRNVCGVDSMVS 359
Y KI G +CG++ M S
Sbjct: 289 YLKIKYTESGAGICGINQMNS 309
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 179/331 (54%), Gaps = 31/331 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VK+QGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D+GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 321 RVVMGVNACLLSEYPVSAHVRESAAPGTSTS 351
>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
Length = 441
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 172/319 (53%), Gaps = 30/319 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHARFGITKFFDLSEAEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR GAVT V DQGACGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVNDQGACGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +++T L++LSEQ+LV CD D GCNGGLM AF+++L K G V
Sbjct: 158 NIESQWYVTTHSLITLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNKNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
YPY +G + +S + A + I S+ED MAA L +GP+A+ ++A
Sbjct: 209 YTGASYPYVSGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDAS 268
Query: 284 WMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Y GG+ SC G+ L+HGVL+VGY +G E PYW+IKNSWGENWGE G
Sbjct: 269 AFMSYTGGILTSCD---GRQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEKG 318
Query: 342 YYKICMGRNVCGVDSMVSS 360
Y ++ G N C + +S
Sbjct: 319 YVRVRKGTNECLIQEYPAS 337
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 178/331 (53%), Gaps = 31/331 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 98 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 157
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 158 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 217
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 218 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 268
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 269 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 328
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK L+HGVL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 329 SFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 380
Query: 344 KICMGRNVC-----GVDSMVSSVAAIHTTSS 369
++ MG N C V + V AA T++S
Sbjct: 381 RVVMGVNACLLSEYPVSAHVRESAAPGTSTS 411
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 175/323 (54%), Gaps = 22/323 (6%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D L+A+ ++ ++S + K YA EE D+R V++ N++ +R HG T
Sbjct: 22 DQSLDAQ--WNQWRSTYKKVYAVNEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D T EFR+ G + P+ +PT DW G VT VKDQG CG
Sbjct: 79 AFGDKTNEEFRQLMNGFQSQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKDQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWR-------EGNEGCNGGLMDNAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
+ GG++ E+ YPYT TD C+++ AA + F I E + + GP++V
Sbjct: 190 VKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPISVA 249
Query: 280 INA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
I+A V Q Y G+ C ++HGVL VGYG G P + K YW++KNSWG++W
Sbjct: 250 IDAGQVSFQFYSSGIYFDPACRLTVNHGVLAVGYGFEGTDPDKNK---YWLVKNSWGKSW 306
Query: 338 GENGYYKICMGRNV-CGVDSMVS 359
G +GY KI RN CG+ S
Sbjct: 307 GADGYIKIAKDRNNHCGIARAAS 329
>gi|353441042|gb|AEQ94105.1| putative drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 187
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 115/175 (65%), Positives = 138/175 (78%), Gaps = 7/175 (4%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHL-LNAEHHFSLFKSKFSKTYATQEE 69
+ L +SV +S + +DD +I QVVP E ED L LNAE HFS F +F K+YA ++E
Sbjct: 15 VALSASVASSWPSYAEDDPLIVQVVP---ESDEDELRLNAEAHFSSFLRRFGKSYADEKE 71
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---ADA 126
H YRF VFKANLRRA+R Q +DPTAVHG+TKFSDLTP+EFRR +LGL RL A +
Sbjct: 72 HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+APILPTN+LPTDFDWRDHGAVTGVKDQG+CGSCWSFSA+GALEGA+FL+TG+L
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQL 186
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 185/334 (55%), Gaps = 32/334 (9%)
Query: 39 GEQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
G S + L + +H+ F+ + + + Y E D R+ FK NL + + V
Sbjct: 12 GIASANRLFSEQHYQNQFTNWMVRLDRAYDVFEFQD-RYNAFKNNLDLIHKWNSQGHSTV 70
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRL-RLPADAQKAPILPTNDL----PTDFDWRDHGAVT 150
GV +DL+ E+R +LG+ RLP Q+A + N + DWR GAV
Sbjct: 71 LGVNHLADLSNEEYRNLYLGVKVDASRLP---QQAASIKLNKVFAPVAASLDWRSSGAVG 127
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
VKDQG CGSCWSFS TG++EGA+ ++TG SLSEQQL+DC + E GCNG
Sbjct: 128 RVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNE-------GCNG 180
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAAN 269
GLM++A +Y++ GG++ E+ YPYT +D +CKF+ + I A +S++ V E +AA
Sbjct: 181 GLMDAAMKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIGAKISSYIDVQRGSETDLAAK 240
Query: 270 LVKHGPLAVGINAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPY 326
L K GP++V I+A Q Y GV C Y LDHGVL VGYG+ G Y
Sbjct: 241 LNK-GPVSVAIDASHSSFQLYKSGVYYEPACSSYNLDHGVLAVGYGTEG-------SSNY 292
Query: 327 WIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
WI+KNSWG NWG +GY + + N CG+ SM S
Sbjct: 293 WIVKNSWGPNWGLSGYIWMAKDKSNHCGISSMAS 326
>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
Length = 441
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 169/311 (54%), Gaps = 30/311 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHARFGITKFFDLSEAEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR GAVT VKDQGACGSCW+ SA G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVKDQGACGSCWALSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +++T L++LSEQ+LV CD D GCNGGLM AF+++L K G V
Sbjct: 158 NIESQWYVTTHSLITLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNKNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
YPY +G + +S + A + I S+ED MAA L +GP+A+ ++A
Sbjct: 209 YTGASYPYVSGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDAS 268
Query: 284 WMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+Y GG+ SC G+ L+HGVL+VGY +G E PYW+IKNSWGENWGE G
Sbjct: 269 AFMSYTGGILTSCD---GRQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEKG 318
Query: 342 YYKICMGRNVC 352
Y ++ G N C
Sbjct: 319 YVRVRKGTNEC 329
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 168/295 (56%), Gaps = 20/295 (6%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
K K + E D RF +FK NLR + + G+TKF+DLT E+R +LG
Sbjct: 48 KHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL 107
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ + + + + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + T
Sbjct: 108 KRKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVT 167
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E+DYPY G D
Sbjct: 168 GDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVD 219
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGGVSCPY 296
G + K+ + ++ + ++ ++ + H P++V I Q Y G+
Sbjct: 220 GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIF-DG 278
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
ICG LDHGV+ VGYG+ K YWI+KNSWG +WGE+GY + M RN+
Sbjct: 279 ICGTDLDHGVVAVGYGTE-------NGKDYWIVKNSWGTSWGESGYIR--MERNI 324
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 137/369 (37%), Positives = 196/369 (53%), Gaps = 31/369 (8%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
R IL S+ LL+L +V + + IR V + E+S +L H F+ F
Sbjct: 5 RTILPSVALLILIAVSTAESIGFYESNPIRMVFDRLLEVEESVVQILGQTRHVLSFARFT 64
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y EE RF +FK NL + + GV +F+D+T EF+R LG
Sbjct: 65 HRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFTDMTWQEFQRTKLGAA 124
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+ +
Sbjct: 125 QNC--SATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQA 182
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 183 FGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGE 235
Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVS 293
D G+CK+ + V N ++ + DE + A L++ P+++ + + Y GV
Sbjct: 236 D-GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLLR--PVSIAFEVIHSFRLYKSGVY 292
Query: 294 CPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
CG+ ++H VL VGYG PYW+IKNSWG +WG+ GY+K+ MG+N
Sbjct: 293 SDSHCGQTPMDVNHAVLAVGYGIEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKN 345
Query: 351 VCGVDSMVS 359
+CG+ + S
Sbjct: 346 MCGIATCAS 354
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 136/330 (41%), Positives = 178/330 (53%), Gaps = 42/330 (12%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP---TAVH----GVT 99
LN E F +K F K+Y+ E R V++AN + L+D +H G+
Sbjct: 26 LNME--FEAWKRTFGKSYSDAVEEINRRAVWEAN------KMLVDAHNGAGIHSYTLGMN 77
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQG 156
F+DLT EF+R +LG L P + +PT + LP DWR G VT VKDQG
Sbjct: 78 IFADLTHEEFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQG 137
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCWSFS TG++EG H TG+LVSLSEQ LVDC + GCNGGLM+ A
Sbjct: 138 QCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSK-------AQGNQGCNGGLMDDA 190
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GP 275
F+YI+ G++ E YPYT D G+CKF+ + + A +S+F I+ + N V GP
Sbjct: 191 FQYIITNKGIDTEASYPYTAKD-GTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGP 249
Query: 276 LAVGINAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
++V I+A Q Y GV C LDHGVL GYG+S PYW++KNS
Sbjct: 250 VSVAIDASKNSFQLYTSGVYNEKKCSSTSLDHGVLAAGYGTS-------NGTPYWLVKNS 302
Query: 333 WGENWGENGYYKICMGRNV---CGVDSMVS 359
WG +WG+ GY I M RN CG+ + S
Sbjct: 303 WGSSWGQAGY--IWMSRNANNQCGIATSAS 330
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 135/362 (37%), Positives = 189/362 (52%), Gaps = 37/362 (10%)
Query: 1 MERLILSSLLLLL----LSSVLASAVAVNDDDAMIRQVVP-SDGEQSEDHLLNAEHHFSL 55
M L LS ++LLL +S + ++ D++ I V SD E E +
Sbjct: 1 MGFLKLSPMILLLAMIGVSYAIDMSIISYDENHHISTVSSRSDAE--------VERIYEA 52
Query: 56 FKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+ + K Q E D RF +FK NLR + + G+T+F+DLT E+R
Sbjct: 53 WMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRS 112
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+LG R+ + + + LP DWR GAV VKDQG+CGSCW+FS GA+E
Sbjct: 113 MYLGAKPVKRVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVE 172
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E D
Sbjct: 173 GINKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEAD 224
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYI 289
YPY DG + K+ + ++ + + + + H P++V I A Q Y
Sbjct: 225 YPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYS 284
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV ICG LDHGV+ VGYG+ K YWI++NSWG WGE+GY K M R
Sbjct: 285 SGVF-DGICGTELDHGVVAVGYGTE-------NGKDYWIVRNSWGNRWGESGYIK--MAR 334
Query: 350 NV 351
N+
Sbjct: 335 NI 336
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 184/332 (55%), Gaps = 37/332 (11%)
Query: 44 DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+HL N + LF+S + SK Y + EE +RF VF+ NL +R + G+ +
Sbjct: 39 EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98
Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
F+DLT EF+ ++LGL + R R P+ + + DLP DWR GAV VKDQG
Sbjct: 99 FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS A+EG + ++TG L SLSEQ+L+DCD + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYA 208
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
F+YI+ GG+ +E DYPY + G C+ K + +S + + ++D+ + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267
Query: 276 LAVGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
++V I A Q Y GGV CG LDHGV VGYGSS K Y I+KNSW
Sbjct: 268 VSVAIEASGRDFQFYKGGVFNGK-CGTDLDHGVAAVGYGSS-------KGSDYVIVKNSW 319
Query: 334 GENWGENGYYKICMGRN------VCGVDSMVS 359
G WGE G+ I M RN +CG++ M S
Sbjct: 320 GPRWGEKGF--IRMKRNTGKPEGLCGINKMAS 349
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 176/313 (56%), Gaps = 22/313 (7%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
+N + L+K K+ KTY + E + R +++ N +D + V +F+DLT
Sbjct: 23 VNDAEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTA 82
Query: 107 SEFRRQFLGLNR-RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + G + R R + +P DWR G VT VK+Q CGSCW+FS
Sbjct: 83 EEFSSIYNGYGKGRNRENHENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG+LEGAH TG+LVSLSEQ LVDCD + D GC GGLM +AF+YI + G
Sbjct: 143 TTGSLEGAHAKKTGKLVSLSEQNLVDCDKK---------DHGCQGGLMTTAFKYIEENKG 193
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVS-NFSVISSDEDQMAANLVKHGPLAVGINAVW 284
++ E+ YPY + G C+F K I A V + S++++D + + + + GP++V ++A
Sbjct: 194 IDTEESYPYKAKN-GRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASH 252
Query: 285 --MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ P IC + LDHGVL+VGYG + + YW++KNSWG+NWG G
Sbjct: 253 SSFQLYKSGIYDPKICSSRKLDHGVLVVGYG-------KEDGEEYWLVKNSWGKNWGMEG 305
Query: 342 YYKICMGRNVCGV 354
Y+KI +N+CG+
Sbjct: 306 YFKIASKKNLCGI 318
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/335 (40%), Positives = 184/335 (54%), Gaps = 38/335 (11%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+S+D +++ + + K K Y E RF +FK NLR + T G+TK
Sbjct: 19 RSDDEVMSI---YKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNRTYKVGLTK 75
Query: 101 FSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQ 155
F+DLT E+R FLG RRL + + D LP DWR GAV +KDQ
Sbjct: 76 FADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQ 135
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G+CGSCW+FS A+EG + + TGEL+SLSEQ+LVDCD ++GCNGGLM+
Sbjct: 136 GSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDR--------FYNAGCNGGLMDY 187
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHG 274
AF++I+ GG++ EKDYPY G D +C DK K A ++ F + +++ V H
Sbjct: 188 AFQFIINNGGLDTEKDYPYLGND-DTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQ 246
Query: 275 PLAVGINAVWM--QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
P++V I A M Q Y GV CG LDHGV++VGYG+ K YW+++NS
Sbjct: 247 PVSVAIEASGMALQFYQSGVFTGE-CGTALDHGVVVVGYGTE-------KGLDYWLVRNS 298
Query: 333 WGENWGENGYYKICMGRNV-------CGVDSMVSS 360
WG WGE+GY K M RNV CG+ +M SS
Sbjct: 299 WGTEWGEHGYIK--MQRNVRDTYTGRCGI-AMESS 330
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 174/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPYT +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYTSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/345 (37%), Positives = 178/345 (51%), Gaps = 46/345 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ + KF++ Y++ E + R+ +FK+N+ D V G+ F+D+T E+R+
Sbjct: 36 FTEWTLKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LG +L DL P DWR AVT +KDQG CGSCWSFS TG
Sbjct: 95 TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ EGAH L T +LVSLSEQ LVDC PEE + GC+GGLMN+AF+YI+K G++
Sbjct: 155 STEGAHALKTKKLVSLSEQNLVDC---SGPEE----NFGCDGGLMNNAFDYIIKNKGIDT 207
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQ 286
E YPYT G +C F+KS I A + + I++ + N +HGP++V I+A Q
Sbjct: 208 ESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQ 267
Query: 287 TYIGGVSCPYICGKY-LDHGVLIVGYGSSG------------------------------ 315
Y G+ C LDHGVL+VGYG G
Sbjct: 268 LYTSGIYYEPKCSPTELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDS 327
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
+R K YWI+KNSWG +WG GY + R N CG+ S+ S
Sbjct: 328 SDSVRPKANNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/309 (41%), Positives = 174/309 (56%), Gaps = 39/309 (12%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + R ++FK NL+ + L + T G+T+F+DLT E + F+ +R L
Sbjct: 11 KNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE-PKDFMKADRYL 69
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
D LP + DWR GAV VKDQG CGSCW+FSA GA+EG + + TGE
Sbjct: 70 YKEGDI----------LPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGAVEGINQIKTGE 119
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+SLS+Q+L+DCD G ++GC GG+MN AFE+I+ GG+E ++DYPYT TD G
Sbjct: 120 LISLSDQELIDCDR-------GFVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTATDLG 172
Query: 241 SCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVSCPY 296
C DK V + ++ ++++ V H P+ V I A + Y GV
Sbjct: 173 VCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKSGVFTG- 231
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV----- 351
CG YLDHGV++VGYG+S + YWII+NSWG NWGENGY K + RN+
Sbjct: 232 TCGIYLDHGVVVVGYGTS-------SGEDYWIIRNSWGLNWGENGYVK--LQRNIDDSFG 282
Query: 352 -CGVDSMVS 359
CGV M S
Sbjct: 283 KCGVAMMPS 291
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKACADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVSTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 181/314 (57%), Gaps = 29/314 (9%)
Query: 54 SLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEF 109
+LF+S K+Y E + RF++FK NLR + L++ G+ KF+DLT E+
Sbjct: 43 TLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEY 102
Query: 110 RRQFLGL---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
R ++ G+ + R ++ A + + L LP DWR+ GAV VKDQG+CGSCW+FS
Sbjct: 103 RSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFST 162
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
A+EG + ++TG+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG+
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGLMDYAFEFIINNGGI 214
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW-- 284
+ + DYPYTG DG ++ K+ + ++ + + ++ + P++V I A
Sbjct: 215 DTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRD 274
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y G+ CG LDHGV++VGYG+ K YWI++NSWG +WGENGY +
Sbjct: 275 FQFYDSGIFTG-KCGIALDHGVVVVGYGTE-------NGKDYWIVRNSWGADWGENGYLR 326
Query: 345 ICMG----RNVCGV 354
+ G +CG+
Sbjct: 327 MERGISSKTGICGI 340
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 183/327 (55%), Gaps = 38/327 (11%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDL 104
A ++ L+K K+Y EEH +R ++F ++ + R L T G+ KF+D+
Sbjct: 15 ASANWDLYKKVHGKSYGHDEEH-FRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDM 73
Query: 105 TPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
T EFR F GL + R QK L LPT DWR+ G VT VK+QG CGS
Sbjct: 74 TSEEFR-NFKGLKFDATKTKRNGTRFQKE--LLGEALPTQVDWREKGYVTPVKNQGQCGS 130
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG+LEG HF +TG+LVSLSEQ LVDC ++GCNGGLM++ F YI
Sbjct: 131 CWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRV-------EGNNGCNGGLMDNGFTYI 183
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
+ GG++ E+ YPYTG D G C F+++ + A V F V DE + A + GP++V
Sbjct: 184 QQNGGIDTEESYPYTGKD-GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVA 242
Query: 280 INAV--WMQTYIGGV----SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
I+A Q Y GV SC + LDHGVL+VGYG+ YW++KNSW
Sbjct: 243 IDASNDSFQYYKEGVYDEPSCSF---SQLDHGVLVVGYGTENGV-------DYWLVKNSW 292
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
G WG++GY K+ + N CG+ SM S
Sbjct: 293 GPTWGQDGYIKMMRNKENQCGIASMAS 319
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 191/359 (53%), Gaps = 37/359 (10%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTYATQE 68
L+LS+ L A+ D +++ S +HL + + LF+S K SKTY + E
Sbjct: 11 LILSATLFITYAIAHDFSIVGY--------SPEHLASMDKTIELFESWMSKHSKTYRSIE 62
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
E +RF +F NL+ + G+ +F+DL+ EF+ ++LGL ++
Sbjct: 63 EKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSSRG 122
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
DLP DWR GAVT VK+QG+CGSCW+FS A+EG + + TG L SLSEQ+
Sbjct: 123 FSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 182
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
L+DCD S ++GC GGLM+ AF+YI+ G+ +E+DYPY +G + +
Sbjct: 183 LIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQF 234
Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGV 306
+S + + ++++Q + H P++V I A Q Y GG+ CG +DHGV
Sbjct: 235 EVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGR-CGTQMDHGV 293
Query: 307 LIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
VGYGSS + Y I+KNSWG WGENGY I M RN +CG++ M S
Sbjct: 294 TAVGYGSS-------EGTDYIIVKNSWGPKWGENGY--IRMKRNTGKPEGLCGINQMAS 343
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 176/315 (55%), Gaps = 39/315 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSE 108
F FK + KTY Q E RF +F N+R + L + G+ KF+D++ E
Sbjct: 26 FQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEE 85
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTN-------DLPTDFDWRDHGAVTGVKDQGACGSC 161
F+ L A + P L T ++P+ DWR G VTGVKDQG CGSC
Sbjct: 86 FKTM---------LTLSASRKPTLETTSYVKTGVEIPSSVDWRKEGRVTGVKDQGDCGSC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+ EGA+ +G+LVSLSEQQL+DC C +GC+GG ++ F+Y++
Sbjct: 137 WAFSITGSTEGAYARKSGKLVSLSEQQLIDC---CT-----DTSAGCDGGSLDDNFKYVM 188
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGI 280
K G++ E+ Y Y G D G+CK++ + + VS ++ I + DED + + GP++VG+
Sbjct: 189 K-DGLQSEESYTYKGED-GACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGM 246
Query: 281 NAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
+A ++ +Y G+ C L+H +L VGYG+ K YWIIKNSWG +WGE
Sbjct: 247 DASYLSSYDSGIYEDQDCSPAGLNHAILAVGYGTE-------NGKDYWIIKNSWGASWGE 299
Query: 340 NGYYKICMGRNVCGV 354
GY+++ G+N CG+
Sbjct: 300 QGYFRLARGKNQCGI 314
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 28/328 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
L N+E F F +K+ K YA E RF VFKANL R + +A G+ +SDL+
Sbjct: 30 LSNSEVLFDEFVTKYGKVYANDAERKSRFDVFKANLAIINERNAQEESATFGINFYSDLS 89
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---------LPTDFDWRDHGAVTGVKDQG 156
+E R+ G + L D +K T LP F+WRD AVT VK Q
Sbjct: 90 SNELLRKQTGF--KTALHNDNEKKSKYCTRRVITGPSTRLLPEAFNWRDSDAVTSVKQQR 147
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FSA +E +++ + V LSEQQ+VDCD ++GCNGGLM+ A
Sbjct: 148 DCGSCWAFSAVANIESQYYIKNKQYVDLSEQQIVDCD---------PINNGCNGGLMSWA 198
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
EY++++GGV+ E+DY Y G + G CK + + + S +E+++ LV +GP+
Sbjct: 199 MEYVMRSGGVQLEEDYQYVGNE-GVCKNNSANVVQISGCVSYDLRNEERLRELLVSNGPI 257
Query: 277 AVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
+V I+ + + Y G++ L+H VL+VGYG PYW+ KNSWG +
Sbjct: 258 SVAIDVMDVTNYQSGIAKHCSVAHGLNHAVLLVGYGVQ-------NNTPYWVFKNSWGSD 310
Query: 337 WGENGYYKICMGRNVCGVDSMVSSVAAI 364
WGENGY+++ N CG+ + ++ A +
Sbjct: 311 WGENGYFRVLRDVNSCGMLNQYAATAIL 338
>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
Length = 259
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/272 (44%), Positives = 157/272 (57%), Gaps = 25/272 (9%)
Query: 93 TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD---FDWRDHGAV 149
TA +GVT+FSDLT EF+ ++L R+R + P D+ D FDWR+HGAV
Sbjct: 5 TAHYGVTQFSDLTSEEFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAV 60
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
V DQG CGSCW+FS G + G F TG L++LSEQQLVDCD+ D GC+
Sbjct: 61 GPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDDGCD 111
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GG + I K GG+E DYPYTG GG C DKSK A V+ +++ E A
Sbjct: 112 GGYPPQTYTAIQKMGGLELASDYPYTGV-GGICHMDKSKFVAYVNGSTILPLSEKVQAQK 170
Query: 270 LVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWI 328
L GPL+ +NA +Q Y GG+ P C ++H VL VGYG KPYWI
Sbjct: 171 LRAIGPLSSALNADTLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYWI 223
Query: 329 IKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+KNSWGE++GE GY++I G CG++S+V++
Sbjct: 224 VKNSWGEDFGEEGYFRIYRGDGTCGINSIVTT 255
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 174/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPYT T G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYTSTFGYVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWG++WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGKDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/362 (36%), Positives = 189/362 (52%), Gaps = 37/362 (10%)
Query: 1 MERLILSSLLLLLLS-----SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSL 55
M L LS ++LLL ++ S ++ +++ + + SD E E +
Sbjct: 1 MGFLKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSE--------VERIYEA 52
Query: 56 FKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+ + K Q E D RF +FK NLR + + G+T+F+DLT E+R
Sbjct: 53 WMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRS 112
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+LG R+ + + + LP DWR GAV VKDQG+CGSCW+FS GA+E
Sbjct: 113 MYLGAKPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVE 172
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E D
Sbjct: 173 GINKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEAD 224
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYI 289
YPY DG + K+ + ++ + + + + H P++V I A Q Y
Sbjct: 225 YPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYS 284
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV +CG LDHGV+ VGYG+ K YWI++NSWG WGE+GY K M R
Sbjct: 285 SGVF-DGLCGTELDHGVVAVGYGTE-------NGKDYWIVRNSWGNRWGESGYIK--MAR 334
Query: 350 NV 351
N+
Sbjct: 335 NI 336
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 138/367 (37%), Positives = 198/367 (53%), Gaps = 39/367 (10%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P D+ + A H+ FK + K + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + R D K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKD-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY- 301
D+ ++ A ++ F + DE+++A + K GP+AV ++A Q Y GGV +C +
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVS--LCLAWS 286
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS--MVS 359
L+HGVLIVG+ + + PYWI+KNSWG +WGE GY ++ MG N C + + + +
Sbjct: 287 LNHGVLIVGFNKNA-------KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYPVSA 339
Query: 360 SVAAIHT 366
+V + HT
Sbjct: 340 TVESPHT 346
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 172/315 (54%), Gaps = 22/315 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSE 108
+S +K+ K Y EE +R V+K N++ ++ H T F D+T E
Sbjct: 29 WSQWKATHGKLYGMDEE-GWRREVWKKNMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F++ GL + +AP+ +P+ DWR+ G VT VKDQG CGSCW+FSATG
Sbjct: 88 FKQVMNGLQMQKHKKGKMFQAPLFAK--IPSSVDWREKGYVTPVKDQGPCGSCWAFSATG 145
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALEG F TG+LVSLSEQ LVDC + GCNGGLMN+AF+Y+ GG++
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSQ-------AEGNEGCNGGLMNNAFQYVKDNGGLDS 198
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQ 286
E+ YPY D SCK+ AA + F I E + + GP++VGI+A Q
Sbjct: 199 EESYPYHAQD-ESCKYKPQDSAANDTGFFDIPQQEKALMVAVATKGPISVGIDASHFTFQ 257
Query: 287 TYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y G+ P + LDHGVL++GYG+ I K YWI+KNSWG NWG +GY K+
Sbjct: 258 FYHEGIYYDPDCSSEDLDHGVLVIGYGTEIGQSIN---KTYWIVKNSWGANWGIDGYIKM 314
Query: 346 CMGR-NVCGVDSMVS 359
R N CG+ +M S
Sbjct: 315 AKDRKNHCGIATMAS 329
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 176/318 (55%), Gaps = 26/318 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTPSEFR 110
+ + +++ + Y E +RF+VFKAN R V G +F+DLT EF
Sbjct: 58 RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFA 117
Query: 111 RQFLGLNRRLRLPADAQKAPILPTN-------DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+ GL + +P+ A++ P + D DWR GAVT VK+QG CG CW+
Sbjct: 118 AMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWA 177
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA GA+EG ++TG LVSLSEQQ++DCD E G + GCNGG M++AF+Y++
Sbjct: 178 FSAVGAMEGLIMITTGNLVSLSEQQILDCD-----ESDG--NQGCNGGYMDNAFQYVINN 230
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN-- 281
GGV E YPY+ G+C+ + AA +S F + S ++ AN V + P++VG++
Sbjct: 231 GGVTTEDAYPYSAVQ-GTCQ--NVQPAATISGFQDLPSGDENALANAVANQPVSVGVDGG 287
Query: 282 AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+ Q Y GG+ CG ++H V +GYG+ + YWI+KNSWG WGENG
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADD------QGTQYWILKNSWGTGWGENG 341
Query: 342 YYKICMGRNVCGVDSMVS 359
+ ++ MG CG+ +M S
Sbjct: 342 FMQLQMGVGACGISTMAS 359
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 176/316 (55%), Gaps = 30/316 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
F+ F S+F K+Y ++EE R+ +F NLR R+ ++ L T V F+D T EF+
Sbjct: 55 FARFVSRFGKSYQSEEEMKERYEIFSQNLRFIRSHNKKRLPYTL--SVNHFADWTWEEFK 112
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R LG + + L LP DWR G V+ VKDQG+CGSCW+FS TGAL
Sbjct: 113 RHRLGAAQNCSATLNGNHK--LTDAVLPPTKDWRKEGIVSSVKDQGSCGSCWTFSTTGAL 170
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GC+GGL + AFEYI GG+E E+
Sbjct: 171 EAAYAQAFGKSISLSEQQLVDCAGPFN-------NFGCHGGLPSQAFEYIKYNGGLETEE 223
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQ 286
YPYTG D G CKF +A V N ++ + DE + A V+ P++V V
Sbjct: 224 AYPYTGKD-GVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVR--PVSVAFQVVNGFH 280
Query: 287 TYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y GV CG ++H VL VGYG PYW+IKNSWGE+WGENGY+
Sbjct: 281 FYENGVFTSDTCGSTSQDVNHAVLAVGYGVENGV-------PYWLIKNSWGESWGENGYF 333
Query: 344 KICMGRNVCGVDSMVS 359
K+ +G+N+CGV + S
Sbjct: 334 KMELGKNMCGVATCAS 349
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 178/321 (55%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H++L+K SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 25 DEHWNLWKDWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHTYSLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G +L+ + + + N L P DWRD G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGY--KLKSQRKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG HF TG LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 142 FSTTGAMEGQHFRKTGTLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
GG++ E+ YPY GTD G C +D S +A + F V S E + + GP++V I+A
Sbjct: 195 GGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERALMKAVASVGPVSVAIDA 254
Query: 283 VW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW ENWG+
Sbjct: 255 GHESFQFYHSGIYYDKECSSEELDHGVLVVGY---GFEGKDVDGKKYWIVKNSWSENWGD 311
Query: 340 NGY-YKICMGRNVCGVDSMVS 359
GY Y +N CG+ + S
Sbjct: 312 KGYIYMAKDKKNHCGIATAAS 332
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 140/375 (37%), Positives = 199/375 (53%), Gaps = 39/375 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNA------EHHFS 54
M R + L+ L S++LA A D+ +I Q V + E LL HF+
Sbjct: 1 MARFLAFLALVFLSSAILARANHAFDEANLI-QSVTERIDSLETSLLGVLGQTRNALHFA 59
Query: 55 LFKSKFSKTYATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F ++ K Y + EE RF +F NL R RR L P + G+ +++D++ EFR
Sbjct: 60 RFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNRRGL--PYKL-GINRYADMSWEEFRA 116
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
LG + A + + LP DWR+ G V+ VKDQG+CGSCW+FS TGALE
Sbjct: 117 SRLGAAQNC--SATLKGNHKMTDELLPKTKDWREDGIVSPVKDQGSCGSCWTFSTTGALE 174
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
A+ +TG+ +SLSEQQLVDC + + + GCNGGL + AFEYI GG++ E+
Sbjct: 175 AAYTQATGKGISLSEQQLVDCAYAFN-------NFGCNGGLPSQAFEYIKYNGGLDTEES 227
Query: 232 YPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQT 287
YPY G + G C F + V N ++ + DE A LV+ P+++ V +
Sbjct: 228 YPYAGVN-GFCHFKPENVGVKVVESVNITLGAEDELLHAVGLVR--PVSIAFEVVSGFRF 284
Query: 288 YIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Y GGV CG+ ++H VL VGYG PYW+IKNSWGE WG +GY+K
Sbjct: 285 YKGGVYTSDTCGRTQMDVNHAVLAVGYGVE-------NGVPYWLIKNSWGEEWGVDGYFK 337
Query: 345 ICMGRNVCGVDSMVS 359
+ +G+N+CG+ + S
Sbjct: 338 MELGKNMCGIATCAS 352
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 143/370 (38%), Positives = 198/370 (53%), Gaps = 47/370 (12%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSK 62
S L+ + S++L SA+A D I P + L + E LF+S + SK
Sbjct: 11 FSLLVAISASALLCSALA---RDFSIVGYTP-------EQLTSTEKLLELFESWMSEHSK 60
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
Y + EE +RF VF+ NL +R + G+ +F+DLT EF+ ++LGL +
Sbjct: 61 VYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFS 120
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R R P+ + + DLP DWR GAV VKDQG CGSCW+FS A+EG + ++T
Sbjct: 121 RKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITT 178
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L SLSEQ+L+DCD + +SGCNGGLM+ AF+YI+ GG+ +E DYPY +
Sbjct: 179 GNLSSLSEQELIDCDT--------TFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYL-ME 229
Query: 239 GGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCP 295
G C+ K + +S + + ++D+ + H P++V I A Q Y GGV
Sbjct: 230 EGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNG 289
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN----- 350
CG LDHGV VGYGSS K Y I+KNSWG WGE G+ I M RN
Sbjct: 290 Q-CGTDLDHGVAAVGYGSS-------KGSDYVIVKNSWGPRWGEKGF--IRMKRNTGKPE 339
Query: 351 -VCGVDSMVS 359
+CG++ M S
Sbjct: 340 GLCGINKMAS 349
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 204/355 (57%), Gaps = 33/355 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIR--QVVPSDG-EQSEDHLLNAEHHFSLFKSKFSKTY 64
S+L L +V+++A A +D ++I Q P+ G +SED + + F + K K+Y
Sbjct: 5 SILFTFLFAVVSAAAAAAEDMSIITYDQQHPAKGLVRSEDEV---KEMFESWLVKHGKSY 61
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLNR---RL 120
+E D RF++F+ NL+ + L+ + G+ +F+D+T E+R +LG R R
Sbjct: 62 NAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASRN 121
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + + + + + LP DWR+ GAVTGVKDQG+CGSCW+FS A+EG + L+TG
Sbjct: 122 MVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLATGN 181
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG- 239
L+SLSEQ+LVDCD + + GCNGG M AF++I+K GG++ E+DYPYTG DG
Sbjct: 182 LISLSEQELVDCDRK--------INQGCNGGDMGYAFQFIIKNGGIDSEEDYPYTGKDGK 233
Query: 240 -GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPY 296
S + + +K+ A++ + + + ++ V + P++V I A Q Y G+
Sbjct: 234 CDSYRQNNAKV-ASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQLYSSGIFTG- 291
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV VGYG+ YWI+KNSWG+ WGE GY + M RNV
Sbjct: 292 SCGTDLDHGVAAVGYGTENGV-------DYWIVKNSWGDYWGEKGYVR--MQRNV 337
>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
Length = 334
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/334 (39%), Positives = 175/334 (52%), Gaps = 30/334 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK L
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDLFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G +++D+ E + G L+ + + DWR GA T
Sbjct: 70 EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAAT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT DG SC + + A + + V + E +
Sbjct: 183 GLMDSAFEYVKSNGGIDTEESYPYTAVDGDSCLYRAANNAGVNTGYKDVQAKSESALRDA 242
Query: 270 LVKHGPLAVGINAV-W-MQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPY 326
+ K GP++V I+A W Q Y G+ C YLDHGVL VGYGS + K +
Sbjct: 243 VEKVGPVSVAIDASNWSFQMYSSGIYYESACSSDYLDHGVLAVGYGS------EWPNKEF 296
Query: 327 WIIKNSWGENWGENGYYKICMG-RNVCGVDSMVS 359
WI+KNSWG +WGE GY K+ +N CG+ + S
Sbjct: 297 WIVKNSWGTSWGEEGYIKMARNKKNNCGIATEAS 330
>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 382
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 170/314 (54%), Gaps = 26/314 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GG + AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TNDFGCGGGFSDPAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G DKS + A + + + DE+ +A L K+GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q+Y GGV SC K ++ VL+VGY + + PYWIIKNSW + WGE GY
Sbjct: 269 FQSYTGGVLTSC---ISKEMNSAVLLVGYDDTS-------KPPYWIIKNSWSKGWGEKGY 318
Query: 343 YKICMGRNVCGVDS 356
+I G N C V +
Sbjct: 319 IRIEKGTNQCLVKN 332
>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 467
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 168/316 (53%), Gaps = 24/316 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF+ FK + K Y + E +R VFK NL A+ +P A GVT FSDLT EFR
Sbjct: 37 HFAAFKQRHGKVYRSAAEEAFRLGVFKENLLLARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 112 QF---LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
++ + A + P DWR GAVT VKDQG CGSCW+FS G
Sbjct: 97 RYHNAAAHFAAAQKRARVPVEVEVEVGGAPAAVDWRARGAVTAVKDQGECGSCWAFSTIG 156
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+EG L+ L SLSEQ LV CD+ D+GC+GGLM++AF++I+ G V
Sbjct: 157 NIEGQWHLAGNPLTSLSEQMLVSCDNA---------DNGCDGGLMDNAFDWIVGKNNGTV 207
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E Y Y G S K D S + A +S + DED+MAA L +GPLA+ ++A
Sbjct: 208 YTEASYSYVSGGGNSQKCDMSGHVVGAVISGHVDLPKDEDKMAAWLAANGPLAIAVDATS 267
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+Y GGV I + LDHGV++VGY S PYWIIKNSWG +WGE GY +
Sbjct: 268 FMSYTGGVLTNCISDQ-LDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEGGYIR 319
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C V++ S
Sbjct: 320 IQKGTNQCLVNNYACS 335
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 189/354 (53%), Gaps = 30/354 (8%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ---SEDHLLNAEHHFSLFKSK 59
+L+ S+ ++L L+ ++ S+ AM ++ D S + + K
Sbjct: 2 KLLNSATVILFLTMIVVSS-------AMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVK 54
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
K + E D RF +FK NLR + + G+TKF+DLT E+R +LG +
Sbjct: 55 HGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLK 114
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ + + + + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + TG
Sbjct: 115 RKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTG 174
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY G DG
Sbjct: 175 DLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDG 226
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGGVSCPYI 297
+ K+ + + + ++ ++ + H P++V I Q Y G+ I
Sbjct: 227 RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIF-DGI 285
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV+ VGYG+ K YWI+KNSWG +WGE+GY + M RN+
Sbjct: 286 CGTDLDHGVVAVGYGTE-------NGKDYWIVKNSWGTSWGESGYIR--MERNI 330
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/369 (35%), Positives = 203/369 (55%), Gaps = 43/369 (11%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
++ +L+LLL L SAV + D QVV + + ++ +A +F F S+++K Y+
Sbjct: 1 MNKILILLL---LVSAVLTSHD-----QVVAVTIKPNLYNINSAPLYFEKFISQYNKQYS 52
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+++E YR+ +F+ N+ + + +AV+ + +F+D+T +E +NR L +
Sbjct: 53 SEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEV------VNRHTGLASG 106
Query: 126 AQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
A T P +FDWR++ VT VKDQG CG+CW+F+ GALE + +
Sbjct: 107 DIGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIK 166
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
L+ L+EQQLVDCD D GC+GGL+++A+E I+ GGVE+E DYPY
Sbjct: 167 YDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYKAV 217
Query: 238 DGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAVGINAVWMQTYIGGVSCP 295
C K A V N + + E+++ +L++H GP+A+ ++AV + Y GGV
Sbjct: 218 R-LPCAVKPHKFAVGVRNCYRYVLLSEERL-EDLLRHVGPIAIAVDAVDLTDYYGGV-IS 274
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVD 355
+ L+H VL+VGYG PYW IKNSWG ++GENGY +I G N CG+
Sbjct: 275 FCENNGLNHAVLLVGYGIE-------NNVPYWTIKNSWGSDYGENGYVRIRRGVNSCGMI 327
Query: 356 SMVSSVAAI 364
+ ++S A I
Sbjct: 328 NELASSAQI 336
>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/313 (41%), Positives = 169/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FS
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSV 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ L +LSEQQLV CD DSGC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHRLTALSEQQLVSCD---------DMDSGCGGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY T G C + A + + +I S+E MAA L K GP+++G++
Sbjct: 207 TMFTEDSYPYVSTFGYVPECTNSSQLVPGARIDGYVMIESNETVMAAWLAKSGPISIGVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GGV SC GK L+HGVL+VGY +G E PYW+IKNSWGENWGE
Sbjct: 267 ASSFMSYHGGVLTSC---AGKQLNHGVLLVGYNMTG-------EVPYWVIKNSWGENWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 184/321 (57%), Gaps = 43/321 (13%)
Query: 55 LFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFR 110
L+KS + K Y E + RF +FK NLR + T G+ KF+DLT E+R
Sbjct: 45 LYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYR 104
Query: 111 RQFLGLN----RRL---RLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+FLG RRL ++P+ A +A ++LP +WRDHGAV+ VKDQG+CGSC
Sbjct: 105 AKFLGTRTDPRRRLMKSKIPSSRYAHRA----GDNLPDSVNWRDHGAVSRVKDQGSCGSC 160
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FSA A+EG + + +GEL+SLSEQ+LVDCD S D+GCNGGLM+ AF++I+
Sbjct: 161 WAFSAIAAVEGINKIVSGELISLSEQELVDCDR--------SYDAGCNGGLMDYAFQFII 212
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG++ EKDYPY G + K+ ++ + + ++E+ + V H P+++ I
Sbjct: 213 DNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIE 271
Query: 282 A--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Q Y GV CG LDHGV+ VGYGS + YWI++NSWG NWGE
Sbjct: 272 AGGRAFQLYESGVFNGE-CGLALDHGVVAVGYGSDDNG------QDYWIVRNSWGGNWGE 324
Query: 340 NGYYKICMGRNV------CGV 354
NGY I M RN+ CG+
Sbjct: 325 NGY--IRMERNINANTGKCGI 343
>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
Length = 462
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 181/314 (57%), Gaps = 26/314 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY ++EE +R VF N+ A++ Q LD TA +GVTKFSDLT EFR
Sbjct: 165 FKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDLTEEEFRT 224
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P+ + + + P ++DWR GAVT VK+QG CGSCW+FS TG +
Sbjct: 225 IYL--NPLLREHPSKTMRQAKIVHDSAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 282
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL G L+SLSEQ+L+DCD D C GGL +A+ I GG+E E
Sbjct: 283 EGQWFLKKGTLLSLSEQELLDCD---------KVDKACMGGLPINAYSAIKSLGGLETED 333
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G +C F K +++ +S +E +AA L GP+++ INA MQ Y
Sbjct: 334 DYSYQG-HMEACNFSAKKAKVYINDSVELSKNEQYLAAWLAVKGPISIAINAFGMQFYRH 392
Query: 291 GVSCPY--ICGK-YLDHGVLIVGYGS-SGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
G++ P +C ++DH +LIVGYG SG P+W IKNSWG +WGE GYY +
Sbjct: 393 GIAHPLQPLCSPWFIDHAMLIVGYGKRSGV--------PFWAIKNSWGTDWGEEGYYYLH 444
Query: 347 MGRNVCGVDSMVSS 360
G CGV+ M SS
Sbjct: 445 RGSRSCGVNVMASS 458
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/327 (38%), Positives = 173/327 (52%), Gaps = 27/327 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTK 100
++ A F+ FKS++ K Y + YR +V+K N + + R + + T +
Sbjct: 15 YIAEAASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNH 74
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKA-PILPTND--LPTDFDWRDHGAVTGVKDQGA 157
+D+ P EF FLG NR LR + P D + + DWR GA++ VKDQG
Sbjct: 75 LADMHPREFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGH 134
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALE FL G VSLSEQ L+DC ++GC GGLM AF
Sbjct: 135 CGSCWAFSSTGALEAHTFLKKGRRVSLSEQNLIDCS-------LNYGNNGCEGGLMEQAF 187
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+Y+ G++ E+ YPY G D C+F K+ + A + F I S DE + + GPL
Sbjct: 188 QYVRDNDGIDTEEAYPYEGED-SECRFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPL 246
Query: 277 AVGINAV--WMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
++ I+A Q Y GV P LDHGVL+VGYG K++ YW++KNSW
Sbjct: 247 SIAIDASNPSFQFYSEGVYYEPECSSAQLDHGVLLVGYGVE-------KDQKYWLVKNSW 299
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
E WGENGY K+ + N CG+ + S
Sbjct: 300 SEQWGENGYIKMARNKDNNCGIATQAS 326
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 117/286 (40%), Positives = 164/286 (57%), Gaps = 20/286 (6%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
EE D RF +FK NLR + + G+T+F+DLT E+R +LG + R+ +
Sbjct: 68 EEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVLKTSD 127
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + TG+L+SLSEQ
Sbjct: 128 RYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQ 187
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
+LVDCD S + GCNGGLM+ AFE+I+K GG++ E+DYPY DG + K+
Sbjct: 188 ELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQTRKN 239
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHG 305
+ + + + + + + P++V I A Q Y GV ICG LDHG
Sbjct: 240 AKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVF-DGICGTELDHG 298
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
V+ VGYG+ K YWI++NSWG +WGE+GY K M RN+
Sbjct: 299 VVAVGYGTE-------NGKDYWIVRNSWGGSWGESGYIK--MARNI 335
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 173/321 (53%), Gaps = 30/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ K+Y + E RF++F N L A+ + V G+ +F DL P
Sbjct: 26 QWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R F G R A + P N LP DWR+ GAVT VK+QG CGSCW+
Sbjct: 86 EFARMFNGY--RGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG+LEG HFL TG LVSLSEQ LVDC E G + GC GGLM++AF+YI
Sbjct: 144 FSTTGSLEGQHFLKTGVLVSLSEQNLVDC-----SETFG--NHGCEGGLMDNAFQYIKAN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINA 282
GG++ EK YPY D G C+F K + A + F I ED + + GP++V I+A
Sbjct: 197 GGIDTEKSYPYEAED-GECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDA 255
Query: 283 VW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y GV C + LDHGVL+VGYG K YW++KNSW E+WG+
Sbjct: 256 SHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGVE-------DGKKYWLVKNSWAESWGD 308
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
NGY K+ + N CG+ S S
Sbjct: 309 NGYIKMSRDKDNQCGIASAAS 329
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 182/321 (56%), Gaps = 43/321 (13%)
Query: 55 LFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFR 110
L+KS + K Y E + RF +FK NLR + T G+ KF+DLT E+R
Sbjct: 44 LYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYR 103
Query: 111 RQFLGLN----RRL---RLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+FLG RRL ++P+ A +A ++LP DWRDHGAV+ VKDQG+CGSC
Sbjct: 104 AKFLGTRTDPRRRLMKSKIPSSRYAHRA----GDNLPDSVDWRDHGAVSPVKDQGSCGSC 159
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS +EG + + +GELVSLSEQ+LVDCD S D+GCNGGLM+ AF++I+
Sbjct: 160 WAFSTIATVEGINKIVSGELVSLSEQELVDCDR--------SYDAGCNGGLMDYAFQFIM 211
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG++ EKDYPY G + K+ ++ + + ++E+ + V H P+++ I
Sbjct: 212 DNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIE 270
Query: 282 A--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Q Y GV CG LDHGV+ VGYG+ + YWI++NSWG NWGE
Sbjct: 271 AGGRAFQLYESGVFNGE-CGLALDHGVVAVGYGTDDNG------QDYWIVRNSWGSNWGE 323
Query: 340 NGYYKICMGRNV------CGV 354
NGY I M RN+ CG+
Sbjct: 324 NGY--IRMERNINANTGKCGI 342
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 138/367 (37%), Positives = 199/367 (54%), Gaps = 39/367 (10%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P+ D+ + A H+ FK + SK + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPA-----VDNFV-ASAHYGSFKKRHSKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + D K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYTSHLKD-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY- 301
D+ ++ A ++ F + DE+++A + K GP+AV ++A Q Y GGV +C +
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIADWVEKRGPVAVAVDATTWQLYFGGVVS--LCLAWS 286
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS--MVS 359
L+HGVLIVG+ + + PYWI+KNSWG +WGE GY ++ MG N C + + + +
Sbjct: 287 LNHGVLIVGFNKNA-------KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYPVSA 339
Query: 360 SVAAIHT 366
+V + HT
Sbjct: 340 TVESPHT 346
>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 170/313 (54%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKBQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--G 224
G +E ++ L LSEQQLV CD + DSGC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHRLXXLSEQQLVSCDDK---------DSGCXGGLMTQAFEWLLRXMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + +I S+E MAA L K GP+++G++
Sbjct: 207 TMFTEDSYPYVSSTGDVPECTNSSELVPGARIDGYVMIESNETVMAAWLAKSGPISIGVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC GK+L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGKHLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 173/307 (56%), Gaps = 33/307 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
FKS +SK+Y ++ R F+ANL + +H GV +F+DLT EF
Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
++ +P + P + + DWR GAVT +K+QG CGSCWSFS TG+ E
Sbjct: 61 LYVPSKFNRTMPYNTVYLPATSEDSV----DWRTKGAVTPIKNQGQCGSCWSFSTTGSTE 116
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREK 230
GAH ++TG LVSLSEQQLVDC SGS + GCNGGLM+ AF+YI+ G++ E+
Sbjct: 117 GAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDDAFKYIISNKGLDTEE 168
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--VWMQT 287
DYPYT DG K ++K AA +S++S V ++EDQ+AA + K GP++V I A Q
Sbjct: 169 DYPYTAQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAK-GPVSVAIEADQSGFQL 227
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
Y GV CG LDHGVL+VGY YWI+KNSWG WG GY +
Sbjct: 228 YKSGV-FDGNCGTNLDHGVLVVGY-----------TDDYWIVKNSWGTTWGVEGYINMKR 275
Query: 348 GRNVCGV 354
G + G+
Sbjct: 276 GVSASGI 282
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
Length = 1095
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/275 (38%), Positives = 159/275 (57%), Gaps = 23/275 (8%)
Query: 93 TAVHGVTKFSDLTPSEFRRQFLGLNRR--LRLPADAQK--APILP----TNDLPTDFDWR 144
+AV G TKFSDL+P +F ++ L LN++ L++ + +K PI ++P FDWR
Sbjct: 831 SAVFGHTKFSDLSPQQFAQKHLKLNQKKLLQVKKETKKLTTPIQQDITVEENVPEQFDWR 890
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
D VT K Q CGSCW+FS TG +E + + +LV SEQQLVDCD
Sbjct: 891 DRNVVTEPKYQNTCGSCWTFSTTGVIESQYAIKHQKLVPFSEQQLVDCD---------DI 941
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
+ GC+GGLM A++Y+ ++GG+E +DY CKFD +K+ A + + I DE+
Sbjct: 942 NDGCHGGLMTDAYKYLQQSGGLEFAEDYGDYKNKKEKCKFDLNKVQAKIKEWQQIDEDEE 1001
Query: 265 QMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
+ L ++GP+A G+NA +Q Y G+ P C ++H +LIVGYG + +
Sbjct: 1002 IIKKQLYQNGPIAAGVNARLLQFYKSGIFDPKECDSDINHAILIVGYG------VEKDGQ 1055
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
YWIIKN WG++WG +GY+K+ G+ CG+ + S
Sbjct: 1056 KYWIIKNQWGKDWGMDGYFKLARGKKQCGIHTYAS 1090
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 170/311 (54%), Gaps = 25/311 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+KS K+Y+ E R +++ NL + KR D + + DLT EFR +LG
Sbjct: 30 WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLG 89
Query: 116 LNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
+ + P+N +P+ DW G VTGVK+QG CGSCW+FS TG++EG H
Sbjct: 90 VRAHHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQH 149
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYP 233
F TG LVSLSEQ L+DC SGS ++GC GGLM++AF YI GG++ E YP
Sbjct: 150 FRKTGSLVSLSEQNLIDC--------SGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYP 201
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQ-MAANLVKHGPLAVGINAVWMQTYIGGV 292
Y G GSC F S + A V+ + I +Q + + + GP++V ++A Q Y GV
Sbjct: 202 YLGQQ-GSCHFSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVSVAVDASQWQFYSSGV 260
Query: 293 -SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-- 349
PY LDHGVL++GYG+ + + YW++KNSWG +WG GY I M R
Sbjct: 261 YDNPYCSSTQLDHGVLVIGYGN-------YNGQDYWLVKNSWGYSWGVEGY--IMMSRNK 311
Query: 350 -NVCGVDSMVS 359
N CG+ S S
Sbjct: 312 NNQCGIASSAS 322
>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
Length = 358
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 143/371 (38%), Positives = 203/371 (54%), Gaps = 38/371 (10%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEH---HFSLF 56
L+LS+ L+L+ S A+A + D+ IR V SDG EQ +L HF+ F
Sbjct: 6 LVLSAALVLVAISCGAAASSF-DESNPIRLV--SDGLRELEQQVVQVLGNSRRALHFARF 62
Query: 57 KSKFSKTYATQEEHDYRFRVFKAN--LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
++ K Y + EE R+ +F N L R+ ++ L T V +F+D + EFRRQ L
Sbjct: 63 AHRYGKKYESVEEMKLRYEIFSENKKLIRSTNKKGLPYTL--AVNRFADWSWEEFRRQRL 120
Query: 115 GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
G + A + + L LP +WR+ G VT VKDQG CGSCW+FS TGALE A+
Sbjct: 121 GAAQNC--SATTKGSHELTDAVLPESKNWREEGIVTPVKDQGHCGSCWTFSTTGALEAAY 178
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E YP
Sbjct: 179 VQAFRKQISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGGLDTEAAYP 230
Query: 234 YTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGG 291
Y GTD G+CKF + V + ++ DE ++ + P++V V + Y G
Sbjct: 231 YVGTD-GACKFSAENVGVQVLDSVNITLGDEQELKHAVAFVRPVSVAFQVVKSFRIYKSG 289
Query: 292 VSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
V CG ++H VL VGYG G P+W+IKNSWGE+WG+NGY+K+ G
Sbjct: 290 VYTSDTCGSSPMDVNHAVLAVGYGEEGGV-------PFWLIKNSWGESWGDNGYFKMEFG 342
Query: 349 RNVCGVDSMVS 359
+N+CGV + S
Sbjct: 343 KNMCGVATCAS 353
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
Length = 348
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHCRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 135/368 (36%), Positives = 183/368 (49%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ANL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P D + P DWR+ GAVT VK+QG CGSCW+F+A G +EG FL
Sbjct: 109 EERARVPVDVEVV------GAPAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L LSEQ LV CD+ +SGC GGL + AFE+I++ G V E YPY
Sbjct: 163 AGNPLTRLSEQMLVSCDNT---------NSGCGGGLSSKAFEWIVQENNGAVYTEDSYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
G CK + A ++ + DE Q+AA+ GPL+V ++A Y GGV
Sbjct: 214 HSCIGIKLPCKDSDRTVGATITGHVELPQDEAQIAASGAVKGPLSVAVDASSWFFYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ K L H VL+VGY S PYWIIKNSW +WGE GY +I G N C
Sbjct: 274 LTNCV-SKRLSHAVLLVGYNDSAAV-------PYWIIKNSWTTHWGEGGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V VSS
Sbjct: 326 LVKEEVSS 333
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 175/320 (54%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR D+ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 92 LMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K I A F+ I DE +MA + GP+AV I+A
Sbjct: 205 GIDTEKSYPYEAID-DSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VGYG+ YW++KNSWG WG+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGD------DYWLVKNSWGTTWGDK 317
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 318 GFIKMLRNKDNQCGIASASS 337
>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
Length = 443
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 174/313 (55%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARVGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+ ++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG+N C
Sbjct: 317 KGYVRVAMGKNAC 329
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 172/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E M A L K+GP+++ +
Sbjct: 207 TVSTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
Length = 467
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 163/316 (51%), Gaps = 24/316 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK + K Y + E +R VFK NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 112 QF---LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
++ + + P DWR GAVT +KDQG CGSCW+FS G
Sbjct: 97 RYHNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFSTIG 156
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+EG L+ L LSEQ LV CD+ D+GC+GGLM+SAF++I+ G V
Sbjct: 157 NIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVGQNNGSV 207
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E Y Y G D +C + A +S + DED+MAA L +GPLA+ ++A
Sbjct: 208 YTEASYSYVSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATS 267
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+Y GGV + + LDHGV++VGY S PYWIIKNSWG +WGE GY +
Sbjct: 268 FMSYTGGVLTNCVSDQ-LDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEEGYIR 319
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C V + S
Sbjct: 320 IQKGTNQCLVKNYACS 335
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 126/326 (38%), Positives = 173/326 (53%), Gaps = 21/326 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-- 99
++ H + + + +KS + K YA EE D+R V++ N++ +R HG T
Sbjct: 18 AQKHDESLDEQWYQWKSLYKKPYAANEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMT 76
Query: 100 --KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
F D+T EFR+ G + R+ P+ +P DW G VT VKDQG
Sbjct: 77 MNAFGDMTNEEFRQVMNGFQNQKRIQGKLLYEPVF--GHIPKSVDWTQKGYVTPVKDQGQ 134
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF
Sbjct: 135 CGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAF 187
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+YI GG++ E+ YPYT D C+++ AA + F I E + + GP++
Sbjct: 188 QYIKDNGGLDSEESYPYTAMDKQDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPIS 247
Query: 278 VGINA--VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V ++A Q Y G+ C K L+HGVL+VGY GF I YW++KNSWG
Sbjct: 248 VAVDAGHESFQFYKSGIYYDSNCSSKDLNHGVLVVGY---GFEGIDSANNRYWLVKNSWG 304
Query: 335 ENWGENGYYKICMGRNV-CGVDSMVS 359
WG +GY K+ RN CG+ + S
Sbjct: 305 TGWGTDGYIKMAKDRNNHCGIATAAS 330
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 148/377 (39%), Positives = 208/377 (55%), Gaps = 48/377 (12%)
Query: 1 MERLILSSLLLLLLSSVLASAV-----AVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH- 52
M RL + + +L+LL +V + + D++ IR V S D E S L+ H
Sbjct: 1 MARLSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHA 60
Query: 53 --FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSE 108
F+ F ++ K+Y T +E RF +F NL+ R+ R+ L T V +F+D T E
Sbjct: 61 HSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTL--AVNQFADWTWEE 118
Query: 109 FRRQFLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
FRR LG N L + + ++ LP DWR+ G V+ +KDQG CGSCW+FS
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVI----LPETKDWREDGIVSPIKDQGHCGSCWTFST 174
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGG 225
TGALE A+ + G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG
Sbjct: 175 TGALEAAYAQAFGKGISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGG 226
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINA 282
++ E+ YPYTG D G+CKF I V N ++ + DE + A V+ P++V
Sbjct: 227 LDTEEAYPYTGLD-GTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVR--PVSVAFEV 283
Query: 283 VW-MQTYIGGVSCPYICGKY---LDHGVLIVGYG-SSGFAPIRFKEKPYWIIKNSWGENW 337
V + Y GV CG ++H VL VGYG G A YW+IKNSWGENW
Sbjct: 284 VHDFRFYKKGVYTSGTCGSTPMDVNHAVLAVGYGVEDGVA--------YWLIKNSWGENW 335
Query: 338 GENGYYKICMGRNVCGV 354
G+NGY+K+ +G+N+CGV
Sbjct: 336 GDNGYFKMELGKNMCGV 352
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 116/295 (39%), Positives = 167/295 (56%), Gaps = 20/295 (6%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
K K + E D RF +FK NLR + + G+TKF+DLT E+R +LG
Sbjct: 48 KHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL 107
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ + + + + + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + T
Sbjct: 108 KRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVT 167
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY G D
Sbjct: 168 GDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVD 219
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGGVSCPY 296
G + K+ + + + ++ ++ + H P++V I Q Y G+
Sbjct: 220 GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIF-DG 278
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
ICG LDHGV+ VGYG+ K YWI+KNSWG +WGE+GY + M RN+
Sbjct: 279 ICGTDLDHGVVAVGYGTE-------NGKDYWIVKNSWGTSWGESGYIR--MERNI 324
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 174/322 (54%), Gaps = 32/322 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFS 102
L+ E + FK K K Y+ +EE+ R +F+ NL+ + T H GV +F+
Sbjct: 18 LSFESQWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFA 76
Query: 103 DLTPSEFRRQFLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
D+T +E+ Q +G + L +P + DWRD G VT +KDQG CG
Sbjct: 77 DMTHAEYLNQVIGGCLITSNLTKTGSRATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG+LEG H +TG LVSLSEQ LVDC + + GC GG M+ F+Y
Sbjct: 137 SCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQ-------EGNKGCEGGDMDQGFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAV 278
I++ G++ E+ YPY + CKFD S I A +S+F+ V S DED + GP++V
Sbjct: 190 IIQNKGIDTEQCYPYKAKN-HRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPISV 248
Query: 279 GINAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
GI+A Q Y GV + C LDHGVL+VGYG+ G K YW++KNSWG
Sbjct: 249 GIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYGTYG-------SKDYWLVKNSWGT 301
Query: 336 NWGENGYYKICMGR---NVCGV 354
WG GY I M R N CGV
Sbjct: 302 VWGNEGY--IMMSRNKDNQCGV 321
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 118/325 (36%), Positives = 180/325 (55%), Gaps = 28/325 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQNDSAAKYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 81 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPFEFDWRRLNKVTNVKNQGVCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+A +LE + +L+ LSEQQ++DCD S D+GCNGGL+++AFE +
Sbjct: 137 CWAFAALASLESQFAMKHNQLIDLSEQQMIDCD---------SVDAGCNGGLLHTAFEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+K GGV+ EKDYPY + +C+ + +K V + + I E+++ L GP+ +
Sbjct: 188 IKMGGVQLEKDYPYEAAN-NNCRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPIPMA 246
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE
Sbjct: 247 IDAADIVNYKQGI-IKYCLNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAI 364
+GY+++ N CG+ + ++S A I
Sbjct: 299 SGYFRLQQNINACGMRNELASTAVI 323
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 168/316 (53%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G M++AF++I+ G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G +C + A + + I +E+ +A L K+GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIDDHVHILDNENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GGV I K ++ L+VGY + + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SKEVNSAALLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C + VSS
Sbjct: 321 IEKGTNQCRMKDYVSS 336
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 180/319 (56%), Gaps = 26/319 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-------VHGVTKFSDL 104
+ L+K K Y++++E YR +++AN ++ +L+ A + F+DL
Sbjct: 22 EWELWKRTNGKDYSSEKEELYRQTIWEAN-----KKIVLEHNANADKWGWTLEMNAFADL 76
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
SEF + G R R ++A + + N LP DWR GAVT VK+Q CGSCW+F
Sbjct: 77 ESSEFAAMYNGYRRSAR-KSNATRYHVPTGNALPDTVDWRTKGAVTPVKNQKQCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG+LEG FL G L SLSEQQLVDC + + GC GGLM++AF+YI G
Sbjct: 136 STTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYG-------NHGCQGGLMDNAFKYIEANG 188
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE-DQMAANLVKHGPLAVGINAV 283
G++ E YPY + G C+F +S +AA + + I D+ D + + GP++V ++A
Sbjct: 189 GIDSEASYPYEAKN-GKCRFQQSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDAS 247
Query: 284 W--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV P +C LDHGVL VGYG+ + + +EKPYW++KNSWG +WG+
Sbjct: 248 HSSFQLYAAGVYDPLLCSSTRLDHGVLAVGYGTEP-SGLFHEEKPYWLVKNSWGPDWGQQ 306
Query: 341 GYYKICMGRNVCGVDSMVS 359
GY+KI N CG+ + S
Sbjct: 307 GYFKIVRKDNKCGIATDAS 325
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 140/363 (38%), Positives = 198/363 (54%), Gaps = 33/363 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGV 292
DGG CKF I V N ++ + DE + A LV+ P++V V + Y GV
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--PVSVAFEVVHEFRFYKKGV 290
Query: 293 SCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
CG ++H VL VGYG + PYW+IKNSWG WG+NGY+K+ MG+
Sbjct: 291 FTSNTCGNTPMDVNHAVLAVGYGVE-------DDVPYWLIKNSWGGEWGDNGYFKMEMGK 343
Query: 350 NVC 352
N+C
Sbjct: 344 NMC 346
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 168/316 (53%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G M++AF++I+ G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G +C + A + + I +E+ +A L K+GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIRDHVHILDNENAIAEWLAKNGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GGV I K ++ L+VGY + + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SKEVNSAALLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C + VSS
Sbjct: 321 IEKGTNQCRMKDYVSS 336
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 191/348 (54%), Gaps = 36/348 (10%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNA---EHH---FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
D+ IRQ+V + E+ +L H F+ F ++ K Y T EE RF VF N
Sbjct: 29 DENPIRQIVSDGLHELENGILQVVGKTRHALLFARFAHRYGKRYETVEEIKQRFEVFLDN 88
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPT 139
L+ + + GV +F+D+T EFRR LG + + K + TN LP
Sbjct: 89 LKMIRSHNKKGLSYKLGVNEFTDITWDEFRRDRLGAAQNC---SATTKGNLKLTNVVLPE 145
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DWR+ G V+ VK+QG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC
Sbjct: 146 TKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDC------- 198
Query: 200 ESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+G+ ++ GCNGGL + AFEYI GG++ E+ YPYTG + G CKF + V N
Sbjct: 199 -AGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN-GLCKFSSENVGVKVIDSVN 256
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKY---LDHGVLIVGY 311
++ + DE + A LV+ P+++ + + Y GV CG ++H VL VGY
Sbjct: 257 ITLGAEDELKYAVALVR--PVSIAFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGY 314
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
G PYW+IKNSWG +WG+NGY+K+ MG+N+CG+ + S
Sbjct: 315 GVENGV-------PYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 355
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 176/326 (53%), Gaps = 30/326 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLT 105
+ + FK + K Y + E +R ++F N + AK +L + V + K++D+
Sbjct: 24 QEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADML 83
Query: 106 PSEFRRQFLGLNRRLRLP-----ADAQKAP-ILPTN-DLPTDFDWRDHGAVTGVKDQGAC 158
EF G NR P D Q A I P N P + DWR+HGAVT VKDQG C
Sbjct: 84 HHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGHC 143
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCWSFSATGALEG HF T +LVSLSEQ LVDC + + GCNGGLM++AF+
Sbjct: 144 GSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDC-------STKFGNDGCNGGLMDNAFK 196
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
Y+ G++ E YPY D C ++ A F + + DE+++ A + GP++
Sbjct: 197 YVKYNHGIDTEASYPYHADD-EKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPVS 255
Query: 278 VGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V I+A Q Y GV P + LDHGVL+VGYG+ + YWI+KNSWG
Sbjct: 256 VAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTDE------NGQDYWIVKNSWG 309
Query: 335 ENWGENGYYKICMGR-NVCGVDSMVS 359
E+WGE GY K+ R N CG+ + S
Sbjct: 310 ESWGEQGYIKMARNRDNNCGIATQAS 335
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 140/363 (38%), Positives = 198/363 (54%), Gaps = 33/363 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGV 292
DGG CKF I V N ++ + DE + A LV+ P++V V + Y GV
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--PVSVAFEVVHEFRFYKKGV 290
Query: 293 SCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
CG ++H VL VGYG + PYW+IKNSWG WG+NGY+K+ MG+
Sbjct: 291 FTSNTCGNTPMDVNHAVLAVGYGVE-------DDVPYWLIKNSWGGEWGDNGYFKMEMGK 343
Query: 350 NVC 352
N+C
Sbjct: 344 NMC 346
>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 443
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 174/313 (55%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+ ++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSCA---GDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVVMGLNAC 329
>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 166/316 (52%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSATG
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSATG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G + AF +I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTD---------DLGCRDGFPDIAFNWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G DKS + A + + ++ DED +A L + GP A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDEDMIAEWLARKGPAAITVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GGV I K ++ L+VGY + + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SKEMNSAALLVGYDDTS-------KPPYWIIKNSWGKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C V S
Sbjct: 321 IEKGTNQCLVQEYARS 336
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 137/367 (37%), Positives = 198/367 (53%), Gaps = 39/367 (10%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P D+ + A H+ FK + K + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + R + K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKN-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY- 301
D+ ++ A ++ F + DE+++A + K GP+AV ++A Q Y GGV +C +
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVS--LCLAWS 286
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDS--MVS 359
L+HGVLIVG+ + + PYWI+KNSWG +WGE GY ++ MG N C + + + +
Sbjct: 287 LNHGVLIVGFNKNA-------KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYPVSA 339
Query: 360 SVAAIHT 366
+V + HT
Sbjct: 340 TVESPHT 346
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 168/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ L +LSEQQLV CD + DSGC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCGGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G +C + A + + I S E MAA L K GP+++ ++
Sbjct: 207 TMXTEDSYPYVSSTGDVPACTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC GK L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYXSGVLTSC---AGKXLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 172/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|371781445|emb|CCA95082.1| putative responsive to dehydration 19, partial [Ginkgo biloba]
Length = 130
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 101/128 (78%), Positives = 116/128 (90%), Gaps = 2/128 (1%)
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
Y LKAGG+E+E+DYPYTGTDG +CKFD K+ AAVSNFSV+S DEDQ+AANLVK+GPL+V
Sbjct: 4 YALKAGGLEKEEDYPYTGTDG-TCKFDDKKVVAAVSNFSVVSIDEDQIAANLVKNGPLSV 62
Query: 279 GINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
GINAV+MQTYIGGVSCPYIC K LDHGVL+VGYGS+G+APIR K+KPYWIIKNSWG NW
Sbjct: 63 GINAVFMQTYIGGVSCPYICSKRNLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGANW 122
Query: 338 GENGYYKI 345
GE GYYK+
Sbjct: 123 GEQGYYKL 130
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 172/294 (58%), Gaps = 33/294 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F FK+ F K Y + EE RF +F NL R +H GV +F+DLT E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+R+ +L L + Q+ + N DWR GAVT +K+QG CGSCWSFS TG
Sbjct: 80 YRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCWSFSTTG 137
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVE 227
++EGAH ++TG LVSLSEQQLVDC SGS + GCNGGLM++AF+YI+ GG++
Sbjct: 138 SVEGAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDNAFKYIISNGGLD 189
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--VW 284
E+DYPYT DG K +SK A ++S + V ++EDQ+AA V+ GP++V I A
Sbjct: 190 TEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVSVAIEADQQS 248
Query: 285 MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
Q Y GV S P CG LDHGVL+VGY S YWI+KNSWG +W
Sbjct: 249 FQMYSSGVFSGP--CGTNLDHGVLVVGYTSD-----------YWIVKNSWGASW 289
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 118/326 (36%), Positives = 180/326 (55%), Gaps = 28/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F KF+K Y+++ E RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
++ GGV+ E DYPY G+DG + + I+ E+++ L GP+ V I
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAI 247
Query: 281 NAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
+A + Y G+ C Y L+H VL+VGYG PYWI+KN+WGE+WGE
Sbjct: 248 DASDIVNYRRGIM--RYCSNYGLNHAVLLVGYGVEN-------NVPYWILKNTWGEDWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
GY+++ N CG+ + + + A I+
Sbjct: 299 QGYFRVQQNINACGIRNELLASAEIY 324
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 119/297 (40%), Positives = 161/297 (54%), Gaps = 26/297 (8%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
EEH RF +FK N++ D G+ KF+DL+ EF+ ++G LR + Q
Sbjct: 62 EEHAERFEIFKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQ 121
Query: 128 KAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ N LP DWR GAV VK+QG CGSCW+FS ++EG ++++TG LVSLS
Sbjct: 122 SGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLS 181
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT--GTDGGSCK 243
EQQLVDC E +SGCNGGLM++AF+YI+ GG+ E +YPYT T+ S K
Sbjct: 182 EQQLVDCSTE---------NSGCNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTK 232
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKY 301
+ + F + ++ +Q V H P++V I A Q Y GV CG
Sbjct: 233 INSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGK-CGTA 291
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNVCGV 354
LDHGV+ VGYG+S P YWI++NSWG WGE GY ++ G CG+
Sbjct: 292 LDHGVVAVGYGTS---PEGIN---YWIVRNSWGPKWGEEGYIRMQQGIEAAEGKCGI 342
>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 315
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 176/312 (56%), Gaps = 35/312 (11%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
N + +S FK+K++K YA + YR +F NL+ + T +G+T+F D+T
Sbjct: 35 NIQALWSAFKTKYNKKYADPDFERYRIEIFTENLKVVESN-----TKNYGITQFMDITRE 89
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EF++ +L L + L A +P ND + DW GAVT VKDQG CGSCWSFS T
Sbjct: 90 EFKQTYLTLKMKNGLKA----SPFAKFNDAGVEIDWTTKGAVTPVKDQGQCGSCWSFSTT 145
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GA+EGA FLST +L SLSEQ LVDC S + GCNGGLM++AF++I + G+
Sbjct: 146 GAVEGALFLSTKKLTSLSEQYLVDC--------SKDGNEGCNGGLMDTAFDFISQH-GIP 196
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQT 287
E YPY D G+CK +S+ + I D + N ++ P+A+ ++A Q
Sbjct: 197 TEAAYPYKAVD-GTCKMTSGPY--KISSHTDIQDCNDLL--NKIQKQPIAIAVDANNFQY 251
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
Y + CG LDHGVL+VGY +SG YW +KNSWG NWGE+G+ ++
Sbjct: 252 YQKDIFSD--CGTELDHGVLLVGYSASG---------KYWKVKNSWGPNWGESGFIRLAA 300
Query: 348 GRNVCGVDSMVS 359
G N CG+ +M S
Sbjct: 301 G-NTCGLCNMAS 311
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 186/348 (53%), Gaps = 35/348 (10%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYICGKY---LDHGVLIVGY 311
++ + DE + A LV+ P+++ + + Y GV CG ++H VL VGY
Sbjct: 255 ITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
G PYW+IKNSWG +WG+ GY+K+ MG+N+CG+ + S
Sbjct: 313 GVEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCAS 353
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 173/316 (54%), Gaps = 31/316 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R +F L ++ + T G+ KFSDLT +EFR
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWM 285
E+ YPYTG GSC +K+K+ ++ + ++ D V P+ VGI +
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNF 227
Query: 286 QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Q Y G+ C DH VL++GYG+ G PYWIIKNSWG +WGENG+ KI
Sbjct: 228 QNYRSGILSGQ-CSNSRDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGENGFMKI 279
Query: 346 CM--GRNVCGVDSMVS 359
G +CG++ S
Sbjct: 280 KKKDGEGMCGMNGQSS 295
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 172/310 (55%), Gaps = 24/310 (7%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--- 115
K K+Y E + RF +FK NLR + ++ T G+ +F+DLT E+R ++LG
Sbjct: 60 KHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSRYLGRRD 119
Query: 116 -LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R LR + + DLP DWR+ GAV VKDQG CGSCW+FS A+EG +
Sbjct: 120 ETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIAAVEGIN 179
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
++TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY
Sbjct: 180 QIATGDLISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 231
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV 292
D K+ ++ + + ++++ V + P++V I A Q Y GV
Sbjct: 232 RAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV 291
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
CG LDHGV+ VGYG+ YWI++NSWG NWGE+GY K + RN+
Sbjct: 292 FTGQ-CGTQLDHGVVAVGYGTENSV-------DYWIVRNSWGPNWGESGYIK--LERNLA 341
Query: 353 GVDSMVSSVA 362
G ++ +A
Sbjct: 342 GTETGKCGIA 351
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 178/324 (54%), Gaps = 28/324 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ + + +TY E + RF VF+ NLR
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LG+ R + + N DLP DWR GAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
VKDQG+CGSCW+FS A+EG + + TG+++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ E+DYPY GTDG K+ + ++ + ++ ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V + P++V I A Q Y G+ CG LDHGV VGYG+ K YW
Sbjct: 260 AVANQPISVAIEAGGRAFQLYNSGIFTG-TCGTALDHGVTAVGYGTE-------NGKDYW 311
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I+KNSWG +WGE+GY + M RN+
Sbjct: 312 IVKNSWGSSWGESGYVR--MERNI 333
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAVKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 145/376 (38%), Positives = 203/376 (53%), Gaps = 42/376 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNAEHH----- 52
M RL SL+L+L++ + A+A+A D IRQVV D + E+ +L
Sbjct: 1 MSRL---SLVLILVAGLFATALAGPATFADKNPIRQVVFPD--ELENGILQVVGQTRSAL 55
Query: 53 -FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ F + K Y + EE RF +F NL+ + + G+ +F+DLT EFR+
Sbjct: 56 SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRK 115
Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LG ++ + K + TN LP DWR G V+ VK QG CGSCW+FS TGAL
Sbjct: 116 HKLGASQNC---SATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGAL 172
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+
Sbjct: 173 EAAYAQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEE 225
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQ 286
YPYTG + G CKF ++ I V N ++ + E + A LV+ P++V V +
Sbjct: 226 AYPYTGKN-GICKFSQANIGVKVISSVNITLGAEYELKYAVALVR--PVSVAFEVVKGFK 282
Query: 287 TYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y GV CG ++H VL VGYG PYW+IKNSWG +WGE+GY+
Sbjct: 283 QYKSGVYASTECGDTPMDVNHAVLAVGYGVE-------NGTPYWLIKNSWGADWGEDGYF 335
Query: 344 KICMGRNVCGVDSMVS 359
K+ MG+N+CGV + S
Sbjct: 336 KMEMGKNMCGVATCAS 351
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 142/376 (37%), Positives = 198/376 (52%), Gaps = 38/376 (10%)
Query: 1 MERLILSSLLLL---LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNA----EHHF 53
M R L L++ L +S LA D++ IRQVV + E+ +L H
Sbjct: 1 MSRFSLLLALVVAGGLFASALAGPATFADENP-IRQVVSDGLHELENAILQVVGKTRHAL 59
Query: 54 S--LFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
S F ++ K Y + EE RF VF NL+ + + GV +F+DLT EFRR
Sbjct: 60 SSARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRR 119
Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LG + + K + TN LP WR+ G V+ VK+QG CGSCW+FS TGAL
Sbjct: 120 DRLGAAQNC---SATTKGNLKVTNVVLPETKGWREAGIVSPVKNQGKCGSCWTFSTTGAL 176
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+
Sbjct: 177 EAAYSQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEE 229
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQ 286
YPYTG + G CKF + V N ++ + DE + A LV+ P+++ + +
Sbjct: 230 AYPYTGKN-GLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR--PVSIAFEVIKGFK 286
Query: 287 TYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y GV CG ++H VL VGYG PYW+IKNSWG +WG+NGY+
Sbjct: 287 QYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWGADWGDNGYF 339
Query: 344 KICMGRNVCGVDSMVS 359
K+ MG+N+CG+ + S
Sbjct: 340 KMEMGKNMCGIATCAS 355
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 128/354 (36%), Positives = 195/354 (55%), Gaps = 37/354 (10%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSED--HLLNAEHHFSLFKS--- 58
+ +++++LL ++SA+ ++ ++ D ++ L E S+++
Sbjct: 13 MTMAAIVLLFTVFAVSSALDMS--------IISYDSAHADKAATLRTEEELMSMYEQWLV 64
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL- 116
K K Y E + RF++FK NLR D T G+ +F+DLT E+R ++LG
Sbjct: 65 KHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTK 124
Query: 117 ---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
NRRL + AP + + LP DWR GAV VKDQG CGSCW+FSA GA+EG
Sbjct: 125 IDPNRRLGKTPSNRYAPRV-GDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGI 183
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TGEL+SLSEQ+LVDCD + GCNGGLM+ AFE+I+ GG++ ++DYP
Sbjct: 184 NKIVTGELISLSEQELVDCDT--------GYNQGCNGGLMDYAFEFIINNGGIDSDEDYP 235
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGG 291
Y G DG + K+ ++ ++ + + ++ V + P++V I Q Y+ G
Sbjct: 236 YRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSG 295
Query: 292 VSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
V CG LDHGV+ VGYG++ K YWI++NSWG +WGE+GY ++
Sbjct: 296 VFTGR-CGTALDHGVVAVGYGTA-------KGHDYWIVRNSWGSSWGEDGYIRL 341
>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
Length = 318
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 128/302 (42%), Positives = 171/302 (56%), Gaps = 21/302 (6%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
+L N+E F+ + SK+ KTYA EE YR RVF NL + K + GV KF+D+
Sbjct: 16 NLRNSE--FTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNAKNLPWTLGVNKFADV 73
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ EF +F G + + Q + D+P DWR+ GAVT VK+QG CGSCW+F
Sbjct: 74 SAEEFAYKFCGCAKDPKTRGTRQTTLV---GDVPARVDWREQGAVTPVKNQGMCGSCWAF 130
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG EGA+FL TG LVSLSEQQLVDC DPE + GC+GG SA +Y+ K
Sbjct: 131 STTGTTEGAYFLKTGNLVSLSEQQLVDCAR--DPEYE---NFGCSGGWPWSAVDYVTKH- 184
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAA-AVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
G+ E+DYPY G D CK K+A +V + DED +A + K P+++ ++A
Sbjct: 185 GLCTEEDYPYKGVD-AECKESSCKVAVQSVDKVQLPVGDEDSLAVAVSKT-PVSIVLDAT 242
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
MQ Y G+ C + ++H VL VGY ++ YWIIKNSWG +WGE GY
Sbjct: 243 AMQLYDKGIITR--CSESINHAVLAVGYDKDAETGLK-----YWIIKNSWGADWGEEGYC 295
Query: 344 KI 345
+I
Sbjct: 296 RI 297
>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 167/316 (52%), Gaps = 22/316 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC G M++AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DLGCRAGFMDTAFKWIVSSNNGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E+ YPY G +C + A + + I +E+ +A L K GP+A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVAIAVDATS 268
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q+Y GGV I K ++ L+VGY + + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SKEVNSAALLVGYDDTS-------KPPYWIIKNSWSKGWGEEGYIR 320
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C + VSS
Sbjct: 321 IEKGTNQCRMKEYVSS 336
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 171/323 (52%), Gaps = 25/323 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK K+ + Y EE YR +F+ N + K+ + + T + KF
Sbjct: 13 LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
D+T EF G R P P T T+ DWR GAVT VKDQG CGSC
Sbjct: 73 GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+LEG HFL TG L+SL+EQQLVDC P+ GCNGG MN AF+YI
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGI 280
G++ E YPY D GSC+FD + +AA S + I+S + V+ GP++V I
Sbjct: 185 ANNGIDTEASYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTI 243
Query: 281 NAVW--MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV C YLDH VL VGYGS G + +W++KNSW +W
Sbjct: 244 DAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEG-------GQDFWLVKNSWATSW 296
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ GY K+ R N CG+ ++ S
Sbjct: 297 GDAGYIKMSRNRNNNCGIATVAS 319
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 176/320 (55%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 92 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K I A F+ I DE +MA + GP+AV I+A
Sbjct: 205 GIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VG+G+ + YW++KNSWG WG+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESG------EDYWLVKNSWGTTWGDK 317
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 318 GFIKMLRNKENQCGIASASS 337
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 177/323 (54%), Gaps = 26/323 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLT 105
+ + FK +K Y ++ E +R ++F N AK +L V G+ K++D+
Sbjct: 24 QEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 106 PSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGS 160
EF + G NR LR LP + LP DWRD GAVT VKDQG CGS
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFSATG+LEG HF +G+LVSLSEQ LVDC E+ G ++GCNGGLM++AF YI
Sbjct: 144 CWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG++ E+ YPY D K+K A + S +ED++ + + GP++V I
Sbjct: 197 KANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAI 256
Query: 281 NAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GGV P LDHGVL+VGYG+ YW++KNSWG++W
Sbjct: 257 DASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGT------DYWLVKNSWGKSW 310
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ GY K+ R N CG+ + S
Sbjct: 311 GDQGYIKMARNRNNNCGIATEAS 333
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 138/359 (38%), Positives = 199/359 (55%), Gaps = 35/359 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
SLL++L A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 SLLIVLFCVASAAAGFSFHDSNPIRMV--SDVEEQLLQVIGESRHAVSFARFANRYGKRY 62
Query: 65 ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ +E RF++F NL R + +R+L + GV F+D T EFR LG +
Sbjct: 63 DSVDEMKLRFKIFSENLELIRSSNKRRL---SYKLGVNHFADWTWEEFRSHRLGAAQNC- 118
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + + +LP + DWR G V+GVKDQG+CGSCW+FS TGALE A+ + G+
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKN 177
Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN-G 228
Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYIC 298
CKF +A V + ++ ED++ + P++V V + Y GV C
Sbjct: 229 LCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYTSTAC 288
Query: 299 GKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
G ++H VL VGYG PYW+IKNSWG +WG++GY+K+ MG+N+CGV
Sbjct: 289 GSTPMDVNHAVLAVGYGIE-------DGIPYWLIKNSWGGDWGDHGYFKMEMGKNMCGV 340
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 121/309 (39%), Positives = 171/309 (55%), Gaps = 29/309 (9%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K Y E + RF +FK NLR + + G+ +F+DLT E+R FLG N ++
Sbjct: 56 KAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMK 115
Query: 122 LPADAQKA---PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ + K+ + LP DWR+ GAV+ VKDQG CGSCW+FS A+EG + + T
Sbjct: 116 ERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVT 175
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD S + GCNGGLM+ F++I+ GG++ E+DYPY D
Sbjct: 176 GELISLSEQELVDCDK--------SYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVD 227
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPY 296
G +F K+ +++ + + D++ V + P++V I A Q Y GV +
Sbjct: 228 GTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGH 287
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV----- 351
CG LDHGV+ VGYG+ YW ++NSWG WGENGY K + RN+
Sbjct: 288 -CGTNLDHGVVAVGYGTENGV-------DYWTVRNSWGPKWGENGYIK--LERNINATSG 337
Query: 352 -CGVDSMVS 359
CG+ SM S
Sbjct: 338 KCGIASMAS 346
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 133/323 (41%), Positives = 176/323 (54%), Gaps = 44/323 (13%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFR- 110
F K+ + Y ++ E + R +F N R LL + + G+ FSD T SE
Sbjct: 70 FLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTNSELDV 129
Query: 111 -RQFLGLNRRLR-----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
R F ++ R +P DA AP P + DWR GAVT VK+QG CGSCW+F
Sbjct: 130 LRGFRHSSKASRSGSQYIPFDA--AP-------PAEVDWRTKGAVTPVKNQGDCGSCWAF 180
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATG +EG H+L+TG+LVSLSEQQLVDC S + GC+GGLM+ AFEY+ +
Sbjct: 181 SATGGIEGQHYLATGKLVSLSEQQLVDCS---------SSNDGCDGGLMDLAFEYVKEHK 231
Query: 225 GVEREKDYPYTGTDGG---SCKFDKSKIAAAVSNFSVISSDEDQMAANLVK-HGPLAVGI 280
G++ E YPY + G C FD A V+ + I ++ + V HGP++VGI
Sbjct: 232 GIDTEVHYPYVSGNTGYARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGI 291
Query: 281 NAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
NA Y G+ + C + LDHGVL+VGYG PYW+IKNSWGE+W
Sbjct: 292 NAGLPSFMAYESGIYSDHRCNPHDLDHGVLVVGYGVDNGV-------PYWLIKNSWGEDW 344
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
GENGY +I N+CGV +M S
Sbjct: 345 GENGYVRILRNHNNLCGVATMAS 367
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 175/320 (54%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR D+ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 92 LMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGHCGSCWAF 151
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K I A F+ I DE +MA + GP++V I+A
Sbjct: 205 GIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VG+G+ YW++KNSWG WG+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGD------DYWLVKNSWGTTWGDK 317
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 318 GFIKMLRNKDNQCGIASASS 337
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/327 (38%), Positives = 180/327 (55%), Gaps = 34/327 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+S++ A ++ + + +TY E + R++VF+ NLR
Sbjct: 31 IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG R +L A A DLP DWR
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+CGSCW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 145 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 196
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 256
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 257 LQKAVANQPVSVAIEAAGTAFQLYSSGIFTG-SCGTALDHGVTAVGYGTE-------NGK 308
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV 351
YWI+KNSWG +WGE+GY + M RN+
Sbjct: 309 DYWIVKNSWGSSWGESGYVR--MERNI 333
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 178/324 (54%), Gaps = 28/324 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ + + +TY E + RF VF+ NLR
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LG+ R + + N DLP DWR GAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG+CGSCW+FS A+EG + + TG+++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ E+DYPY GTDG K+ + ++ + ++ ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V + P++V I A Q Y G+ CG LDHGV VGYG+ K YW
Sbjct: 260 AVANQPISVAIEAGGRAFQLYNSGIFTG-TCGTALDHGVTAVGYGTE-------NGKDYW 311
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I+KNSWG +WGE+GY + M RN+
Sbjct: 312 IVKNSWGSSWGESGYVR--MERNI 333
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 172/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E M A L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 169/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + ED + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 173/315 (54%), Gaps = 23/315 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ ++ + K Y E + RF +FK NLR +D + G+ +F+DLT E++
Sbjct: 51 YEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAM 110
Query: 113 FLG--LNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
FLG + R+ R L +Q+ +DLP + DWR+ GAV VKDQG CGSCW+FS GA
Sbjct: 111 FLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGA 170
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TGEL+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E
Sbjct: 171 VEGINQIVTGELISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDTE 222
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQT 287
+DYPY +D K+ + + + +++ V H P++V I A Q
Sbjct: 223 EDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQL 282
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
Y GV CG LDHGV+ VGYG+ YWI++NSWG WGE+GY + M
Sbjct: 283 YKSGVFTGR-CGTELDHGVVAVGYGTENGV-------NYWIVRNSWGSAWGESGYIR--M 332
Query: 348 GRNVCGVDSMVSSVA 362
RNV + +A
Sbjct: 333 ERNVANTKTGKCGIA 347
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 173/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWG++WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGKDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 177/320 (55%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y + E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 66 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 125
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 126 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 185
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 186 SSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K + A F+ I DE +MA + GP++V I+A
Sbjct: 239 GIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 297
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VG+G+ + YW++KNSWG WG+
Sbjct: 298 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESG------EDYWLVKNSWGTTWGDK 351
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 352 GFIKMLRNKENQCGIASASS 371
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 175/319 (54%), Gaps = 24/319 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 HWELWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGTHSYRLGMNHFGDMTHE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G R+ A+ + L N L P DWRD+G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQLMNGYKRKAE--TKARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFS 143
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ G
Sbjct: 144 TTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 196
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA-- 282
++ E YPY GTD C +D + + + F I S +++ V GP++V I+A
Sbjct: 197 LDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGH 256
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW E WG+ G
Sbjct: 257 ESFQFYQSGIYYEKECSSEELDHGVLVVGY---GFQGEDVDGKKYWIVKNSWSEKWGDKG 313
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y + R N CG+ + S
Sbjct: 314 YIYMAKDRKNHCGIATAAS 332
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/359 (36%), Positives = 189/359 (52%), Gaps = 37/359 (10%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTYATQE 68
L+LS+ L A D +++ S +HL + + LF+S K SK Y + E
Sbjct: 11 LILSATLFITYATAHDFSIVGY--------SPEHLASMDKTIELFESWMSKHSKAYRSIE 62
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
E +RF +F NL+ + G+ +F+DL+ EF+ ++LGL ++
Sbjct: 63 EKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSSRG 122
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
DLP DWR GAVT VK+QG+CGSCW+FS A+EG + + TG L SLSEQ+
Sbjct: 123 FSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 182
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
L+DCD S ++GC GGLM+ AF+YI+ G+ +E+DYPY +G + +
Sbjct: 183 LIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQF 234
Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGV 306
+S + + ++++Q + H P++V I A Q Y GG+ CG +DHGV
Sbjct: 235 EVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGR-CGTQMDHGV 293
Query: 307 LIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
VGYGSS + Y I+KNSWG WGENGY I M RN +CG++ M S
Sbjct: 294 TAVGYGSS-------EGTDYIIVKNSWGPKWGENGY--IRMKRNTGKPEGLCGINQMAS 343
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 177/320 (55%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y + E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 62 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 121
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 122 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 181
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 182 SSTGALEGQHFRKSGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K + A F+ I DE +MA + GP++V I+A
Sbjct: 235 GIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VG+G+ + YW++KNSWG WG+
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESG------EDYWLVKNSWGTTWGDK 347
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 348 GFIKMLRNKENQCGIASASS 367
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 131/352 (37%), Positives = 190/352 (53%), Gaps = 35/352 (9%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQS----EDHLLNAEHHFSLFKSKFSKTY 64
+LL +S L+SA D I S G +S +D ++ + + K K Y
Sbjct: 2 FMLLFFASTLSSA-----SDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLV---KHGKAY 53
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---NRRLR 121
+ E + RF VFK NLR + T G+ +F+DLT E+R +LG RR +
Sbjct: 54 NSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRNK 113
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
L + + + LP DWR GAV GVKDQG+CGSCW+FSA A+EG + + TG+L
Sbjct: 114 LRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDL 173
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
+SLSEQ+LVDCD+ S + GCNGGLM+ FE+I+ GG++ E+DYPY DG
Sbjct: 174 ISLSEQELVDCDN--------SYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRC 225
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICG 299
+ K+ ++ ++ + + + V + P++V I A Q Y GV CG
Sbjct: 226 DTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGR-CG 284
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
LDHGV+ VGYG+ + YWI++NSWG++WGE+GY + M RN+
Sbjct: 285 TALDHGVVAVGYGTE-------NGQDYWIVRNSWGKSWGESGYLR--MARNI 327
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/333 (39%), Positives = 178/333 (53%), Gaps = 35/333 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFCAR 97
Query: 113 FLGLNRRLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A + P + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKRHTPQHYPKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNG 206
Query: 225 GVEREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY +G + S + A + +I S E MAA L K+GP+A+ ++
Sbjct: 207 HLYTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALD 266
Query: 282 AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
A +Y GV I GK ++H VL+VGY +G E PYW+IKNSWG +WGE G
Sbjct: 267 ASSFMSYKSGVLTACI-GKQVNHAVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQG 318
Query: 342 YYKICMGRNVC-----GVDSMVSSVAAIHTTSS 369
Y ++ MG N C V + V AA T++S
Sbjct: 319 YVRVVMGVNACLLSEYPVSAHVRESAAPGTSTS 351
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/349 (36%), Positives = 193/349 (55%), Gaps = 30/349 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ ++++LLL ++SA+ D + + +S++ L++ + + K K
Sbjct: 36 MAMATILLLFTVFAVSSAL---DMSIISYDNAHAATSRSDEELMSMYEQWLV---KHGKV 89
Query: 64 YATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NR 118
Y E + RF++FK NLR D T G+ +F+DLT E+R ++LG NR
Sbjct: 90 YNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNR 149
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
RL + AP + + LP DWR GAV VKDQG CGSCW+FSA GA+EG + + T
Sbjct: 150 RLGKTPSNRYAPRV-GDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVT 208
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD + GCNGGLM+ AFE+I+ GG++ E+DYPY G D
Sbjct: 209 GELISLSEQELVDCDT--------GYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVD 260
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGGVSCPY 296
G + K+ ++ ++ + + ++ V + P++V I Q Y+ GV
Sbjct: 261 GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGR 320
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
CG LDHGV+ VGYG++ YWI++NSWG +WGE+GY ++
Sbjct: 321 -CGTALDHGVVAVGYGTA-------NGHDYWIVRNSWGPSWGEDGYIRL 361
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 171/323 (52%), Gaps = 25/323 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK K+ + Y EE YR +F+ N + K+ + + T + KF
Sbjct: 13 LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
D+T EF G R P P T T+ DWR GAVT VKDQG CGSC
Sbjct: 73 GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+LEG HFL TG L+SL+EQQLVDC P+ GCNGG MN AF+YI
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGI 280
G++ E YPY D GSC+FD + +AA S + I+S + V+ GP++V I
Sbjct: 185 ANNGIDTEAAYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTI 243
Query: 281 NAVW--MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV C YLDH VL VGYGS G + +W++KNSW +W
Sbjct: 244 DAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEG-------GQDFWLVKNSWATSW 296
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ GY K+ R N CG+ ++ S
Sbjct: 297 GDAGYIKMSRNRNNNCGIATVAS 319
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/318 (39%), Positives = 175/318 (55%), Gaps = 22/318 (6%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L+ + + +K KTY T EE D R ++ NL K+ + + + F+DLT
Sbjct: 21 LSQDRQWHAWKDFHGKTY-TGEEEDLRRAIWNDNLEIVKKHNAENHSYKLDMNHFADLTV 79
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+EF+++F+G + P L LP + DWRD G VT VK+QG CGSCW+FS+
Sbjct: 80 TEFKQRFMGYRAASNSTGGSTFLP-LSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSS 138
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG+LEG HF TG+LVSLSEQ LVDC + ++GC GGLM+ AF+YI G+
Sbjct: 139 TGSLEGQHFRKTGKLVSLSEQNLVDCSKKYG-------NNGCEGGLMDYAFKYIKNNDGI 191
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPYT D G C F + A V+ ++ V E + + + GP++V I+A
Sbjct: 192 DTEQSYPYTARD-GQCHFKPGSVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHS 250
Query: 284 WMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV S P LDHGVL VGYG+ K YW++KNSWGE WG NGY
Sbjct: 251 SFQLYKTGVYSEPDCSSTQLDHGVLAVGYGAE-------DGKDYWLVKNSWGEGWGMNGY 303
Query: 343 YKICMGR-NVCGVDSMVS 359
K+ + N CG+ + S
Sbjct: 304 IKMSRNKDNQCGIATQAS 321
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 168/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + D+GC GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHRLTALSEQQLVSCDDK---------DNGCAGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSTGYVPECSNSSQLVPGARIDGYLTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGENWGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGENWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 170/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI + G
Sbjct: 145 ATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKENDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + ED + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 190/355 (53%), Gaps = 46/355 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ--SEDHLLNAEHHFSLFKSKF 60
R + SL+LL++ A+ D +V +G Q S+D +L+ H +
Sbjct: 6 RALGLSLVLLVI------AIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWL---ETH 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG---LN 117
S+ Y + E +RF++FK N + G+ KFSDLT EFR Q+LG +N
Sbjct: 57 SRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKPVN 116
Query: 118 RRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
R+ + + D + P + DWR GAVT VKDQGACGSCW+FSA G++EG
Sbjct: 117 RQRKEANFMYEDVEAEPKV---------DWRLKGAVTDVKDQGACGSCWAFSAVGSVEGV 167
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TGELVSLSEQ+LVDCD + + GCNGGLM+ AFE+I+K GG++ EKDYP
Sbjct: 168 NAIKTGELVSLSEQELVDCDRK--------QNQGCNGGLMDYAFEFIIKNGGIDTEKDYP 219
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGG 291
Y DG + ++ + ++ + + + + P++V I A Q Y GG
Sbjct: 220 YKARDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGG 279
Query: 292 V-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
V + P CG LDHGVL VGYG+ YWI+KNSWG WGE GY ++
Sbjct: 280 VFTGP--CGSELDHGVLAVGYGTDDDGV------NYWIVKNSWGPGWGEKGYIRM 326
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 167/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ L +LSEQQLV CD + DSGCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCNGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K GP+++ ++
Sbjct: 207 TMLTEDSYPYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGDALNHGVLLVGYNXTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/309 (41%), Positives = 173/309 (55%), Gaps = 30/309 (9%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLN-RR 119
K Y E D RF +F NL+ + + + G+T+F+DLT EFR +L R
Sbjct: 46 KNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFRAIYLRSKMER 105
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R +++ + LP + DWR GAV VKDQG+CGSCW+FSA GA+EG + + TG
Sbjct: 106 TRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQIKTG 165
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
ELVSLSEQ+LVDCD S ++GC GGLM+ AF++I+ GG++ E+DYPYT TD
Sbjct: 166 ELVSLSEQELVDCDT--------SYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYTATDD 217
Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPY 296
C DK + + + +E+ + L P++V I A Q Y GV
Sbjct: 218 NICNTDKKNTRVVTIDGYEDVPENENSLKKALANQ-PISVAIEAGGRGFQLYKSGVFTG- 275
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV----- 351
CG LDHGV+ VGYG+S + + YWII+NSWG NWGE+GY K + RN+
Sbjct: 276 TCGTALDHGVVAVGYGTS-------EGQDYWIIRNSWGSNWGESGYIK--LQRNIKDSSG 326
Query: 352 -CGVDSMVS 359
CGV M S
Sbjct: 327 KCGVAMMAS 335
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 137/359 (38%), Positives = 199/359 (55%), Gaps = 35/359 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
SLL++L A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 SLLIVLFCVASAAAGFSFHDSNPIRMV--SDVEEQLLQVIGESRHAVSFARFANRYGKRY 62
Query: 65 ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ +E RF++F N+ R + +R+L + GV F+D T EFR LG +
Sbjct: 63 DSVDEMKLRFKIFSENIELIRSSNKRRL---SYKLGVNHFADWTWEEFRSHRLGAAQNC- 118
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + + +LP + DWR G V+GVKDQG+CGSCW+FS TGALE A+ + G+
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKN 177
Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN-G 228
Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYIC 298
CKF +A V + ++ ED++ + P++V V + Y GV C
Sbjct: 229 LCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYTSTAC 288
Query: 299 GKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
G ++H VL VGYG PYW+IKNSWG +WG++GY+K+ MG+N+CGV
Sbjct: 289 GSTPMDVNHAVLAVGYGIE-------DGIPYWLIKNSWGGDWGDHGYFKMEMGKNMCGV 340
>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 168/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + DSGC GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHRLTALSEQQLVSCDDK---------DSGCGGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGDTLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 174/302 (57%), Gaps = 29/302 (9%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGL 116
K K+Y E + RF++FK NLR DP + G+ +F+DLT E+R ++LG
Sbjct: 55 KHGKSYNALGEKETRFQIFKDNLRYIDNHNA-DPDRSYELGLNRFADLTNEEYRAKYLGT 113
Query: 117 NRRLRLPADAQK-----APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
R P ++ AP+ +LP DWR+ GAV VKDQG+CGSCW+FSA GA+E
Sbjct: 114 KSRESRPKLSKGPSDRYAPV-EGEELPDSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVE 172
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + ++TGEL++LSEQ+LVDCD S + GC GGLM+ AF +I+K GG++ + D
Sbjct: 173 GINQITTGELITLSEQELVDCDR--------SYNEGCEGGLMDYAFNFIIKNGGIDSDLD 224
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM--QTYI 289
YPYTG DG + ++ + ++ + +++ + P++V I A M Q Y+
Sbjct: 225 YPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQLYV 284
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
G+ CG +DHGV++VGYGS + YWI++NSWG WGE GY K M R
Sbjct: 285 SGIFTG-KCGTAVDHGVVVVGYGSE-------EGMDYWIVRNSWGAAWGEAGYLK--MQR 334
Query: 350 NV 351
NV
Sbjct: 335 NV 336
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 177/320 (55%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y + E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 32 FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 92 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K + A F+ I DE +MA + GP++V I+A
Sbjct: 205 GIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VG+G+ + YW++KNSWG WG+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESG------EDYWLVKNSWGTTWGDK 317
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 318 GFIKMLRNKENQCGIASASS 337
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 175/316 (55%), Gaps = 31/316 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R +F L ++ L + T G+ KFSDLT +EFR
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWM 285
E+ YPYTG GSC +K+K+ ++ + ++ D V P+ VGI +
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNF 227
Query: 286 QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Q Y G+ + C DH VL++GYG+ G PYWIIKNSWG +WGE+G+ +I
Sbjct: 228 QNYRSGILSGH-CSNSRDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMRI 279
Query: 346 CM--GRNVCGVDSMVS 359
G +CG++ S
Sbjct: 280 KKEDGEGMCGMNGQSS 295
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 175/316 (55%), Gaps = 31/316 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R +F L ++ L + T G+ KFSDLT +EFR
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWM 285
E+ YPYTG GSC +K+K+ ++ + ++ D V P+ VGI +
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNF 227
Query: 286 QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Q Y G+ + C DH VL++GYG+ G PYWIIKNSWG +WGE+G+ +I
Sbjct: 228 QNYRSGILSGH-CSNSRDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMRI 279
Query: 346 CM--GRNVCGVDSMVS 359
G +CG++ S
Sbjct: 280 KKKDGEGMCGMNGQSS 295
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 179/334 (53%), Gaps = 26/334 (7%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH- 96
G Q+ + + FK +K Y + E +R ++F N AK +L V
Sbjct: 13 GSQAVSFFDLVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSF 72
Query: 97 --GVTKFSDLTPSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAV 149
G+ K++D+ EF + G NR LR LP + LP DWRD GAV
Sbjct: 73 KLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAV 132
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VKDQG CGSCWSFSATG+LEG HF +G+LVSLSEQ LVDC E+ G ++GCN
Sbjct: 133 TPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCN 185
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM++AF YI GG++ E+ YPY D K+K A + S +ED++ +
Sbjct: 186 GGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSA 245
Query: 270 LVKHGPLAVGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
+ GP++V I+A Q Y GGV P LDHGVL+VGYG+ Y
Sbjct: 246 VATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGT------DY 299
Query: 327 WIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
W++KNSWG++WG+ GY K+ R N CG+ + S
Sbjct: 300 WLVKNSWGKSWGDQGYIKMARNRDNNCGIATEAS 333
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 117/326 (35%), Positives = 179/326 (54%), Gaps = 28/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F KF+K Y+++ E RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
++ GGV+ E DYPY G+DG + + I+ E+++ L GP+ V I
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAI 247
Query: 281 NAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
+A + Y G+ C Y +H VL+VGYG PYWI+KN+WGE+WGE
Sbjct: 248 DASDIVNYRRGIM--RYCSNYGFNHAVLLVGYGVEN-------NVPYWILKNTWGEDWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
GY+++ N CG+ + + + A I+
Sbjct: 299 QGYFRVQQNINACGIRNELLASAEIY 324
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/327 (38%), Positives = 179/327 (54%), Gaps = 34/327 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+S + A ++ + + +TY E + R++VF+ NLR
Sbjct: 26 IVSYGERSXEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 82
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG R +L A A DLP DWR
Sbjct: 83 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 139
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+CGSCW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 140 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 191
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 192 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 251
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 252 LQKAVANQPVSVAIEAAGTAFQLYSSGIFTG-SCGTALDHGVTAVGYGTE-------NGK 303
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV 351
YWI+KNSWG +WGE+GY + M RN+
Sbjct: 304 DYWIVKNSWGSSWGESGYVR--MERNI 328
>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
Length = 394
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 175/313 (55%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+G++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY ++G PY +IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGDALNHGVLLVGYNTTGGV-------PYCVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVAMGLNAC 329
>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 359
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 168/309 (54%), Gaps = 26/309 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQG CGSCW+FS+ G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAFSSVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 YTEDSYPYVSGNGYLPECSNSSELVVGAQIDSHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK ++H VL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKEVNHAVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC 352
++ MG N C
Sbjct: 321 RVVMGVNAC 329
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 180/328 (54%), Gaps = 38/328 (11%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDP 92
+L +E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L
Sbjct: 47 NLDQSEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLST 106
Query: 93 TAVHGVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGA 148
+A GV KFSD TP E FL L++ L + + P LP +DWRD
Sbjct: 107 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPNIRLPDYYDWRDTNK 165
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
VT +KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GC
Sbjct: 166 VTPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGC 216
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMA 267
NGGLM+ AF+ +L GGVE E DYPY G++ C D KIA +++ F DE+++
Sbjct: 217 NGGLMHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLK 275
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPY 326
+ GP+A+ ++A+ + Y G+ C Y L+H VL++G+G PY
Sbjct: 276 ELVYTTGPVAIAVDAMDIINYRRGILNQ--CHIYDLNHAVLLIGWGIEN-------NVPY 326
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGV 354
WIIKNSWGE+WGENGY ++ N CG+
Sbjct: 327 WIIKNSWGEDWGENGYLRVRRNVNACGL 354
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 187/355 (52%), Gaps = 26/355 (7%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAE--HHFSLFKS 58
M+ LS + L++ +++S D I + ++S N E + +
Sbjct: 1 MDSNTLSPAMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLV 60
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K+Y E D RF +FK NL+ L+ T G+T+F+DLT E+R +FLG
Sbjct: 61 KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120
Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
NRR++ ++ P + LP DWR GAV GVKDQ +CGSCW+FSA A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIG 290
PY DG + K+ + ++ + + ++ V + P+AV + Q Y
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEY 292
Query: 291 GVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
GV CG LDHGV VGYG+ K YWI++NSWG +WGE GY ++
Sbjct: 293 GVFTGR-CGTALDHGVAAVGYGTE-------NGKDYWIVRNSWGGSWGEQGYIRL 339
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/316 (41%), Positives = 176/316 (55%), Gaps = 26/316 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F +K+ +YAT E R +++ANL ++ + V KF+DLT EF +
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 113 FLGLNRRLRLPADAQKAPI-LPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+LGL + A LP LP DWR G VT +KDQG CGSCWSFS TG++
Sbjct: 82 YLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSV 141
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H TG+LVSLSEQ LVDC S ++GCNGGLM+ AF+YI+ G++ E
Sbjct: 142 EGQHARKTGQLVSLSEQNLVDC-------SSAQGNAGCNGGLMDQAFQYIISNNGIDTES 194
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVW--MQT 287
YPYT D G+C+F+ + + A V+++ I+S + N V GP++V I+A Q
Sbjct: 195 SYPYTAQD-GTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQF 253
Query: 288 YIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GV + P LDHGVL VGYG+SG YW++KNSWG +WG++GY I
Sbjct: 254 YSSGVYNEPACSSSQLDHGVLAVGYGTSG-------SSDYWLVKNSWGTSWGQSGY--IW 304
Query: 347 MGRNV---CGVDSMVS 359
M RN CG+ + S
Sbjct: 305 MTRNSNNQCGIATAAS 320
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/331 (41%), Positives = 174/331 (52%), Gaps = 32/331 (9%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---G 97
S +L E + FKS+ +K Y++ E RF++F N L AK V
Sbjct: 18 SSQEILRTE--WEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLA 75
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQ 155
+ KF DL P EF + G + P ND LPT DWR GAVT VK+Q
Sbjct: 76 MNKFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQ 135
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS TG+LEG HF TG+LVSLSEQ LVDC + + GCNGGLM++
Sbjct: 136 GQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFG-------NQGCNGGLMDN 188
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
F+YI GG++ E+ +PYT D G CKF K+ + A + F + ED + + G
Sbjct: 189 GFQYIKANGGIDTEESHPYTAQD-GDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVG 247
Query: 275 PLAVGINAVW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKN 331
P++V I+A Q Y GV P LDHGVL VGYG K YW++KN
Sbjct: 248 PVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGVK-------NGKKYWLVKN 300
Query: 332 SWGENWGENGYYKICMGR---NVCGVDSMVS 359
SWG +WG+NGY I M R N CG+ S S
Sbjct: 301 SWGGDWGDNGY--ILMSRDKDNQCGIASSAS 329
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 180/328 (54%), Gaps = 38/328 (11%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDP 92
+L +E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L
Sbjct: 49 NLDQSEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLST 108
Query: 93 TAVHGVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGA 148
+A GV KFSD TP E FL L++ L + + P LP +DWRD
Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPNIRLPDYYDWRDTNK 167
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
VT +KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GC
Sbjct: 168 VTPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGC 218
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMA 267
NGGLM+ AF+ +L GGVE E DYPY G++ C D KIA +++ F DE+++
Sbjct: 219 NGGLMHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLK 277
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPY 326
+ GP+A+ ++A+ + Y G+ C Y L+H VL++G+G PY
Sbjct: 278 ELVYTTGPVAIAVDAMDIINYRRGILNQ--CHIYDLNHAVLLIGWGIEN-------NVPY 328
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGV 354
WIIKNSWGE+WGENGY ++ N CG+
Sbjct: 329 WIIKNSWGEDWGENGYLRVRRNVNACGL 356
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 187/355 (52%), Gaps = 26/355 (7%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAE--HHFSLFKS 58
M+ LS + L++ +++S D I + ++S N E + +
Sbjct: 1 MDSNTLSPAMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLV 60
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K+Y E D RF +FK NL+ L+ T G+T+F+DLT E+R +FLG
Sbjct: 61 KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120
Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
NRR++ ++ P + LP DWR GAV GVKDQ +CGSCW+FSA A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIG 290
PY DG + K+ + ++ + + ++ V + P+AV + Q Y
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEY 292
Query: 291 GVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
GV CG LDHGV VGYG+ K YWI++NSWG +WGE GY ++
Sbjct: 293 GVFTGR-CGTALDHGVAAVGYGTE-------NGKDYWIVRNSWGGSWGEQGYIRL 339
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/325 (36%), Positives = 183/325 (56%), Gaps = 28/325 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTK 100
S D ++ A + L K K+Y E + RF++FK N L ++ D + G+ +
Sbjct: 35 STDDVIMAAYESWLVK--HGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNR 92
Query: 101 FSDLTPSEFRRQFLGL---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
F+DLT E+R ++ G+ + R ++ +Q+ L LP DWR+HGAV VKDQG
Sbjct: 93 FADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQ 152
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS A+EG + ++TG+L++LSEQ+LVDCD S + GCNGGLM+ AF
Sbjct: 153 CGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGLMDDAF 204
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++I+ GG++ + DYPYTG DG ++ K+ + ++ + +++ + P++
Sbjct: 205 QFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPIS 264
Query: 278 VGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
V I A Q Y G+ CG LDHGV++VGYG+ K YWI++NSWG
Sbjct: 265 VAIEASGRDFQFYDSGIFTG-KCGTDLDHGVVVVGYGTE-------NGKDYWIVRNSWGA 316
Query: 336 NWGENGYYKICMG----RNVCGVDS 356
+WGE GY ++ G +CG+ S
Sbjct: 317 DWGEKGYLRMERGISSKAGICGITS 341
>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 359
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 167/309 (54%), Gaps = 26/309 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQG CGSCW+FS+ G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAFSSVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
E YPY +G + S + A + +I S E MAA L K+GP+A+ ++A
Sbjct: 209 YTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV I GK ++H VL+VGY +G E PYW+IKNSWG +WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQVNHAVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 344 KICMGRNVC 352
++ MG N C
Sbjct: 321 RVVMGVNAC 329
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/325 (39%), Positives = 177/325 (54%), Gaps = 36/325 (11%)
Query: 53 FSLFK---SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
S++K +K K Y E RF +FK NLR + T G+TKF+DLT E+
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEY 60
Query: 110 RRQFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
R FLG R ++ + +++ + LP DWR GAV +KDQG+CGSCW+F
Sbjct: 61 RAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAF 120
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S A+EG + + TGEL+SLSEQ+LVDCD + ++GCNGGLM+ AF++I+ G
Sbjct: 121 STVAAVEGINQIVTGELISLSEQELVDCDR--------TYNAGCNGGLMDYAFQFIINNG 172
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G++ EKDYPY G D K A ++ F + +++ V H P++V I A
Sbjct: 173 GLDTEKDYPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASG 232
Query: 285 M--QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
M Q Y GV CG LDHGV++VGY S YW+++NSWG WGE+GY
Sbjct: 233 MALQFYQSGVFTGE-CGTALDHGVVVVGYASENGL-------DYWLVRNSWGTEWGEHGY 284
Query: 343 YKICMGRNV-------CGVDSMVSS 360
K M RNV CG+ +M SS
Sbjct: 285 IK--MQRNVGDTYTGRCGI-AMESS 306
>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 168/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + DSGC GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHRLTALSEQQLVSCDDK---------DSGCGGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGITLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 185/331 (55%), Gaps = 31/331 (9%)
Query: 40 EQSEDHLLNAEHH----FSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTA 94
EQ E LL+A+ + F + +++K YA +E + RF V+ NL +
Sbjct: 28 EQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSH 87
Query: 95 VHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA-QKAPIL----PTNDLPTDFDWRDHGAV 149
+ F+DLT EFR + LG + + R ++ Q +P + N LPT+ DWR GAV
Sbjct: 88 WLHLNAFADLTTDEFRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAV 146
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VK+QG CGSCW+F+ TG++EG + + TGEL SLSEQ+LVDCD + D GC+
Sbjct: 147 TEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTD--------EDRGCS 198
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ A+++I+K GG++ E DYPYT DG K++ + + I +++
Sbjct: 199 GGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKK 258
Query: 270 LVKHGPLAVGI--NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
H P+AV I +A Q Y GGV CG L+HGVL+VGYG F YW
Sbjct: 259 AAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGYGKDP----HFGN--YW 312
Query: 328 IIKNSWGENWGENGYYKICMG----RNVCGV 354
I+KNSWG WG+NGY ++ MG + +CG+
Sbjct: 313 IVKNSWGPEWGDNGYIRLRMGAEDVQGMCGI 343
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 183/346 (52%), Gaps = 31/346 (8%)
Query: 19 ASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
AS ++ D DA P + +DH ++ F F+ +K YAT+EE R+ +FK
Sbjct: 61 ASPSSITDGDAKY----PEKIWEWKDHHFQSQ--FYQFQRDHNKFYATEEERLKRYAIFK 114
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RLRLPADAQKAPI--LPTN 135
NL + + V + KF DLT EFR+++LG + LR P + + N
Sbjct: 115 NNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQRYLGYKKPDLRTPPREVDTTLESVEDN 174
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
D+PT DWR G VT VKDQG CGSCW+FSATGA+EG + TG+LV+LS+QQLVDC
Sbjct: 175 DIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRF 234
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
+ GC+GG M AFEY+++ GG+ ++YPY D G CK + A ++
Sbjct: 235 LG-------NQGCDGGRMEEAFEYVVENGGICSGENYPYMRKD-GVCKSSQCTSVATITG 286
Query: 256 F-SVISSDEDQMAANLVKHGPLAVGI--NAVWMQTYIGGV-SCPYICGKYLDHGVLIVGY 311
+ SV E M L P++V I N Q Y G+ P CG LDHGVL+VGY
Sbjct: 287 YRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFDAP--CGTNLDHGVLLVGY 344
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR---NVCGV 354
+ + YWI+KNSWG WG+ GY + M + CGV
Sbjct: 345 SAETAG-----QGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGV 385
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/362 (37%), Positives = 196/362 (54%), Gaps = 33/362 (9%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
S L L S L +++AV D +++ S+ +S D L+ F + S+ K Y +
Sbjct: 6 SKALFLACSFCLFASLAVAGDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYQS 60
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA 126
EE +RF +FK NL+ R + G+ +F+DL+ EF+ ++LGL ++
Sbjct: 61 IEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRES 120
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
+ +LP DWR GAVT VK+QG+CGSCW+FS A+EG + + TG L SLSE
Sbjct: 121 PEEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
Q+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+C+ K
Sbjct: 181 QELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGTCEMTK 231
Query: 247 SKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLD 303
+ +S + + + +Q + + PL+V I A Q Y GGV + CG LD
Sbjct: 232 EETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSGGVFDGH-CGSDLD 290
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSM 357
HGV VGYG+S K Y I+KNSWG WGE GY I M RN +CG+ M
Sbjct: 291 HGVAAVGYGTS-------KGVNYIIVKNSWGSKWGEKGY--IRMRRNIGKPEGICGIYKM 341
Query: 358 VS 359
S
Sbjct: 342 AS 343
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 172/312 (55%), Gaps = 32/312 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ + F +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQRRLANFERNLELMREHQARNPHARFGITKFFDLSEAVFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ +LV LSEQQLV CDH D+GC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIA--AAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
V EK YPY +G C + S++A A + + + S E MAA L K+GP+++ +
Sbjct: 207 TVFTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAV 265
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A +Y GV I G+ L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 266 DASSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEK 317
Query: 341 GYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 143/372 (38%), Positives = 197/372 (52%), Gaps = 41/372 (11%)
Query: 11 LLLLSSVLA--SAVAVNDDDAMIRQVVPSDGEQSEDHLLNA---------EHHFSLFKSK 59
+L + SVLA S V +++ + + + H+L A E + FK
Sbjct: 26 VLWIVSVLAVVSGANVQNENVQWFDLESAQKHPEQLHILKAQTGINYQPYEQAWKEFKIL 85
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRRQFLG 115
K+Y EE RF +F+ N+ R ++ L + GV +F+DL +EF F G
Sbjct: 86 HDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEFV-NFNG 144
Query: 116 LNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
L ++ + + + L N++ P DWR G VT VK+QGACGSCW+FSATG+LEG
Sbjct: 145 L--KMTNLNNTKCSSHLSANNIVVPDSVDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQ 202
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDY 232
+F G+LV LSE QLVDC SGS + GCNGG M +AF+Y+ GG+E E DY
Sbjct: 203 YFRKNGKLVPLSESQLVDC--------SGSFGNEGCNGGFMENAFKYVKSVGGIESESDY 254
Query: 233 PYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYI 289
PY +C FDK+K+ A VS V S E + + + GP++V I+A Q Y
Sbjct: 255 PYKARQ-RTCAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSSFQLYA 313
Query: 290 GGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GGV +C L+HGVL VGYG+S + K YWI+KNSWG WG GY K+
Sbjct: 314 GGVYDEPLCSTSRLNHGVLCVGYGTS------LQGKDYWIVKNSWGVRWGVEGYIKMSRN 367
Query: 349 R-NVCGVDSMVS 359
+ N CG+ S S
Sbjct: 368 KNNQCGIASEAS 379
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 169/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G +R R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HRGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + E + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 167/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ L +LSEQQLV CD + DSGCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCNGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K GP+++ ++
Sbjct: 207 TMLTEDSYPYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/360 (34%), Positives = 191/360 (53%), Gaps = 30/360 (8%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M IL++ + +LL +A + P +Q + + F + +
Sbjct: 1 MTSTILTTTIFILLMLCNTCVIASESE-------CPPTHKQKSSDVEAMKKRFDGWVKRH 53
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
+ Y +E + RF +++AN++ + + + KF+DLT EF+ ++GL+ RL
Sbjct: 54 GRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRL 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
R + DLP DWR GAVT + DQG CG CW+F+A A+EG + + +G+
Sbjct: 114 RSHNTGFRYD--EHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGK 171
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+SLSEQ+L+DCD + S + GC GGLM +A+ +I++ GG+ E+DYPY G D G
Sbjct: 172 LISLSEQELIDCDVK-------SGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVD-G 223
Query: 241 SCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYI 297
+CK +K + AA++S + + +D + H P++V I+A Q Y GV I
Sbjct: 224 TCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSG-I 282
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
CGK L+HGV +VGYG YWI+KNSWG +WGE+GY I M R+ + M
Sbjct: 283 CGKQLNHGVTVVGYGKETI-------NKYWIVKNSWGADWGESGY--IRMKRDTLSKEGM 333
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 174/316 (55%), Gaps = 31/316 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R VF L ++ + T G+ KFSDLT +EFR
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG + AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPDDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWM 285
E+ YPYTG GSC +K+K+ ++ + ++ D V P+ VGI +
Sbjct: 170 TEEAYPYTGF-AGSCNTNKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNF 227
Query: 286 QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Q Y G+ C DH VL++GYG+ G PYWIIKNSWG +WGE+G+ KI
Sbjct: 228 QNYRSGILSGQCCNS-RDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMKI 279
Query: 346 CM--GRNVCGVDSMVS 359
G +CG++ S
Sbjct: 280 KKKDGEGMCGMNGQSS 295
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 179/334 (53%), Gaps = 26/334 (7%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH- 96
G Q+ + + FK +K Y + E +R ++F N AK +L V
Sbjct: 13 GSQAVSFFDLVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSF 72
Query: 97 --GVTKFSDLTPSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAV 149
G+ K++D+ EF + G NR LR LP + LP DWRD GAV
Sbjct: 73 KLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAV 132
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VKDQG CGSCWSFSATG+LEG HF +G+LVSLSEQ LVDC E+ G ++GCN
Sbjct: 133 TPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCN 185
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM++AF YI GG++ E+ YPY D K+K A + S +ED++ +
Sbjct: 186 GGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSA 245
Query: 270 LVKHGPLAVGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
+ GP++V I+A Q Y GGV P LDHGVL+VGYG+ Y
Sbjct: 246 VATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDGT------DY 299
Query: 327 WIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
W++KNSWG++WG+ GY K+ R N CG+ + S
Sbjct: 300 WLVKNSWGKSWGDQGYIKMARNRDNNCGIATEAS 333
>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 167/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ L +LSEQQLV CD + DSGC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCGGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K GP+++G++
Sbjct: 207 TMFTEDSYPYVSSXGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIGVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGBXLNHGVLLVGYNXTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVAMGVNAC 329
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 175/320 (54%), Gaps = 30/320 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y E +R ++F N + AK Q V V K++DL EFR+
Sbjct: 32 FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91
Query: 112 QFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSCW+F
Sbjct: 92 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI G
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D SC F+K I A F+ I DE +MA + GP++V I+A
Sbjct: 205 GIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV + P + LDHGVL+VG+G+ YW++KNSWG WG+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGD------DYWLVKNSWGTTWGDK 317
Query: 341 GYYKICMGR-NVCGVDSMVS 359
G+ K+ + N CG+ S S
Sbjct: 318 GFIKMLRNKENQCGIASASS 337
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 183/329 (55%), Gaps = 30/329 (9%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQLLD-PTAVHGVTKF 101
+L A +F F ++K Y + E + R+ +FK NL AK D PTA +G+ KF
Sbjct: 27 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKF 86
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
SDL+ SE +F GL+ R ++ K +L P + P FDWR+ VT +K+QGACG
Sbjct: 87 SDLSKSELIAKFTGLSIPQR-ASNFCKTIVLNQPPDKGPLHFDWREQNKVTSIKNQGACG 145
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
+CW+F+ ++E + LV LSEQQL+DCD S D GCNGGL+++AFE
Sbjct: 146 ACWAFATLASVESQFAMRHNRLVDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 196
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I++ GGV+ E DYP+ G D C D+ + + + V + + +E+++ L GP+
Sbjct: 197 IIRMGGVQAELDYPFVGRD-RRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIP 255
Query: 278 VGINAVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
+ I+A + Y GV SC L+H VL+VGYG PYW KN+WG+
Sbjct: 256 MAIDAADIVNYYRGVISSCE---NNGLNHAVLLVGYGVENGV-------PYWAFKNTWGD 305
Query: 336 NWGENGYYKICMGRNVCGVDSMVSSVAAI 364
+WGENGY+++ N CG+ + ++S A +
Sbjct: 306 DWGENGYFRVRQNINACGMVNDLASTAVL 334
>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
Length = 354
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 170/311 (54%), Gaps = 25/311 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K++ + +RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 38 ASAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFLNTHNPHAHYDVSGKFADLTPQ 97
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L + D ++ + + L DWR+ GAVT VK+QG CGSCW+FS
Sbjct: 98 EFAKLYLNPDYYAHRGKDYKEHVHVDDSVLSGAMSVDWREKGAVTPVKNQGMCGSCWAFS 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
A G +E L LVSLSEQ LV CD D GCNGGLM+ A E+I++
Sbjct: 158 AIGNIESQWALKNHSLVSLSEQMLVSCD---------DIDDGCNGGLMDQAMEWIIQHHN 208
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V EK YPY G S DK + A +S + + DE +AA + K GP+AV ++A
Sbjct: 209 GTVPTEKSYPYASAGGTSPPCHDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDA 268
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ + + PYWI+KNSWG +WGE G
Sbjct: 269 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------KRAKPPYWIVKNSWGTSWGEKG 319
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 320 YIRLAMGSNQC 330
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 169/315 (53%), Gaps = 21/315 (6%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEF 109
F+ F ++ K+YA+ EE + RF +F NL + + G+TKF+D++ EF
Sbjct: 32 EQFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEF 91
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH-GAVTGVKDQGACGSCWSFSATG 168
+ + L N + P P+ FDWR+ G VT V DQG CGSCW+FSAT
Sbjct: 92 QSRVLMSNPPPPPTEKPYRGPKFEGFTAPSTFDWRNKPGVVTPVYDQGQCGSCWAFSATE 151
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+E L+ +L LS QQ+VDC D GC GG + A++Y++ A G++
Sbjct: 152 NIESQWALAGHKLTGLSMQQIVDCSW---------WDDGCGGGFPSYAYDYVIDAPGLDA 202
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD--EDQMAANLVKHGPLAVGINAVWMQ 286
+YPYT GGSC F +S++ A +S+++ ++D E QMA L +HGP++V ++A
Sbjct: 203 LANYPYTAV-GGSCAFKESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESWP 261
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
+Y GGV CG +DH VL VGY + PYWII+NSWG +WG GY +
Sbjct: 262 SYTGGVYRASACGTSIDHCVLAVGYNLTA-------NPPYWIIRNSWGTSWGLEGYMHLE 314
Query: 347 MGRNVCGVDSMVSSV 361
G + C V M +S
Sbjct: 315 FGTDACAVAEMTTSA 329
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 185/345 (53%), Gaps = 33/345 (9%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
DD IR + SD E ++ H F+ F + K Y + +E RFR+F NL+
Sbjct: 26 DDANPIR--LASDLESQVLDVIGQSRHALSFARFARRHGKRYRSVDEIRNRFRIFSDNLK 83
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
+ T GV F+D T EF R LG + A + L LP + D
Sbjct: 84 LIRSTNRRSLTYTLGVNHFADWTWEEFTRHKLGAPQNC--SATLKGNHRLTDAVLPDEKD 141
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +G
Sbjct: 142 WRKEGIVSQVKDQGNCGSCWTFSTTGALEAAYAQAFGKNISLSEQQLVDC--------AG 193
Query: 203 SCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSV 258
+ ++ GCNGGL + AFEYI GG++ E+ YPYTG D G CKF +A V N ++
Sbjct: 194 AFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD-GVCKFTAKNVAVRVIDSINITL 252
Query: 259 ISSDEDQMAANLVKHGPLAVGIN-AVWMQTYIGGVSCPYICGKY---LDHGVLIVGYGSS 314
+ DE + A V+ P++V A + Y GV ICG ++H VL VGYG
Sbjct: 253 GAEDELKQAVAFVR--PVSVAFEVAKDFRFYNNGVYTSTICGSTPMDVNHAVLAVGYGVE 310
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
PYWIIKNSWG NWG+NGY+K+ +G+N+CGV + S
Sbjct: 311 -------DGVPYWIIKNSWGSNWGDNGYFKMELGKNMCGVATCAS 348
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 134/344 (38%), Positives = 189/344 (54%), Gaps = 36/344 (10%)
Query: 31 IRQVVPSDGEQSEDHLLN----AEHHFSL--FKSKFSKTYATQEEHDYRFRVFKANLRRA 84
IRQVV + E+ +L + H S F ++ K Y + EE RF VF NL+
Sbjct: 33 IRQVVSDGLHELENGILQVVGQSRHALSFVRFAHRYGKRYESVEEIKQRFEVFLDNLKMI 92
Query: 85 KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDW 143
+ + GV +F+DLT EFRR LG + + K + TN LP DW
Sbjct: 93 RSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQNC---SATTKGNVKLTNAVLPETKDW 149
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R+ G V+ VK+QG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +G+
Sbjct: 150 REDGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDC--------AGA 201
Query: 204 CDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVI 259
++ GCNGGL + AFEYI GG++ E+ YPYTG + G CKF + V N ++
Sbjct: 202 FNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN-GLCKFSSENVGVKVIDSVNITLG 260
Query: 260 SSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKY---LDHGVLIVGYGSSG 315
+ DE + A LV+ P+++ + + Y GV CG ++H VL VGYG
Sbjct: 261 AEDELKYAVALVR--PVSIAFEVIKGFKQYKSGVYSSTECGNTPMDVNHAVLAVGYGVEN 318
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
PYW+IKNSWG +WG++GY+K+ MG+N+CG+ + S
Sbjct: 319 GV-------PYWLIKNSWGADWGDDGYFKMEMGKNMCGIATCAS 355
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 175/316 (55%), Gaps = 34/316 (10%)
Query: 62 KTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN 117
K Y + E +R ++F N + AK QL V V K++D+ EFR+ G N
Sbjct: 114 KNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFN 173
Query: 118 ----RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ LR ++ K + + LP DWRD GAVTGVKDQG CGSCW+FS+TGAL
Sbjct: 174 YTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGAL 233
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H+ +G LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ EK
Sbjct: 234 EGQHYRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEK 286
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQT 287
YPY D SC F+K I A F + +E ++A + GP++V I+A Q
Sbjct: 287 SYPYEALD-DSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQF 345
Query: 288 YIGGVSCPYIC-GKYLDHGVLIVGYGS--SGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Y GV C + LDHGVL+VG+G+ SG + YW++KNSWG WG+ G+ K
Sbjct: 346 YSEGVYVEPACDAQNLDHGVLVVGFGTDESG--------QDYWLVKNSWGTTWGDKGFIK 397
Query: 345 ICMGR-NVCGVDSMVS 359
+ + N CG+ S S
Sbjct: 398 MLRNKDNQCGIASASS 413
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 183/342 (53%), Gaps = 35/342 (10%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYICGKY---LDHGVLIVGY 311
++ + DE + A LV+ P+++ + + Y GV CG ++H VL VGY
Sbjct: 255 ITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
G PYW+IKNSWG +WG+ GY+K+ MG+N+CG
Sbjct: 313 GVEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMCG 347
>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
Length = 469
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 166/318 (52%), Gaps = 26/318 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK + K Y + E +R VFK NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEETFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 112 QF-----LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + R+ + + P DWR GAVT +KDQG C SCW+FS
Sbjct: 97 RYHNAAAHFAAAQKRVRVPVEVEVEVEVGGAPAAVDWRARGAVTAIKDQGNCSSCWAFST 156
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--G 224
G +EG L+ L LSEQ LV CD+ D+GC+GGLM+SAF++I++ G
Sbjct: 157 IGNIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVEQNNG 207
Query: 225 GVEREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
V E Y Y G D +C + A +S + DED+MAA L +GPLA+ ++A
Sbjct: 208 SVYTEASYSYVSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDA 267
Query: 283 VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+Y GGV + + LDHGV++VGY S PYWIIKNSWG +WGE GY
Sbjct: 268 TSFMSYTGGVLTNCVSDQ-LDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEEGY 319
Query: 343 YKICMGRNVCGVDSMVSS 360
+I G N C V + S
Sbjct: 320 IRIQKGTNQCLVKNYACS 337
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 171/321 (53%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVF-KANLRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F +++L A+ V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + ED + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 168/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + D+GC GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHGLTALSEQQLVSCDDK---------DNGCGGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 193/344 (56%), Gaps = 47/344 (13%)
Query: 40 EQSEDHLLNAEHHFSLF---KSKFSKTYATQEEHDYRFRVFKANL-----RRAKRRQLLD 91
E D L+ E +F K K K Y EE + RF FK NL R AKR+
Sbjct: 33 EHEIDAFLSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKW 92
Query: 92 PTAVHGVTKFSDLTPSEFRRQFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
V G+ KF+D++ EFR+ +L +N+ + L + ++ + + D P+ DWR++
Sbjct: 93 EHHV-GLNKFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRK--VQSCDAPSSLDWRNY 149
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
G VT VKDQG+CGSCW+FS+TGA+EG + L TG+L+SLSEQ+LV+CD + +
Sbjct: 150 GVVTAVKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECD---------TSNY 200
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK--SKIAAAVSNFSVISSDED 264
GC GG M+ AFE+++ GG++ E DYPYTG D G+C K +K+ + V SD
Sbjct: 201 GCEGGYMDYAFEWVINNGGIDSESDYPYTGVD-GTCNTTKEETKVVSIDGYQDVEQSDSA 259
Query: 265 QMAANLVKHGPLAVGIN--AVWMQTYIGGV---SCPYICGKYLDHGVLIVGYGSSGFAPI 319
+ A V P++VGI+ A+ Q Y GG+ SC +DH VLIVGYGS
Sbjct: 260 LLCA--VAQQPVSVGIDGSAIDFQLYTGGIYDGSCSDDPDD-IDHAVLIVGYGSE----- 311
Query: 320 RFKEKPYWIIKNSWGENWGENGYYKIC----MGRNVCGVDSMVS 359
+ YWI+KNSWG +WG +GY+ + + VC V++M S
Sbjct: 312 --DSEEYWIVKNSWGTSWGIDGYFYLKRDTDLPYGVCAVNAMAS 353
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 174/319 (54%), Gaps = 28/319 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F + SK Y ++E RF ++++N++ L +F+D+T SEF
Sbjct: 40 KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ FLGLN Q+ P ++P DWR GAVT +++QG CG CW+FSA A
Sbjct: 100 KAHFLGLNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAA 159
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG LVSLSEQQL+DCD G+ + GC+GGLM +AFE+I GG+ E
Sbjct: 160 IEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKTNGGLATE 212
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQ 286
DYPYTG + G+C +KSK + + ++ +E + + P++VGI+A Q
Sbjct: 213 TDYPYTGIE-GTCDQEKSKNKVVTIQGYQKVAQNEASLQIAAAQQ-PVSVGIDAGGFIFQ 270
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GV Y CG L+HGV +VGYG G ++ YWI+KNSWG WGE GY I
Sbjct: 271 LYSSGVFTNY-CGTNLNHGVTVVGYGVEG-------DQKYWIVKNSWGTGWGEEGY--IR 320
Query: 347 MGRNV------CGVDSMVS 359
M R V CG+ M S
Sbjct: 321 MERGVSEDTGKCGIAMMAS 339
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 179/324 (55%), Gaps = 39/324 (12%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRR---QLLDPTAVHGVTKFSDLTPSEFRR 111
FK + K Y + E R +++ N L+ A+ +L T + K+ D+ EF+
Sbjct: 31 FKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKYGDMLNHEFKN 90
Query: 112 QFLGLNRRL-------RLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
G NR + RLP A A I P N +LP DWR GAVT VKDQG CGSCW+
Sbjct: 91 MLNGYNRTINHTLRNERLPVGA--AFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWA 148
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILK 222
FSATG+LEG HF TG LVSLSEQ L+DC SGS ++GCNGGLM+ AF YI
Sbjct: 149 FSATGSLEGQHFRRTGVLVSLSEQNLIDC--------SGSYGNNGCNGGLMDQAFSYIKD 200
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAVGIN 281
G++ EK YPY G D C++DK A+ F I DE ++ A + GP++V I+
Sbjct: 201 NKGLDTEKTYPYEGED-DKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVATVGPVSVAID 259
Query: 282 AVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A Q Y G+ P LDHGVL+VGYG+ + + YWI+KNSWGE+WG
Sbjct: 260 ASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDE------EGRDYWIVKNSWGESWG 313
Query: 339 ENGYYKICMGRNV---CGVDSMVS 359
E GY K M RN+ CG+ S S
Sbjct: 314 EKGYIK--MARNIDNHCGIASSAS 335
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 181/337 (53%), Gaps = 25/337 (7%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
I ++ SD Q D + A + L K Y E + RF +FK NLR
Sbjct: 42 IPEIPHSDAHQRPDEEVAALYESWLVH--HGKAYNAIGEKERRFEIFKDNLRFIDEHNRE 99
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRL-PADAQKAPILPTNDLPTDFDWRDHG 147
T G+T+F+DLT E+R +FLG +R+ RL A + + +DLP D DWR G
Sbjct: 100 SRTYKVGLTRFADLTNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKG 159
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AV VKDQG CGSCW+FS+ A+EG + + TGEL+ LSEQ+LVDCD S + G
Sbjct: 160 AVATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDK--------SFNMG 211
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLM+ AF++I+ GG++ E+DYPY G D K+ + + + +++
Sbjct: 212 CNGGLMDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSL 271
Query: 268 ANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKP 325
V + P++V I A Q Y GV CG LDHGV+ VGYG+
Sbjct: 272 KKAVANQPVSVAIEAGGRAFQLYQSGVFTGR-CGTDLDHGVVAVGYGTD-------NGTD 323
Query: 326 YWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
YWI++NSWG++WGE+GY + + RNV + + +A
Sbjct: 324 YWIVRNSWGKDWGESGYIR--LERNVANITTGKCGIA 358
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/346 (36%), Positives = 182/346 (52%), Gaps = 24/346 (6%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L+LL V A + A + Q + D + A + L K K Y E
Sbjct: 1 MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKH--GKNYNALGE 58
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RRLRLPADAQ 127
+ RF +FK NL + + T G+ +F+DLT EFR +LG + RLP +
Sbjct: 59 KEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD 118
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + LP DWR GAV VKDQG CGSCW+FS A+EG + + TG+L++LSEQ
Sbjct: 119 RYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQ 178
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DYPY G DG + K+
Sbjct: 179 ELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKN 230
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHG 305
++ ++ + +++ V + P++V I Q Y GV CG LDHG
Sbjct: 231 AKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGE-CGTSLDHG 289
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
V VGYG+ K K YWI++NSWG++WGE+GY + M RN+
Sbjct: 290 VAAVGYGTE-------KGKDYWIVRNSWGKSWGESGYIR--MERNI 326
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/354 (35%), Positives = 197/354 (55%), Gaps = 38/354 (10%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTY 64
S+LL+L+ S L+SA D ++I +++ H + +L++S + K+Y
Sbjct: 11 SILLMLIFSTLSSA----SDMSIISY------DETHIHRRTDDEVSALYESWLIEHGKSY 60
Query: 65 ATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RL 120
E D RF++FK NLR ++ + + + G+TKF+DLT E+R +LG R
Sbjct: 61 NALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRK 120
Query: 121 RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+L + + D LP DWR+ G + GVKDQG+CGSCW+FSA A+E + + TG
Sbjct: 121 KLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTG 180
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDCD S + GC+GGLM+ AFE+++K GG++ E+DYPY +G
Sbjct: 181 NLISLSEQELVDCDR--------SYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNG 232
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYI 297
++ K+ + ++ + + ++ V H P+++ + A Q Y G+
Sbjct: 233 VCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGK- 291
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG +DHGV+I GYG+ YWI++NSWG NWGENGY ++ RNV
Sbjct: 292 CGTAVDHGVVIAGYGTE-------NGMDYWIVRNSWGANWGENGYLRV--QRNV 336
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 118/325 (36%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K IL P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVIILDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 166/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ L LSEQQLV CD + DSGCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVADHRLXXLSEQQLVSCDDK---------DSGCNGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDGG--SCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K GP+++ ++
Sbjct: 207 TMLTEDSYPYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYESGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 137/359 (38%), Positives = 192/359 (53%), Gaps = 39/359 (10%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KF 60
L S L+LSS ++ D+ + S ++ D LL SL++S K
Sbjct: 18 LFFSLASFLMLSSASDMSIITYDETHGLN----SPPLRTHDQLL------SLYESWLVKH 67
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQ-LLDPTAVHGVTKFSDLTPSEFRRQFLG--LN 117
K Y E + RF +FK N+ R + + + G+ KF+DLT E+R +L +
Sbjct: 68 HKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMM 127
Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
+R R D ++ D LP DWRD GAV VKDQG CGSCW+FS GA+EG +
Sbjct: 128 KRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGIN 187
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TGEL+SLSEQ+LVDCD+ + GCNGGLM+ AFE+I+K GG++ E DYPY
Sbjct: 188 KIVTGELISLSEQELVDCDN--------GYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPY 239
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV 292
G DG + K+ ++ + + ++++ V H P++V I A Q Y GV
Sbjct: 240 KGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGV 299
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV+ VGYGS K YWI++NSWG +WGE+GY + + RNV
Sbjct: 300 FTGQ-CGTELDHGVVAVGYGSE-------NGKDYWIVRNSWGPDWGESGYIR--LERNV 348
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 175/318 (55%), Gaps = 33/318 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K++ K Y + EE R +++ NL R + L T G+ +F+DL EF
Sbjct: 31 WKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKEFVA 90
Query: 112 QFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
G R A+ + LP N+ LP DWR G VT VKDQG CGSCW+FSATG
Sbjct: 91 MMTGF-RVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATG 149
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+LEG HF TG+LVSLSEQ LVDC + + GCNGGLM+ AF+YI+ AGG++
Sbjct: 150 SLEGQHFKKTGKLVSLSEQNLVDCSDK---------NYGCNGGLMDRAFQYIIDAGGIDT 200
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA--VWM 285
E+ YPY D G+C F + + A V+ ++ ++S ++ V H GP++V I+A
Sbjct: 201 EESYPYIAMD-GNCHFKTANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSF 259
Query: 286 QTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GV + P LDHGVL VGYG++ YWI+KNSW E WG NGY
Sbjct: 260 QLYQSGVYNEPGCSSTLLDHGVLAVGYGTT------IDGTDYWIVKNSWAETWGMNGY-- 311
Query: 345 ICMGR---NVCGVDSMVS 359
I M R N CG+ + S
Sbjct: 312 IWMSRNKDNQCGIATQAS 329
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 178/344 (51%), Gaps = 32/344 (9%)
Query: 29 AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRR 87
A++ +V + + +L E + FKS KTY + E RF++F N L AK
Sbjct: 5 ALLCAIVAAATAATSQEILRTE--WEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHN 62
Query: 88 QLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFD 142
V G+ +F+DL P EF + G + + P ND LP D
Sbjct: 63 VKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVD 122
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAVT VKDQG CGSCW+FS+TG+LEG HFL TG+LVSLSEQ LVDC S
Sbjct: 123 WRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDC-------SSA 175
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISS 261
+ GCNGGLM+++F YI GG++ E YPY D G C++ K + A + F +
Sbjct: 176 YGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAED-GDCRYKKEDVGATDTGFVDIKEG 234
Query: 262 DEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAP 318
E + + GP++V I+A Q Y GV P + LDHGVL VGYG
Sbjct: 235 SEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGVK---- 290
Query: 319 IRFKEKPYWIIKNSWGENWGENGYYKICMGR---NVCGVDSMVS 359
K YW++KNSW E WG++GY I M R N CG+ S S
Sbjct: 291 ---NGKKYWLVKNSWAETWGQDGY--ILMSRDKNNQCGIASSAS 329
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 139/362 (38%), Positives = 196/362 (54%), Gaps = 37/362 (10%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
L L ++ L+ +VA + D +++ P D E S D L+ F + S F K Y T EE
Sbjct: 14 LALSAATLSLSVAASHDYSIV-GYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEK 68
Query: 71 DYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
RF VFK NL+ + G+ +F+DL+ EF++ +LGL + + +
Sbjct: 69 LLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA 128
Query: 131 ILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
D+ P DWR GAV VK+QG+CGSCW+FS A+EG + + TG L +LSEQ
Sbjct: 129 EFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQ 188
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--D 245
+L+DCD + ++GCNGGLM+ AFEYI+K GG+ +E+DYPY+ + G+C+ D
Sbjct: 189 ELIDCDT--------TYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKD 239
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVSCPYICGKYLD 303
+S+ + V ++DE + L H PL+V I+A Q Y G CG LD
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALA-HQPLSVAIDASGREFQFYSGVSVFDGRCGVDLD 298
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSM 357
HGV VGYGSS K Y I+KNSWG WGE GY I + RN +CG++ M
Sbjct: 299 HGVAAVGYGSS-------KGSDYIIVKNSWGPKWGEKGY--IRLKRNTGKPEGLCGINKM 349
Query: 358 VS 359
S
Sbjct: 350 AS 351
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 168/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYWRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKBQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E ++ LV LSEQQLV CD + DSGC GGLM AFE++L+ G
Sbjct: 156 VGNIESQWAVAXHGLVRLSEQQLVSCDDK---------DSGCGGGLMTQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + +I S E MAA L K GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSTGDVPECTNSSELVPGARIDGYVMIESXETVMAAWLAKSGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC GK L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASPFMSYESGVLTSC---VGKXLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 176/324 (54%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D NA+ H +KS + Y T EE ++R V++ N+R + HG T
Sbjct: 22 DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H+ + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG+
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDK---YWLVKNSWGKE 305
Query: 337 WGENGYYKICMGRNV-CGVDSMVS 359
WG +GY KI RN CG+ + S
Sbjct: 306 WGMDGYIKIAKDRNNHCGLATAAS 329
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 170/317 (53%), Gaps = 26/317 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E + ++K +K Y+ + E + R+ ++K N+ R + + F D+T +EF
Sbjct: 24 ESSWYVWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEF 83
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
R + GL L + + P DWR G VT VK+QG CGSCW+FS+TGA
Sbjct: 84 RAKMNGL--LLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGA 141
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG HF TG LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ E
Sbjct: 142 LEGQHFKKTGRLVSLSEQNLVDCSTDYG-------NNGCNGGLMDNAFSYIKANGGIDTE 194
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVWM--Q 286
YPY G D G+C++ KS I A + F + DED + + GP++V I+A M Q
Sbjct: 195 TGYPYEGQD-GTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQ 253
Query: 287 TYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y GV P LDHGVL+VGYG+ K YW++KNSWG WG GY I
Sbjct: 254 FYHSGVYDEPQCSPSALDHGVLVVGYGTD-------NGKDYWLVKNSWGTGWGTEGY--I 304
Query: 346 CMGR---NVCGVDSMVS 359
M R N CG+ S S
Sbjct: 305 YMSRNNQNQCGIASKAS 321
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 175/319 (54%), Gaps = 32/319 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKAN---LRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
+K +KTYAT E R R+F N +R R L T + F+DLT EF
Sbjct: 33 WKVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAE 92
Query: 112 QFLGLNRRLR--LPADAQKAPI-LPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++L L + + D + PT L P DWR G VT +KDQG CGSCW+FSAT
Sbjct: 93 KYLTLKQTPMEGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSAT 152
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG TG+L+SLSEQQLVDC + + + GCNGG MN AF Y ++ G E
Sbjct: 153 GALEGQLKRKTGKLISLSEQQLVDC-------STYTGNEGCNGGDMNDAFRYWMR-NGAE 204
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAV--W 284
E DYPYT D G CKF+ SK+ VS F V EDQ+ ++ + GP++V I+A
Sbjct: 205 SESDYPYTAMD-GKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAIDATSSG 263
Query: 285 MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ C +YLDH VL+VGY + + YWI+KNSWGE+WG+ GY
Sbjct: 264 FMLYKKGIYQDNTCSQQYLDHAVLVVGYDADK------TRQKYWIVKNSWGEDWGQRGY- 316
Query: 344 KICMGR---NVCGVDSMVS 359
I M R N+CG+ +M S
Sbjct: 317 -IWMARDKGNMCGIATMAS 334
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 180/326 (55%), Gaps = 28/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F F+K Y+++ E +RF++F+ NL + L D +A + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+ GG++ E DYPY + G C+ + +K V + I+ E+++ L GP+ V
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVA 246
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGY P+WI+KN+WG +WGE
Sbjct: 247 IDASDIVNYKRGI-MKYCANHGLNHAVLLVGYAVQNGV-------PFWILKNTWGADWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
GY+++ N CG+ + + S A I+
Sbjct: 299 QGYFRVQQNINACGIQNELPSSAEIY 324
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 168/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNGYHGS-RKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 TTGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + ED + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 176/324 (54%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D NA+ H +KS + Y T EE ++R V++ N+R + HG T
Sbjct: 22 DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H+ + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKPVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG+
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDK---YWLVKNSWGKE 305
Query: 337 WGENGYYKICMGRNV-CGVDSMVS 359
WG +GY KI RN CG+ + S
Sbjct: 306 WGMDGYIKIAKDRNNHCGLATAAS 329
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/328 (35%), Positives = 178/328 (54%), Gaps = 30/328 (9%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
+L ++E F F K++K+Y+++EE +F FK N+R + L +AV+ + +SD+
Sbjct: 17 NLNDSEKLFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSNSAVYDINFYSDM 76
Query: 105 TPSEFRRQFLGLNRRLR---------LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+E R+ G L+ + + + P LP FDWRD +T VK+Q
Sbjct: 77 NKNELLRKQTGFKINLKKNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHVITSVKNQ 136
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
CGSCW+FS +E + + +L+ LSEQQLV+CD + ++GCNGGLM+
Sbjct: 137 RDCGSCWAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQ---------NNGCNGGLMHW 187
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A E I++ GGV E D+PYT +D G CK + + N I S+ED++ L+ +GP
Sbjct: 188 AMEEIIRQGGVSNETDFPYTASD-GFCKRKQGFVNINGCN-QFILSNEDRLRELLIFNGP 245
Query: 276 LAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
+++ I+ + + Y G+S L+H VL+VGYG PYWI+KNSWG
Sbjct: 246 ISIAIDVIDVIDYSQGISSTCRNDNGLNHAVLLVGYGVKN-------NIPYWILKNSWGS 298
Query: 336 NWGENGYYKICMGRNVCGVDSMVSSVAA 363
WGENGY+++ N CG M++ AA
Sbjct: 299 QWGENGYFRVQRNINSCG---MINDYAA 323
>gi|260819200|ref|XP_002604925.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
gi|229290254|gb|EEN60935.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
Length = 520
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 127/345 (36%), Positives = 184/345 (53%), Gaps = 38/345 (11%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
H S +K + ++ Y T +E RF F+ NL + ++ +F+D++ EFR
Sbjct: 173 HFASQWKHEHNRRYKTADEEKARFATFQDNLLKIEKLNAEYSGTEFATNQFADMSEEEFR 232
Query: 111 RQFLGLNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ L R + NDLP ++W DHGAVT +KDQG+ GSCW+FS
Sbjct: 233 SKILMRPRPPPQHPRERYLRDYGEVNDLPEAYNWVDHGAVTPIKDQGSAGSCWAFSTIEN 292
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG FL+ L +LS +Q+VDCD DP ++G+ D G GG AF+YI + GG+E+E
Sbjct: 293 LEGQWFLTKHPLTNLSVEQVVDCDDNTDP-KTGNADCGVFGGWPYLAFQYIKRVGGIEKE 351
Query: 230 KDYPYTGTDGG-----------------------------SCKF--DKSKIAAA--VSNF 256
+DYPY GG SC F DKSK V+++
Sbjct: 352 EDYPYCSGLGGEKGTCFPCPAPAYNTSMCGPAVSYCNETESCGFRLDKSKFIPGLQVTDW 411
Query: 257 SVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSG 315
+ I ++E +A L+K GPL+V +NAV +Q Y GV P+ C K LDH VL+ G+G
Sbjct: 412 AAIDTNETTIAVQLMKIGPLSVALNAVLLQFYHRGVFEPHFCDPKSLDHAVLLTGWGVE- 470
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
I ++KPYWI+KNSWG+ WG +GY+ I G CG+++ V++
Sbjct: 471 -KTIFGEKKPYWIVKNSWGKKWGMDGYFYIKRGVGQCGINTQVAT 514
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 67/137 (48%), Gaps = 34/137 (24%)
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG--------------------- 240
G+ D G GG AF+YI + GG+E+E+DYPY GG
Sbjct: 20 GNADCGVFGGWPYLAFQYIKRVGGIEKEEDYPYCSGLGGEKGTCFPCPAPAYNASMCGPA 79
Query: 241 --------SCKF--DKSKIAAA--VSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTY 288
SC F DKSK V++++ I ++E +A L+K GPL+V +NAV +Q Y
Sbjct: 80 VSYCNETESCGFRLDKSKFIPGLQVTDWAAIDTNETTIAVQLMKIGPLSVALNAVLLQFY 139
Query: 289 IGGVSCPYICG-KYLDH 304
GV P+ C K LDH
Sbjct: 140 HRGVFEPHFCDPKSLDH 156
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 120/329 (36%), Positives = 184/329 (55%), Gaps = 30/329 (9%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQLLD-PTAVHGVTKF 101
+L A +F F ++K Y + E + R+ +FK NL AK D PTA + + KF
Sbjct: 48 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
SDL+ SE +F GL+ R+ ++ K IL P + P FDWR+ VT +K+QGACG
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-SNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACG 166
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
+CW+F+ ++E + L+ LSEQQL+DCD S D GCNGGL+++AFE
Sbjct: 167 ACWAFATLASVESQFAMRHNRLIDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 217
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I++ GGV+ E DYP+ G + C D+ + + + V + + +E+++ L GP+
Sbjct: 218 IMRMGGVQTELDYPFVGRN-RRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIP 276
Query: 278 VGINAVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
+ I+A + Y GV SC L+H VL+VGYG PYW+ KN+WG+
Sbjct: 277 MAIDAADIVNYYRGVISSCE---NNGLNHAVLLVGYGVENGV-------PYWVFKNTWGD 326
Query: 336 NWGENGYYKICMGRNVCGVDSMVSSVAAI 364
+WGENGY+++ N CG+ + ++S A +
Sbjct: 327 DWGENGYFRVRQNVNACGMVNDLASTAVL 355
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 184/325 (56%), Gaps = 32/325 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + K KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L + +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92 SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQT 287
DYPY D G+CK DK K + +++ + S++++ V P++VGI + Q
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262
Query: 288 YIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y G+ S P C LDH VLIVGYGS YWI+KNSWG++WG +G+
Sbjct: 263 YSSGIFSGP--CSTSLDHAVLIVGYGSQNGV-------DYWIVKNSWGKSWGMDGFMH-- 311
Query: 347 MGRN------VCGVDSMVSSVAAIH 365
M RN VCG++ + S H
Sbjct: 312 MQRNTENSDGVCGINMLASYPIKTH 336
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 124/305 (40%), Positives = 174/305 (57%), Gaps = 23/305 (7%)
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
SK+Y + EE +R+ V++ N + + + T+ + KF DLT +EF + F GL
Sbjct: 38 SKSY-SNEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGDLTNAEFNKLFKGLAFDY 96
Query: 121 RLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
A+ A +P L DFDWR GAVT VK+QG CGSCWSFS TG+ EGA+FL TG
Sbjct: 97 SFHANKAAAEKAVPAPGLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTG 156
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
L SLSEQ L+DC SGS ++GCNGGLM+ AFEYI+ G++ E YPY T
Sbjct: 157 RLTSLSEQNLIDC--------SGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYQ-TA 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPY 296
+C+++ + ++++++ +SS ++ N V P +V I+A Q Y GGV
Sbjct: 208 QYTCQYNPANSGGSLTSYTDVSSGDENALLNAVATEPTSVAIDASHNSFQFYSGGVYYES 267
Query: 297 ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGV 354
C LDHGVL VG+G+ + YW++KNSWG +WG GY K+ R N CG+
Sbjct: 268 ACSSTQLDHGVLAVGWGTE-------DGQDYWLVKNSWGADWGLAGYIKMARNRSNNCGI 320
Query: 355 DSMVS 359
+ S
Sbjct: 321 ATSAS 325
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 133/322 (41%), Positives = 169/322 (52%), Gaps = 32/322 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK KF + Y EE YR VF NL+ K+ + + T + +F
Sbjct: 13 LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
SDLT EF G LR P A T+ P T+ DWR G VT VKDQG CG
Sbjct: 73 SDLTNDEFNSMMKGYKTSLR-PKPV--AVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCG 129
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC--DSGCNGGLMNSAF 217
SCW+FSATG+LEG HFL GELVSL+EQQLVDC +G + GCNGG +N AF
Sbjct: 130 SCWAFSATGSLEGQHFLKYGELVSLAEQQLVDC--------AGGIYYNQGCNGGWVNQAF 181
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI GG++ E YPY D +C+F+ + +AA S F S+ E GP+
Sbjct: 182 KYIKANGGIDTESSYPYEARD-NTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPI 240
Query: 277 AVGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q+Y GV P LDH VL VGYGS G + +W++KNSW
Sbjct: 241 SVAIDAAHRSFQSYSSGVYYEPSCSSSQLDHAVLAVGYGSEG-------GQDFWLVKNSW 293
Query: 334 GENWGENGYYKICMGR-NVCGV 354
G +WG GY + R N CG+
Sbjct: 294 GTSWGSAGYINMARNRNNNCGI 315
>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
Length = 482
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 179/313 (57%), Gaps = 24/313 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F + +++TY +++E +R VF N+ A+R Q LD TA +GVTKFSDLT EFR
Sbjct: 185 FKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYGVTKFSDLTEEEFRT 244
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + P ++DWR GAVT VK+QG CGSCW+FS TG +
Sbjct: 245 IYL--NPLLREEPGKKMHLAKAVRDPAPLEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 302
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GG ++A+ I GG+E E
Sbjct: 303 EGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGFPSNAYLAIKSLGGLETED 353
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIG 290
DY Y G +C F K +++ +S +E ++AA L GP++V INA MQ Y
Sbjct: 354 DYSYQG-HMKACNFSAKKAKVYINDSVELSKNEQKLAAWLAVKGPISVAINAFGMQFYRH 412
Query: 291 GVSCPY--ICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
G++ P +C ++DH +L+VGYG+ P+W IKNSWG +WGE GYY +
Sbjct: 413 GIAHPLRPLCSPWFIDHAMLVVGYGNR-------SNVPFWAIKNSWGTDWGEEGYYYLHR 465
Query: 348 GRNVCGVDSMVSS 360
G CGV+ M SS
Sbjct: 466 GSGACGVNIMASS 478
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 167/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + D+GC GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHRLTALSEQQLVSCDDK---------DNGCRGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSTGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+V Y +G E PYW+IKNSWGENWGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGMPLNHGVLLVWYNRTG-------EVPYWVIKNSWGENWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 182/332 (54%), Gaps = 30/332 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
++ F + +F K Y E RF +FK+N+ + V G+ +DLT E+
Sbjct: 178 KNEFENWIDRFEKKYDVSE-FKKRFSIFKSNMDFVHSWNSKNSQTVLGLNHLADLTNLEY 236
Query: 110 RRQFLGLNRR--LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
R+ +LG +++ L P + + + + DWR GAV+ +KDQG CGSCWSFS T
Sbjct: 237 RQFYLGTHKKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTT 296
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G++EGAH + +G +V LSEQ LVDC + + GCNGGLM+ AFEYI+ G++
Sbjct: 297 GSVEGAHQIKSGNMVELSEQNLVDC-------STSEGNMGCNGGLMDYAFEYIITNNGID 349
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVW-- 284
E YPYT + G +CK++K+ A +S++ I++ + A+ VK+ GP++V I+A
Sbjct: 350 TESSYPYTASSGTTCKYNKANSGATISSYKNITAGSESDLADAVKNAGPVSVAIDASHNS 409
Query: 285 MQTYIGGVSCPYICGKY-LDHGVLIVGYGSS---------GFAPIRFK------EKPYWI 328
Q Y G+ C LDHGVL+VGYGS + +R K K YWI
Sbjct: 410 FQLYSHGIYYDASCSSVNLDHGVLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWI 469
Query: 329 IKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
+KNSWG +WG+ G+ + R N CG+ S S
Sbjct: 470 VKNSWGTSWGDKGFIYMSKDRDNNCGIASCAS 501
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 118/294 (40%), Positives = 170/294 (57%), Gaps = 25/294 (8%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K Y E D RF++FK NLR ++ + T G+ +F+DLT E+R ++LG
Sbjct: 46 KHGKLYNALGEKDKRFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKI 105
Query: 117 --NRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
NRRL R P++ + T LP DWR GAV VKDQ +CGSCW+FSA GA+EG
Sbjct: 106 DPNRRLGRTPSNRYAPRVGET--LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGI 163
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+L+SLSEQ+LVDCD + GCNGGLM+ AFE+I+K GG++ E+DYP
Sbjct: 164 NKIVTGDLISLSEQELVDCDT--------GYNMGCNGGLMDYAFEFIIKNGGIDSEEDYP 215
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGG 291
Y G DG ++ K+ ++ + +++ ++ V + P++V + Q Y G
Sbjct: 216 YKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSG 275
Query: 292 VSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
V CG LDHGV+ VGYG+ +WI++NSWG +WGE GY ++
Sbjct: 276 VFTGR-CGTALDHGVVAVGYGTD-------NGHDFWIVRNSWGADWGEEGYIRL 321
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 133/341 (39%), Positives = 182/341 (53%), Gaps = 35/341 (10%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYICGKY---LDHGVLIVGY 311
++ + DE + A LV+ P+++ + + Y GV CG ++H VL VGY
Sbjct: 255 ITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
G PYW+IKNSWG +WG+ GY+K+ MG+N+C
Sbjct: 313 GVEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMC 346
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 177/319 (55%), Gaps = 22/319 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H++L+K+ K+YA +EE +R +++ NLR + L H G+ +F D+T
Sbjct: 26 DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + ++ AP + P DWR G VT VKDQG CGSCW+FS
Sbjct: 85 NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG H+ +TG+++SLSEQ LVDC + GCNGGLM+ AF+Y+ GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA-- 282
++ E YPYT D C +D + +A + F ++S+ ++ N V GP++V ++A
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSESEKDLMNAVASVGPVSVAVDAGH 255
Query: 283 VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ P + LDHGVL+VGYG G K YWI+KNSW E WG +G
Sbjct: 256 QSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE---DGKKYWIVKNSWSEKWGNDG 312
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y I R N CG+ + S
Sbjct: 313 YIYIAKDRHNHCGIATAAS 331
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 124/328 (37%), Positives = 180/328 (54%), Gaps = 38/328 (11%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDP 92
+L +E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L
Sbjct: 49 NLDQSEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLST 108
Query: 93 TAVHGVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGA 148
+A GV KFSD TP E FL L++ L + + P LP +DWRD
Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPDIRLPDYYDWRDTNK 167
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
VT +KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GC
Sbjct: 168 VTPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGC 218
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMA 267
NGGLM+ AF+ +L GGVE E DYPY G++ C D KIA +++ F DE+++
Sbjct: 219 NGGLMHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLK 277
Query: 268 ANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPY 326
+ GP+A+ ++A+ + Y G+ C Y L+H VL++G+G PY
Sbjct: 278 ELVYTTGPVAIAVDAMDIINYRRGILNQ--CHIYDLNHAVLLIGWGIEN-------NVPY 328
Query: 327 WIIKNSWGENWGENGYYKICMGRNVCGV 354
WIIKNSWGE+WGENG+ ++ N CG+
Sbjct: 329 WIIKNSWGEDWGENGFLRVRRNVNACGL 356
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 202/365 (55%), Gaps = 30/365 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+I++ LL LL ++++A+ +D + P+ ++ +A +F F S+++K
Sbjct: 25 IIMNKSLLFLL--LVSTALTRQNDAVHTPTIKPT-----LYNINSAPLYFEKFISQYNKH 77
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y ++E YR+ +F+ N+ + + +AV+ + +F+D+T +E + GL L
Sbjct: 78 YKNEDEKKYRYNIFRHNIESINHKNSRNDSAVYKINRFADMTKNEVVIRHTGLASG-ELG 136
Query: 124 ADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + ++ PT FDWR VT VKDQG CG+CW+F+ GALE + +
Sbjct: 137 VNFCETIVVDGPGQRQRPTSFDWRTLNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDR 196
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+ LSEQQLVDCDH D GC+GGL+++A+E I++ GGVE++ DYPY +
Sbjct: 197 LIDLSEQQLVDCDH---------VDMGCDGGLIHTAYEEIMRMGGVEQDFDYPYRA-ERQ 246
Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG 299
C K AA V S + + +E+++ L GP+A+ ++AV + Y GG+ +
Sbjct: 247 PCALKPHKFAAGVRSCYRYVLLNEERLEDLLRHVGPIAIAVDAVDITDYYGGI-VSFCEN 305
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
L+H VL+VGYG PYWI+KNSWG ++GE+GY ++ G N CG+ + ++
Sbjct: 306 NGLNHAVLLVGYGVE-------NNVPYWILKNSWGSDYGEDGYVRVRRGVNSCGMINELA 358
Query: 360 SVAAI 364
S A +
Sbjct: 359 SSAQV 363
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 192/370 (51%), Gaps = 49/370 (13%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ + L+L L +++A A AV+ + + Q E H EH K Y
Sbjct: 1 MRTALILPLLALVAVAQAVSYAEVI----------QEEWHTFKLEHR---------KNYQ 41
Query: 66 TQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN---- 117
+ E +R ++F N + AK QL AV V K++D+ EF G N
Sbjct: 42 DETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH 101
Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++LR ++ K + + LP DWR GAVT VKDQG CGSCW+FS+TGALEG H
Sbjct: 102 KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQH 161
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ +G LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ EK YPY
Sbjct: 162 YRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 214
Query: 235 TGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGG 291
D SC F+K I A F + +E +MA + GP+AV I+A Q Y G
Sbjct: 215 EAID-DSCHFNKGTIGATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEG 273
Query: 292 VSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR- 349
V C + LDHGVL+VG+G+ + YW++KNSWG WG+ G+ K+ +
Sbjct: 274 VYNEPACDAQNLDHGVLVVGFGTDESG------QDYWLVKNSWGTTWGDKGFIKMLRNKE 327
Query: 350 NVCGVDSMVS 359
N CG+ S S
Sbjct: 328 NQCGIASASS 337
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/327 (39%), Positives = 185/327 (56%), Gaps = 34/327 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + + KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 32 FDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L N +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92 SRLGLSVSASSLIMASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGI----NAVWM 285
DYPY D G+CK DK K + +++ + S++++ V P++VGI A +
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQL 262
Query: 286 QTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+ + G+ S P C LDH VLIVGYGS YWI+KNSWG++WG +G+
Sbjct: 263 YSRVSGIFSGP--CSTSLDHAVLIVGYGSQNGV-------DYWIVKNSWGKSWGMDGFMH 313
Query: 345 ICMGRN------VCGVDSMVSSVAAIH 365
M RN +CG++ + S H
Sbjct: 314 --MQRNTGNSEGICGINMLASYPIKTH 338
>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 173/313 (55%), Gaps = 21/313 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F ++ +FSK Y T EE R + F N Q D T G+ +DLT SEF+
Sbjct: 42 FRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSEFQS 101
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
++L +++ A + + + LP +FDWR+HG V+ VK+QG CGSCW+FS TG LE
Sbjct: 102 RYLMVSQDC--SATSTRDLDIDILSLPENFDWREHGGVSPVKNQGHCGSCWTFSTTGCLE 159
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
AH + + +LSEQQLVDC + D + GCNGGL + AFEYI GG+E E+D
Sbjct: 160 SAHLIHHKKAYNLSEQQLVDCAQDFD-------NHGCNGGLPSHAFEYIHYVGGLEEEQD 212
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYI 289
Y Y + G C+FD +K A V F++ +DEDQ+ L P++V V + Y
Sbjct: 213 YSYHAEE-GLCEFDPTKTAGTVREVFNITETDEDQLTIALAYFNPVSVAFEVVDGFRFYK 271
Query: 290 GGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
GV C + ++H VL VGYG + E PY+I+KNSWG WG+ G++KI
Sbjct: 272 EGVYQSDTCKSGPEDVNHAVLAVGYGM-----CKKCETPYFIVKNSWGAEWGDEGFFKIK 326
Query: 347 MGRNVCGVDSMVS 359
G N+CG+ + S
Sbjct: 327 RGENMCGIATCAS 339
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 115/326 (35%), Positives = 179/326 (54%), Gaps = 28/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F F+K Y+++ E +RF++F+ NL + L D +A + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+ GG++ E DYPY + G C+ + +K V + + E+++ L GPL V
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVA 246
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y GV Y L+H VL+VGY P+WI+KN+WG +WGE
Sbjct: 247 IDASDIVNYKRGV-IRYCANHGLNHAVLLVGYAVENGV-------PFWILKNTWGTDWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
GY+++ N CG+ + + S A I+
Sbjct: 299 QGYFRVQQNINACGIQNELPSSAEIY 324
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/297 (39%), Positives = 165/297 (55%), Gaps = 22/297 (7%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN- 117
K K Y E + RF +FK NL + + T G+ +F+DLT EFR +LG
Sbjct: 57 KHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRT 116
Query: 118 -RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ RLP + + + LP DWR GAV VKDQG CGSCW+FS A+EG + +
Sbjct: 117 GHKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKI 176
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DYPY G
Sbjct: 177 VTGDLIALSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLG 228
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC 294
DG + K+ ++ ++ + +++ V + P++V I Q Y GV
Sbjct: 229 RDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFT 288
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV VGYG+ K K YWI++NSWG++WGE+GY + M RN+
Sbjct: 289 GE-CGTSLDHGVAAVGYGTE-------KGKDYWIVRNSWGKSWGESGYIR--MERNI 335
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 174/317 (54%), Gaps = 26/317 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F+ +K+ ++ YA+ +E R ++ +NL + G+ +F DL EF
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 112 QFLGLN-RRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
++LG+ + + LP LP DWR G VT VK+QG CGSCWSFS TG+
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG H TG LVSLSEQ LVDC + E GCNGGLM+ AFEYI+K GG++ E
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNE-------GCNGGLMDDAFEYIIKNGGIDTE 193
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--VWMQ 286
YPYT T G+CKF+ + I A V+++ +I+ E + + GP++V I+A + Q
Sbjct: 194 ASYPYTATT-GTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQ 252
Query: 287 TYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y GV C LDHGVL VGYG+S + K YW++KNSWG WG+ GY I
Sbjct: 253 FYFTGVYNEKKCSTTQLDHGVLAVGYGTST------EGKDYWLVKNSWGATWGKAGY--I 304
Query: 346 CMGRNV---CGVDSMVS 359
M RN CG+ + S
Sbjct: 305 WMSRNADNQCGIATSAS 321
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 134/362 (37%), Positives = 192/362 (53%), Gaps = 39/362 (10%)
Query: 9 LLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+LL LL+S +++ + DD ++ + V E F + K KTYAT
Sbjct: 10 ILLFLLASFTDVSLSFDPLDDFVMSESVQRAAE------------FERWTIKHKKTYATA 57
Query: 68 EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-RRLRLPAD 125
EE+++R RV+ AN KR + P + +F+DLT +EF+R +L + + R
Sbjct: 58 EEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLTFAEFKRIYLSSSSQHCRATTG 117
Query: 126 AQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
+ P+ N + P DWR +T V+DQG+CGSCW+FSAT L L TG+L+SL
Sbjct: 118 NFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWAFSATSCLSAHLALKTGQLISL 177
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
S+QQL+DC + + GC GGL + AFEYI GG+E E+DYPY + C F
Sbjct: 178 SKQQLLDCSRSFN-------NRGCKGGLPSQAFEYIRYNGGIESERDYPYKDRE-EKCHF 229
Query: 245 DKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYICGK 300
S +AA V+ NF+ ED +A L GP+++GI++ TY G+ +C K
Sbjct: 230 KPSLVAATVTGVVNFT--QGAEDDIAVALANIGPVSIGIHSTKSFATYKKGIYQGKLCSK 287
Query: 301 ---YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSM 357
++H VLIVGY + + YWI KNSWG NWG NGY+ I G N CG+ +
Sbjct: 288 NPRKINHAVLIVGYDQTA------SGEKYWIGKNSWGTNWGMNGYFWIRRGHNACGLATC 341
Query: 358 VS 359
S
Sbjct: 342 AS 343
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/325 (36%), Positives = 180/325 (55%), Gaps = 26/325 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F + ++K Y +E YR+++FK NL + ++ AV + KFSD++
Sbjct: 20 LLKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVEDHAVFSINKFSDMS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
SE ++ GL+ + + +A IL P N P +FDWR + AVT V+ QG CGSCW+
Sbjct: 80 KSEIISKYTGLSLPSLMQENFCRAIILDGPPNKAPINFDWRQYNAVTPVRVQGNCGSCWA 139
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS +E + + + +SLS QQLVDCD + + GC GGL+++A E I+ A
Sbjct: 140 FSTLAGIESQYSIKYNKQISLSVQQLVDCD---------TSNMGCAGGLLHTALEQIINA 190
Query: 224 -GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGIN 281
GGV +E+DYPY G D C + A V + I +E+++ L GP+ V I+
Sbjct: 191 GGGVLQEEDYPYKGVD-KQCNLPHNNFAVQVLGCYRYIVMNEEKLKDVLRAVGPIPVAID 249
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A + Y G+ +C Y L+H VL+VGYG PYW +KN+WG++WGE
Sbjct: 250 AASIVDYSRGIIRTCTYYG---LNHAVLLVGYGVQDGV-------PYWTLKNTWGDDWGE 299
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAI 364
+GY+++ N CG+ + ++S A I
Sbjct: 300 HGYFRVRQNVNSCGIINDLASTAVI 324
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 125/348 (35%), Positives = 187/348 (53%), Gaps = 32/348 (9%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQV-VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+++ +L ++ +A A +D + + +V P E +L F F +++K Y ++
Sbjct: 1 MIVFVLCAISFTAAAPQNDVSDVEKVRKPVFYSMDEAPIL-----FENFIREYNKKYDSK 55
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
E+ + RF++F NL+R AVHG+ KF+DL+ EF++ + G D
Sbjct: 56 EKEE-RFKIFVNNLKRINDLNHKSTNAVHGINKFTDLSKEEFKKFYTGFKPDKSFLDDNI 114
Query: 128 KAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
K P + ++ P FDWRD G VT VK+QG CGSCW+FS G +E + + G LV LS
Sbjct: 115 KKPSQLSFNITAPPAFDWRDKGVVTRVKNQGTCGSCWAFSTIGNVESVNAIKHGNLVELS 174
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCD S D C+ GL ++A +Y++ G + E+ YPY G +C +D
Sbjct: 175 EQQLVDCD---------SKDEACDSGLPDNAQQYLVSHGAIS-EQSYPYKGY-AANCTYD 223
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV---SCPYICGKYL 302
S++ +SNF + E QMA L PL++ I A + TY G+ C + L
Sbjct: 224 SSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAAEVLGTYTKGILVNECEQ--SQDL 281
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
+H VL+VGYG+ G +WI+KNSWG NWGE GY++I G N
Sbjct: 282 NHAVLLVGYGNEG-------GTNFWILKNSWGTNWGEGGYFRIKRGVN 322
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 192/370 (51%), Gaps = 49/370 (13%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ + L+L L +++A A AV+ + + Q E H EH K Y
Sbjct: 1 MRTALILPLLALVAVAQAVSYAEVI----------QEEWHTFKLEHR---------KNYQ 41
Query: 66 TQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN---- 117
+ E +R ++F N + AK QL AV V K++D+ EF G N
Sbjct: 42 DETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH 101
Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++LR ++ K + + LP DWR GAVT VKDQG CGSCW+FS+TGALEG H
Sbjct: 102 KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQH 161
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ +G LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ EK YPY
Sbjct: 162 YRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 214
Query: 235 TGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGG 291
D SC F+K I A F + +E +MA + GP+AV I+A Q Y G
Sbjct: 215 EAID-DSCHFNKGSIGATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEG 273
Query: 292 VSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR- 349
V C + LDHGVL+VG+G+ + YW++KNSWG WG+ G+ K+ +
Sbjct: 274 VYNEPACDAQNLDHGVLVVGFGTDESG------EDYWLVKNSWGTTWGDKGFIKMLRNKE 327
Query: 350 NVCGVDSMVS 359
N CG+ S S
Sbjct: 328 NQCGIASASS 337
>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 320
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 182/315 (57%), Gaps = 39/315 (12%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTP 106
N + +S FK+ ++K YA + YR VF NL+ ++D + G+TKF DLT
Sbjct: 38 NIKTLWSTFKNSYNKKYADPDFEQYRIEVFTENLK------IIDSNCQNFGITKFMDLTQ 91
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF--DWRDHGAVTGVKDQGACGSCWSF 164
EF++ +L L + + ++ P ND D DW GAVT VKDQG CGSCWSF
Sbjct: 92 EEFKQTYLTLKTKKYI----EEIPETVFNDSNGDIEIDWTMKGAVTPVKDQGKCGSCWSF 147
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TGA+EGAHFLS+ ELVSLSEQ L+DC S + + GCNGGLM++AF++I +
Sbjct: 148 STTGAVEGAHFLSSNELVSLSEQYLIDC--------SKNGNEGCNGGLMDTAFDFIAQ-N 198
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G+ E YPY D G+CK +S++ I S D ++ ++ P+A+ ++A
Sbjct: 199 GIPTENAYPYKALD-GTCKMTTG--PYKISSYQNIISCNDLLSK--LQKQPIAIAVDANN 253
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y G+ CGK LDHGVL+VGY S K+K +W +KNSWG +WGE+GY +
Sbjct: 254 FQFYTKGIFSK--CGKNLDHGVLLVGYSS--------KDK-FWKVKNSWGSSWGEDGYIR 302
Query: 345 ICMGRNVCGVDSMVS 359
+ G N CG+ + S
Sbjct: 303 LSAG-NTCGLCNQAS 316
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 133/322 (41%), Positives = 179/322 (55%), Gaps = 32/322 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP----TAVHGVTKFSDLT 105
E ++++FK+K +KTY+ E+ R+ +++ NL++ + L T G K++D+T
Sbjct: 19 EANWAIFKAKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMT 77
Query: 106 PSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFRR GL L P D + + LPT DWR G VT VKDQG CGSCW+F
Sbjct: 78 NEEFRRTLSGLRVDKELTPGDFVSG--MFKDSLPTAVDWRKEGYVTEVKDQGQCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG+LEG HF +T +LVSLSE LVDC + + GCNGGLM++AF+YI
Sbjct: 136 STTGSLEGQHFKATKQLVSLSESNLVDCSKKWG-------NQGCNGGLMDNAFKYIADNK 188
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAV 283
G++ EK YPY D C F K+ + A + I+S ED + + GP++V I+A
Sbjct: 189 GIDTEKSYPYKPED-RKCNFKKANVGATDKLYKDITSGSEDALQEAVATIGPISVAIDAS 247
Query: 284 W--MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GGV C K LDHGVL VGY S YWI+KNSWG++WG +
Sbjct: 248 HDSFQLYSGGVYNEKACSTKTLDHGVLAVGYDSK-------NGDDYWIVKNSWGKSWGID 300
Query: 341 GYYKICMGR---NVCGVDSMVS 359
GY I M R N CG+ +M S
Sbjct: 301 GY--IWMSRNKKNQCGIATMAS 320
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 179/315 (56%), Gaps = 28/315 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRR-QLLDPTAVHGVTKFSDLTPSEFR 110
+K F K Y + EE +R R F NL R +R Q L+ AV + FSDLTP EF
Sbjct: 35 WKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVR-LNDFSDLTPGEFA 93
Query: 111 RQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
++L L + ++A +P + LP +WR+ GAVT VK+QG CGSCWSFSA GA
Sbjct: 94 ERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGA 153
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EGA + TG L SLSEQQL+DC + + GCNGGLM AF+Y + GVE E
Sbjct: 154 IEGAIQIKTGALRSLSEQQLMDCSWDYG-------NQGCNGGLMPQAFQYAQRY-GVEAE 205
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV--WMQ 286
DY YT D G C++ + + A V+ ++ + DE + + GP++VGI+A
Sbjct: 206 VDYRYTERD-GVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFM 264
Query: 287 TYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
+Y GV C Y +DHGVL+VGYG+ YW++KNSWG +WGE+GY K+
Sbjct: 265 SYSHGVFVSKTCSPYAIDHGVLVVGYGAE-------NGDAYWLVKNSWGSSWGEDGYLKM 317
Query: 346 CMGR-NVCGVDSMVS 359
R N+CG+ SM S
Sbjct: 318 ARNRNNMCGIASMAS 332
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 184/356 (51%), Gaps = 32/356 (8%)
Query: 16 SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
S++ V + D +R++ G++ L A F F ++K Y E+ + RF+
Sbjct: 8 SMVHVLVLFSIDQCKVREL----GQRRLYSLEEAPTLFEQFIKDYNKEYDESEKEE-RFK 62
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
+F NL+ AV+G+ KFSDL+ EF + + GL R + K LP +
Sbjct: 63 IFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPES 122
Query: 136 ---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDC 192
P FDWR G V+ +K+Q CGSCW+FSA +E H + TG+L+ +SEQQL+DC
Sbjct: 123 FNVTAPDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC 182
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
D DSGC+GGL A Y + A G K YPY + G C++D SK+
Sbjct: 183 D---------KYDSGCSGGLPWDALRYFV-ANGAMSLKSYPYVAKE-GKCRYDSSKVEIR 231
Query: 253 VSNFSVISS-DEDQMAANLVKHGPLAVGINAVWMQTYIGGV---SCPYICGKYLDHGVLI 308
+ + + S EDQ+ +L GPL++ I+ ++ Y+GG+ C +C ++H VL+
Sbjct: 232 LKGYKIFSKISEDQIKEHLYNIGPLSIAIDVSPIKPYVGGIVMEECHEVCQ--VNHAVLL 289
Query: 309 VGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAI 364
VGYG YWI+KNSWG NWGENGY+++ G N + S + A I
Sbjct: 290 VGYGKEYSV-------EYWIVKNSWGPNWGENGYFRMERGVNCLLLTSTGITTAVI 338
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 171/317 (53%), Gaps = 29/317 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRR 111
F + + + Y EH+ RF++F N R + + + G+ +FSD T E +R
Sbjct: 69 FMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDKTDEELKR 128
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
L D K I P++ DWR+ GAVT VK+QG CGSCW+FSATGA+E
Sbjct: 129 -LRCFRGSLNASRDGSKY-ITIAAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIE 186
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G +FL+TG LVSLSEQQLVDC E ++ CNGGLM++AF+Y+ + G++ E
Sbjct: 187 GQNFLATGNLVSLSEQQLVDCSSEYG-------NNACNGGLMDNAFKYVKDSNGIDTEAS 239
Query: 232 YPY----TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVW-- 284
YPY TG +C+F+ + V+ + + + V H GP++V INA
Sbjct: 240 YPYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPS 299
Query: 285 MQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GV C LDHGVL+VGYG PYW+IKNSWG +WGENGY
Sbjct: 300 FMSYKSGVYSDDQCSSDDLDHGVLLVGYGEE-------NGIPYWLIKNSWGPHWGENGYV 352
Query: 344 KICMG-RNVCGVDSMVS 359
KI N+CGV SM S
Sbjct: 353 KILRDHNNLCGVASMAS 369
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 181/326 (55%), Gaps = 28/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
+L A ++F F KF+K+Y+++ E RF++F+ NL + D TA + + KF+DL+
Sbjct: 21 VLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFADLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + + ++LSEQQL+DCD D+GC+GGL+++AFE +
Sbjct: 137 CWAFATLGSLESQFAIKHNQFINLSEQQLIDCDF---------VDAGCDGGLLHTAFEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+ GG++ E DYPY + G C+ + +K V + I+ E+++ L GP+ V
Sbjct: 188 MNMGGIQAESDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVA 246
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGY P+WI+KN+WG +WGE
Sbjct: 247 IDASDIVNYKRGIM-KYCANHGLNHAVLLVGYAVENGV-------PFWILKNTWGADWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
GY+++ N CG+ + + S A I+
Sbjct: 299 QGYFRVQQNINACGIQNELPSSAEIY 324
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 171/329 (51%), Gaps = 22/329 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
N + + +K+ + Y T EE +R V++ N++ + HG T F D
Sbjct: 24 NLDTQWYQWKATHRRLYGTNEE-GWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ + + + P+L +LP DWR G VT VK+Q CGSCW+
Sbjct: 83 MTNEEFRQVMVCFRNQKHKNRKVFRGPLL--LNLPKSVDWRKKGYVTPVKNQKQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC H + GCNGG MN+AF+Y+ +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSHP-------QGNQGCNGGFMNNAFQYVKEN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG++ E YPY D GSCK+ A + F VI + E ++ + GP++V ++A
Sbjct: 194 GGLDSEASYPYVAKD-GSCKYKPENSVANDTGFVVIPAHEKELMKAVATVGPISVAVDAS 252
Query: 284 W--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y G+ C K LDHGVL+VGY GF YW+IKNSWG WG N
Sbjct: 253 HSSFQFYKSGIYFEQDCSSKNLDHGVLVVGY---GFEGTNSNNNNYWLIKNSWGPEWGSN 309
Query: 341 GYYKICMGRNV-CGVDSMVSSVAAIHTTS 368
GY KI RN CG+ + S T S
Sbjct: 310 GYIKIAKDRNNHCGIATAASYPIVWKTPS 338
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 179/315 (56%), Gaps = 28/315 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRR-QLLDPTAVHGVTKFSDLTPSEFR 110
+K F K Y + EE +R R F NL R +R Q L+ AV + FSDLTP EF
Sbjct: 35 WKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVR-LNDFSDLTPGEFA 93
Query: 111 RQFLGLNRRLRLPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
++L L + ++A +P +LP +WR+ GAVT VK+QG CGSCWSFSA GA
Sbjct: 94 ERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGA 153
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EGA + TG L SLSEQQL+DC + + GCNGGLM AF+Y + GVE E
Sbjct: 154 IEGAIQIKTGALRSLSEQQLMDCSWDYG-------NQGCNGGLMPQAFQYAQRY-GVEAE 205
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAV--WMQ 286
DY YT D G C++ + + A V+ ++ + DE + + GP++VGI+A
Sbjct: 206 VDYRYTERD-GVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFM 264
Query: 287 TYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
+Y GV C Y +DHGVL+VGYG+ + YW++KNSWG +WGE GY K+
Sbjct: 265 SYSHGVFVSKTCSPYAIDHGVLVVGYGAE-------NGEAYWLVKNSWGSSWGEGGYVKM 317
Query: 346 CMGR-NVCGVDSMVS 359
R N+CG+ SM S
Sbjct: 318 ARNRNNMCGIASMAS 332
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 173/318 (54%), Gaps = 31/318 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + SK K Y + EE +RF VF+ NL R + G+ +F+DL+ EF+ +
Sbjct: 404 FESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEEFKSK 463
Query: 113 FLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+LGL D + + DLP DWR GAVT VK+QGACGSCW+FS A+E
Sbjct: 464 YLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVAAVE 523
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TG L +LSEQ+L+DCD + +SGCNGGLM+ AF +I GG+ +E D
Sbjct: 524 GINQIVTGNLTTLSEQELIDCD--------TTFNSGCNGGLMDYAFAFIASNGGLHKEDD 575
Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTY 288
YPY + G+C+ K + +S + + +++ + H PL+V I A Q Y
Sbjct: 576 YPYL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFY 634
Query: 289 IGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GGV + P CG LDHGV VGYGSS K Y I+KNSWG WGE GY I M
Sbjct: 635 SGGVFNGP--CGTELDHGVAAVGYGSS-------KGLDYIIVKNSWGPKWGEKGY--IRM 683
Query: 348 GRN------VCGVDSMVS 359
RN +CG++ M S
Sbjct: 684 KRNTGKTEGLCGINKMAS 701
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 135/367 (36%), Positives = 197/367 (53%), Gaps = 41/367 (11%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
S L+L S L +++A D +++ S+ +S D L+ F + SK K Y
Sbjct: 5 FSKALVLACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSKHGKIYQ 59
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLR 121
+ EE RF +FK NL+ R + G+ +F+DL+ EF+ ++LGL +RR
Sbjct: 60 SIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRE 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
P + + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L
Sbjct: 120 SPEEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
SLSEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+
Sbjct: 176 TSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGT 226
Query: 242 CKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYIC 298
C+ K + +S + + + +Q + + PL+V I A Q Y GGV + C
Sbjct: 227 CEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGH-C 285
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VC 352
G LDHGV VGYG++ K Y I+KNSWG WGE GY I M RN +C
Sbjct: 286 GSDLDHGVAAVGYGTA-------KGVDYIIVKNSWGSKWGEKGY--IRMRRNIGKPEGIC 336
Query: 353 GVDSMVS 359
G+ M S
Sbjct: 337 GIYKMAS 343
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 182/316 (57%), Gaps = 33/316 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F F++K+ K Y + E +YR +V N+ ++ + + G+T F+D+T +EF
Sbjct: 24 EKLFQTFEAKYGKNYLSSE-REYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEF 82
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTD-FDWRDHGAVTGVKDQGACGSCWSFSATG 168
L ++ P + ++A +L N++ + DWR+ GAVT VK+QG+CGSCW+FSATG
Sbjct: 83 ATS--KLCGCMKKPLNHKQARVL--NNMAVESIDWREKGAVTPVKNQGSCGSCWAFSATG 138
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALEG +F++TG+LVSLSEQQLVDCD E D+GC GG M++AFEY++K G+
Sbjct: 139 ALEGGNFVATGKLVSLSEQQLVDCDTE---------DAGCGGGFMDTAFEYVMKK-GLCT 188
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQ 286
E+DYPY D CK D+ +++ + + +++ + P++V I A Q
Sbjct: 189 EEDYPYHAKD-EDCKDDQCTSVISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQ 247
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GGV +CG L+HGVL VGY K Y I+KNSWG +WG+ GY KI
Sbjct: 248 MYTGGVLDSDMCGTSLNHGVLAVGYA-----------KEYIIVKNSWGASWGDKGYVKIA 296
Query: 347 ---MGRNVCGVDSMVS 359
G +CG++ S
Sbjct: 297 HRDQGEGICGINMAAS 312
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/363 (35%), Positives = 192/363 (52%), Gaps = 38/363 (10%)
Query: 1 MERLILSSLLLLLLSS----VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLF 56
M + I+++LL L SS + S + ++ + + SD ED + N + ++
Sbjct: 1 MAKTIITTLLFALFSSLSYAIDMSIIDYKNNHYARKWTLQSD----EDQVKN---RYEMW 53
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
++ + Y E + RF +FK NLR + + T G+ +F+DLT E+R +LG
Sbjct: 54 LAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLG 113
Query: 116 LN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R ++ +Q+ P +P DWR GAV +K+QG+CGSCW+FS A+
Sbjct: 114 TKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAV 173
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TGE+++LSEQ+LVDCD +SGCNGGLM+ AFE+I+ GG++ EK
Sbjct: 174 EGINQIVTGEMITLSEQELVDCDR--------VQNSGCNGGLMDYAFEFIISNGGMDTEK 225
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTY 288
YPY G +G K+ ++ + + +E + V H P+ V I A Q Y
Sbjct: 226 HYPYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERAL-QKAVAHQPVCVAIEASGRAFQLY 284
Query: 289 IGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GV CG+ +DHGV++VGYGS YWI++NSWG WGENGY K M
Sbjct: 285 SSGVFTGE-CGEEVDHGVVVVGYGSEDGV-------DYWIVRNSWGTKWGENGYVK--ME 334
Query: 349 RNV 351
RNV
Sbjct: 335 RNV 337
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFDSGLNHAVLLVGYGVEN-------NVPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 184/325 (56%), Gaps = 32/325 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + K KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L + +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92 SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQT 287
DYPY D G+CK DK K + +++ + S++++ V P++VGI + Q
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262
Query: 288 YIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y G+ S P C LDH VLIVGYGS YWI+KNSWG++WG +G+
Sbjct: 263 YSRGIFSGP--CSTSLDHAVLIVGYGSQNGV-------DYWIVKNSWGKSWGMDGFMH-- 311
Query: 347 MGRN------VCGVDSMVSSVAAIH 365
M RN VCG++ + S H
Sbjct: 312 MQRNTENSDGVCGINMLASYPIKTH 336
>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
Length = 443
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/313 (41%), Positives = 171/313 (54%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK GACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKXXGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE +L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEXLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+ ++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSCA---GDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVVMGXNAC 329
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 182/320 (56%), Gaps = 23/320 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
A +F F ++++K Y +++E YR+ +F+ N+ ++ + +AV+ + +F+D+T +E
Sbjct: 39 APLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNE 98
Query: 109 FRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+ GL L A+ + ++ P +FDWR VT VKDQG CG+CW+F+
Sbjct: 99 IVIRHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFA 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
GALE + + L+ L+EQQLVDCD D GC+GGL+++A+E I++ GG
Sbjct: 158 GLGALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGG 208
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVW 284
VE+E DYPY + C K AA V N + + +E+++ L GP+A+ ++AV
Sbjct: 209 VEQEFDYPYK-AERQPCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVD 267
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+ Y GG+ + L+H VL+VGYG PYWIIKNSWG ++GE+GY +
Sbjct: 268 LTDYYGGI-VSFCKNNGLNHAVLLVGYGVE-------NNVPYWIIKNSWGSDYGEDGYVR 319
Query: 345 ICMGRNVCGVDSMVSSVAAI 364
+ G N CG+ + ++S A +
Sbjct: 320 VRRGVNSCGMINELASSAQV 339
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 182/320 (56%), Gaps = 23/320 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
A +F F ++++K Y +++E YR+ +F+ N+ ++ + +AV+ + +F+D+T +E
Sbjct: 38 APLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNE 97
Query: 109 FRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+ GL L A+ + ++ P +FDWR VT VKDQG CG+CW+F+
Sbjct: 98 IVIRHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFA 156
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
GALE + + L+ L+EQQLVDCD D GC+GGL+++A+E I++ GG
Sbjct: 157 GLGALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGG 207
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVW 284
VE+E DYPY + C K AA V N + + +E+++ L GP+A+ ++AV
Sbjct: 208 VEQEFDYPYK-AERQPCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVD 266
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+ Y GG+ + L+H VL+VGYG PYWIIKNSWG ++GE+GY +
Sbjct: 267 LTDYYGGI-VSFCKNNGLNHAVLLVGYGVE-------NNVPYWIIKNSWGSDYGEDGYVR 318
Query: 345 ICMGRNVCGVDSMVSSVAAI 364
+ G N CG+ + ++S A +
Sbjct: 319 VRRGVNSCGMINELASSAQV 338
>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
Length = 353
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 167/311 (53%), Gaps = 25/311 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K + E RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 37 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 96
Query: 108 EFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L N R D ++ + DWR+ G VT VK+QG CGSCW+F+
Sbjct: 97 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 156
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
TG +EG L LVSLSEQ LV CD + D GCNGGLM A ++I+
Sbjct: 157 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMQQAMQWIINDHN 207
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E YPYT G D + A ++ + + DE+++AA + K+GP+AV ++A
Sbjct: 208 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDA 267
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ R + PYWI+KNSWG +WGE G
Sbjct: 268 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 318
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 319 YIRLAMGSNQC 329
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 141/370 (38%), Positives = 199/370 (53%), Gaps = 41/370 (11%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK---SKF 60
+ LS LLLL + + VA N D +++ SE+ L + E LF+ +K
Sbjct: 8 MKLSGALLLL---CVGACVARNSDFSIVGY--------SEEDLSSNERLVELFEKWLAKH 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR- 119
K YA+ EE +RF VFK NL+ + + G+ +F+DLT EF+ +LGL+
Sbjct: 57 QKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAAP 116
Query: 120 -LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R + + + + +DLP DWR GAVT VK+QG CGSCW+FS A+EG + + T
Sbjct: 117 ARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L +LSEQ+L+DC S +SGCNGGLM+ AF YI +GG+ E+ YPY +
Sbjct: 177 GNLTALSEQELIDC--------SVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYL-ME 227
Query: 239 GGSCKFDKSKIAAAV--SNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGV-S 293
GSC K + AV S + + ++++Q + H P++V I A Q Y GGV
Sbjct: 228 EGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFD 287
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI----CMGR 349
P CG LDHGV VGYGS + K Y I++NSWG WGE GY ++ G
Sbjct: 288 GP--CGAQLDHGVAAVGYGSD-----KGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGE 340
Query: 350 NVCGVDSMVS 359
+CG++ M S
Sbjct: 341 GLCGINKMAS 350
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFDSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 177/323 (54%), Gaps = 33/323 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++ +++FK +++K Y +EE R V+++NL L H G+ ++ D+T
Sbjct: 24 DNEWNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMT 82
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPI-LPTN---DLPTDFDWRDHGAVTGVKDQGACGSC 161
EF + G R+ AP+ +P N DLP DWR G VT +K+QG CGSC
Sbjct: 83 NEEFTKTMNGY----RMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSC 138
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
WSFSATG+LEG F TG+LVSLSEQ LVDC + + GC GGLM+ AF YI
Sbjct: 139 WSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKK-------QGNHGCEGGLMDDAFTYIK 191
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGI 280
G++ E YPY D G C+F + + A + F + + DE+ + + GP++V I
Sbjct: 192 ANNGIDTEASYPYKARD-GKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAI 250
Query: 281 NAVWM--QTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A M Q Y GV + C + LDHGVL VGYG+ K YW++KNSWGE+W
Sbjct: 251 DASHMSFQLYRTGVYHDWFCSQTKLDHGVLAVGYGTE-------DSKDYWLVKNSWGESW 303
Query: 338 GENGYYKICMG-RNVCGVDSMVS 359
G+ GY ++ RN CG+ + S
Sbjct: 304 GQKGYIQMSRNRRNNCGIATSAS 326
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 168/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + E + + GP++V I+A
Sbjct: 198 IDTEKSYPYKAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFDSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 168/302 (55%), Gaps = 21/302 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ + K K Y E+ +RF V+K NL + + + T G+TKF+DLT EFRR
Sbjct: 53 QFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSET-NRTYSLGLTKFADLTNEEFRR 111
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ G A + ++ P DWR +GAVT VKDQG+CGSCW+FSA G++E
Sbjct: 112 MYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVE 171
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + GE VSLSEQ+LVDCD E + GCNGGLM+ AF++I++ GG++ EKD
Sbjct: 172 GINAIRNGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFIIQNGGIDTEKD 223
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYI 289
YPY G DG K+ + + + ++++ V P++V I A Q Y
Sbjct: 224 YPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYA 283
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV CG LDHGVL VGYG+ YWI+KNSWGE WGE+GY + M R
Sbjct: 284 QGVF-SGECGTDLDHGVLAVGYGTEDGV-------DYWIVKNSWGEYWGESGYLR--MKR 333
Query: 350 NV 351
N+
Sbjct: 334 NM 335
>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
Length = 353
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 167/311 (53%), Gaps = 25/311 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K + E RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 37 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 96
Query: 108 EFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L N R D ++ + DWR+ G VT VK+QG CGSCW+F+
Sbjct: 97 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 156
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
TG +EG L LVSLSEQ LV CD + D GCNGGLM A ++I+
Sbjct: 157 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMEQAMQWIINDHN 207
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E YPYT G D + A ++ + + DE+++AA + K+GP+AV ++A
Sbjct: 208 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDA 267
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ R + PYWI+KNSWG +WGE G
Sbjct: 268 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 318
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 319 YIRLAMGSNQC 329
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 168/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + E + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 181/323 (56%), Gaps = 45/323 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTA-VHGVTKFSDLTP 106
F L++ + Y +E RF +F +NL AKR P+ + G+ F+D +P
Sbjct: 52 FQLWRKEHGLVYKDLKEMAKRFEIFLSNLNYIIEFNAKRS---SPSGYLLGLNNFADWSP 108
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
SEF+ +L L +P D+ P+L + P DWR+ AVT +K+QG+CGSCW+
Sbjct: 109 SEFQEIYL---HSLDMPTDSAPKLNGPLL-SCIAPASLDWRNKVAVTAIKNQGSCGSCWA 164
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA GA+EG H ++TGEL+SLSEQ+LV+CD GCNGG +N AF++++
Sbjct: 165 FSAAGAIEGIHAITTGELISLSEQELVNCDR---------VSKGCNGGWVNKAFDWVISN 215
Query: 224 GGVEREKDYPYTGTDGGSCKFDKS-KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
GG+ E +YPYTG DGG+C DK I A + + + ++ + ++VK P+++ +NA
Sbjct: 216 GGITLEAEYPYTGKDGGNCNSDKQVPIKATIDGYEQVEQSDNGLLCSIVKQ-PISICLNA 274
Query: 283 VWMQTYIGGVSCPYIC---GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C KY +H VLIVGY SS + YWI+KNSWG WG
Sbjct: 275 TDFQLYESGIFDGQQCSSSSKYTNHCVLIVGYDSS-------NGEDYWIVKNSWGTKWGI 327
Query: 340 NGYYKICMGRN------VCGVDS 356
NGY I + RN VCG+++
Sbjct: 328 NGY--IWIKRNTGLPYGVCGMNA 348
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/317 (37%), Positives = 173/317 (54%), Gaps = 24/317 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F + SK Y ++E RF ++++N++ L +F+D+T SEF
Sbjct: 40 KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ FLGLN Q+ P ++P DWR GAVT +++QG CG CW+FSA A
Sbjct: 100 KAHFLGLNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAA 159
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG LVSLSEQQL+DCD G+ + GC+GGLM +AFE+I GG+ E
Sbjct: 160 IEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKSNGGLTTE 212
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQ 286
DYPYTG + G+C +K+K + + ++ +E + + P++VGI+A Q
Sbjct: 213 TDYPYTGIE-GTCDQEKAKNKVVTIQGYQKVAQNEASLQIAAAQQ-PVSVGIDAGGFIFQ 270
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GV Y CG L+HGV +VGYG G ++ YWI+KNSWG WGE GY ++
Sbjct: 271 LYSSGVFTSY-CGTNLNHGVTVVGYGVEG-------DQKYWIVKNSWGTGWGEEGYIRME 322
Query: 347 MG----RNVCGVDSMVS 359
G CG+ + S
Sbjct: 323 RGISEDTGKCGIAMLAS 339
>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
Length = 354
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 167/311 (53%), Gaps = 25/311 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K + E RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 38 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 97
Query: 108 EFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L N R D ++ + DWR+ G VT VK+QG CGSCW+F+
Sbjct: 98 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
TG +EG L LVSLSEQ LV CD + D GCNGGLM A ++I+
Sbjct: 158 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMEQAMQWIINDHN 208
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E YPYT G D + A ++ + + DE+++AA + K+GP+AV ++A
Sbjct: 209 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDA 268
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ R + PYWI+KNSWG +WGE G
Sbjct: 269 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 319
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 320 YIRLAMGSNQC 330
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 186/345 (53%), Gaps = 37/345 (10%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D I P D E S D L+ F + S F K Y T EE RF VFK NL+
Sbjct: 30 DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
+ G+ +F+DL+ EF++ +LGL + + + D+ P DWR
Sbjct: 86 NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
GAV VK+QG+CGSCW+FS A+EG + + TG L +LSEQ+L+DCD +
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
++GCNGGLM+ AFEYI+K GG+ +E+DYPY+ + G+C+ D+S+ + V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256
Query: 263 EDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
E + L H PL+V I+A Q Y GGV CG LDHGV VGYGSS
Sbjct: 257 EKSLLKALA-HQPLSVAIDASGREFQFYSGGV-FDGRCGVDLDHGVAAVGYGSS------ 308
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
K Y I+KNSWG WGE GY I + RN +CG++ M S
Sbjct: 309 -KGSDYIIVKNSWGPKWGEKGY--IRLKRNTGKPEGLCGINKMAS 350
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 168/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + E + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
FK+KF ++Y +EE R VF N++ T GV +F+DLT EF + ++G
Sbjct: 22 FKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKTYMG 81
Query: 116 LNRRLRLPADAQKAP--ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+ + DA + LPT DW GAVT VK+QG CGSCWSFS TG+LEGA
Sbjct: 82 FKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTGSLEGA 141
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ +STG+LVSLSEQQ VDC + GCNGGLM+SAF+Y +A + E+ YP
Sbjct: 142 NEISTGKLVSLSEQQFVDCAGTYG-------NQGCNGGLMDSAFKYA-EANALCTEQSYP 193
Query: 234 YTGTDGGSCKFDKSKIAAA---VSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTY 288
Y GTD GSC+ A VS + +SSD +Q + V P+++ I A Q Y
Sbjct: 194 YKGTD-GSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQLY 252
Query: 289 IGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GGV CG LDHGVL VGYG+ YW +KNSWG WG +GY + G
Sbjct: 253 SGGV-LTGACGASLDHGVLAVGYGT-------LSGTDYWKVKNSWGSTWGMSGYVLLQRG 304
Query: 349 RNVCGVDSMVSSVAAIHTTSS 369
+ G ++S + T S
Sbjct: 305 KGGSGECGLLSEPSYPQVTGS 325
>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
Length = 354
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 169/317 (53%), Gaps = 25/317 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K + E RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 38 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 97
Query: 108 EFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L N R D ++ + DWR+ G VT VK+QG CGSCW+F+
Sbjct: 98 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
TG +EG L LVSLSEQ LV CD + D GCNGGLM A ++I+
Sbjct: 158 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMQQAMQWIINDHN 208
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E YPYT G D + A + + + DE+++AA + K+GP+AV ++A
Sbjct: 209 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDA 268
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ R + PYWI+KNSWG +WGE G
Sbjct: 269 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 319
Query: 342 YYKICMGRNVCGVDSMV 358
Y ++ MG N C + + V
Sbjct: 320 YIRLAMGSNQCLLKNYV 336
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/327 (38%), Positives = 179/327 (54%), Gaps = 34/327 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+S++ A ++ + + +TY E + R++VF+ NLR
Sbjct: 29 IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 85
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG R +L A A DLP DWR
Sbjct: 86 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 142
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+ GSCW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 143 GAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 194
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 255 LQKAVANQPVSVAIEAAGTQFQLYSSGIFTG-SCGTALDHGVTAVGYGTE-------NGK 306
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV 351
YWI+KNSWG +WGE+GY + M RN+
Sbjct: 307 DYWIVKNSWGSSWGESGYVR--MERNI 331
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/359 (37%), Positives = 193/359 (53%), Gaps = 46/359 (12%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
++LL L ++ +AS AV+ D P G F+ + + K+YA
Sbjct: 4 TTLLALCVALFVASTFAVSHD--------PLTGV------------FADWMQEHQKSYAN 43
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD- 125
EE YR+ V++ N + + + + KF DLT +EF + F GL+ + AD
Sbjct: 44 -EEFVYRWNVWRENYLYIEAHNHQNKSFHLAMNKFGDLTNAEFNKLFKGLS----ITADQ 98
Query: 126 -AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
Q++ I P LP DFDWR GAVT VK+QG CGSCWSFS TG+ EGA+FL G L SL
Sbjct: 99 AKQESDIAPAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSL 158
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ LVDC + + GCNGGLM+ AFEYI++ G++ E+ YPY + G+C++
Sbjct: 159 SEQNLVDC-------STSYGNHGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQ-GTCRY 210
Query: 245 DKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-SCPYICGKY 301
+K + +++ + S + N V P +V I+A Q Y GGV P
Sbjct: 211 NKQHSGGELVSYTNVPSGNEGALLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSR 270
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
LDHGVL VG+G +R K YW++KNSWG +WG +GY ++ + N CG+ + S
Sbjct: 271 LDHGVLAVGWG------VR-DGKDYWLVKNSWGADWGLSGYIEMSRNKHNQCGIATAAS 322
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 118/327 (36%), Positives = 181/327 (55%), Gaps = 30/327 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F KF+K Y+++ E +RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + IL P + P +FDWR VT VK+QG CG+
Sbjct: 81 KEEAISKYTGLS----LPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + L++LSEQQ +DCD ++GC+GGL+++AFE
Sbjct: 137 CWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---------VNAGCDGGLLHTAFESA 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVG 279
++ GGV+ E DYPY T G C+ + ++ V S I E+++ L GP+ V
Sbjct: 188 MEMGGVQMESDYPYE-TANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVA 246
Query: 280 INAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
I+A + Y G+ C + L+H VL+VGY PYWI+KN+WG +WG
Sbjct: 247 IDASDIVNYRRGIMRQ--CANHGLNHAVLLVGYAVEN-------NIPYWILKNTWGTDWG 297
Query: 339 ENGYYKICMGRNVCGVDSMVSSVAAIH 365
E+GY+++ N CG+ + + S A I+
Sbjct: 298 EDGYFRVQQNINACGIRNELVSSAEIY 324
>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
Length = 354
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 169/317 (53%), Gaps = 25/317 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K + E RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 38 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 97
Query: 108 EFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L N R D ++ + DWR+ G VT VK+QG CGSCW+F+
Sbjct: 98 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
TG +EG L LVSLSEQ LV CD + D GCNGGLM A ++I+
Sbjct: 158 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMQQAMQWIINDHN 208
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E YPYT G D + A + + + DE+++AA + K+GP+AV ++A
Sbjct: 209 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDA 268
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ R + PYWI+KNSWG +WGE G
Sbjct: 269 TTRQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 319
Query: 342 YYKICMGRNVCGVDSMV 358
Y ++ MG N C + + V
Sbjct: 320 YIRLAMGSNQCLLKNYV 336
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/357 (36%), Positives = 185/357 (51%), Gaps = 33/357 (9%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIR--QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
+LL LS L+SA D ++I Q + D + A + L K K Y
Sbjct: 12 FVLLFLSFTLSSA----SDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQ--GKVYNA 65
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRLP 123
E + RF+VFK NLR + T G+ F+DLT E+R +LG +R RL
Sbjct: 66 LGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLR 125
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+ + LP DWR GAV VKDQG+CGSCW+FS A+EG + + TG+L+S
Sbjct: 126 KTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLIS 185
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY DG
Sbjct: 186 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDT 237
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKY 301
+ K+ + ++ + + + V + P++V I A Q Y G+ CG
Sbjct: 238 YRKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGR-CGTQ 296
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN----VCGV 354
LDHGV VGYG+ K YWI++NSWG++WGENGY ++ N +CG+
Sbjct: 297 LDHGVAAVGYGTE-------NGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGI 346
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 175/315 (55%), Gaps = 27/315 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK + + Y EE + RF +FK NL+ K+ L + G+ +F+D+ EFR
Sbjct: 45 FKKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR- 103
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ GL R + Q + L L P + DWR G VT VK+QG CGSCWSFS TG+
Sbjct: 104 MYNGLRRDYNYSREVQCSNHLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG HF +G+LVSLSEQQLVDC + E GCNGGLM+ AFEYI+ GG+E E
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNE-------GCNGGLMDQAFEYIITNGGIETE 216
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVW--MQ 286
++YPY C F KS++AA S V S DE + ++ + GP+++ I+A Q
Sbjct: 217 EEYPYDARQ-ERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQ 275
Query: 287 TYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y GGV P LDHGVL+VGYG+ + YW++KNSWG WG GY K+
Sbjct: 276 LYSGGVYDEPKCSSTELDHGVLVVGYGTD-------DGQDYWLVKNSWGTTWGLEGYVKM 328
Query: 346 CMGR-NVCGVDSMVS 359
+ N CGV + S
Sbjct: 329 SRNQDNQCGVATQAS 343
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/361 (35%), Positives = 191/361 (52%), Gaps = 51/361 (14%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+LL S+ ASA++ SDGE E + L+ +K K Y
Sbjct: 9 ALLSFFFLSISASALSRR-----------SDGEVRE--------IYDLWLAKHGKAYNGI 49
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-----RRLRL 122
+E + RF++FK NL+ + T G+ F+DLT E+R +LG R ++
Sbjct: 50 DEREKRFQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKA 109
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+++ + + LP DWR GAV VK+QG+CGSCW+FS A+EG + + TGEL+
Sbjct: 110 KTASRRYAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELI 169
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ+LV CD + +SGCNGGLM+ AF++I+ GG++ E+DYPY DG
Sbjct: 170 SLSEQELVSCDKK--------YNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCD 221
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGK 300
K+ ++ + + +++++ V H P++V I A + +Q Y GV CG
Sbjct: 222 PTRKNAKVVSIDAYEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGK-CGS 280
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-------CG 353
LDHGV+ VGYG YW+++NSWG +WGE+GY+K + RNV CG
Sbjct: 281 ALDHGVVAVGYGKENGV-------DYWLVRNSWGTSWGEDGYFK--LERNVKHITEGKCG 331
Query: 354 V 354
+
Sbjct: 332 I 332
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 173/322 (53%), Gaps = 33/322 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ K+Y ++ E R+++F N L AK V G+ +F DL P
Sbjct: 6 QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 66 EFAKMFNGYHGE-RKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 124
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAG 224
ATG+LEG HFL +G+LVSLSEQ L+DC SGS + GC GGLM++AF+YI
Sbjct: 125 ATGSLEGQHFLKSGKLVSLSEQNLIDC--------SGSFGNEGCGGGLMDNAFKYIKAND 176
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAV 283
G++ E+ YPY D G C+F K + A + F + ED + + GP++V I+A
Sbjct: 177 GIDTEESYPYEAMD-GDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDAS 235
Query: 284 W--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV P + LDHGVL VGYG K YW++KNSW E WG+N
Sbjct: 236 HSSFQLYSEGVYDEPNCSSEELDHGVLAVGYGVK-------NGKKYWLVKNSWAETWGDN 288
Query: 341 GYYKICMGR---NVCGVDSMVS 359
GY I M R N CG+ S S
Sbjct: 289 GY--ILMSRDKDNQCGIASSAS 308
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 173/318 (54%), Gaps = 33/318 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K++ K Y + EE R +++ NL + + L T G+ +F+DL EF
Sbjct: 31 WKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEEFVA 90
Query: 112 QFLGLNRRLRLPADAQKAPILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
G R A+ + LP N +LP DWR G VT VKDQG CGSCW+FS TG
Sbjct: 91 MMTGF-RVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTG 149
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
++EG HF +TG+LVSLSEQ LVDC D+GC+GG M+ AF+YI+ AGG++
Sbjct: 150 SVEGQHFKATGKLVSLSEQNLVDCSGR---------DAGCDGGFMDRAFQYIIDAGGIDT 200
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVWM-- 285
E YPY D G C F K+ + A V+ ++ ++S ++ V H GP++V I+A M
Sbjct: 201 EASYPYKAVD-GKCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSF 259
Query: 286 QTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GV C LDHGVL VGYG+S YWI+KNSW E WG NGY
Sbjct: 260 QHYKSGVYNEPGCDSTVLDHGVLAVGYGTSS------DGTDYWIVKNSWAETWGMNGY-- 311
Query: 345 ICMGR---NVCGVDSMVS 359
+ M R N CG+ + S
Sbjct: 312 VWMSRNKDNQCGIATNAS 329
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 176/314 (56%), Gaps = 30/314 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + K K+Y T +E R+ VF+ N+ + + G+ +DLT EF++
Sbjct: 32 FQNWMVKHQKSY-TNDEFGSRYSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKL 90
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+LG + +K ++ + LP DWR +GAVT VK+QG CG C++FS TG++EG
Sbjct: 91 YLGTKANVTY----KKKTLVGVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEG 146
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVEREKD 231
H +++ +LV LSEQQ++DC SGS ++GC+GGLM ++FEYI+ GG++ E
Sbjct: 147 IHEITSQQLVPLSEQQILDC--------SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEAS 198
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYI 289
YPYTG + G CKF+K I A ++ + + S + V P++V I+A Q Y
Sbjct: 199 YPYTG-EVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQPVSVAIDASQSSFQLYA 257
Query: 290 GGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GV P LDHGVL VGYGS + YWI+KNSWG +WGENG+ I M
Sbjct: 258 SGVYYEPECSSTQLDHGVLAVGYGSQ-------SGQDYWIVKNSWGADWGENGF--ILMA 308
Query: 349 RNV---CGVDSMVS 359
RN CG+ +M S
Sbjct: 309 RNKDNNCGIATMAS 322
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 176/319 (55%), Gaps = 22/319 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H++L+K+ K+YA +EE +R +++ NLR + L H G+ +F D+T
Sbjct: 26 DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + ++ AP + P DWR G VT VKDQG CGSCW+FS
Sbjct: 85 NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG H+ +TG+++SLSEQ LVDC + GCNGGLM+ AF+Y+ GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA-- 282
++ E YPYT D C +D + +A + F ++S ++ N V GP++V ++A
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSGSEKDLMNAVASVGPVSVAVDAGH 255
Query: 283 VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ P + LDHGVL+VGYG G K YWI+KNSW E WG +G
Sbjct: 256 QSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE---DGKKYWIVKNSWSEKWGNDG 312
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y I R N CG+ + S
Sbjct: 313 YIYIAKDRHNHCGIATAAS 331
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 142/363 (39%), Positives = 193/363 (53%), Gaps = 50/363 (13%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LLL+L++V+ S AV+ D + Q +S FK + SK Y ++ E
Sbjct: 3 LLLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
+R ++F N + AK +L V G+ K++D+ EF G N+ +
Sbjct: 44 ERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103
Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+D A I P N LP DWRD GAVT VKDQG CGSCWSFS +G+LEG HF TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTG 163
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+LVSLSEQ LVDC SG ++GCNGGLM++AF YI GG++ E+ YPY D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 215
Query: 239 GGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-SC 294
C + A F I +ED + A + GP+++ I+A + Q Y GV S
Sbjct: 216 -EKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPISIAIDASYETFQLYSDGVYSD 274
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCG 353
P + LDHGVL+VGYG+S + YW++KNSW + G NGY K+ + N+CG
Sbjct: 275 PECISQELDHGVLVVGYGTSD------DGQDYWLVKNSWRPSCGLNGYIKMARNQDNMCG 328
Query: 354 VDS 356
V S
Sbjct: 329 VAS 331
>gi|149755237|ref|XP_001495795.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 339
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 169/312 (54%), Gaps = 23/312 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRR 111
+K+ + Y +E +R V++ N+R + HG T F D+T EFR+
Sbjct: 32 WKATHRRLYGVNKEA-WRRAVWEKNMRMIELHNQEYSQGKHGFTMAMNAFGDMTNEEFRQ 90
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
GL+ + + P+ + +LP DWR G VT VK+QG CGSCW+FSATGALE
Sbjct: 91 VMNGLHNQTHKKGRVFREPL--SAELPKSVDWRKKGYVTPVKNQGLCGSCWAFSATGALE 148
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G F TG+LVSLSEQ LVDC + GC+GGLM+ AF+Y+ GG++ EK
Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSW-------AQGNEGCSGGLMDYAFQYVKDNGGLDSEKS 201
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYI 289
YPY D G CK+ AA + F I E + + GP++ GI+A Q Y
Sbjct: 202 YPYLAED-GFCKYKPEYSAANDTGFLDIQQQEKFLMEAVATVGPISAGIDASLESFQFYK 260
Query: 290 GGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
G+ P KYLDHGVL+VGYG G + YW++KNSWGE+WG NGY K+
Sbjct: 261 EGIYYDPDCSSKYLDHGVLVVGYGFEG----KDSRNKYWLVKNSWGEDWGMNGYIKMAKD 316
Query: 349 R-NVCGVDSMVS 359
R N CG+ +M S
Sbjct: 317 RENHCGIATMAS 328
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 140/367 (38%), Positives = 195/367 (53%), Gaps = 51/367 (13%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L+LL + +A+A AV+ + ++V + ++ FK + K Y ++ E
Sbjct: 3 ILILLMAFVAAANAVS-----LYELVKEE--------------WNAFKLQHRKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
R +++ N + AK Q D V K++DL EF + G NR + L
Sbjct: 44 ERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSL 103
Query: 123 PADAQKAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P+ P N ++PT DWR GAVT VKDQG CGSCWSFSATGALEG HF T
Sbjct: 104 KGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKT 163
Query: 179 GELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+LVSLSEQ LVDC SG ++GCNGG+M+ AF+YI GG++ EK YPY
Sbjct: 164 GKLVSLSEQNLVDC--------SGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAI 215
Query: 238 DGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC 294
D +C F+ + A + + DE+ + L GP+++ I+A Q Y GV
Sbjct: 216 D-DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYY 274
Query: 295 PYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVC 352
C + LDHGVL VGYG+S + + YW++KNSWG WG+ GY K+ R N C
Sbjct: 275 EPQCDSENLDHGVLAVGYGTSE------EGEDYWLVKNSWGTTWGDQGYVKMARNRDNHC 328
Query: 353 GVDSMVS 359
GV + S
Sbjct: 329 GVATCAS 335
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 168/312 (53%), Gaps = 25/312 (8%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--- 115
K K Y E D RF++FK NL + T + G+ KF+D+T E+R +LG
Sbjct: 45 KHQKVYNGLREKDQRFQIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRS 104
Query: 116 -LNRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+ RR ++ + + LP DWR GA+T +KDQG+CGSCW+FS +E
Sbjct: 105 DIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAI 164
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+LVSLSEQ+LVDCD + + GCNGGLM+ AFE+I+ GG++ ++ YP
Sbjct: 165 NKIVTGKLVSLSEQELVDCDR--------AFNEGCNGGLMDYAFEFIIGNGGIDTDQHYP 216
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGG 291
Y G +G K ++ + + S+ + V H P++V I A +Q Y G
Sbjct: 217 YKGFEGRCDPTRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSG 276
Query: 292 VSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
V CG LDH V+IVGYGS YW+++NSWG NWGE+GY+K M RNV
Sbjct: 277 VFTGK-CGTSLDHAVVIVGYGSE-------NGLDYWLVRNSWGTNWGEDGYFK--MERNV 326
Query: 352 CGVDSMVSSVAA 363
G + +A
Sbjct: 327 KGTHTGKCGIAV 338
>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
Length = 350
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 139/361 (38%), Positives = 196/361 (54%), Gaps = 39/361 (10%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
SLL++L A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 SLLIVLFCVTTAAAGFSFHDSNPIRMV--SDAEEQLLQVIGESRHAVSFARFANRYGKLY 62
Query: 65 ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ +E RF++F NL R +R+L + GV F+D T EF+ LG +
Sbjct: 63 DSVDEMKLRFKIFSENLELIRSTNKRRL---SYKLGVNHFADWTWEEFKSHRLGAAQNC- 118
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + + +LP + DWR G V+ VKDQG CGSCW+FS TGALE A+ + G+
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKN 177
Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEETYPYTGSN-G 228
Query: 241 SCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPY 296
CKF +A V N ++ S DE + A + P++V V + Y GV
Sbjct: 229 LCKFTSENVALKVLGSVNITLGSEDELKHAVAFAR--PVSVAFEVVHDFRLYKSGVYTST 286
Query: 297 ICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
CG ++H VL VGYG PYW IKNSWG +WG++GY+K+ MG+N+CG
Sbjct: 287 ACGNTPMDVNHAVLAVGYGIE-------DGIPYWHIKNSWGGDWGDHGYFKMEMGKNMCG 339
Query: 354 V 354
V
Sbjct: 340 V 340
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 192/364 (52%), Gaps = 34/364 (9%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
S+ LL +S + + A D +++ D S D L + F + SK K+Y
Sbjct: 6 FSNFFLLFISMAVFAYSAFARDFSIVG--YSPDDLTSMDKLTDL---FESWMSKHGKSYR 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EE +RF VF+ NL+ + G+ +F+DL+ EF+R++LGL L D
Sbjct: 61 SFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRD 120
Query: 126 A-QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
+ ++ DLP DWR GAV VK+QGACGSCW+FS A+EG + + TG L +L
Sbjct: 121 SPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTAL 180
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ+L+DCD ++GCNGGLM+ AF +I+ GG+ +E+DYPY + G+C
Sbjct: 181 SEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV-MEEGTCGE 231
Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVSCPYICGKY 301
K ++ +S + + D +Q + + PL+V I A Q Y GG+ + CG
Sbjct: 232 KKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGH-CGTE 290
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV------CGVD 355
LDHGV VGYG+S K Y +KNSWG WGE GY I M RNV CG+
Sbjct: 291 LDHGVAAVGYGTS-------KGVDYITVKNSWGSKWGEKGY--IRMKRNVGKPEGICGIY 341
Query: 356 SMVS 359
M S
Sbjct: 342 KMAS 345
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/320 (38%), Positives = 175/320 (54%), Gaps = 22/320 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSD 103
N + ++ +K+ K Y EE +R V++ N++ + H + F D
Sbjct: 24 NLDARWTRWKAANGKLYNKDEEV-WRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
LT EF++ GL +++ P + +LP + P+ DWR+ G VT VKDQG CGSCW+
Sbjct: 83 LTNEEFKQVMNGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC ++GCNGGLM++AF Y+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA- 282
GG++ E+ YPY D G CK+ + AA + F+ I DE+ + ++ GP++V I+A
Sbjct: 194 GGLDSEESYPYLAQD-GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPISVAIDAS 252
Query: 283 --VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+ Y G P + LDHGVL+VGYGS + K YWI+KNSWG WG
Sbjct: 253 LDTFRFYYKGIYYDPNCSSEDLDHGVLVVGYGSD---EREAENKNYWIVKNSWGTQWGMQ 309
Query: 341 GYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 310 GYILMAKDRGNHCGIATSAS 329
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 175/326 (53%), Gaps = 30/326 (9%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
+ + FK K Y ++ E +R ++F N + AK +L V GV K+SD+
Sbjct: 23 VQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYSDM 82
Query: 105 TPSEFRRQFLGLNRRLRLPA-----DAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGAC 158
EF G NR + P D I P N +LP DWR GAVT VKDQG C
Sbjct: 83 LNHEFVHTLNGYNRS-KTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQC 141
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCWSFS TG+LEG HF + +LVSLSEQ L+DC E+ G ++GCNGGLM++AF
Sbjct: 142 GSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCS-----EKYG--NNGCNGGLMDNAFR 194
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLA 277
YI GG++ E+ YPY D C + A F I S DE+++ A + GP++
Sbjct: 195 YIKDNGGIDTEQSYPYKAED-EKCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPIS 253
Query: 278 VGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V I+A Q Y GV P + LDHGVL+VGYG+ YW++KNSWG
Sbjct: 254 VAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDEDG------NDYWLVKNSWG 307
Query: 335 ENWGENGYYKICMGR-NVCGVDSMVS 359
++WG+ GY K+ R N CG+ + S
Sbjct: 308 DSWGDQGYIKMARNRDNNCGIATQAS 333
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 175/323 (54%), Gaps = 30/323 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ FK + K YA E +R ++F N AK Q V + K++D+ E
Sbjct: 29 WNTFKLEHRKNYADSTEETFRMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSC 161
FR G N ++LR ++ + + LPT DWR GAVT VKDQG CGSC
Sbjct: 89 FRETMNGFNYTLHKQLRSTDESFTGVTFISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGA+EG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF Y+
Sbjct: 149 WAFSSTGAIEGQHFRKSGTLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYVK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGI 280
GG++ EK Y Y G D SC FDK+ I A F+ I +E ++A + GP++V I
Sbjct: 202 DNGGIDTEKSYAYEGID-DSCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVSVAI 260
Query: 281 NAVW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV P + LDHGVL+VGYG+ YW++KNSWG W
Sbjct: 261 DASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGTEKDGS------DYWLVKNSWGTTW 314
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ G+ K+ + N CG+ S S
Sbjct: 315 GDKGFIKMSRNKENQCGIASASS 337
>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
Length = 337
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 123/326 (37%), Positives = 172/326 (52%), Gaps = 33/326 (10%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
A ++ F + +K Y T ++ D F FK NL + AV+G+ KFSD+
Sbjct: 29 ASVYYENFIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSNQAVYGINKFSDIDKIT 88
Query: 109 FRRQFLGLNRRLRLPADAQKAPIL---------PTNDLPTDFDWRDHGAVTGVKDQGACG 159
F + GL L D+ P P+ P FDWR VT VK+QG CG
Sbjct: 89 FVNEHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCG 148
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+F+A G +E + + L+ LSEQQL+DCD D GC+GGLM+ AF+
Sbjct: 149 SCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDR---------VDQGCDGGLMHLAFQE 199
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAV 278
I++ GGVE E DYPY G + +C+ SK+A +S+ + DE ++ L K+GP+AV
Sbjct: 200 IIRIGGVEHEIDYPYQGIE-YACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAV 258
Query: 279 GINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
I+ V + Y G++ +C L+H VL+VGYG + PYWI KNSWG NW
Sbjct: 259 AIDCVDIIDYRSGIAT--VCNDNGLNHAVLLVGYGIE-------NDTPYWIFKNSWGSNW 309
Query: 338 GENGYYKICMGRNVCGVDSMVSSVAA 363
GENGY++ N CG M++ AA
Sbjct: 310 GENGYFRARRNINACG---MLNEFAA 332
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/326 (35%), Positives = 181/326 (55%), Gaps = 29/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKDQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+ +
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMA 245
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE
Sbjct: 246 IDAADIVNYKQGI-IKYCFNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGE 297
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
+G++++ N CG+ + ++S A I+
Sbjct: 298 DGFFRVQQNINACGMRNELASTAVIY 323
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 143/364 (39%), Positives = 183/364 (50%), Gaps = 50/364 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L L LL +++A VA N + + Q + FK+ K+Y +
Sbjct: 2 LRLSLLCAIVAVTVAANSHEILRTQ-------------------WEAFKTTHKKSYESHM 42
Query: 69 EHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E RF++F N L AK V G+ +F DL EF + F G R R
Sbjct: 43 EELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGY-RGQRTSR 101
Query: 125 DAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+ P ND LP+ DWR GAVT VKDQG CGSCW+FSATG+LEG HFL GELV
Sbjct: 102 GSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELV 161
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ LVDC ++GC GGLM++AF+YI G++ E+ YPY D C
Sbjct: 162 SLSEQNLVDCSQSFG-------NNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMD-DKC 213
Query: 243 KFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV-SCPYIC 298
+F K + A + F I ED + + GP++V I+A Q Y GV P
Sbjct: 214 RFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECS 273
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR---NVCGVD 355
+ LDHGVL VGYG K YW++KNSWG +WG+NGY I M R N CG+
Sbjct: 274 SEELDHGVLAVGYGVK-------DGKKYWLVKNSWGGSWGDNGY--ILMSRDKNNQCGIA 324
Query: 356 SMVS 359
S S
Sbjct: 325 SAAS 328
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 174/321 (54%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 25 DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G R+ + + + N L P DWRD+G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMYGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E YPY GTD C +D +A + F + S E + + GP++V I+A
Sbjct: 195 QGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSVAIDA 254
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 255 GHESFQFYQSGIYYEKECSSEELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 311
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 312 KGYIYMAKDRKNHCGIATAAS 332
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
Length = 323
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/326 (35%), Positives = 181/326 (55%), Gaps = 29/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+ +
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMA 245
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE
Sbjct: 246 IDAADIVNYKQGI-IKYCFNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGE 297
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
+G++++ N CG+ + ++S A I+
Sbjct: 298 DGFFRVQQNINACGMRNELASTAVIY 323
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 171/309 (55%), Gaps = 25/309 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
+K K+ K+Y + E R RV+++NL+ ++ +L G+ ++DL EF
Sbjct: 22 WKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEFMA 81
Query: 112 -QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ G + + + Q L LP+ DWR+ G VT VKDQG CGSCW+FSATG+L
Sbjct: 82 LKGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFSATGSL 141
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG HF TG L+SLSEQQLVDC + GCNGGLM SA++YI GGVE E
Sbjct: 142 EGQHFAKTGNLLSLSEQQLVDCAGRYG-------NYGCNGGLMESAYDYIKGVGGVELES 194
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAVGINA--VWMQT 287
YPYT D G CKFD+SK+ A + VI DE + + GP+AV I+A Q
Sbjct: 195 AYPYTARD-GRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQL 253
Query: 288 YIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GV C LDHGVL VGYG+ G + YW++KNSWG WG+ GY K+
Sbjct: 254 YESGVYDFRRCSSTNLDHGVLAVGYGTEG-------GQNYWLVKNSWGPGWGDQGYIKMS 306
Query: 347 MGR-NVCGV 354
+ N CG+
Sbjct: 307 KDKNNQCGI 315
>gi|71084306|gb|AAZ23598.1| cysteine protease [Leishmania major]
Length = 327
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 170/311 (54%), Gaps = 25/311 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A H+ FK + K++ + +RF FK N++ A +P A + V+ KF+DLTP
Sbjct: 11 ASAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFLNTHNPHAHYDVSGKFADLTPQ 70
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + +L + R D ++ + + L DWR+ AVT VK+QG CGSCW+FS
Sbjct: 71 EFAKLYLNPDYYARRGKDYKEHVHVDDSVLSGAMSVDWREKVAVTPVKNQGMCGSCWAFS 130
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
A G +E L LVSLSEQ LV CD D GCNGGLM+ A E+I++
Sbjct: 131 AIGNIESQWALKNHSLVSLSEQMLVSCD---------DIDDGCNGGLMDQAMEWIIQHHN 181
Query: 224 GGVEREKDYPYTGTDGGSCK-FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G V E+ YPY G S DK + A +S + + DE +AA + K GP+AV ++A
Sbjct: 182 GTVPTEESYPYASAGGTSPPCHDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDA 241
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GGV +C G L+HGVL+VG+ + + PYWI+KNSWG +WGE G
Sbjct: 242 TTWQLYFGGVVT--LCFGWSLNHGVLVVGFN-------KRAKPPYWIVKNSWGTSWGEKG 292
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 293 YIRLAMGSNQC 303
>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
Length = 323
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/326 (36%), Positives = 183/326 (56%), Gaps = 29/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKNYSSETEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K IL P P DFDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPTQTQNFCKVIILDQPPGKGPLDFDWRRLNKVTNVKNQGTCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQYAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+K GGV+ E DYPY + + +K A V + + ++ E+++ L GP+ +
Sbjct: 187 IKMGGVQLESDYPYEANNNNCRM-NGNKFAVRVKDCYRYVTVYEEKLKDLLRVAGPIPMA 245
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y GV Y L+H VL+VGYG P+WI KN+WG +WGE
Sbjct: 246 IDAADIVNYKQGV-IRYCFNSGLNHAVLLVGYGVEN-------NIPFWIFKNTWGTDWGE 297
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
+GY+++ N CG+ + ++S+A I+
Sbjct: 298 DGYFRVQQNINACGMRNELASIATIY 323
>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 166/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + DSGC LM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHRLTALSEQQLVSCDDK---------DSGCRARLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSTGYVPECSNSIQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGENWGE
Sbjct: 267 ASSFMSYQRGVVTSC---AGMPLNHGVLLVGYNRTG-------EVPYWVIKNSWGENWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 181/320 (56%), Gaps = 23/320 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
A +F F ++++K Y T++E YR+ +F+ N+ + + +A++ + +F+D+T +E
Sbjct: 36 APLYFEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNSRNDSAIYKINRFADMTKNE 95
Query: 109 FRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+ GL L A+ + ++ PT FDWR VT VKDQG CG+CW+F+
Sbjct: 96 VVIRHTGLASG-ELGANFCETIVVDGPAQRQRPTSFDWRTLNKVTSVKDQGMCGACWAFA 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
GALE + + L+ L+EQQLVDCD S D GC+GGL+++A+E I+ GG
Sbjct: 155 GLGALESQYAIKYDRLIDLAEQQLVDCD---------SVDMGCDGGLIHTAYEQIMHMGG 205
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
VE+E DYPY + C K AA V S + + +E+++ L GP+A+ ++AV
Sbjct: 206 VEQEFDYPYRA-ERQPCALKPHKFAAGVRSCYRYVLLNEERLEDLLRYVGPIAIAVDAVD 264
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+ Y GG+ + L+H VL+VGYG P+WIIKNSWG ++GE+GY +
Sbjct: 265 LTDYYGGI-VSFCENNGLNHAVLLVGYGVE-------NNVPFWIIKNSWGSDYGEDGYVR 316
Query: 345 ICMGRNVCGVDSMVSSVAAI 364
+ G N CG+ + ++S A +
Sbjct: 317 VRRGVNSCGMINELASSAQV 336
>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
Length = 341
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 180/323 (55%), Gaps = 35/323 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+F F ++++K Y++++E YR+ +F+ N+ + + +AV+ + +F+D+T +E
Sbjct: 43 YFEKFITQYNKQYSSEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEV-- 100
Query: 112 QFLGLNRRLRLPADAQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+NR L + A T P +FDWR++ VT VKDQG CG+CW+
Sbjct: 101 ----VNRHTGLASGDTGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMCGACWA 156
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
F+ GALE + + L+ L+EQQLVDCD D GC+GGL+++A+E I+
Sbjct: 157 FAGLGALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMHI 207
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAVGIN 281
GGVE+E DYPY C K A V N + + E+++ +L++H GP+A+ ++
Sbjct: 208 GGVEQEYDYPYKAVR-LPCAVKPHKFAVGVRNCYRYVLLSEERL-EDLLRHVGPIAIAVD 265
Query: 282 AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
AV + Y GGV + L+H VL+VGYG PYW IKNSWG ++GENG
Sbjct: 266 AVDLTDYYGGV-ISFCENNGLNHAVLLVGYGVE-------NNVPYWTIKNSWGPDYGENG 317
Query: 342 YYKICMGRNVCGVDSMVSSVAAI 364
Y +I G N CG+ + ++S A I
Sbjct: 318 YVRIRRGVNSCGMINELASSAQI 340
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 176/327 (53%), Gaps = 29/327 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR+ GA+T VKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+L+SLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F + S +ED++ A + GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 254
Query: 277 AVGINAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q Y GV C LDHGVL+VGYGS K YW++KNSW
Sbjct: 255 SVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDN-------GKDYWLVKNSW 307
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
E+WG+ GY KI R N CGV + S
Sbjct: 308 SEHWGDEGYIKIARNRKNHCGVATAAS 334
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 134/366 (36%), Positives = 193/366 (52%), Gaps = 45/366 (12%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ S++ LL +VLA V S + +L+AE + +FK +K Y
Sbjct: 1 MKSVVALLFLAVLAMGQTV-----------------SFNKILDAE--WFIFKLHHNKVYK 41
Query: 66 TQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ E YR +++ N R+ ++ +L + T G+ K+ D+ EF G N+ +
Sbjct: 42 SPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVT 101
Query: 122 LPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ + + P N LP + DW GAVT VKDQG CGSCW+FS+TGALEG HF STG
Sbjct: 102 AGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTG 161
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
LVSLSEQ L+DC SG ++GCNGGLM+ AF+YI G++ EK YPY +
Sbjct: 162 YLVSLSEQNLIDC--------SGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYE-AE 212
Query: 239 GGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC- 294
C+++ A + + DE+++ A + GP++V I+A Q Y GV
Sbjct: 213 NDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYD 272
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-CG 353
P + LDHGVLIVGYG+ YW++KNSWG+ WG+ GY K+ +N CG
Sbjct: 273 PDCSAENLDHGVLIVGYGTD-----ETSGHDYWLVKNSWGKTWGQKGYIKMARNKNNHCG 327
Query: 354 VDSMVS 359
+ S S
Sbjct: 328 IASSAS 333
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 190/358 (53%), Gaps = 33/358 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
+LL++ A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 TLLIVFFCVATAAAGLSFHDSNPIRMV--SDMEKQLLQVIGESRHAVSFARFANRYGKRY 62
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--NRRLRL 122
T +E RF++F NL+ + GV F+D T EFR LG N L
Sbjct: 63 DTVDEMKRRFKIFSENLQLIESTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCSATL 122
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+ + ++ LP + DWR G V+ VKDQG CGSCW+FS TGALE A+ + G+ +
Sbjct: 123 KGNHRITDVV----LPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNI 178
Query: 183 SLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
SLSEQQLVDC +G+ ++ GCNGGL + AFEYI GG+E E+ YPYTG + G
Sbjct: 179 SLSEQQLVDC--------AGAFNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQN-GP 229
Query: 242 CKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICG 299
CKF +A V + ++ ED++ + P++V V + Y GV CG
Sbjct: 230 CKFTSEDVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFEVVDDFRLYKKGVYTSTTCG 289
Query: 300 KY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
++H VL VGYG PYW+IKNSWG WG++GY+K+ MG+N+CGV
Sbjct: 290 NTPMDVNHAVLAVGYGIE-------DGVPYWLIKNSWGGEWGDHGYFKMEMGKNMCGV 340
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 176/320 (55%), Gaps = 22/320 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSD 103
N + ++ +K+ K Y ++E +R V++ N++ + H + F D
Sbjct: 24 NLDARWTRWKAANGKLY-NKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
LT EF++ GL +++ P + +LP + P+ DWR+ G VT VKDQG CGSCW+
Sbjct: 83 LTNEEFKQVMNGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC ++GCNGGLM++AF Y+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA- 282
GG++ E+ YPY D G CK+ + AA + F+ I DE+ + ++ GP++V I+A
Sbjct: 194 GGLDSEESYPYLAQD-GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPISVAIDAS 252
Query: 283 --VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+ Y G P + LDHGVL+VGYGS + K YWI+KNSWG WG
Sbjct: 253 LDTFRFYYKGIYYDPNCSSEDLDHGVLVVGYGSD---EREAENKNYWIVKNSWGTQWGMQ 309
Query: 341 GYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 310 GYILMAKDRGNHCGIATSAS 329
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC-PYICGKYLDHGV 306
AA V+ + E+ + + GP++ I A Q G+ P + LDHGV
Sbjct: 391 AADVTGPVNVPQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNCSSEDLDHGV 450
Query: 307 LIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
L+VGYGS + K YWI+KNSWG +WG GY
Sbjct: 451 LVVGYGSD---EREAENKNYWIVKNSWGTDWGLQGY 483
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 174/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E+ + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEEALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 129/325 (39%), Positives = 175/325 (53%), Gaps = 32/325 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E F +K KF ++Y T E R +++ N + +L + G+T+F+D+
Sbjct: 24 EMEFHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMD 83
Query: 106 PSEFRRQF-LGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSC 161
E++ LG R A + + + LPT DWRD G VTGVKDQ CGSC
Sbjct: 84 NEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSC 143
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FSATG+LEG +F TG+LVSLSEQQLVDC + + GCNGGLM+ AF+YI
Sbjct: 144 WAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYG-------NMGCNGGLMDYAFKYIQ 196
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGI 280
+ GG++ EK YPY D G C+F + A + + V DED + + GP++VGI
Sbjct: 197 ENGGIDTEKSYPYEAED-GQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGI 255
Query: 281 NAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV C + LDHGVL VGYG+ + YW++KNSWG W
Sbjct: 256 DASHSSFQLYDSGVYDEQDCSSQDLDHGVLAVGYGTD-------NGQDYWLVKNSWGLGW 308
Query: 338 GENGYYKICMGR---NVCGVDSMVS 359
G+ GY I M R N CG+ + S
Sbjct: 309 GQEGY--IMMSRNKDNQCGIATAAS 331
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 144/371 (38%), Positives = 190/371 (51%), Gaps = 54/371 (14%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L LLL S LA+A AV+ I +V + ++ FK + K Y ++ E
Sbjct: 3 LFLLLVSFLAAANAVS-----IFNLVKEE--------------WNAFKLQHRKKYDSESE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRL----R 121
R +++ N + AK Q D V K++DL EF G NR +
Sbjct: 44 ERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSK 103
Query: 122 LPADAQ----KAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
L Q + PI P N D+PT DWR+ GAVT VKDQG CGSCWSFSATGALEG
Sbjct: 104 LLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQ 163
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
HF TG+LVSLSEQ LVDC + ++GCNGGLM++AF+Y+ G++ EK YP
Sbjct: 164 HFRKTGKLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYP 216
Query: 234 YTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIG 290
Y D C ++ I A F + DE + L GP++V I+A Q Y
Sbjct: 217 YEAID-DECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSE 275
Query: 291 GVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV C + LDHGVL VGYG++ + YW++KNSWG WG+ GY K+ R
Sbjct: 276 GVYYEPQCDSEQLDHGVLAVGYGTTEDG------EDYWLVKNSWGTTWGDQGYVKMARNR 329
Query: 350 -NVCGVDSMVS 359
N CG+ + S
Sbjct: 330 ENHCGIATTAS 340
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 142/363 (39%), Positives = 193/363 (53%), Gaps = 50/363 (13%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LLL+L++V+ S AV+ D + Q +S FK + SK Y ++ E
Sbjct: 3 LLLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
+R ++F N + AK +L V G+ K++D+ EF G N+ +
Sbjct: 44 ERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103
Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+D A I P N LP DWRD GAVT VKDQG CGSCWSFS +G+LEG HF TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTG 163
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+LVSLSEQ LVDC SG ++GCNGGLM++AF YI GG++ E+ YPY D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNTGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 215
Query: 239 GGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-SC 294
C + A F I +ED + A + GP+++ I+A + Q Y GV S
Sbjct: 216 -EKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVSIAIDASYETFQLYSDGVYSD 274
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCG 353
P + LDHGVL+VGYG+S + YW++KNSW + G NGY K+ + N+CG
Sbjct: 275 PECSSQELDHGVLVVGYGTSDDG------QDYWLVKNSWRPSCGLNGYIKMARNQDNMCG 328
Query: 354 VDS 356
V S
Sbjct: 329 VAS 331
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 132/339 (38%), Positives = 176/339 (51%), Gaps = 53/339 (15%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL---------LDPTAVHGVTK 100
E F + ++ K YAT EE R VF N P+ +
Sbjct: 38 EALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNA 97
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--------NDLPTDFDWRDHGAVTGV 152
F+DLT EFR LG R+ A A ++P P +P DWR++GAVT V
Sbjct: 98 FADLTHEEFRAARLG---RIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKV 154
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG+CG+CWSFSATGA+EG + + TG LVSLSEQ+L+DCD S +SGC GGL
Sbjct: 155 KDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDR--------SYNSGCGGGL 206
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
M+ A+++++K GG++ E+DYPY DG K K + +S + S+++ + V
Sbjct: 207 MDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVA 266
Query: 273 HGPLAVGINA------VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPY 326
P++VGI ++ Q I CP LDH VLIVGYGS G K Y
Sbjct: 267 QQPVSVGICGSARAFQLYSQQGIFDGPCP----TSLDHAVLIVGYGSEG-------GKDY 315
Query: 327 WIIKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
WI+KNSWGE+WG GY M RN VCG++ M S
Sbjct: 316 WIVKNSWGESWGMKGYMH--MHRNTGDSKGVCGINMMAS 352
>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
Length = 307
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 167/303 (55%), Gaps = 17/303 (5%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRL 122
T +E RF +FK N+ + + V G+ +D++ E++R +LG + + R
Sbjct: 9 TAQEFGTRFNIFKKNMDFVHKWNAKGSSTVLGLNSMADISNEEYQRVYLGTHIDASQFRQ 68
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
A + K + DWR GAVT +K+QG CGSCWSFS TG+ EGAHF+ TG LV
Sbjct: 69 QAASHKLG-RTFKVQAANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKTGNLV 127
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ L+DC PE + GCNGGLM +AFEYI+K G++ E YPY DG C
Sbjct: 128 SLSEQNLMDCS---KPEG----NQGCNGGLMTAAFEYIIKNNGIDTESSYPYKAEDGKKC 180
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGK 300
++ + AA +S++ +++ + A GP++V I+A Q Y GV C +
Sbjct: 181 LYNPANSAATLSSYVNVTTGSESDLAVKSGLGPVSVAIDASHNSFQLYSSGVYYEPKCSQ 240
Query: 301 -YLDHGVLIVGYGSSGF--APIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDS 356
LDHGVL+VGYGS A + +WI+KNSWG WG GY + R N CG+ +
Sbjct: 241 TQLDHGVLVVGYGSDALPSAGVSAGSGDWWIVKNSWGTTWGVEGYIYMSRNRNNNCGIAT 300
Query: 357 MVS 359
M S
Sbjct: 301 MAS 303
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 168/321 (52%), Gaps = 31/321 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ K+Y + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY D G C+F K + A + + I + E + + GP++V I+A
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASH 256
Query: 285 --MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV P + LDHGVL+VGYG G K YW++KNSW E+WG+ G
Sbjct: 257 SSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQG 309
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ S S
Sbjct: 310 Y--ILMSRDNNNQCGIASQAS 328
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 172/316 (54%), Gaps = 32/316 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRR 111
+K++ K+Y +E R ++AN + V G T +F DL SEF+
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHN--QHAGVFGYTLKMNQFGDLENSEFKS 82
Query: 112 QFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ G R P + P +P DLP DW G VT VK+QG CGSCWSFSATG
Sbjct: 83 LYNGY-RMSNAPRKGK--PFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
++EG HF +TG L+SLSEQ LVDC + + GCNGGLM+ AFEY++K G++
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDC-------SAAEGNHGCNGGLMDDAFEYVIKNNGIDT 192
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAVGINA--VWM 285
E YPY D +CKF+ + + A +S + ++ D E + + GP++V I+A +
Sbjct: 193 EASYPYRAVD-STCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISF 251
Query: 286 QTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GV P IC LDHGVL VGYG+ G K YW++KNSWG +WG +GY +
Sbjct: 252 QFYSSGVYDPLICSSTNLDHGVLAVGYGTDG-------SKDYWLVKNSWGASWGMSGYIE 304
Query: 345 ICMGR-NVCGVDSMVS 359
+ N CG+ + S
Sbjct: 305 MVRNHNNKCGIATSAS 320
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 132/343 (38%), Positives = 181/343 (52%), Gaps = 29/343 (8%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
+D IR V SD E+ ++ H F+ F SK+ K Y + EE +RFR+F NL
Sbjct: 25 EDSNPIRLV--SDLEEQVLQVIGQTRHAVSFARFASKYGKRYDSVEEIQHRFRIFSENLE 82
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
K + G+ F+DL+ EFR Q LG + L LP + D
Sbjct: 83 LIKSTNKKRLSYKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHK--LTDAVLPAEKD 140
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR V+ VKDQ CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +G
Sbjct: 141 WRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDC--------AG 192
Query: 203 SCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVIS 260
+ ++ GCNGGL + AFEYI GG+ EK+YPYT D +CKF +A V + ++
Sbjct: 193 AFNNFGCNGGLPSQAFEYIKYNGGIALEKEYPYTAKD-EACKFTAENVAVRVLDSVNITL 251
Query: 261 SDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKY---LDHGVLIVGYGSSGF 316
ED++ + P++V V + Y GV CG ++H VL VGYG
Sbjct: 252 GAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVE-- 309
Query: 317 APIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
PYWIIKNSWG WG++GY+K+ +G+N+CGV + S
Sbjct: 310 -----NNVPYWIIKNSWGSTWGDHGYFKMELGKNMCGVATCAS 347
>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
Length = 323
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 115/325 (35%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFDSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 169/317 (53%), Gaps = 22/317 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+K+ K+Y +EE +R +++ NLR + L H G+ +F D+T
Sbjct: 28 HWHLWKNWHKKSYLPKEE-GWRRVLWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR+ G + + AP + P DWR+ G VT VKDQG CGSCW+FS T
Sbjct: 87 EFRQLMNGYKNQKMIKGSTFLAP--NNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG H+ G+L+SLSEQ LVDC + GCNGGLM+ AF+Y+ GG++
Sbjct: 145 GALEGQHYRKAGKLISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGGID 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--VW 284
E YPYT D C +D + +A + F V S E + + GP++V ++A
Sbjct: 198 SEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSGSEKDLMKAVASVGPVSVAVDAGHKS 257
Query: 285 MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Q Y G+ P + LDHGVL+VGY GF K YWI+KNSW E WG NGY
Sbjct: 258 FQFYQSGIYYDPECSSEDLDHGVLVVGY---GFEGEDVDGKRYWIVKNSWSEKWGNNGYI 314
Query: 344 KICMGR-NVCGVDSMVS 359
KI R N CG+ + S
Sbjct: 315 KIAKDRHNHCGIATAAS 331
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 174/321 (54%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 25 DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G R+ + + + N L P DWRD+G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E YPY GTD C +D +A + F + S E + + GP++V I+A
Sbjct: 195 QGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSVAIDA 254
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 255 GHESFQFYQSGIYYEKECSSEELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 311
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 312 KGYIYMAKDRKNHCGIATAAS 332
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 176/324 (54%), Gaps = 28/324 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AF++I+ GG++ E DYPY G D K+ + ++ ++ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQK 253
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V + P++V I A Q Y G+ CG LDHGV VGYG+ K YW
Sbjct: 254 AVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDYW 305
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I++NSWG++WGE+GY + M RN+
Sbjct: 306 IVRNSWGKSWGESGYVR--MERNI 327
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 136/364 (37%), Positives = 192/364 (52%), Gaps = 35/364 (9%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
S L+L S L ++A D +++ S+ +S D L+ F + S+ K Y T
Sbjct: 6 SKTLVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYET 60
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL--RLPA 124
EE RF VFK NL+ R + G+ +F+DL+ EF+ ++LGL L R +
Sbjct: 61 IEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLSQRRES 120
Query: 125 DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
++ DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG L SL
Sbjct: 121 SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSL 180
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E DYPY + +C+
Sbjct: 181 SEQELIDCD--------TTYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYI-MEESTCEM 231
Query: 245 DKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKY 301
K + N + + + +Q + + PL+V I A Q Y GGV + CG
Sbjct: 232 KKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGH-CGSD 290
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVD 355
LDHGV VGYG+S K Y I+KNSWG WGE G+ I M RN +CG+
Sbjct: 291 LDHGVSAVGYGTS-------KNLDYIIVKNSWGAKWGEKGF--IRMKRNIGKPEGICGLY 341
Query: 356 SMVS 359
M S
Sbjct: 342 KMAS 345
>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
Length = 323
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 116/326 (35%), Positives = 180/326 (55%), Gaps = 29/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIIIKNQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+ +
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMA 245
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE
Sbjct: 246 IDAADIVNYKQGI-IKYCFNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGE 297
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 298 EGFFRVQQNINACGMRNELASTAVIY 323
>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
Length = 255
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 121/267 (45%), Positives = 160/267 (59%), Gaps = 22/267 (8%)
Query: 99 TKFSDLTPSEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
TKFSDLT EF +L L + L + + AP + + +DWRDHGAV+ VK+QG
Sbjct: 4 TKFSDLTEEEFHSAYLNPLLSQWTLHREMKPAPPAKSPAPDS-WDWRDHGAVSPVKNQGM 62
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG +EG FL G L+SLSEQ+LVDCD D C GGL ++A+
Sbjct: 63 CGSCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAY 113
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E I K GG+E E DY YTG C F K+AA +++ + DE ++AA L ++GP++
Sbjct: 114 EAIEKLGGLETETDYSYTGKK-QRCDFTNRKVAAYINSSVELPKDEKEIAAWLAENGPIS 172
Query: 278 VGINAVWMQTYIGGVSCPY--ICGKYL-DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V +NA MQ Y GVS P+ C ++ DH VL+VGYG P+W IKNSWG
Sbjct: 173 VALNAFAMQFYKKGVSHPWKIFCNPWMIDHAVLLVGYGER-------NGIPFWAIKNSWG 225
Query: 335 ENWGENGYYKICMGRNVCGVDSMVSSV 361
E++GE GYY + G N CG++ M SS
Sbjct: 226 EDYGEQGYYYLHRGSNACGINKMGSSA 252
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 182/325 (56%), Gaps = 35/325 (10%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTP 106
N F ++ ++ K+Y++ EE YR VF N LD ++ + ++DLT
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 107 SEFRRQFLGLNRRLR--LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF+ LG + LR P Q+ P LP D+P DWR GAVT VKDQG+CG+CWSF
Sbjct: 84 HEFKVSRLGFSPALRNFRPVLPQE-PSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACWSF 141
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATGA+EG + + TG L+SLSEQ+L+DCD S +SGC GGLM+ A+++++
Sbjct: 142 SATGAMEGINQIMTGSLISLSEQELIDCDR--------SYNSGCGGGLMDYAYQFVISNH 193
Query: 225 GVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--N 281
G++ E DYPY D GSC+ DK + + ++ I S+++ V P++VGI +
Sbjct: 194 GIDTENDYPYQARD-GSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGS 252
Query: 282 AVWMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y G+ S P C LDH VLIVGYGS YWI+KNSWG++WG +
Sbjct: 253 ERAFQLYSKGIFSGP--CSTSLDHAVLIVGYGSENGV-------DYWIVKNSWGKSWGMD 303
Query: 341 GYYKICMGRN------VCGVDSMVS 359
GY M RN VCG++ + S
Sbjct: 304 GYMH--MQRNSGNSEGVCGINKLAS 326
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 171/309 (55%), Gaps = 41/309 (13%)
Query: 62 KTYATQEEHDYRFRVFKANL--------RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
K Y + E+ RF++FK N+ RR L G+ KF+DLT SEFR +
Sbjct: 47 KAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSL-------GLNKFADLTNSEFRGLY 99
Query: 114 LGLNRRLRLPADAQK-APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+G RL+ PA + I D T DWR G VT +KDQG CGSCW+FSA A+EG
Sbjct: 100 VG---RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWAFSAVAAVEG 156
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
FLSTG LVSLSEQ+LVDCD + + GC+GG+M+ AF+Y+++ GG+ + +Y
Sbjct: 157 LTFLSTGTLVSLSEQELVDCDT--------TVNQGCDGGIMDYAFQYMIRNGGITSQSNY 208
Query: 233 PYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYI 289
PY G+C DK K AA ++ F I +++ V + P++V I A Q Y
Sbjct: 209 PYRALR-GACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYS 267
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM-- 347
GV CG LDHGV IVGYG+ + YW++KNSWG WGE+GY ++
Sbjct: 268 SGVFTGE-CGSNLDHGVAIVGYGTDAGG------RQYWLVKNSWGSGWGESGYVRMERQG 320
Query: 348 -GRNVCGVD 355
G VCG++
Sbjct: 321 PGAGVCGIN 329
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 124/322 (38%), Positives = 175/322 (54%), Gaps = 29/322 (9%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTP 106
A + L+ ++ ++Y EH+ RFRVF NLR A + D G+ +F+DLT
Sbjct: 50 ARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTN 109
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFR FLG R A ++ +LP DWR+ GAV VK+QG CGSCW+FSA
Sbjct: 110 EEFRATFLGAKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
+E + L TGE+++LSEQ+LV+C + +SGCNGGLM+ AF++I+K GG+
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVEC-------STNGQNSGCNGGLMDDAFDFIIKNGGI 222
Query: 227 EREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--V 283
+ E DYPY D G C ++ ++ F + ++++ V H P++V I A
Sbjct: 223 DTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 281
Query: 284 WMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Q Y GV CG LDHGV+ VGYG+ K YWI++NSWG WGE+GY
Sbjct: 282 EFQLYHSGVFSGR-CGTSLDHGVVAVGYGTD-------NGKDYWIVRNSWGPKWGESGYV 333
Query: 344 KICMGRNV------CGVDSMVS 359
+ M RN+ CG+ M S
Sbjct: 334 R--MERNINVTTGKCGIAMMAS 353
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 176/324 (54%), Gaps = 28/324 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 26 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 82
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 83 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 142
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 194
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AF++I+ GG++ E DYPY G D K+ + ++ ++ + +
Sbjct: 195 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQK 254
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V + P++V I A Q Y G+ CG LDHGV VGYG+ K YW
Sbjct: 255 AVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDYW 306
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I++NSWG++WGE+GY + M RN+
Sbjct: 307 IVRNSWGKSWGESGYVR--MERNI 328
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 172/318 (54%), Gaps = 31/318 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K K+ K Y +EE+DY F N+ +L T G+ +DL SE+R+
Sbjct: 43 YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 101
Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
L R RL D+ + ++P N +P DWR+H VT VK+QG CGSCW+FSA
Sbjct: 102 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSA 159
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI G+
Sbjct: 160 TGALEGQHFRATGKLVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGI 212
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPY G + C F K I A F + DED + + GP+++ I+A
Sbjct: 213 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIAIDAGHR 271
Query: 284 WMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV C + LDHGVL+VGYG+ A YWIIKNSWG WGE GY
Sbjct: 272 SFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAG------DYWIIKNSWGTKWGEKGY 325
Query: 343 YKICMGRNV-CGVDSMVS 359
+I RN CGV + S
Sbjct: 326 VRIARNRNNHCGVATKAS 343
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 135/347 (38%), Positives = 185/347 (53%), Gaps = 34/347 (9%)
Query: 27 DDAMIRQVVPSDGEQSED---HLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
D+ IRQVV + E H++ H F+ F ++ K Y + EE RF +F N
Sbjct: 27 DENPIRQVVSDSFHELESGILHVVGQTRHALSFARFARRYGKRYDSVEEIKQRFDIFLDN 86
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
L + GV +FSDLT EFRR LG + A + L LP
Sbjct: 87 LEMINSHNDKGLSYKLGVNEFSDLTWDEFRRDRLGAAQNC--SATTKGNLKLRDAVLPET 144
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
DWR+ G V+ VK+QG CGSCW+FS TGALE A+ G+ +SLSEQQLVDC
Sbjct: 145 KDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDC-------- 196
Query: 201 SGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS---NF 256
+G+ ++ GCNGGL + AFEYI GG+E E+ YPYTG + G CKF + V+ N
Sbjct: 197 AGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYTGKN-GLCKFSSQNVGVKVTDSVNI 255
Query: 257 SVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKY---LDHGVLIVGYG 312
++ + DE + A LV+ P++V V + Y GV CG ++H VL VGYG
Sbjct: 256 TLGAEDELKYAVALVR--PVSVAFEVVKGFKQYKSGVYTSTECGTTPMDVNHAVLAVGYG 313
Query: 313 SSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
P+W+IKNSWG +WG+N Y+K+ MG ++CG+ + S
Sbjct: 314 VE-------YGVPFWLIKNSWGADWGDNAYFKMEMGNDMCGIATCAS 353
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 174/310 (56%), Gaps = 26/310 (8%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG-- 115
K K Y + RF +FK NLR + + ++ + G+ KF+DL+ E++ FLG
Sbjct: 13 KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLGGR 72
Query: 116 -LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
+ R +D K + ++LP DWR+ GAV VKDQG CGSCW+FS A+EG +
Sbjct: 73 MVRDRKGFESDRFKYGV--GDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
++TG+L+SLSEQ+LVDCD + GCNGG M+ AFE+I+K GG++ E DYPY
Sbjct: 131 QIATGDLISLSEQELVDCDK--------GFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPY 182
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV 292
G DG + K+ ++ F + ++++ V H P++V I A Q Y G+
Sbjct: 183 KGVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGI 242
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+CG LDHGV+ VGYG+ K YWI++NSWG NWGENGY + + RNV
Sbjct: 243 F-NGLCGTDLDHGVVAVGYGTE-------DGKDYWIVRNSWGPNWGENGYIR--LERNVA 292
Query: 353 GVDSMVSSVA 362
++ +A
Sbjct: 293 STNTGKCGIA 302
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 171/321 (53%), Gaps = 28/321 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +K SK Y EE +R +++ NL++ + L +H G+ F D+T
Sbjct: 28 HWDQWKKWHSKKYHATEE-GWRRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHFGDMTHE 86
Query: 108 EFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G +RR R + I ++P DWR+ G VT VKDQG CGSCW+
Sbjct: 87 EFRQVMNGFKHKKDRRFRGSLFMEPNFI----EVPNKLDWREKGYVTPVKDQGECGSCWA 142
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGALEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+
Sbjct: 143 FSTTGALEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQ 195
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E+ YPY GTD C FD AA + F + S E + + GP++V I+A
Sbjct: 196 NGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDA 255
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL VGY GF K YWI+KNSW ENWG+
Sbjct: 256 GHESFQFYQSGIYYEKECSSEELDHGVLAVGY---GFEGEDVDGKKYWIVKNSWSENWGD 312
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 313 KGYIYMAKDRHNHCGIATAAS 333
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 169/323 (52%), Gaps = 27/323 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK+++ + Y +E YR RVF+ N + K+ + + T + +F
Sbjct: 5 LATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQF 64
Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T EF G + R P A P + D DWR GAVT VKDQG CGS
Sbjct: 65 GDMTNEEFNAVMKGYKKGSRGEPTTVFTAEGRP---MAADVDWRTKGAVTPVKDQGQCGS 121
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FSATG+LEG HFL ELVSLSEQ+LVDC E + GC GG M SAF+YI
Sbjct: 122 CWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYG-------NDGCGGGWMTSAFDYI 174
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG++ E YPY D SC+FD + I A + F + E+ + + GP++V I
Sbjct: 175 KDNGGIDTESSYPYEAQD-RSCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPISVAI 233
Query: 281 NA--VWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV C LDHGVL VGYG+ + YW++KNSWG W
Sbjct: 234 DASHFSFQFYSSGVYYEKKCSPTNLDHGVLAVGYGTE-------STEDYWLVKNSWGSGW 286
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ GY K+ R N CG+ S S
Sbjct: 287 GDAGYIKMSRNRDNNCGIASEPS 309
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 130/359 (36%), Positives = 190/359 (52%), Gaps = 32/359 (8%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
LS LA +++ D + QV E++E L + ++ K+ K Y E
Sbjct: 14 FYFLSVCLAIDMSIIDYNLKHGQVP----ERTEAETLRL---YEMWLVKYGKAYNALGEK 66
Query: 71 DYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQ 127
+ RF +FK NL+ + + +P+ G+ KF+DL+ E+R +LG ++ + RL +
Sbjct: 67 ERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPK 126
Query: 128 KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A L +DLP DWR+ GAV VKDQG CGSCW+FS GA+EG + + TG L SLS
Sbjct: 127 SARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLS 186
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD + GCNGGLM+ AFE+I+K GG++ E+DYPY D
Sbjct: 187 EQELVDCDK--------VYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNR 238
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLD 303
K+ + + + ++++ V + P++V I A Q Y GV CG LD
Sbjct: 239 KNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTG-SCGTQLD 297
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
HGV+ VGYG+ YW+++NSWG WGENGY + M RNV ++ +A
Sbjct: 298 HGVVAVGYGTENGV-------DYWVVRNSWGPAWGENGYIR--MERNVASTETGKCGIA 347
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDYAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANGTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 174/313 (55%), Gaps = 27/313 (8%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTP 106
+ + L+K+ + K+Y T EE YR ++ N K + HG T F DLT
Sbjct: 25 NEWELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNT--DSDKHGYTLEMNSFGDLTS 82
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+EF + G + L + + N +P+ DWRD VT VK+QG CGSCW+FS
Sbjct: 83 AEFSSLYNGYRQNLETSGSVFSSSL--RNAMPSSLDWRDKKVVTDVKNQGKCGSCWAFST 140
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG+LEG H L TG LVSLSEQQL+DC + ++GC+GG M SAF+YI AGG
Sbjct: 141 TGSLEGLHALKTGHLVSLSEQQLMDCSVKYG-------NNGCDGGNMRSAFQYIKDAGGD 193
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPYT + SC+FD K+ A + I S DE + L + GP++V ++A
Sbjct: 194 DTEESYPYTAKN-ESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLK 252
Query: 284 WMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y G+ Y+C +L+HGV ++GYG S PYW++KNSWG++WG +GY
Sbjct: 253 TFQFYKKGIYSDYLCSNTHLNHGVTLIGYGESSDGS------PYWLVKNSWGKDWGIDGY 306
Query: 343 YKIC-MGRNVCGV 354
+ + N+CGV
Sbjct: 307 FMLARYVGNMCGV 319
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 172/318 (54%), Gaps = 31/318 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K K+ K Y +EE+DY F N+ +L T G+ +DL SE+R+
Sbjct: 48 YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 106
Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
L R RL D+ + ++P N +P DWR+H VT VK+QG CGSCW+FSA
Sbjct: 107 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSA 164
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI G+
Sbjct: 165 TGALEGQHFRATGKLVSLSEQNLVDCS-------TKYGNHGCNGGLMDLAFEYIKDNHGI 217
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPY G + C F K I A F + DED + + GP+++ I+A
Sbjct: 218 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIAIDAGHR 276
Query: 284 WMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV C + LDHGVL+VGYG+ A YWIIKNSWG WGE GY
Sbjct: 277 SFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAG------DYWIIKNSWGTKWGEKGY 330
Query: 343 YKICMGRNV-CGVDSMVS 359
+I RN CGV + S
Sbjct: 331 VRIARNRNNHCGVATKAS 348
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 179/326 (54%), Gaps = 35/326 (10%)
Query: 51 HHFSLFK------SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
HH L K +K+ K YA+ EE +RF VFK NL T G+ F+DL
Sbjct: 58 HHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADL 117
Query: 105 TPSEFRRQFLGLNR-RLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T EF+ +LGL + + D++ + + +D+P DWR GAVT VK+QG CGSCW
Sbjct: 118 THDEFKATYLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCW 177
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS A+EG + + TG L SLSEQ+LVDC S ++GCNGG+M++AF YI
Sbjct: 178 AFSTVAAVEGINQIVTGNLTSLSEQELVDC--------STDGNNGCNGGVMDNAFSYIAS 229
Query: 223 AGGVEREKDYPYTGTDGGSC--KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI 280
+GG+ E+ YPY + G C K + +S + + ++++Q + H PL+V I
Sbjct: 230 SGGLRTEEAYPYL-MEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAI 288
Query: 281 NAV--WMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
A Q Y GGV + P CG LDHGV VGYGSS K + Y I+KNSWG +W
Sbjct: 289 EASGRHFQFYSGGVFNGP--CGSELDHGVAAVGYGSS-------KGQDYIIVKNSWGSHW 339
Query: 338 GENGYYKICMG----RNVCGVDSMVS 359
GE GY ++ G +CG++ M S
Sbjct: 340 GEKGYIRMKRGTGKPEGLCGINKMAS 365
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 132/305 (43%), Positives = 169/305 (55%), Gaps = 35/305 (11%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNR-RLRLPAD 125
E R+ +FK NLR + G+ F+DLT EFR Q G +R R R +
Sbjct: 81 EKATRYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSYE 140
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ + DLP DWR+ GAV GVKDQG+CGSCW+FSA A+EG + L+TGELVSLS
Sbjct: 141 EFRYGSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLS 200
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD D GCNGGLM+ AF +++K GG++ E DYPY G G C D
Sbjct: 201 EQELVDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGY-GTRC--D 249
Query: 246 KSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGK 300
+SK+ A V + + +++ V H P++V I+A MQ Y G+ CG
Sbjct: 250 RSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGR-CGT 308
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGV 354
LDHGV VGYG + K YWIIKNSWG NWGE GY K M RN +CG+
Sbjct: 309 DLDHGVTNVGYG-------KEDGKAYWIIKNSWGSNWGEKGYIK--MARNTGLAAGLCGI 359
Query: 355 DSMVS 359
+ S
Sbjct: 360 NMEAS 364
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 175/324 (54%), Gaps = 28/324 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AF++I+ GG++ E DYPY G D K+ + ++ ++ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQK 253
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V + P++V I A Q Y G+ CG LDHGV VGYG+ K YW
Sbjct: 254 AVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDYW 305
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I++NSWG++WGE+GY + M RN+
Sbjct: 306 IVRNSWGKSWGESGYVR--MERNI 327
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 132/305 (43%), Positives = 169/305 (55%), Gaps = 35/305 (11%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNR-RLRLPAD 125
E R+ +FK NLR + G+ F+DLT EFR Q G +R R R +
Sbjct: 81 EKATRYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHE 140
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ + DLP DWR+ GAV GVKDQG+CGSCW+FSA A+EG + L+TGELVSLS
Sbjct: 141 EFRYGSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLS 200
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD D GCNGGLM+ AF +++K GG++ E DYPY G G C D
Sbjct: 201 EQELVDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGY-GTRC--D 249
Query: 246 KSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGK 300
+SK+ A V + + +++ V H P++V I+A MQ Y G+ CG
Sbjct: 250 RSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGR-CGT 308
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGV 354
LDHGV VGYG + K YWIIKNSWG NWGE GY K M RN +CG+
Sbjct: 309 DLDHGVTNVGYG-------KEDGKAYWIIKNSWGSNWGEKGYVK--MARNTGLAAGLCGI 359
Query: 355 DSMVS 359
+ S
Sbjct: 360 NMEAS 364
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 173/319 (54%), Gaps = 23/319 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 HWDLWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHPYRLGMNHFGDMTHE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G +R + + + + N L P DWRD G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQIMNGYKQR-KTERKFKGSLFMEPNFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ G
Sbjct: 145 TTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA-- 282
++ E YPY GTD C +D + +A + F V S E + + GP++V I+A
Sbjct: 198 LDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGKERALMKAVAAVGPVSVAIDAGH 257
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ C + LDHGVL+VGYG G K YWI+KNSW E WG+ G
Sbjct: 258 ESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEG---EDVDGKKYWIVKNSWSEKWGDKG 314
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y + R N CG+ + S
Sbjct: 315 YIYMAKDRKNHCGIATAAS 333
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 173/310 (55%), Gaps = 29/310 (9%)
Query: 58 SKFSKTYA-TQEEH-DYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
S+ + YA QE+H + RF VFK N+ R + T + +F+DLT EFR + G
Sbjct: 42 SQHGRVYADEQEDHKNKRFNVFKENVERIEEFND-GKTFKLAINQFADLTNEEFRASYNG 100
Query: 116 LNRRLRLPADAQK-APILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ L + K P N LP DWR GAVT VK+QG CG CW+FSA A+E
Sbjct: 101 FKGPMVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIE 160
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G +STG+L+SLSEQ+LVDCD + D GC GGLM++AFE+I+ GG+ E +
Sbjct: 161 GITQISTGKLISLSEQELVDCD-------TKGIDHGCEGGLMDTAFEFIINNGGLTTESN 213
Query: 232 YPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTY 288
YPY G D G+C F+K+ IA +++ + + ++++Q V H P++V I A Q Y
Sbjct: 214 YPYKGED-GTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFY 272
Query: 289 IGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK---- 344
GV CG LDH V VGYG S YWI+KNSWG WGE+GY +
Sbjct: 273 SSGVFTGE-CGTELDHAVTAVGYGESE------DGSKYWIVKNSWGTKWGESGYIEMQKD 325
Query: 345 ICMGRNVCGV 354
I + + +CG+
Sbjct: 326 IKVKQGLCGI 335
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 115/294 (39%), Positives = 166/294 (56%), Gaps = 23/294 (7%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K Y E D RF +FK NLR + T G+ +F+DLT E+R ++LG
Sbjct: 10 KHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEYRARYLGTRI 69
Query: 117 --NRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
NRR ++ + + ++LP DWR+ AV VKDQG CGSCW+FS GA+EG
Sbjct: 70 DPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAFSTIGAVEGI 129
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ A+E+I+ GG++ E+DYP
Sbjct: 130 NKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAYEFIINNGGIDSEEDYP 181
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGG 291
Y DG ++ K+ + ++ + ++++ V + P++V I Q Y+ G
Sbjct: 182 YRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYVSG 241
Query: 292 VSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
V CG LDHGV+ VGYGS K YWI++NSWG +WGE GY ++
Sbjct: 242 VFTGR-CGTALDHGVVAVGYGS-------VKGHDYWIVRNSWGASWGEEGYVRL 287
>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 165/313 (52%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSE LV C + +SGC GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHRLTALSEHHLVSCHDK---------NSGCTGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E MAA L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGENWGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGISLNHGVLLVGYNRTG-------EVPYWVIKNSWGENWGE 316
Query: 340 NGYYKICMGRNVC 352
NGY ++ MG N C
Sbjct: 317 NGYVRVTMGVNAC 329
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 168/316 (53%), Gaps = 21/316 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPS 107
++ ++S + K YA EE D+R V++ N++ +R HG T F D+T
Sbjct: 28 QWNQWRSTYKKPYAVNEE-DWRRAVWEKNVKMIERHNQEYSQGKHGFTMAMNAFGDMTNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR+ G + P+ +PT DW G VT VK+QG CGSCW+FSAT
Sbjct: 87 EFRQVMNGFQNQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKNQGQCGSCWAFSAT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+Y+ GG++
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAFQYVQDNGGLD 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWM 285
E+ YPY TD +C + AA + F I E + + GP++V I+A
Sbjct: 198 SEESYPYLATDTHTCNYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHESF 257
Query: 286 QTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y G+ P K LDHGVL+VGY GF + +WI+KNSWG +WG NGY K
Sbjct: 258 QFYKSGIYYEPGCSSKDLDHGVLLVGY---GFEGKDSENNKFWIVKNSWGTSWGTNGYVK 314
Query: 345 ICMGRNV-CGVDSMVS 359
+ +N CG+ + S
Sbjct: 315 MAKDQNNHCGIATAAS 330
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 180/326 (55%), Gaps = 28/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F KF+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEIINKNQNDTSAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQKQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDVGCDGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVG 279
+ GG++ E DYPY + G C+ + +K V + ++ E+++ L GP+ V
Sbjct: 188 MNMGGIQAENDYPYEANN-GPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVA 246
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGYG P+WI+KN+WG +WGE
Sbjct: 247 IDASDIVGYKRGI-IRYCENHGLNHAVLLVGYGVENGI-------PFWILKNTWGADWGE 298
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
GY+++ N CG+ + + S A I+
Sbjct: 299 QGYFRVQQNINACGIKNELPSSAEIY 324
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 174/314 (55%), Gaps = 25/314 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSE 108
+H F+ F +K+ K+Y T+EE+D+R ++FK NL + + D T G+ KF+D T +E
Sbjct: 40 DHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAE 99
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
++R LG + K P ND +W + GAVT VKDQG CGSCWSFSATG
Sbjct: 100 YKR-LLGFGGQKNKNPRNIKVLGAPKND---GVNWVEQGAVTPVKDQGQCGSCWSFSATG 155
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + G L SLSEQQLVDC + GC GG M+ AF+Y+ + +E
Sbjct: 156 AMEGHAKIQFGTLYSLSEQQLVDCSQ-------AEGNEGCGGGWMDQAFQYVEQT-ALET 207
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM--Q 286
E YPY D +C+ + + S V ++ +++ A L K GP++V I A M Q
Sbjct: 208 EDQYPYEAVD-DTCRASSAGVVKVDSFVDVTPNNVNELKAALDK-GPVSVAIEADQMVFQ 265
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GGV CG LDHGVL VGYG+ + Y+++KNSWG +WGE GY KI
Sbjct: 266 FYSGGVINDASCGTTLDHGVLAVGYGNE-------SGQDYFLVKNSWGASWGEEGYVKIA 318
Query: 347 MG-RNVCGVDSMVS 359
N+CG+ S S
Sbjct: 319 ASPDNICGILSQAS 332
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 128/361 (35%), Positives = 192/361 (53%), Gaps = 39/361 (10%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
S+ LL S++L + A++ +++ R + D ++ A + L + K+Y +
Sbjct: 11 SMSLLFFSTLLILSSALDIKNSVQR---------TNDQVM-AMYESWLVEQ--GKSYNSL 58
Query: 68 EEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA 126
+E + RF +FK NLR + + G+ +F+DLT E+R +LG +
Sbjct: 59 DEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSN 118
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
+ P + LP DWR GAV GVKDQG C SCW+FSA A+EG + + TG L+SLSE
Sbjct: 119 RYVPKVGV-VLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSE 177
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
Q+LVDC GCN G MN AF++I+ GG+ E +YPYT DG + K
Sbjct: 178 QELVDCGRTQRTR-------GCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRK 230
Query: 247 SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDH 304
++ + N+ + ++ + + N V + P+ VG+ + + Y G+ Y CG +DH
Sbjct: 231 NQRYVTIDNYEQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGY-CGTAIDH 289
Query: 305 GVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-----CGVDSMVS 359
GV IVGYG+ + YWI+KNSWG NWGENGY +I RN+ CG+ +MV
Sbjct: 290 GVTIVGYGTE-------RGLDYWIVKNSWGTNWGENGYIRI--QRNIGGAGKCGI-AMVP 339
Query: 360 S 360
S
Sbjct: 340 S 340
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 130/334 (38%), Positives = 174/334 (52%), Gaps = 31/334 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANAAEEAYRAKVFKENAIRIAKHNDRFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G +++D+ E + G L+ + + DWR GAVT
Sbjct: 70 EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT D G+C + + A + + V + E +
Sbjct: 183 GLMDSAFEYVKSYGGIDTEESYPYTAED-GTCLYKAANNAGVNTGYKDVQAKSESALRDA 241
Query: 270 LVKHGPLAVGINAV-W-MQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPY 326
+ K GP++V I+A W Q Y G+ C LDHGVL VGYGS + K +
Sbjct: 242 VEKVGPVSVAIDASNWSFQMYTSGIYYEPACSSDSLDHGVLAVGYGS------EWPNKEF 295
Query: 327 WIIKNSWGENWGENGYYKICMG-RNVCGVDSMVS 359
WI+KNSWG +WGE GY K+ +N CG+ + S
Sbjct: 296 WIVKNSWGTSWGEEGYIKMARNKKNNCGIATEAS 329
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 176/331 (53%), Gaps = 30/331 (9%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAV---HG 97
S +L AE +S FK+K K+Y ++ E +R +++ N + AK + V
Sbjct: 18 SYQEVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMA 75
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVK 153
+ +F D+ EF G R + + P N LP DWR GAVT VK
Sbjct: 76 MNEFGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVK 135
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG CGSCW+FSATG+LEG HF +G +VSLSEQ LVDC + ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFG-------NNGCEGGLM 188
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
++AF+YI G++ EK YPY GTD G+C F KS + A S F + E Q+ +
Sbjct: 189 DNAFKYIRANKGIDTEKSYPYNGTD-GTCHFKKSTVGATDSGFVDIKEGSETQLKKAVAT 247
Query: 273 HGPLAVGINAVW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
GP++V I+A Q Y GV P + LDHGVL+VGYG+ YW++
Sbjct: 248 VGPISVAIDASHESFQFYSDGVYDEPECDSESLDHGVLVVGYGT-------LNGTDYWLV 300
Query: 330 KNSWGENWGENGYYKICMG-RNVCGVDSMVS 359
KNSWG WG+ GY ++ +N CG+ S S
Sbjct: 301 KNSWGTTWGDEGYIRMSRNKKNQCGIASSAS 331
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 162/293 (55%), Gaps = 26/293 (8%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ YA E + R+ VFK N+ R +R + T V +F+DLT EFR + G
Sbjct: 47 RVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGN 106
Query: 120 LRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
L + + + ++ LP DWR GAVT +KDQG CGSCW+FSA A+EG
Sbjct: 107 SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQ 166
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+L+SLSEQ+LVDCD + D GC GGLM++AF Y + GG+ E +YPY
Sbjct: 167 IKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSESNYPYK 217
Query: 236 GTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV 292
T+ G+C F+K+K IA ++ F + +++++ V H P+++GI + Q Y GV
Sbjct: 218 STN-GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV 276
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
C +LDHGV VGYG S YWI+KNSWG WGE GY +I
Sbjct: 277 FSGE-CTTHLDHGVTAVGYGRSK------NGLKYWILKNSWGPKWGERGYMRI 322
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 179/323 (55%), Gaps = 31/323 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSE 108
++ FK K Y ++ E +R ++F N + ++ +L + + G+ K+ D+ E
Sbjct: 28 WNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLHHE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPI-----LPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
F G N+ + AQ+ PI P N ++P+ DWR HGAVT +KDQG CGSCW
Sbjct: 88 FINTLNGFNKSVSAQLRAQRRPIGSRFIEPANVEIPSSVDWRTHGAVTPIKDQGHCGSCW 147
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
SFSATGALEG H+ TG+LVSLSEQ L+DC SG ++GCNGGLM+ AF+YI
Sbjct: 148 SFSATGALEGQHYRITGKLVSLSEQNLIDC--------SGRYGNNGCNGGLMDQAFQYIK 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGI 280
G++ E YPY + C+++ A S + + +E ++ A + GP++V I
Sbjct: 200 DNHGLDTEISYPYE-AENDKCRYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVSVAI 258
Query: 281 NAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV P + LDHGVL+VGYG+ ++ YW++KNSWG W
Sbjct: 259 DASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDD------NDQDYWLVKNSWGVTW 312
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ GY K+ + N CG+ S S
Sbjct: 313 GDEGYIKMARNKDNHCGIASSAS 335
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 129/327 (39%), Positives = 175/327 (53%), Gaps = 29/327 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 21 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNK 78
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N +P DWR+ GA+T VKDQG
Sbjct: 79 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQ 138
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+LVSLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 139 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 191
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F + S +ED++ A + GP+
Sbjct: 192 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 250
Query: 277 AVGINAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q Y GV C LDHGVL+VGYGS K YW++KNSW
Sbjct: 251 SVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSD-------NGKDYWLVKNSW 303
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
E+WG+ GY K+ R N CGV S S
Sbjct: 304 SEHWGDEGYIKMARNRKNHCGVASAAS 330
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 175/330 (53%), Gaps = 30/330 (9%)
Query: 29 AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RR 87
A+ + VPS+ + + F+ F ++SK Y + E RF FKAN+ +
Sbjct: 26 ALFSEEVPSE--------VMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHN 76
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
L + + G+ +F+DL+ EF+ ++ G R A + PT DWR
Sbjct: 77 TLANASYTMGLNEFADLSFEEFKGKYFGYKHVEREFARSNNLH-QEVEAAPTSIDWRTSN 135
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGE-LVSLSEQQLVDCDHECDPEESGSCDS 206
AVT +KDQG CGSCW+FSATG++EGA L L SLSEQQLVDC + D+
Sbjct: 136 AVTPIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCS-------TSYGDA 188
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFEYI+ G+ E YPY G GG C+ +K+ V S DE +
Sbjct: 189 GCNGGLMDYAFEYIIANKGICAESAYPYKGV-GGLCQKSCTKVVTISGYKDVASGDEASL 247
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
+ GP++V I A Q Y GV CG LDHGVL VGYG++G +
Sbjct: 248 LNAVGTVGPVSVAIEADQAGFQFYSSGVFSG-TCGHNLDHGVLAVGYGTTG-------SQ 299
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNVCGV 354
YWI+KNSWG +WGE+GY ++ +N CG+
Sbjct: 300 DYWIVKNSWGTSWGESGYIRMIRNKNQCGI 329
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 195/371 (52%), Gaps = 35/371 (9%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
+ R LS LL++ ++ A +++ D ++ +++D ++ + + K
Sbjct: 3 LHRSSLSLFLLMIFTASSAVDMSIVSYD---QRHADKSSWRTDDEVM---AMYEAWLVKH 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---- 116
K Y E + RF +FK NLR + T G+ +F+DLT E+R +LG+
Sbjct: 57 GKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGA 116
Query: 117 ---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
R++ +D A + + LP DWR GAV GVKDQG+CGSCW+FS A+EG
Sbjct: 117 TRVTRKVSRKSDRFAARV--GDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGI 174
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYP
Sbjct: 175 NQIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSEEDYP 226
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGG 291
Y D ++ K+ ++ + + +++ V P++V I A Q Y G
Sbjct: 227 YRAADQKCDQYRKNANVVSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSG 286
Query: 292 VSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
V CG LDHGV VGYG+ + YWI+ NSWG+NWGE+GY + M RN+
Sbjct: 287 VFTGK-CGTSLDHGVAAVGYGTE-------NGQDYWIVGNSWGKNWGEDGYIR--MERNL 336
Query: 352 CGVDSMVSSVA 362
G S +A
Sbjct: 337 AGSSSGKCGIA 347
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 160/293 (54%), Gaps = 26/293 (8%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLD--PTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ YA E + R+ VFK N+ +R + T V +F+DLT EFR + G
Sbjct: 46 RVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN 105
Query: 120 LRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
L + + + ++ LP DWR GAVT +KDQG+CGSCW+FSA A+EG
Sbjct: 106 SVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQ 165
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+L+SLSEQ+LVDCD + D GC GG MNSAF Y + GG+ E +YPY
Sbjct: 166 IKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYK 216
Query: 236 GTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQTYIGGV 292
TD G+C +K+K IA ++ F + +++++ V H P+++GI Q Y GV
Sbjct: 217 STD-GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV 275
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
C +LDHGV +VGYG S YWI+KNSWG WGE GY +I
Sbjct: 276 FSGE-CSTHLDHGVAVVGYGKSS------NGSKYWILKNSWGPKWGERGYMRI 321
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 132/368 (35%), Positives = 183/368 (49%), Gaps = 53/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 LSLAAVLVVMACLVPAATASLHAEETLASQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + E +R VF+ANL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P D + P DWR+ GAVT VK+QG CGSCW+F+A G +E FL
Sbjct: 109 QERARVPVDVEFV------GAPAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIECQWFL 162
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGVEREKDYPY 234
+ L LSEQ LV CD+ +SGC GG AF++I+ G V E+ YPY
Sbjct: 163 AGNPLTRLSEQMLVSCDNT---------NSGCGGGWPLVAFKWIVDRNNGTVYTEESYPY 213
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
G S C + A ++ + I DE+ +AA L +GP+AV ++A Y GGV
Sbjct: 214 HSCIGISPPCTTSGHTVGATITGYVTIPRDENGIAAWLAVNGPVAVVVDASSWIFYTGGV 273
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ K L H VL+VGY S P+WIIKNSW +WGE+GY +I G N C
Sbjct: 274 MTSCV-SKQLSHAVLLVGYNDSA-------TVPHWIIKNSWTTHWGEDGYIRIAKGSNQC 325
Query: 353 GVDSMVSS 360
V VSS
Sbjct: 326 LVKEGVSS 333
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 169/319 (52%), Gaps = 25/319 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ ++++K F+K Y EE R V++ N+ ++ H G +++D+T
Sbjct: 25 DQEWAIYKDMFAKNYVADEERMRRL-VWEDNIDYIEKHNRRADRGEHKFWLGTNEYADMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF+ G + D +P DLP DWRD G VT VK+QG CGSCWSFS
Sbjct: 84 IDEFKAIMNGFIMQNGTKGDTYMSPS-NIGDLPDKVDWRDKGYVTPVKNQGHCGSCWSFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HF STG+LVSLSEQ L+DC + + GC GGLM+ AFEYI K G
Sbjct: 143 ATGSLEGQHFKSTGKLVSLSEQNLIDCSKK-------EGNHGCKGGLMDFAFEYIQKNDG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGINA-- 282
++ E+ YPYT DG C+F K+ + A + E + + GP++V ++A
Sbjct: 196 IDTEQSYPYTAKDGIECRFKKADVGATDKGKVDLPRQSEKALQEAVATVGPISVAMDAGH 255
Query: 283 VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ +C LDHGVL VGYGS G E YW++KNSWG WG G
Sbjct: 256 RSFQLYKRGIYTEPMCSSTKLDHGVLAVGYGSEG-------EGDYWLVKNSWGATWGMEG 308
Query: 342 YYKICMG-RNVCGVDSMVS 359
++ + RN CG+ + S
Sbjct: 309 FFMLARNHRNECGIATQAS 327
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 194/359 (54%), Gaps = 27/359 (7%)
Query: 11 LLLLSSVLA-SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+LLL +VLA SA+A + A + + ED + + L+ ++ K Y E
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAI--MELYELWLAQHKKAYNGLGE 60
Query: 70 HDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLP-AD 125
RF VFK N L + +P+ G+ +F+DL+ EF+ +LG L+ + RL +
Sbjct: 61 KQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSP 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ + DLP DWR+ GAVT VKDQG+CGSCW+FS A+EG + + TG L SLS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD S + GCNGGLM+ AF++I+ GG++ E DYPY DG +
Sbjct: 181 EQELVDCDT--------SYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYR 232
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVSCPYICGKYLD 303
K+ + ++ + ++++ + P++V I A Q Y GV CG LD
Sbjct: 233 KNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTS-TCGTQLD 291
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
HGV +VGYGS YWI+KNSWG++WGE G+ + + RN+ GV + + +A
Sbjct: 292 HGVTLVGYGSE-------SGTDYWIVKNSWGKSWGEKGFIR--LQRNIEGVSTGMCGIA 341
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/327 (37%), Positives = 178/327 (54%), Gaps = 34/327 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE++++ A ++ + + +TY + R++VF+ NLR
Sbjct: 29 IVSYGERTDEE---ARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAG 85
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+ +LG R +L A A DLP DWR
Sbjct: 86 VHSFRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGARYHAAD---NEDLPESVDWRAK 142
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+CG+CW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 143 GAVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 194
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 255 LQKAVANQPVSVAIEAAGTAFQLYSSGIFTG-SCGTRLDHGVTAVGYGTE-------NGK 306
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV 351
YWI+KNSWG +WGE+GY + M RN+
Sbjct: 307 DYWIVKNSWGSSWGESGYVR--MERNI 331
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/367 (36%), Positives = 190/367 (51%), Gaps = 50/367 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L L L+ +VLA+A A+ S L+N E ++ FK + +K Y
Sbjct: 3 LFLFLIVAVLATAQAI-----------------SFFELVNQE--WTTFKMEHNKVYKNDV 43
Query: 69 EHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E +R ++F N + + ++ + + K+ D+ EF G N+ +
Sbjct: 44 EERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQL 103
Query: 125 DAQKAPIL-----PTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+++ PI P N LP DWR+HGAVT VKDQG CGSCWSFSATGALEG HF T
Sbjct: 104 RSERLPIAASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRT 163
Query: 179 GELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G L+ LSEQ L+DC SG ++GCNGGLM+ AF+YI G++ E YPY
Sbjct: 164 GILIPLSEQNLIDC--------SGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE-A 214
Query: 238 DGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC 294
+ C+++ + A V + +E ++ A + GP++V I+A Q Y GV
Sbjct: 215 ENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYY 274
Query: 295 -PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVC 352
P + LDHGVL VGYG+ + YW++KNSWGE WG+NGY K+ + N C
Sbjct: 275 EPECSSENLDHGVLAVGYGTDE------NGQDYWLVKNSWGETWGDNGYIKMARNKLNHC 328
Query: 353 GVDSMVS 359
G+ S S
Sbjct: 329 GIASTAS 335
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/367 (36%), Positives = 191/367 (52%), Gaps = 50/367 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L LLL+ ++LA+A A+ S L+N E ++ FK + +K Y
Sbjct: 3 LFLLLIVAILATAQAI-----------------SFFELVNQE--WTTFKMEHNKVYKNDI 43
Query: 69 EHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E +R ++F N + + ++ + + K+ D+ EF G N+ +
Sbjct: 44 EERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQL 103
Query: 125 DAQKAPI-----LPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+++ PI P N LP DWR+HGAVT VKDQG CGSCWSFSATGALEG HF T
Sbjct: 104 RSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRT 163
Query: 179 GELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G L+ LSEQ L+DC SG ++GCNGGLM+ AF+YI G++ E YPY
Sbjct: 164 GILIPLSEQNLIDC--------SGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE-A 214
Query: 238 DGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC 294
+ C+++ + A V + +E ++ A + GP++V I+A Q Y GV
Sbjct: 215 ENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYY 274
Query: 295 -PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVC 352
P + LDHGVL VGYG+ + YW++KNSWGE WG+NGY K+ + N C
Sbjct: 275 EPECSSENLDHGVLAVGYGTDE------NGQDYWLVKNSWGETWGDNGYIKMARNKLNHC 328
Query: 353 GVDSMVS 359
G+ S S
Sbjct: 329 GIASTAS 335
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/324 (39%), Positives = 171/324 (52%), Gaps = 25/324 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDL 104
+ ++ FK + K Y ++ E +R ++F N + + L ++ + K+ DL
Sbjct: 23 VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82
Query: 105 TPSEFRRQFLGLNRRL----RLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
EF G NR R I P + D+P DWR GAVT VKDQG CG
Sbjct: 83 LHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCG 142
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCWSFSATGALEG HF T +LVSLSEQ LVDC S ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDC-------SSRFGNNGCNGGLMDNAFRY 195
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I GG++ E YPY G D K++ A + S DED++ A + GP+++
Sbjct: 196 IKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIA 255
Query: 280 INAVW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
I+A Q Y GV S P LDHGVL+VGYG+ + YW++KNSWG+
Sbjct: 256 IDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGM-----DYWLVKNSWGDT 310
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG +GY K+ + N CGV + S
Sbjct: 311 WGLDGYIKMARNQDNQCGVATQAS 334
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 171/318 (53%), Gaps = 31/318 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K K+ K Y +EE+DY F N+ +L T G+ +DL SE+R+
Sbjct: 43 YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 101
Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
L R RL D+ + ++P N P DWR+H VT VK+QG CGSCW+FSA
Sbjct: 102 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKAPDSVDWREHNLVTPVKNQGMCGSCWAFSA 159
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI G+
Sbjct: 160 TGALEGQHFRATGKLVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGI 212
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPY G + C F K I A F + DED + + GP+++ I+A
Sbjct: 213 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIAIDAGHR 271
Query: 284 WMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV C + LDHGVL+VGYG+ A YWIIKNSWG WGE GY
Sbjct: 272 SFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAG------DYWIIKNSWGTKWGEKGY 325
Query: 343 YKICMGRNV-CGVDSMVS 359
+I RN CGV + S
Sbjct: 326 VRIARNRNNHCGVATKAS 343
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 160/293 (54%), Gaps = 26/293 (8%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ YA E + R+ VFK N+ +R + T V +F+DLT EFR + G
Sbjct: 40 RVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN 99
Query: 120 LRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
L + + + ++ LP DWR GAVT +KDQG+CGSCW+FSA A+EG
Sbjct: 100 SVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQ 159
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+L+SLSEQ+LVDCD + D GC GG MNSAF Y + GG+ E +YPY
Sbjct: 160 IKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYK 210
Query: 236 GTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQTYIGGV 292
TD G+C +K+K IA ++ F + +++++ V H P+++GI Q Y GV
Sbjct: 211 STD-GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV 269
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
C +LDHGV +VGYG S YWI+KNSWG WGE GY +I
Sbjct: 270 FSGE-CSTHLDHGVAVVGYGKSSNGS------KYWILKNSWGPKWGERGYMRI 315
>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/334 (38%), Positives = 174/334 (52%), Gaps = 31/334 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDRFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G +++D+ E + G L+ + + DWR GAVT
Sbjct: 70 EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT D G+C + + A + + V + E +
Sbjct: 183 GLMDSAFEYVKSNGGIDTEESYPYTAED-GTCLYKAANNAGVNTGYKDVQAKSESALRDA 241
Query: 270 LVKHGPLAVGINAV-W-MQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPY 326
+ K GP++V I+A W Q Y G+ C LDHGVL VGYGS + K +
Sbjct: 242 VEKVGPVSVAIDASNWSFQMYTSGIYYEPACSSDSLDHGVLAVGYGS------EWPNKEF 295
Query: 327 WIIKNSWGENWGENGYYKICMG-RNVCGVDSMVS 359
WI+KNSWG +WGE GY K+ +N CG+ + S
Sbjct: 296 WIVKNSWGTSWGEEGYIKMARNKKNNCGIATEAS 329
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 175/327 (53%), Gaps = 29/327 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E R +++ N + + +L + + + K
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR+ GA+T VKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+LVSLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI G++ E YPY D G C+++ A F + S +ED++ A + GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-GVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 254
Query: 277 AVGINAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q Y G C LDHGVL+VGYGS + YW++KNSW
Sbjct: 255 SVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGSDN-------GEDYWLVKNSW 307
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
E+WG+ GY KI R N CGV + S
Sbjct: 308 SEHWGDEGYIKIARNRKNHCGVATAAS 334
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/297 (39%), Positives = 164/297 (55%), Gaps = 26/297 (8%)
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQFLG 115
++ + YA E + R+ VFK N+ R +R + T V +F+DLT EFR + G
Sbjct: 37 TEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTG 96
Query: 116 LNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
L + + + ++ LP DWR GAVT +KDQG CGSCW+FSA A+E
Sbjct: 97 FKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIE 156
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + G+L+SLSEQ+LVDCD + D GC GGLM++AF Y + GG+ E +
Sbjct: 157 GVAQIKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSESN 207
Query: 232 YPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTY 288
YPY T+ G+C F+K+K IA ++ F + +++++ V H P+++GI + Q Y
Sbjct: 208 YPYKSTN-GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFY 266
Query: 289 IGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
GV C +LDHGV VGYG S YWI+KNSWG WGE GY +I
Sbjct: 267 SSGVFSGE-CTTHLDHGVTAVGYGRSKNGL------KYWILKNSWGPKWGERGYMRI 316
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 173/314 (55%), Gaps = 25/314 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSE 108
+H F+ F +K+ K+Y T+EE+D+R ++FK NL + D T G+ KF+D T +E
Sbjct: 40 DHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLNKFADYTEAE 99
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
++R LG + K P ND +W + GAVT VKDQG CGSCWSFSATG
Sbjct: 100 YKR-LLGFGGQKNKNPRNIKVLGAPKND---GVNWVEQGAVTPVKDQGQCGSCWSFSATG 155
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + G L SLSEQQLVDC + GC GG M+ AF+Y+ + +E
Sbjct: 156 AMEGHAKIQFGTLYSLSEQQLVDCSQ-------AEGNEGCGGGWMDQAFQYVEQT-ALET 207
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM--Q 286
E YPY D +C+ + + S V ++ +++ A L K GP++V I A M Q
Sbjct: 208 EDQYPYEAVD-DTCRASSAGVVKVDSFVDVTPNNVNELKAALDK-GPVSVAIEADQMVFQ 265
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GGV CG LDHGVL VGYG+ + Y+++KNSWG +WGE GY KI
Sbjct: 266 FYSGGVINDASCGTTLDHGVLAVGYGNE-------SGQDYFLVKNSWGASWGEEGYVKIA 318
Query: 347 MG-RNVCGVDSMVS 359
N+CG+ S S
Sbjct: 319 ASPDNICGILSQAS 332
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 181/335 (54%), Gaps = 45/335 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + K KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 30 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 89
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L + +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 90 SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 149
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 150 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 201
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA------- 282
DYPY D G+CK DK K + +++ + S++++ V P++VGI
Sbjct: 202 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 260
Query: 283 ------VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
+ MQ G C LDH VLIVGYGS YWI+KNSWG++
Sbjct: 261 YSSKFYLLMQGIFSGP-----CSTSLDHAVLIVGYGSQNGV-------DYWIVKNSWGKS 308
Query: 337 WGENGYYKICMGRN------VCGVDSMVSSVAAIH 365
WG +G+ M RN VCG++ + S H
Sbjct: 309 WGMDGFMH--MQRNTENSDGVCGINMLASYPIKTH 341
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 129/367 (35%), Positives = 187/367 (50%), Gaps = 57/367 (15%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L LL S +A + +DD+ M F+++ K+ KTY+T
Sbjct: 9 ALFFLLASFTVALPFSPSDDEVMAES-------------------FNMWMKKYEKTYSTM 49
Query: 68 EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL-------GLNRR 119
EE++ R RV+ +N ++ + P + + +FSDLT +EF++ +L N
Sbjct: 50 EEYNERLRVYTSNYYYIEQLNKEHGPHTEYELNQFSDLTFAEFKKIYLTEPQHCSATNGN 109
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ P +A+ P DWR+ +T VKDQG CGSCW+FS TG LE H + TG
Sbjct: 110 FQKPVNARD---------PVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLEAHHAIKTG 160
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L+SLSEQQLVDC + + GCNGGL + AFEYI GG+E E +Y YT D
Sbjct: 161 QLISLSEQQLVDCAGAFN-------NHGCNGGLPSQAFEYIKYNGGIESESNYNYTAKD- 212
Query: 240 GSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYI 297
G C+F+ S +AA VS+ I+ D E + + GP+++ Q Y GV I
Sbjct: 213 GVCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVSIAFEVTKSFQHYKKGVYQGEI 272
Query: 298 --CGKYLD---HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
C + D H VL+VGY + + YWI+KNSW +WG +GY+ I G N C
Sbjct: 273 EVCSQSPDKVNHAVLVVGYNQTKLG------EEYWIVKNSWSASWGMDGYFWIRRGHNAC 326
Query: 353 GVDSMVS 359
G+ + S
Sbjct: 327 GLATCAS 333
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 137/370 (37%), Positives = 191/370 (51%), Gaps = 59/370 (15%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
+LLL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MLLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV- 292
TD CKFD S + A + + V SS+E + + GP++V I+A Q Y GV
Sbjct: 207 TDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY 266
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR--- 349
P + LDHGVL+VGYG A + +WI+KNSWG NWG+ GY I M R
Sbjct: 267 DEPQCSTEQLDHGVLVVGYG----AMNDNSHQAFWIVKNSWGPNWGDQGY--IMMSRNKN 320
Query: 350 NVCGVDSMVS 359
N CG+ + S
Sbjct: 321 NQCGIATSAS 330
>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
Length = 467
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 162/316 (51%), Gaps = 24/316 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK + K Y + E +R VFK NL A+ +P A VT FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFAVTPFSDLTREEFRS 96
Query: 112 QF---LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
++ + + P DWR GAVT +KDQG C SCW+FS G
Sbjct: 97 RYHNAAAHFAAAQKRVRVPVEVEVEVGGPPAAVDWRARGAVTAIKDQGNCSSCWAFSTIG 156
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG L+ L LSEQ LV CD+ D+GC+GGLM+SAF++I++ G V
Sbjct: 157 NIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVEQNNGSV 207
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
E Y Y G D +C + A +S + DED+MAA L +GPLA+ ++A
Sbjct: 208 YTEASYSYVSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATS 267
Query: 285 MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
+Y GGV + + LDHGV++VGY S PYWIIKNSWG +WGE GY +
Sbjct: 268 FMSYTGGVLTNCVSDQ-LDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEEGYIR 319
Query: 345 ICMGRNVCGVDSMVSS 360
I G N C V + S
Sbjct: 320 IQKGTNQCLVKNYACS 335
>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
Length = 323
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/325 (35%), Positives = 180/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++ CD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIGCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFDSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/288 (39%), Positives = 164/288 (56%), Gaps = 23/288 (7%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRLPAD 125
E + RF+VFK NLR + + G+ +F+DLT E+R +LG +R RL
Sbjct: 70 EKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRS 129
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ + + LP DWR GAV VKDQG+CGSCW+FS A+EG + + TG+L+SLS
Sbjct: 130 SNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLS 189
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD S + GCNGGLM+ AF++I+ GG++ E+DYPY DG +
Sbjct: 190 EQELVDCDR--------SYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYR 241
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLD 303
K+ + N+ + ++++ V + P++V I A Q Y G+ CG LD
Sbjct: 242 KNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGR-CGTALD 300
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
HGV VGYG+ K YWI++NSWG++WGE+GY + M RN+
Sbjct: 301 HGVAAVGYGTE-------NGKDYWIVRNSWGKSWGESGYIR--MERNI 339
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 167/306 (54%), Gaps = 26/306 (8%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K+Y + +E + RF +FK NLR + + G+ +F+DLT E+R +LG
Sbjct: 51 KSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFKSGP 110
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + P + + LP DWR GAV GVK+QG C SCW+FSA A+EG + + TG
Sbjct: 111 KAKVSNRYVPKV-GDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGN 169
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+SLSEQ+LVDC GCN G M AF++I+ GG+ E +YPYT DG
Sbjct: 170 LLSLSEQELVDCGRT-------QSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQ 222
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYIC 298
++ +++ + ++ + S+ + N V H P++VG+ + + Y G+ Y C
Sbjct: 223 CNRYLQNQKYVTIDDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQY-C 281
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-----CG 353
G +DHGV IVGYG+ + YWI+KNSWG NWGENGY +I RN+ CG
Sbjct: 282 GTAIDHGVTIVGYGTE-------RGLDYWIVKNSWGTNWGENGYIRI--QRNIGGAGKCG 332
Query: 354 VDSMVS 359
+ M S
Sbjct: 333 IARMAS 338
>gi|339244637|ref|XP_003378244.1| cathepsin F [Trichinella spiralis]
gi|316972865|gb|EFV56511.1| cathepsin F [Trichinella spiralis]
Length = 317
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 112/287 (39%), Positives = 162/287 (56%), Gaps = 29/287 (10%)
Query: 93 TAVHGVTKFSDLTPSEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
TA++G T F+D+T EFR+ +L L LP Q+ +L D P FDWR++ VT
Sbjct: 10 TAIYGPTIFADMTQDEFRKTYLNMLETSALLPK--QRIALLKV-DRPNKFDWRNYNVVTK 66
Query: 152 VKDQ----------GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
VK Q G CGS W+FS +E A + G+L+SLSEQQ++DCD
Sbjct: 67 VKRQVWHKMQKKFLGKCGSSWAFSTIANIESAWAIKFGDLISLSEQQIIDCD-------- 118
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
+ GC GG A+ I++ GV+ E DYPYTG G SCK +K KI +++ ++
Sbjct: 119 -KINRGCRGGQPLKAYHEIIRMSGVQAESDYPYTGLHG-SCKLNKEKIKVYINDTVLLHK 176
Query: 262 DEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAP 318
+E +A L +HGP+AV +NA + Y G+ P +L+HG I+GYG +
Sbjct: 177 NETTIANYLYEHGPVAVRMNADILMLYRKGIIKPTKSSCNPNFLNHGATIIGYGKESW-- 234
Query: 319 IRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVAAIH 365
+ + PYWIIKNSWG +WGENGY+++ G CGV+ MV+S++ +
Sbjct: 235 LHWWSNPYWIIKNSWGVDWGENGYFRLYRGNEACGVNRMVTSMSEMQ 281
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 176/324 (54%), Gaps = 28/324 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+E + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AF++I+ GG++ E DYPY G D K+ + ++ ++ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQK 253
Query: 270 LVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
V++ P++V I A Q Y G+ CG LDHGV VGYG+ K YW
Sbjct: 254 AVRNQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDYW 305
Query: 328 IIKNSWGENWGENGYYKICMGRNV 351
I++NSWG++WGE+GY + M RN+
Sbjct: 306 IVRNSWGKSWGESGYVR--MERNI 327
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 175/321 (54%), Gaps = 31/321 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H++LFK+ F K Y+T EE R ++AN+ ++ L +H G+ ++DLT
Sbjct: 25 DSHWALFKTTFGKQYSTAEEITRRL-AWEANVAIIRQHNLEHDLGLHTYTLGLNNYADLT 83
Query: 106 PSEFRRQFLGLNRRLRLPADA-QKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+EF + GL A ++ + P +LPT DWR G VT +KDQG CGSCW+
Sbjct: 84 NAEFNQVMNGLRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS+TG+LEG HF TG+LVSLSEQ L DC + + GCNGGLM+ AF YI +
Sbjct: 144 FSSTGSLEGQHFAKTGQLVSLSEQNLTDCSQK-------QGNMGCNGGLMDQAFTYIKEN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAVGINA 282
G++ E YPY D C F + + A + ++ I+ DE+ + + + GP++V I+A
Sbjct: 197 NGIDTESSYPYKAVD-EKCHFKAADVGATDTGYTDIAQQDENALQSAIATVGPISVAIDA 255
Query: 283 VW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G C LDHGVL VGY S K Y+I+KNSWG +WG+
Sbjct: 256 SHSSFQLYRSGAYNERACSATQLDHGVLAVGYDSE-------DGKDYYIVKNSWGTSWGQ 308
Query: 340 NGYYKICMGR---NVCGVDSM 357
GY I M R N CG+ +M
Sbjct: 309 KGY--IWMTRNKNNQCGIATM 327
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 173/323 (53%), Gaps = 30/323 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK Q V V K++D+ E
Sbjct: 27 WQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADMLHHE 86
Query: 109 FRRQFLGLN----RRLRL--PADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
F G N ++LR P+ I P + +P DWR GAVT VKDQG CGSC
Sbjct: 87 FHTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSC 146
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF G L+SLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 WAFSSTGALEGQHFRKAGTLISLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGI 280
GG++ EK YPY G D SC F+K+ I A + + DE +MA + GP++V I
Sbjct: 200 DNGGIDTEKSYPYEGID-DSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVSVAI 258
Query: 281 NAVW--MQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y G+ C + LDHGVL+VGYG+ + YW++KNSWG W
Sbjct: 259 DASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDESG------QDYWLVKNSWGTTW 312
Query: 338 GENGYYKICM-GRNVCGVDSMVS 359
G+ G+ K+ N CG+ S S
Sbjct: 313 GDKGFIKMARNADNQCGIASASS 335
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/362 (32%), Positives = 202/362 (55%), Gaps = 28/362 (7%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L++LL+ + L + D+ ++ + + S ++ +A +F F S+++K Y +
Sbjct: 4 TLIILLVVNALLNW----RDNELVDAAGTAANKPSLYNINSAPQYFEQFISQYNKQYKNE 59
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
E +RF +F N+ ++ + +AV+ + +F+D+T +E + GL L ++
Sbjct: 60 AEKRHRFNIFMHNIEEINQKNSRNDSAVYKINRFADMTKNEVVIRHTGLASIGELNSNFC 119
Query: 128 KAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
+ ++ P+ FDWR + VT VKDQ CG+CW+F++ GALE + + L+ L
Sbjct: 120 ETVVVDGPGQRQRPSSFDWRTYNKVTSVKDQSMCGACWAFASLGALESQYAIKYDRLIDL 179
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
+EQQLVDCD D GC+GGL+++A+E I++ GGVE+E DYPY + C
Sbjct: 180 AEQQLVDCDF---------VDMGCDGGLIHTAYEQIMQMGGVEQEFDYPYRA-ERQPCAL 229
Query: 245 DKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAVGINAVWMQTYIGGVSCPYICGKYL 302
K AA V F + +E+++ +L++H GP+A+ ++AV + Y GG+ + L
Sbjct: 230 KPHKFAAGVRKCFRYVLRNEERL-EDLLRHVGPIAIAVDAVDLTDYYGGI-VSFCENNGL 287
Query: 303 DHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
+H VL+VGYG P+W +KNSWG ++GE+GY ++ G N CG+ + ++S A
Sbjct: 288 NHAVLLVGYGVE-------NNVPFWTLKNSWGSDYGEDGYVRVRRGVNSCGLVNELASSA 340
Query: 363 AI 364
+
Sbjct: 341 QV 342
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 168/321 (52%), Gaps = 41/321 (12%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--------GVTKFSDLTPS 107
+K + K Y + E R +++AN R+ +D H G+ +F+DL S
Sbjct: 25 WKKEHGKVYNSDREELTRHIIWQAN------RKYVDEHNAHAEKFGFTVGMNQFADLESS 78
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EF R + G N + + K DLPT DWR G VT +K+QG CGSCW+FSA
Sbjct: 79 EFGRLYNGYNNKPSMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
LEG HF +TG LVSLSEQ LVDC + + GCNGGLM++AF+Y++K GG++
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCS-------TAEGNQGCNGGLMDNAFQYVIKNGGID 191
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI--SSDEDQMAANLVKHGPLAVGINA--V 283
E YPY D CKF+ + + + S FS I E + + GP++V I+A
Sbjct: 192 TEASYPYKAVD-QKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHT 250
Query: 284 WMQTYIGGVSCPYICGKY-LDHGVLIVGY-GSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV C + LDHGV VGY SSG A YWI+KNSWG WG+ G
Sbjct: 251 SFQLYKSGVYSESACSQTSLDHGVTAVGYDSSSGVA--------YWIVKNSWGTTWGQAG 302
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M R N CG+ + S
Sbjct: 303 Y--IWMSRNKNNQCGIATAAS 321
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 118/302 (39%), Positives = 168/302 (55%), Gaps = 27/302 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTAVHGVTKFSDLTPS 107
F L+K K K Y EE + R FK NL+ KR+ L+ G+ KF+DL+
Sbjct: 50 FKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKV--GLNKFADLSNE 107
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR +L ++ + +K L T D P+ DWR+ G VT VKDQG CGSCWSFS T
Sbjct: 108 EFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFSTT 167
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GA+E + + TG+L+SLSEQ+LVDCD + + GC GG M+SAF++++ GG++
Sbjct: 168 GAIEAINAIVTGDLISLSEQELVDCDT--------TNNYGCEGGDMDSAFQWVIGNGGID 219
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWM 285
E DYPYTG DG + K ++ + + + + V+ P++VG++ A+
Sbjct: 220 TEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQ-PISVGMDGSALDF 278
Query: 286 QTYIGGVSCPYICG--KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Q Y GG+ G +DH +LIVGYGS ++ YWI+KNSWG WG GY+
Sbjct: 279 QLYTGGIYDGDCSGDPNDIDHAILIVGYGSE-------NDEDYWIVKNSWGTEWGMEGYF 331
Query: 344 KI 345
I
Sbjct: 332 YI 333
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 173/319 (54%), Gaps = 27/319 (8%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTP 106
+ + FK+++ K Y + +E YR V++ N + T +F D+T
Sbjct: 20 NEWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTT 79
Query: 107 SEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
E G L+ ++P P++ ++LP DWRD GAVT VKDQ ACGSCW+FS
Sbjct: 80 EEINAAMNGFLSAGKKVPRGTMYQPLV--DELPDTVDWRDKGAVTPVKDQKACGSCWAFS 137
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFLSTG+LVSLSEQ LVDC + + GC GGLM++AF YI G
Sbjct: 138 ATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYG-------NFGCGGGLMDNAFRYIKDNNG 190
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAVGINA-- 282
++ E+ YPY + G C+F+ + A +S++ I ED + + + GP++V I+A
Sbjct: 191 IDTEESYPYEAKN-GPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDAST 249
Query: 283 VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Y G+ C +LDHGVL VGYG+ YW++KNSW E WG++G
Sbjct: 250 STFHFYSRGIYYDEKCSSSFLDHGVLAVGYGTD-------DSSDYWLVKNSWNETWGDSG 302
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y K+ R N CG+ S S
Sbjct: 303 YIKMSRNRNNNCGIASQAS 321
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 171/318 (53%), Gaps = 21/318 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS SK Y +EE +R V++ NL+ + L H G+ +F D+T
Sbjct: 43 HWQLWKSWHSKDYHEREE-SWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAE 101
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFR+ G + + P+ + P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 102 EFRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 161
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ GG+
Sbjct: 162 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNQGCNGGLMDQAFQYVQDNGGI 214
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPYT D C++ AA + F + E + + GP++V I+A
Sbjct: 215 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHS 274
Query: 284 WMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y G+ P + LDHGVL+VGY GF K YWI+KNSWGE WG+ GY
Sbjct: 275 SFQFYQSGIYYEPDCSSEDLDHGVLVVGY---GFEGEDVDGKKYWIVKNSWGEKWGDKGY 331
Query: 343 YKICMGR-NVCGVDSMVS 359
+ R N CG+ + S
Sbjct: 332 IYMAKDRKNHCGIATAAS 349
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 117/326 (35%), Positives = 180/326 (55%), Gaps = 29/326 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
+L A ++F F +++K Y ++ E R+++F+ NL + D TAV+ + KFSDL+
Sbjct: 21 ILKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDIITKNRND-TAVYKINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P P +FDWR +T VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPLHTQNFCEVVVLDRPPGKGPLEFDWRRFNKITSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE ++ L++LSEQQ++DCD S D GC GGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIAHDRLINLSEQQMIDCD---------SVDVGCEGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVG 279
+ GGV+ E DYPY ++ C+ D +K V + I+ E+++ L GP+ V
Sbjct: 187 ISMGGVQIENDYPYESSN-NYCRMDPTKFVVGVKQCNRYITIYEEKLKDVLRLAGPIPVA 245
Query: 280 INAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
I+A + Y G+ Y L+H VL+VGYG PYWI+KNSWG +WGE
Sbjct: 246 IDASDILNYEQGI-IKYCANNGLNHAVLLVGYGVEN-------NVPYWILKNSWGTDWGE 297
Query: 340 NGYYKICMGRNVCGVDSMVSSVAAIH 365
G++KI N CG+ + ++S A I+
Sbjct: 298 QGFFKIQQNVNACGIKNELASTAEIN 323
>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
Length = 299
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 176/316 (55%), Gaps = 23/316 (7%)
Query: 55 LFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
+F + ++K Y E R+ +F+ NLR + L+ +AV+ + KFSDL+ SE ++
Sbjct: 1 MFVANYNKMYDDDLEKTKRYSIFRDNLRDINIKNKLNGSAVYRINKFSDLSTSEIVLKYT 60
Query: 115 GLN--RRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
GL+ RL + K +L P P +FDWR VT +K+QG CG+CW+F+ ++
Sbjct: 61 GLSVPPTERLTTNFCKTIVLDQPPGKGPLNFDWRHQNKVTSIKNQGVCGACWAFATLASI 120
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E + + ++LSEQQ++DCD+ D GC+GGL+++AFE +++ GGV+ E
Sbjct: 121 ESQYAIKHNVQINLSEQQMIDCDY---------VDMGCDGGLLHTAFEQMIEMGGVKHEH 171
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVWMQTYI 289
+YPY G + +C+ + A + + I E+++ L GP+ + I+A + Y
Sbjct: 172 EYPYEGIN-MNCRLNDDNFAVKIIGCYRYIVLQEEKLKDLLRAVGPIPIAIDASGIANYY 230
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV Y L+H VL+VGYG PYW IKN+WGE+WGENGY+++
Sbjct: 231 QGV-INYCENHGLNHAVLLVGYGVE-------NNIPYWTIKNTWGEDWGENGYFRVRQNI 282
Query: 350 NVCGVDSMVSSVAAIH 365
N CG+ + ++S A +H
Sbjct: 283 NACGMTNELASSAVLH 298
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 170/309 (55%), Gaps = 25/309 (8%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-----GVTKFSDLT 105
+H + K K Y E + RF +F+ NL + + G+ KF+DLT
Sbjct: 3 YHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLT 62
Query: 106 PSEFRRQFLGLNRRLRLPA-DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFRR + G+ R + + + + + ++LP DWR GAV+ VKDQG CGSCW+F
Sbjct: 63 NDEFRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA GA+EG + + TG+L++LSEQ+LVDCD S +SGC+GGLM+ AF +I+ G
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDT--------SYNSGCDGGLMDYAFRFIINNG 174
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
G++ +KDYPY TDG K+ + + ++ ++ V H P+ + I A
Sbjct: 175 GIDTDKDYPYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGG 234
Query: 285 --MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV CG LDHGV+ VGYG++ K YWI++NSWG++WGE+GY
Sbjct: 235 RDFQLYKSGVFTG-SCGTSLDHGVVAVGYGTTDDG------KDYWIVRNSWGDDWGEDGY 287
Query: 343 YKICMGRNV 351
I M RN
Sbjct: 288 --IRMERNT 294
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 169/314 (53%), Gaps = 26/314 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
+K+ K+YA+ EE R +++ NLR + +H +TKF+DL EF
Sbjct: 26 WKNTHGKSYASHEELK-RQLIWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDEFAA 84
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L R+ P+ + PT DWR G VT VK+Q CGSCW+FS TG+LE
Sbjct: 85 MYLPRMRKDSRNGFCSAQPVGGFVENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTGSLE 144
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G HF T LVSLSEQQL+DC + D GC GG+M+ AF+YI AGGVE E D
Sbjct: 145 GQHFAKTKNLVSLSEQQLMDCSFK-------EGDEGCGGGIMDYAFDYIFLAGGVESEAD 197
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINA--VWMQTY 288
YPY + C+FD S IAA ++ V S E Q+ + GP++V I+A + Q Y
Sbjct: 198 YPYEARN-DHCRFDNSSIAATLTGCVDVTSGSETQLEKAVGSIGPVSVAIDASHISFQLY 256
Query: 289 IGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE-NGYYKIC 346
GV+ +C LDHGVL VGYG+ YWI+KNSWGE WG NGY K+
Sbjct: 257 GSGVNYEPMCSTTTLDHGVLAVGYGAD-------NGNEYWIVKNSWGEGWGHLNGYIKMS 309
Query: 347 MGR-NVCGVDSMVS 359
R N CG+ + S
Sbjct: 310 KNRNNNCGIATQAS 323
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSLGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 138/362 (38%), Positives = 187/362 (51%), Gaps = 42/362 (11%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L LL+L++ L+S ++ DA + + H+ L+KS SK Y +E
Sbjct: 2 LPLLVLTACLSSVLSAPVLDAQLNE------------------HWDLWKSWHSKKYHEKE 43
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLG--LNRRLRL 122
E +R V++ NL++ + L H G+ F D+T EFR+ G L + +
Sbjct: 44 E-GWRRMVWEKNLQKIELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKTQRKF 102
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P T P+ DWR+ G VT VKDQG CGSCW+FS TGALEG F TG+LV
Sbjct: 103 TGSLFMEPNFMT--APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLV 160
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ LVDC PE + GC GGLM+ AF+Y+ G++ E YPYTGTD C
Sbjct: 161 SLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPC 213
Query: 243 KFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYIC- 298
+D +A + F V S E + + GP++V I+A Q Y G+ C
Sbjct: 214 HYDPLYNSANDTGFVDVPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECS 273
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSM 357
+ LDHGVL VGYG G + K +WI+KNSWGE WG+ GY + R N CG+ +
Sbjct: 274 SEELDHGVLAVGYGFEGEDKMG---KKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATA 330
Query: 358 VS 359
S
Sbjct: 331 AS 332
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 175/327 (53%), Gaps = 29/327 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR GA+T VKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQ 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+L+SLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F I S +ED++ A + GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-NVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPV 254
Query: 277 AVGINAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q Y GV C LDHGVL+VGYGS K YW++KNSW
Sbjct: 255 SVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDN-------GKDYWLVKNSW 307
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
E+WG+ GY KI R N CG+ + S
Sbjct: 308 SEHWGDEGYIKIARNRKNHCGIATAAS 334
>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 327
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 171/313 (54%), Gaps = 27/313 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E HF + + +K Y+ QE H R ++F N RR ++ + + G+ +FSD+T +EF
Sbjct: 26 EQHFKSWMALHNKAYSVQEFHQ-RLQIFTENKRRIEKHNGGNHSFTMGLNQFSDMTFAEF 84
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGA-VTGVKDQGACGSCWSFSAT 167
R++FL + A K + TN P DWR G VT VK+QGACGSCW+FS T
Sbjct: 85 RKRFLWSEPQ---NCSATKGSYMKTNSPQPESIDWRTKGNYVTPVKNQGACGSCWTFSTT 141
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G LE ++TG+LV LSEQQLVDC + + + GCNGGL + AFEYI G+
Sbjct: 142 GCLESVTAINTGKLVPLSEQQLVDCAWDFN-------NHGCNGGLPSQAFEYIKYNKGLM 194
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAV--W 284
E YPYT + G CK+ AA V N ++ + DE M + H P++ +
Sbjct: 195 TESGYPYTAFE-GKCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVSFAFEVTDDF 253
Query: 285 MQTYIGGVSCPYICGKYLD---HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
M Y GGV C K D H VL VGYG++ PYWI+KNSWG WGENG
Sbjct: 254 MH-YKGGVYSSSRCHKTTDKVNHAVLAVGYGNNN------SSVPYWIVKNSWGPYWGENG 306
Query: 342 YYKICMGRNVCGV 354
Y+ I G+N+CG+
Sbjct: 307 YFLIERGKNMCGL 319
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 177/324 (54%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
DH LN + + L+K+ K Y EE +R V+K N++ + H +
Sbjct: 22 DHSLNTQ--WELWKAVHRKPYDLNEE-GWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F DLT EFR+ G R+ I + +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDLTSEEFRQMMNGFQRQENKKGKVFHETIFAS--IPPSVDWREKGYVTPVKNQGKCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TGALEG F TG+LVSLSEQ LVDC PE + GC+GGLM++AF+Y
Sbjct: 137 SCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQ---PE----GNRGCHGGLMDNAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
+L GG++ E+ YPYTG G+C ++ AA + F + E+ + + GP++V
Sbjct: 190 VLDVGGLDSEESYPYTGLV-GTCNYNPKNSAANETGFVDLPKQENALMKAVATLGPISVA 248
Query: 280 INAV--WMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A Q Y G+ C + +DHGVL+VGY GF + YW++KNSWG++
Sbjct: 249 VDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGY---GFEGADSDDNKYWLVKNSWGKH 305
Query: 337 WGENGYYKICMGRNV-CGVDSMVS 359
WG NGY K+ +N CG+ +M S
Sbjct: 306 WGINGYIKMAKDQNNHCGIATMAS 329
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 171/314 (54%), Gaps = 36/314 (11%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
K +K Y E +D +++ FK N+ + V G+ +F+DLT E+++ +LG++
Sbjct: 40 KHNKAYHHHEFND-KYQTFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKTYLGMSI 98
Query: 119 RLRLPADAQKAPILPTNDL-------PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ L A+ +P N L P+ DWR +GAV VKDQG CGSCW+F+ TGA+E
Sbjct: 99 NVNLRANQ-----VPMNGLNFERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVE 153
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GAH + TG +V+ SEQ LVDC ++GC+GGLM SAF+YI+ G+ E+
Sbjct: 154 GAHQIKTGNMVTFSEQHLVDCSGRYG-------NNGCDGGLMTSAFKYIIDNDGIATEEA 206
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYI 289
YPYT T C ++ + + A+S + + + + P+AV I+A + Q Y
Sbjct: 207 YPYTATQ-NRCVYNTTMLGTAISGYKDVPRGSESALTAAISKQPVAVAIDASPITFQLYK 265
Query: 290 GGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GV C Y L+HGVL VGYG+ + K Y+I+KNSW E WG GY I M
Sbjct: 266 SGVYQEATCSSYRLNHGVLAVGYGT-------LEGKDYYIVKNSWAETWGNQGY--ILMA 316
Query: 349 RNV---CGVDSMVS 359
RN CG+ +M S
Sbjct: 317 RNANNHCGIATMAS 330
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY +I R N CG+ + S
Sbjct: 306 WGMEGYIEIAKDRDNHCGLATAAS 329
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 175/319 (54%), Gaps = 31/319 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDLTPSE 108
+ +FK+ KTY Q E +R ++F N ++ + + + + + + F DL E
Sbjct: 27 WHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHE 86
Query: 109 FRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
F+ L ++ D ++ L P+N +LP DWR GAVT VKDQG CGSCWSFS
Sbjct: 87 FK----ALMNGFKMSPDTKRNGELYFPSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG FL TG+LVSLSEQ LVDC + ++GC GGLM+ AF+Y+ G
Sbjct: 143 ATGSLEGQVFLKTGKLVSLSEQNLVDC-------STSYGNNGCEGGLMDQAFQYVSDNKG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGINAVW 284
++ E YPY + +C+F K+K+ + + + DE + L GP++V I+A
Sbjct: 196 IDTEASYPYEARE-NTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANH 254
Query: 285 --MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV C Y LDHGVL VGYG+ + YW++KNSWG +WGENG
Sbjct: 255 GSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGTE-------NGQDYWLVKNSWGPSWGENG 307
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y KI N CG+ SM S
Sbjct: 308 YIKIARNHSNHCGIASMAS 326
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 170/320 (53%), Gaps = 22/320 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
N + + +K+ + Y+T EE +R V++ N++ + HG T F D
Sbjct: 24 NLDTQWYQWKATHRRLYSTNEE-GWRRAVWEKNMKMIELHNGEYSRGKHGFTMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ + + + P+L DLP DWR G VT VK+Q CGSCW+
Sbjct: 83 MTNEEFRQVMVCFRNQKHKNGKVFRGPLLL--DLPKSVDWRKKGYVTPVKNQKQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC P+ + GCNGG MN AF Y+ +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGFMNYAFRYVKEN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV 283
GG++ E YPY D G CK+ A + F VI + E ++ + GP++V ++A
Sbjct: 194 GGLDSEASYPYEAKD-GICKYKPENSVANDTGFVVIPTHEKELMKAVATVGPISVAVDAS 252
Query: 284 W--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y G+ C K LDHGVL+VGY GF K+ YW+IKNSWG WG N
Sbjct: 253 HSSFQFYKSGIYFEKKCSSKNLDHGVLVVGY---GFEGANSKDNKYWLIKNSWGPEWGLN 309
Query: 341 GYYKICMGRNV-CGVDSMVS 359
GY KI +N CG+ + S
Sbjct: 310 GYIKIAKDQNNHCGIATAAS 329
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 169/326 (51%), Gaps = 38/326 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSE 108
+ FK +F + Y E RF +F AN + Q T GV +F+D T E
Sbjct: 60 WKFFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTDKTDYE 119
Query: 109 FRRQFLGLNRRLRLPADA--QKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWS 163
++ R ++ + A K ++ LP+ DWR GAVT VK+QG CGSCW+
Sbjct: 120 LKKL-----RGYKVTSGAIRHKGSTFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWA 174
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG H+ T LV+LSEQQLVDC ++GC+GGLMNSAFEY+
Sbjct: 175 FSTTGAIEGQHYRKTNRLVNLSEQQLVDCS-------KSYGNNGCSGGLMNSAFEYVRDN 227
Query: 224 GGVEREKDYPYT---GTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
G++ E YPY GT+ C F+ S I A V+ + ++ DE + + GP++V
Sbjct: 228 EGIDSEISYPYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVA 287
Query: 280 INAVW--MQTYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
INA Y G+ C LDHGVL+VGYG + YW+IKNSWG
Sbjct: 288 INAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEEN-------GRSYWLIKNSWG 340
Query: 335 ENWGENGYYKICMG-RNVCGVDSMVS 359
E WGE GY KI G N+CGV S S
Sbjct: 341 EEWGEKGYIKISKGSHNMCGVASAAS 366
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/308 (37%), Positives = 177/308 (57%), Gaps = 25/308 (8%)
Query: 53 FSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSE 108
++F+S ++ K+Y E + RF +FK NLR ++ + G+ +FSDLT +E
Sbjct: 45 IAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDAE 104
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ +LG +R+ + + + LP DWR GAV GVK+QG CGSCW+F++
Sbjct: 105 YSSIYLGTKFNIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWTFASIA 164
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + + TG L+SLSEQ++VDC + ++GCNGG ++ A+++I+ GG+
Sbjct: 165 AVEGINKIVTGNLISLSEQEIVDCQRKYP-------NNGCNGGTLSGAYQFIINNGGINT 217
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGI--NAVWMQ 286
E +YPYTG DG + K+K + + + S+ ++ V P++V I N+ +
Sbjct: 218 EANYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFK 277
Query: 287 TYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
+Y G+ + P CG +DHGV IVGYG+ G K YWI++NSWG NWGE+GY +
Sbjct: 278 SYKSGIFNGP--CGPRIDHGVTIVGYGTEG-------GKDYWIVRNSWGPNWGESGY--V 326
Query: 346 CMGRNVCG 353
M RNV G
Sbjct: 327 RMQRNVGG 334
>gi|116779845|gb|ABK21448.1| unknown [Picea sitchensis]
gi|116791731|gb|ABK26088.1| unknown [Picea sitchensis]
gi|224286276|gb|ACN40847.1| unknown [Picea sitchensis]
Length = 357
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 130/372 (34%), Positives = 200/372 (53%), Gaps = 38/372 (10%)
Query: 6 LSSLLLLLLSSVLASAVAVND----DDAMIRQVVPSDGEQSEDHLLN------AEHHFSL 55
++ +L ++LS++LA A+AV+ ++ +V + E L F+
Sbjct: 1 MARILAIVLSTLLALAIAVSAARSFEETEYIDMVTDKIQNLESSLFKILGTNPKSVQFAE 60
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F ++ K Y + + +RF F N+ + R ++ + +F+D+T EF Q+LG
Sbjct: 61 FALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLAINEFADITWEEFHGQYLG 120
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
++ K PT DWR+ G V+ VK+Q CGSCW+FS TGALE A+
Sbjct: 121 ASQNCSATKSNHK---FTDAQPPTKKDWREEGIVSPVKNQAHCGSCWTFSTTGALEAAYT 177
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPY 234
+TG+ V LSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPY
Sbjct: 178 QATGKTVILSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY 229
Query: 235 TGTDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIG 290
T D G C +D + + V+ N S+ + DE + A LV+ P++V + + Y
Sbjct: 230 TAKD-GVCNYDVNNVGVKVADSVNISLGAEDELKSAVGLVR--PVSVAFQVIQDFRFYKE 286
Query: 291 GVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GV CG+ ++H VL VGYG S + P+WIIKNSWG++WG GY+K+ M
Sbjct: 287 GVFTSTTCGQGPMDVNHAVLAVGYGVSE------EGTPHWIIKNSWGKSWGVEGYFKMEM 340
Query: 348 GRNVCGVDSMVS 359
G+N+CGV + S
Sbjct: 341 GKNMCGVATCAS 352
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/336 (36%), Positives = 177/336 (52%), Gaps = 40/336 (11%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + ++ TY E + RF F+ NLR +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 253
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 254 LQKAVANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGK 305
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV------CGV 354
YW+++NSWG WGE+GY I M RN+ CG+
Sbjct: 306 DYWLVRNSWGSVWGEDGY--IRMERNIKASSGKCGI 339
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 139/355 (39%), Positives = 186/355 (52%), Gaps = 37/355 (10%)
Query: 14 LSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ---EEH 70
LS L S V V A+ Q +P D E L + E +SL++ K+ +A ++
Sbjct: 4 LSYALLSVVLVLGSVALA-QSIPFD----EKDLASEESLWSLYE-KWRAHHAVSRDLDDT 57
Query: 71 DYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLPAD 125
D RF VFK N++ Q D T + KF D+T EFR + G + LR D
Sbjct: 58 DKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKD 117
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A + +DLPT DWR+ GAVTGVKDQG CGSCW+FS A+EG + + T ELVSLS
Sbjct: 118 AGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSLS 177
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCD + +SGCNGGLM+ AF++I GG+ E YPY + SC +
Sbjct: 178 EQQLVDCDTK---------NSGCNGGLMDYAFDFIKNNGGLSSEDSYPYL-AEQKSCGSE 227
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLD 303
+ + + + + + V + P++V I A Q Y GV + CG LD
Sbjct: 228 ANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGH-CGTELD 286
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNVCGV 354
HGV VGYG + K YWI+KNSWGE WGE+GY ++ G R CG+
Sbjct: 287 HGVAAVGYG------VDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGI 335
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 135/345 (39%), Positives = 181/345 (52%), Gaps = 30/345 (8%)
Query: 33 QVVPSDGEQSEDHLL-------NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
QV+P E S + L + H+ L+KS K Y +EE +R V++ NL+ +
Sbjct: 107 QVIPVTKENSTETLHCRWQVDPELDGHWQLWKSWHRKDYHEREE-GWRRVVWEKNLKMIE 165
Query: 86 RRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PT 139
L H G+ +F D+T EFR+ G + + + + L N L P
Sbjct: 166 IHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGYVHK-KSERKYRGSQFLEPNFLEAPR 224
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DWR+ G VT VKDQG CGSCW+FS TGALEG HF TG+LVSLSEQ LVDC PE
Sbjct: 225 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE 281
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SV 258
+ GCNGGLM+ AF+Y+ GG++ E+ YPYT D C++ AA + F +
Sbjct: 282 ----GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDI 337
Query: 259 ISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSG 315
E + + GP++V I+A Q Y G+ P + LDHGVL+VGY G
Sbjct: 338 PQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGY---G 394
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
F K YWI+KNSWGE WG+ GY + R N CG+ + S
Sbjct: 395 FEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAAS 439
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 173/324 (53%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK++G CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNKGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++V
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A +Q Y G+ P K LDHGVL+VGYG G + K YW++KNSWG
Sbjct: 249 MDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSE 305
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG GY KI R N CG+ + S
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAAS 329
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 135/354 (38%), Positives = 188/354 (53%), Gaps = 41/354 (11%)
Query: 28 DAMIRQVVPSDGEQSEDH------LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
DA+++Q + +D +S H L+N E + FK + K Y + E +R ++F N
Sbjct: 7 DAVVQQKLTND--ESRTHAVSFFELVNQE--WMTFKMEHKKVYKSDVEERFRMKIFMDNK 62
Query: 82 RR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAP-----IL 132
+ AK + V + K+ D+ EF G N+ + +++ P I
Sbjct: 63 HKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIE 122
Query: 133 PTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVD 191
P N LP DWR GAVT VKDQG CGSCWSFSATGALEG HF TG LVSLSEQ L+D
Sbjct: 123 PANVVLPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLID 182
Query: 192 CDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
C SG ++GCNGGLM+ AF+YI G++ E YPY + C+++ +
Sbjct: 183 C--------SGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYE-AENDKCRYNPANSG 233
Query: 251 AA-VSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSC-PYICGKYLDHGV 306
A V + + DE + A + GP++V I+A Q Y GV P + LDHGV
Sbjct: 234 AIDVGYIDIPTGDEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGV 293
Query: 307 LIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
L++GYG++ + YW++KNSWGE WG NGY K+ + N CG+ S S
Sbjct: 294 LVIGYGTNENG------QDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSAS 341
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 170/311 (54%), Gaps = 25/311 (8%)
Query: 60 FSKTYATQEEHDYRFR------VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
F+K + +YRF +++ N+ R + + + + +F DLT +EF R F
Sbjct: 30 FAKWMRENTKSNYRFVYSNEEFIYRWNVWRDEEHNRQNKSYFLAMNQFGDLTNAEFNRLF 89
Query: 114 LGLNRRLRLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
GL A A P P +P++FDWR GAVT VK+QG CGSCWSFS TG+ EG
Sbjct: 90 KGLAFDYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEG 149
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
A+FL TG LVSLSEQ L+DC ++GCNGGLM+ AFEYI+ G++ E Y
Sbjct: 150 ANFLKTGRLVSLSEQNLIDCSVSYG-------NNGCNGGLMDYAFEYIINNRGIDTEASY 202
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIG 290
PY +C+++ + +++ ++ ++S ++ N P++V I+A Q Y G
Sbjct: 203 PYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSVAIDASHNSFQFYSG 262
Query: 291 GVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV C LDHGVL+VG+GS + +W +KNSWG +WG NGY K+ +
Sbjct: 263 GVYYESACSSTQLDHGVLVVGWGSE-------NGQDFWWVKNSWGASWGLNGYIKMSRNQ 315
Query: 350 -NVCGVDSMVS 359
N CG+ + S
Sbjct: 316 NNNCGIATAAS 326
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 176/324 (54%), Gaps = 32/324 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ FK + K Y ++ E R +++ N + AK Q D V K++DL E
Sbjct: 27 WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86
Query: 109 FRRQFLGLNR---RLRLPADAQKAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
F + G NR + L + P+ P N ++PT DWR GAVT VKDQG CGSC
Sbjct: 87 FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYI 220
WSFSATGALEG HF TG+LVSLSEQ LVDC SG ++GCNGG+M+ AF+YI
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDC--------SGKYGNNGCNGGMMDYAFQYI 198
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
GG++ EK YPY D +C F+ + A + + DE+ + L GP+++
Sbjct: 199 KDNGGIDTEKSYPYEAID-DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 280 INAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
I+A Q Y GV C + LDHGVL VGYG+S + + YW++KNSWG
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSE------EGEDYWLVKNSWGTT 311
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG+ GY K+ N CGV + S
Sbjct: 312 WGDQGYVKMARNHDNHCGVATCAS 335
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 179/342 (52%), Gaps = 34/342 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEN 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
++ + + P++V I A Q Y G+ CG LDH V+ VGYGS
Sbjct: 263 DETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGS-CGTNLDHAVVAVGYGSENGV--- 318
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
YWI++NSWG WGE GY I M RN+ S +A
Sbjct: 319 ----DYWIVRNSWGPRWGEEGY--IRMERNLAASKSGKCGIA 354
>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
Length = 471
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 133/368 (36%), Positives = 181/368 (49%), Gaps = 55/368 (14%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
++L+++L+++ V A+ +++ ++ + Q F+ FK K +
Sbjct: 8 VLLAAVLVVMACLVPAATASLHAEETLTSQ-------------------FAEFKQKHGRV 48
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------FLGL 116
Y + VF+ NL A+ +P A GVT FSDLT EFR + F
Sbjct: 49 YESAARR-LPLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAA 107
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R R+P + P DWR GAVT VKDQG CGSCW+FSA G +E FL
Sbjct: 108 QERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFL 161
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPY 234
+ L +LSEQ LV CD D GC+GGLMN+AFE+I++ G V E YPY
Sbjct: 162 AGHPLTNLSEQMLVSCDKT---------DFGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 212
Query: 235 TGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV 292
+G S C + A ++ + DE Q+AA + +GP+AV ++A TY GGV
Sbjct: 213 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAACVAVNGPVAVAVDASSWMTYTGGV 272
Query: 293 SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
+ + LDHGVL+VGY S PYWIIKNSW GE GY +I G N C
Sbjct: 273 MTSCV-SEQLDHGVLLVGYNDSA-------AVPYWIIKNSWTTQ-GEEGYIRIAKGSNQC 323
Query: 353 GVDSMVSS 360
V SS
Sbjct: 324 LVKEEASS 331
>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
Length = 377
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 23/319 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGV-TKFSDLTPSEFRR 111
F F +KF KTY T EE +R VF N + G+ +F+D T EF
Sbjct: 65 FMTFMTKFEKTYETVEEWAHRLTVFAQNAKIVLEHDAKAEGFALGLDNQFADWTAEEFA- 123
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ L+ R + P+ A + PT DWR G V +K+QG+CGSCW+FS ++E
Sbjct: 124 SYQKLHSRPK-PSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCWTFSTVVSIE 182
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVERE 229
GA TG+LV+LSEQ LVDC + + C GC+GGLM++AF+YI+K GG++ E
Sbjct: 183 GAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTE 242
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINAV--WMQ 286
Y YTG D G+C FDK+ + A +SN++ V DE +A L GP+++ ++A W Q
Sbjct: 243 ASYGYTGKD-GTCAFDKANVGATISNWTDVAVGDEVALADALANAGPVSIALDASKQW-Q 300
Query: 287 TYIGGVSCPY-ICG-----KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Y GG+ P I G + DHGV IVGYG+ YW I+NSWG WGE+
Sbjct: 301 LYSGGILKPRSILGCSSDPTHADHGVAIVGYGTD-------DGVDYWWIRNSWGTTWGES 353
Query: 341 GYYKICMGRNVCGVDSMVS 359
GY ++ G N CGV + S
Sbjct: 354 GYMRLERGVNACGVANFAS 372
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 179/317 (56%), Gaps = 32/317 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
FS F + K Y +++E RF +F NL R+ R+ L T V F+DLT EF+
Sbjct: 59 FSRFVYRHGKRYQSEDEMKMRFAIFSENLDFIRSTNRKGLSYTLA--VNDFADLTWQEFQ 116
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ LG + A + L LP DWR+ G V+ VK+QG CGSCW+FS TGAL
Sbjct: 117 KHRLGAAQNC--SATTKGNHKLTGVALPDTKDWREVGIVSPVKNQGHCGSCWTFSTTGAL 174
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVERE 229
E A+ + G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E
Sbjct: 175 EAAYHQAFGKGISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGGLETE 226
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAV-WM 285
+ YPYTG D G+CKF + V N ++ + DE + A LV+ P++V V
Sbjct: 227 EAYPYTGED-GACKFSSENVGIQVLDSVNITLGAEDELKEAVGLVR--PVSVAFEVVSGF 283
Query: 286 QTYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+ Y GV CG ++H VL VGYG PYW++KNSWGENWG++GY
Sbjct: 284 RFYKSGVYTSDTCGSTPMDVNHAVLAVGYGVE-------DGVPYWLVKNSWGENWGDHGY 336
Query: 343 YKICMGRNVCGVDSMVS 359
+K+ MG+N+CGV + S
Sbjct: 337 FKMEMGKNMCGVATCAS 353
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 179/342 (52%), Gaps = 34/342 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNED 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
++ + + P++V I A Q Y G+ CG LDH V+ VGYGS
Sbjct: 263 DETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGS-CGTNLDHAVVAVGYGSENGV--- 318
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
YWI++NSWG WGE GY I M RN+ S +A
Sbjct: 319 ----DYWIVRNSWGPRWGEEGY--IRMERNLAASKSGKCGIA 354
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 132/365 (36%), Positives = 196/365 (53%), Gaps = 41/365 (11%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L+L+ S L +++A D +++ S+ +S D L+ F + S+ K Y
Sbjct: 8 ALVLIACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYENI 62
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
EE RF +FK NL+ R + G+++F+DL+ EF ++LGL +RR P
Sbjct: 63 EEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRRRESP 122
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+ + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L S
Sbjct: 123 EEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+C+
Sbjct: 179 LSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGACE 229
Query: 244 FDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGK 300
K + +S + + + +Q + + PL+V I A Q Y GGV + CG
Sbjct: 230 MTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGH-CGS 288
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGV 354
LDHGV VGYG++ K Y +KNSWG WGE GY I M RN +CG+
Sbjct: 289 DLDHGVAAVGYGTA-------KGVDYITVKNSWGSKWGEKGY--IRMRRNIGKPEGICGI 339
Query: 355 DSMVS 359
M S
Sbjct: 340 YKMAS 344
>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 166/313 (53%), Gaps = 34/313 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G++E L+ L +LSEQQLV CD + D+GC+GGLM AFE++L+ G
Sbjct: 156 VGSIESQWALAGHGLTALSEQQLVSCDDK---------DNGCSGGLMLQAFEWLLRNMNG 206
Query: 225 GVEREKDYPYTGTDG--GSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ E YPY + G C + A + + I S E A L K+GP+++ ++
Sbjct: 207 TMFTEDSYPYVSSSGYVPECSNSSQLVPGARIEGYMTIESSETVKGAWLAKNGPISIAVD 266
Query: 282 AVWMQTYIGGV--SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A +Y GV SC G L+HGVL+VGY +G E PYW+IKNSWGE+WGE
Sbjct: 267 ASSFMSYQSGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGE 316
Query: 340 NGYYKICMGRNVC 352
GY ++ MG N C
Sbjct: 317 KGYVRVTMGVNAC 329
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 171/329 (51%), Gaps = 39/329 (11%)
Query: 47 LNAEHHFSLFKSKFSKTYATQE------EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
L AE H+ +++F+ Q+ E R+ FK NL R ++ G T
Sbjct: 19 LFAEKHY---QNQFTNWMVVQDRQYDAYEFRTRYSAFKDNLDFIHRWNAVNKETELGATV 75
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT------NDLPTDFDWRDHGAVTGVKD 154
F+DLT E+R +LG+N DA P + + DWR++GAV VKD
Sbjct: 76 FADLTNEEYRAVYLGMN------VDASNFAAQPATLDQVYQPVRSTLDWRNNGAVGRVKD 129
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CGSCW+FS TGA+EGAH ++TG VSLSEQQL+DC + GC GGLM+
Sbjct: 130 QGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYG-------NHGCQGGLMD 182
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
SA YI+K GG+ E+ YPY D +CK++ + A +S +S I + A + G
Sbjct: 183 SAMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNGAKLSGYSNIKRGSEADLAAKLNIG 242
Query: 275 PLAVGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKN 331
P+A+ ++A Q Y GV P L HGVL VGYG+ G YWI+KN
Sbjct: 243 PVAIALDASHSSFQLYKSGVFYDPACSSTSLSHGVLAVGYGTEG-------SSAYWIVKN 295
Query: 332 SWGENWGENGYYKICMGRNV-CGVDSMVS 359
SWG WG+ GY I RN CGV +M S
Sbjct: 296 SWGTRWGDAGYIWIAKDRNNHCGVATMSS 324
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 192/360 (53%), Gaps = 34/360 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L+L S L ++A D +++ S+ +S D L+ F + S+ K Y T EE
Sbjct: 9 LVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYETIEE 63
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
RF VFK NL+ R + G+ +F+DL+ EF+ ++LGL L ++ +
Sbjct: 64 KLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSEE 123
Query: 130 PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
+ DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG L SLSEQ+
Sbjct: 124 EFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 183
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS- 247
L+DCD + ++GCNGGLM+ AF +I+K GG+ +E+DYPY + +C+ K
Sbjct: 184 LIDCDT--------TYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYI-MEESTCEMKKEV 234
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHG 305
++ + + + +Q + + PL+V I A Q Y GGV + CG LDHG
Sbjct: 235 SEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGH-CGSELDHG 293
Query: 306 VLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
V VGYG+S K Y I+KNSWG WGE G+ I M RN +CG+ M S
Sbjct: 294 VSAVGYGTS-------KGLDYIIVKNSWGAKWGEKGF--IRMKRNIGKSEGICGLYKMAS 344
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/330 (38%), Positives = 175/330 (53%), Gaps = 30/330 (9%)
Query: 29 AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RR 87
A+ + VPS+ + + F+ F ++SK Y + E RF FKAN+ +
Sbjct: 26 ALFSEEVPSE--------VMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHN 76
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
L + + G+ +F+DL+ EF+ ++ G R A + PT DWR
Sbjct: 77 TLANASYTMGLNEFADLSFEEFKGKYFGYKHVEREFARSNNLH-QEVEAAPTSIDWRTSN 135
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGE-LVSLSEQQLVDCDHECDPEESGSCDS 206
AVT +KDQG CGSCW+FSATG++EGA L L SLSEQQLVDC + ++
Sbjct: 136 AVTPIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCS-------TSYGNA 188
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFEYI+ G+ E YPY G GG C+ +K+ V S DE +
Sbjct: 189 GCNGGLMDYAFEYIIANKGICAESAYPYKGV-GGLCQKSCTKVVTISGYKDVASGDEASL 247
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
+ GP++V I A Q Y GV CG LDHGVL VGYG++G +
Sbjct: 248 LNAVGTVGPVSVAIEADQAGFQFYSSGVFSG-TCGHNLDHGVLAVGYGTTG-------SQ 299
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNVCGV 354
YWI+KNSWG +WGE+GY ++ +N CG+
Sbjct: 300 DYWIVKNSWGTSWGESGYIRMIRNKNQCGI 329
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 182/346 (52%), Gaps = 27/346 (7%)
Query: 4 LILSSLLLLLLSSV-LASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+ LS +LL+ S + +A+ + I+Q V S E F + +
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREA-----------FDFWVQTLKR 49
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
YA+ EE++ RF V+ NLR + + ++DL+ E+R + LG N L
Sbjct: 50 AYASAEEYERRFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSKALGYNADLHE 109
Query: 123 PADAQKAPILPTNDLP-TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+ AP L +P + DW GAVT VK+Q CGSCW+FS TGA+EGA ++TG+L
Sbjct: 110 ERPLRAAPFLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKL 169
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
SLSEQ LVDCD E D+GC+GGLM+ AFE+I+K GG++ E DYPYT +G
Sbjct: 170 ASLSEQMLVDCDRE--------RDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMC 221
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICG 299
+ + ++ + +++ V + P++V I A Q Y GGV CG
Sbjct: 222 QDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVF-DAECG 280
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
LDHGVL+VGYG++ PYW++KNSWG WG+ GY ++
Sbjct: 281 TALDHGVLVVGYGTASNGTHHL---PYWLVKNSWGAEWGDKGYIRL 323
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/327 (38%), Positives = 174/327 (53%), Gaps = 29/327 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 21 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNK 78
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR+ GA+T VKDQG
Sbjct: 79 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 138
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CG CW+FS+TGALEG F TG+LVSL EQ L+DC + E GCNGGLM+ AF
Sbjct: 139 CGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNE-------GCNGGLMDQAF 191
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F + S +ED++ A + GP+
Sbjct: 192 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 250
Query: 277 AVGINAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q Y GV C LDHGVL+VGYGS K YW++KNSW
Sbjct: 251 SVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDN-------GKDYWLVKNSW 303
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
E+WG+ GY KI R N CGV + S
Sbjct: 304 SEHWGDQGYIKIARNRKNHCGVATAAS 330
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 132/365 (36%), Positives = 195/365 (53%), Gaps = 41/365 (11%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L+L+ S L +++A D +++ S+ +S D L+ F + S+ K Y
Sbjct: 8 ALVLIACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYENI 62
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
EE RF +FK NL+ R + G+ +F+DL+ EF ++LGL +RR P
Sbjct: 63 EEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRRRESP 122
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+ + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L S
Sbjct: 123 EEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+C+
Sbjct: 179 LSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGTCE 229
Query: 244 FDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGK 300
K + +S + + + +Q + + PL+V I A Q Y GGV + CG
Sbjct: 230 MTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGH-CGS 288
Query: 301 YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN------VCGV 354
LDHGV VGYG++ K Y +KNSWG WGE GY I M RN +CG+
Sbjct: 289 DLDHGVAAVGYGTA-------KGVDYITVKNSWGSKWGEKGY--IRMRRNIGKPEGICGI 339
Query: 355 DSMVS 359
M S
Sbjct: 340 YKMAS 344
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 172/312 (55%), Gaps = 29/312 (9%)
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
SK K+Y + EE +RF VF+ NL+ + G+ +F+DL+ EF+R++LGL
Sbjct: 2 SKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLK 61
Query: 118 RRLRLPADA-QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
L D+ ++ DLP DWR GAV VK+QGACGSCW+FS A+EG + +
Sbjct: 62 IELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQI 121
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG L +LSEQ+L+DCD ++GCNGGLM+ AF +I+ GG+ +E+DYPY
Sbjct: 122 VTGNLTALSEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV- 172
Query: 237 TDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGVS 293
+ G+C K ++ +S + + D +Q + + PL+V I A Q Y GG+
Sbjct: 173 MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIF 232
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-- 351
+ CG LDHGV VGYG+S K Y +KNSWG WGE GY I M RNV
Sbjct: 233 NGH-CGTELDHGVAAVGYGTS-------KGVDYITVKNSWGSKWGEKGY--IRMKRNVGK 282
Query: 352 ----CGVDSMVS 359
CG+ M S
Sbjct: 283 PEGICGIYKMAS 294
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 178/342 (52%), Gaps = 34/342 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEN 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIR 320
++ + + P+ V I A Q Y G+ CG LDH V+ VGYGS
Sbjct: 263 DETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGS-CGTNLDHAVVAVGYGSENGV--- 318
Query: 321 FKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSSVA 362
YWI++NSWG WGE GY I M RN+ S +A
Sbjct: 319 ----DYWIVRNSWGPRWGEEGY--IRMERNLAASKSGKCGIA 354
>gi|33242884|gb|AAQ01146.1| cathepsin [Petromyzon marinus]
Length = 333
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 121/317 (38%), Positives = 174/317 (54%), Gaps = 26/317 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL---DPTAVH-GVTKFSDLTPS 107
+ +KS + K Y +++E +R VF+ NL+R + LL + H G+ K+SDL
Sbjct: 26 QWDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAP--ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
E+ + +G LR + AP + ++LP DWR G VT VK+QG CGS W+FS
Sbjct: 86 EYHEKVVGRFWNLRNGTRRRGAPFPLRSMDNLPEQVDWRLKGYVTPVKEQGLCGSSWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HF +TG L SLSEQQLVDC ++GCNGG A +YI+ G
Sbjct: 146 ATGSLEGQHFAATGNLTSLSEQQLVDC-------TKSYYNNGCNGGRSERALQYIIDNNG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI--SSDEDQMAANLVKHGPLAVGINAV 283
++ E YPY D G C+F + +A S++ + SS+E+ + + GP+A+ +NA
Sbjct: 199 IDSELSYPYEHAD-GKCRFKPANVATKCSSYQFVEPSSNEEVLRQAVASVGPIAIAMNAD 257
Query: 284 W--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
+ Y G+ C K +H +L+VGYGS +WI+KNSWGE+WGE G
Sbjct: 258 LDTFKHYKSGLFNEPSCDKSPNHAMLVVGYGS-------LSGNDFWIVKNSWGEDWGEKG 310
Query: 342 Y-YKICMGRNVCGVDSM 357
Y Y I N CG+ S+
Sbjct: 311 YIYMIRNKDNQCGIASI 327
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/326 (38%), Positives = 180/326 (55%), Gaps = 39/326 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRR 111
+ L+ ++ KTY E + RFR+F NL+ L + G+ +F+DLT E+R
Sbjct: 36 YELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLTNEEYRS 95
Query: 112 QFLGLN-RRLRLPADAQKAPI-----LPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSF 164
+LG R A Q+ I + N++ P DWR+ GAV+ VK+QG CGSCW+F
Sbjct: 96 MYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSPVKNQGGCGSCWAF 155
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S ++EG + + TG+L+SLSEQ+LVDCD++ +SGCNGG M+ AF++I+ G
Sbjct: 156 STVASVEGINKIVTGDLISLSEQELVDCDNK--------YNSGCNGGSMDYAFQFIVSNG 207
Query: 225 GVEREKDYPYTGTDGGSCK--FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA 282
G++ E DYPY G G C +K+KI ++ + + ++ V H P++VGI A
Sbjct: 208 GIDSESDYPYKGV-GAVCDPVRNKAKI-VSIDGYEDVPPMNEKALMKAVAHQPVSVGIEA 265
Query: 283 VW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y GV CG LDHGV++VGYGS K YWI++NSWG WGE+
Sbjct: 266 SGRAFQLYTSGVLTGS-CGTNLDHGVVVVGYGSE-------NGKDYWIVRNSWGPEWGED 317
Query: 341 GYYKICMGRN-------VCGVDSMVS 359
GY I M RN +CG+ M S
Sbjct: 318 GY--IRMERNMVDTPVGMCGITLMAS 341
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 136/377 (36%), Positives = 190/377 (50%), Gaps = 54/377 (14%)
Query: 10 LLLLLSSVLASAVAVND----DDAMIRQVVPSDGEQSEDHLLNA------EHHFSLFKSK 59
L +L VLA AV + D IR V E + A F+ F +
Sbjct: 6 LFVLAVVVLADTAAVVNSGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVR 65
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--- 116
+ K+Y + E RFR+F +L+ + + G+ +F+D++ EFR LG
Sbjct: 66 YGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQN 125
Query: 117 -------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
N R+R A A LP DWR+ G V+ VK+QG CGSCW+FS TGA
Sbjct: 126 CSATLTGNHRMRAAAVA----------LPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LE A+ +TG+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGFAFN-------NFGCNGGLPSQAFEYIKYNGGLDTE 228
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-M 285
+ YPY G + G CKF + V N ++ + DE + A LV+ P++V +
Sbjct: 229 ESYPYQGVN-GICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVR--PVSVAFEVITGF 285
Query: 286 QTYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
+ Y GV CG ++H VL VGYG PYW+IKNSWG +WG+ GY
Sbjct: 286 RLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVE-------DGVPYWLIKNSWGADWGDEGY 338
Query: 343 YKICMGRNVCGVDSMVS 359
+K+ MG+N+CGV + S
Sbjct: 339 FKMEMGKNMCGVATCAS 355
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 127/318 (39%), Positives = 171/318 (53%), Gaps = 21/318 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS +K Y +EE +R V++ NL+ + L H G+ +F D+T
Sbjct: 9 HWQLWKSWHNKDYHEREE-SWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTE 67
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFR+ G + + P+ + P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 68 EFRQLMNGYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 127
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ GG+
Sbjct: 128 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNQGCNGGLMDQAFQYVQDNGGI 180
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--V 283
+ E+ YPYT D C++ AA + F + E + + GP++V I+A
Sbjct: 181 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHS 240
Query: 284 WMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y G+ P + LDHGVL+VGY GF K YWI+KNSWGE WG+ GY
Sbjct: 241 SFQFYQSGIYYEPDCSSEDLDHGVLVVGY---GFEGEDVDGKKYWIVKNSWGEKWGDKGY 297
Query: 343 YKICMGR-NVCGVDSMVS 359
+ R N CG+ + S
Sbjct: 298 IYMAKDRKNHCGIATAAS 315
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 114/305 (37%), Positives = 169/305 (55%), Gaps = 25/305 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ L+ ++ + Y +E RF VFK N + + G+ +F+DL+ EF+
Sbjct: 42 YELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LG +RL P +++ DLP DWR+ GAVT VKDQG+CGSCW+FS
Sbjct: 102 YLGAKLDTKKRLSRPP-SRRYQYSDGEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVA 160
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIINNGGLDS 212
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQ 286
E+DYPYT DG + K+ + ++ + ++++ + P++V I A Q
Sbjct: 213 EEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGREFQ 272
Query: 287 TYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKIC 346
Y GV CG LDHGV +VGYGS YW +KNSWG++WGE G+ +
Sbjct: 273 FYDSGVFTS-TCGTQLDHGVTLVGYGSE-------SGTDYWTVKNSWGKSWGEEGFIR-- 322
Query: 347 MGRNV 351
+ RN+
Sbjct: 323 LQRNI 327
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 133/370 (35%), Positives = 191/370 (51%), Gaps = 30/370 (8%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R + + + L L++ V + + N+ ++ + + + L+ AE +S FK+ K
Sbjct: 2 RPLEALIRLFLVTHVPLNGIWKNEGFVVLGCLFVTAAAITHQELVGAE--WSAFKALHGK 59
Query: 63 TYATQEEHDYRFRVFKANLRRAKR--RQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLNR 118
Y ++ E YR +++ N + R + + A + + +F DL EF G R
Sbjct: 60 EYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKR 119
Query: 119 RLRLPADAQKAPILPT----NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R I P LP DWR GAVT VK+QG CGSCW+FS TG+LEG H
Sbjct: 120 NYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQH 179
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG +VSLSEQ LVDC + ++GC GGLM++AF+YI GG++ E YPY
Sbjct: 180 FRKTGRMVSLSEQNLVDCSGKFG-------NNGCEGGLMDNAFKYIKANGGIDTELSYPY 232
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVW--MQTYIGG 291
GTD G C F+KS + A + F I +Q+ V GP++V I+A Q Y G
Sbjct: 233 NGTD-GICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQG 291
Query: 292 V-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR- 349
V P + LDHGVL+VGYG+ + YW++KNSWG WG++GY + +
Sbjct: 292 VYDEPECSSESLDHGVLVVGYGTK-------DGQDYWLVKNSWGTTWGDDGYIYMTRNKE 344
Query: 350 NVCGVDSMVS 359
N CG+ S S
Sbjct: 345 NQCGIASSAS 354
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 168/312 (53%), Gaps = 22/312 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSDLTPSEFRR 111
+KS + Y T EE ++R +++ N+R + HG + F D+T EFR+
Sbjct: 6 WKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQ 64
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
G + + P++ +P DWR+ G VT VK+QG CGSCW+FSA+G LE
Sbjct: 65 VVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLE 122
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+YI + GG++ E+
Sbjct: 123 GQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKENGGLDSEES 175
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYI 289
YPY D GSCK+ A + F I E + + GP++V ++A +Q Y
Sbjct: 176 YPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYS 234
Query: 290 GGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
G+ P K LDHGVL+VGYG G + K YW++KNSWG WG GY KI
Sbjct: 235 SGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWGMEGYIKIAKD 291
Query: 349 R-NVCGVDSMVS 359
R N CG+ + S
Sbjct: 292 RDNHCGLATAAS 303
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 175/317 (55%), Gaps = 34/317 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + K K+Y T +E R+ +F+ N+ + + G+ +DLT E++R
Sbjct: 32 FQNWMVKHQKSY-TNDEFGSRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRI 90
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LG ++ P I+ D+ P DWR +GAVT VK+QG CG C+SFS TG+
Sbjct: 91 YLGTKTTVKKPN-----LIIGVTDVSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGS 145
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVER 228
+EG H +++ +LVSLSEQQ++DC SGS ++GC+GGLM ++FEYI+ GG++
Sbjct: 146 VEGIHEITSKQLVSLSEQQILDC--------SGSEGNNGCDGGLMTNSFEYIIAVGGLDT 197
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQ 286
E YPY G G CKF+K+ I A ++ + + S + V P++V I+A Q
Sbjct: 198 EASYPYEGVV-GKCKFNKANIGATITGYKNVKSGSESDLQTAVAAQPVSVAIDASQNSFQ 256
Query: 287 TYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y GV P LDHGVL VGYGS + YWI+KNSWG +WGE G+ I
Sbjct: 257 LYSSGVYYEPACSSTQLDHGVLAVGYGSQ-------SGQDYWIVKNSWGADWGEKGF--I 307
Query: 346 CMGRNV---CGVDSMVS 359
M RN CG+ +M S
Sbjct: 308 LMARNKHNNCGIATMAS 324
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 134/362 (37%), Positives = 188/362 (51%), Gaps = 37/362 (10%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ--------SEDHLLNAEHHFSL 55
L+ +++ LL+ +S L +DD + P + Q E H +A FS
Sbjct: 65 LVAAAVSLLVFASFLIQWQG--EDDRAVFPPSPVEDHQPPANIWEWKEAHFQDA---FSS 119
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F++ ++K+YAT+EE R+ +FK NL + + F DL+ EFRR++LG
Sbjct: 120 FQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLG 179
Query: 116 LNRRLRLPAD-----AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ L + + +LP+ +LP DWR G VT VKDQ CGSCW+FS TGAL
Sbjct: 180 FKKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGAL 238
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EGAH TG+LVSLSEQ+L+DC + C+GG MN AF+Y+L +GG+ E
Sbjct: 239 EGAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMNDAFQYVLDSGGICSED 291
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINAVWM--QT 287
YPY D C+ + + F V E M A L K P+++ I A M Q
Sbjct: 292 AYPYLARD-EECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQF 349
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
Y GV CG LDHGVL+VGYG+ + +K +WI+KNSWG WG +GY + M
Sbjct: 350 YHEGV-FDASCGTDLDHGVLLVGYGTD-----KESKKDFWIMKNSWGTGWGRDGYMYMAM 403
Query: 348 GR 349
+
Sbjct: 404 HK 405
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 130/336 (38%), Positives = 171/336 (50%), Gaps = 40/336 (11%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDL 104
+ + FK + K Y ++ E++YR VF NL + L + + DL
Sbjct: 24 VQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDL 83
Query: 105 TPSEFRRQF------LGLNRRLR-------LPADAQK--APILPTN----DLPTDFDWRD 145
T EF R + L + L LP D Q LPTN DLPTD DWR
Sbjct: 84 TKDEFMRIYTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQ 143
Query: 146 HGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCD 205
GAVT VK+Q CGSCWSFSATGALE F T +L+SLSEQQLVDC +
Sbjct: 144 KGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYG-------N 196
Query: 206 SGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQ 265
GC+GG M+ AF YI + GG++ E+ YPYT D G C + AA VS ++ E+Q
Sbjct: 197 HGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKD-GRCAYKPGNKAATVSQVIMVPRGENQ 255
Query: 266 MAANLVKHGPLAVGINAVW-MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
+AA + GP+++ Q Y GV CG L+H +L VGYGS G K
Sbjct: 256 LAAKVSSVGPISIAAEVSHKFQFYHSGVYDEPQCGHSLNHAMLAVGYGSMG-------GK 308
Query: 325 PYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
+W++KNSWG WG+ GY ++ + N CG+ M S
Sbjct: 309 NFWLVKNSWGTGWGDQGYIRMAKDKNNQCGIALMAS 344
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 182/335 (54%), Gaps = 38/335 (11%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEE-HDYRFRVFKANLRRAKRRQLLDPTAVH-GVT 99
S D L+ E ++ + +KF K A+ D+RF FK N R + + G+
Sbjct: 4 SSDSDLSGE--YASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLN 61
Query: 100 KFSDLTPSEFRRQFLGLNRRL------RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
+FSDLT EFR++FLGL L ++P D+ DLP DWR HGAVT K
Sbjct: 62 QFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPK 121
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG+CG CW+F+ TGA+EG + + TG+LVSLSEQ+L+DCD + D GC+GGLM
Sbjct: 122 DQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKK--------ADKGCDGGLM 173
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVK 272
+A+++I++ GG++ E DYPY ++ C K + A+ + I ++Q V
Sbjct: 174 ENAYQFIVENGGLDTETDYPYHASE-SHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVA 232
Query: 273 HGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIK 330
P++V I Q Y GV + CG+ ++HGVLIVGYG+ YWI+K
Sbjct: 233 KQPVSVAIEGASKDFQHYASGVFTGH-CGEEINHGVLIVGYGTE-------DGLDYWIVK 284
Query: 331 NSWGENWGENGYYKICMGRN------VCGVDSMVS 359
NSW WG+ G+ K M RN +C ++++ S
Sbjct: 285 NSWAATWGDGGFVK--MQRNTGKRGGLCSINTLAS 317
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 124/360 (34%), Positives = 191/360 (53%), Gaps = 38/360 (10%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
S+ LL S++L ++A++ ++++ R + D ++ + + + K+Y +
Sbjct: 9 SMSLLFFSTLLILSLALDIENSVQR---------TNDQVM---AMYESWLVEQGKSYNSL 56
Query: 68 EEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA 126
+E + RF +FK NLR + + G+ +F+DLT E+R +LGL +
Sbjct: 57 DEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSN 116
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
+ P + LP DWR GAV GVK+QG C SCW+FSA A+EG + + TG L+SLSE
Sbjct: 117 EYMPKV-GEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSE 175
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
Q+LVDC + GCN GLM AF++I+ GG+ E +YPYT DG K
Sbjct: 176 QELVDCGRTQRTK-------GCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLK 228
Query: 247 SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDH 304
++ + N+ + S+ + V + P++VG+ + + Y G+ + CG +DH
Sbjct: 229 NQKYVTIDNYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGF-CGTAVDH 287
Query: 305 GVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-----CGVDSMVS 359
GV IVGYG+ + YWI+KNSWG NWGENGY +I RN+ CG+ M S
Sbjct: 288 GVTIVGYGTE-------RGMDYWIVKNSWGTNWGENGYIRI--QRNIGGAGKCGIARMPS 338
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 120/302 (39%), Positives = 162/302 (53%), Gaps = 34/302 (11%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
+ D RF +FK NLR + A + G+T F++LT E+R +LG RR+
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 123 PADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ ND+ P DWR GAV +KDQG CGSCW+FS A+EG + + TGE
Sbjct: 84 AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD S + GCNGGLM+ AF++I+K GG+ EKDYPY GT+G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYIC 298
K+ + + + S ++ V + P++V I+A Q Y G+ C
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGK-C 254
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV------C 352
G +DH V+ VGYGS YWI++NSWG WGE+GY I M RNV C
Sbjct: 255 GTNMDHAVVAVGYGSENGV-------DYWIVRNSWGTRWGEDGY--IRMERNVASKSGKC 305
Query: 353 GV 354
G+
Sbjct: 306 GI 307
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 193/355 (54%), Gaps = 32/355 (9%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMI---RQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
L + +L++L +VLA + A+ D ++I R G +S++ +++ + + K K
Sbjct: 7 LMATILIVLFTVLAVSSAL--DMSIISYDRSHADKSGWKSDEEVMSIYEEWLV---KHGK 61
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RR 119
Y EE + RF++FK NL + ++ T G+ +FSDL+ E+R ++LG R
Sbjct: 62 VYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKIDPSR 121
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ + +P + N LP DWR GAV VK+Q C CW+FSA A+EG + + TG
Sbjct: 122 MMARPSRRYSPRVADN-LPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTG 180
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L +LSEQ+L+DCD + ++GC+GGL++ AFE+I+ GG++ E+DYP+ G DG
Sbjct: 181 NLTALSEQELLDCDR--------TVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADG 232
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYI 297
++ + A + + + + ++ V + P++V I A Q Y G+
Sbjct: 233 ICDQYKINARAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTG-T 291
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVC 352
CG +DHGV VGYG+ YWI+KNSWGENWGE GY + M RN+
Sbjct: 292 CGTSIDHGVTAVGYGTENGI-------DYWIVKNSWGENWGEAGY--VGMERNIA 337
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 163/312 (52%), Gaps = 33/312 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK+KF K YA EE +R VF L+ +R + T + FSDLT E
Sbjct: 23 FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82
Query: 112 QFLGLNRRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
G+ RR LP A PT + D DWR+ GAVT VKDQG CGSCW+FSA
Sbjct: 83 TKTGMTRRRHPLSVLPKSA------PTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAV 136
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
ALEGAHFL TG+LVSLSEQ LVDC S + GCNGG A++YI+ G++
Sbjct: 137 AALEGAHFLKTGDLVSLSEQNLVDC-------SSSYGNQGCNGGWPYQAYQYIIANRGID 189
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--VW 284
E YPY D +C++D I A VS++ S DE + + GP++V I+A
Sbjct: 190 TESSYPYKAID-DNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSS 248
Query: 285 MQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
+Y GGV C Y +H V VGYG+ YWI+KNSWG WGE+GY
Sbjct: 249 FGSYGGGVYYEPNCDSWYANHAVTAVGYGTDA------NGGDYWIVKNSWGAWWGESGYI 302
Query: 344 KICMGR-NVCGV 354
K+ R N C +
Sbjct: 303 KMARNRDNNCAI 314
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 129/325 (39%), Positives = 174/325 (53%), Gaps = 32/325 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H+ +K+ K Y +EE +R V++ NL++ + L H G+ +F D+T
Sbjct: 26 DNHWEQWKNWHGKKYHEKEE-GWRRMVWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMT 84
Query: 106 PSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACG 159
EFR+ G RR R + + N ++P DWR+ G VT VKDQG CG
Sbjct: 85 HEEFRQVMNGYKHKKERRFR------GSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECG 138
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 139 SCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQY 191
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAV 278
I G++ E+ YPY GTD C +D AA + F + S E + + GP++V
Sbjct: 192 IKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSV 251
Query: 279 GINA--VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
I+A Q Y G+ C + LDHGVL VGY GF K YWI+KNSW E
Sbjct: 252 AIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGY---GFEGEDVDGKKYWIVKNSWSE 308
Query: 336 NWGENGYYKICMGR-NVCGVDSMVS 359
NWG+ GY + R N CG+ + S
Sbjct: 309 NWGDKGYVYMAKDRHNHCGIATAAS 333
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 187/355 (52%), Gaps = 29/355 (8%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--GEQSEDHLLNAEHHFSLFKSKFSKTY 64
S +L++L+ L +A D + SD +S+ + N + + K +
Sbjct: 8 SPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNI 67
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------R 118
E+ D RF +FK NL+ + T G+ +F+DL+ E+R ++LG
Sbjct: 68 DGSEK-DKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMM 126
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R + + + LP DWR GAV VKDQG+CGSCW+FS A+EG + + T
Sbjct: 127 MARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVT 186
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GELVSLSEQ+LVDCD + ++GC+GGLM AFE+I+ GG++ ++DYPY G D
Sbjct: 187 GELVSLSEQELVDCDR--------TVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVD 238
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPY 296
G ++ K+ ++ ++ + + ++ V + P++V I A Q Y+ G+
Sbjct: 239 GKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGK 298
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG LDHGV VGYG+ YWI++NSWG++WGE+GY + M RN+
Sbjct: 299 -CGTALDHGVTAVGYGTENGV-------DYWIVRNSWGKSWGESGYVR--MERNL 343
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 172/320 (53%), Gaps = 24/320 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H+ +K KTYA +EE +R +++ NL+ + L H G+ +F D+T
Sbjct: 26 DNHWYSWKDWHKKTYAPKEE-GWRRVLWEKNLKMIEFHNLDHSLGKHSYRLGMNQFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF++ G + + AP + P DWR G VT VKDQG CGSCW+FS
Sbjct: 85 NEEFKQLMNGYKNQKMIRGSTFLAP--NNFEAPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG H+ T +L+SLSEQ LVDC + GCNGGLM+ AF+Y+ GG
Sbjct: 143 TTGALEGQHYRKTSKLISLSEQNLVDC-------SRAQGNEGCNGGLMDQAFQYVKDNGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS--DEDQMAANLVKHGPLAVGINA- 282
++ E YPYT D C +D + +A + F + S ++D M A + GP++V I+A
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDLMKA-VASVGPVSVAIDAG 254
Query: 283 -VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y G+ P + LDHGVL+VGY GF K YWI+KNSW E WG+N
Sbjct: 255 HQSFQFYQSGIYYEPECSSEDLDHGVLVVGY---GFESEDVDGKKYWIVKNSWSEKWGDN 311
Query: 341 GYYKICMGR-NVCGVDSMVS 359
GY I R N CG+ + S
Sbjct: 312 GYINIAKDRHNHCGIATAAS 331
>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
Length = 323
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 115/325 (35%), Positives = 179/325 (55%), Gaps = 29/325 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEANC 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGI 280
+ GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+ + I
Sbjct: 188 RMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 281 NAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+A + Y G+ Y L+H VL+VGYG PYW KN+WG +WGE+
Sbjct: 247 DAADIVNYKQGI-IKYCFNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGED 298
Query: 341 GYYKICMGRNVCGVDSMVSSVAAIH 365
G++++ N CG+ + ++S A I+
Sbjct: 299 GFFRVQQNINACGMRNELASTAVIY 323
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 126/322 (39%), Positives = 172/322 (53%), Gaps = 30/322 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
F ++ KF +TY++ E R + + N + +L + G+T F+D+
Sbjct: 25 EFHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENE 84
Query: 108 EFRRQF----LGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
E++R LG + LP LP N DLP DWRD G VT VKDQ CGSCW
Sbjct: 85 EYKRLISQGCLG-SFNASLPRRGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSATG+LEG F TG+LVSLSEQQLVDC + + GC GGLM+ AF YI
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYG-------NMGCGGGLMDDAFRYIQA 196
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGIN 281
GG++ E+ YPY D G C++ + A + + +SS DED + + GP++VGI+
Sbjct: 197 TGGIDTEESYPYEAED-GECRYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGID 255
Query: 282 A--VWMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A + Q Y G+ P LDHGVL VGYGS + YW++KNSWG WG
Sbjct: 256 ASHISFQLYESGLYDEPQCSSSELDHGVLAVGYGSE-------NGQDYWLVKNSWGLTWG 308
Query: 339 ENGYYKICMGR-NVCGVDSMVS 359
+ GY K+ + N CG+ + S
Sbjct: 309 DQGYIKMSKNKSNQCGIATAAS 330
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 172/317 (54%), Gaps = 33/317 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
+K K+ K+Y + E R RV+++NL+ ++ +L G+ ++DL +
Sbjct: 22 WKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADL----YNE 77
Query: 112 QFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+F+ L + + + Q L LP+ DWR+ G VT VKDQG CGSCWSFSA
Sbjct: 78 EFMALKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFSA 137
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG+LEG HF TG LVSLSEQQLVDC + GC+GGLM SA++YI AGGV
Sbjct: 138 TGSLEGQHFAKTGTLVSLSEQQLVDCSWSYG-------NYGCSGGLMESAYDYIRDAGGV 190
Query: 227 EREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW- 284
+ E YPYT + G C FD+SK +A + ++ S DE + + GP+AV I+A
Sbjct: 191 QLESAYPYTAQN-GRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGY 249
Query: 285 -MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV C LDHGVL GYG+ G YW++KNSWG WG GY
Sbjct: 250 DFQLYESGVYDRSRCSSSSLDHGVLAAGYGTEG-------GNDYWLVKNSWGPGWGAQGY 302
Query: 343 YKICMGR-NVCGVDSMV 358
K+ + N CG+ +M
Sbjct: 303 IKMSRNKSNQCGIATMA 319
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 176/336 (52%), Gaps = 40/336 (11%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDG------------GSCKFDKSKIAAAVSNFS 257
GGLM+ AF++I+ GG++ E DYPY G D F K+ + ++
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYE 253
Query: 258 VISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSG 315
++ + + V + P++V I A Q Y G+ CG LDHGV VGYG+
Sbjct: 254 DVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE- 311
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
K YWI++NSWG++WGE+GY + M RN+
Sbjct: 312 ------NGKDYWIVRNSWGKSWGESGYVR--MERNI 339
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 165/310 (53%), Gaps = 22/310 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+KS K Y Q E D+R VF N++ T + +FSDLT EF + + G
Sbjct: 28 WKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNA-KSTFKMAINEFSDLTRKEFVKTYNG 86
Query: 116 LNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++ + + P N ++PT+ DWR G VT +K+QG CGSCW+FS TG+LEG H
Sbjct: 87 YRLSMKKSTNKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQH 146
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG+LVSLSEQ L+DC + + GC GG M+ AFEYI G++ E YPY
Sbjct: 147 FRKTGKLVSLSEQNLIDC-------SAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPY 199
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINA---VWMQTYIG 290
G D C++ K+ A + + I ED + A + GP++V I+A + + G
Sbjct: 200 EGRD-DICRYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTG 258
Query: 291 GVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR- 349
P LDHGVL+VGYG+ + YW++KNSWG +WG NGY K+ R
Sbjct: 259 VYHEPECSQTVLDHGVLVVGYGTE-------NGEDYWLVKNSWGTDWGMNGYIKMSRNRS 311
Query: 350 NVCGVDSMVS 359
N CG+ + S
Sbjct: 312 NNCGIATNAS 321
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 112/298 (37%), Positives = 163/298 (54%), Gaps = 21/298 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + +K K+Y E + RF++FK NLR + T G+ +F+DLT E+R
Sbjct: 53 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 112
Query: 113 FLGLN---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LG +R + + + LP DWR GAV VKDQG+CGSCW+FS A
Sbjct: 113 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 172
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E
Sbjct: 173 VEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSE 224
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQT 287
+DYPY +DG ++ K+ + + + ++++ V + P++V I A Q
Sbjct: 225 EDYPYKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQL 284
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y G+ CG LDHGV VGYG+ YWI+KNSWG +WGE GY ++
Sbjct: 285 YQSGIFTGR-CGTALDHGVTAVGYGTENGV-------DYWIVKNSWGASWGEEGYIRM 334
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 172/318 (54%), Gaps = 31/318 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
FK+ + Y EE R VF+ NL++ + L G+ +F+D+ EF
Sbjct: 47 FKTVHERNYGETEEMQ-RKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFAS 105
Query: 112 QFLG--LNRRLRLPADAQK---APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
G +N R ++ +P +P + LP + DWR G VT +KDQG CGSCWSFS
Sbjct: 106 VVNGFRMNNRTKVRDHLHSHYISPAIPVS-LPAEVDWRKEGYVTPIKDQGHCGSCWSFST 164
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF TG+LVSLSEQ L+DC + ++GCNGG+M+ AF+YI G
Sbjct: 165 TGALEGQHFRKTGKLVSLSEQNLIDC-------STSYGNNGCNGGVMDYAFQYIKDNDGD 217
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINA--V 283
+ E YPY D G C+F K + A + ++ + DE++M + GP++V I+A
Sbjct: 218 DTEDSYPYEAAD-GPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHT 276
Query: 284 WMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GV C + LDHGVL+VGYG+ + YW++KNSWG WG+ GY
Sbjct: 277 SFQMYQSGVYDEVECDPEGLDHGVLVVGYGTE-------LGQDYWLVKNSWGTKWGDEGY 329
Query: 343 YKICMGR-NVCGVDSMVS 359
K+ + N CG+ SM S
Sbjct: 330 IKMSRNKNNQCGISSMAS 347
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 126/328 (38%), Positives = 178/328 (54%), Gaps = 34/328 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSE 108
+ LFK++ K Y E +R ++F N ++ + Q + G+ K+SD+ E
Sbjct: 27 WQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYSDMLHHE 86
Query: 109 FRRQFLGLNRRLRLP---ADAQKAP------ILPTN-DLPTDFDWRDHGAVTGVKDQGAC 158
F F G N+ + P ++ K I P N LP DW GAVT VKDQG C
Sbjct: 87 FINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPVKDQGHC 146
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FSATGALEG HF T LVSLSEQ L+DC E ++GCNGGLM+ AF+
Sbjct: 147 GSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTE-------EGNNGCNGGLMDQAFQ 199
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLA 277
Y+ GG++ E+ YPY G + C+++ A + ++ V DED + + + GP++
Sbjct: 200 YVRINGGIDTERSYPYEGNN-DVCRYEPENSGAIDTGYTDVPLGDEDALKSAVATVGPVS 258
Query: 278 VGINAVW--MQTYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
V I+A Q Y GV C + LDHGVL+VGYG+ ++ YW++KNS
Sbjct: 259 VAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTD-----EETQQDYWLVKNS 313
Query: 333 WGENWGENGYYKICM-GRNVCGVDSMVS 359
WG++WGENGY K+ N CG+ + S
Sbjct: 314 WGDSWGENGYIKMARNADNQCGIATQPS 341
>gi|148908373|gb|ABR17300.1| unknown [Picea sitchensis]
Length = 357
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 200/372 (53%), Gaps = 38/372 (10%)
Query: 6 LSSLLLLLLSSVLASAVAVND----DDAMIRQVVPSDGEQSEDHLLN------AEHHFSL 55
++ +L ++LS++LA A+AV+ ++ +V + E L F+
Sbjct: 1 MARILAIVLSTLLALAIAVSAARSFEETEYIDMVTDKIQNLESSLFKILGTNPKSVQFAE 60
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F ++ K Y + + +RF F N+ + R ++ + +F+D+T EF Q+LG
Sbjct: 61 FALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLAINEFADITWEEFHGQYLG 120
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
++ K PT DWR+ G V+ VK+Q CGSCW+FS TGALE A+
Sbjct: 121 ASQNCSATKSNHK---FTDAQPPTKKDWREEGIVSPVKNQAHCGSCWTFSTTGALEAAYT 177
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPY 234
+TG+ V LSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPY
Sbjct: 178 QATGKTVILSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY 229
Query: 235 TGTDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIG 290
T D G C +D + + V+ N S+ + D+ + A LV+ P++V + + Y
Sbjct: 230 TAKD-GVCNYDVNNVGVKVADSVNISLGAEDKLKSAVGLVR--PVSVAFQVIQDFRFYKE 286
Query: 291 GVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM 347
GV CG+ ++H VL VGYG S + P+WIIKNSWG++WG GY+K+ M
Sbjct: 287 GVFTSTTCGQGPMDVNHAVLAVGYGVSE------EGTPHWIIKNSWGKSWGVEGYFKMEM 340
Query: 348 GRNVCGVDSMVS 359
G+N+CGV + S
Sbjct: 341 GKNMCGVATCAS 352
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 112/298 (37%), Positives = 163/298 (54%), Gaps = 21/298 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + +K K+Y E + RF++FK NLR + T G+ +F+DLT E+R
Sbjct: 51 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 110
Query: 113 FLGLN---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LG +R + + + LP DWR GAV VKDQG+CGSCW+FS A
Sbjct: 111 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 170
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E
Sbjct: 171 VEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSE 222
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQT 287
+DYPY +DG ++ K+ + + + ++++ V + P++V I A Q
Sbjct: 223 EDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQL 282
Query: 288 YIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y G+ CG LDHGV VGYG+ YWI+KNSWG +WGE GY ++
Sbjct: 283 YQSGIFTGR-CGTALDHGVTAVGYGTENGV-------DYWIVKNSWGASWGEEGYIRM 332
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H++L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 DEHWNLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + L N L P DWRD G VT VKDQG CGSCW+
Sbjct: 86 NEEFRQLMNGYKHKAERKVKG--SLFLEPNFLEAPRSLDWRDKGYVTPVKDQGQCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG++V LSEQ LV+C PE + GCNGGLM+ AF+Y+
Sbjct: 144 FSATGALEGQQFRKTGKMVQLSEQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E+ YPY GTD C +D A + F + S E + + GP++V I+A
Sbjct: 197 QGLDSEESYPYLGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPISVAIDA 256
Query: 283 --VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ P + LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 257 GHESFQFYQSGIYYEPECSSEELDHGVLLVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 313
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 314 KGYVYMAKDRQNHCGIATAAS 334
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 180/335 (53%), Gaps = 36/335 (10%)
Query: 42 SEDHLLNAEHHFSLFK---SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGV 98
SE+ L + + LF+ +K K YA+ EE +RF VFK NL+ + + G+
Sbjct: 136 SEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTSYWLGL 195
Query: 99 TKFSDLTPSEFRRQFLGLNRRLRLPADAQ------KAPILPTNDLPTDFDWRDHGAVTGV 152
+F+DLT EF+ +LGL PA A+ K + +DLP DWR GAVT V
Sbjct: 196 NEFADLTHEEFKATYLGLAP----PAPARESRGSFKYEDVSADDLPKSVDWRTKGAVTEV 251
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
K+QG CGSCW+FS A+EG + + TG L +LSEQ+L+DC S ++GCNGGL
Sbjct: 252 KNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDC--------SVDGNNGCNGGL 303
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLV 271
M+ AF YI +GG+ E+ YPY +G KS+ A +S + + + +Q +
Sbjct: 304 MDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKAL 363
Query: 272 KHGPLAVGINAV--WMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWI 328
H P++V I A Q Y GGV P CG LDHGV VGYGS + K Y I
Sbjct: 364 AHQPVSVAIEASGRHFQFYSGGVFDGP--CGTQLDHGVAAVGYGSD-----KGKGHDYII 416
Query: 329 IKNSWGENWGENGYYKI----CMGRNVCGVDSMVS 359
++NSWG WGE GY ++ G +CG++ M S
Sbjct: 417 VRNSWGAKWGEKGYIRMKRGTGKGEGLCGINKMAS 451
>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
Length = 293
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 174/310 (56%), Gaps = 34/310 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+ +++KTY E+ +R +F ++R + + G+ +F+DLT EF +LG
Sbjct: 9 LEGEYNKTYGGAEDK-HRLALFAESVRIVETENAKGHSYTLGLNQFADLTTEEFSSLYLG 67
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
L L A ++ +L D + DWR GAVT VKDQ +CGSCW+FSATGA+EGA
Sbjct: 68 L--VLENKVQASESVVLQDGDSEENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALV 125
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
STG+L++LSEQQLVDC +C+ GCNGGLM +AF+Y+L G EKDYPY
Sbjct: 126 KSTGKLINLSEQQLVDCVTKCN---------GCNGGLMTAAFDYVLGRGRAT-EKDYPYK 175
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSC 294
G D G CK ++ + ++ + + + A PL+V +NA +Q Y GV
Sbjct: 176 GVD-GRCK--QTATDNKIKGYNNVPQN-NYKALKAAVASPLSVAVNAAGTIQRYKSGV-I 230
Query: 295 PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN---- 350
CG LDHGVL VGY + + YWI+KNSWG +GENGY+++ MG
Sbjct: 231 DANCGTRLDHGVLAVGY----------QGEDYWIVKNSWGNGYGENGYFRVKMGTQNGGA 280
Query: 351 -VCGVDSMVS 359
VCG++ M +
Sbjct: 281 GVCGINMMAA 290
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+KS SK Y ++E +R V++ NL++ + L H G+ F D+T
Sbjct: 26 DEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + R + + L N++ P DWR+ G VT VKDQG CGSCW+
Sbjct: 86 NEEFRQVMNGYKLQQR---KFKGSLFLEPNNMEAPKQVDWREEGYVTPVKDQGQCGSCWA 142
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F T +LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 143 FSTTGAMEGQMFRKTQKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 195
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E+ YPY GTD C + AA + F + S E + + GP++V I+A
Sbjct: 196 SGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVSVAIDA 255
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL VGY GF K YWI+KNSW E WG+
Sbjct: 256 GHESFQFYQSGIYYEKECSSEELDHGVLAVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 312
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 313 KGYILMAKDRKNHCGIATAAS 333
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 131/347 (37%), Positives = 182/347 (52%), Gaps = 33/347 (9%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNA----EH--HFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
D IR V E +L A H F+ F + K+Y + E RFR+F +
Sbjct: 26 DSNPIRPVTERAASAVESTVLGALGRTRHALRFARFAVRHGKSYGSAAEVQRRFRIFSES 85
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
L + + G+ +FSD+T EF+ LG + A + N LP
Sbjct: 86 LDEVRSTNRKGLSYKLGINRFSDMTWEEFQATKLGAAQTCSATL-AGNHLMRDANALPET 144
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
DWR+ G V+ VKDQ +CGSCW+FS TGALE A+ +TG+ +SLSEQQLVDC
Sbjct: 145 KDWRETGIVSPVKDQASCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDC-------- 196
Query: 201 SGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS---NF 256
+G+ ++ GCNGGL + AFEYI GG++ E+ YPY G + G CK+ A V+ N
Sbjct: 197 AGAYNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVN-GVCKYRPENAAVQVADSVNI 255
Query: 257 SVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKYLD---HGVLIVGYG 312
++ + DE + A LV+ P++V + + Y GV CG D H VL VGYG
Sbjct: 256 TLNAEDELKNAVGLVR--PVSVAFEVIDGFKQYKSGVYTSDHCGTTPDDVNHAVLAVGYG 313
Query: 313 SSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
PYW+IKNSWG +WGE+GY+K+ MG+N+C V + S
Sbjct: 314 VENGV-------PYWLIKNSWGADWGEDGYFKMEMGKNMCAVATCAS 353
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 169/301 (56%), Gaps = 27/301 (8%)
Query: 74 FRVFKANL-----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-RRLRLPAD-- 125
F VF+ NL + Q L + G+ F+ LT EF Q+LG + P
Sbjct: 52 FEVFQKNLDMIMKHNEEYNQGLQSYEM-GLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRR 110
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A K +++P DWR+ GAV VK+QGACGSCW+FSA ALEGAHFL++GEL+SLS
Sbjct: 111 AGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLS 170
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVEREKDYPYTGTDGGSCK 243
EQQLVDC + + GC GG M++AFEY + G + EKDYPY G D G CK
Sbjct: 171 EQQLVDCSKKFG-------NHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMD-GKCK 222
Query: 244 FDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA-VWMQTYIGGV--SCPYICG 299
F + A +S ++ V +E + + GP++V I+A +Q Y+ GV C
Sbjct: 223 FSADGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCF 282
Query: 300 KYLDHGVLIVGYGSSGFAPIRFKEK-PYWIIKNSWGENWGENGYYKICMGRNVCGVDSMV 358
L+HGV VGYG+ A +RF K YWIIKNSWG WGE G+ + G+N+CGV +
Sbjct: 283 GPLNHGVTAVGYGT---ASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGA 339
Query: 359 S 359
S
Sbjct: 340 S 340
>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
Length = 369
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 188/346 (54%), Gaps = 45/346 (13%)
Query: 46 LLNAEHHFSLFKS---KFSKTYATQEEHDY--RFRVFKANLRRAKRRQLLDPTAVHGV-- 98
L + E + + FK +F K Y E H++ RF +FK N+ K D + H +
Sbjct: 33 LFSHEQYTTEFKGWVGQFEKNY---ESHEFLNRFDIFKKNMDYIKTWN--DKSVDHKLEL 87
Query: 99 TKFSDLTPSEFRRQFLG--LNRRLRL---PADAQ-----KAPILPTNDLPTDFDWRDHGA 148
+DLT E++R +LG +N LR+ AD + K+ D P + DWR GA
Sbjct: 88 NTLADLTDKEYQRLYLGTKVNGALRVGLNHADERDFGHIKSVFSNVKDNP-NVDWRKQGA 146
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
V+ VK+QG CGSCWSFS+TGA+EGAH + TGE++SLSEQQLVDC ++GC
Sbjct: 147 VSHVKNQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDCSKRYG-------NNGC 199
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMA 267
NGGLM AF+Y++ AGG+E E+ YPYT TD +C F+ + ++S+ I + +E +
Sbjct: 200 NGGLMTLAFDYVIDAGGLESEEAYPYTTTDTSACMFNSTNAVTSISDHQNIRAGNEKHLE 259
Query: 268 ANLVKHGPLAVGINAV--WMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSG--------- 315
L GP++V I+A + Y G+ P LDHGVL VG+G
Sbjct: 260 TVLRNVGPVSVAIDASPRSFRFYKSGIFYAPECSSSQLDHGVLAVGFGKGNPESNFENKV 319
Query: 316 -FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
F K Y+I+KNSWG +WG NG+ + R N CG+ +M +
Sbjct: 320 SFIHDDTKNNEYYIVKNSWGSDWGSNGFIYMSKNRKNNCGIATMAT 365
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 172/323 (53%), Gaps = 31/323 (9%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEF 109
H + + K K Y E + RF++FK NLR + D + G+ KF+DLT E+
Sbjct: 46 HVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEY 105
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTN--------DLPTDFDWRDHGAVTGVKDQGACGSC 161
R FLG R R P + T+ +LP DWR+ GAVT +KDQG CGSC
Sbjct: 106 RAMFLGT--RTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQCGSC 163
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS GA+EG + + TG L SLSEQ+LVDCD + GCNGGLM+ AFE+I+
Sbjct: 164 WAFSTVGAVEGINQIVTGNLTSLSEQELVDCDR--------GYNMGCNGGLMDYAFEFIV 215
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ GG++ E+DYPY D K+ + + + +++++ V + P++V I
Sbjct: 216 QNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIE 275
Query: 282 AVWM--QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A M Q Y GV CG LDHGV+ VGYG+ YW+++NSWG WGE
Sbjct: 276 AGGMEFQLYQSGVFTGR-CGTNLDHGVVAVGYGTE-------NGTDYWLVRNSWGSAWGE 327
Query: 340 NGYYKICMGRNVCGVDSMVSSVA 362
NGY K + RNV ++ +A
Sbjct: 328 NGYIK--LERNVQNTETGKCGIA 348
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 188/355 (52%), Gaps = 34/355 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+LL++ A+A D IR V SD E+ ++ S F +++ K Y T
Sbjct: 5 TLLIVFFCVATAAAGLSFHDSNPIRMV--SDMEEQLLQVIGE----SRFANRYGKRYDTV 58
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--NRRLRLPAD 125
+E RF++F NL+ K GV F+D T EFR LG N L +
Sbjct: 59 DEMKRRFKIFSENLQLIKSTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCSATLKGN 118
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ ++ LP + DWR G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLS
Sbjct: 119 HRITDVV----LPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNISLS 174
Query: 186 EQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
EQQLVDC +G+ ++ GCNGGL + AFEYI GG+E E+ YPYTG + G CKF
Sbjct: 175 EQQLVDC--------AGAYNNFGCNGGLPSQAFEYIKYNGGLETEEVYPYTGQN-GLCKF 225
Query: 245 DKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKY- 301
+A V + ++ ED++ + P++V V + Y GV CG
Sbjct: 226 TSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQVVDDFRLYKKGVYTGTTCGSTP 285
Query: 302 --LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
++H VL VGYG PYW+IKNSWG WG++GY+K+ MG+N+CGV
Sbjct: 286 MDVNHAVLAVGYGIE-------DGVPYWLIKNSWGGEWGDHGYFKMEMGKNMCGV 333
>gi|1185457|gb|AAA87848.1| cathepsin L, partial [Schistosoma japonicum]
Length = 224
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 147/234 (62%), Gaps = 19/234 (8%)
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
P D+P +FDWR+ GAVT VK+QG CGSCW+FS TG +E F TG+L+SLSEQQL
Sbjct: 3 PRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQL 62
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCD S D GCNGGL ++A+E I++ GG+ E +YPY + C +
Sbjct: 63 VDCD---------SLDDGCNGGLPSNAYESIIRMGGLMLEDNYPYDAKN-EKCHLKVGNV 112
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--ICGKY-LDHGV 306
AA +++ ++ DE ++A L H ++VG+NA+ +Q Y G+S P+ C KY LDH V
Sbjct: 113 AAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFCSKYLLDHAV 172
Query: 307 LIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
L+VGYG S K +P+WI+KNSWG WGE GY+++ G CG+++ +S
Sbjct: 173 LLVGYGVSE------KNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTGATS 220
>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
Length = 366
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 177/339 (52%), Gaps = 47/339 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ + KF + Y+ E ++ FK+N+ + V + +D +P E+++
Sbjct: 27 FTDWTHKFQRLYSNNEFLK-KYHTFKSNMDYVHSWNAKNSDTVLELNHLADHSPEEYKKF 85
Query: 113 FLGLNRRLRLPADAQKAPI---LPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+LG R + + Q I L T D DWR GAV+ +KDQG CGSCWSFS T
Sbjct: 86 YLGT-RVKHIHFNVQGTHINTQLSTVFEDSGATVDWRKKGAVSPIKDQGQCGSCWSFSTT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G++EGAH + TG +V LSEQ LVDC S + GCNGGLMN+AF+YI+ G++
Sbjct: 145 GSVEGAHQIKTGNMVELSEQNLVDC-------SSAEGNMGCNGGLMNNAFDYIISNHGID 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVW-- 284
E+ YPYT G CKF+K+ + A +S++ I+ + AN VK GP++V I+A
Sbjct: 198 TEQSYPYTANTGSVCKFNKTNVGATISSYKSITPGSETDLANAVKTAGPVSVAIDASHRS 257
Query: 285 MQTYIGGVSCPYICGKY-LDHGVLIVGYGSS----------------------GFAPIRF 321
Q Y G+ ++C LDHGVL+VGYGS G ++
Sbjct: 258 FQLYSHGIYYEWLCSSTRLDHGVLVVGYGSGNPPNSDMDHMILKKTAKTDHYHGKKSLKV 317
Query: 322 KE------KPYWIIKNSWGENWGENGYYKICMGR-NVCG 353
++ K YWI+KNSW + WG+ GY + R N CG
Sbjct: 318 EKVDTTSSKNYWIVKNSWSDTWGDKGYIYMSKDRKNNCG 356
>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
Length = 337
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 179/326 (54%), Gaps = 35/326 (10%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
A +F F ++++K Y +++E YR+ +F+ N+ ++ + +AV+ + +F+D+ +E
Sbjct: 36 APLYFEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMPKNE 95
Query: 109 FRRQF-------LGLN--RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
+ LGLN + + AQ+ P FDWR +T VKDQG CG
Sbjct: 96 IVIRHTGLASGELGLNFCETIVVDGPAQRQR-------PVSFDWRSMNKITSVKDQGMCG 148
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
+CW F++ GALE + + L+ LSEQQLVDCD D GC+GGL+++A+E
Sbjct: 149 ACWRFASLGALESQYAIKYDRLIDLSEQQLVDCDF---------VDMGCDGGLIHTAYEQ 199
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAV 278
I+K GGVE+E DY Y + C K A V N + + +E+++ L GP+A+
Sbjct: 200 IMKMGGVEQEFDYSYKA-ERQPCALKPHKFATGVRNCYRYVILNEERLEDLLRYVGPIAI 258
Query: 279 GINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
++AV + Y GG+ + L+H VL+VGYG PYWIIKNSWG ++G
Sbjct: 259 AVDAVDLTDYYGGI-VSFCENNGLNHAVLLVGYGVEN-------NVPYWIIKNSWGSDYG 310
Query: 339 ENGYYKICMGRNVCGVDSMVSSVAAI 364
E+GY ++ G N CG+ + ++S A +
Sbjct: 311 EDGYVRVRRGVNSCGMINELASSAQV 336
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/324 (39%), Positives = 174/324 (53%), Gaps = 31/324 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ +K + K Y ++ E R +++ N + AK Q + V K++DL E
Sbjct: 27 WNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEE 86
Query: 109 FRRQFLGLNR-RLRLPA------DAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGS 160
F + G NR + P D I P N ++P DWR+ GAVT VKDQG CGS
Sbjct: 87 FVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGS 146
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFSATGALEG HF TG+LVSLSEQ LVDC + ++GCNGG+M+ AF+YI
Sbjct: 147 CWSFSATGALEGQHFRKTGKLVSLSEQNLVDCS-------TKYGNNGCNGGMMDFAFQYI 199
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
GG++ EK YPY D +C ++ + A F + DE + + GP++V
Sbjct: 200 KDNGGIDTEKAYPYEAID-DTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVA 258
Query: 280 INAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
I+A Q Y GV C + LDHGVL VGYG+S + + YW++KNSWG
Sbjct: 259 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSE------EGEDYWLVKNSWGTT 312
Query: 337 WGENGYYKICMGR-NVCGVDSMVS 359
WG+ GY K+ R N CG+ + S
Sbjct: 313 WGDQGYVKMARNRDNHCGIATAAS 336
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 118/302 (39%), Positives = 165/302 (54%), Gaps = 34/302 (11%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
+ D RF +FK NLR + A + G+T F++LT E+R +LG RR+
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 123 PADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + + + +++P DWR GAV +KDQG CGSCW+FS A+EG + + TGE
Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD S + GCNGGLM+ AF++I+K GG+ EKDYPY GT+G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYIC 298
K+ + + + S ++ V + P++V I+A Q Y G+ C
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGK-C 254
Query: 299 GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV------C 352
G +DH V+ VGYGS YWI++NSWG WGE+GY I M RNV C
Sbjct: 255 GTNMDHAVVAVGYGSENGV-------DYWIVRNSWGTRWGEDGY--IRMERNVASKSGKC 305
Query: 353 GV 354
G+
Sbjct: 306 GI 307
>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 381
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 166/311 (53%), Gaps = 43/311 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+ ++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVD 266
Query: 282 AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
A +Y GVL+VGY +G PYW+IKNSWGE+WGE G
Sbjct: 267 ASSFMSY--------------QSGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKG 305
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 306 YVRVAMGLNAC 316
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 169/326 (51%), Gaps = 38/326 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSE 108
+ FK +F + Y E RF +F AN + Q T GV +F+D T E
Sbjct: 60 WKFFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTDKTDYE 119
Query: 109 FRRQFLGLNRRLRLPADA--QKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWS 163
++ R ++ + A K ++ LP+ DWR GAVT VK+QG CGSCW+
Sbjct: 120 LKKL-----RGYKVTSGAIRHKGSTFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWA 174
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG H+ T LV+LSEQQLVDC ++GC+GGLMNSAFEY+
Sbjct: 175 FSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYG-------NNGCSGGLMNSAFEYVRDN 227
Query: 224 GGVEREKDYPYT---GTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVG 279
G++ E YPY GT+ C F+ S I A V+ + ++ DE + + GP++V
Sbjct: 228 EGIDSEISYPYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVA 287
Query: 280 INAVW--MQTYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
INA Y G+ C LDHGVL+VGYG + YW+IKNSWG
Sbjct: 288 INAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEE-------NGRSYWLIKNSWG 340
Query: 335 ENWGENGYYKICMG-RNVCGVDSMVS 359
E WGE GY KI G N+CGV S S
Sbjct: 341 EEWGEKGYIKISKGSHNMCGVASAAS 366
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 121/292 (41%), Positives = 164/292 (56%), Gaps = 30/292 (10%)
Query: 74 FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILP 133
FR ANLR + + + G+T+F+DLT +EF +R + + +
Sbjct: 48 FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFS----AYVKRFPMNVTRPRNEVWI 103
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
T + DWR AVT +K+QG CGSCWSFS TG++EGAH ++TG+LVSLSEQQL+DC
Sbjct: 104 TEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCS 163
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
+ + GCNGGLM+ AFEY++ GG++ E+DYPYT DG + K AA +
Sbjct: 164 -------TRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEI 216
Query: 254 SNF-SVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVG 310
F +V EDQ+AA V GP++V I A Q Y GV CG LDHGVL+VG
Sbjct: 217 HGFRNVPKEHEDQLAA-AVSIGPVSVAIEADQAGFQHYTSGVF-DGKCGTSLDHGVLVVG 274
Query: 311 YGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG---RNVCGVDSMVS 359
Y YWI+KNSWG++WGE GY ++ G + +CG+ S
Sbjct: 275 YSDD-----------YWIVKNSWGKSWGEEGYIRLKRGVDKKGMCGITMQAS 315
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 129/328 (39%), Positives = 179/328 (54%), Gaps = 32/328 (9%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
D L+++ H +K++ +TYA E+ +R ++ NL+ + L H G+
Sbjct: 22 DQTLDSQWH--QWKAQHRRTYAANED-GWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78
Query: 100 KFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
KF D+T EF++ G N + R + P+L LP DWR+ G VT VK+QG
Sbjct: 79 KFGDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLA--QLPKSVDWREKGYVTPVKNQG 136
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FSATG+LEG F T +LVSLSEQ LVDC + ++GC+GGLM++A
Sbjct: 137 QCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCS-------TSEGNNGCSGGLMDNA 189
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGP 275
FEY+ GG++ E+ YPY G D CK+ A V+ F I S +E + + GP
Sbjct: 190 FEYVKNNGGIDTEQAYPYLGQD-NECKYRAECSGANVTGFVDIPSMNERALMKAVANVGP 248
Query: 276 LAVGINA--VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNS 332
++V I+A Q Y GV P LDHGVL+VGYGS G + YWI+KNS
Sbjct: 249 ISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIG-------KDEYWIVKNS 301
Query: 333 WGENWGENGYYKICMGRNV-CGVDSMVS 359
WGE WG+ GY + RN CG+ + S
Sbjct: 302 WGEEWGKKGYVLMAKFRNNHCGIATAAS 329
>gi|323457344|gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
Length = 346
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 176/331 (53%), Gaps = 37/331 (11%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR---RQLLDPTAVHGVTKFSDLTP 106
E F LFKS + K+Y + E RF +F ANLR+ + +++ + A GVT+F DLT
Sbjct: 17 ESLFELFKSDYVKSYNSTEAEAERFTIFSANLRKTEALNAQRVDEDDAEFGVTQFMDLTE 76
Query: 107 SEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWR--DHGAVTGVKDQGACGSCWS 163
+EF+ Q+L + L D AP P DWR G V+ VKDQG CGSCW+
Sbjct: 77 AEFKAQYLNYVPSEQVLAEDVYAAP--EGFAAPGSLDWRTKQSGVVSDVKDQGQCGSCWA 134
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSAT +E L+ + + + QQ+V CD D GCNGG +A+ Y+ KA
Sbjct: 135 FSATEQIESEWVLAGNDPLVFAPQQIVSCDK---------VDQGCNGGNTETAYAYVEKA 185
Query: 224 GGVEREKDYPY-TGTDGGSCKFDKSKIAAA-VSNFSVI----------SSDEDQMAANLV 271
GG+ E YPY +GT G + + K + A V +FS + DED+MAA L
Sbjct: 186 GGMALESAYPYKSGTSGNTGRCKKFETAGGDVESFSYVVPECKKGKCNDQDEDKMAAALA 245
Query: 272 KHGPLAVGINAVWMQTYIGGVSCPYICGKY----LDHGVLIVGY-GSSGFAPI---RFKE 323
HGP ++ +NA QTY GV CG + LDH V +VGY G +G A K+
Sbjct: 246 SHGPASICVNAGAWQTYTKGVMTNLQCGSHAANALDHCVQVVGYTGYTGDAKACGKGLKD 305
Query: 324 KPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
K W ++NSWG +WG GY ++ MG+N CG+
Sbjct: 306 KCVWNVRNSWGTSWGYQGYIRVQMGKNACGI 336
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 167/319 (52%), Gaps = 33/319 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF+ F +K+ K Y T EE +RF F +++ + + V +F+D+T EFR
Sbjct: 28 HFAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQHSYSLAVNEFADMTFEEFRD 87
Query: 112 QFLGLNRRLRLPADAQKAP------ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
RL Q +L LP DWR+ G V+ VK+Q +CGSCW+FS
Sbjct: 88 S--------RLMKGEQNCSATVGNHVLTGESLPKTKDWREEGIVSQVKNQASCGSCWTFS 139
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALE AH +TG++V LSEQQLVDC E + + GC GGL + AFEYI GG
Sbjct: 140 TTGALEAAHAQATGKMVLLSEQQLVDCAGEFN-------NFGCGGGLPSQAFEYIRYNGG 192
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVW 284
++ E YPY D C+F K+ I A V + ++ E Q+ + P++V V
Sbjct: 193 IDTEDSYPYNAKD-SQCRFHKNTIGAQVWDVVNITEGAETQLKHAIATMRPVSVAFEVVH 251
Query: 285 -MQTYIGGVSCPYICG---KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
+ Y GGV C + ++H VL VGYG PYWIIKNSWG +WG N
Sbjct: 252 DFRLYNGGVYTSLNCHTGPQTVNHAVLAVGYGEDENGV------PYWIIKNSWGADWGMN 305
Query: 341 GYYKICMGRNVCGVDSMVS 359
GY+ + MG+N+CGV + S
Sbjct: 306 GYFNMEMGKNMCGVATCAS 324
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 171/319 (53%), Gaps = 25/319 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +K +K Y +EE +R V++ NL++ + L H + F D+
Sbjct: 28 HWQQWKEWHNKDYHEKEE-GWRRMVWEKNLKKIELHNLEHSLGKHSYRLAMNHFGDMPHE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G ++R + + + N L P+ DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGYKHKVR---KIRGSLFMEPNFLEAPSKLDWREKGYVTPVKDQGQCGSCWAFS 143
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI GG
Sbjct: 144 TTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNGG 196
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW 284
++ EK YPY GTD C +D S AA + F + S E + + GP++V I+A
Sbjct: 197 LDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHALMKAVTAVGPVSVAIDAGH 256
Query: 285 --MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ C + LDHGVL+VGYG G K YWI+KNSW E WG G
Sbjct: 257 ESFQFYQSGIYYEADCSSEDLDHGVLVVGYGYEG---ENVDGKKYWIVKNSWSEQWGNKG 313
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y + R N CG+ + S
Sbjct: 314 YIYMAKDRHNHCGIATAAS 332
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/326 (37%), Positives = 176/326 (53%), Gaps = 41/326 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+ L+ ++ + Y E D RFRVF NLR A + + G+ +F+DLT EFR
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 168
Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+PA ++ + +LP DWR+ GAV VK+QG CGSCW
Sbjct: 169 AAYLGA----RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 224
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++E + + TGE+V+LSEQ+LV+C + +SGCNGGLM++AF++I+K
Sbjct: 225 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 277
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG++ E DYPY D G C ++ ++ F + ++++ V H P++V I
Sbjct: 278 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIE 336
Query: 282 A--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Q Y GV C LDHGV+ VGYG+ K YWI++NSWG WGE
Sbjct: 337 AGGREFQLYKAGVF-TGTCTTNLDHGVVAVGYGTE-------NGKDYWIVRNSWGAKWGE 388
Query: 340 NGYYKICMGRNV------CGVDSMVS 359
+GY + M RNV CG+ M S
Sbjct: 389 DGYIR--MERNVNATTGKCGIAMMAS 412
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 125/366 (34%), Positives = 195/366 (53%), Gaps = 34/366 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R + ++L LL+ VL++ + D A S G + E F ++ SK K
Sbjct: 5 RPVCMTILFLLIVFVLSAPSSAMDLPAT------SGGHNRSNE--EVEFIFQMWMSKHGK 56
Query: 63 TYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RL 120
TY E + RF+ FK NLR + + + G+T+F+DLT E+R F G + +
Sbjct: 57 TYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQ 116
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
R +++ L + LP DWR GAV+ +KDQG C SCW+FS A+EG + + TGE
Sbjct: 117 RNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGE 176
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNG-GLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDC+ ++GC G GLM++AF++++ G++ EKDYPY GT G
Sbjct: 177 LISLSEQELVDCNL---------VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQG 227
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY--I 297
+ + + ++ + ++++ V H P++VG++ Q ++ SC Y
Sbjct: 228 SCNRKQVHLLVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKK-SQEFMLYRSCIYNGP 286
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNVCG 353
CG LDH ++IVGYGS + YWI++NSWG WG+ GY KI + +CG
Sbjct: 287 CGTNLDHALVIVGYGSE-------NGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCG 339
Query: 354 VDSMVS 359
+ + S
Sbjct: 340 IAMLAS 345
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 120/321 (37%), Positives = 171/321 (53%), Gaps = 32/321 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + + + ++ +TY E + RF VF+ NLR +
Sbjct: 27 IVSYGERSEEEV---RRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAG 83
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTNDLPTDFDWRDH 146
+H G+ +F+DLT E+R +LG+ R RL Q A +LP DWR+
Sbjct: 84 LHSFRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSGRYQAAD---NEELPESVDWREK 140
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG++++LSEQ+LVDCD S +
Sbjct: 141 GAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDT--------SYNQ 192
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + +
Sbjct: 193 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELS 252
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYGS K
Sbjct: 253 LKKAVANQPISVAIEAGGRAFQLYKSGIFTGR-CGTALDHGVTAVGYGSE-------NGK 304
Query: 325 PYWIIKNSWGENWGENGYYKI 345
YWI+KNSWG WGE+GY ++
Sbjct: 305 DYWIVKNSWGTVWGEDGYVRL 325
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 170/319 (53%), Gaps = 23/319 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFQEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVEC---SKPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA-- 282
++ E YPY GTD C ++ AA + F + S E + + GP++V I+A
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGH 258
Query: 283 VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ C LDHGVL+VGY G K YWI+KNSW E WG+NG
Sbjct: 259 TSFQFYQSGIYFEAECSSTDLDHGVLVVGY---GVEKRDTDGKKYWIVKNSWSEKWGQNG 315
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y + + N CG+ + S
Sbjct: 316 YILMAKDKDNHCGIATAAS 334
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/325 (38%), Positives = 173/325 (53%), Gaps = 44/325 (13%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ F ++ K+Y + E RFR+F +L+ + + G+ +FSD++ EFR
Sbjct: 61 RFARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKGLSYRLGINRFSDMSWEEFRA 120
Query: 112 QFLGL----------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
LG N R+R A A LP DWR+ G V+ VK+QG CGSC
Sbjct: 121 TRLGAAQNCSATLAGNHRMRAAAVA----------LPKTKDWREDGIVSPVKNQGHCGSC 170
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TGALE A+ +TG+ +SLSEQQLVDC + + GCNGGL + AFEYI
Sbjct: 171 WTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFN-------NFGCNGGLPSQAFEYIK 223
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAV 278
GG++ E+ YPY G + G C F + V N ++ + DE + A LV+ P++V
Sbjct: 224 YNGGLDTEESYPYKGVN-GICDFKAENVGVKVLDSVNITLGAEDELKDAVALVR--PVSV 280
Query: 279 GINAV-WMQTYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V + Y GV CG ++H VL VGYG PYW+IKNSWG
Sbjct: 281 AFQVVNGFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWG 333
Query: 335 ENWGENGYYKICMGRNVCGVDSMVS 359
+WG+ GY+K+ MG+N+CGV + S
Sbjct: 334 ADWGDKGYFKMEMGKNMCGVATCAS 358
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/321 (38%), Positives = 167/321 (52%), Gaps = 25/321 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKF 101
E H L + +K K Y +E RF++FK+N+ + + + + G+ KF
Sbjct: 29 ELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKF 88
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+DLT EFR + G R L LP+ DWR GAVT +KDQG CGSC
Sbjct: 89 ADLTNEEFRAFWNGYKRPLGASRKITPFKYENVTALPSSIDWRSKGAVTPIKDQGVCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FSA A EG H L TG+LVSLSEQ+LVDCD + D GC GGLM AF++I
Sbjct: 149 WAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQ-------DKGCQGGLMVDAFKFIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
+ GG+ E +YPY G DG ++ A ++ + + + + V + P++V I+
Sbjct: 202 RHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVSVAID 261
Query: 282 A--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A + Q Y G+ ICGK ++HGV VGYG S YWI+KNSWG WGE
Sbjct: 262 AGSLSFQFYRSGIFTG-ICGKDINHGVAAVGYGRSNSGS------KYWIVKNSWGTEWGE 314
Query: 340 NGYYKICMGRNV------CGV 354
GY I M R+V CG+
Sbjct: 315 KGY--IRMKRDVRSKEGLCGI 333
>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
Length = 1157
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 113/276 (40%), Positives = 160/276 (57%), Gaps = 21/276 (7%)
Query: 80 NLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
N+++A+ Q L+ TA++GVT+FSDLT EF+ FLGL + + +P
Sbjct: 654 NIKQAEFYQTLERGTALYGVTQFSDLTGEEFQETFLGLRLDEQYSKSQSYVKKKHSVSIP 713
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
++DWR +GAV V DQG CGSCW+FS G +EG F TG+LVSLS+QQLVDCD
Sbjct: 714 ENYDWRPYGAVGPVLDQGHCGSCWAFSVIGNIEGQWFRKTGQLVSLSKQQLVDCDRS--- 770
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
GC GG + ++ I + GG+E E DY YTG D G C + K A V++
Sbjct: 771 ------SRGCGGGYPPATYDSIRRIGGLEIELDYRYTGRD-GVCHQNPRKFVAYVNSSVA 823
Query: 259 ISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCP---YICGKYLDHGVLIVGYGSSG 315
++ DE+ +A L HGP+++ +NA +Q Y+ G+ P Y K + H VL VG+G+ G
Sbjct: 824 LTKDENTIAEWLSYHGPISMALNARLLQFYVSGIMHPPAAYCPVKDISHAVLSVGFGTKG 883
Query: 316 FAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
P+WI+KNSWG WGE GY++I G ++
Sbjct: 884 -------NVPFWIVKNSWGTLWGEEGYFRIYRGDDM 912
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/206 (43%), Positives = 116/206 (56%), Gaps = 21/206 (10%)
Query: 138 PTD-FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
P D FDWRD+GAV V DQ CG+ W+FSA G +EG +F+ L+SLSEQQLVDCD
Sbjct: 463 PQDSFDWRDYGAVGPVLDQDRCGASWAFSAIGNIEGQYFMRVHRLLSLSEQQLVDCDR-- 520
Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
D GC GG AFE I + GG+E E DYPY G +C+ + + +++
Sbjct: 521 -------IDQGCAGGTPYGAFEGIQQLGGLELEADYPYLGHQ-DNCQSNPLRFVVSINGS 572
Query: 257 SVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYI--CGKY-LDHGVLIVGYGS 313
+ DEDQ+A L HGPL+VGIN +Q Y G+ P C ++H L VG+G
Sbjct: 573 VQLPKDEDQIAQYLFDHGPLSVGINGALLQYYSSGIMQPLWDNCNPAEMNHAGLAVGFGF 632
Query: 314 SGFAPIRFKEKPYWIIKNSWGENWGE 339
++ PYW IKNSWG WGE
Sbjct: 633 E-------QDVPYWTIKNSWGMLWGE 651
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 117/209 (55%), Gaps = 15/209 (7%)
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL-RLPADAQKAPILPTNDLPTD 140
R + RQL + ++ + + +E FL L R R P+ A + ++P
Sbjct: 947 RELRERQLYEEFKLN----YGKVYENEGMFYFLYLGARFDREPSRAGSMVVDDLGEIPER 1002
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWR+ GAV ++DQG CGSCW+FS G +EG F TG+L++LSEQQL+DCD
Sbjct: 1003 FDWRELGAVGPIQDQGDCGSCWAFSTIGNIEGQWFKKTGQLLTLSEQQLIDCD------- 1055
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
S D GC GG + I+K GG+E DYPY D G CK ++SK A V+ V+
Sbjct: 1056 --SVDDGCGGGYPPDTYGDIVKMGGLELNADYPYIAAD-GVCKMERSKFRAYVNKSLVLP 1112
Query: 261 SDEDQMAANLVKHGPLAVGINAVWMQTYI 289
+ EDQ A L K+GPL+ GINA ++Q I
Sbjct: 1113 TKEDQQAVWLSKNGPLSAGINADYLQVVI 1141
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 117/242 (48%), Gaps = 44/242 (18%)
Query: 108 EFRRQFLGLNR-RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFRR +L P D + + LP+ FDWR++GAV V++QG CGSCW+ SA
Sbjct: 190 EFRRLYLTYKSPDEHEPID--RIHVQEVGQLPSYFDWREYGAVGPVRNQGQCGSCWAISA 247
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
++VDCDH D GC+GG A+E + + GG+
Sbjct: 248 ---------------------EVVDCDH---------ADHGCSGGFPIHAYECVQRLGGL 277
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQ 286
E YPY G C+ D A ++ + D +Q+A L GPL+V ++A +Q
Sbjct: 278 ELAVRYPYVGYQ-QYCQADPRYFVAYINGSVALPKDSEQIAKFLATFGPLSVVLDARLLQ 336
Query: 287 TYIGGVSCP---YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYY 343
Y G+ P Y + L+H VL VG+G+ + PYWIIKNSWGE WGE
Sbjct: 337 YYRSGILNPSVAYCNPEELNHAVLSVGFGTE-------QGIPYWIIKNSWGEQWGEQHLT 389
Query: 344 KI 345
K+
Sbjct: 390 KL 391
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 86/157 (54%), Gaps = 20/157 (12%)
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQLVDCDH D GC GG AF + + GG++ DYPY + +C+F+
Sbjct: 23 QQLVDCDH---------VDRGCEGGFPLDAFMAVQRLGGLQLSIDYPYIASRQ-ACQFNP 72
Query: 247 SKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGV---SCPYICGKYLD 303
+ A V+ F+ + +E +A L ++GPL+VG+N+ ++ Y G+ + + L+
Sbjct: 73 KQAVAFVTGFAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCDPEALN 132
Query: 304 HGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
H L VG+G+ + P+WIIKN++G++WGE
Sbjct: 133 HAALAVGFGTD-------ESTPFWIIKNTFGKDWGEQ 162
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 184/337 (54%), Gaps = 40/337 (11%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFS---KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
G SED L + + LF+S S K Y + EE +RF +FK NL+ R +
Sbjct: 32 GYSSED-LKSMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYW 90
Query: 96 HGVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
G+ +F+DL+ EF+ ++LGL +RR P + + +LP DWR GAVT
Sbjct: 91 LGLNEFADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDV----ELPKSVDWRKKGAVTQ 146
Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
VK+QG+CGSCW+FS A+EG + + TG L SLSEQ+L+DCD + ++GCNGG
Sbjct: 147 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDR--------TYNNGCNGG 198
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANL 270
LM+ AF +I++ G+ +E+DYPY + G+C+ K + +S + + + +Q
Sbjct: 199 LMDYAFSFIVENDGLHKEEDYPYI-MEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKA 257
Query: 271 VKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWI 328
+ + PL+V I A Q Y GGV + CG LDHGV VGYG++ K Y
Sbjct: 258 LANQPLSVAIEASGRDFQFYSGGVFDGH-CGSDLDHGVAAVGYGTA-------KGVDYIT 309
Query: 329 IKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
+KNSWG WGE GY I M RN +CG+ M S
Sbjct: 310 VKNSWGSKWGEKGY--IRMRRNIGKPEGICGIYKMAS 344
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 194/354 (54%), Gaps = 38/354 (10%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTY 64
SLLL+L+ S L+SA D ++I +++ H + + +L++S + K+Y
Sbjct: 11 SLLLMLIFSTLSSA----SDMSIISY------DETHIHHRSDDEVSALYESWLIEHGKSY 60
Query: 65 ATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RL 120
E D RF++FK NL+ ++ + + + G+TKF+DLT E+R +LG R
Sbjct: 61 NALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRR 120
Query: 121 RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+L + + D LP DWRD G + GVKDQG+CGSCW+FSA A+E + + TG
Sbjct: 121 KLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTG 180
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDCD S + GC+GGLM+ AFE+++ GG++ E+DYPY +
Sbjct: 181 NLISLSEQELVDCDK--------SYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERND 232
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYI 297
++ K+ + ++ + + ++ V H P+++ I A +Q Y G+
Sbjct: 233 VCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGK- 291
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG +DHGV+ GYGS YWI++NSWG WGE GY ++ RNV
Sbjct: 292 CGTAVDHGVVAAGYGSE-------NGMDYWIVRNSWGAKWGEKGYLRV--QRNV 336
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 27/321 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVEC---SKPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA-- 282
++ E YPY GTD C ++ AA + F + S E + + GP++V I+A
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGH 258
Query: 283 VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ C LDHGVL+VGY G K YWI+KNSW E WG+NG
Sbjct: 259 TSFQFYQSGIYFEAECSSTDLDHGVLVVGY---GVEKRDTDGKKYWIVKNSWSEKWGQNG 315
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M + N CG+ + S
Sbjct: 316 Y--ILMAKDKDNHCGIATAAS 334
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 170/327 (51%), Gaps = 34/327 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ FK + K Y ++ E R +++ N + AK Q D V K++DL E
Sbjct: 28 WTAFKLQHRKKYDSETEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEE 87
Query: 109 FRRQFLGLNRRLR----------LPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGA 157
F G NR + P + I P N D+PT DWR GAVT VKDQG
Sbjct: 88 FVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANVDVPTAMDWRTKGAVTQVKDQGH 147
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCWSFSATGALEG HF TG+LVSLSEQ LVDC + ++GCNGG+M+ AF
Sbjct: 148 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYG-------NNGCNGGMMDFAF 200
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI G++ EK YPY D C ++ + A F + +E + L GP+
Sbjct: 201 QYIKDNKGIDTEKSYPYEAID-DECHYNPKAVGATDKGFVDIPQGNEKALMKALATVGPV 259
Query: 277 AVGINAVW--MQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSW 333
+V I+A Q Y GV C + LDHGVL VGYG++ + YW++KNSW
Sbjct: 260 SVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDG------EDYWLVKNSW 313
Query: 334 GENWGENGYYKICMGR-NVCGVDSMVS 359
G WG+ GY K+ R N CG+ + S
Sbjct: 314 GTTWGDQGYVKMARNRDNHCGIATTAS 340
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 181/335 (54%), Gaps = 38/335 (11%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEE-HDYRFRVFKANLRRAKRRQLLDPTAVH-GVT 99
S D L+ E ++ + +KF K A+ D RF FK N R + + G+
Sbjct: 4 SSDSDLSGE--YASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLN 61
Query: 100 KFSDLTPSEFRRQFLGLNRRL------RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
+FSDLT EFR++FLGL L ++P D+ DLP DWR HGAVT K
Sbjct: 62 QFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPK 121
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG+CG CW+F+ TGA+EG + + TG+L+SLSEQ+L+DCD + D GC+GGLM
Sbjct: 122 DQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKK--------ADKGCDGGLM 173
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVK 272
+A+++I++ GG++ E DYPY ++ C K + A+ + I ++Q V
Sbjct: 174 ENAYQFIVENGGLDTETDYPYHASE-SHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVA 232
Query: 273 HGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIK 330
P++V I Q Y GV + CG+ ++HGVLIVGYG+ YWI+K
Sbjct: 233 KQPVSVAIEGASKDFQHYASGVFTGH-CGEEINHGVLIVGYGTE-------DGLDYWIVK 284
Query: 331 NSWGENWGENGYYKICMGRN------VCGVDSMVS 359
NSW WG+ G+ K M RN +C ++++ S
Sbjct: 285 NSWAATWGDGGFVK--MQRNTGKRGGLCSINTLAS 317
>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
Length = 379
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 186/342 (54%), Gaps = 52/342 (15%)
Query: 59 KFSKTYATQEEHDY--RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG- 115
+F K+Y E D+ RF VFK N+ V + +F+D+T E+RR +LG
Sbjct: 45 RFEKSY---ESFDFLQRFAVFKTNMDYVHEWNSKKLPTVLELNQFADITNQEYRRLYLGT 101
Query: 116 -LNRR--LRLPADAQKA----PILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFS 165
+N R L P + + + +D + DWR GAV+ +K+QG CGSCWSFS
Sbjct: 102 RINARHLLGTPGTHEMSNNFGKVFGDDDSDSSGATVDWRAKGAVSPIKNQGQCGSCWSFS 161
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAG 224
TG++EGAH++STG++V LSEQ LVDC SGS + GC GGLMN AF+YI+K
Sbjct: 162 TTGSVEGAHYISTGKMVPLSEQNLVDC--------SGSEGNMGCQGGLMNLAFDYIIKNE 213
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAV 283
G++ E YPY+ G C F+K+ + A +S++ I+S ++ A+ VK+ GP++V I+A
Sbjct: 214 GIDTEDSYPYSAETGKKCLFNKTNVGATISSYKNITSGDESNLADAVKNAGPVSVAIDAS 273
Query: 284 W--MQTYIGGVSCPYICGKY-LDHGVLIVGYGS-------------SGFAPIRFKEK--- 324
Q Y G+ C LDHGVL+VGYGS SG + F +
Sbjct: 274 HNSFQLYSHGIYYEKDCSSVNLDHGVLVVGYGSGDPSSLANNVGGRSGPKMVVFNNRMVK 333
Query: 325 ------PYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
YWI+KNSWG WG +G+ + M R N CG+ + S
Sbjct: 334 TPSSNGDYWIVKNSWGSTWGSHGFIFMSMNRDNNCGIATSAS 375
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 169/314 (53%), Gaps = 28/314 (8%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
+D+ L + + +K + YA +E + R+ VFK N+ R +R + T V +
Sbjct: 29 DDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQ 88
Query: 101 FSDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
F+DLT EFR + G L+ + + + + + LP DWR GAVT +K+
Sbjct: 89 FADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKN 148
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CG CW+FSA A+EGA + G+L+SLSEQQLVDCD D GC+GGLM+
Sbjct: 149 QGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCDTN---------DFGCSGGLMD 199
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
+AFE+I+ GG+ E +YPY G D +CK +K A +++ + + ++++ V H
Sbjct: 200 TAFEHIMATGGLTTESNYPYKGKD-ATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAH 258
Query: 274 GPLAVGIN--AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKN 331
P+++GI Q Y GV C YLDH V VGYG S YWIIKN
Sbjct: 259 QPVSIGIEGGGFDFQFYGSGVFTGE-CTTYLDHAVTAVGYGQSSNGS------KYWIIKN 311
Query: 332 SWGENWGENGYYKI 345
SWG WGE+GY +I
Sbjct: 312 SWGTKWGESGYMRI 325
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/334 (37%), Positives = 178/334 (53%), Gaps = 34/334 (10%)
Query: 42 SEDHLLNAEHHFSLFK---SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGV 98
SE+ L + + LF+ +K+ K YA+ EE RF VFK NL + G+
Sbjct: 37 SEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGL 96
Query: 99 TKFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
+F+DLT EF+ +LGL + ++ + + ++P + DWR AVT V
Sbjct: 97 NEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEV 156
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
K+QG CGSCW+FS A+EG + + TG L SLSEQ+L+DC S ++GCNGGL
Sbjct: 157 KNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDC--------STDGNNGCNGGL 208
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
M+ AF YI GG+ E+ YPY + G C K +S + + ++++Q +
Sbjct: 209 MDYAFSYIASTGGLRTEEAYPYA-MEEGDCDEGKGAAVVTISGYEDVPANDEQALVKALA 267
Query: 273 HGPLAVGINAV--WMQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
H P++V I A Q Y GGV P CG+ LDHGV VGYG+S K + Y I+
Sbjct: 268 HQPVSVAIEASGRHFQFYSGGVFDGP--CGEQLDHGVTAVGYGTS-------KGQDYIIV 318
Query: 330 KNSWGENWGENGYYKI----CMGRNVCGVDSMVS 359
KNSWG +WGE GY ++ G +CG++ M S
Sbjct: 319 KNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMAS 352
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/331 (38%), Positives = 173/331 (52%), Gaps = 30/331 (9%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAV---HG 97
S +L AE +S FK+K K+Y ++ E +R +++ N + AK + V
Sbjct: 18 SYQEVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMA 75
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVK 153
+ +F D+ EF G R + + P N LP DWR GAVT VK
Sbjct: 76 MNEFGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVK 135
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG CGSCW+FSATG+LEG HF +G +VSLSEQ LV C + ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFG-------NNGCEGGLM 188
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
+ AF+YI G++ EK YPY GTD G+C F KS + A S F + E Q+ +
Sbjct: 189 DDAFKYIRANKGIDTEKSYPYNGTD-GTCHFKKSTVGATDSGFVDIKEGSETQLKKAVAT 247
Query: 273 HGPLAVGINAVW--MQTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWII 329
GP++V I+A Q Y GV P + LDHGVL+VGYG+ YW +
Sbjct: 248 VGPISVAIDASHESFQFYSDGVYDEPECDSESLDHGVLVVGYGT-------LNGTDYWFV 300
Query: 330 KNSWGENWGENGYYKICMG-RNVCGVDSMVS 359
KNSWG WG+ GY ++ +N CG+ S S
Sbjct: 301 KNSWGTTWGDEGYIRMSRNKKNQCGIASSAS 331
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/357 (34%), Positives = 188/357 (52%), Gaps = 38/357 (10%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
LL S++L + A++ ++++ R + D ++ + + + K+Y + +E
Sbjct: 12 LLFFSTLLILSSAIDIENSVQR---------TNDQVM---AMYESWLVEHGKSYNSLDEK 59
Query: 71 DYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
+ RF +FK NLR + + G+ +F+DLT E+R +LGL R + Q
Sbjct: 60 EMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDVSNQYM 119
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
P + + LP DWR GAV GVK+QG C SCW+FSA A+EG + + TG L+SLSEQ+L
Sbjct: 120 PKV-GDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQEL 178
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDC GCN GLM AF++I+ GG+ E +YPYT DG K++
Sbjct: 179 VDCGRT-------QITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQK 231
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKYLDHGVL 307
+ ++ + S+ + V + P++VG+ + + Y G+ CG +DHGV
Sbjct: 232 YVTIDSYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGS-CGTAVDHGVT 290
Query: 308 IVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV-----CGVDSMVS 359
IVGYG+ + YWI+KNSWG NWGE+GY +I RN+ CG+ M S
Sbjct: 291 IVGYGTE-------RGMDYWIVKNSWGTNWGESGYIRI--QRNIGGAGKCGIAKMPS 338
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 176/325 (54%), Gaps = 32/325 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+S FK + SK Y ++ E +R +++ N R AK Q + AV K++D+ E
Sbjct: 27 WSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLSHE 86
Query: 109 FRRQFLGLNRRLRLPADA-----QKAP---ILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
F G N+ L+ P + P I P + P DWR GAVT VKDQG CG
Sbjct: 87 FVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGKCG 146
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TGALEG HF TG LVSLSEQ L+DC + ++GCNGGLM++AF+Y
Sbjct: 147 SCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDC-------SAAYGNNGCNGGLMDNAFKY 199
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAV 278
I GG++ EK YPY G D C+++ K+ A V + DE+++ + GP++V
Sbjct: 200 IKDNGGIDTEKAYPYEGVD-DKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSV 258
Query: 279 GINAVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
I+A Q Y GV C LDHGV++VGYG+ + YW++KNSWG
Sbjct: 259 AIDASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDE------QGGDYWLVKNSWGR 312
Query: 336 NWGENGYYKICMGRNV-CGVDSMVS 359
WG+ GY K+ +N CG+ S S
Sbjct: 313 TWGDLGYIKMARNKNNHCGIASSAS 337
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/326 (37%), Positives = 176/326 (53%), Gaps = 41/326 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+ L+ ++ + Y E D RFRVF NLR A + + G+ +F+DLT EFR
Sbjct: 52 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 111
Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+PA ++ + +LP DWR+ GAV VK+QG CGSCW
Sbjct: 112 AAYLGA----RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 167
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++E + + TGE+V+LSEQ+LV+C + +SGCNGGLM++AF++I+K
Sbjct: 168 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 220
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG++ E DYPY D G C ++ ++ F + ++++ V H P++V I
Sbjct: 221 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIE 279
Query: 282 A--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Q Y GV C LDHGV+ VGYG+ K YWI++NSWG WGE
Sbjct: 280 AGGREFQLYKAGVF-TGTCTTNLDHGVVAVGYGTE-------NGKDYWIVRNSWGAKWGE 331
Query: 340 NGYYKICMGRNV------CGVDSMVS 359
+GY + M RNV CG+ M S
Sbjct: 332 DGYIR--MERNVNATTGKCGIAMMAS 355
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+K SK Y +EE +R V++ NLR+ + L H G+ F D+T
Sbjct: 25 DQHWQLWKGWHSKNYHEKEE-GWRRLVWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G RR + + + N L P DWRD G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGYKRREQRKYSG--SLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGALEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+
Sbjct: 142 FSTTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA 282
G++ E YPY GTD C+++ A + F I S +++ V GP++V I+A
Sbjct: 195 QGLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDA 254
Query: 283 --VWMQTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 255 GHESFQFYQSGIYFEKECSSDELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 311
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
G+ + R N CG+ + S
Sbjct: 312 KGFIYMAKDRHNHCGIATAAS 332
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 180/328 (54%), Gaps = 47/328 (14%)
Query: 44 DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTAV 95
+ + E F LF++ + + Y QEE RF++F++NLR AKR+ PT
Sbjct: 33 EQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQSNLRYINEMNAKRK---SPTTQ 89
Query: 96 H--GVTKFSDLTPSEFRRQFLGLNRRLRLP-------ADAQKAPILPTNDLPTDFDWRDH 146
H G+ KF+D++P EF + +L + + +P QK ++LP DWRD
Sbjct: 90 HRLGLNKFADMSPEEFMKTYL---KEIEMPYSNLESRKKLQKGDDADCDNLPHSVDWRDK 146
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT V+DQG C S W+FS TGA+EG + + TG LVSLS QQ+VDCD
Sbjct: 147 GAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDCD---------PASH 197
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC GG +AF Y+++ GG++ E YPYT + G+CK + +K+ ++ N V+ E+ +
Sbjct: 198 GCAGGFYFNAFGYVIENGGIDTEAHYPYTAQN-GTCKANANKV-VSIDNLLVVVGPEEAL 255
Query: 267 AANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGV---LIVGYGSSGFAPIRFKE 323
+ K P++V I+A +Q Y GGV C K LIVGYGS G
Sbjct: 256 LCRVSKQ-PVSVSIDATGLQFYAGGVYGGENCSKNSTKATLVCLIVGYGSVG-------G 307
Query: 324 KPYWIIKNSWGENWGENGYYKICMGRNV 351
+ YWI+KNSWG++WGE GY + + RNV
Sbjct: 308 EDYWIVKNSWGKDWGEEGY--LLIKRNV 333
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 165/319 (51%), Gaps = 24/319 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ + +FK + +K Y +E YR VF + ++ L VH G+ +++D+
Sbjct: 19 DREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMP 78
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R G + + P P DLP DWR G VT VK+QG CGSCW+FS
Sbjct: 79 NEEFVRVMNGYKMQEQRPKAPTYMPPSNVGDLPATVDWRTKGYVTEVKNQGQCGSCWAFS 138
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
+TG+LEG F +L+SLSEQ LVDC E + GC GGLM+ AF YI G
Sbjct: 139 STGSLEGQTFKKYNKLISLSEQNLVDCSTE-------QGNMGCGGGLMDQAFTYIKVNDG 191
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAVW 284
++ E YPY G C+F+K+ + A + ++ I S E + + + GP+AV I+A
Sbjct: 192 IDTETSYPYEAAS-GKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASH 250
Query: 285 M--QTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
M Q Y GV C + LDHGVL VGYG+ K YW++KNSWG WG+ G
Sbjct: 251 MSFQLYKSGVYHYIFCSQTRLDHGVLAVGYGTD-------SGKDYWLVKNSWGATWGQQG 303
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y + R N CG+ + S
Sbjct: 304 YIMMSRNRDNNCGIATQAS 322
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 27/321 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEEHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVEC---SKPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA-- 282
++ E YPY GTD C ++ AA + F + S E + + GP++V I+A
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGH 258
Query: 283 VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y G+ C LDHGVL+VGY G K YWI+KNSW E WG+NG
Sbjct: 259 TSFQFYQSGIYFEAECSSTDLDHGVLVVGY---GVEKRDTDGKKYWIVKNSWSEKWGQNG 315
Query: 342 YYKICMGR---NVCGVDSMVS 359
Y I M + N CG+ + S
Sbjct: 316 Y--ILMAKDKDNHCGIATAAS 334
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 138/352 (39%), Positives = 180/352 (51%), Gaps = 49/352 (13%)
Query: 13 LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDY 72
+ SS +A+AV V A +V P D+++ F+ FK+K+ K Y E
Sbjct: 1 MKSSCIAAAVLV----AAGHEVPP------PDYMM----MFNNFKTKYGKVYNGINEDAV 46
Query: 73 RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA-PI 131
RF +FKAN+ + T GV +F+DLT E + GL PA P
Sbjct: 47 RFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEELAASYTGLK-----PASLWSGLPR 101
Query: 132 LPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
L T++ L + DW G VT VK+QG CGSCWSFS TGALEGA LSTG LVSLSE
Sbjct: 102 LSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSE 161
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQ VDCD + DSGCNGG M++AF + K + E YPYT TD G+C
Sbjct: 162 QQFVDCD---------TTDSGCNGGWMDNAFSFA-KKNSICTEGSYPYTATD-GTCNLSG 210
Query: 247 SKIA---AAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKY 301
++ V ++ +S+D +Q + V P+++ I A Q Y GV CG
Sbjct: 211 CQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSSGV-LTASCGTR 269
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCG 353
LDHGVL VGYGS YW +KNSWG +WGE GY ++ G+ G
Sbjct: 270 LDHGVLAVGYGSE-------AGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAG 314
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 133/362 (36%), Positives = 190/362 (52%), Gaps = 31/362 (8%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
S L+L S L ++A D +++ S+ +S D L+ F + S+ K Y T
Sbjct: 6 SKTLVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYET 60
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL--RLPA 124
EE RF VFK NL+ R + G+ +F+DL+ EF+ ++LGL L R +
Sbjct: 61 IEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRES 120
Query: 125 DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
++ DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG L SL
Sbjct: 121 SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSL 180
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ+L+DCD + ++GCNGGLM+ AF +I + GG+ +E+DYPY + +C+
Sbjct: 181 SEQELIDCDT--------TYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYI-MEESTCEM 231
Query: 245 DKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPYICGKY 301
K + N + + + +Q + + PL+V I A Q Y GGV + CG
Sbjct: 232 KKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGH-CGSD 290
Query: 302 LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK----ICMGRNVCGVDSM 357
LDHGV VGYG+S K Y I+KNSWG WGE G+ + I +CG+ M
Sbjct: 291 LDHGVSAVGYGTS-------KNLDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGICGLYKM 343
Query: 358 VS 359
S
Sbjct: 344 AS 345
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/323 (37%), Positives = 172/323 (53%), Gaps = 34/323 (10%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-- 96
GE+SE+ + ++ + ++ TY E + RF F+ NLR + VH
Sbjct: 31 GERSEEEV---RRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSF 87
Query: 97 --GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR GAV
Sbjct: 88 RLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKKGAVG 144
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S + GCNG
Sbjct: 145 AVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQGCNG 196
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANL 270
GLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 197 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 256
Query: 271 VKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWI 328
V + P++V I A Q Y G+ CG LDHGV VGYG+ K YW+
Sbjct: 257 VANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGKDYWL 308
Query: 329 IKNSWGENWGENGYYKICMGRNV 351
++NSWG WGENGY + M RN+
Sbjct: 309 VRNSWGSVWGENGYIR--MERNI 329
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/327 (36%), Positives = 174/327 (53%), Gaps = 34/327 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + ++ TY E + RF F+ NLR +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 253
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 254 LQKAVANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGK 305
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV 351
YW+++NSWG WGE+GY + M RN+
Sbjct: 306 DYWLVRNSWGSVWGEDGYIR--MERNI 330
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 188/364 (51%), Gaps = 34/364 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L+LL +SV AS + + D IR Q D + +K F K+Y EE
Sbjct: 7 LVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKL---WDDYKESFGKSYNKDEE 63
Query: 70 HDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+DY F N+ + +L T G+ +DL S++R+ L R R D
Sbjct: 64 NDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRK--LNGYRHRRNFGD 120
Query: 126 AQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ ++ + P N ++P DWRD G VT VK+QG CGSCW+FSATGALEG H ++G+
Sbjct: 121 SMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGK 180
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+VSLSEQ LVDC + + GCNGGLM+ AFEYI G++ E+ YPY G +
Sbjct: 181 MVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRE-T 232
Query: 241 SCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYI 297
C F K I A F + DE+ + + GP+++ I+A Q Y GV
Sbjct: 233 KCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEE 292
Query: 298 C-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVD 355
C + LDHGVL+VGYG+ A YW+IKNSWG WGE GY +I R N CGV
Sbjct: 293 CSSEELDHGVLLVGYGTDPEAG------DYWLIKNSWGPGWGEKGYIRIARNRSNHCGVA 346
Query: 356 SMVS 359
+ S
Sbjct: 347 TKAS 350
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 170/310 (54%), Gaps = 25/310 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+K +K Y+ E R+ ++K N RR + L + + +F D+T SEF+
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFK----A 85
Query: 116 LNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
N L + P N + P DWR+ G VT VKDQG CGSCW+FS TG+LEG H
Sbjct: 86 FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG+LVSLSEQ LVDC + ++GC+GGLM++AF YI + G++ E YPY
Sbjct: 146 FKKTGKLVSLSEQNLVDC-------STAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPY 198
Query: 235 TGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGG 291
T D G C F KS +AA + F + +E+++ + GP++V I+A Q Y G
Sbjct: 199 TAED-GKCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSG 257
Query: 292 V-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICM-GR 349
V + P LDHGVL+VGYG+ K YW++KNSW +WG+ GY K+ +
Sbjct: 258 VYNEPSCSSTELDHGVLVVGYGTE-------SGKDYWLVKNSWNTSWGDKGYIKMRRNAK 310
Query: 350 NVCGVDSMVS 359
N CG+ + S
Sbjct: 311 NQCGIATKAS 320
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/326 (37%), Positives = 176/326 (53%), Gaps = 41/326 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+ L+ ++ + Y E D RFRVF NLR A + + G+ +F+DLT EFR
Sbjct: 49 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 108
Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+PA ++ + +LP DWR+ GAV VK+QG CGSCW
Sbjct: 109 AAYLGA----RIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 164
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++E + + TGE+V+LSEQ+LV+C + +SGCNGGLM++AF++I+K
Sbjct: 165 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 217
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG++ E DYPY D G C ++ ++ F + ++++ V H P++V I
Sbjct: 218 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIE 276
Query: 282 A--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
A Q Y GV C LDHGV+ VGYG+ K YWI++NSWG WGE
Sbjct: 277 AGGREFQLYKAGVF-SGTCTTNLDHGVVAVGYGTE-------NGKDYWIVRNSWGAKWGE 328
Query: 340 NGYYKICMGRNV------CGVDSMVS 359
+GY + M RNV CG+ M S
Sbjct: 329 DGYIR--MERNVNATTGKCGIAMMAS 352
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 134/368 (36%), Positives = 196/368 (53%), Gaps = 36/368 (9%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK---SKFS 61
+ S L + +L + + VA N D +++ SE+ L + + LF+ +K
Sbjct: 1 MASKLSVAVLLLCVGACVARNSDFSIVGY--------SEEDLSSHDRLVELFEKWLAKHQ 52
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EE +RF VFK NL+ + G+ +F+DLT EF+ +LGL+
Sbjct: 53 KAYASFEEKLHRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDEFKTTYLGLSPPPA 112
Query: 122 LPADAQ--KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ ++ + + +DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG
Sbjct: 113 RRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTG 172
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L +LSEQ+L+DC S +SGCNGG+M+ AF YI +GG+ E+ YPY +G
Sbjct: 173 NLTALSEQELIDC--------SVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEG 224
Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIGGV-SCP 295
KS+ A ++S + + + ++Q + H P++V I A Q Y GGV P
Sbjct: 225 SCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGP 284
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNV 351
CG LDHGV VGYGS + K Y I+KNSWG WGE GY ++ G +
Sbjct: 285 --CGAQLDHGVAAVGYGSD-----KGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGL 337
Query: 352 CGVDSMVS 359
CG++ M S
Sbjct: 338 CGINKMAS 345
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 166/310 (53%), Gaps = 32/310 (10%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
+ K Y + D RF+VFK NL + L+ T G+ KF+D+T E+R +LG
Sbjct: 44 RHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTK 103
Query: 118 -----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
R ++ + + + LP DWR GAV +KDQG+CGSCW+FS +E
Sbjct: 104 SNAKRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEA 163
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG+ VSLSEQ+LVDCD + + GCNGGLM+ AFE+I++ GG++ +KDY
Sbjct: 164 INKIVTGKFVSLSEQELVDCDR--------AYNEGCNGGLMDYAFEFIIQNGGIDTDKDY 215
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAV--WMQTYIG 290
PY G DG K+ + + + ++ V H P++V I A +Q Y
Sbjct: 216 PYRGFDGICDPTKKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQS 275
Query: 291 GVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN 350
GV CG LDHGV++VGYGS YW+++NSWG WGE+GY+K M RN
Sbjct: 276 GVFTG-KCGTSLDHGVVVVGYGSENGV-------DYWLVRNSWGTGWGEDGYFK--MQRN 325
Query: 351 V------CGV 354
V CG+
Sbjct: 326 VRTSTGKCGI 335
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/368 (35%), Positives = 197/368 (53%), Gaps = 37/368 (10%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R + ++L LL+ VL++ + D A S G + E F ++ SK K
Sbjct: 5 RPVCMTILFLLIVFVLSAPSSAMDLPAT------SGGHNRSNE--EVEFIFQMWMSKHGK 56
Query: 63 TYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RL 120
TY E + RF+ FK NLR + + + G+T+F+DLT E+R F G + +
Sbjct: 57 TYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQ 116
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
R +++ L + LP DWR GAV+ +KDQG C SCW+FS A+EG + + TGE
Sbjct: 117 RNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGE 176
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNG-GLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDC+ ++GC G GLM++AF++++ G++ EKDYPY GT
Sbjct: 177 LISLSEQELVDCNL---------VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQ- 226
Query: 240 GSC--KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPY- 296
GSC K S + ++ + ++++ V H P++VG++ Q ++ SC Y
Sbjct: 227 GSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKK-SQEFMLYRSCIYN 285
Query: 297 -ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNV 351
CG LDH ++IVGYGS + YWI++NSWG WG+ GY KI + +
Sbjct: 286 GPCGTNLDHALVIVGYGSE-------NGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGL 338
Query: 352 CGVDSMVS 359
CG+ + S
Sbjct: 339 CGIAMLAS 346
>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
Length = 334
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 168/320 (52%), Gaps = 21/320 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
N + H+ +K+ + Y EE +R V++ N + HG + F D
Sbjct: 24 NLDAHWHQWKATHRRLYGMNEE-GWRRAVWEKNKKIIDLHNQEYSQGKHGFSMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + R + P+L D+P DW G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKRKKGKLFREPLLI--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC P+ + GCNGGLM++AF+YI +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGLMDNAFQYIKEN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA- 282
GG++ E+ YPY TD SC + AA + F I E + + GP++V I+A
Sbjct: 194 GGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAG 253
Query: 283 -VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGEN 340
Q Y G+ P K LDHGVL+VGY GF +WI+KNSWG WG N
Sbjct: 254 HASFQFYKSGIYYDPDCSSKDLDHGVLVVGY---GFEGTDSNNNKFWIVKNSWGPEWGWN 310
Query: 341 GYYKICMGRNV-CGVDSMVS 359
GY K+ +N CG+ + S
Sbjct: 311 GYVKMAKDQNNHCGIATAAS 330
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/357 (34%), Positives = 189/357 (52%), Gaps = 26/357 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+L+ S+ + LL ++ ++ A++ S D + A + L K K
Sbjct: 2 KLLSPSMAIALLFALFVASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVK--HGK 59
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLNRRLR 121
+Y E + RF++FK NLR + + G+ +F+DLT E+R +LG + +
Sbjct: 60 SYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPK 119
Query: 122 LPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
L + AP + + LP DWR GAV +KDQG+CGSCW+FS A+EG + + TG
Sbjct: 120 LSKVKSDRYAPRV-GDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTG 178
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
EL++LSEQ+LVDCD S + GC+GGLM+ FE+I+ GG++ +KDYPY G D
Sbjct: 179 ELITLSEQELVDCDK--------SYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDA 230
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN--AVWMQTYIGGVSCPYI 297
++ K+ + ++ + + ++ V P++VGI Q Y G+
Sbjct: 231 RCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTG-K 289
Query: 298 CGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGV 354
CG LDHGV +VGYG+ K K YWI++NSWG +WGE GY + M RN+ G
Sbjct: 290 CGTALDHGVNVVGYGTE-------KGKDYWIVRNSWGSSWGEAGYIR--MERNLAGT 337
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 165/309 (53%), Gaps = 25/309 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA----VHGVTKF 101
+L+AE + FK + +K Y EE R +F N + K L T GV +F
Sbjct: 34 VLDAEVAWHKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTVGVNEF 93
Query: 102 SDLTPSEFRRQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
+D+T EF + GL R+ +P + LP + DWR G V+ VK+QG+CGS
Sbjct: 94 ADMTVHEFAQMMNGLKPDSTRVSGSTYLSPNIDA-PLPVEVDWRTKGLVSEVKNQGSCGS 152
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG+LEG H TG +V LSEQ LVDC + + GCNGGLM +AF+YI
Sbjct: 153 CWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDC-------STSYGNDGCNGGLMTNAFKYI 205
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVG 279
G++ E+ YPY G D G CKF K+K+ A V+ F I + +E ++ L GP++V
Sbjct: 206 KDNKGIDTEEAYPYAGRD-GDCKFKKNKVGATVTGFVEIPAGNEKKLQEALATVGPVSVA 264
Query: 280 INA---VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
I+A +M G P LDHGVL VGYGS K Y+I+KNSWG
Sbjct: 265 IDANHQSFMLYKSGVYDEPECDSAQLDHGVLAVGYGS-------IHGKDYYIVKNSWGTT 317
Query: 337 WGENGYYKI 345
WGE GY +
Sbjct: 318 WGEQGYIRF 326
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 171/319 (53%), Gaps = 22/319 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS SK Y +EE +R +++ NL+ + L H G+ F D+T
Sbjct: 27 HWLSWKSWHSKKYHEKEE-GWRRMIWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G ++ R + + L N L P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQVMNGF-KQSRSQRKYKGSQFLEPNFLQAPKSVDWREKGYVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATGALEG HF TG+LVSLSEQ L+DC PE + GCNGGLM+ AF+YI G
Sbjct: 145 ATGALEGQHFRKTGKLVSLSEQNLIDC---SGPE----GNQGCNGGLMDQAFQYIKDNNG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA-- 282
++ E+ YPY G D C + +A + F I ++ V GP++V I+A
Sbjct: 198 IDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEGRERALMKAVAAVGPISVAIDASH 257
Query: 283 VWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV C + LDHGVL+VGYG G +K YWI+KNSW E WG+ G
Sbjct: 258 TSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTDDDN--KKRYWIVKNSWSEKWGDQG 315
Query: 342 YYKICMGR-NVCGVDSMVS 359
Y + R N CG+ S S
Sbjct: 316 YIHMAKDRSNNCGIASAAS 334
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/368 (36%), Positives = 195/368 (52%), Gaps = 34/368 (9%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKF 60
L++SS+L LL S+ ++ ++ + D E S +L F+ F +
Sbjct: 7 LVVSSILFLLCCVAAGSSFDESNPIKLVSDRL-HDFESSFVKVLGQSRRALSFARFAHRH 65
Query: 61 SKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
K Y T+ E RF +F +L R+ ++ L T G+ +F+D T EF++ LG +
Sbjct: 66 GKRYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTL--GLNQFADWTWQEFQKYRLGAAQ 123
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
A + L LP DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 NC--SATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTGALEAAYHQAF 181
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG D
Sbjct: 182 GKGISLSEQQLVDCARAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKD 234
Query: 239 GGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSC 294
+CKF + V N ++ + DE + A V+ P++V V + Y GV
Sbjct: 235 -DACKFSSENVGVRVVESVNITLGAEDELKHAVAFVR--PVSVAFEVVGSFRLYKEGVYT 291
Query: 295 PYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNV 351
CG ++H VL VGYG PYW+IKNSWGE+WG+NGY+K+ MG+N+
Sbjct: 292 TSTCGSTPMDVNHAVLAVGYGVE-------NGIPYWLIKNSWGEDWGDNGYFKMEMGKNM 344
Query: 352 CGVDSMVS 359
CG+ + S
Sbjct: 345 CGIATCAS 352
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/324 (37%), Positives = 175/324 (54%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
DH L+ + + L+K+ K Y EE +R V+K N++ + H +
Sbjct: 22 DHSLDTQ--WKLWKAAHRKPYDLNEE-GWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR G R+ I + +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRHTMNGFQRQKNKKGKEFHETIFAS--IPPSVDWREKGYVTPVKNQGKCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATGALEG F TG+LVSLSEQ LVDC PE + GC+GG +++AF+Y
Sbjct: 137 SCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQ---PE----GNRGCHGGFIDNAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
+L GG++ E+ YPYTG G+C ++ + AA + F + E + + GP++V
Sbjct: 190 VLDVGGLDSEESYPYTGLV-GTCLYNPNNSAANETGFVDLPKQEKALMKAVANLGPISVA 248
Query: 280 INA--VWMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
++A Q Y G+ P + +DH VL+VGY GF + YW++KNSWGE+
Sbjct: 249 VDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGY---GFEGADSDDNKYWLVKNSWGEH 305
Query: 337 WGENGYYKICMGRNV-CGVDSMVS 359
WG NGY K+ RN CG+ +M S
Sbjct: 306 WGMNGYIKMAKDRNNHCGIATMAS 329
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/299 (40%), Positives = 164/299 (54%), Gaps = 27/299 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVF-----KANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+K+K+ KTY + E R ++ K A+ Q L + G+ F+D+ EFR
Sbjct: 30 YKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKL-GLNSFADMHNGEFR 88
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ G R P ++ + LP DWR GAVT +K+QG CGSCW+FS TG+L
Sbjct: 89 KMMNGYRRGT--PRNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSL 146
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H L G+LVSLSEQ+LVDC + + GC+GGLM+ AF YI K G++ E+
Sbjct: 147 EGQHALKKGKLVSLSEQELVDC-------SAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQ 199
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA-VW-MQT 287
YPYTG D G+C F KS +AA V+ F V S E + GP++V I+A W Q
Sbjct: 200 SYPYTGED-GTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQL 258
Query: 288 YIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKI 345
Y GV C LDHGVL+VGYG+ YW++KNSWG +WG +GY ++
Sbjct: 259 YESGVYDVSDCSTTELDHGVLVVGYGTD-------DGTAYWLVKNSWGTDWGHHGYIQM 310
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 176/325 (54%), Gaps = 32/325 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+S FK + Y ++ E ++R +++ + AK Q + V G+ K+ D+ E
Sbjct: 27 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 86
Query: 109 FRRQFLGLNR------RLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
F + G N+ L + + + I P N LP DWR HGAVT +KDQG CG
Sbjct: 87 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 146
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCWSFS TGALEG HF +G LVSLSEQ L+DC E+ G ++GCNGGLM++AF+Y
Sbjct: 147 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-----SEQYG--NNGCNGGLMDNAFKY 199
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAV 278
I GG++ E+ YPY G D C+++ A F I DE ++ + GP++V
Sbjct: 200 IKDNGGIDTEQTYPYEGVD-DKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSV 258
Query: 279 GINA--VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
I+A Q Y GV C LDHGVL+VGYG+ + YW++KNSWG
Sbjct: 259 AIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE------QGVDYWLVKNSWGR 312
Query: 336 NWGENGYYKICMGR-NVCGVDSMVS 359
+WGE GY K+ + N CG+ S S
Sbjct: 313 SWGELGYIKMIRNKNNRCGIASSAS 337
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 168/310 (54%), Gaps = 24/310 (7%)
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
SK Y +EE +R V++ NL++ + L H G+ F D+T EFR+ G
Sbjct: 1 SKKYHEKEE-GWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGY 59
Query: 117 NRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R+ + + + N L P DWRD+G VT VKDQG CGSCW+FS TGALEG H
Sbjct: 60 KRKPQRKFTG--SLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQH 117
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI G++ E YPY
Sbjct: 118 FRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPY 170
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINA--VWMQTYIGG 291
GTD C +D +A + F I S +++ V GP++V I+A Q Y G
Sbjct: 171 LGTDDQPCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG 230
Query: 292 VSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR- 349
+ C + LDHGVL+VGY GF K YWI+KNSW E WG+ GY + R
Sbjct: 231 IYYEKDCSSEELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRK 287
Query: 350 NVCGVDSMVS 359
N CG+ + S
Sbjct: 288 NHCGIATAAS 297
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK+Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E+ YPY GTD C + A + F + S E M + GP++V I+A
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDA 256
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 257 GHESFQFYESGIYYEKECSSEELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 313
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 314 KGYIYMAKDRKNHCGIATASS 334
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 175/342 (51%), Gaps = 57/342 (16%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKAN--------------LRRAKRRQLLDPTAV 95
E F + ++ K YAT EE R VF N P+
Sbjct: 33 EAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYT 92
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT-------NDLPTDFDWRDHGA 148
+ F+DLT EFR LG R+ P A ++ P +P DWR GA
Sbjct: 93 LALNAFADLTHEEFRAARLG---RI-APGAALRSRAAPVYWGLGGGAAVPDALDWRKSGA 148
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
VT VKDQG+CG+CWSFSATGA+EG + + TG LVSLSEQ+L+DCD S +SGC
Sbjct: 149 VTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDR--------SYNSGC 200
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
GGLM+ A+++++K GG++ E+DYPY DG K K + ++ + S+++ +
Sbjct: 201 GGGLMDYAYKFVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLL 260
Query: 269 NLVKHGPLAVGI--NAVWMQTYIGGV---SCPYICGKYLDHGVLIVGYGSSGFAPIRFKE 323
V P++VGI +A Q Y G+ CP LDH VLIVGYGS G
Sbjct: 261 QAVAQQPVSVGICGSARAFQLYYQGIFDGPCP----TSLDHAVLIVGYGSEG-------G 309
Query: 324 KPYWIIKNSWGENWGENGYYKICMGRN------VCGVDSMVS 359
K YWI+KNSWGE+WG GY M RN VCG++ M S
Sbjct: 310 KDYWIVKNSWGESWGMKGYMH--MHRNTGDSKGVCGINMMAS 349
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 182/356 (51%), Gaps = 46/356 (12%)
Query: 34 VVPSDGEQSEDHLLNAEHHFSL------FKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
+ P D +D ++ + +F+L F + + + Y EH+ RF++F N R +
Sbjct: 42 LTPLDSMHMQD-VIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKH 100
Query: 88 QLL----DPTAVHGVTKFSD-----------LTPSEFRRQFLGLNRRLRLPADAQKAPIL 132
+ + G+ +FSD E ++ L D K I
Sbjct: 101 NVRFIQGQVSYTMGINEFSDKVIGLIIHTICFQTDEELKRLRCFRGSLNASRDGSKY-IT 159
Query: 133 PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDC 192
P++ DWR+ GAVT VK+QG CGSCW+FSATGA+EG +FL+TG LVSLSEQQLVDC
Sbjct: 160 IAAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDC 219
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY----TGTDGGSCKFDKSK 248
E ++ CNGGLM++AF+Y+ + G++ E YPY TG +C+F+ +
Sbjct: 220 SSEYG-------NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKE 272
Query: 249 IAAAVSNFSVISSDEDQMAANLVKH-GPLAVGINAVW--MQTYIGGVSCPYICGK-YLDH 304
V+ + + + V H GP++V INA +Y GV C LDH
Sbjct: 273 AVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDH 332
Query: 305 GVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
GVL+VGYG PYW+IKNSWG +WGENGY KI N+CGV SM S
Sbjct: 333 GVLLVGYGEE-------NGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMAS 381
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 172/321 (53%), Gaps = 28/321 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
F +K KF ++Y + E +R +++ N + +L + G+T F+D+
Sbjct: 25 EFHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENE 84
Query: 108 EFRR---QFLGLNRRLRLPADAQKAPILPT-NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
E++R Q + LP LP DLP DWRD G VT VKDQ CGSCW+
Sbjct: 85 EYKRVISQGCLHSFNASLPRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWA 144
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATG+LEG HF TG LVSLSEQQLVDC + + GC GGLM+ AF+YI
Sbjct: 145 FSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYG-------NMGCMGGLMDYAFQYIQAN 197
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINA 282
GG++ E+ YPY + G C+++ I A + ++ +S DED + + GP++VGI+A
Sbjct: 198 GGIDTEESYPYE-AENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDA 256
Query: 283 VWM--QTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
M Q Y GV C LDHGVL VGYG+ YW++KNSWG WG+
Sbjct: 257 SQMSFQFYESGVYNEPDCSSLELDHGVLAVGYGTE-------DGNDYWLVKNSWGLEWGD 309
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY K+ + N CG+ + S
Sbjct: 310 KGYIKMSRNKSNQCGIATAAS 330
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 133/365 (36%), Positives = 182/365 (49%), Gaps = 43/365 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M L LL+ L VLA D A R++ S + + + +K
Sbjct: 1 MALLCKGQFLLIALFFVLAMWA----DQASTRELHESTMVERHEKWM----------AKH 46
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
K Y EE RF++FK N+ + + + + G+ +F+DLT EFR + G R
Sbjct: 47 GKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKR- 105
Query: 120 LRLPADAQK--APILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
P DA + P N LP DWR GAVT +KDQ CGSCW+FSA A EG H
Sbjct: 106 ---PLDASRIVTPFKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHK 162
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
L TG+LVSLSEQ+LVDCD + + D GC GGLM AF++I + GG+ E +Y Y
Sbjct: 163 LRTGKLVSLSEQELVDCDVKGE-------DKGCQGGLMEDAFKFIKRNGGITTEANYAYR 215
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVWM--QTYIGGVS 293
G DG ++ A ++ + V+ + + V H P++V I+A M Q Y G+
Sbjct: 216 GRDGKCDTKKEASHVAKITGYQVVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIY 275
Query: 294 CPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK----ICMGR 349
CG L+HGV VGYG+S YWI+KNSWG WGE GY + I +
Sbjct: 276 AGS-CGSDLNHGVAAVGYGTSSSGS------KYWIVKNSWGPEWGERGYVRMKRDITSRK 328
Query: 350 NVCGV 354
+CG+
Sbjct: 329 GLCGI 333
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 132/366 (36%), Positives = 184/366 (50%), Gaps = 49/366 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V+ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILGAVITMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFTFEKNTIKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVKVN 99
Query: 125 DAQKAPILPTND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H TG+
Sbjct: 100 KPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVKDQGECGSCWAFSTTGSLEGQHANKTGK 159
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT TD
Sbjct: 160 LVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDK 212
Query: 241 SCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGV-SCPY 296
CKFD S + A + + V S +E + + GP++V I+A Q Y GV P
Sbjct: 213 PCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPISVAIDAGHESFQFYSSGVYDEPQ 272
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR---NVCG 353
+ LDHGVL+VGYG A + +WI+KNSWG NWG+ GY I M R N CG
Sbjct: 273 CSSEQLDHGVLVVGYG----AMNDNSHQAFWIVKNSWGPNWGDQGY--IMMSRNKDNQCG 326
Query: 354 VDSMVS 359
+ + S
Sbjct: 327 IATSAS 332
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/318 (39%), Positives = 171/318 (53%), Gaps = 27/318 (8%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT 99
E E H +A FS F++ ++K+YAT+EE R+ +FK NL + +
Sbjct: 106 EWKEAHFQDA---FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMN 162
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPAD-----AQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
F DL+ EFRR++LG + L + + +LP+ +LP DWR G VT VKD
Sbjct: 163 HFGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKD 221
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
Q CGSCW+FS TGALEGAH TG+LVSLSEQ+L+DC + C+GG MN
Sbjct: 222 QRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMN 274
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKH 273
AF+Y+L +GG+ E YPY D C+ + + F V E M A L K
Sbjct: 275 DAFQYVLDSGGICSEDAYPYLARD-EECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK- 332
Query: 274 GPLAVGINAVWM--QTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKN 331
P+++ I A M Q Y GV CG LDHGVL+VGYG+ + +K +WI+KN
Sbjct: 333 SPVSIAIEADQMPFQFYHEGV-FDASCGTDLDHGVLLVGYGTD-----KESKKDFWIMKN 386
Query: 332 SWGENWGENGYYKICMGR 349
SWG WG +GY + M +
Sbjct: 387 SWGTGWGRDGYMYMAMHK 404
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 119/327 (36%), Positives = 174/327 (53%), Gaps = 34/327 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + S+ +TY E + RF VF+ NLR +
Sbjct: 26 IVSYGERSEEEV---RRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAG 82
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
+H G+ +F+DLT E+R +LG +R +L A Q +LP DWR
Sbjct: 83 LHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQADD---NEELPETVDWRKK 139
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV +KDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 140 GAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNE 191
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 192 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 251
Query: 267 AANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEK 324
V + P++V I A Q Y G+ CG LDHGV VGYG+ K
Sbjct: 252 LQKAVANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGK 303
Query: 325 PYWIIKNSWGENWGENGYYKICMGRNV 351
YW+++NSWG WGE+GY + M RN+
Sbjct: 304 DYWLVRNSWGTVWGEDGYIR--MERNI 328
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 179/343 (52%), Gaps = 29/343 (8%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
+D IR V SD E+ ++ H F+ F SK+ K Y + EE +RFR+F NL
Sbjct: 25 EDSNPIRLV--SDLEEQVLQVIGQTRHAASFARFASKYGKRYDSVEEIQHRFRIFSENLE 82
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
K + G+ F+DL+ EFR Q LG + L L + D
Sbjct: 83 LIKSTNKKRLSYKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHK--LTDAVLSAEKD 140
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR V+ VKDQ CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +G
Sbjct: 141 WRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDC--------AG 192
Query: 203 SCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVIS 260
+ ++ GCNGGL + AFEYI GG+ EK+YPYT D S KF +A V + ++
Sbjct: 193 AFNNFGCNGGLPSQAFEYIKYNGGIALEKEYPYTAKDEAS-KFTAENVAVRVLDSVNITL 251
Query: 261 SDEDQMAANLVKHGPLAVGINAV-WMQTYIGGVSCPYICGKY---LDHGVLIVGYGSSGF 316
ED++ + P++V V + Y GV CG ++H VL VGYG
Sbjct: 252 GAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVE-- 309
Query: 317 APIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
PYWIIKNSWG WG++GY+K+ +G+N+CGV + S
Sbjct: 310 -----NNVPYWIIKNSWGSTWGDHGYFKMELGKNMCGVATCAS 347
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 188/364 (51%), Gaps = 34/364 (9%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L+LL +SV AS + + D IR Q D + +K F K+Y EE
Sbjct: 7 LVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKL---WDDYKEAFGKSYNKDEE 63
Query: 70 HDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+DY F N+ + +L T G+ +DL S++R+ L R R D
Sbjct: 64 NDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRK--LNGYRHRRNFGD 120
Query: 126 AQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ ++ + P N ++P DWRD G VT VK+QG CGSCW+FSATGALEG H ++G+
Sbjct: 121 SMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGK 180
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+VSLSEQ LVDC + + GCNGGLM+ AFEYI G++ E+ YPY G +
Sbjct: 181 MVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRE-T 232
Query: 241 SCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYI 297
C F K I A F + DE+ + + GP+++ I+A Q Y GV
Sbjct: 233 KCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEE 292
Query: 298 C-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVD 355
C + LDHGVL+VGYG+ A YW+IKNSWG WGE GY +I R N CGV
Sbjct: 293 CSSEELDHGVLLVGYGTDPEAG------DYWLIKNSWGPGWGEKGYIRIARNRSNHCGVA 346
Query: 356 SMVS 359
+ S
Sbjct: 347 TKAS 350
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/312 (38%), Positives = 168/312 (53%), Gaps = 37/312 (11%)
Query: 58 SKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEF---RRQ 112
+++ K Y +E + RF +F+ N++ A P + GV +F+DLT EF R +
Sbjct: 44 ARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGNKPYKL-GVNQFTDLTNKEFIATRNK 102
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
F G + + + ++ P+ DWR GAVT VK+QG CG CW+FSA A
Sbjct: 103 FKG-----HMSSSITRTTTFKYENVTAPSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAAT 157
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H LSTG LVSLSEQ+LVDCD + D GC GGLM+ AF++I++ GG+ E
Sbjct: 158 EGIHKLSTGNLVSLSEQELVDCD-------TSGADQGCQGGLMDDAFKFIIQNGGLNTEA 210
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTY 288
YPY G DG ++ A ++ + + S+ +Q V + P++V I+A Q Y
Sbjct: 211 QYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQALQQAVANQPISVAIDASGSDFQNY 270
Query: 289 IGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
GV CG LDHGV +VGYG S YW++KNSWGE+WGE GY I M
Sbjct: 271 QSGVFTGS-CGTQLDHGVAVVGYGVSD------DGTKYWLVKNSWGEDWGEEGY--IRMQ 321
Query: 349 RNV------CGV 354
R+V CG+
Sbjct: 322 RDVEAPEGLCGI 333
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 171/324 (52%), Gaps = 24/324 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
+ NA+ H +KS + + Y T EE ++R V++ N++ + HG T
Sbjct: 22 NQTFNAQWH--KWKSTYRRLYGTNEE-EWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ LP DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQLVNGYKHQKHRKGKVFQEPLML--QLPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA GALEG L TG LVSLSEQ LVDC + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQ-------AEGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVG 279
+L G++ E+ YPY D G+CK+ AA + + I E + + GP+A+
Sbjct: 190 VLNNKGLDSEESYPYEAKD-GTCKYKPEFAAANDTGYVDIPQLEKALMKAVATVGPIAIA 248
Query: 280 INAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGEN 336
I+A Q Y G+ P K LDHGVL+VGY GF +K YWI+KNSWG +
Sbjct: 249 IDASHPSFQFYSSGIYYEPNCSSKELDHGVLVVGY---GFEGTDSNKKKYWIVKNSWGSS 305
Query: 337 WGENGYYKICMGRNV-CGVDSMVS 359
WG G++ I +N CGV + S
Sbjct: 306 WGMGGFFHIAKDKNNHCGVATAAS 329
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/323 (39%), Positives = 175/323 (54%), Gaps = 35/323 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKF-------SDLT 105
F+LFK K Y + E YR ++F N +R ++ + G F +D+
Sbjct: 27 FTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKH---NSRYKQGKVSFKLKLNHLADML 83
Query: 106 PSEFRRQFLGLNRRLRLPADA-QKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCW 162
E+ +LG N+ + + Q +P L + DWR GAVT VK+QG CGSCW
Sbjct: 84 IHEYSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
+FS TGALEG +F TG+LVSLSEQ LVDC SGS ++GC GGLM++AF+YI
Sbjct: 144 AFSTTGALEGQNFRKTGKLVSLSEQNLVDC--------SGSYGNNGCEGGLMDNAFQYIK 195
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGI 280
+ G++ EK YPY G D +C+F K+ I A S F + DE+ + + GP++V I
Sbjct: 196 ENHGIDTEKSYPYEGED-ETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVAI 254
Query: 281 NAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENW 337
+A Q Y GV P + LDHGVL+VGYG + YW++KNSWG W
Sbjct: 255 DASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYGVE-------DNQKYWLVKNSWGTQW 307
Query: 338 GENGYYKICMGR-NVCGVDSMVS 359
G+ GY K+ + N CG+ + S
Sbjct: 308 GDGGYIKMARDQDNNCGIATQAS 330
>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
occidentalis]
Length = 642
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 177/313 (56%), Gaps = 30/313 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVH---GVTKFSDLTPSE 108
+ L+K K+Y +EE R R+F+ N+ LL D V G+++ +D TP+E
Sbjct: 19 WELYKRIHGKSYDVEEE-SMRRRIFEKNVAMINAHNLLHDLKQVSYRMGLSRLTDATPAE 77
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT---NDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+ LN LP + L T DLP DW G VT VKDQG CG+CW+F+
Sbjct: 78 VQ-ALKCLN--FTLPNKTSRKSTLGTLQRQDLPEAVDWTQQGYVTPVKDQGKCGACWTFA 134
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATGA+EG HF +TG LVSLSEQ ++DC + +GC+GGL AF+Y+ +GG
Sbjct: 135 ATGAIEGQHFKATGNLVSLSEQNILDCVKT-------ATSNGCSGGLFVEAFDYLKNSGG 187
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAVGINA-- 282
++ E+ YPY + GG+C+F + +AA VS + IS+ +E ++ + GP++VGI++
Sbjct: 188 IDAEESYPYEAS-GGTCRFRQDSVAATVSGYQAISAGNEAELQEAVATIGPISVGIDSGH 246
Query: 283 VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGY 342
Q Y GG+ C ++L H VL+VGYG+ + YW++KNSWG ++G GY
Sbjct: 247 PGFQHYTGGIYYEPECTEHLSHAVLVVGYGTE-------NGEDYWLVKNSWGASYGLQGY 299
Query: 343 YKICMGR-NVCGV 354
K+ R N CG+
Sbjct: 300 IKMARNRNNNCGI 312
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 125/326 (38%), Positives = 170/326 (52%), Gaps = 30/326 (9%)
Query: 46 LLNAEH-HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVH---GVTK 100
LL H + L+K +K Y E+ R R+F+ N+ LL D V G+++
Sbjct: 331 LLKFSHADWDLYKRVQNKNYGVAED-SMRRRIFEKNVAMINGHNLLHDLKRVSYRMGLSR 389
Query: 101 FSDLTPSEFR-RQFLGLNRRLRLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
F+D TP E R + L +N + ++ + ++DL DWR G VT VK+QG C
Sbjct: 390 FTDSTPEEMRAMRCLNINVSMTTGGPHEEVFDAIESSDLSEAIDWRQQGYVTPVKNQGNC 449
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FSATGA+EG HF +TG L SLSEQ LVDC E GC+GG AF+
Sbjct: 450 GSCWAFSATGAVEGQHFKATGRLESLSEQNLVDCVKE---------SKGCDGGFFEQAFQ 500
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
YI GG+ E YPY D GSC+F + I A VS + I E + + GP++
Sbjct: 501 YIKDNGGINTEDSYPYEAFD-GSCRFREDSIGATVSGYQTIPKGSEADLQKAVSTIGPIS 559
Query: 278 VGINAV--WMQTYIGGVSC-PYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
V I+ Q Y GV P LDH VL+VGYGS G + YW++KNSWG
Sbjct: 560 VAIDVSNPSFQNYREGVYYEPSCSSSNLDHAVLVVGYGSDG-------GEDYWLVKNSWG 612
Query: 335 ENWGENGYYKICMGR-NVCGVDSMVS 359
++GE GY ++ + N CG+ S +
Sbjct: 613 TSFGEQGYVRMARNKGNNCGIASAAA 638
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 138/370 (37%), Positives = 191/370 (51%), Gaps = 42/370 (11%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ LS LL L + + D +++ P D S D L+ F + S K
Sbjct: 1 MALSKLLPLAMCMSFFVVTSFGKDFSIV-GYWPED-LTSMDRLIEL---FEEWISNHGKI 55
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRR 119
Y T EE +RF VFK NL+ + GV +F+DLT EF+ +LGL +R
Sbjct: 56 YETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRT 115
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ P + ++ DLP DWR GAVT VK+QG+CGSCW+FS A+EG + + G
Sbjct: 116 RQSPEEFTYKDVV---DLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGG 172
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L SLSEQ+L+DCD ++GC+GGLM+ AF +I+ +GG+ +E+DYPY +
Sbjct: 173 NLTSLSEQELIDCDR--------PYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVE- 223
Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGV-SCP 295
+C K ++ +S + + + + + H PL+V I A Q Y GGV P
Sbjct: 224 STCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGP 283
Query: 296 YICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRN----- 350
CG LDHGV VGYGSS K Y I+KNSWG WGE GY I M RN
Sbjct: 284 --CGTQLDHGVTAVGYGSS-------KGVDYIIVKNSWGPKWGEKGY--IRMKRNTGKPA 332
Query: 351 -VCGVDSMVS 359
+CG++ M S
Sbjct: 333 GLCGINKMAS 342
>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
Length = 357
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 188/348 (54%), Gaps = 35/348 (10%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNA------EHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
D+ + IR +VP + ED ++ F+ F ++ K Y + EE RF +F
Sbjct: 26 DESSPIR-LVPDGLRELEDQVVQVLGQVCHVRSFARFAYRYEKRYESVEEMGRRFEIFAE 84
Query: 80 N--LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL 137
N L R+ R+ L + GV +F+D T EF+R LG + A + L
Sbjct: 85 NKKLIRSTNRKGL--SYKLGVNRFADWTWEEFQRHRLGAAQNCS--ATTKGNHKLTDAVP 140
Query: 138 PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECD 197
P +WRD G VT VKDQG CGSCW+FS TGALE A+ + G+ +S SEQQLVDC
Sbjct: 141 PLTKNWRDEGIVTPVKDQGHCGSCWTFSTTGALEAAYVQAFGKQISPSEQQLVDC----- 195
Query: 198 PEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SN 255
+G+ ++ GC+GGL + AFEYI GG++ E+ YPYT D G+CKF + V +
Sbjct: 196 ---AGAFNNFGCSGGLPSQAFEYIKYNGGLDTEQAYPYTAVD-GACKFSSENVGVRVLDS 251
Query: 256 FSVISSDEDQMAANLVKHGPLAVGINAVW-MQTYIGGVSCPYICGKY---LDHGVLIVGY 311
++ +DE+++ + P++V V + Y GV CG ++H VL VGY
Sbjct: 252 VNITLNDEEELKHAVAFVRPVSVAFQVVQDFRLYKSGVYTSETCGNTPMDVNHAVLAVGY 311
Query: 312 GSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
G PYW+IKNSWG++WG+NGY+K+ G+N+CGV + S
Sbjct: 312 GVENGV-------PYWLIKNSWGQSWGDNGYFKMEYGKNMCGVATCAS 352
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 163/312 (52%), Gaps = 35/312 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK+K+ K Y E RF +FKAN+ + T GV +F+DLT EF
Sbjct: 27 FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEFAAS 86
Query: 113 FLGLNRRLRLPADAQKA-PILPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+ GL PA P L T++ L + DW G VT VK+QG CGSCWSFS
Sbjct: 87 YTGLK-----PASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFST 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEGA LSTG LVSLSEQQ DCD + DSGCNGG M++AF + K +
Sbjct: 142 TGALEGAWALSTGNLVSLSEQQFEDCD---------TTDSGCNGGWMDNAFSFA-KKNSI 191
Query: 227 EREKDYPYTGTDGGSCKFDKSKIA---AAVSNFSVISSDEDQMAANLVKHGPLAVGINA- 282
E YPYT TD G+C ++ V ++ +S+D +Q + V P+++ I A
Sbjct: 192 CTEGSYPYTATD-GTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEAD 250
Query: 283 -VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Q Y GV CG LDHGVL VGYGS YW +KNSWG +WGE G
Sbjct: 251 QYSFQLYSSGV-LTASCGTRLDHGVLAVGYGSE-------AGTDYWKVKNSWGSSWGEQG 302
Query: 342 YYKICMGRNVCG 353
Y ++ G+ G
Sbjct: 303 YVRLQRGKGGAG 314
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 119/311 (38%), Positives = 172/311 (55%), Gaps = 25/311 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+K K++++Y EE R +++ N+ K + +F+DLT E+R+ +LG
Sbjct: 33 WKLKYNRSYGLDEE--LRKKIWANNMLYVKEFNAEGHSYKLAANQFADLTNLEYRQIYLG 90
Query: 116 LNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+ RL + + DLPT DWR G VT VK+QG CGSCWSFSATG+LEG
Sbjct: 91 YDNEARLSRKREGKVFQRKMKDEDLPTTVDWRSKGVVTPVKNQGQCGSCWSFSATGSLEG 150
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + +G+LVS SEQ+LVDC + + GC GGLM+ AF+Y + E+E DY
Sbjct: 151 QYAIKSGKLVSFSEQELVDCS-------TSLGNHGCQGGLMDYAFKY-WETNLAEKESDY 202
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDE-DQMAANLVKHGPLAVGINA--VWMQTYI 289
YT + G CK++ S+F+ I S+ D + + GP+AV ++A Q Y
Sbjct: 203 TYTAKN-GKCKYNAQLGVTKDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMYH 261
Query: 290 GGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG 348
G+ P++C K LDHGVL+VGYG+ YW+IKNSWG WG +GY+KI M
Sbjct: 262 SGIYTPFLCSKTKLDHGVLVVGYGTDNGV-------DYWLIKNSWGMAWGMDGYFKIEMK 314
Query: 349 RNVCGVDSMVS 359
+ CG+ + S
Sbjct: 315 SDKCGICTQAS 325
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKHYHESEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E+ YPY GTD C + AA + F + S E M + GP++V I+A
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDA 256
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 257 GHESFQFYESGIYYEKECSSEELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 313
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 314 KGYIYMAKDRKNHCGIATASS 334
>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 250
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 99/223 (44%), Positives = 137/223 (61%), Gaps = 16/223 (7%)
Query: 137 LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
LP+ FDWR+ G +T VK Q CG CW+F+ TG +E + L +LV+ SEQQL+DCD
Sbjct: 39 LPSYFDWREQGIITPVKYQDTCGGCWTFATTGVIESQYALKYNKLVNFSEQQLIDCD--- 95
Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
S + GC GGLM A++ I + GG+E +DY G CK D +K++A V N+
Sbjct: 96 ------SINDGCRGGLMTDAYKAIQEMGGLETSEDYGEYLNSKGQCKIDSNKVSAKVINW 149
Query: 257 SVISSDEDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGF 316
IS DE+ + LV++GP+AVG+NA ++Q Y GG+ P +C ++H VLIVGYG
Sbjct: 150 YQISEDEEAIRRELVQNGPIAVGVNARFLQFYQGGILDPKLCDDSINHAVLIVGYGEE-- 207
Query: 317 APIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVS 359
K YWIIKN WG++WG NGY+K+ G+ CGV + S
Sbjct: 208 -----NGKKYWIIKNQWGKSWGINGYFKLVRGKKQCGVHTYAS 245
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 174/325 (53%), Gaps = 44/325 (13%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ F ++ K+Y + E RFR+F +L+ + + G+ +F+D++ EFR
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRA 117
Query: 112 QFLGL----------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
LG N R+R A A LP DWR+ G V+ VK+QG CGSC
Sbjct: 118 TRLGAAQNCSATLTGNHRMRAAAVA----------LPETKDWREDGIVSPVKNQGHCGSC 167
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TGALE A+ +TG+ +SLSEQQL+DC + + GCNGGL + AFEYI
Sbjct: 168 WTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFN-------NFGCNGGLPSQAFEYIK 220
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAV 278
GG++ E+ YPY G + G CKF + V N ++ + DE + A LV+ P++V
Sbjct: 221 YNGGLDTEESYPYQGVN-GICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVR--PVSV 277
Query: 279 GINAVW-MQTYIGGVSCPYICGKY---LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWG 334
+ + Y GV CG ++H VL VGYG PYW+IKNSWG
Sbjct: 278 AFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVE-------DGVPYWLIKNSWG 330
Query: 335 ENWGENGYYKICMGRNVCGVDSMVS 359
+WG+ GY+K+ MG+N+CGV + S
Sbjct: 331 ADWGDEGYFKMEMGKNMCGVATCAS 355
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 177/323 (54%), Gaps = 29/323 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTK 100
S+D ++ A H + +++S+ Y E RF VFKAN++ + GV +
Sbjct: 121 SDDSVMVARHE--QWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQ 178
Query: 101 FSDLTPSEFR--RQFLGL-NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
F+DLT EFR + GL + +++P + + + LPT DWR GAVT +KDQG
Sbjct: 179 FADLTNDEFRSTKTNKGLKSSNMKIPTGFRYENV-SADALPTTIDWRTKGAVTPIKDQGQ 237
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CG CW+FSA A EG +STG+LVSL+EQ+LVDCD + D GC GGLM+ AF
Sbjct: 238 CGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGE-------DQGCEGGLMDDAF 290
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++I+K GG+ E YPYT D G CK S AA + + + ++++ V + P++
Sbjct: 291 KFIIKNGGLTTESSYPYTAAD-GKCK-SGSNSAATIKGYEDVPANDEAALMKAVANQPVS 348
Query: 278 VGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
V ++ + Q Y GGV CG LDHG+ +GYG + YW++KNSWG
Sbjct: 349 VAVDGGDMTFQFYSGGVMTGS-CGTDLDHGIAAIGYGKTS------DGTKYWLMKNSWGT 401
Query: 336 NWGENGYYK----ICMGRNVCGV 354
WGENGY + I R +CG+
Sbjct: 402 TWGENGYLRMEKDISDKRGMCGL 424
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 171/317 (53%), Gaps = 32/317 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y ++ E +R +++ N + AK QL + V G K++D+ EF +
Sbjct: 31 FKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLHHEFIQ 90
Query: 112 QFLGLNRRLR-------LPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCW 162
G NR + D + A +P + P DW GAVT VKDQG CGSCW
Sbjct: 91 AMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGKCGSCW 150
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TGALEG HF +G LVSLSEQ L+DC S ++GCNGGLM++AF+YI
Sbjct: 151 AFSTTGALEGQHFRKSGYLVSLSEQNLIDC-------SSTYGNNGCNGGLMDNAFKYIKD 203
Query: 223 AGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
GG++ EK YPY G D C+++ K+ A V + S DE+++ + GP++V I+
Sbjct: 204 NGGIDTEKTYPYEGVD-DKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVAID 262
Query: 282 AVW--MQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWG 338
A Q Y GGV C LDHGVL+VGYG+ YW++KNSW WG
Sbjct: 263 ASQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGG------DYWLVKNSWSRTWG 316
Query: 339 ENGYYKICMGR-NVCGV 354
E GY K+ R N CG+
Sbjct: 317 ELGYIKMARNRDNHCGI 333
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 24/321 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKNYHASEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSHRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQPFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAVGINA 282
G++ E+ YPY GTD C + AA + F + S E M + GP++V I+A
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDA 256
Query: 283 --VWMQTYIGGVSCPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGE 339
Q Y G+ C + LDHGVL+VGY GF K YWI+KNSW E WG+
Sbjct: 257 GHESFQFYESGIYYEKECSSEELDHGVLVVGY---GFEGEDVDGKKYWIVKNSWSEKWGD 313
Query: 340 NGYYKICMGR-NVCGVDSMVS 359
GY + R N CG+ + S
Sbjct: 314 KGYIYMAKDRKNHCGIATASS 334
>gi|12024965|gb|AAG45727.1| cathepsin L-like cysteine protease [Leishmania chagasi]
Length = 381
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 166/311 (53%), Gaps = 43/311 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VK+QGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLAVGIN 281
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A+ ++
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVD 266
Query: 282 AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
A +Y GVL+VGY +G PYW+IKNSWGE+WGE G
Sbjct: 267 ASSFMSY--------------QSGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKG 305
Query: 342 YYKICMGRNVC 352
Y ++ MG N C
Sbjct: 306 YVRVAMGLNAC 316
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 168/319 (52%), Gaps = 25/319 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
+ + L+ K Y +EE R +++ NL ++ L D + G+ ++ D+T
Sbjct: 24 DSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMT 82
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR G R + P DLP DWR G VT +K+QG CGSCWSFS
Sbjct: 83 NEEFRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG F TG+L SLSEQ LVDC + + GC GGLM+ AF+YI G
Sbjct: 143 ATGSLEGQTFKKTGKLPSLSEQNLVDCSQK-------QGNHGCQGGLMDDAFQYIKDNNG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAVGINAVW 284
++ E YPY + G C+F+ + + A S F+ I S E + + + GP+AV I+A
Sbjct: 196 IDTESSYPYEAKN-GKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASH 254
Query: 285 M--QTYIGGVSCPYICGK-YLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
M Q Y GV + C + LDHGVL VGYG+ K YW++KNSWGE+WG+ G
Sbjct: 255 MSFQLYKSGVYHEFFCSETRLDHGVLAVGYGTE-------SGKDYWLVKNSWGESWGQKG 307
Query: 342 YYKICMG-RNVCGVDSMVS 359
Y + RN CG+ + S
Sbjct: 308 YIMMSRNKRNNCGIATSAS 326
>gi|298713906|emb|CBJ33775.1| Cathepsin-like proteinase [Ectocarpus siliculosus]
Length = 462
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 136/367 (37%), Positives = 176/367 (47%), Gaps = 53/367 (14%)
Query: 36 PSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
P E S+ L E F F KF K+Y +E RF VFK NL+R R
Sbjct: 112 PRLSELSDQEL---ESLFQEFGIKFEKSYENDDEKAMRFEVFKRNLKRIDERNSKSLGVK 168
Query: 96 HGVTKFSDLTPSEF-----------------RRQFLGLNRRLRLPADAQKAPILP----- 133
+ VT ++DLT EF R + + + Q P
Sbjct: 169 YDVTMWTDLTHEEFKGYQNYGKISDEAKEVARSKAMSTKDASDMYESCQSCTRFPELEQY 228
Query: 134 -TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDC 192
T DLPT+FDWRD+GAVT VK+Q CGSCW+FS TG LEGA +LS L SLSEQQLV C
Sbjct: 229 ITGDLPTEFDWRDYGAVTPVKNQAYCGSCWTFSTTGCLEGAWYLSGHPLESLSEQQLVAC 288
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT--------GTDGGSCKF 244
D S + GCNGG + + +YI K GG+ E YPY G S
Sbjct: 289 DT--------SYNQGCNGGWPSISMDYISKNGGIVPESIYPYRKVFMNGHLGDPVCSDVV 340
Query: 245 DKSKIAAAVSNFSVISSD---EDQMAANLVKHGPLAVGINAVWMQTYIGGVSCPYICGKY 301
+ AA ++ ++ D E+ MA L+ +GPL+V ++A+ M Y G+ C
Sbjct: 341 KEGNYAATLAIEVALAEDSMTEEAMARWLILNGPLSVALDAMGMDYYSEGIDMGEYCEPL 400
Query: 302 -LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGRNVCGVDSMVSS 360
+DH VLIVGYG YWIIKNSW WGE GYY++ G N CG+ V++
Sbjct: 401 EIDHAVLIVGYGEEDGV-------KYWIIKNSWKYLWGERGYYRLVRGVNACGIADDVTT 453
Query: 361 VAAIHTT 367
+ T
Sbjct: 454 IIVADAT 460
>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
Length = 328
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 113/329 (34%), Positives = 176/329 (53%), Gaps = 33/329 (10%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F + + K Y E R+ +FK NL + L+ TAV+ + KFSDL+
Sbjct: 22 LLKAPDYFESFVANYQKNYNDDLEKSKRYTIFKDNLEEINVKNRLNDTAVYRINKFSDLS 81
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
+E ++ GLN P++ K +L P P +FDWR VT +K+QG+CG+
Sbjct: 82 KTEIISKYTGLN----APSETTNFCKTIVLDQPPGKGPLNFDWRQQNKVTSIKNQGSCGA 137
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ ++E + + ++LSEQQL+DCD+ D GC GGL+++AFE +
Sbjct: 138 CWAFATLASIESQYAIRNDRHINLSEQQLIDCDY---------VDMGCYGGLLHTAFEQM 188
Query: 221 LKAGGVEREKDYPYTGTDGGSCKF----DKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
++ GGV++E +YPY G + C+ D S + + + E+++ L GP+
Sbjct: 189 IQMGGVKQEHEYPYAGVN-KQCELNDITDDSFVVRIKGCYRYVVVREEKLKDLLRAVGPI 247
Query: 277 AVGINAVWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGE 335
+ I+A + Y GV C Y L+H VL+VGYG PYW KN+WG
Sbjct: 248 PIAIDASGIVNYYKGVIN--YCENYGLNHAVLLVGYGVDNGV-------PYWTFKNTWGV 298
Query: 336 NWGENGYYKICMGRNVCGVDSMVSSVAAI 364
+WGENGY+++ N CG+ + ++S A I
Sbjct: 299 DWGENGYFRLRQNINACGMANELASSAVI 327
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 136/368 (36%), Positives = 191/368 (51%), Gaps = 51/368 (13%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+LL+L +V+A+ AV+ D ++R+ ++ FK + K Y ++ E
Sbjct: 3 ILLVLCAVVAAGTAVSFFD-LVRE------------------EWNTFKLEHKKQYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPA- 124
+R +++ N + AK Q V K+SD+ EF G N+ ++
Sbjct: 44 EKFRMKIYAENKHKVAKHNQRYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKG 103
Query: 125 ------DAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
D + A + P N P DWR HGAVT VKDQG CGSCWSFS TGALEG HF
Sbjct: 104 LYAKGNDIRGATFVSPANVAAPPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFR 163
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
+G LVSLSEQ L+DC S ++GCNGGLM++AF+YI G++ EK YPY
Sbjct: 164 KSGFLVSLSEQNLIDC-------SSAYGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEA 216
Query: 237 TDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVS 293
D C+++ K+ A V + + DE ++ L GP++V I+A Q Y GV
Sbjct: 217 VD-DKCRYNPKNSGAEDVGFVDIPAGDEHKLMLALATVGPVSVAIDASQESFQLYSDGVY 275
Query: 294 CPYIC-GKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NV 351
C + LDHGVL+VGYG+ YW++KNSWG +WG+ GY K+ R N
Sbjct: 276 YDENCSSENLDHGVLVVGYGTDEDG------GDYWLVKNSWGPSWGDEGYIKMARNRDNH 329
Query: 352 CGVDSMVS 359
CG+ S S
Sbjct: 330 CGIASSAS 337
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 127/355 (35%), Positives = 181/355 (50%), Gaps = 32/355 (9%)
Query: 29 AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ 88
A I Q+ + GE++ + + F + K KTY ++EE + R ++F N ++
Sbjct: 44 AKINQLKAALGEKATKEVGSLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHN 103
Query: 89 LLDPTAVH----GVTKFSDLTPSEFRRQFLGLN---RRLRLPADA---QKAPILPTNDLP 138
H G+ +DLT EF++ LG N R R P DA + A + P P
Sbjct: 104 AEYENGEHTHFVGLNHLADLTKDEFKK-MLGYNAALRASRAPVDASTWEYADVTP----P 158
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
+ DW GAVT VK+Q CGSCW+FS TGA+EG + + TG+L+SLSE++L+ C
Sbjct: 159 EEIDWVASGAVTPVKNQKQCGSCWAFSTTGAVEGVNAIKTGKLISLSEEELISC------ 212
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
S + + GCNGGLM++ FE+I+ G++ E + Y + F + A A+ F
Sbjct: 213 --STNGNMGCNGGLMDNGFEWIVNNRGIDTEDGWEYVAKEEKCGFFRRHHRAVAIDGFKD 270
Query: 259 ISSDEDQMAANLVKHGPLAVGINA--VWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGF 316
+ S+++ V P++V I A Q Y GGV CG LDHGVL+VGYG
Sbjct: 271 VPSNDEDSLMKAVSQQPVSVAIEADHQSFQLYAGGVYSAKDCGTELDHGVLLVGYGVD-- 328
Query: 317 APIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNVCGVDSMVSSVAAIHTT 367
P K K +W IKNSWG WGE+GY +I G CGV S + TT
Sbjct: 329 -PKSTKHKHFWKIKNSWGPAWGEDGYIRIAKGGSGVEGQCGVAMQPSYPTKLGTT 382
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 117/307 (38%), Positives = 163/307 (53%), Gaps = 32/307 (10%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
+EH RF +FK N++ D G+ KF+DL+ EF+ + ++ LR
Sbjct: 61 DEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKFADLSNEEFKAMHMTTKMEKHKSLRGD 120
Query: 124 ADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+ + N LP DWR GAVT VK+QG CGSCW+FS ++EG +++ TG+L
Sbjct: 121 RGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAFSTIASVEGINYIKTGKL 180
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDC E ++GCNGGLM++AF+YI+ GG+ E +YPYT + G
Sbjct: 181 VSLSEQQLVDCSKE---------NAGCNGGLMDNAFQYIIDNGGIVTEDEYPYT-AEAGE 230
Query: 242 C---KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--MQTYIGGVSCPY 296
C K + IA + F + ++ + V H P+++ I A Q Y GV
Sbjct: 231 CSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQFYSTGVFTGK 290
Query: 297 ICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMG----RNVC 352
CG LDHGV++VGYG S P YWI++NSWG WGE GY ++ G C
Sbjct: 291 -CGTELDHGVVVVGYGKS---PEGIN---YWIVRNSWGPEWGEQGYIRMQRGIEATEGKC 343
Query: 353 GVDSMVS 359
G+ S
Sbjct: 344 GISMQAS 350
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 170/319 (53%), Gaps = 32/319 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLL--DPTAVHGVTKFSDLTPSE 108
F FK ++ + YAT +E YR V+ N+ A Q + T + + +F D+T E
Sbjct: 22 FHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEE 81
Query: 109 FRRQFLGLNRRLRLPA-DAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
GL LPA +++ +L D LP + DWR GAVT VKDQ ACGSCW+FS
Sbjct: 82 INAVMNGL-----LPASESRGVAVLGGRDDTLPAEVDWRTKGAVTPVKDQKACGSCWAFS 136
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL G+LVSLSEQ LVDC + D GC GGLM+ AF YI GG
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDC-------STKQGDHGCGGGLMDFAFTYIKDNGG 189
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAVGINA-- 282
++ E YPY TD G C+++ + A V+ + + D ED + + GP++V I+A
Sbjct: 190 IDTEASYPYEATD-GKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASR 248
Query: 283 VWMQTYIGGVSCPYICGKY-LDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENG 341
Y GV C LDHGVL VGYG+ YW++KNSW WG +G
Sbjct: 249 STFHFYHKGVYYDKECSSTSLDHGVLAVGYGTQ-------DGTDYWLVKNSWNITWGNHG 301
Query: 342 YYKICMGR-NVCGVDSMVS 359
+ ++ R N CG+ + S
Sbjct: 302 FIEMSRNRNNNCGIATQAS 320
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 168/314 (53%), Gaps = 29/314 (9%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
++ L+ + H + +K + YA +E R+ VFK+N+ R + + T V +
Sbjct: 29 DNELIMQKRHIE-WMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQ 87
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKD 154
F+DLT EFR + G L + +Q + + LP DWR GAVT +K+
Sbjct: 88 FADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKN 147
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG+CG CW+FSA A+EGA + G+L+SLSEQQLVDCD + D GC GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCEGGLMD 198
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
+AFE+I+ GG+ E +YPY G D +C K+ A +++ + + +++Q V H
Sbjct: 199 TAFEHIMATGGLTTESNYPYKGED-ATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAH 257
Query: 274 GPLAVGIN--AVWMQTYIGGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKN 331
P++VGI Q Y GV C YLDH V +GYG S YWIIKN
Sbjct: 258 QPVSVGIEGGGFDFQFYSSGVFTGE-CTTYLDHAVTAIGYGQST------NGSKYWIIKN 310
Query: 332 SWGENWGENGYYKI 345
SWG WGE+GY +I
Sbjct: 311 SWGTKWGESGYMRI 324
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 174/346 (50%), Gaps = 31/346 (8%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
++ V + + SE NA F+ + K+Y T EE R+ +FKAN+ ++
Sbjct: 10 LLVSVATAKQQFSELQYRNA---FTDWMITHQKSY-TSEEFGARYNIFKANMDYVQQWNS 65
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
V G+ F+D+T E+R +LG Q+ + T+ + DWR GAV
Sbjct: 66 KGSETVLGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVFTTSSAASK-DWRSEGAV 124
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VK+QG CG CWSFS TG+ EGAHF S GELVSLSEQ L+DC E +SGC+
Sbjct: 125 TPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE---------NSGCD 175
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM AFEYI+ G++ E YPY + G C++ A +S++ +++ + +
Sbjct: 176 GGLMTYAFEYIINNNGIDTESSYPYKA-ENGKCEYKSENSGATLSSYKTVTAGSESSLES 234
Query: 270 LVKHGPLAVGINAVW--MQTYIGGVSC-PYICGKYLDHGVLIVGY------------GSS 314
V P++V I+A Q Y G+ P + LDHGVL VGY G S
Sbjct: 235 AVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQS 294
Query: 315 GFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
YWI+KNSWG +WG GY + R N CG+ S S
Sbjct: 295 SGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSAS 340
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 173/321 (53%), Gaps = 37/321 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + S K Y T EE +RF VFK NL+ + GV +F+DLT EF+
Sbjct: 48 FEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNM 107
Query: 113 FLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LGL +R + P + ++ DLP DWR GAVT VK+QG+CGSCW+FS
Sbjct: 108 YLGLKVESSRTRQSPEEFTYKDVV---DLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVA 164
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + + G L SLSEQ+L+DCD ++GC+GGLM+ AF +I+ +GG+ +
Sbjct: 165 AVEGINKIVGGNLTSLSEQELIDCDR--------PYNNGCHGGLMDYAFSFIVSSGGLHK 216
Query: 229 EKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAVGINAVW--M 285
E+DYPY + +C K ++ +S + + + + + H PL+V I A
Sbjct: 217 EEDYPYLEVE-STCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDF 275
Query: 286 QTYIGGV-SCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYK 344
Q Y GGV P CG LDHGV VGYGSS K Y I+KNSWG WGE GY
Sbjct: 276 QFYSGGVFDGP--CGTQLDHGVTAVGYGSS-------KGVDYIIVKNSWGPKWGEKGY-- 324
Query: 345 ICMGRN------VCGVDSMVS 359
I M RN +CG++ M S
Sbjct: 325 IRMKRNTGKPAGLCGINKMAS 345
>gi|66377984|gb|AAY45869.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 116/273 (42%), Positives = 155/273 (56%), Gaps = 26/273 (9%)
Query: 97 GVTKFSDLTPSEFR-----RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
G +DL SE++ R+ LG N LR A API DLP DWRD G VT
Sbjct: 119 GENHIADLPFSEYKKLNGYRRLLGDN--LRRNASTFLAPI-NIGDLPESVDWRDKGWVTE 175
Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
VK+QG CGSCW+FS+TGALE H TG+L+SLSEQ L+DC + + GCNGG
Sbjct: 176 VKNQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYG-------NMGCNGG 228
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANL 270
+M++AF+YI GV++E DYPY G C F ++ + A + F + DE+++ +
Sbjct: 229 IMDNAFQYIKDNNGVDKELDYPYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAV 288
Query: 271 VKHGPLAVGINA--VWMQTYIGGVSCPYICG-KYLDHGVLIVGYGSSGFAPIRFKEKPYW 327
GP +V I+A Q Y GV C + LDHGVL+VGYG+ ++ YW
Sbjct: 289 ATQGPASVAIDAGHRSFQLYTHGVYFEKECSPENLDHGVLVVGYGTDA------QQGDYW 342
Query: 328 IIKNSWGENWGENGYYKICMGR-NVCGVDSMVS 359
I+KNSWG +WGE GY ++ R N CG+ S S
Sbjct: 343 IVKNSWGAHWGEQGYIRMARNRKNNCGIASHAS 375
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 119/302 (39%), Positives = 165/302 (54%), Gaps = 30/302 (9%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLG----- 115
K Y E + RF +FK NL + D G+ KF+DLT EFR +LG
Sbjct: 62 KNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSS 121
Query: 116 ----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
L + + + ++LP DWR +GAV VKDQG CGSCW+FS A+E
Sbjct: 122 SSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVE 181
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TGEL+SLSEQ+LVDCD S +SGC+GGLM+ A+E+I+ GG++ + D
Sbjct: 182 GINQIVTGELLSLSEQELVDCD--------TSYNSGCDGGLMDYAYEFIINNGGIDTDAD 233
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAVGINA--VWMQTYI 289
YPYT DG ++ K+ + +F + ++++ V H P++V I A Q Y
Sbjct: 234 YPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVSVAIEAGGSTFQFYQ 293
Query: 290 GGVSCPYICGKYLDHGVLIVGYGSSGFAPIRFKEKPYWIIKNSWGENWGENGYYKICMGR 349
GV CG LDHGV+ VGYGS K YWI++NSWG +WGE+GY + M R
Sbjct: 294 SGVFTG-KCGADLDHGVVAVGYGSD-------DGKDYWIVRNSWGADWGESGYIR--MER 343
Query: 350 NV 351
N+
Sbjct: 344 NL 345
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.412
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,032,940,503
Number of Sequences: 23463169
Number of extensions: 261856853
Number of successful extensions: 598755
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6604
Number of HSP's successfully gapped in prelim test: 836
Number of HSP's that attempted gapping in prelim test: 569410
Number of HSP's gapped (non-prelim): 8929
length of query: 369
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 225
effective length of database: 8,980,499,031
effective search space: 2020612281975
effective search space used: 2020612281975
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)