BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022276
(300 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/298 (72%), Positives = 255/298 (85%), Gaps = 9/298 (3%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R ++S L+ LLS +AS + ++ DD +IRQVVP DG+Q DHLLNAEHHF+ FK+KF
Sbjct: 4 RCLISFLVYALLSFTIASTTSPDELDDPLIRQVVP-DGDQ--DHLLNAEHHFTTFKAKFG 60
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
KTYATQEEHDYRF++FKANLRRA++ Q++DPTAVHGVT FSDLTP EFRRQ+LGL RRLR
Sbjct: 61 KTYATQEEHDYRFKLFKANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQYLGL-RRLR 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LPADA +APILPTNDLPTDFDWRDHGAVT VK+QG+CGSCWSFSA GALEGAHFL+TGEL
Sbjct: 120 LPADAHEAPILPTNDLPTDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGEL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEY LKAGG+ERE+DYPYTG D G
Sbjct: 180 VSLSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLKAGGLEREEDYPYTGNDRGP 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
CKFD++KI A+VSNFSV+S DEDQ+AANLVKHGPLA + ++ + +++ VS P
Sbjct: 240 CKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQ----TYMGGVSCP 293
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 213/263 (80%), Positives = 236/263 (89%), Gaps = 5/263 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +AS V+ ND DD +IRQVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17 SAVASTVSSNDLDDPLIRQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFKANLRRAK+ Q++DPTA HG+TKFSDLTP EFRRQFLGL R LRLP DA KAPILPT
Sbjct: 73 GVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPT 132
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVS 252
Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
NFSV+S DEDQ+AANLVKHGPL+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLS 275
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 213/263 (80%), Positives = 235/263 (89%), Gaps = 5/263 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +AS V+ ND DD +IRQVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17 SAVASTVSSNDLDDPLIRQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFKANLRRAK+ Q++DPTA HG+TKFSDLTP EFRRQFLGL R LRLP DA KAPILPT
Sbjct: 73 GVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPT 132
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREADYPYTGTDGGTCKFDKSKVVASVS 252
Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
NFSV+S DEDQ+AANLVKHGPL+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLS 275
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 213/263 (80%), Positives = 235/263 (89%), Gaps = 5/263 (1%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +AS V+ D DD +I QVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17 SAVASTVSSTDLDDPLIIQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFKANLRRAK+ Q++DPTA HGVTKFSDLTP EFRRQFLGL RRLRLP DA KAPILPT
Sbjct: 73 GVFKANLRRAKKHQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPT 132
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVS 252
Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
NFSV+S DEDQ+AANLVKHGPL+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLS 275
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 208/251 (82%), Positives = 228/251 (90%), Gaps = 4/251 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +I QVV SDGE D LLNAEHHF+ FKSKF KTYATQEEHDYRF VFKANLRRAK+
Sbjct: 29 DDPLIIQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK 84
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q++DPTA HGVTKFSDLTP EFRRQFLGL RRLRLP DA KAPILPT DLPTD+DWRDH
Sbjct: 85 HQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPTTDLPTDYDWRDH 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDHECDPEE G+CDS
Sbjct: 145 GAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDS 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VSNFSV+S DEDQ+
Sbjct: 205 GCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQI 264
Query: 267 AANLVKHGPLA 277
AANLVKHGPL+
Sbjct: 265 AANLVKHGPLS 275
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 206/301 (68%), Positives = 252/301 (83%), Gaps = 11/301 (3%)
Query: 1 MERLILSSLLLL--LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
MER SL++ L SS+L +A + DD +IRQVVP ED+LL+A+HHF+ FK+
Sbjct: 1 MERSCFLSLIVFAFLSSSILFTATSDELDDPLIRQVVP----DVEDYLLSAQHHFTAFKA 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YATQEEHDYRF+VFKANLRRA++ QL+DP+AVHGVTKFSDLTP EFRRQ+LGL +
Sbjct: 57 KFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDPSAVHGVTKFSDLTPREFRRQYLGL-K 115
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LRLPADA +APILPT+ +P DFDWRDHGAVT VK+QG+CGSCWSFSA GALEGAHFL+T
Sbjct: 116 KLRLPADAHEAPILPTDGIPEDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLAT 175
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GELVSLSEQQLVDCDHECDP E G+CDSGCNGGLM +AFEYILKAGG+ERE+DYPYTG+D
Sbjct: 176 GELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILKAGGLEREEDYPYTGSD 235
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
G CKF+++KIAA+V+NFSV+S DEDQ+AANLV++GPLA + ++ + +++ VS
Sbjct: 236 RGPCKFERAKIAASVNNFSVVSVDEDQIAANLVQNGPLAVGINAVFMQ----TYIGGVSC 291
Query: 299 P 299
P
Sbjct: 292 P 292
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 211/300 (70%), Positives = 249/300 (83%), Gaps = 9/300 (3%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDD-AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
M L + LLL S+ +A+ ++D+D +IRQVVP + + HLLNAEHHFS FK+K
Sbjct: 1 MANLSILFFGLLLFSAAVATVERIDDEDNLLIRQVVP---DAEDHHLLNAEHHFSAFKTK 57
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
F+KTYATQEEHD+RFR+FK NL RAK Q LDP+AVHGVT+FSDLTPSEFR QFLGL +
Sbjct: 58 FAKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPSEFRGQFLGL-KP 116
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
LRLP+DAQKAPILPT+DLPTDFDWRDHGAVTGVK+QG+CGSCWSFSA GALEGAHFLSTG
Sbjct: 117 LRLPSDAQKAPILPTSDLPTDFDWRDHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTG 176
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
LVSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEY LKAGG+ RE+DYPYTG D
Sbjct: 177 GLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLKAGGLMREEDYPYTGRDR 236
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
G CKFDKSKIAA+V+NFSV+S DE+Q+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 237 GPCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGINAVFMQ----TYIGGVSCP 292
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 206/285 (72%), Positives = 244/285 (85%), Gaps = 9/285 (3%)
Query: 16 SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
+ +A+A ++D DD +IRQVVP + + HLLNAEHHFS FK+KF KTYATQEEHD+RF
Sbjct: 16 ATVAAAERIDDEDDLLIRQVVP---DAEDHHLLNAEHHFSAFKTKFGKTYATQEEHDHRF 72
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
R+FK NL RAK Q LDP+AVHGVT+FSDLTP+EFRRQFLGL + LRLP+DAQKAPILPT
Sbjct: 73 RIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPAEFRRQFLGL-KPLRLPSDAQKAPILPT 131
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
NDLPTDFDWR+HGAVTGVK+QG+CGSCWSFSA GALEGAHFLSTGELVSLSEQQLVDCDH
Sbjct: 132 NDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDH 191
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDPEE G+CDSGCNGGLM +AFEY L+AGG+ REKDYPYTG D G CKFDKSK+AA+V+
Sbjct: 192 ECDPEERGACDSGCNGGLMTTAFEYTLQAGGLMREKDYPYTGRDRGPCKFDKSKVAASVA 251
Query: 255 NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
NFSV+S DE+Q+AANLV++GPLA + ++ + +++ VS P
Sbjct: 252 NFSVVSLDEEQIAANLVQNGPLAVGINAVFMQ----TYIGGVSCP 292
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 204/284 (71%), Positives = 242/284 (85%), Gaps = 14/284 (4%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
V D+D +IRQVV SDGE +D LLNA+HHF+LFKSK+ K+YATQEEHDYR VFKANL
Sbjct: 19 VVRADEDPLIRQVV-SDGE--DDALLNADHHFTLFKSKYGKSYATQEEHDYRLSVFKANL 75
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTN 135
RRAKR QLLDP+AVHGVTKFSDLTP EFRR FLG+ R+L+LPADA A ILPT+
Sbjct: 76 RRAKRHQLLDPSAVHGVTKFSDLTPKEFRRTFLGIRKSSSGKRKLKLPADAHAAEILPTS 135
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
DLP+DFDWRD+GAVTGVKDQG+CGSCWSFS TGALEGA+FL+TGELVSLSEQQLVDCDH
Sbjct: 136 DLPSDFDWRDYGAVTGVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHL 195
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPEE+G+CDSGCNGGLM +A+EY+L++GG+E+EKDYPYTG D G+CKFDKSKIAAAV+N
Sbjct: 196 CDPEEAGACDSGCNGGLMTTAYEYVLQSGGLEKEKDYPYTGKD-GTCKFDKSKIAAAVAN 254
Query: 256 FSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
FSV+S DEDQ+AANLVKHGPL+ + ++ + +++ VS P
Sbjct: 255 FSVVSLDEDQIAANLVKHGPLSVGINAVFMQ----TYIGGVSCP 294
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 420 bits (1079), Expect = e-115, Method: Compositional matrix adjust.
Identities = 203/277 (73%), Positives = 236/277 (85%), Gaps = 8/277 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
MERL L SLL +L +SA+A +D+D +IRQVV E + HLLNAEHHFSLFKSKF
Sbjct: 1 MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKANLRRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 55 GKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ +CD+GC GGLM +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKD-G 232
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
C FDKSKIAAAV+NFSVI DEDQ+AANLVKHGPLA
Sbjct: 233 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLA 269
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 205/297 (69%), Positives = 240/297 (80%), Gaps = 15/297 (5%)
Query: 8 SLLLLLLSSVLASAVAVNDDDA-----MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
S LL L+ ++L SA + +IRQVVP D LL+AEH F LFK+KF K
Sbjct: 7 SALLFLIPTLLFSAAVSDISSDESDDLLIRQVVPEG-----DDLLSAEHQFGLFKAKFGK 61
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
TY+T EEHDYRF VF+ANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR +LGL + LRL
Sbjct: 62 TYSTVEEHDYRFSVFEANLRRARRHQLLDPSAVHGVTRFSDLTPDEFRRDYLGL-KPLRL 120
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
PADAQKAPILPTNDLPTDFDWRDHGAVT VKDQG+CGSCWSFSA GALEGAHFL+TG L+
Sbjct: 121 PADAQKAPILPTNDLPTDFDWRDHGAVTPVKDQGSCGSCWSFSAIGALEGAHFLTTGNLI 180
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
S+SEQQLVDCDHECDPEE G+CD GCNGGLM SAFEYILKAGGVERE+ YPY G+D GSC
Sbjct: 181 SMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSC 240
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KF+KS+I A+VSNFSV+S DEDQ+AAN+VK+GPLA + ++ + +++ VS P
Sbjct: 241 KFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQ----TYMKGVSCP 293
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 204/273 (74%), Positives = 236/273 (86%), Gaps = 8/273 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVVP +GE EDHLLNAEHHFS FKSKF KTYAT+EEHD+RF VFK+N+RRA+
Sbjct: 27 DDILIRQVVP-EGE-VEDHLLNAEHHFSTFKSKFGKTYATKEEHDHRFGVFKSNMRRARL 84
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
LDP+AVHGVTKFSDLTP+EF R+FLGL + LRLPA AQKAPILPTN+LP DFDWRD
Sbjct: 85 HAQLDPSAVHGVTKFSDLTPAEFHRKFLGL-KPLRLPAHAQKAPILPTNNLPKDFDWRDK 143
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE GSCDS
Sbjct: 144 GAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDS 203
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEY++ +GGV+REKDYPYTG D G+CKFDKSKIAA+VSN+SVIS DE+Q+
Sbjct: 204 GCNGGLMNNAFEYLIGSGGVQREKDYPYTGRD-GTCKFDKSKIAASVSNYSVISLDEEQI 262
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
AANLVK+GPLA + ++ + +++ VS P
Sbjct: 263 AANLVKNGPLAVAINAVYMQ----TYVGGVSCP 291
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 202/277 (72%), Positives = 235/277 (84%), Gaps = 8/277 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
MERL L SLL +L +SA+A +D+D +IRQVV E + HLLNAEHHFSLFKSKF
Sbjct: 1 MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKAN RRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 55 GKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ +CD+GC GGLM +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKD-G 232
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
C FDKSKIAAAV+NFSVI DEDQ+AANLVKHGPLA
Sbjct: 233 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLA 269
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 201/279 (72%), Positives = 238/279 (85%), Gaps = 9/279 (3%)
Query: 25 NDDDAMIRQVVPSDGE---QSEDHLLNAEHH-FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+DDD +IRQVVP G+ E++LL A+HH FS+FK +F K+YA+QEEHDYRF+VFKAN
Sbjct: 30 SDDDIIIRQVVPELGDVEGSEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKAN 89
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
LRRA+R Q LDP+A HGVT+FSDLTP+EFR +LGL R L+LP DAQKAPILPTNDLP D
Sbjct: 90 LRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGL-RPLKLPHDAQKAPILPTNDLPED 148
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWRDHGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG LVSLSEQQLV+CDHECDPEE
Sbjct: 149 FDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEE 208
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
GSCDSGCNGGLMN+AFEY LKAGG+ +E+DYPYTGTD GSCKFDK+KIAA+VSNFSVIS
Sbjct: 209 MGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVIS 268
Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 269 LDEDQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 303
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 204/301 (67%), Positives = 250/301 (83%), Gaps = 15/301 (4%)
Query: 10 LLLLLSSVLASAVAV------NDDDAMIRQVVP----SDGEQSEDHLLNAEHHFSLFKSK 59
L+++LS + ASA+ +D D +IRQVV ++G +D LL A+HHFS+FK K
Sbjct: 7 LIVVLSLLAASAIGSEVISGESDGDFIIRQVVDDGGVNEGSNGDDLLLGADHHFSVFKQK 66
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NR 118
F K+YA++EEHD+RFRVFKANL+RA+R Q LDP+A HGVT+FSDLTPSEFRR FLGL +R
Sbjct: 67 FGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRSR 126
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
RL LPADA KAPILPT+ LPTDFDWRD GAV+ VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 127 RLGLPADANKAPILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEGANFLAT 186
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK+GG+ +E+DYPYTGTD
Sbjct: 187 GKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDYPYTGTD 246
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
G+CKFDKSKIAA+V+NFSV+S DE+Q+AANLVK+GPLA + ++ + +++ VS
Sbjct: 247 RGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQ----TYIKGVSC 302
Query: 299 P 299
P
Sbjct: 303 P 303
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 197/281 (70%), Positives = 233/281 (82%), Gaps = 5/281 (1%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R L L LL ++ L A + DD +IRQVV G+ LLNA+HHF++FK +F K
Sbjct: 4 RFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD-----LLNADHHFTVFKRRFGK 58
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
YA+ EEHDYR VFKAN+RRAK+ Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 59 VYASDEEHDYRLSVFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKF 118
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+LV
Sbjct: 119 PADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLV 178
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D C
Sbjct: 179 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVC 238
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA + ++
Sbjct: 239 RFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAV 279
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/277 (72%), Positives = 233/277 (84%), Gaps = 10/277 (3%)
Query: 27 DDAMIRQVVP----SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
DD +IRQVVP E+ EDHLLNAEHHF+ FK+KF K YAT+EEHD RF VFK+NLR
Sbjct: 23 DDILIRQVVPDAVGEAAEKEEDHLLNAEHHFASFKAKFGKKYATKEEHDRRFGVFKSNLR 82
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
RA+ LDP+AVHGVTKFSDLTP+EFRRQFLG + LRLPA+AQKAPILPT DLP DFD
Sbjct: 83 RARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGF-KPLRLPANAQKAPILPTKDLPKDFD 141
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WRD GAVT VKDQGACGSCWSFS TGALEGAH+L+TGELVSLSEQQLVDCDH CDPEE G
Sbjct: 142 WRDKGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYG 201
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
+CDSGCNGGLMN+AFEYIL++GGV++EKDYPYTG D G+CKFDK+K+AA VSN+SV+S D
Sbjct: 202 ACDSGCNGGLMNNAFEYILQSGGVQKEKDYPYTGRD-GTCKFDKTKVAATVSNYSVVSLD 260
Query: 263 EDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
EDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 261 EDQIAANLVKNGPLAVGINAVFMQ----TYIGGVSCP 293
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 201/279 (72%), Positives = 237/279 (84%), Gaps = 9/279 (3%)
Query: 25 NDDDAMIRQVVPSDGE---QSEDHLLNAEHH-FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+DDD +IRQVVP G+ E++LL A+HH FS+FK +F K+YA+QEEHDYRF+VFKAN
Sbjct: 30 SDDDIIIRQVVPELGDVEGGEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKAN 89
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
LRRA+R Q LDP+A HGVT+FSDLTP+EFR +LGL R L+LP DAQKAPILPTNDLP D
Sbjct: 90 LRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGL-RPLKLPHDAQKAPILPTNDLPED 148
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWRDHGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG LVSLSEQQLV+CDHECDPEE
Sbjct: 149 FDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEE 208
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
GSCDSGCNGGLMN+AFEY LKAGG+ +E+DYPYTGTD GSCKFDK+KIAA+VSNFSVIS
Sbjct: 209 MGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVIS 268
Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DEDQ+AANLVK GPLA + ++ + +++ VS P
Sbjct: 269 LDEDQIAANLVKIGPLAVAINAVFMQ----TYVGGVSCP 303
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/273 (76%), Positives = 235/273 (86%), Gaps = 8/273 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV + EDHLLNAEHHF+ FKSKF K YATQEEHDYRF VFKANL RAK+
Sbjct: 29 DDPLIRQVV----SEGEDHLLNAEHHFTTFKSKFGKNYATQEEHDYRFSVFKANLLRAKK 84
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q++DPTA HGVTKFSDLTP EFRRQ LGL RRLRLP DA KAPILPT DLPTDFDWRDH
Sbjct: 85 HQIMDPTAAHGVTKFSDLTPKEFRRQLLGLKRRLRLPTDANKAPILPTGDLPTDFDWRDH 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCWSFSATGALEGAH+L+TGELVSLSEQQLVDCDHECDPEE G+CDS
Sbjct: 145 GAVTSVKDQGSCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDS 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC+GGLMN+AFEY LKAGG+EREKDYPYTG D G+CKF+KSK+AA+VSNFSV+S DEDQ+
Sbjct: 205 GCSGGLMNNAFEYALKAGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQI 264
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
AANLVKHGPL+ + ++ + +++ VS P
Sbjct: 265 AANLVKHGPLSVAINAVFMQ----TYIGGVSCP 293
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 203/298 (68%), Positives = 242/298 (81%), Gaps = 9/298 (3%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R L L LL ++ L A +D DD +IRQVV DG+ LLNA+HHF++FK +F
Sbjct: 4 RFSLLFLCTLLATTSLVFAAEDDDGDDVLIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EEHDYR VFKAN+RRAKR Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 60 KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLK 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 293
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 197/257 (76%), Positives = 222/257 (86%), Gaps = 7/257 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV + D LL+AEHHF+ FK++F KTYAT EEHDYRF +FKANLRRAKR
Sbjct: 31 DDLLIRQVV-----SNSDDLLSAEHHFAAFKARFRKTYATAEEHDYRFSIFKANLRRAKR 85
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
QLLDP+AVHGVT+FSDLTP+EFR+ +LGL + LR P D Q+APILPTNDLPTDFDWRDH
Sbjct: 86 NQLLDPSAVHGVTRFSDLTPAEFRQNYLGL-KPLRFPIDTQQAPILPTNDLPTDFDWRDH 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG CGSCWSFS TGALEGAHFL+TG LVSLSEQQLVDCDHECDPEE G+CD
Sbjct: 145 GAVTAVKDQGECGSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDR 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEYILKAGGV R +DYPYTGTD G CKFDK+KIAA+VSNFS +S DEDQ+
Sbjct: 205 GCNGGLMNTAFEYILKAGGVVRGEDYPYTGTD-GHCKFDKTKIAASVSNFSTVSIDEDQI 263
Query: 267 AANLVKHGPLAGNVASI 283
AANLVK+GPLA + +I
Sbjct: 264 AANLVKNGPLAVGINAI 280
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 203/298 (68%), Positives = 242/298 (81%), Gaps = 9/298 (3%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R L L LL ++ L A +D DD +IRQVV DG+ LLNA+HHF++FK +F
Sbjct: 4 RFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EEHDYR VFKAN+RRAKR Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 60 KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLK 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 293
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 412 bits (1060), Expect = e-113, Method: Compositional matrix adjust.
Identities = 201/270 (74%), Positives = 234/270 (86%), Gaps = 8/270 (2%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
+IRQVVP +GE EDHLLNAEHHFS FK+KF KTYAT+EEHD+RF VFK+N+RRA+
Sbjct: 30 LIRQVVP-EGE-VEDHLLNAEHHFSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQ 87
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
LDP+AVHGVTKFSDLTP+EF R+FLGL + LRLPA AQKAPILPTN+LP DFDWRD GAV
Sbjct: 88 LDPSAVHGVTKFSDLTPAEFHRKFLGL-KPLRLPAHAQKAPILPTNNLPKDFDWRDKGAV 146
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VKDQG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE GSCDSGCN
Sbjct: 147 TNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCN 206
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMN+AFEY++ +GGV+REKDYPYTG D G+CKFDKSKIAA+VSN+SVIS DE+Q+AAN
Sbjct: 207 GGLMNNAFEYLIGSGGVQREKDYPYTGRD-GTCKFDKSKIAASVSNYSVISLDEEQIAAN 265
Query: 270 LVKHGPLAGNVASIELPHISFSFLFTVSSP 299
LVK+GPLA + ++ + +++ VS P
Sbjct: 266 LVKNGPLAVAINAVYMQ----TYVGGVSCP 291
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 412 bits (1059), Expect = e-113, Method: Compositional matrix adjust.
Identities = 195/282 (69%), Positives = 236/282 (83%), Gaps = 12/282 (4%)
Query: 10 LLLLLSSVLASA--VAVN------DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
L+ +LS +L ++ +AVN DDD +IRQVV + + H+LNAEHHF+LFK +F
Sbjct: 7 LVFVLSILLTTSFLLAVNGEIKGGDDDILIRQVVGDE----DHHMLNAEHHFTLFKKRFG 62
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
KTYA+ EEH YRF VFKANLRRA R Q LDP+AVHGVT+FSD+TP EF ++FLG+NRRLR
Sbjct: 63 KTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDMTPDEFSQKFLGVNRRLR 122
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
P+DA KAPILPT DLP+DFDWR+HGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 123 FPSDANKAPILPTEDLPSDFDWREHGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKL 182
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE SCDSGC+GGLMNSAFEY LKAGG+ RE+DYPYTGTD +
Sbjct: 183 VSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDKAT 242
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
CKFD +K+AA V+NFSV+S DE+Q+AANLVK+GPLA + ++
Sbjct: 243 CKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAV 284
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 198/271 (73%), Positives = 234/271 (86%), Gaps = 7/271 (2%)
Query: 30 MIRQVVPSDGE-QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ 88
+IRQVVP GE + ED+LLNAEHHF+ FK+KF+KTYAT+EEHD+RF VFK+NLRRA+
Sbjct: 32 LIRQVVPDVGEAEEEDNLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHA 91
Query: 89 LLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGA 148
LDP+AVHGVTKFSDLTP+EFRRQFLGL + LR PA AQKAPILPT DLP DFDWRD GA
Sbjct: 92 KLDPSAVHGVTKFSDLTPAEFRRQFLGL-KPLRFPAHAQKAPILPTKDLPKDFDWRDKGA 150
Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
VT VKDQGACGSCWSFS TGALEGAH+L+TGELVSLSEQQLVDCDH CDPEE G+CDSGC
Sbjct: 151 VTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGC 210
Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
NGGLMN+AFEYIL++GGV++EKDYPYTG D G+CKFDK+K+AA VSN+SV+S DE+Q+AA
Sbjct: 211 NGGLMNNAFEYILQSGGVQKEKDYPYTGRD-GTCKFDKTKVAATVSNYSVVSLDEEQIAA 269
Query: 269 NLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
NLVK+GPLA + ++ + +++ VS P
Sbjct: 270 NLVKNGPLAVAINAVFMQ----TYVGGVSCP 296
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 196/251 (78%), Positives = 223/251 (88%), Gaps = 5/251 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF VFKANL +AK
Sbjct: 21 DDFLIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKL 76
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDPTA HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPT +LP DFDWR+
Sbjct: 77 HQKLDPTAEHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPEDFDWREK 136
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH CDPEE+GSCDS
Sbjct: 137 GAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDS 196
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSNFSV+S DE+Q+
Sbjct: 197 GCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSNFSVVSLDEEQI 255
Query: 267 AANLVKHGPLA 277
AANLVK+GPLA
Sbjct: 256 AANLVKNGPLA 266
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 201/294 (68%), Positives = 240/294 (81%), Gaps = 15/294 (5%)
Query: 12 LLLSSVLASAVAV------NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ + VL +AVA N DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+Y+
Sbjct: 5 FIFAIVLFAAVATSSTDNTNTDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYS 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T+EEHDYRF VFK+NL +AK Q LDPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA
Sbjct: 61 TKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAH 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
AQKAPILPT +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLS
Sbjct: 121 AQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDH CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFD
Sbjct: 181 EQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFD 239
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KSK+ A+VSNFSV+S DE+Q+AANLVK+GPLA + + + +++ VS P
Sbjct: 240 KSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQ----TYMSGVSCP 289
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 201/294 (68%), Positives = 240/294 (81%), Gaps = 15/294 (5%)
Query: 12 LLLSSVLASAVAV------NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ + VL +AVA N DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+Y+
Sbjct: 5 FIFAIVLFAAVATSSTDDTNTDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYS 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T+EEHDYRF VFK+NL +AK Q LDPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA
Sbjct: 61 TKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAH 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
AQKAPILPT +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLS
Sbjct: 121 AQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDH CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFD
Sbjct: 181 EQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFD 239
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KSK+ A+VSNFSV+S DE+Q+AANLVK+GPLA + + + +++ VS P
Sbjct: 240 KSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQ----TYMSGVSCP 289
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 203/281 (72%), Positives = 236/281 (83%), Gaps = 9/281 (3%)
Query: 21 AVAVNDDDAMIRQVVPSDGEQ-SEDHLLNAE-HHFSLFKSKFSKTYATQEEHDYRFRVFK 78
A +N DD +IR+VV DG+ S +LL+AE HHFSLFKSKF K+Y +QEEHDYRF VFK
Sbjct: 21 AETLNGDDPLIREVV--DGQDASSSNLLSAEQHHFSLFKSKFKKSYGSQEEHDYRFSVFK 78
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
ANLRRA R Q LDPTA HGVT+FSDLTP+EFR+Q LGL RRLRLP DA +APILPT+DLP
Sbjct: 79 ANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQVLGL-RRLRLPKDANEAPILPTSDLP 137
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DFDWRD GAV +K+QG+CGSCWSFSATGALEGAHFL+TGELVSLSEQQLVDCDHECDP
Sbjct: 138 EDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDHECDP 197
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
EE GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD +CKFDK+K+AA V+NFSV
Sbjct: 198 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRDACKFDKNKVAARVANFSV 257
Query: 259 ISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
+S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 258 VSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 294
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 195/254 (76%), Positives = 222/254 (87%), Gaps = 5/254 (1%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
N DD +IRQVV + +EDH+LNAEHHF+ FKSKFSK YAT+EEHDYRF VFK+NL +
Sbjct: 26 TNSDDLLIRQVV----DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIK 81
Query: 84 AKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
AK Q LDP+A HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPTN+LP DFDW
Sbjct: 82 AKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTNNLPEDFDW 141
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+L SLSEQQLVDCDH CDPEE GS
Sbjct: 142 REKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGS 201
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
CDSGCNGGLMN+AFEYIL++GGV EKDY YTG D GSCKFDKSK+ A+VSNFSV+S DE
Sbjct: 202 CDSGCNGGLMNNAFEYILQSGGVVSEKDYAYTGRD-GSCKFDKSKVVASVSNFSVVSLDE 260
Query: 264 DQMAANLVKHGPLA 277
DQ+AANLVK+GPLA
Sbjct: 261 DQIAANLVKNGPLA 274
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 202/270 (74%), Positives = 232/270 (85%), Gaps = 8/270 (2%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
+IRQVVP +GE EDHLLNAEHHFS FK+KF KTYAT+EEHD+RF VFK+NLRRA+
Sbjct: 29 LIRQVVP-EGE-VEDHLLNAEHHFSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQ 86
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
LDP+AVHGVTKFSDLT +EF+RQFLGL + L LPA+AQKAPILPTN+LP DFDWRD GAV
Sbjct: 87 LDPSAVHGVTKFSDLTAAEFQRQFLGL-KPLGLPANAQKAPILPTNNLPKDFDWRDKGAV 145
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VKDQGACGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE G+CDSGCN
Sbjct: 146 TNVKDQGACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCN 205
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLMN+AFEYIL AGGV+RE+DYPY G D SCKFDKSKIAA+V+N+SVIS DEDQ+AAN
Sbjct: 206 GGLMNNAFEYILGAGGVQREEDYPYAGRD-SSCKFDKSKIAASVANYSVISLDEDQIAAN 264
Query: 270 LVKHGPLAGNVASIELPHISFSFLFTVSSP 299
LVK+GPLA + ++ + +++ VS P
Sbjct: 265 LVKNGPLAVGINAVYMQ----TYIGGVSCP 290
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 197/280 (70%), Positives = 228/280 (81%), Gaps = 5/280 (1%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
SA N DD++IRQVV E S + L +HHFSLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 20 SAETFNGDDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYRFSVFKS 79
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA APILPTNDLP
Sbjct: 80 NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANTAPILPTNDLPE 138
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+ GAV VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 139 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 198
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK+K+AA V+NFSV+
Sbjct: 199 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKNKVAAGVANFSVV 258
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 SLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 294
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 200/277 (72%), Positives = 233/277 (84%), Gaps = 6/277 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+RL L SL L +SA+A D+D +IRQVV S+ E + HLLNAEHHFSLFKSKF
Sbjct: 1 MDRLFLLSLPRFAL---FSSAIAFPDEDPLIRQVV-SETETDDSHLLNAEHHFSLFKSKF 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKANLRRA+ QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 57 GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 115
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP D+DWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 116 KPKVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 175
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECD E+ SCD+GC GGLM +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 176 LVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKD-G 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
C FDKSKIAAAV+NFSVI DEDQ+AANLVKHGPLA
Sbjct: 235 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLA 271
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 195/280 (69%), Positives = 233/280 (83%), Gaps = 6/280 (2%)
Query: 21 AVAVNDDDAMIRQVVPSDGEQSE-DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
AV + D +IRQVV +D + E D LL+ EHHF LFK+KF +TY T+EEH+YR VFK+
Sbjct: 17 AVTADSSDPLIRQVVQNDETEIESDPLLDPEHHFKLFKNKFGRTYDTEEEHEYRLTVFKS 76
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRAKR Q+LDPTA HGVTKFSDLTPSEFR+++LGL +L+LPADA KAPILPT++LP
Sbjct: 77 NLRRAKRHQVLDPTAKHGVTKFSDLTPSEFRKKYLGLKSKLKLPADANKAPILPTSNLPQ 136
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWRD GAVT VK+QG+CGSCWSFS TGALEG+HFL TGELVSLSEQQLVDCDHECDP
Sbjct: 137 DFDWRDKGAVTPVKNQGSCGSCWSFSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDPA 196
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E SCDSGCNGGLMN+AFEYILKAGG+++E DYPYTG D G+CKFDKSKIAA+V+NFSV+
Sbjct: 197 EYNSCDSGCNGGLMNNAFEYILKAGGLQKEADYPYTGRD-GTCKFDKSKIAASVANFSVV 255
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
S+DEDQ+AANLV +GPLA + + + +++ VS P
Sbjct: 256 STDEDQIAANLVTNGPLAIGINAAWMQ----TYIGQVSCP 291
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/277 (71%), Positives = 232/277 (83%), Gaps = 8/277 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
MERL L SLL +L +SA+A +D+D +IRQVV E + HLLNAEHHFSLFKSKF
Sbjct: 1 MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA++EEHD+RF+VFKANLRRA+ QLLDP+A HG+TKFSDLTPSEFRR +LGL++
Sbjct: 55 GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ +CD+GC GG +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQLEKDYPYTGKD-G 232
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
C FDKSKI AAV+NFSVI DEDQ+AANLVKHGPLA
Sbjct: 233 KCHFDKSKICAAVTNFSVIGLDEDQIAANLVKHGPLA 269
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/262 (74%), Positives = 228/262 (87%), Gaps = 7/262 (2%)
Query: 18 LASAVA--VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
+A+AV N+DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF
Sbjct: 15 VATAVTDDTNNDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFG 70
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
VFK+NL +AK Q DPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT
Sbjct: 71 VFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTT 130
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
+LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH
Sbjct: 131 NLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHV 190
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSN
Sbjct: 191 CDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSN 249
Query: 256 FSVISSDEDQMAANLVKHGPLA 277
FSV++ DEDQ+AANLVK+GPLA
Sbjct: 250 FSVVTLDEDQIAANLVKNGPLA 271
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/297 (68%), Positives = 245/297 (82%), Gaps = 8/297 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSED-HLLNAEHHFSLFKSKFSK 62
++SS+L + S+V A + + +D +IRQV E S + +LL AEHHFSLFK KF K
Sbjct: 10 FVISSILFV--SAVTAETLTTDGEDPLIRQVTDGQDESSANPNLLGAEHHFSLFKKKFKK 67
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
TYA+QEEHDYRF++FK+NLRRA+R Q LDPTA HGVT+FSDLT SEFRRQFLGL RRLRL
Sbjct: 68 TYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHGVTQFSDLTHSEFRRQFLGL-RRLRL 126
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P DA +AP+LPTNDLP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGA++L+TG+LV
Sbjct: 127 PKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLV 186
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP E G+CDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD G+C
Sbjct: 187 SLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGAC 246
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 247 QFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 299
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/278 (70%), Positives = 235/278 (84%), Gaps = 7/278 (2%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
VA +D+D +IRQVV S+ E + HLLNAEHHFSLFKSKF K YA++EEHD+RF+VFKANL
Sbjct: 19 VAFSDEDPLIRQVV-SETETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANL 77
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++ + +A+KAPILPT+DLP D+
Sbjct: 78 RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP-KPKLNAEKAPILPTSDLPADY 136
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQLVDCDHECDPE+
Sbjct: 137 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 196
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCD+GC+GGLM +AFEY LKAGG++REKDYPYTG G C FDKSKIAAAV+NFSVI
Sbjct: 197 DSCDAGCSGGLMTTAFEYTLKAGGLQREKDYPYTGKX-GKCHFDKSKIAAAVTNFSVIGL 255
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DEDQ+AANLVKHGPLA + + + +++ VS P
Sbjct: 256 DEDQIAANLVKHGPLAVGINAAWMQ----TYVGGVSCP 289
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 194/273 (71%), Positives = 230/273 (84%), Gaps = 8/273 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD +IRQVV DG+ LLNA+HHF++FK +F K YA+ EEHDYR VFKAN+RRAKR
Sbjct: 27 DDILIRQVV-GDGDGD---LLNADHHFAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKR 82
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+ PADA+ APILPT++LP+DFDWRD
Sbjct: 83 HQQLDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDR 142
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDPEE+GSCDS
Sbjct: 143 GAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDS 202
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMNSAFEY LKAGG+ RE+DYPYTG D C+FDK+KIAA V+NFSV+S DEDQ+
Sbjct: 203 GCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQI 262
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
AANLVK+GPLA + ++ + +++ VS P
Sbjct: 263 AANLVKNGPLAVAINAVFMQ----TYIGGVSCP 291
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 194/258 (75%), Positives = 217/258 (84%), Gaps = 1/258 (0%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
SA N DD++IRQVV E S + L +HH SLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 26 SAETFNGDDSLIRQVVEGQDESSPNLLTAEQHHLSLFKRKFKKSYLSQEEHDYRFSVFKS 85
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA KAPILPTNDLP
Sbjct: 86 NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANKAPILPTNDLPE 144
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+ GAV VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 145 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 204
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK K+AA V+NFSV+
Sbjct: 205 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKDKVAAGVANFSVV 264
Query: 260 SSDEDQMAANLVKHGPLA 277
S DEDQ+AANLVK+GPLA
Sbjct: 265 SLDEDQIAANLVKNGPLA 282
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/280 (70%), Positives = 227/280 (81%), Gaps = 5/280 (1%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
SA N DD++IRQVV E S + L +HHFSLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 20 SAETFNGDDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYRFSVFKS 79
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA APILPTNDLP
Sbjct: 80 NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANTAPILPTNDLPE 138
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+ GAV VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 139 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 198
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK+K+AA V+NFS +
Sbjct: 199 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKNKVAAGVANFSAV 258
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 SLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 294
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/277 (70%), Positives = 229/277 (82%), Gaps = 6/277 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M L L++ S A++ DD+ +I QVV G + L AEHHF FK +F
Sbjct: 1 MNNPTLIIFFLVIFSVFFAASADGGDDEPLIMQVVEGSGVR-----LGAEHHFLDFKRRF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YA+QEEH+YRF VFKAN+RRA+R Q LDP+A HGVT+FSDLT SEFR + LGL R +
Sbjct: 56 GKAYASQEEHNYRFEVFKANMRRARRHQSLDPSAAHGVTRFSDLTASEFRNKVLGL-RGV 114
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
RLP++A KAPILPT++LP+DFDWRDHGAVT VK+QG+CGSCWSFS TGALEGAHFLSTGE
Sbjct: 115 RLPSNANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLSTGE 174
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEYILK+GGV RE+DYPY+GTD G
Sbjct: 175 LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRG 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+CKFDK+KIAA+V+NFSVIS DEDQ+AANLVK+GPLA
Sbjct: 235 NCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLA 271
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/297 (70%), Positives = 245/297 (82%), Gaps = 8/297 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
LI ++LL + L S + S IRQVVP E++++HLLNAEHHFSLFKSK+ KT
Sbjct: 9 LIAATLLAVSLGSAVISGEVNYGFVNPIRQVVP---EENDEHLLNAEHHFSLFKSKYEKT 65
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-LRL 122
YATQEEHD+RFRVFKANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR+FLGL RR RL
Sbjct: 66 YATQEEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRL 125
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P D Q APILPT+DLPT+FDWR+ GAVT VK+QG CGSCWSFSA GALEGAHFL+T ELV
Sbjct: 126 PTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELV 185
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP ++ SCDSGC+GGLMN+AFEY LKAGG+ +E+DYPYTG D +C
Sbjct: 186 SLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDNTAC 245
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KFDKSKIAA+VSNFSV+SSDEDQ+AANLVKHGPLA + ++ + +++ VS P
Sbjct: 246 KFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQ----TYIGGVSCP 298
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 200/299 (66%), Positives = 241/299 (80%), Gaps = 9/299 (3%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+RL L SLL+ + S +SA A +D+D +IRQV S+ + + +HLLNAEHHFSLFKSKF
Sbjct: 1 MDRLFLLSLLVFTIFS--SSAFAFSDEDPLIRQVT-SESDDNNNHLLNAEHHFSLFKSKF 57
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K YATQEEHD+R +VFKANLRRA+R QLLDPTA HG+TKFSDLTPSEFRR +LGL++
Sbjct: 58 GKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSEFRRTYLGLHKP- 116
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ KAPILPT+DLP DFDWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 117 KPKLSTTKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 176
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECD E+ CD+GC GGLM +AFEY LKAGG++REKDYPYTG + G
Sbjct: 177 LVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRN-G 235
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
C FDKSKIAA+V+N+SV+ DEDQ+AANLVKHGPLA + S + +++ VS P
Sbjct: 236 QCHFDKSKIAASVTNYSVVGLDEDQIAANLVKHGPLAVGINSAWMQ----TYIGGVSCP 290
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 406 bits (1043), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/297 (66%), Positives = 238/297 (80%), Gaps = 9/297 (3%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R L L LL ++ L A + DD +IRQVV G+ LLNA+HHF++FK +F K
Sbjct: 4 RFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD-----LLNADHHFTVFKRRFGK 58
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
YA+ EEHDYR FKAN+RRAK+ Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 59 VYASDEEHDYRLSEFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKF 118
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSC SFS TGALEGA+FL+TG+LV
Sbjct: 119 PADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCCSFSTTGALEGANFLATGKLV 178
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+D+PYTG D C
Sbjct: 179 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDHPYTGNDLQVC 238
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 239 RFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 291
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 195/278 (70%), Positives = 231/278 (83%), Gaps = 9/278 (3%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
+A +DDD +IRQVV E ++H+LNAEHHFSLFKSK+ K YA+QEEHD+R +VFKANL
Sbjct: 19 IAFSDDDPLIRQVV---SETDDNHMLNAEHHFSLFKSKYGKIYASQEEHDHRLKVFKANL 75
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RRA+R QLLDPTA HG+T+FSDLTPSEFRR +LGL++ R +AQKAPILPT+DLP DF
Sbjct: 76 RRARRHQLLDPTAEHGITQFSDLTPSEFRRTYLGLHKP-RPKLNAQKAPILPTSDLPEDF 134
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQLVDCDHECD EE
Sbjct: 135 DWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEK 194
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
CD+GCNGGLM +AFEY LKAGG++REKDYPYTG D G C FDKSKIAA+V+NFSVI
Sbjct: 195 SECDAGCNGGLMTTAFEYTLKAGGLQREKDYPYTGRD-GKCHFDKSKIAASVANFSVIGL 253
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DEDQ+AANLVKHGPLA + + + +++ VS P
Sbjct: 254 DEDQIAANLVKHGPLAVGINAAWMQ----TYMRGVSCP 287
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/307 (65%), Positives = 246/307 (80%), Gaps = 19/307 (6%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+R L SLL+ L+ A+ V D+D +IRQVV SDGE +D LLNA+HHF+LFKSK+
Sbjct: 1 MDRFSLPSLLIHALT---AACVVRADEDPLIRQVV-SDGE--DDALLNADHHFTLFKSKY 54
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K+YATQEEHDYR VFKANLRRAKR Q+LDP+AVHGVTKFSDLTP EFRR +LG+ +
Sbjct: 55 GKSYATQEEHDYRLSVFKANLRRAKRHQMLDPSAVHGVTKFSDLTPKEFRRTYLGIRKSS 114
Query: 121 RL--------PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
PADA A ILPT+DLP DF+WRD+GAVTGVKDQG CGSCWSFS TG LEG
Sbjct: 115 SSKQKLKLKLPADAHAAEILPTSDLPFDFEWRDYGAVTGVKDQGLCGSCWSFSTTGTLEG 174
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+FL+TGEL+SL+EQ+LVDCDH CDP+++G+CD+GCNGGLM +A+EY+L++GG+E+EKDY
Sbjct: 175 TNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVLQSGGLEKEKDY 234
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
PYTG D G+CKFDKSKIAAAV+NFSV+S DEDQ+AANLVKHGPL+ + SI + ++
Sbjct: 235 PYTGRD-GTCKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINSIFMQ----TY 289
Query: 293 LFTVSSP 299
+ VS P
Sbjct: 290 IGGVSCP 296
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/298 (66%), Positives = 238/298 (79%), Gaps = 9/298 (3%)
Query: 3 RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
R L L LL ++ L A +D DD +IRQVV DG+ LLNA+HHF++FK +F
Sbjct: 4 RFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K YA+ EEHDYR VFKAN+RRAKR Q LDP AVHGVT+FSD TP+EFRR+FLGLNRRL+
Sbjct: 60 KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDSTPTEFRRKFLGLNRRLK 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
PADA+ APILPT++LP+DFDWRD GAVT VK+QG CG CWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDRGAVTPVKNQGTCGLCWSFSTTGALEGANFLATGKL 179
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE+GSCD GCNGGLMNSAFEY LKAGG+ RE+DYPYTG D
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDFGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 293
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 402 bits (1033), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/292 (68%), Positives = 234/292 (80%), Gaps = 14/292 (4%)
Query: 1 MERLILSSLLL-LLLSSVLASAV-------AVNDD-DAMIRQVVPSDGEQSEDHLLNAEH 51
MER L +LLS+ +A V AV+D+ D +IRQVV ++D L AE
Sbjct: 1 MERFNAIPLFFAILLSATVAYGVSSDQINSAVSDEEDILIRQVVSG----ADDRPLTAEQ 56
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF FK KF KTY T EEHDYRFRVFKANLR+AKR Q LDP AVHGVT+FSDLT SEFR
Sbjct: 57 HFQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDPDAVHGVTRFSDLTESEFRE 116
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
F+GLNR LRLPADA +APILPT++L +DFDWRD GAVT VKDQG+CGSCWSFSA GALE
Sbjct: 117 NFVGLNR-LRLPADAHQAPILPTDNLASDFDWRDQGAVTPVKDQGSCGSCWSFSAVGALE 175
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GA+FLSTG+L+SLSEQQLVDCDHECDPEE+G+CD+GCNGGLM SAFEYI+KAGG+ERE+D
Sbjct: 176 GANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVKAGGLEREED 235
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
YPYTGTD GSCKF KIAA+ +NFSVIS+D DQ+AANLVK+GPLA + ++
Sbjct: 236 YPYTGTDRGSCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAV 287
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/272 (72%), Positives = 229/272 (84%), Gaps = 9/272 (3%)
Query: 9 LLLLLLSSVLASAVAVND---DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
LLL+ S V A+ A +D ++ +I QVV DG D L AEHHF FK +F K Y
Sbjct: 8 LLLVAFSLVFAAVSASSDGGNEEPLIMQVV--DGG---DVRLGAEHHFLEFKRRFGKAYD 62
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+++EHDYR++VFKAN+RRA+R Q LDP+A HGVT+FSDLTPSEFR + LGL R +RLP D
Sbjct: 63 SEDEHDYRYKVFKANMRRARRHQSLDPSAAHGVTRFSDLTPSEFRNKVLGL-RGVRLPLD 121
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A KAPILPT++LP+DFDWRDHGAVT VK+QG+CGSCWSFS TGALEGAHFLSTGELVSLS
Sbjct: 122 ANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLS 181
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYILK+GGV RE+DYPY+G D G+CKFD
Sbjct: 182 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGADSGTCKFD 241
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+KIAA+V+NFSV+S DEDQ+AANLVK+GPLA
Sbjct: 242 KTKIAASVANFSVVSLDEDQIAANLVKNGPLA 273
>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 191/277 (68%), Positives = 229/277 (82%), Gaps = 6/277 (2%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ L ++L + SV A + +D +IRQVV +G + L AEHHF+LFK KF
Sbjct: 1 MDHRTLLLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVR-----LGAEHHFNLFKHKF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y++++EHDYRF++FK+NL RAKR QL+DP+AVHGVT+FSDLTP EFR+ LGL R +
Sbjct: 56 GKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGL-RGV 114
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
LP DA APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTG+
Sbjct: 115 GLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGK 174
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQQLVDCDHECDPE+ GSCD+GCNGGLMNSAFEYILK+GGV RE+DYPY+GTD G
Sbjct: 175 LVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRG 234
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
SCKFDK KIAA+V+NFSV+S DEDQ+AANLVK+GPLA
Sbjct: 235 SCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLA 271
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 193/281 (68%), Positives = 233/281 (82%), Gaps = 11/281 (3%)
Query: 19 ASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
AS + + +D +I+Q+V DG DH L+A+HHF LFK +F K+YATQE+HDYRF VFK
Sbjct: 22 ASGKSSDGEDLVIQQIV--DG----DHPLSADHHFRLFKRRFGKSYATQEDHDYRFSVFK 75
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NLRRA+ Q LDP+AVHGVT+FSDLTP+EFRR LGL +RLR PADA KAPILPT DLP
Sbjct: 76 TNLRRARHHQRLDPSAVHGVTQFSDLTPAEFRRNHLGL-KRLRFPADANKAPILPTEDLP 134
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DFDWRDHGAV VK+QG+CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDP
Sbjct: 135 ADFDWRDHGAVASVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 194
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
EE GSCDSGCNGGLMNSA EY LKAGG+ RE+DYPY+GTD G+CKFD++KIAA+V+NFSV
Sbjct: 195 EEPGSCDSGCNGGLMNSALEYTLKAGGLMREEDYPYSGTDRGTCKFDETKIAASVANFSV 254
Query: 259 ISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
+S DE+Q+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 255 VSLDENQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 291
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 192/257 (74%), Positives = 221/257 (85%), Gaps = 6/257 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +I QVV SDG D LLNAE+ F+ FK+KF KTYAT EEHD+RF VFKANLRRAKR
Sbjct: 35 EDLLIHQVV-SDG----DDLLNAEYQFAEFKTKFGKTYATAEEHDHRFNVFKANLRRAKR 89
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
QLLDP+A HGVT+FSDLTP EFR+ +LGL +RL+LPADAQKAPILPT DLPTDFDWRDH
Sbjct: 90 HQLLDPSAEHGVTQFSDLTPREFRQNYLGL-KRLQLPADAQKAPILPTKDLPTDFDWRDH 148
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VKDQG CGSCWSFS GALEGAHFL+TG LVSLS QQL+DCD ECDPEE +CD
Sbjct: 149 GAVTAVKDQGYCGSCWSFSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDD 208
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEYILKAGGV +E+DYPYTGTD G C+F+K+KIAA+V+NFSV+S DEDQ+
Sbjct: 209 GCNGGLMNNAFEYILKAGGVAQEEDYPYTGTDRGLCRFNKTKIAASVANFSVVSLDEDQI 268
Query: 267 AANLVKHGPLAGNVASI 283
AANLVK+GPLA + ++
Sbjct: 269 AANLVKNGPLAVGINAV 285
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/279 (70%), Positives = 228/279 (81%), Gaps = 6/279 (2%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFKS
Sbjct: 1 MDRLKLCFSVFVLFFLIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKS 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVRA 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRD GAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 190/269 (70%), Positives = 227/269 (84%), Gaps = 5/269 (1%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
LL L ++ +SA+A +DDD +IRQVV + ++H+LNAEHHFSLFK+KF K YA+QE
Sbjct: 4 LLSFLAFALFSSAIAFSDDDPLIRQVVSGN---DDNHMLNAEHHFSLFKAKFGKIYASQE 60
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
EHD+R +VFKANL RAKR QLLDP+A HG+T+FSDLTPSEFRR +LGLN+ R +A+K
Sbjct: 61 EHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSEFRRTYLGLNKP-RPNLNAEK 119
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
APILPT DLP+DFDWR+ GAVT VK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQ
Sbjct: 120 APILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQ 179
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
LVDCDHECDP E CD+GCNGGLM +AFEY LKAGG++ EKDYPYTG + G C FDKS+
Sbjct: 180 LVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKAGGLQLEKDYPYTGRN-GKCHFDKSR 238
Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
IAA+VSNFSV+ DEDQ+AANL+KHGPLA
Sbjct: 239 IAASVSNFSVVGLDEDQIAANLLKHGPLA 267
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/257 (73%), Positives = 215/257 (83%), Gaps = 6/257 (2%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
D+ MIRQV E D LNAE HF FK++F KTYAT EEHDYRF VFKANLRRAKR
Sbjct: 31 DNLMIRQV-----ESHVDDFLNAERHFEKFKARFQKTYATPEEHDYRFNVFKANLRRAKR 85
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
QLLDP+AVHGVT+FSDLTP+EFRR +LGLN LR PADAQ+APILPT++LPTDFDWR++
Sbjct: 86 HQLLDPSAVHGVTQFSDLTPAEFRRDYLGLNP-LRFPADAQQAPILPTDNLPTDFDWREN 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG CGSCWSFS GALEGAHFL+TG L SLSEQQLVDCD ECDPEE +CD
Sbjct: 145 GAVTPVKNQGNCGSCWSFSTIGALEGAHFLATGNLESLSEQQLVDCDRECDPEEYDACDD 204
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMN+AFEYILK GGVEREKDYPYTG D CKF++SKI A+VSNFSV+S DEDQ+
Sbjct: 205 GCNGGLMNNAFEYILKTGGVEREKDYPYTGRDRSPCKFNESKIVASVSNFSVVSIDEDQI 264
Query: 267 AANLVKHGPLAGNVASI 283
AANLVK+GPLA + ++
Sbjct: 265 AANLVKNGPLAVGINAV 281
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/280 (69%), Positives = 228/280 (81%), Gaps = 9/280 (3%)
Query: 3 RLILSSLLLLLL-----SSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK 57
+L S +LL+L S ++A + + DD +IRQVV DG +E +L++E HFSLFK
Sbjct: 5 KLSFSVFVLLILFVSVSSGIVAETSSSDGDDLVIRQVV--DG--AEPKVLSSEDHFSLFK 60
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
KF K YA+ EEHDYR VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 61 RKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVR 120
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+LP DA KAPILPT +LP DFDWRD GAVT VK+QG+CGSCWSFSATGALEGA+FL+
Sbjct: 121 GGFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLA 180
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TG+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTG
Sbjct: 181 TGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGK 240
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DG +CK DKSKI A+VSNFSVIS DEDQ+AANLVK+GPLA
Sbjct: 241 DGPTCKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLA 280
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/278 (70%), Positives = 231/278 (83%), Gaps = 4/278 (1%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
M+RL LS S+ LL V AS+ DD +I+QVV DG +E ++L++E HFSLFK K
Sbjct: 1 MDRLKLSLSVFALLFIVVSASSDGNEGDDLVIKQVV--DG-GAEPNVLSSEDHFSLFKKK 57
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
F K YA++EEHDYRF VFK+NLRRA+R Q LDP+A HGVT+FSDLT SEF+R+ LG+
Sbjct: 58 FGKVYASREEHDYRFSVFKSNLRRARRHQKLDPSARHGVTQFSDLTRSEFKRKHLGVKGG 117
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+LP DA KAPILPT +LP +FDWR+ GAVT VK+QG+CGSCWSFSATGALEGA+FL+TG
Sbjct: 118 FKLPKDANKAPILPTENLPEEFDWRERGAVTPVKNQGSCGSCWSFSATGALEGANFLATG 177
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTG DG
Sbjct: 178 KLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGKDG 237
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 238 ATCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 203/297 (68%), Positives = 242/297 (81%), Gaps = 8/297 (2%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
LI ++LL L S + S + IRQVVP E++++ LLNAEHHF+LFKSK+ KT
Sbjct: 9 LIAATLLAGSLGSTVISGEVTDGFVNPIRQVVP---EENDEQLLNAEHHFTLFKSKYEKT 65
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-LRL 122
YATQ EHD+RFRVFKANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR+FLGL RR RL
Sbjct: 66 YATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRL 125
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P D Q APILPT+DLPT+FDWR+ GAVT VK+QG CGSCWSFSA GALEGAHFL+T ELV
Sbjct: 126 PTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELV 185
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDP ++ SCDSGC+GGLMN+AFEY LKAGG+ +E+DYPYTG D +C
Sbjct: 186 SLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTAC 245
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KFDKSKI A+VSNFSV+SSDEDQ+AANLV+HGPLA + ++ + +++ VS P
Sbjct: 246 KFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQ----TYIGGVSCP 298
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 190/272 (69%), Positives = 224/272 (82%), Gaps = 12/272 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EFR LGL R + LP+DA APILPT++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+CDPEE+GSCDSG
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSG 198
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMNSAFEYIL GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIA 258
Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
ANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 ANLVKNGPLAVAINAVYMQ----TYVGGVSCP 286
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/279 (70%), Positives = 227/279 (81%), Gaps = 6/279 (2%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/275 (70%), Positives = 227/275 (82%), Gaps = 9/275 (3%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD+ +IRQVV E ++HLLNAEHHFS FK+KFSKTYAT+EEHDYRF VFK+NL RA
Sbjct: 25 DDDNILIRQVV----EDGDEHLLNAEHHFSAFKTKFSKTYATKEEHDYRFGVFKSNLLRA 80
Query: 85 KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWR 144
K Q LDP+A+HGVTKFSDLTPSEFR QFLGL + L LP+DA APILPT++LP DFDWR
Sbjct: 81 KSHQELDPSAIHGVTKFSDLTPSEFRSQFLGL-KPLSLPSDAHNAPILPTDNLPKDFDWR 139
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
DHGAVT VK+QG GSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDP+ + +C
Sbjct: 140 DHGAVTNVKNQGTGGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPDLNDAC 199
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
DSGCNGGLM +AF Y KAGG+ RE+DY YTG D G CKFDKSKIAA+VSNFSV+S DED
Sbjct: 200 DSGCNGGLMTTAFGYTKKAGGLVREEDYLYTGRDRGPCKFDKSKIAASVSNFSVVSLDED 259
Query: 265 QMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
Q+AANLVK+GPL+ + ++ + +++ VS P
Sbjct: 260 QIAANLVKNGPLSVGINAVYMQ----TYIGGVSCP 290
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 191/297 (64%), Positives = 239/297 (80%), Gaps = 9/297 (3%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG-EQSEDHLLNAEHHFSLFKSKFSK 62
+I S ++ ++ + SA + D +I QV +DG E +E LL AEHH+SLFK +F K
Sbjct: 11 VIFSFFIVGVICTETFSAEGF-EVDPLIEQV--TDGHEGAEPQLLTAEHHYSLFKKRFKK 67
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
+Y +Q+EHDYRF++F+ NLRRA R Q LDP+A HGVT+FSDLTP EFR+ +LGL RRLRL
Sbjct: 68 SYGSQKEHDYRFKIFQVNLRRAARHQNLDPSATHGVTQFSDLTPGEFRKAYLGL-RRLRL 126
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P DA +APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGA+FL+TG+LV
Sbjct: 127 PKDATEAPILPTDNLPQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLV 186
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD G+C
Sbjct: 187 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTC 246
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KFD +K+AA V+NFSV+S DEDQ+AANL K+GPLA + ++ + +++ VS P
Sbjct: 247 KFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQ----TYIGGVSCP 299
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 191/272 (70%), Positives = 228/272 (83%), Gaps = 11/272 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D +IRQV +DG+ H+LNAEHHF+ FK+KF K+YATQEEHDYRF VF+ANLRRAK
Sbjct: 24 DPLIRQV--TDGDH---HMLNAEHHFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLH 78
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
LDP+A HGVTKFSDLTP EF+RQ+LGL + LRLP+ A KAPILPT+DLP +FDWRD G
Sbjct: 79 AKLDPSAEHGVTKFSDLTPEEFKRQYLGL-KPLRLPSTANKAPILPTSDLPENFDWRDKG 137
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCW+FS TGALEGAH+LSTGELVSLSEQQLVDCDH CDPEE G+CD+G
Sbjct: 138 AVTPVKNQGSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAG 197
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMN+AF+YIL+AGGV+ EKDYPY+G D +CKFDKSK+AA V+NFSV+S DEDQ+A
Sbjct: 198 CNGGLMNNAFDYILQAGGVQTEKDYPYSGRD-ETCKFDKSKVAATVANFSVVSLDEDQIA 256
Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
ANLVKHGPLA + +I + +++ VS P
Sbjct: 257 ANLVKHGPLAVGINAIFMQ----TYIGGVSCP 284
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 393 bits (1009), Expect = e-107, Method: Compositional matrix adjust.
Identities = 195/279 (69%), Positives = 227/279 (81%), Gaps = 6/279 (2%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFE+ LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEHTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 392 bits (1007), Expect = e-107, Method: Compositional matrix adjust.
Identities = 188/275 (68%), Positives = 223/275 (81%), Gaps = 8/275 (2%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R++ S LL V S D+D +IRQVV +++E +L++E HF+LFK KF K
Sbjct: 5 RVLFSVSLLF----VFVSVSICGDEDLLIRQVV----DEAEPKVLSSEDHFTLFKKKFGK 56
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
Y + EEH YRF VFKANLRRA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +L
Sbjct: 57 DYGSIEEHYYRFSVFKANLRRAMRHQKMDPSARHGVTQFSDLTGSEFRRKHLGVTGGFKL 116
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P DA +APILPT++LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LV
Sbjct: 117 PKDANQAPILPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLV 176
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTGTDGGSC
Sbjct: 177 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGTDGGSC 236
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K D+SKI A+VSNFSV+S +EDQ+AANLVK+GPLA
Sbjct: 237 KLDRSKIVASVSNFSVVSINEDQIAANLVKNGPLA 271
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 189/299 (63%), Positives = 237/299 (79%), Gaps = 9/299 (3%)
Query: 5 ILSSLLLLLLSS--VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+++++ L SS +++ +D D +IRQVV +DG+ + H L AEHHFSLFK +F K
Sbjct: 10 VITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNH-HALGAEHHFSLFKRRFGK 68
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NRRLR 121
+YAT+EEHD RF++FKAN+RRA+R Q DP+A+HGVT+FSDLTP EFR+ FLGL RLR
Sbjct: 69 SYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLR 128
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LP D APILPT +LP DFDWR HG VT VK+QG+CGSCWSFS TGALEGA+FL+TGEL
Sbjct: 129 LPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFLATGEL 188
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDCDHECDPEE +CDSGCNGGLMNSAFEY LKAGG+ +E+DYPY G D +
Sbjct: 189 VSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDRNT 248
Query: 242 CKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
C FDKSKIAA+++NFSV++S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 249 CNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQ----TYIGGVSCP 303
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/272 (68%), Positives = 218/272 (80%), Gaps = 4/272 (1%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L L + L V S D+D +IRQVV +++E +L++E HF+LFK KF K Y
Sbjct: 5 LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +LP D
Sbjct: 61 SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLA 272
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 197/263 (74%), Positives = 229/263 (87%), Gaps = 6/263 (2%)
Query: 16 SVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
SV+A+A N+DD +IRQV + +D LLNAEHHF+ FKSKFSK+YAT+EEHDYRF
Sbjct: 13 SVVATATKDDNNDDFLIRQVT----DHEDDQLLNAEHHFTTFKSKFSKSYATKEEHDYRF 68
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
VFK+NL++AK Q LDP+A HGVTKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT
Sbjct: 69 GVFKSNLKKAKLHQKLDPSAEHGVTKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPT 128
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
N+LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+LVSLSEQQLVDCDH
Sbjct: 129 NNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDH 188
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
CDP+E SCDSGCNGGLMN+AFEY+L++GGV RE+DY YTG D GSCKFDKSKIAA+VS
Sbjct: 189 VCDPDEYNSCDSGCNGGLMNNAFEYLLQSGGVVREQDYSYTGRD-GSCKFDKSKIAASVS 247
Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
NFSV+S DEDQ+AANLVK+GPLA
Sbjct: 248 NFSVVSVDEDQIAANLVKNGPLA 270
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 202/302 (66%), Positives = 242/302 (80%), Gaps = 12/302 (3%)
Query: 5 ILSSLLLLLLSSVL-----ASAVAVND-DDAMIRQVVP-SDGEQSEDHLLNAEHHFSLFK 57
+LS +LLL SS L AS V+ ++ DD +IRQVV +D ++D LLNAEHHFS FK
Sbjct: 3 LLSRFVLLLFSSSLVFAATASTVSSDESDDLLIRQVVAGADDHDNDDLLLNAEHHFSSFK 62
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
+F K Y + +EHD RF VFKANLRRAKR Q+LDP+AVHGVT+F DLTP+EFRR +LGL
Sbjct: 63 KRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDPSAVHGVTQFFDLTPAEFRRTYLGL- 121
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+RLRLPAD +APILPTNDLP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+
Sbjct: 122 KRLRLPADTHEAPILPTNDLPADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TG+LVSLSEQQLVDCDH CD E+ SCDSGCNGGLM SAFEY LKAGG+ERE+DYPYTGT
Sbjct: 182 TGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLKAGGLEREEDYPYTGT 241
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVS 297
D CKFDK+KIA + SNFSV+S DE+Q+AANLV +GPLA + ++ + +++ VS
Sbjct: 242 DHSKCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQ----TYIGGVS 297
Query: 298 SP 299
P
Sbjct: 298 CP 299
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/272 (70%), Positives = 229/272 (84%), Gaps = 11/272 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D +IRQVV + +EDH+LNAEHHFS FKSKFSKTYAT+EEHDYRF VFK+N+RRAK
Sbjct: 1 DLLIRQVV----DDNEDHVLNAEHHFSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLH 56
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
LDP+AVHGVTKFSDLTPSEFRRQFLGL + LRLP AQKAPILPT+DLP DFDWRD G
Sbjct: 57 AKLDPSAVHGVTKFSDLTPSEFRRQFLGL-KPLRLPEHAQKAPILPTHDLPEDFDWRDKG 115
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCW+FS TGALEG+HFL+TGELVSLS+QQLVDCDH CDPE+ G+CDSG
Sbjct: 116 AVTHVKNQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSG 175
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMN+AFEYIL++GGV+RE+DYPYTG D G D++ AA+VSNFSV+S DEDQ++
Sbjct: 176 CNGGLMNNAFEYILESGGVQREEDYPYTGRDRGPA-IDEAN-AASVSNFSVVSLDEDQIS 233
Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
ANLVK+GPLA + ++ + +++ VS P
Sbjct: 234 ANLVKNGPLAIGINAVFMQ----TYIGGVSCP 261
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 189/273 (69%), Positives = 224/273 (82%), Gaps = 13/273 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EF+ LGL R + LP+DA APILPT++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTQFSDLTPMEFQHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE-CDPEESGSCDS 206
AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+ CDPEE+GSCDS
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDS 198
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLMNSAFEYIL GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+
Sbjct: 199 GCNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQI 258
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
AANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 AANLVKNGPLAVAINAVYMQ----TYVGGVSCP 287
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/272 (68%), Positives = 217/272 (79%), Gaps = 4/272 (1%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L L + L V S D+D +IRQVV +++E +L++E HF+LFK KF K Y
Sbjct: 5 LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +LP D
Sbjct: 61 SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNG LMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLA 272
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 188/273 (68%), Positives = 222/273 (81%), Gaps = 13/273 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EFR LGL R + LP+DA APILPT++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE-CDPEESGSCDS 206
AVT VK+QG+CGSCWSFSATGALEGAHFLSTG+LVSLSEQQLVDCDHE CDPEE+GSCDS
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDS 198
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC GGLMNSAFEYIL GGV RE+DYPY+GT GG+CKFD++KIAA+V+NFSV+S DEDQ+
Sbjct: 199 GCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQI 258
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
AANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 AANLVKNGPLAVAINAVYMQ----TYVGGVSCP 287
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/257 (70%), Positives = 211/257 (82%), Gaps = 4/257 (1%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVV D +Q LL+AE HFS F S++ K+YA + EH YRF VFK+NLRRA+R
Sbjct: 23 EDPVIRQVVSDDQQQ----LLSAEAHFSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARR 78
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
Q LDPTAVHGVT+F+DLTPSEFRR +LGL RR R APILPTN+LP DFDWRDH
Sbjct: 79 HQRLDPTAVHGVTRFADLTPSEFRRTYLGLRRRPRTAGSTHDAPILPTNELPADFDWRDH 138
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT VK+QG+CGSCWSFSA GALEGA++LSTG LVSLSEQQLVDCDHECD E SCD
Sbjct: 139 GAVTPVKNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQ 198
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM +AFEYILK+GG+ERE DYPYTGTD G+CKF+K+KI+A SNFSV+S DEDQ+
Sbjct: 199 GCNGGLMTTAFEYILKSGGLEREADYPYTGTDRGTCKFNKAKISAVASNFSVVSIDEDQI 258
Query: 267 AANLVKHGPLAGNVASI 283
AANLVKHGPLA + ++
Sbjct: 259 AANLVKHGPLAVGINAV 275
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/272 (67%), Positives = 221/272 (81%), Gaps = 12/272 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D +IRQVV +G L AEHHF FK +F K Y ++EEH YRF VFK+N+ RA+R
Sbjct: 27 DPLIRQVVDGEG-------LGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EFR LGL R + LP+DA APIL T++LP DFDWR+HG
Sbjct: 80 QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILRTDNLPKDFDWREHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CG+CWSFSATGALEGAHFLSTG+LVSLSEQQLVDCDHECDPEE+GSCDSG
Sbjct: 139 AVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSG 198
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
C GGLMNSAFEYIL GGV RE+DYPY+GT GG+CKFD++KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIA 258
Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
ANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 ANLVKNGPLAVAINAVYMQ----TYVGGVSCP 286
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/299 (60%), Positives = 230/299 (76%), Gaps = 15/299 (5%)
Query: 5 ILSSLLLLLLSS--VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+++++ L SS +++ +D D +IRQVV +DG+ + H L AEHHFSLFK +F K
Sbjct: 10 VITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNH-HALGAEHHFSLFKRRFGK 68
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NRRLR 121
+YAT+EEHD RF++FKAN+RRA+R Q DP+A+HGVT+FSDLTP EFR+ FLGL RLR
Sbjct: 69 SYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLR 128
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LP D APILPT +LP DFDWR HG VT VK+QG+CGSCWSFS TGALEGA+FL
Sbjct: 129 LPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFL----- 183
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
LSEQQLVDCDHECDPEE +CDSGCNGGLMNSAFEY LKAGG+ +E+DYPY G D +
Sbjct: 184 -XLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDRNT 242
Query: 242 CKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
C FDKSKIAA++++FSV++S DEDQ+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 243 CNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQ----TYIGGVSCP 297
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 366 bits (939), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 169/224 (75%), Positives = 192/224 (85%)
Query: 54 SLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
+LFK KF K Y + EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+
Sbjct: 1 ALFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKH 60
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
LG+ +LP DA +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGA
Sbjct: 61 LGVKGGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGA 120
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
HFL+TG+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYP
Sbjct: 121 HFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYP 180
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YTGTDGGSCK D+SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 181 YTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLA 224
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 188/272 (69%), Positives = 222/272 (81%), Gaps = 12/272 (4%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D MI QVV +G L AEHHF FK +F K YAT+EEH YRF VFK+N+ RA+R
Sbjct: 27 DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
QLLDP+AVHGVT+FSDLTP EF+ LGL R + LP+DA APILPT++LP DFDWR HG
Sbjct: 80 QLLDPSAVHGVTQFSDLTPMEFQHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWRGHG 138
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+CDPEE+GSC SG
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCGSG 198
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLMNSAFEYIL GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIA 258
Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
ANLVK+GPLA + ++ + +++ VS P
Sbjct: 259 ANLVKNGPLAVAINAVYMQ----TYVGGVSCP 286
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 176/289 (60%), Positives = 214/289 (74%), Gaps = 12/289 (4%)
Query: 16 SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
S A+ D+D +IRQVV G +D+ L HF+ F +F KTY EEH +R
Sbjct: 18 SPAAATATAGDEDPLIRQVV--GGADGDDNDLELSSHFTSFVQRFGKTYKDAEEHAHRLS 75
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAP 130
VFKANLRRA+R QLLDP+A HG+TKFSDLTP+EFRR FLGL R + A AP
Sbjct: 76 VFKANLRRARRHQLLDPSAEHGITKFSDLTPAEFRRTFLGLKTSRRSFLREIGGSAHDAP 135
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
+LPT+ LP DFDWRDHGAV VK+QG+CGSCWSFSA+GALEGA++L+TG++ LSEQQ V
Sbjct: 136 VLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMEVLSEQQFV 195
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCDHECDPEE SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG D G+CKFDKSKI
Sbjct: 196 DCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGRD-GTCKFDKSKIV 254
Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
A+V NFSV+S DE+Q+AANLVKHGPLA + + + +++ VS P
Sbjct: 255 ASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYMQ----TYIGGVSCP 299
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 353 bits (905), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 178/304 (58%), Positives = 222/304 (73%), Gaps = 14/304 (4%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
+ RL + +LLLS V A + V +D +I QVV G++ + LNAE HF+ F +F
Sbjct: 4 LRRLPIVVAAVLLLSGVAALSSPV--EDPLIEQVV--GGDEKNELELNAEAHFASFVQRF 59
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
+K+Y +EH +R VF ANLRRA+R Q LDP+AVHGVTKFSDLTP EFR +FLGL +
Sbjct: 60 NKSYRDADEHAHRLSVFTANLRRARRHQRLDPSAVHGVTKFSDLTPDEFRDRFLGLRKYR 119
Query: 121 R-----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R L A AP LPT+ LPT+FDWR+HGAV VKDQG+CGSCWSFS +GALEGAH+
Sbjct: 120 RSFLKGLSGSAHDAPALPTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHY 179
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
L+TG+L LSEQQ+VDCDHECDP E +CD+GCNGGLM +AF Y+ KAGG+E EKDYPYT
Sbjct: 180 LATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYT 239
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
G GG+CKFDKSKIAA V NFS ++ DEDQ+AANLVKHGPLA + ++ + +++
Sbjct: 240 GR-GGACKFDKSKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQ----TYIGG 294
Query: 296 VSSP 299
VS P
Sbjct: 295 VSCP 298
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/278 (62%), Positives = 208/278 (74%), Gaps = 12/278 (4%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G D LNAE HF F +F K+Y +EH YR VFKANLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKANLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAV VK+QG+CGSCWSFSA+GALEGAH+L+TG+L LSEQQ VDCDHECD E
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D G CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DE Q++ANL+KHGPLA + + + +++ VS P
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 349 bits (896), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 173/290 (59%), Positives = 215/290 (74%), Gaps = 12/290 (4%)
Query: 15 SSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
S +A+A +++ +IRQVV G + LNAE HF+ F +F K+Y +EH YR
Sbjct: 14 SPAVAAASVPGEEEPLIRQVV--GGGDDNELELNAERHFASFVQRFGKSYRDADEHAYRL 71
Query: 75 RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKA 129
VFKANLRRA+R QLLDP+A HGVTKFSDLTP+EFRR +LGL R L A +A
Sbjct: 72 SVFKANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHEA 131
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
P+LPT+ LP DFDWRDHGAV VK+QG+CGSCWSFSA+GALEGA++L+TG++ LSEQQ+
Sbjct: 132 PVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQM 191
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCDHECD E SCD+GCNGGLM +AF Y+LK+GG+E EKDYPYTG D G+CKFDKSKI
Sbjct: 192 VDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLKSGGLESEKDYPYTGRD-GTCKFDKSKI 250
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
+V NFSV+S DEDQ+AANLVKHGPLA + + + +++ VS P
Sbjct: 251 VTSVQNFSVVSVDEDQIAANLVKHGPLAIGINAAYMQ----TYIGGVSCP 296
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 349 bits (896), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 173/278 (62%), Positives = 207/278 (74%), Gaps = 12/278 (4%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G D LNAE HF F +F K+Y +EH YR VFK NLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAV VK+QG+CGSCWSFSA+GALEGAH+L+TG+L LSEQQ VDCDHECD E
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D G CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DE Q++ANL+KHGPLA + + + +++ VS P
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 349 bits (895), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 172/278 (61%), Positives = 207/278 (74%), Gaps = 12/278 (4%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G + LNAE HF F +F K+Y EEH YR +FKANLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNELELNAESHFLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGKSANEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAVT VK+QG+CGSCWSFS +GALEGAH+L+TG+L LSEQQ+VDCDH CD E
Sbjct: 142 DWRDHGAVTPVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTNAFSYLQKAGGLESEKDYPYTGSD-DKCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DE Q+AANL+KHGPLA + + + +++ VS P
Sbjct: 261 DEGQIAANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 170/285 (59%), Positives = 212/285 (74%), Gaps = 12/285 (4%)
Query: 20 SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
+ A D++ +IRQVV G D+ L + F F +F KTY EEH +R VFKA
Sbjct: 22 ATAAAGDEEPLIRQVV--GGADPLDNDLELDSQFVGFVQRFGKTYRDAEEHAHRLSVFKA 79
Query: 80 NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPT 134
NLRRA+R QLLDP+A HGVTKFSDLTP+EFRR +LGL R + A AP+LPT
Sbjct: 80 NLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLKTTRRSFLREMAGSAHDAPVLPT 139
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
+ LP DFDWRDHGAV VK+QG+CGSCWSFSA+GALEGA++L++G++ LSEQQLVDCDH
Sbjct: 140 DGLPEDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLVDCDH 199
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
ECDP E SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG D G+CKFDKSKIAA+V
Sbjct: 200 ECDPSEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKD-GTCKFDKSKIAASVQ 258
Query: 255 NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
N+SV++ DE+Q+AANLVK+GPLA + + + +++ VS P
Sbjct: 259 NYSVVAVDEEQIAANLVKYGPLAIGINAAYMQ----TYIGGVSCP 299
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/230 (72%), Positives = 195/230 (84%), Gaps = 4/230 (1%)
Query: 55 LFKSKF-SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
L + KF + YAT+EEHD+RF VFK+NLRRA P VHGVTKFSDLTP+EFRRQF
Sbjct: 7 LSRPKFRPRPYATKEEHDHRFGVFKSNLRRASCTPSSTPR-VHGVTKFSDLTPAEFRRQF 65
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
LGL + +R PA AQKAPILPT DLP DFDWRD GAVT VKDQG CGSCWSFS TGALEGA
Sbjct: 66 LGL-KAVRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGGCGSCWSFSTTGALEGA 124
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
++L+TGELVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFEYIL++GGV++EKDYP
Sbjct: 125 YYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEKDYP 184
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
YTG D G+CKFDK+K+AA VSN+SV+ DE+Q+AANLVK+GPLA + ++
Sbjct: 185 YTGRD-GTCKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAV 233
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/278 (59%), Positives = 208/278 (74%), Gaps = 12/278 (4%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +I QVV D E + LNAE HF+ F +F K+Y +EH++R VF+ANLRRA+R
Sbjct: 34 EDPLIEQVVGGDAENELE--LNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANLRRARR 91
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
Q LDP+AVHG+TKFSDLTP EFR +FLGL + R + A AP LPT+ LPT+F
Sbjct: 92 HQRLDPSAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHDAPALPTDGLPTEF 151
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAV VKDQG+CGSCWSFS +GALEGA++L+TG+L LSEQQLVDCDHECDP E
Sbjct: 152 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEP 211
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
+CD+GCNGGLM +AF Y+ KAGG+E EKDYPYTG + +CKFDKSKIAA V NFS ++
Sbjct: 212 RACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYTGRN-SACKFDKSKIAAQVKNFSTVAI 270
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DEDQ+AANLVKHGPLA + ++ + +++ VS P
Sbjct: 271 DEDQIAANLVKHGPLAIGINAVFMQ----TYIGGVSCP 304
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 160/278 (57%), Positives = 208/278 (74%), Gaps = 11/278 (3%)
Query: 28 DAMIRQVVPSDGEQSEDHL------LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
D+ IR+V + ++S L L+ E HF F ++F K YAT E + +R +VF+ANL
Sbjct: 27 DSGIREVTDTARDESNGRLDAAKALLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANL 86
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RA Q LDP+AVHG+T+FSDLT EF++QFLGL RL +A KAP+LPTNDLP DF
Sbjct: 87 VRAVSHQALDPSAVHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDF 145
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAVT VK+QGACGSCW+FS TGA+EGAHFL TG+L+SLSEQQLVDCDH CDP +
Sbjct: 146 DWREHGAVTEVKNQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDK 205
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCD+GCNGGLM +A++Y++K+GG+E E DYPYTG G C+F+ +KI A+V+NFS +S
Sbjct: 206 VSCDAGCNGGLMTNAYDYVMKSGGLETETDYPYTGNSNGKCQFNANKIVASVANFSTVSL 265
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DEDQ+AANLVKHGPLA + ++ + +++ VS P
Sbjct: 266 DEDQIAANLVKHGPLAIGINAVFMQ----TYIGGVSCP 299
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 337 bits (863), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 167/279 (59%), Positives = 208/279 (74%), Gaps = 12/279 (4%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
D++ +IRQVV G D+ L + F +F KTY EEH +R VFKANLRRA+
Sbjct: 28 DEEPLIRQVV--GGADPLDNDLELDSQLLGFVQRFGKTYRDAEEHAHRLSVFKANLRRAR 85
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTD 140
R Q+LDP+A HGVTKFSDLTP+EFRR FLGL R + A AP+LPT+ LP D
Sbjct: 86 RHQMLDPSAEHGVTKFSDLTPAEFRRTFLGLKTTRRSFLREMAGSAHDAPVLPTDGLPED 145
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
FDWRDHGAV VK+QG+C SCWSFSA+GALEGA++L+TG++ LSEQQLVDCDHECDP E
Sbjct: 146 FDWRDHGAVGPVKNQGSCWSCWSFSASGALEGANYLATGKMEVLSEQQLVDCDHECDPAE 205
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG D G+CKF+KSKIAA+V NFSV++
Sbjct: 206 PDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKD-GTCKFEKSKIAASVQNFSVVA 264
Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DE+Q+AANLV++GPLA + + + +++ VS P
Sbjct: 265 VDEEQIAANLVEYGPLAIGINAAYMQ----TYIGGVSCP 299
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 337 bits (863), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 158/265 (59%), Positives = 204/265 (76%), Gaps = 7/265 (2%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
V +G++S LL+ E HF F ++F K YAT E + +R +VF+ANL RA Q LDP+A
Sbjct: 5 VVDNGDRSA--LLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDPSA 62
Query: 95 VHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
VHG+T+FSDLT EF++QFLGL RL +A KAP+LPTNDLP DFDWR+HGAVT VK+
Sbjct: 63 VHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDFDWREHGAVTEVKN 121
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QGACGSCW+FS TGA+EGAHFL TG+L+SLSEQQLVDCDH CDP + SCD+GCNGGLM
Sbjct: 122 QGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMT 181
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
+A++Y++K+GG+E E DYPYTG G C+F+ +KI A+V+NFS +S DEDQ+AANLVKHG
Sbjct: 182 NAYDYVMKSGGLETETDYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHG 241
Query: 275 PLAGNVASIELPHISFSFLFTVSSP 299
PLA + ++ + +++ VS P
Sbjct: 242 PLAIGINAVFMQ----TYIGGVSCP 262
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 337 bits (863), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 164/256 (64%), Positives = 196/256 (76%), Gaps = 6/256 (2%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
++D +I QVV G + ED L+AE HF+ F+ +F +TY E YR VF ANLRRA
Sbjct: 32 GEEDPLIEQVV--GGGEEEDAQLDAEAHFASFERRFGRTYRDAGERAYRMSVFAANLRRA 89
Query: 85 KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDF 141
+R Q LDPTA HGVTKFSDLTP EFR +FLGL R + + +APILPT+ LP DF
Sbjct: 90 RRHQRLDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLPDDF 149
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCDHECD ES
Sbjct: 150 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASES 209
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
+CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V NFSVIS
Sbjct: 210 RACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQVKNFSVISV 268
Query: 262 DEDQMAANLVKHGPLA 277
+EDQ+AANLVKHGPLA
Sbjct: 269 NEDQIAANLVKHGPLA 284
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 336 bits (862), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 175/280 (62%), Positives = 209/280 (74%), Gaps = 14/280 (5%)
Query: 7 SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
S+LL + SV+ + A DD +IRQV V SD + + L NAE HF F +
Sbjct: 4 STLLFSAFCIFSVIFLSSATKPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y+ EEH++RF VFK+NL RA Q LDP A HGVTKFSDLT EFR Q+LGL
Sbjct: 64 YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGL--- 120
Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
R P DA APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLS 278
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 336 bits (861), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 175/280 (62%), Positives = 209/280 (74%), Gaps = 14/280 (5%)
Query: 7 SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
S+LL + SV+ + A DD +IRQV V SD + + L NAE HF F +
Sbjct: 4 STLLFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y+ EEH++RF VFK+NL RA Q LDP A HGVTKFSDLT EFR Q+LGL
Sbjct: 64 YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGL--- 120
Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
R P DA APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLS 278
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 333 bits (854), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 174/280 (62%), Positives = 208/280 (74%), Gaps = 14/280 (5%)
Query: 7 SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
S+LL + SV+ + A DD +IRQV V SD + + L NAE HF F +
Sbjct: 4 STLLFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y+ EEH++RF VFK+NL RA Q LDP A HGVTKFSDLT FR Q+LGL
Sbjct: 64 YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEGFRHQYLGL--- 120
Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
R P DA APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLS 278
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 333 bits (854), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 171/283 (60%), Positives = 207/283 (73%), Gaps = 11/283 (3%)
Query: 5 ILSSLLLLLLSSVLASA-VAVNDDDAM----IRQVVPSDGEQSEDHL----LNAEHHFSL 55
ILS LL L+ ++ A A +D +A+ IR+V DGE D L LNAE HF+
Sbjct: 18 ILSLALLFLVPTITAHVHEASSDLNAVLPNPIREVTDMDGEGVIDDLRRGLLNAEAHFAH 77
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F KF+K Y+ EEH RF +FK NL +A R Q LD A+HG+ KFSDLT EF Q+LG
Sbjct: 78 FVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDRDAIHGINKFSDLTEEEFHEQYLG 137
Query: 116 LNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
L R L Q APILPT+DLP DFDWR+ GAVT VK+QGACGSCW+FS TGA+EGA+
Sbjct: 138 LTTPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTPVKNQGACGSCWTFSTTGAMEGAN 197
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F+ TG+L+SLSEQQLVDCDHECD E CDSGCNGGLM +A++Y LKAGG++RE+DYPY
Sbjct: 198 FMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALKAGGLQREEDYPY 257
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
TG D GSCKFD +K+AA V+NFS +S DEDQ+AANLVK+GPLA
Sbjct: 258 TGID-GSCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLA 299
>gi|388519111|gb|AFK47617.1| unknown [Medicago truncatula]
Length = 241
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 155/202 (76%), Positives = 177/202 (87%), Gaps = 4/202 (1%)
Query: 24 VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
N DD +IRQVV + +EDH+LNAEHHF+ FKSKFSK YAT+EEHDYRF VFK+NL +
Sbjct: 26 TNSDDLLIRQVV----DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIK 81
Query: 84 AKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
AK Q LDP+A HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPTN+LP DFDW
Sbjct: 82 AKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTNNLPEDFDW 141
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+L SLSEQQLVDCDH CDPEE GS
Sbjct: 142 REKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGS 201
Query: 204 CDSGCNGGLMNSAFEYILKAGG 225
CDSGCNGGLMN+AFEYIL++GG
Sbjct: 202 CDSGCNGGLMNNAFEYILQSGG 223
>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 252
Score = 330 bits (845), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 167/241 (69%), Positives = 195/241 (80%), Gaps = 7/241 (2%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHL-LNAEHHFSLFKSKFSKTYATQEE 69
+ L +SV +S + +DD +I QVVP E ED L LNAE HFS F +F K+YA ++E
Sbjct: 15 VALSASVASSWPSYAEDDPLIVQVVP---ESDEDELRLNAEAHFSSFLRRFGKSYADEKE 71
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---ADA 126
H YRF VFKANLRRA+R Q +DPTAVHG+TKFSDLTP+EFRR +LGL RL A +
Sbjct: 72 HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
+APILPTN+LPTDFDWRDHGAVTGVKDQG+CGSCWSFSA+GALEGA+FL+TG+L SLSE
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQLESLSE 191
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQLVDCDHECD E SCDSGCNGGLM +AFEY+LK+GG+E EKDYPYTGTD G CKFD+
Sbjct: 192 QQLVDCDHECDSSEPDSCDSGCNGGLMTTAFEYLLKSGGLELEKDYPYTGTDRGRCKFDE 251
Query: 247 S 247
S
Sbjct: 252 S 252
>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
Length = 292
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 146/220 (66%), Positives = 180/220 (81%), Gaps = 5/220 (2%)
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RLRLPADAQKAPILPTNDLPT 139
+RRA+R Q LDPTAVHGVT+FSDLTP EF+R +LGL + + L A +AP+LPTNDLP
Sbjct: 1 MRRARRHQQLDPTAVHGVTQFSDLTPGEFKRTYLGLRKGKKHLVGSAHEAPLLPTNDLPE 60
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWRD GAVTGVK+QG+CGSCWSFS +GALEGA+FL+TG+L +LSEQQ+VDCDHECD E
Sbjct: 61 DFDWRDKGAVTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAE 120
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
E CD GCNGGLMN+AF+Y+ K GG+E EKDYPYTGTD G+CKFD+SKI A+V NFSV+
Sbjct: 121 EPDDCDQGCNGGLMNTAFQYLQKVGGLESEKDYPYTGTDRGTCKFDESKIKASVHNFSVV 180
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
S DE+Q+AANLVKHGPLA + ++ + +++ VS P
Sbjct: 181 SIDEEQIAANLVKHGPLAIAINAVFMQ----TYIGGVSCP 216
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 155/256 (60%), Positives = 187/256 (73%), Gaps = 23/256 (8%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
++D +I QVV G + ED L+AE HF+ F+ +F +TY RRA
Sbjct: 32 GEEDPLIDQVV--GGGEEEDAQLDAEAHFASFERRFGRTYP--------------GPRRA 75
Query: 85 KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDF 141
+R LDPTA HGVTKFSDLTP EFR +FLGL R + + +APILPT+ LP DF
Sbjct: 76 RR---LDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLPDDF 132
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCDHECD ES
Sbjct: 133 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASES 192
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
+CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V NFSVIS
Sbjct: 193 RACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQVKNFSVISV 251
Query: 262 DEDQMAANLVKHGPLA 277
+EDQ+AANLVKHGPLA
Sbjct: 252 NEDQIAANLVKHGPLA 267
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 300 bits (768), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/262 (57%), Positives = 189/262 (72%), Gaps = 4/262 (1%)
Query: 18 LASAVAVNDDDAMIRQVVPSDG--EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
L +++ + D + V DG EQ LL AE F F +F K Y T EE+++RF+
Sbjct: 19 LVASLPLRDVIQQVTDGVRVDGSVEQFAHALLGAEKQFESFIKEFGKVYHTVEEYEHRFK 78
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
VFK+NL RA + Q LDPTA HGVT FSDLT EF Q+LGL R L + A A LPT
Sbjct: 79 VFKSNLLRALKHQALDPTASHGVTMFSDLTEEEFATQYLGLKRPSAL-STAPTAEPLPTG 137
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
DLP FDWR+ GAV VK+QG+CGSCW+FS TGA+EGAHFL+TG+L+SLSEQQLVDCDH+
Sbjct: 138 DLPPSFDWREKGAVGPVKNQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQ 197
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPEE+ +CD+GC GGLM +A++Y+ +AGG+E E DYPY G D G C+F+ +K+AA VSN
Sbjct: 198 CDPEEAQACDAGCGGGLMTNAYKYVEEAGGLELESDYPYKGRD-GKCQFNPNKVAAKVSN 256
Query: 256 FSVISSDEDQMAANLVKHGPLA 277
F+ I DEDQ+AA L+K GPLA
Sbjct: 257 FTNIPIDEDQVAAYLIKSGPLA 278
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 152/279 (54%), Positives = 192/279 (68%), Gaps = 11/279 (3%)
Query: 4 LILSSLLLLLLSSVLAS-----AVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
L+L +++L + AS + DDA+ V EQ L+ AE F F
Sbjct: 6 LLLVGIVVLGFAGFAASLPTGDTIREVTDDALSNGSV----EQFAHALIGAEKRFESFMK 61
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
F K Y + EE+++RF VFK+NL +A + Q LDPTA HGVT FSDLT EF ++LGL R
Sbjct: 62 DFGKVYHSVEEYEHRFGVFKSNLLKALKHQALDPTASHGVTMFSDLTEEEFTSKYLGLKR 121
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
L + A +AP LPT DLP +FDWR+ GAV VKDQG CGSCW+FS TGA+EGAHFL++
Sbjct: 122 PSVL-SSAPQAPPLPTEDLPPNFDWREKGAVGPVKDQGGCGSCWAFSTTGAVEGAHFLNS 180
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDH+CD EE+ +CD+GCNGG M +A++Y+ AGG+E E DYPY G D
Sbjct: 181 GKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYVEAAGGLELESDYPYEGRD 240
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G CKFD +K+A VSNF+ I DEDQ+AA L+K GPLA
Sbjct: 241 -GKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLA 278
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 143/300 (47%), Positives = 202/300 (67%), Gaps = 10/300 (3%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
++ +L L+ +L V + +D IRQV +D + +LL + E F LF S +
Sbjct: 1 MVAKALAQLITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RR 119
K Y+T+EE+ +R +F N+ +A Q++DP+AVHGVT+FSDLT EF+R + G+
Sbjct: 60 KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGG 119
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R +AP++ + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L+SLSEQQLVDCD CDP++ +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG
Sbjct: 180 KLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR- 238
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
G CKFD K+A V NF+ I DE+Q+AANLV+HGPLA + ++ + +++ VS P
Sbjct: 239 GHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQ----TYIGGVSCP 294
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 143/301 (47%), Positives = 201/301 (66%), Gaps = 11/301 (3%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
++ +L L+ + V + +D IRQV +D + +LL + E F +F S +
Sbjct: 1 MVAKALAQLITCIIFFCHVVASVEDLTIRQVT-ADERRVRPNLLGTHTESKFRVFMSDYG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RR 119
K Y+T+EE+ +R +F N+ +A Q++DPTAVHGVT+FSDLT EF+R + G+
Sbjct: 60 KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGG 119
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R A +AP++ + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGHAVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179
Query: 180 ELVSLSEQQLVDCDHE-CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+L+SLSEQQLVDCD CDP++ +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG
Sbjct: 180 KLLSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR 239
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
G CKFD K+A V NF+ I DEDQ+AANLV+ GPLA + ++ + +++ VS
Sbjct: 240 -GHCKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQ----TYIGGVSC 294
Query: 299 P 299
P
Sbjct: 295 P 295
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 135/198 (68%), Positives = 157/198 (79%), Gaps = 4/198 (2%)
Query: 83 RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPT 139
R R LDPTA HGVTKFSDLTP EFR + LGL R + + +APILPT+ LP
Sbjct: 55 RELRAARLDPTATHGVTKFSDLTPGEFRDRLLGLRRPSLEGLVGGEPHEAPILPTDGLPD 114
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DFDWR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCDHECD
Sbjct: 115 DFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDAS 174
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
ES +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V NFSVI
Sbjct: 175 ESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQVKNFSVI 233
Query: 260 SSDEDQMAANLVKHGPLA 277
S +EDQ+AANLVKHGPLA
Sbjct: 234 SVNEDQIAANLVKHGPLA 251
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 137/282 (48%), Positives = 194/282 (68%), Gaps = 12/282 (4%)
Query: 4 LILSS-LLLLLLSSVLASAVAVND----DDAMIRQVVPSDGEQSEDHLLN--AEHHFSLF 56
L+L+ + LL++S+ ++ ++ +++ + I QV + + +HLLN ++ F F
Sbjct: 37 LVLAGCMFLLVISTQISFSLGLDNGRVSEGGFIAQVTE---KFNREHLLNLRSKTLFDKF 93
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
+ K Y+T EE+ R R+F+ NL +A Q LDPTAVHG+T FSDLT EF ++ GL
Sbjct: 94 IVEHGKVYSTIEEYVRRLRIFEKNLLKAAENQALDPTAVHGITPFSDLTEYEFESRYTGL 153
Query: 117 -NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R L + Q A ILP +DLP +FDWR+ GAVT VK QG CGSCW+FS TG +EGA+F
Sbjct: 154 LGVRQGLVNEKQTAEILPVDDLPANFDWREKGAVTEVKTQGNCGSCWAFSTTGVVEGANF 213
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
L+TG+L++LSEQQL+DCDH+CDP + +CD+GC+GGLM +A+ Y+++AGG+E K+YPYT
Sbjct: 214 LATGKLLNLSEQQLIDCDHKCDPLNTKACDNGCHGGLMTNAYNYLMEAGGIEEAKNYPYT 273
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G G CKF+ A NF+ ++ DE Q+AANLVKHGPLA
Sbjct: 274 GVQ-GDCKFNPDLAAVKAINFTTVNLDEKQIAANLVKHGPLA 314
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 195/301 (64%), Gaps = 12/301 (3%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L+ ++ L LL S + SA A+ D +RQV +DGE + +E F +F K+ K+
Sbjct: 42 LLACAISLALLISAIPSATALRRDPEFLRQV--TDGEIFNNLPAGSERKFVMFMEKYGKS 99
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-- 121
Y T++E+ +RF +F NL RA Q LDPTAVHGVT+FSDL+ EF R F+G+
Sbjct: 100 YPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGE 159
Query: 122 -LPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LP Q + LP FDWRD GAVT VK QG CGSCW+FS GA+EGA+F++T
Sbjct: 160 GLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIAT 219
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCDH CDP + +C++GCNGGLM +A++Y++++GG+E E YPYTG
Sbjct: 220 GNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTGRS 279
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
G C F KIA VSNF+ I DE+Q+AA+LV+ GPLA + ++ + +++ VS
Sbjct: 280 -GQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQ----TYIGGVSC 334
Query: 299 P 299
P
Sbjct: 335 P 335
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 142/301 (47%), Positives = 195/301 (64%), Gaps = 12/301 (3%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L+ ++ L LL S + SA A+ D +RQV +DGE + +E F +F K+ K+
Sbjct: 42 LLACAISLALLISAIPSATALRRDPEFLRQV--TDGEIFNNLPAGSERKFVMFMEKYGKS 99
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-- 121
Y T++E+ +RF +F NL RA Q LDPTAVHGVT+FSDL+ EF R F+G+
Sbjct: 100 YPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGE 159
Query: 122 -LPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LP Q + LP FDWRD GAVT VK QG CGSCW+FS GA+EGA+F++T
Sbjct: 160 GLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIAT 219
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCDH CDP + +C++GCNGGLM +A++Y++++GG+E E YPYTG
Sbjct: 220 GNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTGRS 279
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
G C F KIA VSNF+ I DE+Q+AA+LV+ GPLA + ++ + +++ VS
Sbjct: 280 -GQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQ----TYIGGVSC 334
Query: 299 P 299
P
Sbjct: 335 P 335
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 140/300 (46%), Positives = 198/300 (66%), Gaps = 14/300 (4%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
++ +L L+ +L V + +D IRQV +D + +LL + E F LF S +
Sbjct: 1 MVAKALAQLITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYG 59
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RR 119
K Y+T+EE+ +R +F N+ +A Q++DP+AVHGVT+FSDLT EF+R + G+
Sbjct: 60 KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGG 119
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R +AP++ + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L+SLSEQQLVDCD + +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG
Sbjct: 180 KLLSLSEQQLVDCDQ----ADKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR- 234
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
G CKFD K+A V NF+ I DE+Q+AANLV+HGPLA + ++ + +++ VS P
Sbjct: 235 GHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQ----TYIGGVSCP 290
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/298 (47%), Positives = 196/298 (65%), Gaps = 12/298 (4%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ L+ + L L + L++A + R++ D E LL E F +F + ++
Sbjct: 10 MCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNE-----LLRTEKKFKVFMENYGRS 64
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y+T+EE+ R +F N+ RA Q LDPTAVHGVT+FSDLT EF + + G+N
Sbjct: 65 YSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDEFEKLYTGVNGGFPSS 124
Query: 124 ADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+A AP L + LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+L
Sbjct: 125 NNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKL 184
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQL+DCD++CD E SCD+GCNGGLM +A+ Y+L++GG+E E YPYTG + G
Sbjct: 185 VSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGE 243
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
CKFD KIA ++NF+ I +DE+Q+AA LVK+GPLA V +I + +++ VS P
Sbjct: 244 CKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQ----TYIGGVSCP 297
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 137/293 (46%), Positives = 191/293 (65%), Gaps = 14/293 (4%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
SL+L L+ A V+D + + ++ LL E F++F + K Y+T+
Sbjct: 16 SLVLFALTLSSARQTTVHD--------IAKKLKLQDNQLLRTEKKFNVFMENYGKKYSTR 67
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
EE+ R +F N+ RA Q LDPTA+HGVT+FSDLT EF+R + G+N +
Sbjct: 68 EEYLQRLEIFAGNMLRAPENQALDPTAIHGVTQFSDLTEDEFQRHYTGVNGGFPWNNGVR 127
Query: 128 K-APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
AP L + LP DFDWR+ GAVT VK QG CGSCW+FS TG++EGA+F++TG+L++LSE
Sbjct: 128 DVAPPLKVDGLPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLNLSE 187
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQLVDCD +CD ES +CD+GC GGLM +A++Y+L++GG+E E YPYTG G CKFD
Sbjct: 188 QQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQSGGLEEESSYPYTGAK-GECKFDP 246
Query: 247 SKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
K+A ++NF+ I DE+Q+AA LVKHGPLA + +I + +++ VS P
Sbjct: 247 GKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQ----TYIGGVSCP 295
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 270 bits (690), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 138/278 (49%), Positives = 182/278 (65%), Gaps = 15/278 (5%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L L LSS L + D V E ++ LL E F LF +SK Y+T E
Sbjct: 19 LCALTLSSSLHHETLIQD--------VARKLELKDNDLLTTEKKFKLFMKDYSKKYSTTE 70
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---AD 125
E+ R +F N+ +A Q LDPTA+HGVT+FSDL+ EF R + G + P A
Sbjct: 71 EYLLRLGIFAKNMVKAAEHQALDPTAIHGVTQFSDLSEEEFERFYTGF--KGGFPSSNAA 128
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
AP L P +FDWR+ GAVTG+K QG CGSCW+F+ TG++EGA+FL+TG+LVSLS
Sbjct: 129 GGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLATGKLVSLS 188
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCD++CD ++ SCD+GCNGGLM +A++Y+++AGG+E E YPYTG G CKFD
Sbjct: 189 EQQLVDCDNKCDITKT-SCDNGCNGGLMTTAYDYLMEAGGLEEETSYPYTGAQ-GECKFD 246
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
+K+A VSNF+ I +DE+Q+AA LV HGPLA V ++
Sbjct: 247 PNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAV 284
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 173/255 (67%), Gaps = 5/255 (1%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
+LL E F +F + +K YAT+EE+ +RF +F NL RA Q LDPTA+HGVT F DL
Sbjct: 6 NLLGTEEKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDPTAIHGVTPFMDL 65
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
T EF R + G+ +P + + + LP FDWR+ GAVT VK QG+CGSCW+F
Sbjct: 66 TEEEFERMYAGVLGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAF 125
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG++EGA+F++TG+L++LSEQQLVDCD CD + SCD GC GGLM +A+ Y+++AG
Sbjct: 126 STTGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIEAG 185
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
G++ E YPYTG G CKFD KIA V+NF+ I+ DE+Q+AANLV HGPLA + +I
Sbjct: 186 GLQEESSYPYTGKS-GECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIF 244
Query: 285 LPHISFSFLFTVSSP 299
+ +++ VS P
Sbjct: 245 MQ----TYIGGVSCP 255
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 188/294 (63%), Gaps = 13/294 (4%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L+ + L L + L+SA + R++ D E LL E F +F + ++Y+
Sbjct: 12 LARVSLFLFALTLSSAHESTTVHDIARKLKVGDNE-----LLRTEKKFKVFMENYGRSYS 66
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T+EE+ R +F N+ RA Q LDPTAVHGVT+FSDLT EF + + G
Sbjct: 67 TREEYLRRLGIFSQNMLRAAEHQALDPTAVHGVTQFSDLTEVEFEKLYTGXPST---NTA 123
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
AP L LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+LVSLS
Sbjct: 124 GGVAPPLEVEGLPENFDWREKGAVTEVKIQGRCGSCWAFSTTGSIEGANFLATGKLVSLS 183
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQL+DCD++C+ E SCD+GCNGGLM +A+ Y+L++GG+E E YPYTG + G CKFD
Sbjct: 184 EQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGECKFD 242
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KI ++NF+ I DE+Q+AA LVK+GPLA V +I + +++ VS P
Sbjct: 243 PEKITVRITNFTNIPVDENQIAAYLVKNGPLAMGVNAIFMQ----TYIGGVSCP 292
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 128/257 (49%), Positives = 173/257 (67%), Gaps = 6/257 (2%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSD 103
D +L E F +F K+ K Y+++EE+ +R +F N+ RA Q LDPTA+HGVT FSD
Sbjct: 52 DGVLGTEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSD 111
Query: 104 LTPSEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
L+ EF R F G+ R + A+ A L + LP FDWR+ GAVT VK QG CGSCW
Sbjct: 112 LSEEEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCW 171
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TGA+EGAHF+ST +L++LSEQQLVDCDH CD + +CDSGC GGLM +A++Y+++
Sbjct: 172 AFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE 231
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
AGG+E E YPYTG G CKF ++A V NF+ + +E+Q+AANLV HGPLA + +
Sbjct: 232 AGGLEEESSYPYTGKH-GECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNA 290
Query: 283 IELPHISFSFLFTVSSP 299
I + +++ VS P
Sbjct: 291 IFMQ----TYIGGVSCP 303
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 193/322 (59%), Gaps = 31/322 (9%)
Query: 4 LILSSLLLLLLSSVLASAVAVN-------DDDAMIRQVVPSD------GEQSEDHLL--- 47
++ +L + LLS L S+ D MIRQV + G S +H L
Sbjct: 9 MLTCTLAITLLSCALISSTTFQHEIQYRVQDPLMIRQVTDNHHHRHHPGRSSANHRLLGT 68
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
E HF F ++ KTY+T EE+ +R +F NL +A Q +DP+A+HGVT+FSDLT
Sbjct: 69 TTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQAMDPSAIHGVTQFSDLTEE 128
Query: 108 EFRRQFLGLNRRLRLPADAQKAP----------ILPTNDLPTDFDWRDHGAVTGVKDQGA 157
EF ++GL + Q ++ +DLP FDWR+ GAVT VK QG
Sbjct: 129 EFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLPESFDWREKGAVTEVKTQGR 188
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TGA+EGA+F++TG+L+SLSEQQLVDCDH CD +E CD GC+GGLM +AF
Sbjct: 189 CGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAF 248
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y+++AGG+E E YPYTG G CKF+ K+A V NF+ I DE Q+AAN+V +GPLA
Sbjct: 249 NYLIEAGGIEEEVTYPYTGKR-GECKFNPEKVAVKVRNFAKIPEDESQIAANVVHNGPLA 307
Query: 278 GNVASIELPHISFSFLFTVSSP 299
+ ++ + +++ VS P
Sbjct: 308 IGLNAVFMQ----TYIGGVSCP 325
>gi|24417396|gb|AAN60308.1| unknown [Arabidopsis thaliana]
Length = 193
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 132/197 (67%), Positives = 155/197 (78%), Gaps = 6/197 (3%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L VL S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVLVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHE 195
G+LVSLSEQQLVDCDH+
Sbjct: 177 GKLVSLSEQQLVDCDHQ 193
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 260 bits (665), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 121/232 (52%), Positives = 160/232 (68%), Gaps = 2/232 (0%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
+ E F +F K+ K Y+++EE+ +R +F N+ RA Q LDP A+HGVT FSDL+
Sbjct: 1 MGGEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSE 60
Query: 107 SEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G+ R + A+ A L + LP FDWR+ GAVT VK QG CGSCW+FS
Sbjct: 61 EEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGA+EGAHF+ST +L++LSEQQLVDCDH CD + +CDSGC GGLM +A++Y+++AGG
Sbjct: 121 TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGG 180
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+E E YPYTG G CKF ++A V NF+ + BE+Q+AANLV HGPLA
Sbjct: 181 LEEESSYPYTGKH-GECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLA 231
>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
Length = 381
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 172/253 (67%), Gaps = 6/253 (2%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
N E +F +F K+ K Y T+EE+ +R VF NL RA Q+LDPTAVHG+T F DLT
Sbjct: 62 NTEENFKMFMIKYDKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPTAVHGITPFMDLTEE 121
Query: 108 EFRRQFLGLNRRLRLPADAQKAP-ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF R + G+ + A+ A L T LP+ FDWR GAVT VK QGACGSCW+FS
Sbjct: 122 EFERMYTGVVGGGAVGAEGVTATSFLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFST 181
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGA+EGA+F++TG+L++LSEQQLVDCD CD +E +CD GC GGLM +A+ Y+++AGG+
Sbjct: 182 TGAIEGANFIATGKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGL 241
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
E E YPYTG G CKFD+ KIA V NF+ I DE+Q+AA+LV HGPLA + ++ +
Sbjct: 242 EDEISYPYTGKP-GKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQ 300
Query: 287 HISFSFLFTVSSP 299
+++ VS P
Sbjct: 301 ----TYIGGVSCP 309
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 122/172 (70%), Positives = 143/172 (83%), Gaps = 5/172 (2%)
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
KAPILPT+DLP DFDWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQ
Sbjct: 15 KAPILPTSDLPDDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQ 74
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
QLVDCDHECD E+ CD+GC GGLM +AFEY LKAGG++REKDYPYTG D G C FDKS
Sbjct: 75 QLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRD-GKCHFDKS 133
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KIAA+V+NFSV+ DEDQ+AANLVKHGPLA + + + +++ VS P
Sbjct: 134 KIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQ----TYVGGVSCP 181
>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
Length = 245
Score = 257 bits (656), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 122/177 (68%), Positives = 149/177 (84%), Gaps = 6/177 (3%)
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
AD KAP LPT++LP +FDWR+ GAVT VK+QG+CGSCWSFS TGALEGA++L+TGEL+S
Sbjct: 1 ADENKAPKLPTSNLPEEFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGELIS 60
Query: 184 LSEQQLVDCDHECDPEESG-SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
LSEQQLVDCDHECDPEE SCD+GCNGGLMN+AFEY LKAGG+++EKDYPYTG D G+C
Sbjct: 61 LSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALKAGGLQKEKDYPYTGKD-GTC 119
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
KFDK+KIAA+V NFSV+S DEDQ+AANLVK+GPLA + + + +++ VS P
Sbjct: 120 KFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQ----TYIGGVSCP 172
>gi|357473429|ref|XP_003606999.1| Cysteine proteinase [Medicago truncatula]
gi|355508054|gb|AES89196.1| Cysteine proteinase [Medicago truncatula]
Length = 210
Score = 256 bits (654), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 124/196 (63%), Positives = 152/196 (77%), Gaps = 6/196 (3%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ L ++L + SV A + +D +IRQVV +G + L AEHHF+LFK KF
Sbjct: 1 MDHRTLLLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVR-----LGAEHHFNLFKHKF 55
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y++++EHDYRF++FK+NL RAKR QL+DP+AVHGVT+FSDLTP EFR+ LGL R +
Sbjct: 56 GKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGL-RGV 114
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
LP DA APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTG+
Sbjct: 115 GLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGK 174
Query: 181 LVSLSEQQLVDCDHEC 196
LVSLSEQQLVDCDHE
Sbjct: 175 LVSLSEQQLVDCDHEV 190
>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
Length = 331
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/284 (47%), Positives = 177/284 (62%), Gaps = 42/284 (14%)
Query: 2 ERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
+ +L S+L L S LA ++ + +D +I+QVV G AE+ F+ FK +F
Sbjct: 6 QTFMLFSVLFLFFSVDLAFSMPKDREDPIIQQVVDKGG---------AEYQFNEFKQRFG 56
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K Y++++EHDYRF VFK+NL RAKR ++DP+A HGVT+FSDLTP EFR LGL + +
Sbjct: 57 KVYSSKDEHDYRFNVFKSNLHRAKRHGIMDPSATHGVTRFSDLTPREFRNSILGL-KGVG 115
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
LP A+ APIL T +LP DFDWR+ GAVT V++QG CGS WSFS GALEGAHFLS+GEL
Sbjct: 116 LPRHAKAAPILSTENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGAHFLSSGEL 175
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ VDCDH EYI K GG+ R +DY Y T+
Sbjct: 176 VSLSEQHHVDCDH-----------------------EYIQKYGGLMRVEDYTYYKTNTAR 212
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ +NFS IS D++Q+ ANLVKHGPLA + ++ +
Sbjct: 213 ---------SVAANFSSISVDDNQITANLVKHGPLAAAINAVYM 247
>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
Length = 363
Score = 253 bits (646), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 135/300 (45%), Positives = 187/300 (62%), Gaps = 33/300 (11%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ L+ + L L + L++A + R++ D E LL E F +F + ++
Sbjct: 10 MCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNE-----LLRTEKKFKVFMENYGRS 64
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y+T+EE+ R +F N+ RA Q LDPTAVHGVT+FS LP
Sbjct: 65 YSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFS-------------------LP 105
Query: 124 ----ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
A AP L + LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG
Sbjct: 106 VSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATG 165
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+LVSLS+QQL+DCD++CD E SCD+GCNGGLM +A+ Y+L++GG+E E YPYTG +
Sbjct: 166 KLVSLSDQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ER 224
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
G CKFD KIA ++NF+ I +DE+Q+AA LVK+GPLA V +I + +++ VS P
Sbjct: 225 GECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQ----TYIGGVSCP 280
>gi|357473731|ref|XP_003607150.1| Cysteine proteinase [Medicago truncatula]
gi|355508205|gb|AES89347.1| Cysteine proteinase [Medicago truncatula]
Length = 326
Score = 252 bits (643), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/275 (50%), Positives = 177/275 (64%), Gaps = 44/275 (16%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
L+L S+L L S LA + + +D +I+QVV G AEH F+ FK +F K
Sbjct: 7 LMLFSVLFLFFSVDLAFSTPNDREDPIIQQVVDKGG---------AEHQFNEFKQRFGKV 57
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y++++EHDYRF VFK+NL RAKR ++DP+A HGVT+FSDLTP EFR LGL + + LP
Sbjct: 58 YSSKDEHDYRFNVFKSNLHRAKRHVIMDPSATHGVTRFSDLTPREFRNSILGL-KGVGLP 116
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
A+ APIL + +LP DFDWR+ GAVT V++QG CGS WSFS GALEGA+FLSTGELVS
Sbjct: 117 RHAKAAPILSSENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGANFLSTGELVS 176
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LS+QQ VDCDH EYI K+GG+ R +DY Y
Sbjct: 177 LSDQQHVDCDH-----------------------EYIKKSGGLMRVEDYTYY-------- 205
Query: 244 FDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLA 277
K+ IA +V +NFS + D+DQ+AANL+K+GPLA
Sbjct: 206 --KTNIARSVAANFSSVLVDDDQIAANLLKYGPLA 238
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 191/302 (63%), Gaps = 30/302 (9%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L+ +L+ LL V+ + + D IRQV +D + +D E HF F KF K Y
Sbjct: 5 LAIILVGLLILVVCCSSSNRLDIGKIRQV--TDNLEVKD----VEGHFKHFMQKFGKVYG 58
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T EE+ +R +VF+ANL + DPTA+HG+T F+DLTP E R FLG R+
Sbjct: 59 TTEEYVHRLKVFQANLAHVMSLKKQDPTAIHGITSFADLTPEELSR-FLGF-RKAYSNRV 116
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+AP+LPT++LP FDWR+HGAVT VK QG CGSCW+FS TG +EGA+FL TG+L+SLS
Sbjct: 117 VNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLS 176
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD------G 239
E+QL+DCD++ D+GC GG M SA+EY+ KA G+E E+DYPY
Sbjct: 177 EEQLIDCDYK---------DNGCEGGDMLSAYEYV-KARGLEAEEDYPYEELGYRHKPVR 226
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL-PHISFSFLFTVSS 298
G C++ SK+ A ++N+S +S DEDQ+AANLVK+GPL SI L ++ F++ V+
Sbjct: 227 GPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPL-----SIALRGNVLFTYEGGVAC 281
Query: 299 PK 300
P+
Sbjct: 282 PR 283
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/302 (45%), Positives = 191/302 (63%), Gaps = 30/302 (9%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L+ +L+ LL V+ + + D IRQV +D + +D E HF F KF K Y
Sbjct: 5 LAIILVGLLILVICCSSSNRLDIGKIRQV--TDNLEVDD----VEGHFKHFMQKFGKVYG 58
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T EE+ +R +VF+ANL + DPTA+HG+T F+DLTP E R FLG R+
Sbjct: 59 TTEEYVHRLKVFQANLVHVMSLKKQDPTAIHGITSFADLTPEELSR-FLGF-RKAYSNRV 116
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+AP+LPT++LP FDWR+HGAVT VK QG CGSCW+FS TG +EGA+FL TG+L+SLS
Sbjct: 117 VNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLS 176
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD------G 239
E+QL+DCD++ D+GC GG M SA+EY+ KA G+E ++DYPY
Sbjct: 177 EEQLIDCDYK---------DNGCEGGDMLSAYEYV-KARGLEADEDYPYEELGYRHKPVR 226
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL-PHISFSFLFTVSS 298
G C++ SK+ A ++N+S +S DEDQ+AANLVK+GPL SI L ++ F++ V+
Sbjct: 227 GPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPL-----SIALRGNVLFTYEGGVAC 281
Query: 299 PK 300
P+
Sbjct: 282 PR 283
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/264 (51%), Positives = 170/264 (64%), Gaps = 21/264 (7%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
IRQV +DG LL E F+ F + + Y+ EE+ R RVF ANL RA Q L
Sbjct: 29 IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLN--------RRLRLP-ADAQKAPILPTNDLPTDF 141
DPTA HGVT FSDLT EF + GL RR RLP A A + LP+ F
Sbjct: 86 DPTARHGVTPFSDLTREEFEARLTGLATDVGDDDVRRRRLPMPSAAPATEEEVSGLPSSF 145
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+
Sbjct: 146 DWRDRGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKK 205
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS- 260
CDSGC GGLM +A+ Y++ +GG+ + YPYTG G+C+FD +++A V+NF+V++
Sbjct: 206 TECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GACRFDANRVAVRVANFTVVAP 264
Query: 261 ------SDED-QMAANLVKHGPLA 277
+D D QM A LV+HGPLA
Sbjct: 265 AAGPGGNDGDAQMRAALVRHGPLA 288
>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
Length = 318
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 119/180 (66%), Positives = 143/180 (79%), Gaps = 7/180 (3%)
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+R PA AQKAPILPT DLP DFDWRD GAVT VKD G CGSCWSFS TGALE + +L+TG
Sbjct: 71 VRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATG 130
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
ELVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFE IL++GGV++EKD PYTG D
Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE-ILQSGGVQKEKDIPYTGRD- 188
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
G+CKFDK+K+ AA +S DE+Q+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 189 GTCKFDKTKV-AATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 243
>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
Length = 373
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 132/269 (49%), Positives = 168/269 (62%), Gaps = 21/269 (7%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHL----LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+ DD IRQV +DG +S L E F+ F + + Y+ EE+ R RVF AN
Sbjct: 19 STDDGFIRQV--TDGRRSRAGAGALGLLPEAQFAAFVRRHGRRYSGPEEYARRLRVFAAN 76
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK-----APILP-- 133
L RA Q LDPTA HGVT FSDLT EF + G+ R D Q+ AP P
Sbjct: 77 LARAAAHQALDPTARHGVTPFSDLTREEFEARLTGV--RAGAGGDVQRLVMSGAPAAPPA 134
Query: 134 ----TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
+ LP FDWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TG+L+ LSEQQL
Sbjct: 135 SQEEVSRLPASFDWRDKGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGKLLELSEQQL 194
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCDH C C++GC GGLM +A+ Y++K+GG+ ++ YPYTG G C+FD +K
Sbjct: 195 VDCDHTCSAVAQNECNNGCAGGLMTNAYAYLMKSGGLMEQRAYPYTGAP-GPCRFDPAKA 253
Query: 250 AAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
A V+NF+ + + DE Q+ A LV+ GPLA
Sbjct: 254 AVRVANFTAVPAGDEAQIRAALVRRGPLA 282
>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
Length = 309
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 134/261 (51%), Positives = 168/261 (64%), Gaps = 19/261 (7%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
IRQV +DG LL E F+ F + + Y+ EE+ R RVF ANL RA Q L
Sbjct: 29 IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLN-------RRLRLPADAQKAPILPTNDLPTDFDW 143
DPTA HGVT FSDLT EF + GL RR +P+ A A + LP FDW
Sbjct: 86 DPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPS-AAPATEEEVSGLPASFDW 144
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
RD GAVT VK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+
Sbjct: 145 RDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTE 204
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS--- 260
CDSGC GGLM +A+ Y++ +GG+ + YPYTG G+C+FD +++A V+NF+V++
Sbjct: 205 CDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GTCRFDANRVAVRVANFTVVAPPG 263
Query: 261 -SDED---QMAANLVKHGPLA 277
+D D QM A LV+HGPLA
Sbjct: 264 GNDGDGDAQMRAALVRHGPLA 284
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 128/262 (48%), Positives = 160/262 (61%), Gaps = 16/262 (6%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
DD IRQV L E F+ F + + Y+ +E+ R RVF ANL RA
Sbjct: 34 DDKFIRQVTTQGTRAGAGPGLLPEAQFAAFVRRHGRRYSGPKEYARRLRVFAANLARAAA 93
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK----APILP------TND 136
Q LDPTA HGVT FSDLT EF + GL R D Q+ P P
Sbjct: 94 HQALDPTARHGVTPFSDLTREEFEARLTGL----RAGGDVQRLMSGVPAAPPASKEEVAR 149
Query: 137 LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
LP FDWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TGELV LSEQQLVDCDH C
Sbjct: 150 LPASFDWRDKGAVTGVKTQGACGSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHTC 209
Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
C++GC GGLM +A+ Y++++GG+ + YPYTG G C+FD +++A V+NF
Sbjct: 210 SAVAQNECNNGCAGGLMTNAYSYLMESGGLMEQSAYPYTGA-AGPCRFDPTQVAVRVANF 268
Query: 257 SVI-SSDEDQMAANLVKHGPLA 277
+ + + DE Q+ A LV+ GPLA
Sbjct: 269 TAVPAGDEAQIRAALVRRGPLA 290
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 134/261 (51%), Positives = 168/261 (64%), Gaps = 19/261 (7%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
IRQV +DG LL E F+ F + + Y+ EE+ R RVF ANL RA Q L
Sbjct: 29 IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLN-------RRLRLPADAQKAPILPTNDLPTDFDW 143
DPTA HGVT FSDLT EF + GL RR +P+ A A + LP FDW
Sbjct: 86 DPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPS-AAPATEEEVSGLPASFDW 144
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
RD GAVT VK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+
Sbjct: 145 RDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTE 204
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS--- 260
CDSGC GGLM +A+ Y++ +GG+ + YPYTG G+C+FD +++A V+NF+V++
Sbjct: 205 CDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GTCRFDANRVAVRVANFTVVAPPG 263
Query: 261 -SDED---QMAANLVKHGPLA 277
+D D QM A LV+HGPLA
Sbjct: 264 GNDGDGDAQMRAALVRHGPLA 284
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 134/268 (50%), Positives = 173/268 (64%), Gaps = 14/268 (5%)
Query: 19 ASAVAVNDDDAMIRQVV----PSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT-QEEHDYR 73
A+A A DD +IRQV P+ LL E F+ F + K Y+ EE+ R
Sbjct: 19 AAAGASGDD--VIRQVTDNGAPAARRPPSPGLL-PEAKFAAFVRRHGKEYSGGAEEYARR 75
Query: 74 FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL---RLPADAQKAP 130
RVF ANL RA Q LDP A HGVT FSDLTP EF+ + GL ++ +PA A +A
Sbjct: 76 LRVFAANLARAAAHQALDPGARHGVTPFSDLTPEEFQARLTGLQQQGTNNNMPA-AARAT 134
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
LP FDWR GAVT VK QG CGSCW+FS TGA+EGAHF++TG+L++LSEQQLV
Sbjct: 135 AEELATLPASFDWRAKGAVTEVKMQGMCGSCWAFSTTGAVEGAHFVATGKLLNLSEQQLV 194
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCDH CD CDSGC+GGLM +A+ Y+++AGG+ + YPYTG G+C+FD +K+A
Sbjct: 195 DCDHTCDAVAKNECDSGCSGGLMTNAYTYLIRAGGLMEQAAYPYTGAQ-GTCRFDANKVA 253
Query: 251 AAVSNFSVISS-DEDQMAANLVKHGPLA 277
V++F+ + DEDQ+ A+LV+ GPLA
Sbjct: 254 VRVTSFTAVPPDDEDQIRASLVRAGPLA 281
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 105/144 (72%), Positives = 123/144 (85%), Gaps = 1/144 (0%)
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
T+ LP DFDWR+HGAV VKDQG+CGSCWSFS +GALEGAHFL+TG+L LSEQQ+VDCD
Sbjct: 145 TDGLPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCD 204
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
HECD ES +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G + +CKFDKSKI A V
Sbjct: 205 HECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQV 263
Query: 254 SNFSVISSDEDQMAANLVKHGPLA 277
NFSVIS +EDQ+AANLVKHGPLA
Sbjct: 264 KNFSVISVNEDQIAANLVKHGPLA 287
>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 157/254 (61%), Gaps = 8/254 (3%)
Query: 30 MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
+IRQV S LL E F+ F + K Y+ EE+ R RVF AN+ RA Q
Sbjct: 28 VIRQVTDSGHGAGHPGLL-PEAQFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQA 86
Query: 90 LDPTAVHGVTKFSDLTPSEFRRQFLGL---NRRLRLPADAQKAPILPTND---LPTDFDW 143
LDP A HGVT FSDLT EF + GL LR A + LP FDW
Sbjct: 87 LDPGARHGVTPFSDLTREEFEARLTGLVGAGDVLRSARRMPAAAPATEEEVAALPASFDW 146
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
RD GAVT VK QG CGSCW+FS TGA+EGA+F++TG+L+ LSEQQLVDCDH CD
Sbjct: 147 RDKGAVTDVKMQGVCGSCWAFSTTGAVEGANFVATGKLLDLSEQQLVDCDHTCDAVAKTE 206
Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
C+SGC+GGLM +A+ Y++ +GG+ + YPYTG G C+FD+ K+A V+NF+ + DE
Sbjct: 207 CNSGCSGGLMTNAYRYLMSSGGLMEQAAYPYTGAQ-GPCRFDRGKVAVRVANFTAVPLDE 265
Query: 264 DQMAANLVKHGPLA 277
DQM A LV+ GPLA
Sbjct: 266 DQMRAALVRGGPLA 279
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/203 (57%), Positives = 145/203 (71%), Gaps = 7/203 (3%)
Query: 81 LRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLG---LNRRLRLPADAQKAPI--LPT 134
L RA +Q D +A HGVT+FSDLTP EF ++LG L+ R A+ I LPT
Sbjct: 3 LIRAATQQANDRGSAKHGVTRFSDLTPEEFAERYLGHVKLSSEHREKVRARGGVIEDLPT 62
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
LP +FDWR GAV+ VKDQG CGSCW+FS TGA+EGAHF+STG+LV LSEQQL+DCD
Sbjct: 63 KHLPAEFDWRFKGAVSRVKDQGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDV 122
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
CDP+ +CDSGCNGGL ++A EYI++ GG++ EK YPY G + G CK D+ + A +
Sbjct: 123 GCDPDVPNACDSGCNGGLPSNAMEYIVEHGGIDTEKSYPYVG-EKGECKADEGTLGATLK 181
Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
NFS +SSDE QMAA LVKHGPL+
Sbjct: 182 NFSYVSSDEKQMAAALVKHGPLS 204
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/235 (49%), Positives = 159/235 (67%), Gaps = 15/235 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F F K++K Y T EEH+ R+++FKAN+ +++ + G+TKFSDLTP EF+R
Sbjct: 33 FIKFSRKYAKVYGT-EEHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEEFKRM 91
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL---------PTDFDWRDHGAVTGVKDQGACGSCWS 163
FL + P +A+K P + + PT FDWR HGAVT VK+QGACGSCW+
Sbjct: 92 FL---MKTYTPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWT 148
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYILK 222
FS TG +EG + G+LVSLSEQQLVDCDH C + +CDSGCNGGLM SAF+Y++K
Sbjct: 149 FSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIK 208
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG++ E YPY G D +C+F+KS +AA +S+++ ISSDE+QMAA L +GP++
Sbjct: 209 NGGLDTEDSYPYEGVD-DTCRFNKSNVAATISSWTSISSDENQMAAWLAANGPIS 262
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 125/289 (43%), Positives = 174/289 (60%), Gaps = 30/289 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +LIL ++LL+ S +LA A +A+ +SE L F+ F+ K
Sbjct: 1 MNKLIL--VVLLVASFILAIEAAKGPFNAL---------PESEMQQL-----FTQFRRKH 44
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG----- 115
K Y T++ D R+++FK N+ RA+ L GVT+FSDLTP EF+ FL
Sbjct: 45 VKLYGTKQVQDRRYQIFKQNVERARFENYLTERDNMGVTRFSDLTPDEFKSMFLMKSYTP 104
Query: 116 ------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
L+ + PA+A K + +D P +FDWR+H AVT VKDQG CGSCW+FS TG
Sbjct: 105 KQARELLSGMRQYPANA-KLTMKQVSDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGN 163
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEES-GSCDSGCNGGLMNSAFEYILKAGGVER 228
+EG + TG+L+SLSEQQLVDCDH C E +C++GCNGGLM S+FE+I+K GG+
Sbjct: 164 VEGMYAAKTGKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKTGGLVT 223
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY D C+F+ S +SN++ +SS+ED+MAA L +GP+A
Sbjct: 224 EESYPYEAVD-NRCRFNVSNAVVKISNWTFVSSNEDEMAAWLANNGPIA 271
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/243 (49%), Positives = 153/243 (62%), Gaps = 21/243 (8%)
Query: 50 EHHFSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
E F F + KTYA+ +E+ R +F N+ RAK D A +G T F+DLT E
Sbjct: 5 ERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEMSARD-GAEYGATPFADLTEDE 63
Query: 109 FRRQFLGLNRRLRLPADAQKA------------PILPTNDLPTDFDWRDHGAVTGVKDQG 156
F L +R P DA + P LPT ++P +FDWR GAVT VK+QG
Sbjct: 64 FASSLL-----MREPIDAARVERLKRHESSRVLPHLPTENIPLNFDWRALGAVTPVKNQG 118
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCWSFSATGA+EGAHF+ +G LVSLSEQQLVDCDH CDP+ +CDSGC+GGL +A
Sbjct: 119 MCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANA 178
Query: 217 FEYILKAGGVEREKDYPYTGTDG-GSCKF-DKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
Y++K GG++ E YPY G G G CK + AA ++N+S +S+DE Q+AA LVKHG
Sbjct: 179 MAYVVKRGGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYSFVSADESQIAAALVKHG 238
Query: 275 PLA 277
PL+
Sbjct: 239 PLS 241
>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
Length = 350
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 110/235 (46%), Positives = 151/235 (64%), Gaps = 15/235 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F F K +K Y E+H R+++FK+N+ +A+ + GV+KF DLTP EF+R
Sbjct: 36 FVKFSKKHAKLYGA-EDHGKRYQIFKSNVEKARYYNHVGKRETFGVSKFMDLTPEEFKRM 94
Query: 113 FLGLNRRLRLPADAQKAPILP---------TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
FL + P +A+K P D PT +DWR GAVT VK+QGACGSCW+
Sbjct: 95 FL---MKTYTPEEARKILAAPKEAVVTAQQVKDTPTSWDWRQKGAVTPVKNQGACGSCWT 151
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYILK 222
FS TG +EG H + TG+LVSLSEQQLVDCDH C + +CD+GCNGGLM SAF+Y++K
Sbjct: 152 FSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWSAFQYVIK 211
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+ E YPY G D +C+F+KS +A +++++ I SDE +MAA L +GP++
Sbjct: 212 TGGLVTEDSYPYEGVD-DTCRFNKSNVAVTINSWTSIPSDEGKMAAWLAANGPIS 265
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 151/244 (61%), Gaps = 14/244 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANL-------RRAKRRQLLDPTAVHGVTKFS 102
E F F+ K++K Y T E+ RF FK+NL R A R+ + GV +F+
Sbjct: 25 ETQFRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKNRDAASRK---SSVRFGVNEFA 80
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DL+ SEFR +L + +R P +A A LP DLPT FDWR GAVTGVK+QG CGSCW
Sbjct: 81 DLSQSEFRATYLNSVQAVRDP-NAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCW 139
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYIL 221
SFS TG +EG FL+ L LSEQ LVDCDHEC + CD GCNGGL +A+ YI+
Sbjct: 140 SFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYII 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
K GG++ E YPY G D G+C F + I A +SN++ +SS+E QMAA LV +GPLA
Sbjct: 200 KNGGIDTEASYPYQGVD-GTCSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAAD 258
Query: 282 SIEL 285
++E
Sbjct: 259 AVEW 262
>gi|353441042|gb|AEQ94105.1| putative drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 187
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 115/175 (65%), Positives = 138/175 (78%), Gaps = 7/175 (4%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHL-LNAEHHFSLFKSKFSKTYATQEE 69
+ L +SV +S + +DD +I QVVP E ED L LNAE HFS F +F K+YA ++E
Sbjct: 15 VALSASVASSWPSYAEDDPLIVQVVP---ESDEDELRLNAEAHFSSFLRRFGKSYADEKE 71
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---ADA 126
H YRF VFKANLRRA+R Q +DPTAVHG+TKFSDLTP+EFRR +LGL RL A +
Sbjct: 72 HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+APILPTN+LPTDFDWRDHGAVTGVKDQG+CGSCWSFSA+GALEGA+FL+TG+L
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQL 186
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 104/236 (44%), Positives = 152/236 (64%), Gaps = 14/236 (5%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF F KF + Y EE++YR +VF+ N+ ++R + + +G+TKFSDLT EFR+
Sbjct: 36 HFLDFTRKFQRFYKGPEEYEYRLKVFRENIETSRRMNIREGNNNYGITKFSDLTSDEFRK 95
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---------PTDFDWRDHGAVTGVKDQGACGSCW 162
+L + P + QK + +N + P +DWR+HGA+TGVKDQG CGSCW
Sbjct: 96 FYL---MEKKTPKEIQKMMRMDSNKMVSNSYAKPAPDHYDWRNHGAITGVKDQGQCGSCW 152
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP-EESGSCDSGCNGGLMNSAFEYIL 221
+FSA G++EG++ + +LVS SEQQLVDCD+ C E SCD GCNGGL SA++Y++
Sbjct: 153 AFSAIGSIEGSYAIKHKQLVSFSEQQLVDCDNNCVTFENQQSCDDGCNGGLQWSAYQYLM 212
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
KAGGV EKDYPY + C+ + A +SN++++S++E +MA L ++GP+A
Sbjct: 213 KAGGVVTEKDYPYYA-ERYKCEVKPANFVAKLSNWTMLSTNETEMANWLAENGPIA 267
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 145/239 (60%), Gaps = 10/239 (4%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFS 102
L + F F+ KF+K Y + EE+ RF +FK+NL + + L+ GV KF+
Sbjct: 23 LEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACG 159
DL+ EF+ +L N+ D A L N +PT FDWR GAVT VK+QG CG
Sbjct: 82 DLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFE 218
SCWSFS TG +EG HF+S +LVSLSEQ LVDCDHEC + E +CD GCNGGL +A+
Sbjct: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYN 200
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YI+K GG++ E YPYT G C F+ + I A +SNF++I +E MA +V GPLA
Sbjct: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259
>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 291
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 110/199 (55%), Positives = 138/199 (69%), Gaps = 6/199 (3%)
Query: 84 AKRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLGLNRR----LRLPADAQKAPILPTNDLP 138
A RQ D +AVHGVT+FSDLTP+EF FLG + + P P +DLP
Sbjct: 4 AAERQAQDRGSAVHGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHDLP 63
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
+FDWR+ GAVT VK+QGACGSCW+FSATGA+EGA+FL TGELVSLSEQQLVDCDH CDP
Sbjct: 64 LEFDWRERGAVTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDP 123
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
+CD GCNGGL +A Y+ K G++ E +YPY G DG AA+VS+F++
Sbjct: 124 SAPRNCDYGCNGGLPLNAMRYVQKH-GLDTESNYPYKGVDGKCASARHGPAAASVSSFNL 182
Query: 259 ISSDEDQMAANLVKHGPLA 277
+S++E Q+AA L+KHGPL+
Sbjct: 183 VSTNETQIAAALLKHGPLS 201
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 110/236 (46%), Positives = 144/236 (61%), Gaps = 10/236 (4%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
+ F F+ KF+K Y + EE+ RF +FK+NL + + L+ GV KF+DL+
Sbjct: 26 QSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
EF+ +L N+ D A L N +PT FDWR GAVT VK+QG CGSCW
Sbjct: 85 SDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYIL 221
SFS TG +EG HF+S +LVSLSEQ LVDCDHEC + E +CD GCNGGL +A+ YI+
Sbjct: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K GG++ E YPYT G C F+ + I A +SNF++I +E MA +V GPLA
Sbjct: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259
>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
Length = 500
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 149/229 (65%), Gaps = 21/229 (9%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLD----PTAVHGVTKFSDLTPSEFRRQFLGL----- 116
T+EE++ R +F+ N +RA R++ D +A HGVTKF DL+ EFR Q+LGL
Sbjct: 188 TEEEYEKRMEIFQENWKRAIEREIDDRKGGGSAKHGVTKFFDLSEEEFREQYLGLLSTST 247
Query: 117 --------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R+ ++ A +++ LP +DWR GAVT VKDQG CGSCW+FS TG
Sbjct: 248 SSSASKDAFRKHQMEAPSEE----DLEKLPQYYDWRARGAVTPVKDQGQCGSCWTFSTTG 303
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EGA+F+ TG+LVSLSEQQL+DCD C P+ +CDSGCNGGL ++A EYI++ GG++
Sbjct: 304 AIEGANFIKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEHGGLDT 363
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EK YPY +C+ + K+ A +SN++ + +E MA LVK+GPL+
Sbjct: 364 EKSYPYKAYKEDTCRAKEGKLGATISNYTFVGKNETHMAHALVKYGPLS 412
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 107/205 (52%), Positives = 135/205 (65%), Gaps = 11/205 (5%)
Query: 101 FSDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
FSDLT EF ++LG R R + LP LP +FDWR GAVT VKD
Sbjct: 2 FSDLTAEEFAARYLGHVRLSSEEREKRKARGGETLETLPVEHLPEEFDWRFKGAVTRVKD 61
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CGSCW+FS TGA+EGAHF+STG+LV LSEQQLVDCD CDP+ +CDSGCNGGL +
Sbjct: 62 QGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPS 121
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
+A EYI++ GG++ EK YPY G + G CK K K+ A + NFS +S DE QMAA LVK+G
Sbjct: 122 NAMEYIVEHGGIDTEKSYPYVG-EKGECKAKKGKLGATLKNFSFVSDDEKQMAAALVKYG 180
Query: 275 PLAGNVASIELPHISFSFLFTVSSP 299
PL+ + + + S++ V+ P
Sbjct: 181 PLSIGINAAWMQ----SYIGGVACP 201
>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
Length = 346
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 149/237 (62%), Gaps = 12/237 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLT 105
+ F F+ K++K Y++ E+ +F FKANL + ++ +L GV +F+DL+
Sbjct: 26 QTQFVAFQQKYNKVYSS-NEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFADLS 84
Query: 106 PSEFRRQFLGLNRRLRLP-ADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSC 161
+EFR+ +L N ++ P A AP+L L PT FDWR GAVTGVK+QG CGSC
Sbjct: 85 AAEFRKYYL--NAQVAKPDASLPMAPLLTEEVLETIPTAFDWRTKGAVTGVKNQGQCGSC 142
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYI 220
WSFS TG +EG +L+ LV LSEQ LVDCDH+C + + SCD+GC+GGL +A+ Y+
Sbjct: 143 WSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAYRYV 202
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++ GG++ E YPY G SCKF +AA +SNF++I +E QMA L HGPLA
Sbjct: 203 IENGGLDSENSYPYLAVTGDSCKFKSGNVAAKISNFTMIPQNETQMAGYLATHGPLA 259
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 205 bits (522), Expect = 2e-50, Method: Composition-based stats.
Identities = 108/236 (45%), Positives = 143/236 (60%), Gaps = 13/236 (5%)
Query: 45 HLLNAEHHFSLFKSKFSKTYAT-QEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
H L AEH F F S + Y + + RF +FK N+R+ + TA +GVT+F+
Sbjct: 2363 HHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFA 2422
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
DLT EF + +G+ LR P Q + ++P P FDWRDHGAVTGVKDQG+CGSC
Sbjct: 2423 DLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVTAPDSFDWRDHGAVTGVKDQGSCGSC 2482
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG +EG + TG+LVSLSEQ+LVDCD D GCNGGL ++A+ I
Sbjct: 2483 WAFSVTGNIEGQWKMKTGDLVSLSEQELVDCD---------KLDQGCNGGLPDNAYRAIE 2533
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ GG+E E DYPY G+D C F+K+ +S I+S+E MA LVKHGP++
Sbjct: 2534 QLGGLESEDDYPYEGSD-DKCSFNKTLARVQISGAVNITSNETDMAKWLVKHGPIS 2588
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 105/226 (46%), Positives = 148/226 (65%), Gaps = 12/226 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F++ F + Y +++E RF++F N+R+AK+ Q ++ TAV+GVTKF+D++ SEF+
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFK- 476
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
Q++G +KA I N LP FDWR+HGAVT VK+QG+CGSCW+FS TG +E
Sbjct: 477 QYVGKVWDQNANKGMKKAKIPEMNSLPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGNIE 536
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G +S +LVSLSEQ+LVDCD D GCNGGL + A++ I++ GG+E E D
Sbjct: 537 GQWAISKKKLVSLSEQELVDCD---------KVDEGCNGGLPSQAYKEIIRLGGLETETD 587
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y Y G + C DKSKI ++ ISS+E +MAA LVK+GP++
Sbjct: 588 YKYRGHN-EKCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPIS 632
>gi|5777611|emb|CAB53397.1| cysteine protease [Medicago sativa]
Length = 209
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 99/140 (70%), Positives = 114/140 (81%), Gaps = 10/140 (7%)
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGS W+FS TGALEGA++L+TG+LVSLSEQQLVDCDH CDPEE SCDSGCNGGLMN+AF
Sbjct: 1 CGSGWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAF 60
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EYIL++GGV EKDY YTG D GSCKFDKSKI A+VSNFSV+S DEDQ+AANLVK+GPLA
Sbjct: 61 EYILQSGGVVSEKDYAYTGRD-GSCKFDKSKIVASVSNFSVVSLDEDQIAANLVKNGPLA 119
Query: 278 GNV---------ASIELPHI 288
+ + + PHI
Sbjct: 120 VAINAAWMQTYMSGVSCPHI 139
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 106/248 (42%), Positives = 152/248 (61%), Gaps = 26/248 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLT 105
E F F++K++K Y+ EE+ +F FK+NL K+ + GV KF+DL+
Sbjct: 24 ESQFIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLS 82
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILP--TNDL----PTDFDWRDHGA---------VT 150
EF++ +L ++ RL D P+LP ++D+ P FDWR+ G VT
Sbjct: 83 KEEFKKYYLS-SKEARLTDDL---PMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVT 138
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCN 209
VK+QG CGSCWSFS TG +EG H+LSTG LV LSEQ LVDCDH C E C++GC+
Sbjct: 139 AVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCD 198
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGL +A+ YI+K GG++ E YPYT D G CKF+ +++ A +S+F+++ +E Q+A+
Sbjct: 199 GGLQPNAYNYIIKNGGIQTEATYPYTAVD-GECKFNSAQVGAKISSFTMVPQNETQIASY 257
Query: 270 LVKHGPLA 277
L +GPLA
Sbjct: 258 LFNNGPLA 265
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 101/239 (42%), Positives = 146/239 (61%), Gaps = 11/239 (4%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
D L + F F +K Y + EE RFR+F AN+++ K Q + +A++G T+F+
Sbjct: 271 DDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFA 330
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT +EF++++LGL+ + A I + +P +FDWR+H VT VK+QGACGSCW
Sbjct: 331 DLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCW 390
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA +EG + L + EL+SLSEQ+L+DCD+ D+GC GGLM AFE +
Sbjct: 391 AFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMTQAFEAVEN 441
Query: 223 AGGVEREKDYPYTG-TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG+E E DYPY G D C+ KS + ++S +S+DE+ +A LVKHGPL+ V
Sbjct: 442 LGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGV 500
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 101/239 (42%), Positives = 146/239 (61%), Gaps = 11/239 (4%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
D L + F F +K Y + EE RFR+F AN+++ K Q + +A++G T+F+
Sbjct: 271 DDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFA 330
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT +EF++++LGL+ + A I + +P +FDWR+H VT VK+QGACGSCW
Sbjct: 331 DLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCW 390
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA +EG + L + EL+SLSEQ+L+DCD+ D+GC GGLM AFE +
Sbjct: 391 AFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMTQAFEAVEN 441
Query: 223 AGGVEREKDYPYTG-TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG+E E DYPY G D C+ KS + ++S +S+DE+ +A LVKHGPL+ V
Sbjct: 442 LGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGV 500
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/227 (45%), Positives = 144/227 (63%), Gaps = 11/227 (4%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F ++++TY++QEE D R RVF NL+ A++ Q LD TA +GVTKFSDLT EFR
Sbjct: 175 QFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSDLTEEEFR 234
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L + + K +P P +DWR+HGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 TLYLNPLLSQQNLQQSMKPAAMPRGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNI 294
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG F TG+LVSLSEQ+LVDCD + D C GGL ++A+E I K GG+E E
Sbjct: 295 EGQWFAKTGKLVSLSEQELVDCD---------TVDQACGGGLPSNAYEAIEKLGGLETET 345
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY YTG SC F K+ A +++ +S+DE+++AA L ++GP++
Sbjct: 346 DYSYTGKK-QSCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVS 391
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/239 (44%), Positives = 150/239 (62%), Gaps = 12/239 (5%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
E++ED + F F ++++TY++QE+ D R R+F NL+ A++ Q LD TA +GV
Sbjct: 165 EETED-FVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGV 223
Query: 99 TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
TKFSDLT EFR +L + + K +P P +DWR+HGAV+ VK+QG C
Sbjct: 224 TKFSDLTEEEFRTLYLNPLLSQQKLQRSMKPAAMPHGPAPPSWDWREHGAVSPVKNQGMC 283
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FS TG +EG F+ TG+LVSLSEQ+LVDCD + D C GGL ++A+E
Sbjct: 284 GSCWAFSVTGNIEGQWFVKTGKLVSLSEQELVDCD---------TADQACGGGLPSNAYE 334
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I K GGVE E DY YTG SC F K+ A +++ +S DE+++AA L ++GP++
Sbjct: 335 AIEKLGGVETETDYSYTGKK-QSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVS 392
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/257 (42%), Positives = 151/257 (58%), Gaps = 16/257 (6%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
L E F F K+ K Y +EE + RF++FK NL + Q + T +GVT+F+DLT
Sbjct: 725 LKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLT 784
Query: 106 PSEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+EF+ + LGL L+ D +P +LP+D+DWR H VT VKDQG+CGSCW+F
Sbjct: 785 KAEFKARHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + GEL+SLSEQ+LVDCD DSGCNGGL ++A+ I + G
Sbjct: 845 SVTGNIEGQYAIKHGELLSLSEQELVDCD---------KLDSGCNGGLPDTAYRAIEELG 895
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNV 280
G+E E DYPY D C F+K+K+ + + I+S+E QMA LVK+GP++ N
Sbjct: 896 GLELESDYPYDAED-EKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINANA 954
Query: 281 ASIELPHISFSFLFTVS 297
+ +S F F S
Sbjct: 955 MQFYMGGVSHPFKFLCS 971
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/238 (45%), Positives = 144/238 (60%), Gaps = 15/238 (6%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ-LLD---PTAVHGVTK 100
H L+ + + FK + +K+Y E RF +F+ +LR+ + D T GVTK
Sbjct: 15 HALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTK 74
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
F+DLT EF LG++R + + P DLP+ FDWR+ GAVT VKDQG+CGS
Sbjct: 75 FADLTEKEFS-DMLGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGS 133
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFS TG +EGA+FL TG+LVSLSEQ LVDC E C GC+GG M+ A EYI
Sbjct: 134 CWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-------DC-YGCSGGYMDKALEYI 185
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
AGG+ E DYPY G D C+FD SK+AA +SNF+ I +DED + ++ GP++
Sbjct: 186 ETAGGIMSENDYPYEGID-DKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPIS 242
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 108/228 (47%), Positives = 152/228 (66%), Gaps = 13/228 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FS+F ++KTY +EEH+ RF +FK NL+R A +L + TA +G+T+FSDL+PSEF R
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225
Query: 112 QFLGLNRRL-RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LGL + L A+ + + P N+ LP FDWR GAVT VK+QG CGSCW+FS TG
Sbjct: 226 HYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSVTGN 285
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG FLS +L+SLSEQ+LVDCDH D GC GG M A + +++ GG+E E
Sbjct: 286 VEGQWFLSRSKLLSLSEQELVDCDHG---------DHGCKGGYMGQAMKAVIEMGGLETE 336
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+YPY G D G+C+F+K++ A V +F + +E ++A L+KHGP++
Sbjct: 337 SEYPYKGVD-GTCEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVS 383
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 110/259 (42%), Positives = 155/259 (59%), Gaps = 18/259 (6%)
Query: 22 VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
V + D + +Q VPS + ED +L F F + ++K Y+ QEE R ++F NL
Sbjct: 137 VELTDTETSQKQNVPSS--ELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNL 194
Query: 82 RRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQKAPILPTNDLP 138
++A+ Q +D TA +GVTK+SDLT EFR +L L+ + P K I+P P
Sbjct: 195 KKAQMIQEMDQGTAEYGVTKYSDLTEDEFRSLYLNPLLSSK---PLYQMKKAIVPNMSAP 251
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
+DWRDHGAVT VK+QG CGSCW+FS G +EG FL G LVSLSEQ+LVDCD
Sbjct: 252 DQWDWRDHGAVTEVKNQGMCGSCWAFSVIGNIEGQWFLKKGSLVSLSEQELVDCD----- 306
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D C GGL ++A+E I K GG+E E++Y Y G +C F SK++A +++
Sbjct: 307 ----GVDHACAGGLPSNAYEAIEKLGGIETEQEYSYEGHK-NTCSFSTSKVSAYINSSVE 361
Query: 259 ISSDEDQMAANLVKHGPLA 277
I DE+++AA L ++GP++
Sbjct: 362 IPKDENEIAAWLAQNGPIS 380
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 110/236 (46%), Positives = 144/236 (61%), Gaps = 16/236 (6%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFS 102
LN + + FK K +K+Y + E RFR+F+ NLR+ + + + T GVTKF+
Sbjct: 17 LNDKEEWVQFKVKNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFT 76
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT EF L L++ R + P DLP+ FDWRD GAVT VKDQG CGSCW
Sbjct: 77 DLTEKEFL-DLLVLSKNARPNRTHATHLLAPLRDLPSAFDWRDKGAVTEVKDQGMCGSCW 135
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG++E AHFL TG LVSLSEQ LVDC + +C GC GG M+ A EYI K
Sbjct: 136 TFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKD-------TC-YGCGGGWMDKALEYIEK 187
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
GG+ EKDYPY G D +C+FD SK+AA +SNF+ I +DE+ + + GP++
Sbjct: 188 -GGIMSEKDYPYEGVD-DNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPIS 241
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 140/217 (64%), Gaps = 11/217 (5%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
++Y T EE RFR+F+AN+++A Q + TA +GVT FSD++ EF++ +LGL +R
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGLKKRT 568
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
Q+ +P LP ++DWR++ AVT VK+QG CGSCW+FS TG +EG + + TG
Sbjct: 569 PDIKFKQEMAQIPNITLPEEYDWRNYNAVTPVKNQGMCGSCWAFSVTGNIEGQYAIKTGN 628
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD D GC GGL +A+ I + GG+E E DYPY+G D
Sbjct: 629 LVSLSEQELVDCDKY---------DDGCEGGLFETAYHAIEELGGLELESDYPYSGRD-N 678
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+C F+ S++ ++++ IS+DE MA LV +GP++
Sbjct: 679 TCHFNSSEVRVSITSSVNISNDETDMAKWLVANGPIS 715
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 93/174 (53%), Positives = 124/174 (71%), Gaps = 5/174 (2%)
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A+ A L + LP FDWR+ GAVT VK QG CGSCW+FS TGA+EGAHF+ST +L++LS
Sbjct: 6 AETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLS 65
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDH CD + +CDSGC GGLM +A++Y+++AGG+E E YPYTG G CKF
Sbjct: 66 EQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKH-GECKFK 124
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
++A V NF+ + +E+Q+AANLV HGPLA + +I + +++ VS P
Sbjct: 125 PDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQ----TYIGGVSCP 174
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 143/234 (61%), Gaps = 25/234 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F K++K Y++Q+E D R +F NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 176 QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFR 235
Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L L+R ++ PA K P P +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQGMCGSCWA 288
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +EG FL G LVSLSEQ+LVDCD D CNGGL ++A+E I K
Sbjct: 289 FSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNAYEAIEKL 339
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G SC F K+AA +++ +S DE ++AA L ++GP++
Sbjct: 340 GGLETETDYSYIGKK-QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVS 392
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 143/234 (61%), Gaps = 25/234 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F K++K Y++Q+E D R +F NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 176 QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFR 235
Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L L+R ++ PA K P P +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQGMCGSCWA 288
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +EG FL G LVSLSEQ+LVDCD D CNGGL ++A+E I K
Sbjct: 289 FSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNAYEAIEKL 339
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G SC F K+AA +++ +S DE ++AA L ++GP++
Sbjct: 340 GGLETETDYSYIGKK-QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVS 392
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 105/233 (45%), Positives = 141/233 (60%), Gaps = 18/233 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
E F F KF KTY + +E RF++FK NL+ + Q + TA +GVT F+DLTP E
Sbjct: 576 ETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKE 635
Query: 109 FRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
F+ ++LGL L+ + P+ +P LP FDWRDH VT VKDQG CGSCW+F
Sbjct: 636 FKARYLGLRPELK---HENEIPLPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGSCWAF 692
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + +L+SLSEQ+LVDCD S D GCNGG M +A++ I + G
Sbjct: 693 SVTGNVEGQYAIKHNQLLSLSEQELVDCD---------SLDEGCNGGDMENAYKAIERLG 743
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G+E E DYPY D C F ++K V + I+SDE +MA LVK+GP++
Sbjct: 744 GLELESDYPYDAKD-EKCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPIS 795
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 105/242 (43%), Positives = 150/242 (61%), Gaps = 19/242 (7%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHG 97
E +D LL F F ++KTY + +E R++VF+ NL+ ++ R+ TAV+G
Sbjct: 570 AEDYKDELL-----FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYG 624
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
VT F+DLTP EF+ ++LGL L D Q+A ++P DLP FDWR++ AVT VKDQ
Sbjct: 625 VTMFADLTPEEFKTKYLGLKTNLNQENDIPLQEA-VIPDIDLPPKFDWREYNAVTPVKDQ 683
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FSA G +EG + + +L+SLSEQ+LVDCD+ D GC GG M +
Sbjct: 684 GQCGSCWAFSAIGNIEGQYAIKHKKLLSLSEQELVDCDN---------LDDGCGGGYMIN 734
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A++ + K GG+E E DYPY + C F K+K V++ I++DE +MA LVK+GP
Sbjct: 735 AYKTVEKLGGLELETDYPYDARN-EKCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGP 793
Query: 276 LA 277
++
Sbjct: 794 IS 795
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 105/232 (45%), Positives = 144/232 (62%), Gaps = 23/232 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +K++K Y++QEE D R ++FK NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 178 FKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSDLTEEEFRL 237
Query: 112 QFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+L RR PA ++P P +DWRDHGAV+ VK+QG CGSCW+FS
Sbjct: 238 TYLNPLLSQWTLRRPMKPASPARSPA------PASWDWRDHGAVSPVKNQGLCGSCWAFS 291
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG +EG FL G+L+SLSEQ+LVDCD D C GGL ++A+E I GG
Sbjct: 292 VTGNIEGQWFLKHGKLLSLSEQELVDCD---------GLDHACRGGLPSNAYEAIEGLGG 342
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+E E DY Y+G C F K+AA +++ + SDE++MAA L ++GP++
Sbjct: 343 LEAENDYTYSGHK-QKCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVS 393
>gi|356519401|ref|XP_003528361.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 205
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 102/179 (56%), Positives = 120/179 (67%), Gaps = 14/179 (7%)
Query: 21 AVAVNDDDAMIRQVVP-----SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
V DD +IRQVVP + ++ EDHLLN EHHF+ FK+KF K Y T+EEH+ RF
Sbjct: 17 VVTSTTDDILIRQVVPDAVSEATEKEDEDHLLNEEHHFTSFKAKFGKKYVTKEEHNRRFG 76
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
VFK+NL RA+ LDP+ VH +TK SDLT +EFRR L A+ KAP
Sbjct: 77 VFKSNLHRARLHAKLDPSVVHNITKLSDLTSTEFRRX-FLSLXLLCFLANTHKAP----- 130
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DFDW D GA+T VKDQGACG CWSFS T +LEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 131 ---KDFDWXDKGAITNVKDQGACGLCWSFSTTRSLEGAHYLATGELGSLSEQQLVDCDH 186
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 142/231 (61%), Gaps = 16/231 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA +++ RF +FK NL RA++ Q+ + TA +GVT+FSDLTP
Sbjct: 27 NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF ++LGL R+ + + P DWR+ GAV +++QG+CGSCW+FS
Sbjct: 86 EEFEAKYLGL----RIDEQVDRVQLNDLQTAPASVDWREKGAVGPIENQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG FL TG LVSLS+QQLVDCD + D+GC GG ++ I + GG+
Sbjct: 142 VGNIEGQWFLKTGYLVSLSKQQLVDCD---------TVDNGCYGGYPPYTYKEIKRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E + DYPYTG G C+ D+SK+ A + + V+ +DE++ AA L +HGP++
Sbjct: 193 ELQSDYPYTGW-GHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMS 242
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 142/234 (60%), Gaps = 25/234 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F +K++K Y++QEE D R R+F NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 176 QFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFR 235
Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L L++ ++ PA K P P +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHQPMK-PATPAKGPS------PDSWDWRDHGAVSPVKNQGMCGSCWA 288
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG FL G L+SLSEQ+LVDCD D C GGL ++A+E I K
Sbjct: 289 FSVIGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEKL 339
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY YTG C F K+AA +++ + DE ++AA L ++GP++
Sbjct: 340 GGLETESDYSYTGHK-QRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVS 392
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 115/298 (38%), Positives = 157/298 (52%), Gaps = 39/298 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCDH D GCNGG + I K GG+E DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
G C ++SK A V++ +V+ E A L + GPL+ + ++ L +F +
Sbjct: 208 -GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 137/207 (66%), Gaps = 17/207 (8%)
Query: 74 FRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL--GLNRRLRLPADAQKAP 130
++F++N+R+A + Q +D TA +G T FSDL+ EFR+Q + G + L DA+
Sbjct: 1 MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWGKPLYEMKDAE--- 57
Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
+P D+P DWRD G VT VK+QG+CGSCW+FS TG +EG + + TG+LVSLSEQ+LV
Sbjct: 58 -IPLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELV 116
Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
DCD + D GC GGL ++A++ I K GG+E E DYPY G D CKF+K+++
Sbjct: 117 DCD---------TIDKGCEGGLPSNAYKQIEKLGGLESESDYPYKGAD-SKCKFNKAEVK 166
Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLA 277
+++ VIS DE ++AA L K+GP++
Sbjct: 167 VTINSSVVISKDEKEIAAWLAKNGPIS 193
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 108/246 (43%), Positives = 152/246 (61%), Gaps = 31/246 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FS+F ++KTY +EEH+ RF +FK NL+R A +L + TA +G+T+FSDL+PSEF R
Sbjct: 34 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93
Query: 112 QFLGLNRRL-RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWS------ 163
+LGL + L A+ + + P N+ LP FDWR GAVT VK+QG CGSCW+
Sbjct: 94 HYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSXXTE 153
Query: 164 ------------FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
FS TG +EG FLS +L+SLSEQ+LVDCDH D GC GG
Sbjct: 154 VKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDHG---------DHGCKGG 204
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
M A + +++ GG+E E +YPY G D G+C+F+K++ A V +F + +E ++A L+
Sbjct: 205 YMGQAMKAVIEMGGLETESEYPYKGVD-GTCEFNKTESKARVQSFVGLPQNETELAYWLM 263
Query: 272 KHGPLA 277
KHGP++
Sbjct: 264 KHGPVS 269
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 98/239 (41%), Positives = 146/239 (61%), Gaps = 16/239 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K+YA ++ RF +FK NL RA+ QL + TA +GVT+FSDLTP
Sbjct: 27 SARELYEQFKRDYGKSYANDDDEK-RFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF +FL R ++ + P DWR+ GAV V+DQG+CGSCW+FS
Sbjct: 86 EEFAAKFLSS----RFDDQVERVQLNDLKAAPESVDWRELGAVAPVEDQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG FL TG+LVSLS+QQLVDCD + DSGC+GG + + I++ GG+
Sbjct: 142 AGNVEGQWFLKTGQLVSLSKQQLVDCDVQ---------DSGCDGGYPPTTYGEIIRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
E ++DYPY G + CK D+SK+ A +++ V+ ++E + AA + +HGP++ + ++ L
Sbjct: 193 EAQRDYPYVGRE-QPCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTL 250
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 115/298 (38%), Positives = 155/298 (52%), Gaps = 39/298 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCDH D GCNGG + I K GG+E DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
G C ++SK A V+ +V+ E A L + GPL+ + ++ L +F +
Sbjct: 208 -GICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/279 (39%), Positives = 148/279 (53%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY + ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A ++ +++ E A L GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/249 (42%), Positives = 148/249 (59%), Gaps = 19/249 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L R + D Q + PT P DWR+ GAVT V++QG+CGSCW+F
Sbjct: 81 EEFAAKYL----RAAVNND-QVERVRPTGLKAAPERMDWREKGAVTAVENQGSCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA G +EG F+ TG+LVSLS+QQLVDCD + GCNGG S++ I G
Sbjct: 136 SAAGNVEGQWFIKTGQLVSLSKQQLVDCDRVAE---------GCNGGWPVSSYLEIKHMG 186
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
G+E E DYPY G + +C +K K+ A + + V+ + E++ AA L +HGPL+ + ++
Sbjct: 187 GLESESDYPYVGAE-QTCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVA 245
Query: 285 LPHISFSFL 293
L H L
Sbjct: 246 LQHYQSGVL 254
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/279 (39%), Positives = 148/279 (53%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY + ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A ++ +++ E A L GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 141/238 (59%), Gaps = 23/238 (9%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPS 107
A + F F + K Y ++ + RFRVFK NL+ + Q + TAV+G+T+FSDLTP
Sbjct: 153 AWNQFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPE 212
Query: 108 EFRRQFLGL--------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
EF++ +L NR + L A+ + LP FDWRDHGAVT VK+QG CG
Sbjct: 213 EFKKIYLPYIWDEPIVPNRMVDLTAEG----VHLNETLPESFDWRDHGAVTDVKNQGFCG 268
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG FL+ +LVSLSEQ+LVDCD D GC GGL + A++
Sbjct: 269 SCWAFSTTGNIEGQWFLAKKKLVSLSEQELVDCD---------KVDDGCEGGLPSQAYKE 319
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I++ GG+E E YPY G G C ++++ A +++ + DE+ M A LVK GP++
Sbjct: 320 IMRMGGLETESAYPYDGR-GEECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPIS 376
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/236 (42%), Positives = 139/236 (58%), Gaps = 15/236 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N F F+ KF K+Y++ R+ +FK NL + + Q L+ TA +G+TKFSDL+
Sbjct: 122 NTSRLFEEFQRKFRKSYSSDTAK--RYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSA 179
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFR + RR + + I PT LP FDWR +GAVT VKDQG CGSCW+F
Sbjct: 180 EEFRHSLANMKRR-KSKGSQMETAIFPTTIQSLPPSFDWRANGAVTEVKDQGMCGSCWAF 238
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
+ TG +EG F T +L+SLSEQQL+DCD + D CNGGL A++ I+K G
Sbjct: 239 ATTGNIEGQWFRKTNKLISLSEQQLLDCDTK---------DEACNGGLPEWAYDEIVKMG 289
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
G+ EKDYPY SC + I+A ++ + + SDE ++AA LV++GP++ V
Sbjct: 290 GLMSEKDYPYEAMKEQSCHLRRPNISAYINGSATLPSDEAKLAAWLVQNGPISVGV 345
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 114/298 (38%), Positives = 156/298 (52%), Gaps = 39/298 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
P D+ D FDWR+HGAV V DQG CGSCW+FS G +EG F T
Sbjct: 97 FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQQLVDCDH + GCNGG + I K GG+E DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LEKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
G C ++SK A V++ +V+ E A L + GPL+ + ++ L +F +
Sbjct: 208 -GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/228 (43%), Positives = 141/228 (61%), Gaps = 18/228 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL 114
F ++++TY+ ++E RFR++K NLR AK Q + TA++G T+FSDLT +EFR+ +
Sbjct: 10 FIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAEFRK--I 67
Query: 115 GLNRRLRLPADAQKAPI-----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
L + P K + ND+P FDWR+ AVT VK+QG+CGSCW+FS TG
Sbjct: 68 MLPYKWETPKVPNKMANFKEFGIAQNDIPESFDWREKNAVTEVKNQGSCGSCWAFSVTGN 127
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EGA + T +LVSLSEQ+LVDCD D GCNGGL ++A+ I++ GG+E E
Sbjct: 128 IEGAWAIKTSKLVSLSEQELVDCD---------IIDQGCNGGLPSNAYREIIRMGGLEAE 178
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DYPY G G C K IA +++ + DE++MAA LV GP++
Sbjct: 179 SDYPYDGR-GEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPIS 225
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 105/265 (39%), Positives = 153/265 (57%), Gaps = 22/265 (8%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKS------KFSKTYATQEEHDYRFRVFKANLRRA 84
+ Q+ P+ S+D A HH +FK+ +++K+Y +E +YR++VF N+ RA
Sbjct: 30 MMQLQPATRRFSQD---TATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARA 86
Query: 85 KRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
Q D T +G TK SDLT E + F + + + +KA I N LP FDW
Sbjct: 87 MLFQKHDNATGRYGFTKLSDLTDQEVK-SFYAMKKWPQQLYPTKKANIPQLNSLPQSFDW 145
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R GAVT VKDQ CG+CW+F+ TG +EG +L+ G+L SLSEQ+LVDCD
Sbjct: 146 RSKGAVTAVKDQKRCGACWAFATTGNIEGQWYLNKGKLYSLSEQELVDCD---------K 196
Query: 204 CDSGCNGGLMNSAFEYIL-KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
D GC GGL +A+ I+ + GG+E EKDYPY + G CK +KS+ +++ +S++
Sbjct: 197 IDEGCKGGLPLNAYHSIMNRLGGLETEKDYPYVAKN-GKCKLNKSEEVVYINSSVKVSTN 255
Query: 263 EDQMAANLVKHGPLAGNVASIELPH 287
E +AA LV HGP+A + S+ + H
Sbjct: 256 ETDLAAWLVAHGPVAIGINSVNMLH 280
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 140/238 (58%), Gaps = 13/238 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKF 101
+E + AE F F +++ K+YA+ EE R R+F+ NL R + A +GV KF
Sbjct: 21 AEAGTMTAEQQFRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTGARYGVNKF 79
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+DLTP EF+ +L R A A + T LP+ FDWRD GAVT KDQG CG
Sbjct: 80 ADLTPKEFKATYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTKDQGQCG-- 137
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS T A+E FLS +LVSL+ QQ+VDCD G+ D GC+GG +A+EY++
Sbjct: 138 WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQ-------GNGDYGCDGGDPPTAYEYVI 190
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS--DEDQMAANLVKHGPLA 277
KAGG++ E+ YPYT D G C F S + A +SN++ I++ +E +M L GPL+
Sbjct: 191 KAGGLDTEESYPYTAED-GQCAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLS 247
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 111/256 (43%), Positives = 152/256 (59%), Gaps = 27/256 (10%)
Query: 33 QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLD 91
+ P D + SED +A F F + K Y+ QE H RF+ F NL+R K +
Sbjct: 33 KTTPEDFDVSED---DARKQFENFLLEHPKMYSEQESHS-RFQTFWENLKRIKFHNHIEQ 88
Query: 92 PTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA----------DAQKAPILPTNDLPTDF 141
+A +GVT+F+DL+ EFRR +LGL L++P ++K T D F
Sbjct: 89 GSAKYGVTEFADLSDFEFRRHYLGLKPELKIPNRKKYERKSRNSSKKLKFAKTVD--ETF 146
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DW + GAVT VK+QG CGSCW+FS TG +EGA F +TG+LVSLSEQ+LVDCD +
Sbjct: 147 DWVEKGAVTEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLVSLSEQELVDCDQK------ 200
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
DSGCNGGLM+ AFE +++ GG+E E+ YPY G +C F+KS + +F I
Sbjct: 201 ---DSGCNGGLMDQAFEEVIRIGGLETEQQYPYDGVQ-ETCNFEKSLSKVQIDDFMDIGE 256
Query: 262 DEDQMAANLVKHGPLA 277
DE+++A L +HGPL+
Sbjct: 257 DEEEIAEALEEHGPLS 272
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/265 (39%), Positives = 151/265 (56%), Gaps = 27/265 (10%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL------RRAKRRQLLDPTAVHGVTK 100
+ +E F+ F +++TY+T EE + R R+F+ NL R+ +R TA + V
Sbjct: 576 VRSEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERG-----TAHYDVNM 630
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F+D++P EFR ++LGL LR D + +P +LP FDWR+ VT VKDQG CG
Sbjct: 631 FADMSPEEFRSRYLGLRPDLRSENDIPLREAEIPDVELPPKFDWREKSVVTPVKDQGMCG 690
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG + + G L+SLSEQ+LVDCD D GCNGGL ++A+
Sbjct: 691 SCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRA 741
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA-- 277
I K GG+E E DYPY + C F K+ +++ I+S+E QMA LV++GP++
Sbjct: 742 IEKLGGLELESDYPYEA-ENEKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIG 800
Query: 278 --GNVASIELPHISFSFLFTVSSPK 300
N + +S F F + +PK
Sbjct: 801 INANAMQFYVGGVSHPFKF-LCNPK 824
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/249 (41%), Positives = 143/249 (57%), Gaps = 19/249 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 27 SARELYEQFKRGYGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L D Q + PT P DWR GAVT V++QG+CGSCW+F
Sbjct: 86 EEFAAKYLSAPVN-----DDQVKRMRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 140
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG F+ TG+LVSLS+QQLVDCD GCNGG S++ I+ G
Sbjct: 141 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYMG 191
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
G+E E DYPY G + +C +K K+ A + + V+ +E+ AA L +HGPL+ + ++
Sbjct: 192 GLESESDYPYVGVE-QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVA 250
Query: 285 LPHISFSFL 293
L H L
Sbjct: 251 LQHYQSGVL 259
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 102/237 (43%), Positives = 139/237 (58%), Gaps = 14/237 (5%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQE-EHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
H + AE F F + + Y E RF +FK N+++ + T V+ VT+F+
Sbjct: 223 HHVQAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFT 282
Query: 103 DLTPSEFRRQFLGLNRRLRLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
DLT EF+ ++LGLN L+ P ++A I + LP FDWR GAVT VKDQGACGS
Sbjct: 283 DLTYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVTEVKDQGACGS 342
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG L TG+L+SLSEQ+LVDCD D GC+GG M++A+ I
Sbjct: 343 CWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCD---------KMDDGCDGGYMDNAYRAI 393
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ GG+E E++YPY D C F+KS +S ISS+E MA LV +GP++
Sbjct: 394 EQLGGLETEEEYPYEAED-DKCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPIS 449
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 149/241 (61%), Gaps = 21/241 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTK 100
SED + AE F+ F + +++TY++ E + RF++F+ NL + R+ T ++GV
Sbjct: 461 SED--MKAERLFNNFMTTYNRTYSSLE-RNLRFKIFRENLNFIEELRETEQGTGIYGVNM 517
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQG 156
F+D++ EFR ++LGL L+ + P+ +P DLP+ FDWR G VT VK+QG
Sbjct: 518 FADMSQKEFRTRYLGLRPDLQ---SENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQG 574
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS TG +EG + + G+L+SLSEQ+LVDCDH D GCNGGL ++A
Sbjct: 575 QCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDH---------LDEGCNGGLPDNA 625
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
+ I + GG+E E DYPY + C F ++ + +++ I+S+E Q+A LV++GP+
Sbjct: 626 YRAIEQLGGLELESDYPYEA-ENEKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPI 684
Query: 277 A 277
A
Sbjct: 685 A 685
>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
Length = 347
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 139/241 (57%), Gaps = 20/241 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV-------HGVTKFS 102
E F F+ K++K Y + E +F FK NL R L+ A GV +F+
Sbjct: 24 EIQFRDFQVKYNKVYGSHE-FSQKFVTFKDNLNRIDT---LNANAAASGSDTKFGVNEFA 79
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACG 159
DL+ EFR+ ++ +P+DAQ A L P+ FDWR GAVT VK+QG CG
Sbjct: 80 DLSVQEFRKFYMNA-VPASVPSDAQVAGDYSDETLASIPSSFDWRTKGAVTPVKNQGQCG 138
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC---DPEESGSCDSGCNGGLMNSA 216
SCWSFS TG +EG FL+ L LSEQ LVDCDH C D ++ SCD GCNGGL +A
Sbjct: 139 SCWSFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHHCMTYDGQQ--SCDDGCNGGLQPNA 196
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
F+YI+ GG++ E YPY C+F S I A +SN+ ++S++E Q+AA L +GP+
Sbjct: 197 FQYIIGNGGIDTETSYPYLAVAQDKCQFKASNIGAKISNWQMLSTNETQIAAYLALNGPV 256
Query: 277 A 277
+
Sbjct: 257 S 257
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/242 (42%), Positives = 140/242 (57%), Gaps = 19/242 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK K+ KTY+ ++ + RFR+FK NL RAKR Q ++ TA +GVT+FSDLT
Sbjct: 27 DARALYEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTS 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWS 163
EF+ ++L R+R P D+ D FDWRDHGAV V DQG CGSCW+
Sbjct: 86 EEFKTRYL----RMRFDEPIVNEDPTPQEDVTMDNSNFDWRDHGAVGPVLDQGDCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG F TG+L+ LSEQQL+DCDH D GC+GG + I +
Sbjct: 142 FSVIGNVEGQWFRKTGDLLGLSEQQLIDCDHS---------DQGCDGGYPPQTYSAIEEM 192
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+E DYPYTG D G C D+SK A V+ + + E A +L + GPL+ + ++
Sbjct: 193 GGLELRSDYPYTGKD-GICYMDQSKFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAV 251
Query: 284 EL 285
L
Sbjct: 252 LL 253
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 148/265 (55%), Gaps = 29/265 (10%)
Query: 30 MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
MI ++ Q E HL +A+H+F F ++K YA + +YRF++F NL
Sbjct: 5 MIFTILLVASSQIEGHLKFDIHDAQHYFETFIVNYNKQYADTKTKNYRFKIFVQNLEYIN 64
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
+ L+ +A++ + KFSDL+ +E ++ GL R P++ K AP
Sbjct: 65 EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSRK--PSNMVKSTSNFCNVIHLDAPPDA 122
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
++LP +FDWR + +T VKDQGACGSCW+ +A G LE + + L++LSEQQL+DCD
Sbjct: 123 RDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
S + C+GGLM++AFE ++ AGG+ E DYPY GT G CK D K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GICKIDNKKFALSV 232
Query: 254 SNFS-VISSDEDQMAANLVKHGPLA 277
S+ I +E+ + L+ GP+A
Sbjct: 233 SSCKRYIFQNEENLKKELITTGPIA 257
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/279 (39%), Positives = 146/279 (52%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A V+ +++ E A L GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/233 (42%), Positives = 137/233 (58%), Gaps = 20/233 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
NA + FK + K YA ++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 27 NARELYEQFKRDYGKVYANDDDQK-RFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L P + Q + PT P DWR+ GAV V++QG+CGSCW+F
Sbjct: 86 EEFAAKYL------SRPMNDQVERVRPTGLKAAPERMDWREWGAVGPVENQGSCGSCWAF 139
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG FL TG+LVSLS+QQLVDCD D GC GG +A+ I++ G
Sbjct: 140 SVAGNVEGQWFLKTGQLVSLSKQQLVDCD---------VMDYGCGGGWPTNAYMEIMRMG 190
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G+E + DYPY G C +K K+ A + + V+ + E++ AA L +HGPL+
Sbjct: 191 GLELQSDYPYVGVQ-QQCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLS 242
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 111/279 (39%), Positives = 146/279 (52%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A V+ +++ E A L GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 146/279 (52%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A ++ +++ E A L GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 139/227 (61%), Gaps = 12/227 (5%)
Query: 53 FSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFR 110
F +F K+ +TY++ +E++ RF +FK N + + ++ TAV+G+TKF D++ E+
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R R +P + L T ++P DWR HGAVT VK+QG+CGSCW+FS TG +
Sbjct: 229 RTLAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNV 288
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL +L+SLSEQ+LVDCD + DSGC GGL ++A++ I K GG+E EK
Sbjct: 289 EGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYKSIEKLGGLEPEK 339
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DYPY G +G C +S V+N + DE ++AA L ++GP++
Sbjct: 340 DYPYVG-EGEKCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPIS 385
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 70/115 (60%), Gaps = 9/115 (7%)
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
E+ R R +P + L T ++P DWR HGAVT VK+QG+CGSCW+FS T
Sbjct: 446 EYHRTLAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTT 505
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
G +EG FL +L+SLSEQ+LVDCD + DSGC GGL ++A++ I K
Sbjct: 506 GNVEGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYKSIEK 551
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 144/250 (57%), Gaps = 21/250 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADA-QKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF ++L P + Q + PT P DWR GAVT V++QG+CGSCW+
Sbjct: 81 EEFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWA 134
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG F+ TG+LVSLS+QQLVDCD GCNGG S++ I+
Sbjct: 135 FSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYM 185
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+E E DYPY G + +C +K K+ A + + V+ +E+ AA L +HGPL+ + ++
Sbjct: 186 GGLESESDYPYVGVE-QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAV 244
Query: 284 ELPHISFSFL 293
L + L
Sbjct: 245 ALQYYQSGVL 254
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 111/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFETRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A V+ +++ E A L GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 99/233 (42%), Positives = 131/233 (56%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GGLM++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EK YPY G + CK K+ A ++ I DED +A L +GP+A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 102/258 (39%), Positives = 148/258 (57%), Gaps = 17/258 (6%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD ++++ + + D+++ + F F + K Y + E RFRVFK N +
Sbjct: 150 HDDSVTVQELRKAKIIKPRDYVI--WNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVI 207
Query: 85 KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA----QKAPILPTNDLPT 139
+ Q + TAV+G TKFSD+T EF+ L +P D ++ + DLP
Sbjct: 208 RELQKNEQGTAVYGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKEGVTISEEDLPD 267
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
FDWR+HGAVT VK+QG+CGSCW+FS TG +EGA FL+ +LVSLSEQ+LVDCD
Sbjct: 268 SFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD------ 321
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++ +
Sbjct: 322 ---SVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVEL 377
Query: 260 SSDEDQMAANLVKHGPLA 277
DE +M LV GP++
Sbjct: 378 PHDEVEMQKWLVTKGPIS 395
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 102/281 (36%), Positives = 151/281 (53%), Gaps = 17/281 (6%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
++L V AS N DA+I V + + + +L A +F F++K+ K YA E
Sbjct: 4 IILFFVFVFASGGFDNGVDAIIDYVTAAPQFKLQYNLERAPQYFETFQTKYKKVYADDNE 63
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
DYR+++FK NL + + +AV+ + KF+DLT +E +F GL +R PA
Sbjct: 64 RDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGLG--IRSPALKNSC 121
Query: 130 -PIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
P++ P+ FDWR +T VKDQG CGSCW+FS LE + + E V LS
Sbjct: 122 EPVIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHVDLS 181
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCD + D GC GGL+++A+E I+ GG+E E+DYPY G C+
Sbjct: 182 EQQLVDCD---------TIDMGCAGGLLHTAYEEIMAMGGLEYEEDYPYRSVQ-GPCRLQ 231
Query: 246 KSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
K +V N + + ED++ L + GP+A V +++L
Sbjct: 232 SDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDL 272
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 102/258 (39%), Positives = 148/258 (57%), Gaps = 17/258 (6%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD ++++ + + D+++ + F F + K Y + E RFRVFK N +
Sbjct: 150 HDDSVTVQELRKAKIIKPRDYVV--WNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVI 207
Query: 85 KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA----QKAPILPTNDLPT 139
+ Q + TAV+G TKFSD+T EF+ L +P D ++ + DLP
Sbjct: 208 RELQKNEQGTAVYGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKEGVTISEEDLPD 267
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
FDWR+HGAVT VK+QG+CGSCW+FS TG +EGA FL+ +LVSLSEQ+LVDCD
Sbjct: 268 SFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD------ 321
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++ +
Sbjct: 322 ---SVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVEL 377
Query: 260 SSDEDQMAANLVKHGPLA 277
DE +M LV GP++
Sbjct: 378 PHDEVEMQKWLVTKGPIS 395
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 98/230 (42%), Positives = 129/230 (56%), Gaps = 14/230 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EFR +
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93
Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ R + + +P P DWR GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94 YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGNIE 153
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
G + L SLSEQ LV CD + D+GC GGLM++AFE+I+K +G V E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKVYTE 204
Query: 230 KDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K YPY G + CK K+ A ++ I DED +A L +GP+A
Sbjct: 205 KSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + F K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFTLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A V+ +++ E A L GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAIAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 105/258 (40%), Positives = 141/258 (54%), Gaps = 24/258 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKF 101
L+ E H +K + K YA + E +R ++F N + AK QL V G+ K+
Sbjct: 23 LIKEEWH--TYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKY 80
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKDQ 155
+D+ EF+ G N LR + + T +P DWR+HGAVTGVKDQ
Sbjct: 81 ADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS+TGALEG HF G LVSLSEQ LVDC + ++GCNGGLM++
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDN 193
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHG 274
AF YI GG++ EK YPY G D SC F+K+ I A + F I DE++M + G
Sbjct: 194 AFRYIKDNGGIDTEKSYPYEGID-DSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMG 252
Query: 275 PLAGNVASIELPHISFSF 292
P++ +I+ H SF
Sbjct: 253 PVS---VAIDASHESFQL 267
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/249 (40%), Positives = 145/249 (58%), Gaps = 19/249 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF ++L L +D Q + PT P DWR GAVT V++QG CGSCW+F
Sbjct: 81 EEFAAKYLSP----PLNSD-QVERVQPTGLKAAPERMDWRAKGAVTPVENQGECGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S G +EG F+ TG+LVSLS+QQLVDCD + GCNGG +S++ I+ G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDMAAE---------GCNGGWPSSSYLEIMDMG 186
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
G+E E DYPY G + +C +K K+ A + + V+ + E++ L +HGPL+ + ++
Sbjct: 187 GLESENDYPYVGVE-QTCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVA 245
Query: 285 LPHISFSFL 293
L H L
Sbjct: 246 LQHYQSGIL 254
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 148/265 (55%), Gaps = 29/265 (10%)
Query: 30 MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
MI ++ Q E HL +A+H+F F ++K Y + +YRF++FK NL
Sbjct: 5 MIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLEDIN 64
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
+ L+ +A++ + KFSDL+ +E ++ GL + P++ + AP
Sbjct: 65 EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKK--PSNMVRSTSNFCNVIHLDAPPDV 122
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
++LP +FDWR + +T VKDQGACGSCW+ +A G LE + + L++LSEQQL+DCD
Sbjct: 123 HDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
S + C+GGLM++AFE ++ AGG+ E DYPY GT G CK D K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GVCKIDNKKFALSV 232
Query: 254 SNFS-VISSDEDQMAANLVKHGPLA 277
S+ I +E+ + L+ GP+A
Sbjct: 233 SSCKRYIFQNEENLKKELITMGPIA 257
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A ++ +++ E A L GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRET 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LS QQLVDCD+ D GC+GG + I K GG+E DYPYTG
Sbjct: 157 GHLLALSGQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A V+ +++ E A L GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/233 (42%), Positives = 139/233 (59%), Gaps = 26/233 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY T+EE +R +F N+ RA+ Q LD TA +GVTKFSDLT EFR
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFRT 221
Query: 112 QFL------GLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+L GL +++RL P +D P ++DWR+ GAVT VK+QG CGSCW+F
Sbjct: 222 FYLNPLLKEGLGKKMRLAK--------PVDDPAPPEWDWRNKGAVTKVKNQGMCGSCWAF 273
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG FL G+L+SLSEQ+LVDCD + D C GGL ++A+ I G
Sbjct: 274 SVTGNVEGQWFLKQGDLLSLSEQELVDCD---------TLDKACMGGLPSNAYSAIKTLG 324
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G+E E DY Y G +C F K+ +++ +S DE ++AA L K GP++
Sbjct: 325 GLETEDDYSYHG-HLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGPIS 376
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 160/304 (52%), Gaps = 25/304 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD----GEQSEDHLLNAEHHFSLFKSK 59
+ + +L+ + LS+V + A+ I V D + + F F
Sbjct: 1 MAILTLIAVFLSTVALGSQAIGPRTITINNVPMIDEIERNTNESGSVDKTQDLFQDFMKT 60
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
+ K Y T+EEH R+++F+ NL +A+R +Q T +GVTKF DL+ EFR+ +L
Sbjct: 61 YDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFRKYYLTPVW 120
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRD--HGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
R P +KA I P P FDWRD AVT VK+QG CGSCW+FS TG +EG +
Sbjct: 121 RGSDP-HMKKAEI-PKGTPPAAFDWRDADKNAVTKVKNQGTCGSCWAFSTTGNIEGQWKI 178
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G LVSLSEQ+LVDCD D GCNGGL ++A++ I++ GG+ E DYPYTG
Sbjct: 179 KKGTLVSLSEQELVDCD---------KLDQGCNGGLPSNAYQEIMRFGGIMSEDDYPYTG 229
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLF-T 295
D CK + + ++ IS DE MA+ L +GP+ SI + + F F
Sbjct: 230 RD-QDCKLNATLNKVYINGSMNISKDEGDMASWLAANGPI-----SIGINANAMQFYFGG 283
Query: 296 VSSP 299
VS P
Sbjct: 284 VSHP 287
>gi|241602000|ref|XP_002405373.1| cathepsin-like protease, putative [Ixodes scapularis]
gi|215502535|gb|EEC12029.1| cathepsin-like protease, putative [Ixodes scapularis]
Length = 273
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 106/250 (42%), Positives = 140/250 (56%), Gaps = 22/250 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ + K+Y++Q E +R V+ N L+ AK + V + KFSDL
Sbjct: 27 EWETFKANYGKSYSSQAEEQFRMTVYMNNKLKVAKHNEQYAEGKVSYQLAMNKFSDLLHE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND----LPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R G RR+R P + P N P DWR GAVT VK+Q CGSCW+
Sbjct: 87 EFVRSRNGF-RRIR-PVKQASTYMEPANIEDVCFPQTVDWRKKGAVTPVKNQEQCGSCWA 144
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATG+LEG HFL TG+LVSLSEQ LVDC + + GC+GG+M+ AF YI
Sbjct: 145 FSATGSLEGQHFLRTGKLVSLSEQNLVDCSDDFG-------NLGCSGGVMDDAFRYIKAN 197
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
GG++ EK YPYTG D G C FDKS + A + F V + DE Q+ + GP++ +
Sbjct: 198 GGIDTEKSYPYTGED-GQCVFDKSNVGATDTGFVDVQTGDETQLMKAVASVGPIS---VA 253
Query: 283 IELPHISFSF 292
I+ H+SF F
Sbjct: 254 IDASHLSFQF 263
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/231 (41%), Positives = 136/231 (58%), Gaps = 16/231 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA +++ RF +FK NL RA++ Q+ + TA +GVT+FSDLTP
Sbjct: 27 NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTP 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF +LG R+ + + P DWR GAV V+DQG+CGSCW+FS
Sbjct: 86 EEFAAMYLGS----RIDERVDRVQLNDLQTAPASVDWRKKGAVGPVEDQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
T +EG FL TG LVSLS+QQLVDCD D GC+GG ++ I + GG+
Sbjct: 142 TANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E + YPYT +C+ D+SK+ A + + V+ +DE++ AA L +HGP++
Sbjct: 193 ELQSAYPYTSWK-QACRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMS 242
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 141/236 (59%), Gaps = 18/236 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
++LL + F L KF+K Y ++EE RFR+F+AN+++ + TA +G+T+FS
Sbjct: 128 EYLLQSFKDFVL---KFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFS 184
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DL+ +EF+ +LGL ++ P +P LP +FDWR + AVT VK+QG+CGSCW
Sbjct: 185 DLSVTEFK-NYLGLKKK---PESKLPTAEIPDVKLPDNFDWRHYNAVTPVKNQGSCGSCW 240
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + EL+SLSEQ+L+DCD D+GCNGG M +E I+K
Sbjct: 241 AFSVTGNIEGLWAIKKHELLSLSEQELIDCD---------KIDNGCNGGYMPETYEAIMK 291
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
GG+E E DYPY + C +K++I ++ ++ E +A L K+GP++
Sbjct: 292 LGGLETETDYPYEA-ENEKCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSA 346
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 109/252 (43%), Positives = 147/252 (58%), Gaps = 20/252 (7%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFS 102
+N F FK+KF+K Y + EE RF VF N+ R VH V +F+
Sbjct: 24 VNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFA 83
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
DLT E+R+ +L L + Q+ + N DWR GAVT +K+QG CGSCW
Sbjct: 84 DLTNEEYRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCW 141
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYIL 221
SFS TG++EGAH ++TG LVSLSEQQLVDC SGS + GCNGGLM++AF+YI+
Sbjct: 142 SFSTTGSVEGAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDNAFKYII 193
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNV 280
GG++ E+DYPYT DG K +SK A ++S + V ++EDQ+AA V+ GP++
Sbjct: 194 SNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVS--- 249
Query: 281 ASIELPHISFSF 292
+IE SF
Sbjct: 250 VAIEADQQSFQM 261
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/240 (40%), Positives = 141/240 (58%), Gaps = 13/240 (5%)
Query: 40 EQSEDHLLN-AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
++S H LN EH F+ F+ KF + Y T E RFR+FK NL+ + + +A +G
Sbjct: 152 KKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYG 211
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
+T+F+D+T E++ Q GL +R A + +P DLP +FDWR+ GA++ VK+QG
Sbjct: 212 ITEFADMTSPEYK-QRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGN 270
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG +EG H + TG L SEQ+L+DCD + DS CNGGL ++A+
Sbjct: 271 CGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNGGLPDNAY 321
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E I K GG+E E DYPY C F+ +KI V + +E +A L+ +GP++
Sbjct: 322 EAIEKIGGLELESDYPYHARK-DQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPIS 380
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 109/256 (42%), Positives = 150/256 (58%), Gaps = 27/256 (10%)
Query: 33 QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLD 91
+ P D + SED +A F F + K Y+ QE H RF+ F NL+R K +
Sbjct: 33 KTTPEDFDVSED---DARKQFENFLLEHPKMYSEQESHS-RFQTFWENLKRIKFHNHIEQ 88
Query: 92 PTAVHGVTKFSDLTPSEFRRQFLGLNRRLR----------LPADAQKAPILPTNDLPTDF 141
+A +GVT+F+DL+ EFRR +LGL L+ ++K T D F
Sbjct: 89 GSAKYGVTEFTDLSDFEFRRHYLGLKPELKNLNRKKYERKSRNSSKKLKFAKTAD--ETF 146
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DW + GAVT VK+QG CGSCW+FS TG +EGA F +TG+L+SLSEQ+LVDCD +
Sbjct: 147 DWVEKGAVTEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLISLSEQELVDCDQK------ 200
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
DSGCNGGLM+ AFE +++ GG+E E+ YPY G +C F+KS + +F I
Sbjct: 201 ---DSGCNGGLMDQAFEEVIRIGGLETEQQYPYDGVQ-ETCNFEKSLSKVQIDDFMDIGE 256
Query: 262 DEDQMAANLVKHGPLA 277
DE+++A L +HGPL+
Sbjct: 257 DEEEIAEALEEHGPLS 272
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/240 (40%), Positives = 141/240 (58%), Gaps = 13/240 (5%)
Query: 40 EQSEDHLLN-AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
++S H LN EH F+ F+ KF + Y T E RFR+FK NL+ + + +A +G
Sbjct: 152 KKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYG 211
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
+T+F+D+T E++ Q GL +R A + +P DLP +FDWR+ GA++ VK+QG
Sbjct: 212 ITEFADMTSPEYK-QRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGN 270
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG +EG H + TG L SEQ+L+DCD + DS CNGGL ++A+
Sbjct: 271 CGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNGGLPDNAY 321
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E I K GG+E E DYPY C F+ +KI V + +E +A L+ +GP++
Sbjct: 322 EAIEKIGGLELESDYPYHARK-DQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPIS 380
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 101/239 (42%), Positives = 142/239 (59%), Gaps = 18/239 (7%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
ED ++ F F +++TY T+EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 151 EDFVMQVASIFKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKF 210
Query: 102 SDLTPSEFRRQFLG-LNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGAC 158
SDLT EFR +L L + LR +++ P+ + P ++DWR+ GAVT VKDQG C
Sbjct: 211 SDLTEEEFRTIYLNPLLKELR----SKRMPLAMSVSGPAPPEWDWRNKGAVTKVKDQGMC 266
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FS TG +EG FL G+L+SLSEQ+LVDCD D C GGL ++A+
Sbjct: 267 GSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDK---------LDKACLGGLPSNAYS 317
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I GG+E E DY Y G +C F K +++ +S +E ++AA L K+GP++
Sbjct: 318 AIKTLGGLETEDDYGYNG-HLQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPIS 375
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 134/227 (59%), Gaps = 11/227 (4%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F K+ K Y++QEE + R ++F+ NL+ A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 174 QFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFR 233
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L K P +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 234 STYLNPLLSQWTLHRGMKPAPPAKTPAPDSWDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL G L+SLSEQ+LVDCD D C GGL ++A+E I K GG+E E
Sbjct: 294 EGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEKLGGLESET 344
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY YTG C F K+AA +++ + DE ++AA L ++GP++
Sbjct: 345 DYSYTGHK-QKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPIS 390
>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
Length = 260
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 140/244 (57%), Gaps = 21/244 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 1 SARELYEQFKRXYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 59
Query: 107 SEFRRQFLGLNRRLRLPADA-QKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF ++L P + Q + PT P DWR GAVT V++QG+CGSCW+
Sbjct: 60 EEFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWA 113
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG F+ TG+LVSLS+QQLVDCD D GCNGG S++ I+
Sbjct: 114 FSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD---------GCNGGWPASSYLEIMHM 164
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+E + DYPY G C +K ++ A + + + ED AA L +HGPL+ + +I
Sbjct: 165 GGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIALXPSEDDNAAYLAEHGPLSTLLNAI 223
Query: 284 ELPH 287
L +
Sbjct: 224 TLQY 227
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 96/230 (41%), Positives = 129/230 (56%), Gaps = 14/230 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EFR +
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93
Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G +E
Sbjct: 94 YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGNIE 153
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
G + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDSK---------DNGCGGGFMDNAFEWIVKENSGKVYTE 204
Query: 230 KDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K YPY G + CK ++ A ++ I DED +A L +GP+A
Sbjct: 205 KSYPYVSGGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/233 (41%), Positives = 129/233 (55%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 23 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 82
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G
Sbjct: 83 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 142
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V
Sbjct: 143 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 193
Query: 227 EREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EK YPY DG C ++ A ++ I DED +A L +GP+A
Sbjct: 194 YTEKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 246
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/227 (42%), Positives = 140/227 (61%), Gaps = 13/227 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY++QEE + R R+F+ N++ A+ Q L+ +A +G+TKFSDLT EFR
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234
Query: 112 QFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L + + L + + A T +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDT-WDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG F TG+L+SLSEQ+LVDCD D C GGL ++A+E I GG+E E
Sbjct: 294 EGQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETET 344
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY YTG SC F K+AA +++ + DE ++AA L ++GP++
Sbjct: 345 DYSYTGHK-QSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVS 390
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 101/236 (42%), Positives = 132/236 (55%), Gaps = 19/236 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y ++ E +R ++F N + AK QL V G+ K++D+ E
Sbjct: 27 WQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHE 86
Query: 109 FRRQFLGLNRRLRLPADAQKA-----PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
F+ G N +R AQ+ I P N +P DWR HGAVT VKDQG CGSCW
Sbjct: 87 FKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCW 146
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
SFS+TG+LEG HF G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 SFSSTGSLEGQHFRKAGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKD 199
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
GGV+ EK YPY G D SC F+K+ + A + F I DE+ M + GP+A
Sbjct: 200 NGGVDTEKSYPYEGID-DSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVA 254
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/227 (42%), Positives = 140/227 (61%), Gaps = 13/227 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY++QEE + R R+F+ N++ A+ Q L+ +A +G+TKFSDLT EFR
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234
Query: 112 QFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L + + L + + A T +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDT-WDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG F TG+L+SLSEQ+LVDCD D C GGL ++A+E I GG+E E
Sbjct: 294 EGQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETET 344
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY YTG SC F K+AA +++ + DE ++AA L ++GP++
Sbjct: 345 DYSYTGHK-QSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVS 390
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 98/233 (42%), Positives = 129/233 (55%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DW GAVT VKDQG CGSCWSFSA G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG + L SLSEQ LV CD + D+GC GGLM++AFE+I+K +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EK YPY G + CK K+ A ++ I DED +A L +GP+A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 140/244 (57%), Gaps = 21/244 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
+A + FK + K YA +++ RF +FK NL RA++ QL D TA +GVT+FSDLTP
Sbjct: 22 SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 80
Query: 107 SEFRRQFLGLNRRLRLPADA-QKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF ++L P + Q + PT P DWR GAVT V++QG+CGSCW+
Sbjct: 81 EEFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWA 134
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS G +EG F+ TG+LVSLS+QQLVDCD D GCNGG S++ I+
Sbjct: 135 FSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD---------GCNGGWPASSYLEIMHM 185
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+E + DYPY G C +K ++ A + + + ED AA L +HGPL+ + +I
Sbjct: 186 GGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAI 244
Query: 284 ELPH 287
L +
Sbjct: 245 TLQY 248
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 101/283 (35%), Positives = 153/283 (54%), Gaps = 21/283 (7%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
++L V+AS N +A+I V + + + +L A +F F++K+ K YA E
Sbjct: 4 IILFFVFVVASGGLDNGVNAVIDYVAAAPHFKLQYNLERAPQYFETFQTKYKKVYADDNE 63
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR---LRLPADA 126
DYR+++FK NL + + +AV+ + KF+DLT +E +F GL + L+ D
Sbjct: 64 RDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGLGVKSPNLKNFCD- 122
Query: 127 QKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
P++ P+ FDWR +T VKDQG CGSCW+FS LE + + E +
Sbjct: 123 ---PLIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHID 179
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQQLVDCD + D GC GGL+++A+E I+ GGVE E+DYPY G C+
Sbjct: 180 LSEQQLVDCD---------TIDMGCAGGLLHTAYEEIMSMGGVEYEEDYPYRSVQ-GPCR 229
Query: 244 FDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ K +V N + I ED++ L + GP+A V +++L
Sbjct: 230 IENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDL 272
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 99/247 (40%), Positives = 140/247 (56%), Gaps = 16/247 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F+ + KF++ Y++ E + R+ +FK+N+ D V G+ F+D+T E+R+
Sbjct: 36 FTEWTLKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LG +L DL P DWR AVT +KDQG CGSCWSFS TG
Sbjct: 95 TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ EGAH L T +LVSLSEQ LVDC PEE + GC+GGLMN+AF+YI+K G++
Sbjct: 155 STEGAHALKTKKLVSLSEQNLVDCS---GPEE----NFGCDGGLMNNAFDYIIKNKGIDT 207
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E YPYT G +C F+KS I A + + I++ + N +HGP++ +I+ H
Sbjct: 208 ESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVS---VAIDASHN 264
Query: 289 SFSFLFT 295
SF L+T
Sbjct: 265 SFQ-LYT 270
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 100/231 (43%), Positives = 134/231 (58%), Gaps = 22/231 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + ++K+YA E R +F NL A++ Q LD +A +GVTKFSDLT EFR
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213
Query: 112 QFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L L R P A + P P +DWRDHGAVTGVK+QGACGSCW+FS
Sbjct: 214 SYLNPLLSSLPGRALRPGPATRGPA------PASWDWRDHGAVTGVKNQGACGSCWAFSV 267
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG FL G L++LSEQ+LVDCD + D C GGL ++A+ I K GG+
Sbjct: 268 TGNVEGQWFLRRGALLALSEQELVDCD---------TLDQACGGGLPSNAYTAIEKLGGL 318
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E EKDY Y G C F K +++ +S DE+++A L ++GP++
Sbjct: 319 ETEKDYSYEGRK-ERCSFSPDKARVYINSSVDLSRDEEELATWLAENGPVS 368
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 109/288 (37%), Positives = 151/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VK QG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|340053969|emb|CCC48263.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 259
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 96/230 (41%), Positives = 127/230 (55%), Gaps = 14/230 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EFR +
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93
Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ R + + +P P DWR GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94 YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGNIE 153
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
G + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKVYTE 204
Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K YPY DG C ++ A ++ I DED +A L +GP+A
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 106/255 (41%), Positives = 138/255 (54%), Gaps = 23/255 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK QL V G+ K++D+ E
Sbjct: 28 WQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLHHE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAP------ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
F G N L A A I P + LP DWR+ GAVTGVKDQG CGSC
Sbjct: 88 FHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSC 147
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF TG L+SLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 148 WAFSSTGALEGQHFRKTGTLISLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 200
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY G D SC F+K I A F+ I DE ++A + GP++
Sbjct: 201 DNGGIDTEKSYPYEGID-DSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVS--- 256
Query: 281 ASIELPHISFSFLFT 295
+I+ H SF F T
Sbjct: 257 VAIDASHESFQFYST 271
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/230 (41%), Positives = 127/230 (55%), Gaps = 14/230 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EFR +
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93
Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ R + + +P P DWR GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94 YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGRCGSCWSFSAIGNIE 153
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
G + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKVYTE 204
Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K YPY DG C ++ A ++ I DED +A L +GP+A
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/279 (38%), Positives = 154/279 (55%), Gaps = 32/279 (11%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+I S+L+LL++ A+A R DG L ++ F + +K K+
Sbjct: 1 MIASTLILLVVVGATPFAIA--------RPAALEDGRA-----LEIKNMFEDWAAKHGKS 47
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR- 121
Y++ E R +F L ++ + T G+ KFSDLT +EFR +G +R R
Sbjct: 48 YSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRY 107
Query: 122 ---LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
LPA+ + + + LPT DWR GAVT +KDQG CGSCW+FSA ++E AHFL+T
Sbjct: 108 QDRLPAEDEDVDV---SSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLAT 164
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
ELVSLSEQQL+DCD + D+GC+GGLM +AF++++K GGV E YPYTG+
Sbjct: 165 KELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSV 215
Query: 239 GGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPL 276
GSC +K+K A ++ F V++ D V P+
Sbjct: 216 -GSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPV 253
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 145/250 (58%), Gaps = 20/250 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH---GVTKFSDLT 105
AE H++ FKS K+Y +E R +F+ NL + ++ + GV +F+D+T
Sbjct: 24 AEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+EF LGL R ++ D+ + + + DLP + DW G VT VK+QG CGSCW+
Sbjct: 84 NTEFSNMLLGLGGRNKIAGDSVFESSHV---QDLPAEVDWTQKGYVTEVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG+LEG F TG+LVSLSEQ LVDC + + GCNGGLM+ AF YI K
Sbjct: 141 FSTTGSLEGQVFKKTGKLVSLSEQNLVDCS-------TSEGNQGCNGGLMDQAFTYIKKN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
GG++ E YPYTG+D G+C+F ++K+ A VS F V S DE+ + + GP++ +
Sbjct: 194 GGIDTEAAYPYTGSD-GTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPIS---VA 249
Query: 283 IELPHISFSF 292
I+ I F F
Sbjct: 250 IDASSIFFQF 259
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/279 (39%), Positives = 144/279 (51%), Gaps = 39/279 (13%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
RL + +L+ + S LA V D NA + FK K+ K
Sbjct: 2 RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
TY+ ++ + RF +FK NL RAKR Q ++ TA +GVT+FSDLT EF+ ++L R+R
Sbjct: 42 TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96
Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P D+ D FDWR+HGAV V DQG CGSCW+FS G + G F T
Sbjct: 97 FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L++LSEQ LVDCD+ D GC+GG I K GG+E DYPYTG
Sbjct: 157 GHLLALSEQPLVDCDY---------LDGGCDGGYPPQTNTAIQKMGGLELASDYPYTGV- 206
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG C DKSK A ++ +++ E A L GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/282 (37%), Positives = 153/282 (54%), Gaps = 34/282 (12%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
+I S+L+LL++ A+A R DG L ++ F + +K K
Sbjct: 4 NMIASTLILLVVVGATPFAIA--------RPAALEDGRA-----LEIKNMFEDWAAKHGK 50
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+Y++ E R +F L ++ + T G+ KFSDLT +EFR +G +R R
Sbjct: 51 SYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPR 110
Query: 122 ----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
LPA+ + + + LPT DWR GAVT +KDQG CGSCW+FSA ++E AHFL+
Sbjct: 111 YQDRLPAEDEDVDV---SSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLA 167
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
T ELVSLSEQQL+DCD + D+GC+GGLM +AF++++K GGV E YPYTG+
Sbjct: 168 TKELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGS 218
Query: 238 DGGSCKFDKSKI---AAAVSNFSVISSDEDQMAANLVKHGPL 276
GSC +K I A ++ F V++ D V P+
Sbjct: 219 V-GSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPV 259
>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
Length = 227
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 87/156 (55%), Positives = 113/156 (72%), Gaps = 17/156 (10%)
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
AP+LPT++LP FDWR+HGA+T VK+QG+CGSCW+FS+TGA+EGAHFL + EL+SL E+Q
Sbjct: 1 APLLPTDNLPKSFDWREHGAMTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQ 60
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS------- 241
LVDCD D GC GG M +A+EYI KA G+E E+DYPY +
Sbjct: 61 LVDCDRM---------DGGCKGGDMLNAYEYI-KAKGLEAEEDYPYQEENYKEYMFPHHR 110
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
C F SK+AA ++N+S +S DEDQ+AANLVK+GPL+
Sbjct: 111 CHFRPSKVAATIANYSTVSEDEDQIAANLVKNGPLS 146
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 100/260 (38%), Positives = 147/260 (56%), Gaps = 17/260 (6%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLT 105
+ +E F F + +++TYAT+EE + R +F+ NL + R+ T +GV +F+D++
Sbjct: 721 MRSERLFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVS 780
Query: 106 PSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF +LGL LR + + +P +LP FDWR GAVT VK+QG CGSCW+F
Sbjct: 781 TEEFHAFYLGLRPDLRTENNIPLRQAEIPDIELPNSFDWRQKGAVTPVKNQGMCGSCWAF 840
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + +L+SLSEQ+LVDCD D GCNGGL ++A+ I K G
Sbjct: 841 SVTGNVEGQYAIKHNKLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRAIEKLG 891
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNV 280
G+E E DYPY + C F K+ V + I+S+E Q+A LV +GP++ N
Sbjct: 892 GLELESDYPYEA-ENERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGINANA 950
Query: 281 ASIELPHISFSFLFTVSSPK 300
+ +S F F + +PK
Sbjct: 951 MQFYMGGVSHPFKF-LCNPK 969
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/242 (40%), Positives = 139/242 (57%), Gaps = 24/242 (9%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D L F F + +++TY T+EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 103 QDFYLRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKF 162
Query: 102 SDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
SDLT EFR +L L +++RL + P ++DWR GAVT VK+Q
Sbjct: 163 SDLTEEEFRTMYLNPLLKEELGKKMRLVK-------FVGDPAPPEWDWRKKGAVTKVKNQ 215
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS TG +EG FL G+L+SLSEQ+LVDCD D C GGL ++
Sbjct: 216 GMCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCD---------KVDKACMGGLPSN 266
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A+ I GG+E E DY Y+G +C F K +++ +S +E ++AA L K+GP
Sbjct: 267 AYSAIKTLGGLETEDDYSYSG-HLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGP 325
Query: 276 LA 277
++
Sbjct: 326 IS 327
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/237 (42%), Positives = 129/237 (54%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK ++ + Y + E +R VF+ NL AK +P A GVT FSDLT EFR
Sbjct: 37 QFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F +R R+P D + D P DWRD GAVT VKDQG CGSCW+F
Sbjct: 97 RHHSGAAHFAAGRKRARVPVD------VGVGDAPAAVDWRDRGAVTPVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK-- 222
SA G +EG FL+ L SLSEQ LV CD + DSGC+GGLMNSAFE+I++
Sbjct: 151 SAIGNVEGQWFLAGNALTSLSEQMLVSCD---------TMDSGCDGGLMNSAFEWIVEHH 201
Query: 223 AGGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E+ Y Y DG + C+ + A ++ + DE +MA L +GPLA
Sbjct: 202 NGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPPDEAKMATWLAANGPLA 258
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 95/246 (38%), Positives = 137/246 (55%), Gaps = 23/246 (9%)
Query: 53 FSLFKSKFSKTYATQE---EHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSE 108
F F F + Y + E++YR+ VF N+ + Q TA +G TKF+D+T +E
Sbjct: 156 FDKFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAE 215
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
FR+ G ++ + +K +P +P ++DWR HGAVT VK+QG CGSCW+FSA G
Sbjct: 216 FRKLQSGPLKKTGI----KKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIG 271
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EG + GEL+SLSEQ+LVDCD D GC GG M+ A+E I+K GG
Sbjct: 272 NMEGQWQIKKGELISLSEQELVDCD---------KVDGGCEGGEMSDAYEAIIKLGGAMS 322
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E+ YPY G + CKF+ + + ++ + IS +E +MA L HGP+ SI + +
Sbjct: 323 EEKYPYRG-ENEKCKFNMTDVRVKINGYVNISKNETEMAGWLAAHGPI-----SIGINAL 376
Query: 289 SFSFLF 294
F F
Sbjct: 377 MMQFYF 382
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/246 (43%), Positives = 144/246 (58%), Gaps = 20/246 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F FK+ F K Y + EE RF +F NL R +H GV +F+DLT E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+R+ +L L + Q+ + N DWR GAVT +K+QG CGSCWSFS TG
Sbjct: 80 YRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCWSFSTTG 137
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVE 227
++EGAH ++TG LVSLSEQQLVDC SGS + GCNGGLM++AF+YI+ GG++
Sbjct: 138 SVEGAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDNAFKYIISNGGLD 189
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELP 286
E+DYPYT DG K +SK A ++S + V ++EDQ+AA V+ GP++ +IE
Sbjct: 190 TEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVS---VAIEAD 245
Query: 287 HISFSF 292
SF
Sbjct: 246 QQSFQM 251
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 103/248 (41%), Positives = 148/248 (59%), Gaps = 26/248 (10%)
Query: 21 AVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
V N D +IR +P+D +D LL + F+ + K K Y+ EE +RF V+K N
Sbjct: 21 GVVANGD--VIR--MPTD--VGKDQLLAGQ--FAAWAHKHGKVYSAAEERAHRFLVWKDN 72
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTND 136
L +R + + G+TKF+DLT EFRRQ+ G +RRL+ +A + ++
Sbjct: 73 LEYIQRHSEKNLSYWLGLTKFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSE 132
Query: 137 LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
P DWR+ GAVT VKDQG+CGSCW+FSA G++EG + + TG+ +SLS Q+LVDCD +
Sbjct: 133 APKSIDWREKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK- 191
Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
+ GCNGGLM+ AF+++++ GG++ EKDYPY G DG + D +K+ A V
Sbjct: 192 -------YNQGCNGGLMDYAFDFVIQNGGIDTEKDYPYQGYDG---RCDVNKMNARV--- 238
Query: 257 SVISSDED 264
I S ED
Sbjct: 239 VTIDSYED 246
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 389
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 95/230 (41%), Positives = 126/230 (54%), Gaps = 14/230 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EFR +
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93
Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ R + + +P P DWR GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94 YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGNIE 153
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
G + L SLSEQ LV CD + D+GC GG M++AFE+I+K +G V
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKVYTG 204
Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K YPY DG C ++ A ++ I DED +A L +GP+A
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 131/235 (55%), Gaps = 21/235 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK K+ KTY ++ +YRF VFK NL RA + Q ++ TA +GVT+F DLT
Sbjct: 302 NARQLYEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTS 360
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPIL-PTNDLPTD---FDWRDHGAVTGVKDQGACGSCW 162
EF+ Q+LG D Q + P+ + D FDWRDHGAV V DQG CGSCW
Sbjct: 361 QEFQIQYLGFKYE-----DMQDTEEMSPSTRVVMDEDSFDWRDHGAVGPVLDQGKCGSCW 415
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS G +EG FL TGEL+SLSEQQL+DCD + D GCNGG + ++K
Sbjct: 416 AFSTIGNIEGQWFLKTGELLSLSEQQLIDCD---------NVDEGCNGGYPPKTYGAVIK 466
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E DYPY C D+ K+ +++ V +E A L GPL+
Sbjct: 467 MGGLELNSDYPYKAL-AEKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLS 520
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/139 (47%), Positives = 83/139 (59%), Gaps = 12/139 (8%)
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
+FDWR HGAV V +QG CGSCW+FSA G +EG FL +GEL+ LS QQ++DCDH
Sbjct: 42 NFDWRQHGAVGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCDH----- 96
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
D GCNGG + + + GG++ + DY Y G C D+SK A V N SVI
Sbjct: 97 ----VDHGCNGGYPPQVYRQVNQMGGLQLDADYSYKAAV-GKCHTDRSKFRAYV-NSSVI 150
Query: 260 SSDEDQMAANLVKH-GPLA 277
S +Q AN +K GPLA
Sbjct: 151 LSQNEQFQANKLKTIGPLA 169
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 103/263 (39%), Positives = 151/263 (57%), Gaps = 19/263 (7%)
Query: 22 VAVNDDDAM-IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
+ + DDD++ ++++ + + D+++ + F F + K Y+ + E RFR FK N
Sbjct: 142 IQLTDDDSITVQELRKAKIIRPRDYVI--WNSFLDFIDRHEKRYSNKREVLKRFRTFKKN 199
Query: 81 LRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPT 134
+ + Q + TAV+G TKFSD+T EF++ L + AD +K I +
Sbjct: 200 AKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQTMLPYQWEQPVYPMDQADFEKEGITISE 259
Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
DLP FDWRD GAVT VK+QG CGSCW+FS TG +EGA FL+ +LVSLSEQ+LVDCD
Sbjct: 260 EDLPESFDWRDKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFLAKNKLVSLSEQELVDCD- 318
Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 319 --------GVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGK-GETCHLVRKDIAVYIN 369
Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
+ DE +M LV GP++
Sbjct: 370 GSIELPHDEVEMQKWLVTKGPIS 392
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 107/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+R+R K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRVR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 162/297 (54%), Gaps = 18/297 (6%)
Query: 1 MERLILSSLLLLLLSSVLASA-VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
M+ + +++L L S++++ +++ + DA S D +NA + L K
Sbjct: 1 MKLIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVK-- 58
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--- 116
KTY E D RF++FK NLR D T G+ KF+DLT E+R + G+
Sbjct: 59 HGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTI 118
Query: 117 -NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+++ + + + LP DWR+ GAVT VKDQG+CGSCW+FS TG++EG +
Sbjct: 119 DDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNK 178
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ TG+L+S+SEQ+LV+CD S + GCNGGLM+ AFE+I+K GG++ E+DYPYT
Sbjct: 179 IVTGDLISVSEQELVNCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYT 230
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
G DG K K+ + ++ + +++ V + P+A +IE F F
Sbjct: 231 GKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVA---VAIEAGGRDFQF 284
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 96/242 (39%), Positives = 142/242 (58%), Gaps = 18/242 (7%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + RF++FK NL+ + + D T G+T+F+DLT EFR +L +++
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
D+ K + D LP + DWR +GAV VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGEL+SLSEQ+LVDCD G ++GC+GG+MN AFE+I+K GG+E ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
D G C DK+ V+ + + D+++ V H P++ +IE +F +
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS---VAIEASSQAFQLYKS 280
Query: 296 VS 297
V+
Sbjct: 281 VN 282
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 106/295 (35%), Positives = 157/295 (53%), Gaps = 26/295 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--------GEQSEDHLLNAEHHFSL 55
L+ +++ LL+ +S L DDD + P + E E+H NA F
Sbjct: 67 LVAAAVSLLVFASFLIQWQG--DDDRGVFPPSPVEDHKTPVNIWEWKEEHFQNA---FGS 121
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F++ + K+YAT+EE R+ +FK NL + + F DL+ EFRR++LG
Sbjct: 122 FRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRKYLG 181
Query: 116 LNRRLRLPAD----AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
N+ L ++ A + + +D+P+ DWR+ G VT VKDQ CGSCW+FSATGALE
Sbjct: 182 YNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALE 241
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GAH TGEL+SLSEQ+LVDC + GC+GG MN AF+Y++ +GG+ E+
Sbjct: 242 GAHCAKTGELLSLSEQELVDCS-------LAEGNQGCSGGEMNDAFQYVVDSGGLCSEEG 294
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
YPY D G CK K+ +S F + + + H P++ + + +LP
Sbjct: 295 YPYLARD-GECKRACKKV-VTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLP 347
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 133/236 (56%), Gaps = 20/236 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 110 RRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R +L + L+ P +K + T P DWR GAVT VKDQ CGSCW+FS
Sbjct: 98 RATYLNGAKYYAAALKRP---RKVVTVSTGKAPPAIDWRKKGAVTPVKDQRKCGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +EG ++ EL SLSEQ LV CD+ D GC GGLM+ A ++I+ +
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCDNM---------DDGCQGGLMDRALKWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E+ YPY TDG +KS + A +S + DE+ +A L K+GP+A
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIA 261
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 91/229 (39%), Positives = 138/229 (60%), Gaps = 15/229 (6%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEF 109
+F F +F+K Y T++ ++ +FK+N+ AKR Q + TA++G T F+D+TP EF
Sbjct: 64 ENFKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEF 123
Query: 110 RRQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R+ L N ++ P ++ +P +++ DWR AVT VKDQG CGSCW+F
Sbjct: 124 RKTHLNFNPNNVKKP---KRMANIPKSNISERMDWRKFNAVTSVKDQGNCGSCWAFCTVA 180
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EGA + T +L+SLSEQQLVDCD D GC GGL +A+ I++ GG+E+
Sbjct: 181 NIEGAWAVKTAQLISLSEQQLVDCDR---------LDDGCEGGLPVNAYLEIIRLGGLEK 231
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+DY YT G CKF+ +K A +++ V+ DED +A + ++GP+A
Sbjct: 232 EEDYKYTAR-SGKCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVA 279
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/248 (39%), Positives = 141/248 (56%), Gaps = 20/248 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSE 108
++ +K++ K Y + EE R +++ NL + + L T G+ +F+DL E
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
F G R A+ + LP+N+ LP DWR G VT VKDQG CGSCW+FS
Sbjct: 88 FVAMMTGF-RVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFS 146
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG+LEG HF +TG+LVSLSEQ LVDC + E GC+GGLM+ AF+YI+KAGG
Sbjct: 147 TTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNE-------GCDGGLMDQAFQYIIKAGG 199
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
++ E+ YPY D G C F K+ I A V+ ++ ++SD + V H GP++ +I+
Sbjct: 200 IDTEESYPYKAVD-GECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPIS---VAID 255
Query: 285 LPHISFSF 292
H+SF
Sbjct: 256 ASHMSFQL 263
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 134/230 (58%), Gaps = 15/230 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F KF + Y++ EE RFR++ N+ AK+ Q + TA++G TKFSD+T EF++
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
L R+ ++ + L +LP+ FDWR G VT VKDQG+CGSCW+FS T
Sbjct: 219 IMLPSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSCGSCWAFSVT 278
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G +E + TG+L+SLSEQ+L+DCD D GCNGGL +AF I + GG+E
Sbjct: 279 GNIESLWAIKTGKLISLSEQELIDCD---------VIDKGCNGGLPINAFREIKRMGGLE 329
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY + G+C +++IA ++ + I +E M A + + GPL+
Sbjct: 330 PEDQYPYEAKN-GTCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLS 378
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/252 (41%), Positives = 139/252 (55%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLL---DPTAVHGVTKFSDLTPSE 108
+ FK + KTY + E +R ++F N + AK Q + T V K++D+ E
Sbjct: 27 WHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADMLHHE 86
Query: 109 FRRQFLGLN----RRLRL--PADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR G N + LR P+ I P + LP DWR+ GAVT VKDQG CGSC
Sbjct: 87 FRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCGSC 146
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF TG LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 WAFSSTGALEGQHFRKTGTLVSLSEQNLVDC-------SAKYGNNGCNGGLMDNAFRYIK 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY G D SC F+K + A F+ I +E +MA + GP++
Sbjct: 200 DNGGIDTEKSYPYEGID-DSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVS--- 255
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 256 VAIDASHESFQF 267
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 135/250 (54%), Gaps = 21/250 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K + ++ E +R ++F N + AK QL V G+ K+SD+ E
Sbjct: 27 WQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHE 86
Query: 109 FRRQFLGLNRRLRLPADAQK----APILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
F+ G N +R AQ I P N +P DWR HGAVT VKDQG CGSCW+
Sbjct: 87 FKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCGSCWA 146
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS+T ALEG HF G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 FSSTAALEGQHFRKAGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDN 199
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
GG++ EK YPY G D SC F KS + A + F I DE+ + + GP++ +
Sbjct: 200 GGIDTEKSYPYEGID-DSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVS---VA 255
Query: 283 IELPHISFSF 292
I+ H SF
Sbjct: 256 IDASHESFQL 265
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 109/280 (38%), Positives = 145/280 (51%), Gaps = 40/280 (14%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
+L+ + L ++SVLA VAV D L N EH FK F KTY
Sbjct: 140 VLTIEMRLYIASVLALVVAVGAD------------------LTNFEH----FKEHFGKTY 177
Query: 65 ATQEEHDYRFRVFKANLRRAKR---RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+EH R +F+ NL ++ + G+T+F+D++ +EFR+ +LGL
Sbjct: 178 EG-DEHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEFRQTYLGLRMNAS 236
Query: 122 LPADA---QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
A Q+ + DLP DWRD GAV+ VKDQG CGSCW+FS +GA+EG HFL
Sbjct: 237 TIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKN 296
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQQ+VDC D GCNGG A EY+ GG+E E YPY G
Sbjct: 297 GELLSLSEQQMVDCSW---------LDFGCNGGQPMLAMEYVRFNGGLELETAYPYKGV- 346
Query: 239 GGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLA 277
GGSC DK AA ++ F + E + + K GP++
Sbjct: 347 GGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPIS 386
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 107/260 (41%), Positives = 151/260 (58%), Gaps = 17/260 (6%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
+ E F F KF+KT+++ E RF++FK NL+ K Q + TA +GVT F+DLT
Sbjct: 570 IKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFADLT 629
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
P EF+ ++LG L+ + A I ++ LP FDWRD+ AVT VKDQG CGSCW+F
Sbjct: 630 PKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPPKFDWRDYNAVTPVKDQGLCGSCWAF 689
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG +EG + + +L+SLSEQ+L+DCD + D GCNGG M +A++ I K G
Sbjct: 690 SVTGNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLG 740
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNV 280
G+E E DYPY G + C F K V I+S+E +MA L+K+GP++ N
Sbjct: 741 GLELESDYPYDGRN-EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANA 799
Query: 281 ASIELPHISFSFLFTVSSPK 300
+ +S F F + +PK
Sbjct: 800 MQFYIGGVSHPFHF-LCNPK 818
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/242 (39%), Positives = 145/242 (59%), Gaps = 19/242 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 266
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD S D GCNGGL ++A+E I++
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 317
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+ E +YPY + C + +AA +++ ++ DE ++A L H ++ + ++
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 376
Query: 284 EL 285
L
Sbjct: 377 LL 378
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/236 (40%), Positives = 133/236 (56%), Gaps = 20/236 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 110 RRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R +L + L+ P +K + T P DWR GAVT VKDQG CGSCW+FS
Sbjct: 98 RATYLNGAKYYAAALKRP---RKVVNVSTGKAPPAIDWRKKGAVTPVKDQGKCGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +EG ++ EL SLSEQ LV CD+ D GC GG ++ A ++I+ +
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCDNM---------DYGCRGGFLDRALKWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E+ YPY TDG +KS + A +S + DE+ +A L K+GP+A
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIA 261
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 100/259 (38%), Positives = 150/259 (57%), Gaps = 18/259 (6%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD ++++ + + D+++ + F F + K Y+ + E RFR FK N +
Sbjct: 148 HDDSITVQELRKAKIIRPRDYVI--WNSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKVI 205
Query: 85 KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPTNDLP 138
+ Q + +AV+G TKFSD+T EF++ L + AD +K + + +DLP
Sbjct: 206 RELQKNEQGSAVYGFTKFSDMTTMEFKQTMLPYQWEQPVYPMAEADFEKEGVTISEDDLP 265
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG CGSCW+FS TG +EGA +L+ +LVSLSEQ+LVDCD
Sbjct: 266 DSFDWRDHGAVTQVKNQGNCGSCWAFSTTGNVEGAWYLAKKKLVSLSEQELVDCD----- 320
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
S D GCNGGL ++A++ I++ GG+E E YPY G G +C + IA ++
Sbjct: 321 ----SVDQGCNGGLPSNAYKEIMRMGGLEPEDAYPYDGK-GETCHIVRKDIAVYINGSVE 375
Query: 259 ISSDEDQMAANLVKHGPLA 277
+ DE ++ LV GP++
Sbjct: 376 LPHDEVKIQKWLVTKGPIS 394
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/236 (40%), Positives = 131/236 (55%), Gaps = 20/236 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P E
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEL 97
Query: 110 RRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R +L + L+ P +K + T P DWR GAVT VKDQ CGSCW+FS
Sbjct: 98 RATYLNGAKYYAAALKRP---RKVVNVSTGKAPPAVDWRKKGAVTPVKDQRKCGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
ATG +EG ++ EL SLSEQ LV CD+ D GC GGLM+ A ++I+ +
Sbjct: 155 ATGNIEGQWKVAGHELTSLSEQMLVSCDNM---------DDGCQGGLMDRALKWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E+ YPY TDG C + A +S + DE+ +A L K+GP+A
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 261
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/233 (41%), Positives = 128/233 (54%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC GGLM+ AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G DKS + A + + + DE+ +A L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVA 261
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 141/259 (54%), Gaps = 35/259 (13%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
DDDA +R++ F F+ + YA+ EH+ RF +F+ NL + +
Sbjct: 1515 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 1556
Query: 86 RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
+ + TA +GVTKF+D+T +E+R GL A+ A + + DLP
Sbjct: 1557 QLNKFERGTAKYGVTKFADMTVAEYR-AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLP 1615
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD
Sbjct: 1616 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 1670
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D+GC GG M+ AF+ I + GG+E E DYPY SC F++S V
Sbjct: 1671 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 1726
Query: 259 ISSDEDQMAANLVKHGPLA 277
+ +E +A L+K+GP+A
Sbjct: 1727 MPKNETYIAKYLIKNGPIA 1745
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 141/259 (54%), Gaps = 35/259 (13%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
DDDA +R++ F F+ + YA+ EH+ RF +F+ NL + +
Sbjct: 634 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 675
Query: 86 RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
+ + TA +GVTKF+D+T +E+R GL A+ A + + DLP
Sbjct: 676 QLNKFERGTAKYGVTKFADMTVAEYRAH-TGLVVPKHDRANHVGNRVASEEDVAGVGDLP 734
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD
Sbjct: 735 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 789
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D+GC GG M+ AF+ I + GG+E E DYPY SC F++S V
Sbjct: 790 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 845
Query: 259 ISSDEDQMAANLVKHGPLA 277
+ +E +A L+K+GP+A
Sbjct: 846 MPKNETYIAKYLIKNGPIA 864
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 140/247 (56%), Gaps = 16/247 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
NA + FK + K YA +++ RF +FK NL RA++ Q + TA +GVT+FSDLT
Sbjct: 27 NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTN 85
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF +LG R+ + + P DWR+ GAV V+ QG+CGSCW+FS
Sbjct: 86 EEFAAMYLGS----RIDERVDRVQLNDLQTAPASVDWREKGAVGPVEHQGSCGSCWAFSV 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
T +EG FL TG LVSLS+QQLVDCD D GC+GG ++ I + GG+
Sbjct: 142 TANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGL 192
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
E + YPYTG + +C+ D+SK+ A + + V+ +E++ AA L +HGP++ + + L
Sbjct: 193 ELQSAYPYTGWE-QACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQ 251
Query: 287 HISFSFL 293
+ L
Sbjct: 252 FYRYGIL 258
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 141/259 (54%), Gaps = 35/259 (13%)
Query: 26 DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
DDDA +R++ F F+ + YA+ EH+ RF +F+ NL + +
Sbjct: 1491 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 1532
Query: 86 RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
+ + TA +GVTKF+D+T +E+R GL A+ A + + DLP
Sbjct: 1533 QLNKFERGTAKYGVTKFADMTVAEYR-AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLP 1591
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD
Sbjct: 1592 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 1646
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
D+GC GG M+ AF+ I + GG+E E DYPY SC F++S V
Sbjct: 1647 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 1702
Query: 259 ISSDEDQMAANLVKHGPLA 277
+ +E +A L+K+GP+A
Sbjct: 1703 MPKNETYIAKYLIKNGPIA 1721
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K+Y E D RF +FK NL+ L+ T G+T+F+DLT E+R +FLG
Sbjct: 61 KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120
Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
NRR++ ++ P + LP DWR GAV GVKDQ +CGSCW+FSA A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
PY DG + K+ + ++ + + ++ V + P+A
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIA 277
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 130/233 (55%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CDP E C GG M++AF +I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQTLVS----CDPTE-----YACEGGFMDNAFRWIISSNKGKV 208
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY+ G + +C + A +S++ + DE+ +A L K+GP++
Sbjct: 209 FTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDENAIAEWLAKNGPVS 261
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/249 (38%), Positives = 144/249 (57%), Gaps = 23/249 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRR 111
++ + +K SKTY E + RF +FK NLR + + T G+T+F+DLT E+R
Sbjct: 48 YNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRA 107
Query: 112 QFLGLN----RRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+FLG RRL + +Q+ + LP DWR GAV+ +KDQG+CGSCW+FS
Sbjct: 108 KFLGTKSDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFST 167
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
A+EG + + TGEL+SLSEQ+LVDCD S ++GCNGGLM++AF++I+ GG+
Sbjct: 168 IAAVEGVNKIVTGELISLSEQELVDCDR--------SYNAGCNGGLMDNAFQFIINNGGI 219
Query: 227 EREKDYPYTGTDGGSCKFDKSKI---AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
+ +KDYPY DG K D +K+ A + F + + ++ V H P++ +I
Sbjct: 220 DTDKDYPYQAVDG---KCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVS---VAI 273
Query: 284 ELPHISFSF 292
E ++ F
Sbjct: 274 EASGMALQF 282
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/233 (41%), Positives = 128/233 (54%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC GGLM+ AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G DKS + A + + + DE+ +A L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVA 261
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/239 (38%), Positives = 134/239 (56%), Gaps = 15/239 (6%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--L 116
K K+Y E + RF++FK NLR T G+ +F+DLT E+R +LG
Sbjct: 52 KHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSMYLGART 111
Query: 117 NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R RL + +P LP DWR+ GAV GVKDQG+CGSCW+FS A+EG +
Sbjct: 112 GSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGIN 171
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E+DYPY
Sbjct: 172 QIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPY 223
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
DG ++ K+ + ++ + + +Q V + P++ +IE ++F F
Sbjct: 224 NARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVS---VAIEASGMAFQFY 279
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 127/233 (54%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG CGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG--GV 226
+EG ++ L SLSEQ LV CD E D GC GGLM++AF++I+ + V
Sbjct: 158 NIEGQWKVTGHNLTSLSEQMLVSCDTE---------DLGCAGGLMDNAFKWIVSSNRHNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G C+ + A + + + DE+ +A L K+GP+A
Sbjct: 209 FTEESYPYASKGGNVPPCRMSGKVVGAKIRDHVDLPKDENAIAEWLAKNGPVA 261
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/222 (41%), Positives = 134/222 (60%), Gaps = 15/222 (6%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + RF++FK NL+ + + D T G+T+F+DLT EFR +L +++
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
D+ K + D LP + DWR +GAV VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGEL+SLSEQ+LVDCD G ++GC+GG+MN AFE+I+K GG+E ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
D G C DK+ V+ + + D+++ V H P++
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/222 (41%), Positives = 134/222 (60%), Gaps = 15/222 (6%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + RF++FK NL+ + + D T G+T+F+DLT EFR +L +++
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
D+ K + D LP + DWR +GAV VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGEL+SLSEQ+LVDCD G ++GC+GG+MN AFE+I+K GG+E ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
D G C DK+ V+ + + D+++ V H P++
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 141/253 (55%), Gaps = 29/253 (11%)
Query: 37 SDGE----QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP 92
SDGE + EDH A H F FK K S+ Y + EH+ RFR+FK NL + ++ +
Sbjct: 841 SDGEGHYSKGEDH---ARHLFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQ 897
Query: 93 -TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWR 144
TA +G+T F+D+T +E+R Q GL +P D KA I +LP FDWR
Sbjct: 898 GTAKYGITHFADMTSAEYR-QRTGL----VIPRDEDRNHVGNPKAEIDENMELPESFDWR 952
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
+ GAV+ VK+QG CGSCW+FS G +EG H + T L SEQ+L+DCD +
Sbjct: 953 ELGAVSPVKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCD---------AV 1003
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
DS C GG M+ A++ I K GG+E E +YPY +C F+ +++ V + +E
Sbjct: 1004 DSACQGGYMDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNET 1063
Query: 265 QMAANLVKHGPLA 277
MA LV +GP++
Sbjct: 1064 AMAQYLVANGPIS 1076
>gi|52546916|gb|AAU81591.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 190
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 82/120 (68%), Positives = 98/120 (81%), Gaps = 4/120 (3%)
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
ELVSLSEQQLVDCDHECDPEE SCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD
Sbjct: 3 ELVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 62
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
CKFD +K+AA V+NFSV+S DE+Q+AANLVK+GPLA + ++ + +++ VS P
Sbjct: 63 AKCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 118
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 140/234 (59%), Gaps = 19/234 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 15 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74 DEFSRTHLTAPWR----ASSKRNTIPPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 129
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD S D GCNGGL ++A+E I++
Sbjct: 130 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 180
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+ E +YPY + C +AA +++ ++ DE ++A L H ++
Sbjct: 181 GGLMLEDNYPYDAKN-EKCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAIS 233
>gi|238683695|gb|ACR54126.1| cathepsin L [Palaemonetes varians]
Length = 248
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 134/247 (54%), Gaps = 20/247 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDL 104
A + FK K Y+ +E YR +F+ NLR + R + T + +F D+
Sbjct: 15 ASESWDSFKLTHGKAYSNAKEELYRKTIFENNLRFVEEHNARFHNGEVTFNVAMNRFGDM 74
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
T EF Q GL + Q P D DWR GAVTGVKDQG CGSCWSF
Sbjct: 75 TTEEFVAQMTGLTKLEDTVG--QVFAHFPDAPRAADVDWRSKGAVTGVKDQGQCGSCWSF 132
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATGALEGAHF+ TG L SLSEQQLVDC E +SGCNGG++ A++Y+ G
Sbjct: 133 SATGALEGAHFIKTGSLPSLSEQQLVDCSTE---------NSGCNGGVVQWAYDYLKSCG 183
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASI 283
G + E YPY D +C+FD SK+AA V ++ I +DE A+ + GP++ +
Sbjct: 184 GSQTESSYPYEAAD-RTCRFDSSKVAATVRGYTNIPYADEQTQASAVHDKGPVS---VCV 239
Query: 284 ELPHISF 290
+ H+SF
Sbjct: 240 DAGHLSF 246
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 148/268 (55%), Gaps = 36/268 (13%)
Query: 25 NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
+DD ++++ + + D+++ + F F + K Y + E RFRVFK N +
Sbjct: 148 HDDSITVQELRKAKIIRPRDYVI--WNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVI 205
Query: 85 KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-------- 135
+ Q + TAV+G TKFSD+T EF++ + LP ++ P+ P
Sbjct: 206 RELQKNEQGTAVYGFTKFSDMTTMEFKK--------IMLPYQWEQ-PVYPMEQANFEKHD 256
Query: 136 ------DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
DLP FDWR+ GAVT VK+QG CGSCW+FS TG +EGA F++ +LVSLSEQ+L
Sbjct: 257 VTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQEL 316
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCD S D GCNGGL ++A++ I++ GG+E E YPY G G +C + I
Sbjct: 317 VDCD---------SMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDI 366
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLA 277
A ++ + DE +M LV GP++
Sbjct: 367 AVYINGSVELPHDEVEMQKWLVTKGPIS 394
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 98/249 (39%), Positives = 134/249 (53%), Gaps = 19/249 (7%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTP 106
H+ L+K +K Y+ EEH R ++ NL++ + L VH G+ K++D+T
Sbjct: 26 QHWKLWKEANNKRYSDAEEH-VRRATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTV 84
Query: 107 SEFRRQFLGLNRRLR--LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+EF + G N +R D LP DWRD G VT VKDQG CGSCW+F
Sbjct: 85 TEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSCWAF 144
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TGALEG HF TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI +
Sbjct: 145 STTGALEGQHFKQTGKLVSLSEQNLVDCSGK-------QGNMGCNGGLMDQAFEYIKENN 197
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASI 283
G++ E YPY D C+F + + A + F+ I+S DE + + GP++ +I
Sbjct: 198 GIDTEDSYPYEAVD-NQCRFKAANVGATDTGFTDITSKDESALQQAVATVGPIS---VAI 253
Query: 284 ELPHISFSF 292
+ H SF
Sbjct: 254 DAGHTSFQL 262
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 107/252 (42%), Positives = 137/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y E +R ++F N + AK Q V V K++DL E
Sbjct: 29 WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR D+ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 89 FRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K I A F+ I DE +MA + GP+A
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVA--- 257
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 258 VAIDASHESFQF 269
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 106/252 (42%), Positives = 137/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y E +R ++F N + AK Q V V K++DL E
Sbjct: 29 WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR D+ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 89 FRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K I A F+ I DE +MA + GP++
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 257
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 258 VAIDASHESFQF 269
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 129/226 (57%), Gaps = 8/226 (3%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F + K K Y++ EEH +R+ V+K NL +R + + G+TKF+D+T EFRR
Sbjct: 45 QFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDEFRR 104
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
Q+ G + + ++ P DWR GAVT VKDQG+CGSCW+FSA G++E
Sbjct: 105 QYTGTRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAIGSVE 164
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TGE VSLSEQ+LVDCD E + GCNGGLM+ AF++IL+ GG++ E D
Sbjct: 165 GINAIRTGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFILENGGIDTEND 216
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY G DG K+ + + + ++++ V P++
Sbjct: 217 YPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVS 262
>gi|343412631|emb|CCD21595.1| hypothetical protein, conserved in T. vivax [Trypanosoma vivax
Y486]
Length = 257
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 96/233 (41%), Positives = 125/233 (53%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F+ FK K+ ++Y T E +R RVF+ N+RR++ +P A GVT FSDLTP EF
Sbjct: 11 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 70
Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R ++ R + + +P P DWR GAVT VKDQG+CGSCWSFSA G
Sbjct: 71 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 130
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG + L SLSEQ LV CD + D GC GG M++AF I+K G
Sbjct: 131 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DKGCGGGFMDNAFYSIVKENIGKE 181
Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
EK YPY G + CK K+ A ++ I DED +A L +GP+A
Sbjct: 182 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 234
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 110/280 (39%), Positives = 152/280 (54%), Gaps = 22/280 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
+ ILSS++L++L + A+A D+ IR V SDG E+S +L H F+
Sbjct: 4 KTILSSVVLVVLFAASAAANIGFDESNPIRMV--SDGLREVEESVSQILGQSRHVLSFAR 61
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F ++ K Y EE RF +FK NL + + GV +F+DLT EF+R LG
Sbjct: 62 FTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLG 121
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ A + + + LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 122 AAQNC--SATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ EK YPYT
Sbjct: 180 QAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYT 232
Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
G D +CKF + V N ++ + DE + A LV+
Sbjct: 233 GKD-ETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVR 271
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 95/242 (39%), Positives = 144/242 (59%), Gaps = 19/242 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWRKKGAVTEVKNQGMCGSCWA 266
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD + D GCNGGL ++A+E I++
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------NLDDGCNGGLPSNAYESIIRM 317
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+ E +YPY + C + +AA +++ ++ DE ++A L H ++ + ++
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 376
Query: 284 EL 285
L
Sbjct: 377 LL 378
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 170 bits (430), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 98/228 (42%), Positives = 136/228 (59%), Gaps = 19/228 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
FKS +SK+Y ++ R F+ANL + +H GV +F+DLT EF
Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
++ +P + P + + DWR GAVT +K+QG CGSCWSFS TG+ E
Sbjct: 61 LYVPSKFNRTMPYNTVYLPATSEDSV----DWRTKGAVTPIKNQGQCGSCWSFSTTGSTE 116
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVEREK 230
GAH ++TG LVSLSEQQLVDC SGS + GCNGGLM+ AF+YI+ G++ E+
Sbjct: 117 GAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDDAFKYIISNKGLDTEE 168
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLA 277
DYPYT DG K ++K AA +S++S V ++EDQ+AA + K GP++
Sbjct: 169 DYPYTAQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAK-GPVS 215
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 103/268 (38%), Positives = 148/268 (55%), Gaps = 36/268 (13%)
Query: 46 LLNAEHH--FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
L+N ++ ++ FK K +K+Y T++E RF+VF +N + ++ + H +
Sbjct: 34 LINHPYYPVWTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLN 93
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPAD---AQKAPI--------LPTN-DLPTDFDWRDHG 147
KF+D+T +EFR++ G +LPA A+ P+ +P N +P DWR G
Sbjct: 94 KFADMTNAEFRQRMNGF----KLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEG 149
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
VT VKDQG+CGSCW+FSATG+LEG H+ TG+LVSLSEQ LVDCD D D G
Sbjct: 150 YVTKVKDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGD-------DEG 202
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQM 266
CNGG M+ AF+Y+ G++ E YPY G D G C+F + A + F I +E +
Sbjct: 203 CNGGYMDGAFQYVETNKGIDTEASYPYKGRD-GRCRFKSEDVGATDTGFVDIPEGNETLL 261
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLF 294
A + GP+ S+ + SF F F
Sbjct: 262 EAAIATVGPV-----SVAIDAASFKFQF 284
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 106/252 (42%), Positives = 137/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y E +R ++F N + AK Q V V K++DL E
Sbjct: 29 WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 89 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K I A F+ I DE +MA + GP+A
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVA--- 257
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 258 VAIDASHESFQF 269
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 98/230 (42%), Positives = 131/230 (56%), Gaps = 17/230 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKFSDLTPSE 108
F FK K KTY Q E RF +FK NLR ++ +L + G+ +F+D+T E
Sbjct: 25 FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
FR FL L+ + P +L +P DWR G VTGVKDQG CGSCW+FS TG
Sbjct: 85 FR-AFLTLSSSKK-PHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ E A++ G+LVSLSEQQLVDC S ++GCNGG ++ F Y+ K+ G+E
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDC--------STDINAGCNGGYLDETFTYV-KSKGLEA 193
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E YPY GTD GSCK+ SK+ VS S+ S DE+ + + GP++
Sbjct: 194 ESTYPYKGTD-GSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVS 242
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 103/280 (36%), Positives = 148/280 (52%), Gaps = 21/280 (7%)
Query: 19 ASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
AS ++ D DA P + +DH ++ F F+ +K YAT+EE R+ +FK
Sbjct: 61 ASPSSITDGDAKY----PEKIWEWKDHHFQSQ--FYQFQRDHNKFYATEEERLKRYAIFK 114
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RLRLPADAQKAPI--LPTN 135
NL + + V + KF DLT EFR+++LG + LR P + + N
Sbjct: 115 NNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQRYLGYKKPDLRTPPREVDTTLESVEDN 174
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
D+PT DWR G VT VKDQG CGSCW+FSATGA+EG + TG+LV+LS+QQLVDC
Sbjct: 175 DIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRF 234
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
+ GC+GG M AFEY+++ GG+ ++YPY D G CK + A ++
Sbjct: 235 LG-------NQGCDGGRMEEAFEYVVENGGICSGENYPYMRKD-GVCKSSQCTSVATITG 286
Query: 256 F-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLF 294
+ SV E M L P++ +I+ +F F +
Sbjct: 287 YRSVPRRSEKSMKTALALRSPVS---VAIQANQAAFQFYY 323
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 111/283 (39%), Positives = 156/283 (55%), Gaps = 28/283 (9%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
+ IL S++L++L + A+A D+ IR V SDG E+S +L H F+
Sbjct: 4 KTILPSVVLVILIAASAAADIGFDESNPIRMV--SDGLREIEESVVQILGQSRHVLSFAR 61
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F ++ K Y EE RF +FK NL R +++L + GV +F+DLT EF+R
Sbjct: 62 FTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRL---SYKLGVNQFADLTWQEFQRN 118
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
LG + A + + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE
Sbjct: 119 KLGAAQNC--SATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEA 176
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ Y
Sbjct: 177 AYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAY 229
Query: 233 PYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
PYTG D G+CK+ + V N ++ + DE + A LV+
Sbjct: 230 PYTGKD-GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVR 271
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 94/238 (39%), Positives = 140/238 (58%), Gaps = 19/238 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP---TAVHGVTKFS 102
L +H F +F+ K+ + YA EH R R+F+ NLR + +L D +A +G+T+F+
Sbjct: 292 LNKVDHLFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQ--ELNDNEQGSAKYGITEFA 349
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T SE+ Q GL +R K ++P +LP +FDWR+ AVT VK+QG+CGS
Sbjct: 350 DMTSSEYT-QRAGLWQRSANKPTGGKPAVVPAYKGELPKEFDWREKNAVTQVKNQGSCGS 408
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG + + TGEL SEQ+L+DCD S DS CNGGLM++A++ I
Sbjct: 409 CWAFSVTGNIEGLYAIKTGELREFSEQELLDCD---------STDSACNGGLMDNAYKAI 459
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY C F+K+ V++F + +E M L+ +GP++
Sbjct: 460 KDIGGLEYESEYPYLAKK-KQCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPIS 516
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 100/241 (41%), Positives = 137/241 (56%), Gaps = 17/241 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + K + Y+ +E D R++ FK N+ + + V G+TKF+DLT E+++
Sbjct: 33 FIGWMRKHDRAYSHEEFTD-RYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKH 91
Query: 113 FLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+LG+ N + L A AQK P DWR+ GAV+ VKDQG CGSCWSFS TGA+
Sbjct: 92 YLGIKVNVKKNLNA-AQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAV 150
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EGAH + +G +VSLSEQ LVDC + + GC GGLM +AFEYI+ GG+ E
Sbjct: 151 EGAHQIKSGNMVSLSEQNLVDCSGQYG-------NQGCEGGLMVNAFEYIIDNGGIATES 203
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHIS 289
YPYT G CKF KS A + + I +ED + A L K P++ +I+ H+S
Sbjct: 204 SYPYTAAQ-GRCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQ-PVS---VAIDASHMS 258
Query: 290 F 290
F
Sbjct: 259 F 259
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 105/289 (36%), Positives = 153/289 (52%), Gaps = 52/289 (17%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
+++LI++ LLL+ L+S S ++ E F F+ K+
Sbjct: 2 IKKLIVAILLLVALASARTSNLSF------------------------EETQFREFQLKY 37
Query: 61 SKTYATQEEHDY--RFRVFKANLRRAK------RRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+K Y E H++ + FK +L+R + +R +D GV KF+DL+ EF
Sbjct: 38 NKHY---ESHEFAQKLATFKNSLKRIQELNDMAKRAKVDTE--FGVNKFADLSKEEFANY 92
Query: 113 FLGLNRRLRLPADAQK-APILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L N+ D++ AP ++LPT FDWR GAVT VKDQG CGSCWSFS TG
Sbjct: 93 YL--NKGGMESTDSETYAPDYSDKEISNLPTSFDWRTQGAVTPVKDQGQCGSCWSFSTTG 150
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+EG FL+ +L LSEQ LVDC + D GCNGGLM A++YI++ G++
Sbjct: 151 NVEGQWFLAGNDLTGLSEQNLVDCSTKND---------GCNGGLMPLAYDYIVENNGIDT 201
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +C+F+ + I A + + +SS+E QM NLV +GPL+
Sbjct: 202 EASYPYLAIQQKNCQFNPANIGAKIDGYYNVSSNETQMQINLVNNGPLS 250
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/253 (40%), Positives = 140/253 (55%), Gaps = 20/253 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H++L+K SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 25 DEHWNLWKDWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHTYSLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G +L+ + + + N L P DWRD G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGY--KLKSQRKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG HF TG LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 142 FSTTGAMEGQHFRKTGTLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
GG++ E+ YPY GTD G C +D S +A + F V S E + + GP++ +
Sbjct: 195 GGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERALMKAVASVGPVS---VA 251
Query: 283 IELPHISFSFLFT 295
I+ H SF F +
Sbjct: 252 IDAGHESFQFYHS 264
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 136/234 (58%), Gaps = 22/234 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F FK + YA+ EH+ R+ +F+ NL + + + T +GVTKF+D+T +E+R
Sbjct: 1477 QFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYR 1536
Query: 111 RQFLGLNRRLRLP---ADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+ L +P ++ + PI + LPT FDWRDHGAVTGVK+QG CGSCW+
Sbjct: 1537 -----AHTGLIVPKQHSNHIRNPIATVSTERTSLPTSFDWRDHGAVTGVKNQGNCGSCWA 1591
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA G +EG H + T +L + SEQ+L+DCD + D+GCNGG M+ AF+ I K
Sbjct: 1592 FSAIGNIEGLHQIKTKKLEAYSEQELIDCD---------TVDNGCNGGYMDDAFKAIEKL 1642
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY +C F+K+ V + +E +A L+++GP+A
Sbjct: 1643 GGLELEDEYPYQAKAQKTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIA 1696
>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
Length = 318
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 130/229 (56%), Gaps = 13/229 (5%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
+L N+E F+ + SK+ KTYA EE YR RVF NL + K + GV KF+D+
Sbjct: 16 NLRNSE--FTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNAKNLPWTLGVNKFADV 73
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ EF +F G + + Q + D+P DWR+ GAVT VK+QG CGSCW+F
Sbjct: 74 SAEEFAYKFCGCAKDPKTRGTRQTTLV---GDVPARVDWREQGAVTPVKNQGMCGSCWAF 130
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG EGA+FL TG LVSLSEQQLVDC DPE + GC+GG SA +Y+ K
Sbjct: 131 STTGTTEGAYFLKTGNLVSLSEQQLVDCAR--DPEYE---NFGCSGGWPWSAVDYVTKH- 184
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAA-AVSNFSVISSDEDQMAANLVK 272
G+ E+DYPY G D CK K+A +V + DED +A + K
Sbjct: 185 GLCTEEDYPYKGVD-AECKESSCKVAVQSVDKVQLPVGDEDSLAVAVSK 232
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 146/239 (61%), Gaps = 18/239 (7%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIR--QVVPSDG-EQSEDHLLNAEHHFSLFKSKFSKTY 64
S+L L +V+++A A +D ++I Q P+ G +SED + + F + K K+Y
Sbjct: 5 SILFTFLFAVVSAAAAAAEDMSIITYDQQHPAKGLVRSEDEV---KEMFESWLVKHGKSY 61
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLNR---RL 120
+E D RF++F+ NL+ + L+ + G+ +F+D+T E+R +LG R R
Sbjct: 62 NAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASRN 121
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + + + + + LP DWR+ GAVTGVKDQG+CGSCW+FS A+EG + L+TG
Sbjct: 122 MVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLATGN 181
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDCD + + GCNGG M AF++I+K GG++ E+DYPYTG DG
Sbjct: 182 LISLSEQELVDCDRK--------INQGCNGGDMGYAFQFIIKNGGIDSEEDYPYTGKDG 232
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + ++ + + ++ V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 137/247 (55%), Gaps = 15/247 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F+ +K+ ++ YA+ +E R ++ +NL + G+ +F DL EF
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 112 QFLGLN-RRLRLPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
++LG+ + + LP LP DWR G VT VK+QG CGSCWSFS TG+
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG H TG LVSLSEQ LVDC + E GCNGGLM+ AFEYI+K GG++ E
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNE-------GCNGGLMDDAFEYIIKNGGIDTE 193
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHI 288
YPYT T G+CKF+ + I A V+++ +I+ E + + GP++ +I+ HI
Sbjct: 194 ASYPYTATT-GTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVS---VAIDASHI 249
Query: 289 SFSFLFT 295
+F F FT
Sbjct: 250 NFQFYFT 256
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 132/253 (52%), Gaps = 17/253 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK K+ + Y EE YR +F+ N + K+ + + T + KF
Sbjct: 13 LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
D+T EF G R P P T T+ DWR GAVT VKDQG CGSC
Sbjct: 73 GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+LEG HFL TG L+SL+EQQLVDC P+ GCNGG MN AF+YI
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNV 280
G++ E YPY D GSC+FD + +AA S + I+S + V+ GP++
Sbjct: 185 ANNGIDTEASYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS--- 240
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 241 VTIDAAHSSFQFY 253
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + ++ + + ++ V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 136/242 (56%), Gaps = 16/242 (6%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+SE+ ++ + + +K K Y E + RF +FK NL+ + T G+ +
Sbjct: 37 RSEEEVMGM---YQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNR 93
Query: 101 FSDLTPSEFRRQFLGL----NRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
F+DLT E+R +LG RR +L + + ++P LP DWR+ GAV VKDQ
Sbjct: 94 FADLTNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQ 153
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
+CGSCW+FS A+EG + + TGEL+SLSEQ+LVDCD E D GCNGGLM+
Sbjct: 154 RSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTE--------YDMGCNGGLMDY 205
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
AF++I+K GG++ EKDYPYTG DG KS ++ + + +++ V H P
Sbjct: 206 AFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQP 265
Query: 276 LA 277
++
Sbjct: 266 VS 267
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 133/227 (58%), Gaps = 15/227 (6%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL 114
F KF + Y++ E RF+ + NL ++ Q + TA++GVT+FSD++P EF++ L
Sbjct: 173 FIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQKTML 232
Query: 115 GLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R+ ++ + + L N+LP FDWR G VT VK+QG+CGSCW+FS TG +
Sbjct: 233 PSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNI 292
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + TG+L+SLSEQ+L+DCD D GCNGGL +AF I + GG+E E
Sbjct: 293 EGLWAIKTGKLISLSEQELIDCDR---------IDKGCNGGLPINAFREIQRMGGLEPED 343
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY + G+C +S IA + + I +E M A +V+ GPL+
Sbjct: 344 QYPYKARN-GTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLS 389
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 105/257 (40%), Positives = 148/257 (57%), Gaps = 17/257 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
E F F KF+KT+++ E RF++FK NL+ Q + TA +GVT F+DLTP E
Sbjct: 573 EMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKE 632
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
F+ ++LG L+ + A I ++ LP FDWRD+ VT VKDQG CGSCW+FS T
Sbjct: 633 FKTRYLGFRPELKQENEIPLAKIEVSDIFLPLKFDWRDYNVVTPVKDQGLCGSCWAFSVT 692
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G +EG + + +L+SLSEQ+L+DCD + D GCNGG M +A++ I K GG+E
Sbjct: 693 GNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLGGLE 743
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNVASI 283
E DYPY G + C F K V I+S+E +MA L+K+GP++ N
Sbjct: 744 LESDYPYDGRN-EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQF 802
Query: 284 ELPHISFSFLFTVSSPK 300
+ +S F F + +PK
Sbjct: 803 YIGGVSHPFHF-LCNPK 818
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 130/236 (55%), Gaps = 20/236 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 33 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 92
Query: 110 RRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R + + A A K P + T P DWR GAVT VKDQGACGSCW+FS
Sbjct: 93 RATY---HNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFS 149
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +EG ++ EL SLSEQ LV CD + D GC GGLM+ + ++I+ +
Sbjct: 150 AIGNIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNK 200
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V + YPY G +KS + A +S + DE+ +A L K+GP+A
Sbjct: 201 GNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 256
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 136/245 (55%), Gaps = 16/245 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E + ++K +K Y+ + E + R+ ++K N+ R + + F D+T +EF
Sbjct: 24 ESSWYVWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEF 83
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + GL L ++P++ P DWR G VT VK+QG CGSCW+FS+TG
Sbjct: 84 RAKMNGL---LLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTG 140
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALEG HF TG LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++
Sbjct: 141 ALEGQHFKKTGRLVSLSEQNLVDCSTDYG-------NNGCNGGLMDNAFSYIKANGGIDT 193
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPH 287
E YPY G D G+C++ KS I A + F I DED + + GP++ +I+ H
Sbjct: 194 ETGYPYEGQD-GTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVS---VAIDASH 249
Query: 288 ISFSF 292
+SF F
Sbjct: 250 MSFQF 254
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + ++ + + ++ V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK Q V V K++DL E
Sbjct: 59 WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 118
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 119 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 178
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 179 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFRYIK 231
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K + A F+ I DE +MA + GP++
Sbjct: 232 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 287
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 288 VAIDASHESFQF 299
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 130/236 (55%), Gaps = 20/236 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
R + + A A K P + T P DWR GAVT VKDQGACGSCW+FS
Sbjct: 98 RATY---HNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFS 154
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +EG ++ EL SLSEQ LV CD + D GC GGLM+ + ++I+ +
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNK 205
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V + YPY G +KS + A +S + DE+ +A L K+GP+A
Sbjct: 206 GNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 261
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK Q V V K++DL E
Sbjct: 63 WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 122
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 123 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 182
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 183 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 235
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K + A F+ I DE +MA + GP++
Sbjct: 236 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 291
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 292 VAIDASHESFQF 303
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/252 (41%), Positives = 137/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y E +R ++F N + AK Q V V K++DL E
Sbjct: 29 WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 89 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K I A F+ I DE +MA + GP++
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 257
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 258 VAIDASHESFQF 269
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK Q V V K++DL E
Sbjct: 29 WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 89 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K + A F+ I DE +MA + GP++
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 257
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 258 VAIDASHESFQF 269
>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/233 (39%), Positives = 129/233 (55%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R +L G +K + T P DWR GAVT VKDQG+CGSCW+F+ATG
Sbjct: 98 RATYLNGAKYYAAALERPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAATG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + + C GG + AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCRGGFADRAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY TDG C + A +S + DE+ +A L ++GP+A
Sbjct: 209 FTEESYPYASTDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVA 261
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/233 (40%), Positives = 128/233 (54%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQGACGSCW+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGACGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GGLM+ + ++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ YPY G +KS + A +S + DE+ +A L K+GP+A
Sbjct: 209 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 261
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/245 (39%), Positives = 143/245 (58%), Gaps = 19/245 (7%)
Query: 38 DGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVH 96
+G+++E L N+ F F KF + Y++ E RF+ + NL ++ Q + TA++
Sbjct: 124 EGKKTE-MLWNS---FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIY 179
Query: 97 GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGV 152
GVT+FSD++P EF++ L R+ ++ + + L N+LP FDWR G VT V
Sbjct: 180 GVTQFSDMSPEEFQKTMLPSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPV 239
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
K+QG+CGSCW+FS TG +EG + TG+L+SLSEQ+L+DCD D GCNGGL
Sbjct: 240 KNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDR---------IDKGCNGGL 290
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
+AF I + GG+E E YPY + G+C +S IA + + I +E M A +V+
Sbjct: 291 PINAFREIQRMGGLEPEDQYPYKARN-GTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQ 349
Query: 273 HGPLA 277
GPL+
Sbjct: 350 RGPLS 354
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 132/253 (52%), Gaps = 17/253 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK K+ + Y EE YR +F+ N + K+ + + T + KF
Sbjct: 13 LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
D+T EF G R P P T T+ DWR GAVT VKDQG CGSC
Sbjct: 73 GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+LEG HFL TG L+SL+EQQLVDC P+ GCNGG MN AF+YI
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNV 280
G++ E YPY D GSC+FD + +AA S + I+S + V+ GP++
Sbjct: 185 ANNGIDTEAAYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS--- 240
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 241 VTIDAAHSSFQFY 253
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 133/229 (58%), Gaps = 17/229 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKFSDLTPSE 108
F FK + K+Y Q E RF +F+AN+ ++ L + + +F+DLT E
Sbjct: 26 FETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTDLTQEE 85
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F+ +LGL+ + L Q L ++PT DWR G VTGVK+QG+CGSCWSF+ TG
Sbjct: 86 FKA-YLGLHVKPVLNNTIQYE--LKGLEVPTSVDWRSAGQVTGVKNQGSCGSCWSFALTG 142
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ EGA++ +LVSLSEQQLVDC S S + GCNGG +++ F YI + G++
Sbjct: 143 STEGAYYRKHKQLVSLSEQQLVDC--------STSINYGCNGGFLDATFPYIEQY-GLQT 193
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPYTG D GSCK+D SK+ +SN+ + E ++ + GP+A
Sbjct: 194 ESSYPYTGVD-GSCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVA 241
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/224 (40%), Positives = 134/224 (59%), Gaps = 15/224 (6%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K Y E D RF++FK NLR ++ + T G+ +F+DLT E+R ++LG
Sbjct: 46 KHGKLYNALGEKDKRFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKI 105
Query: 117 --NRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
NRRL R P++ + T LP DWR GAV VKDQ +CGSCW+FSA GA+EG
Sbjct: 106 DPNRRLGRTPSNRYAPRVGET--LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGI 163
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+L+SLSEQ+LVDCD + GCNGGLM+ AFE+I+K GG++ E+DYP
Sbjct: 164 NKIVTGDLISLSEQELVDCDT--------GYNMGCNGGLMDYAFEFIIKNGGIDSEEDYP 215
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y G DG ++ K+ ++ + +++ ++ V + P++
Sbjct: 216 YKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVS 259
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/244 (39%), Positives = 133/244 (54%), Gaps = 18/244 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ +K K++K Y++QEE R RV+ +NL+ + + +F+DL P EF
Sbjct: 19 WEQWKLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSEREGYTVAMNEFADLDPREFVSH 78
Query: 113 FLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ GL RR P + P D LPT DWR G VTGVK+QG CGSCW+FSATG+
Sbjct: 79 YNGLRRR---PHTSSGEPCTLGEDVSALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGS 135
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG HF +TG+LVSLSEQ LVDC S + GCNGGL + AF+Y++K GG++ E
Sbjct: 136 LEGQHFNATGKLVSLSEQNLVDC-------SSAEGNEGCNGGLPDDAFKYVIKNGGIDTE 188
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHI 288
YPY D C + + I + S++ I S E Q+ GP+ I+ H+
Sbjct: 189 ASYPYVARD-EKCHYSSANIGSTCSSYVDIESKSEAQLQVASATVGPIP---VGIDASHL 244
Query: 289 SFSF 292
F
Sbjct: 245 GFQL 248
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 134/256 (52%), Gaps = 17/256 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-- 99
++ H + + + +KS + K YA EE D+R V++ N++ +R HG T
Sbjct: 18 AQKHDESLDEQWYQWKSLYKKPYAANEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMT 76
Query: 100 --KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
F D+T EFR+ G + R+ P+ +P DW G VT VKDQG
Sbjct: 77 MNAFGDMTNEEFRQVMNGFQNQKRIQGKLLYEPVF--GHIPKSVDWTQKGYVTPVKDQGQ 134
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF
Sbjct: 135 CGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAF 187
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+YI GG++ E+ YPYT D C+++ AA + F I E + + GP++
Sbjct: 188 QYIKDNGGLDSEESYPYTAMDKQDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPIS 247
Query: 278 GNVASIELPHISFSFL 293
+++ H SF F
Sbjct: 248 ---VAVDAGHESFQFY 260
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/255 (41%), Positives = 146/255 (57%), Gaps = 31/255 (12%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
+ H+ LFK + +KTY Q++ R +F+AN+++ LL + G+ F+D+T
Sbjct: 23 DEHWELFKRQHNKTY-LQKQDVGRRAIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMT 81
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND-----LPTDFDWRDHGAVTGVKDQGACGS 160
P EF + R R A+ + L D +P DWR G VT VK+QG CGS
Sbjct: 82 PDEFEKY-----RGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGS 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TGALEG HF +G+LVSLSEQ LVDC + ++GCNGGLM++AF +I
Sbjct: 137 CWAFSTTGALEGQHFRRSGDLVSLSEQMLVDC-------SAVYGNAGCNGGLMDNAFRFI 189
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQM--AANLVKHGPLA 277
AGG+E EK YPYTG D G+C FD I A ++ F V S DE+ + AA +V GP++
Sbjct: 190 KDAGGLETEKSYPYTGKD-GTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVV--GPVS 246
Query: 278 GNVASIELPHISFSF 292
+I+ +F F
Sbjct: 247 ---VAIDASGQNFQF 258
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 151/285 (52%), Gaps = 16/285 (5%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAE--HHFSLFKS 58
M+ LS + L++ +++S D I + ++S N E + +
Sbjct: 1 MDSNTLSPAMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLV 60
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K+Y E D RF +FK NL+ L+ T G+T+F+DLT E+R +FLG
Sbjct: 61 KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120
Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
NRR++ ++ P + LP DWR GAV GVKDQ +CGSCW+FSA A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
PY DG + K+ + ++ + + ++ V + P+A
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIA 277
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 151/285 (52%), Gaps = 16/285 (5%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAE--HHFSLFKS 58
M+ LS + L++ +++S D I + ++S N E + +
Sbjct: 1 MDSNTLSPAMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLV 60
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K+Y E D RF +FK NL+ L+ T G+T+F+DLT E+R +FLG
Sbjct: 61 KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120
Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
NRR++ ++ P + LP DWR GAV GVKDQ +CGSCW+FSA A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
PY DG + K+ + ++ + + ++ V + P+A
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIA 277
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 141/245 (57%), Gaps = 24/245 (9%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVT 99
+SEDH + H F FK++ ++TY + EH+ RFR+FK NL + ++ + TA +G+T
Sbjct: 1137 KSEDH---SRHLFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGIT 1193
Query: 100 KFSDLTPSEFR-RQFLGLNRR------LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
F+D+T +E+R R L + R +R P A I +LP FDWR+ GAV+ V
Sbjct: 1194 HFADMTSAEYRARTGLVVPREGDEVNHIRNPM----AEIDEHMELPDAFDWRELGAVSEV 1249
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
K+QG CGSCW+FS G +EG H + T +L SEQ+L+DCD + DS CNGG
Sbjct: 1250 KNQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCD---------TVDSACNGGF 1300
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
M+ A++ I K GG+E E +YPY +C F+K+ V + +E +A LV
Sbjct: 1301 MDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAVDLPKNETAIAQFLVA 1360
Query: 273 HGPLA 277
+GP++
Sbjct: 1361 NGPVS 1365
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 152/280 (54%), Gaps = 22/280 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
+ +LSS++L++L + A+A D+ IR V SDG E++ +L H F+
Sbjct: 4 KTVLSSVVLVILIAASAAADIGFDELNPIRMV--SDGLREVEETVSQILGQSRHVLTFAR 61
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
F ++ K Y EE RF +FK NL + + GV +F+DLT EF+R LG
Sbjct: 62 FTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLG 121
Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ A + + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 122 AAQNC--SATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPY
Sbjct: 180 QAFGKGISLSEQQLVDCAGAYN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYI 232
Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
G D G+CKF + V N ++ + DE + A LV+
Sbjct: 233 GKD-GTCKFSAENVGVQVLDSVNITLGAEDELKHAVGLVR 271
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 125/232 (53%), Gaps = 16/232 (6%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
EEH RF +FK N++ D G+ KF+DL+ EF+ ++G LR + Q
Sbjct: 62 EEHAERFEIFKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQ 121
Query: 128 KAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ N LP DWR GAV VK+QG CGSCW+FS ++EG ++++TG LVSLS
Sbjct: 122 SGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLS 181
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT--GTDGGSCK 243
EQQLVDC E +SGCNGGLM++AF+YI+ GG+ E +YPYT T+ S K
Sbjct: 182 EQQLVDCSTE---------NSGCNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTK 232
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
+ + F + ++ +Q V H P++ +IE F F T
Sbjct: 233 INSQTTRVVIDGFEDVPANNEQALKEAVAHQPVS---VAIEASGQDFQFYST 281
>gi|402584107|gb|EJW78049.1| hypothetical protein WUBG_11042, partial [Wuchereria bancrofti]
Length = 213
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 133/222 (59%), Gaps = 22/222 (9%)
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNR 118
+++ Y +++E RFR++K NLR AK Q + TA++G T +SD+T EFR+ L
Sbjct: 1 YNRKYRSKKEFLKRFRIYKRNLRLAKLIQNKEEGTAIYGETPYSDMTQEEFRKIMLPY-- 58
Query: 119 RLRLPADAQKAPIL-------PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ P + K ++ +++P FDWRD G VT VK+QG+CGSCW+FS TG +E
Sbjct: 59 --KWPLNENKKQMIDLAEYGITDDEIPESFDWRDKGVVTEVKNQGSCGSCWAFSVTGNIE 116
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GA + G+L+SLSEQ+LVDCD D GC GGL +A++ I++ GG+E EKD
Sbjct: 117 GAWAIKKGKLISLSEQELVDCD---------VIDQGCKGGLPLNAYKEIIRMGGLESEKD 167
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
YPY G G C + IA +++ + +DE ++AA L K
Sbjct: 168 YPYDGY-GEKCHLVRRDIAVYINDSVQLPADEFKIAAWLTKK 208
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 99/283 (34%), Positives = 157/283 (55%), Gaps = 28/283 (9%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD----GEQSEDHLLNAEHHFSLFKSKF 60
I S L ++ S LAS ++ D +P+D E++E H++ H+ + K
Sbjct: 10 IAISFLFMVFSLSLASMSIIDYD-------LPADPLQSTERTEAHMMKMYEHWLV---KH 59
Query: 61 SKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LN 117
K Y E + RF +FK NLR ++ + T G+TKF+DLT E+R +LG +
Sbjct: 60 GKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKME 119
Query: 118 RRLRLPADAQKAPILPT---NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++ +L + + + +DLP+ DWR+ GAVT VKDQG CGSCW+FS G++EG +
Sbjct: 120 KKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGIN 179
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TG+L+SLSEQ+LVDCD + + GCNGGLM+ AFE+I+K GG++ E DYPY
Sbjct: 180 QIVTGDLISLSEQELVDCDK--------AYNQGCNGGLMDYAFEFIIKNGGIDSEADYPY 231
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+D K+ + + + ++++ V + P++
Sbjct: 232 RASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVS 274
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 102/248 (41%), Positives = 137/248 (55%), Gaps = 20/248 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 HWELWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGTHSYRLGMNHFGDMTHE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G R+ A+ + L N L P DWRD+G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQLMNGYKRKAE--TKARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFS 143
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ G
Sbjct: 144 TTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 196
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
++ E YPY GTD C +D + + + F I S +++ V GP++ +I+
Sbjct: 197 LDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVS---VAID 253
Query: 285 LPHISFSF 292
H SF F
Sbjct: 254 AGHESFQF 261
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 140/242 (57%), Gaps = 17/242 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
N + + FK K+ K Y E+ + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 115 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 173
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 174 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 231
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 232 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 282
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++ + ++
Sbjct: 283 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 341
Query: 284 EL 285
L
Sbjct: 342 LL 343
>gi|407838603|gb|EKG00105.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative, partial [Trypanosoma
cruzi]
Length = 326
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 123/236 (52%), Gaps = 26/236 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK K + Y + E +R VF+ANL A+ +P A GVT FSDLT EFR +
Sbjct: 71 FAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRSR 130
Query: 113 -------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
F R R+P D + P DWR GAVT VKDQG CGSCW+FS
Sbjct: 131 YHNGAAHFAAAQERARVPVDVEVV------GAPAAKDWRARGAVTAVKDQGQCGSCWAFS 184
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
A G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 185 AIGNVECQWFLAGHPLTNLSEQMLVSCD---------KTDSGCGGGLMNNAFEWIVQENN 235
Query: 224 GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 236 GAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 291
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 140/242 (57%), Gaps = 17/242 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
N + + FK K+ K Y E+ + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 153 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 211
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 212 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 269
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 270 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 320
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++ + ++
Sbjct: 321 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 379
Query: 284 EL 285
L
Sbjct: 380 LL 381
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 112/299 (37%), Positives = 153/299 (51%), Gaps = 42/299 (14%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ + L+L L +++A A AV+ + + Q E H EH K Y
Sbjct: 1 MRTALILPLLALVAVAQAVSYAEVI----------QEEWHTFKLEHR---------KNYQ 41
Query: 66 TQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN---- 117
+ E +R ++F N + AK QL AV V K++D+ EF G N
Sbjct: 42 DETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH 101
Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++LR ++ K + + LP DWR GAVT VKDQG CGSCW+FS+TGALEG H
Sbjct: 102 KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQH 161
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ +G LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ EK YPY
Sbjct: 162 YRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 214
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
D SC F+K I A F I +E +MA + GP+A +I+ H SF F
Sbjct: 215 EAID-DSCHFNKGSIGATDRGFVDIPQGNEKKMAEAVATIGPVA---VAIDASHESFQF 269
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 95/242 (39%), Positives = 137/242 (56%), Gaps = 18/242 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ + + K+YA EE YR+ V++ N + + + + KF DLT +EF +
Sbjct: 30 FADWMQEHQKSYA-NEEFVYRWNVWRENYLYIEAHNHQNKSFHLAMNKFGDLTNAEFNKL 88
Query: 113 FLGLNRRLRLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
F GL+ + AD Q++ I P LP DFDWR GAVT VK+QG CGSCWSFS TG+
Sbjct: 89 FKGLS----ITADQAKQESDIAPAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGST 144
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EGA+FL G L SLSEQ LVDC + + GCNGGLM+ AFEYI++ G++ E+
Sbjct: 145 EGANFLKHGRLTSLSEQNLVDC-------STSYGNHGCNGGLMDYAFEYIIRNKGIDTEE 197
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
YPY + G+C+++K + +++ + S + N V P + +I+ H SF
Sbjct: 198 SYPYHASQ-GTCRYNKQHSGGELVSYTNVPSGNEGALLNAVATQPTS---VAIDASHSSF 253
Query: 291 SF 292
F
Sbjct: 254 QF 255
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 100/243 (41%), Positives = 134/243 (55%), Gaps = 23/243 (9%)
Query: 62 KTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN 117
K Y + E +R ++F N + AK QL V V K++D+ EFR+ G N
Sbjct: 114 KNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFN 173
Query: 118 ----RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ LR ++ K + + LP DWRD GAVTGVKDQG CGSCW+FS+TGAL
Sbjct: 174 YTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGAL 233
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H+ +G LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ EK
Sbjct: 234 EGQHYRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEK 286
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHIS 289
YPY D SC F+K I A F I +E ++A + GP++ +I+ H S
Sbjct: 287 SYPYEALD-DSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVS---VAIDASHES 342
Query: 290 FSF 292
F F
Sbjct: 343 FQF 345
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 94/240 (39%), Positives = 132/240 (55%), Gaps = 23/240 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
A +HF+ F + K Y + E RF +FK NL + Q D TA++G+ +F+DL+
Sbjct: 58 FGAWNHFTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLS 117
Query: 106 PSEFRRQFLG--------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
P EF++ L NR + L A+ + P LP FDWR+HGAVT VK +G
Sbjct: 118 PEEFKKTHLPHTWKQPDHPNRIVDLAAEG----VDPKEPLPESFDWREHGAVTKVKTEGH 173
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
C +CW+FS TG +EG FL+ +LVSLS QQL+DCD D GCNGG A+
Sbjct: 174 CAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDCD---------VVDEGCNGGFPLDAY 224
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ I++ GG+E E YPY C+ S IA ++ + DE++M A LVK GP++
Sbjct: 225 KEIVRMGGLEPEDKYPYE-AKAEQCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPIS 283
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 139/242 (57%), Gaps = 18/242 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
N + + FK K+ K Y E + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 153 NVDEKYVQFKLKYRKQY--HETDEIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 268
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 269 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 319
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++ + ++
Sbjct: 320 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 378
Query: 284 EL 285
L
Sbjct: 379 LL 380
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 112/299 (37%), Positives = 153/299 (51%), Gaps = 42/299 (14%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ + L+L L +++A A AV+ + + Q E H EH K Y
Sbjct: 1 MRTALILPLLALVAVAQAVSYAEVI----------QEEWHTFKLEHR---------KNYQ 41
Query: 66 TQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN---- 117
+ E +R ++F N + AK QL AV V K++D+ EF G N
Sbjct: 42 DETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH 101
Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
++LR ++ K + + LP DWR GAVT VKDQG CGSCW+FS+TGALEG H
Sbjct: 102 KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQH 161
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ +G LVSLSEQ LVDC + ++GCNGGLM++AF YI GG++ EK YPY
Sbjct: 162 YRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 214
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
D SC F+K I A F I +E +MA + GP+A +I+ H SF F
Sbjct: 215 EAID-DSCHFNKGTIGATDRGFVDIPQGNEKKMAEAVATIGPVA---VAIDASHESFQF 269
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 97/257 (37%), Positives = 143/257 (55%), Gaps = 22/257 (8%)
Query: 44 DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+HL N + LF+S + SK Y + EE +RF VF+ NL +R + G+ +
Sbjct: 39 EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98
Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
F+DLT EF+ ++LGL + R R P+ + + DLP DWR GAV VKDQG
Sbjct: 99 FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS A+EG + ++TG L SLSEQ+L+DCD + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCD--------TTFNSGCNGGLMDYA 208
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
F+YI+ GG+ +E DYPY + G C+ K + +S + + ++D+ + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267
Query: 276 LAGNVASIELPHISFSF 292
++ +IE F F
Sbjct: 268 VS---VAIEASGRDFQF 281
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 130/231 (56%), Gaps = 22/231 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + ++K+YA E R +F NL A + Q LD +A +GVTKFSDLT EFR
Sbjct: 270 FKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTEEEFRM 329
Query: 112 QFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L L R PA + P P +DWRDHGA+T K+QG CGSCW+FS
Sbjct: 330 FYLNPLLSSLPGRALRPAPRARGPA------PASWDWRDHGALTAAKNQGMCGSCWAFSV 383
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG FL G L++LSEQ+LVDCD + D C GGL ++A+ I GG+
Sbjct: 384 TGNVEGQWFLRRGALLTLSEQELVDCD---------TLDQACGGGLPSNAYTAIETLGGL 434
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E EKDY Y G C F K A +++ +S DE ++AA L ++GP++
Sbjct: 435 ETEKDYSYEGRK-ERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVS 484
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 95/249 (38%), Positives = 135/249 (54%), Gaps = 14/249 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT V+D+ C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPDAVDWRKKGAVTPVRDERLCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ L+ CD D GC GGLM+ AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLLSCDTRED---------GCGGGLMDRAFQWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
E+ YPY TDG + +KS + A +S++ + DE+ +A L K+GP+A V +
Sbjct: 209 FTEQSYPYASTDGDVPRCNKSGKVVGAKISDYVDLPQDENAIAEWLAKNGPVAIAVEATS 268
Query: 285 LPHISFSFL 293
L + L
Sbjct: 269 LQRYTGGVL 277
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 146/282 (51%), Gaps = 30/282 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +++ +LLLL + A+A+++ + SE+ +++ + + K
Sbjct: 1 MPSMLIPTLLLLSFTFSHATAMSIIN--------------YSENEVMDMYEEWLV---KH 43
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--- 117
K Y +E + RF+VFK NL + + T G+ KF+D+T E+R +LG
Sbjct: 44 RKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDA 103
Query: 118 --RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R ++ + + LP DWR GAV +KDQG CGSCW+FS A+EG +
Sbjct: 104 KRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINN 163
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ TGE VSLSEQ+LVDCD E D GCNGGLM+ AF++I++ GG++ E+DYPY
Sbjct: 164 IVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQ 215
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G DG + K + + + S+ + V H P++
Sbjct: 216 GIDGTCDQTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVS 257
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 151/270 (55%), Gaps = 10/270 (3%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++L L +ASAV ++ + V + G +S+ +++ + L K ++ +
Sbjct: 2 VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAW-LVKHGKAQNQNSLV 60
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA-DAQ 127
E D RF +FK NLR + + G+T+F+DLT E+R ++LG + +Q
Sbjct: 61 EKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSQ 120
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ ++LP DWR GAV VKDQG+CGSCW+FS GA+EG + + TG+L++LSEQ
Sbjct: 121 RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 180
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG + K+
Sbjct: 181 ELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKN 232
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ ++ + + ++ V H P++
Sbjct: 233 AKVVTIDSYEDVPTYSEESLKKAVAHQPVS 262
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 104/243 (42%), Positives = 134/243 (55%), Gaps = 21/243 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK+ F K Y T EE RF +F+ L R ++ + + GV +FSD++ E+ R
Sbjct: 57 FKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLR 116
Query: 112 QFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
GL R R + + + L DWRD G VT VK+QG CGSCWSFS TG+
Sbjct: 117 HN-GLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGS 175
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVER 228
LEG HF TG+L+SLSEQQLVDC SG+ + GCNGGLM++AFEYI GG+E
Sbjct: 176 LEGQHFRQTGKLISLSEQQLVDC--------SGTFGNEGCNGGLMDNAFEYIKSIGGLEG 227
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPH 287
E DYPYT G C KS A + + V S DED + L GP++ +I+ H
Sbjct: 228 EDDYPYTAKQ-GKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPIS---VAIDASH 283
Query: 288 ISF 290
SF
Sbjct: 284 ASF 286
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 146/282 (51%), Gaps = 30/282 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +++ +LLLL + A+A+++ + SE+ +++ + + K
Sbjct: 1 MPSMLIPTLLLLSFTFSHATAMSIIN--------------YSENEVMDMYEEWLV---KH 43
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--- 117
K Y +E + RF+VFK NL + + T G+ KF+D+T E+R +LG
Sbjct: 44 RKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDA 103
Query: 118 --RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R ++ + + LP DWR GAV +KDQG CGSCW+FS A+EG +
Sbjct: 104 KRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINN 163
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ TGE VSLSEQ+LVDCD E D GCNGGLM+ AF++I++ GG++ E+DYPY
Sbjct: 164 IVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQ 215
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G DG + K + + + S+ + V H P++
Sbjct: 216 GIDGTCDETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVS 257
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 100/280 (35%), Positives = 149/280 (53%), Gaps = 47/280 (16%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L LL S +A + +DD+ M F+++ K+ KTY+T
Sbjct: 9 ALFFLLASFTVALPFSPSDDEVMAES-------------------FNMWMKKYEKTYSTM 49
Query: 68 EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL-------GLNRR 119
EE++ R RV+ +N ++ + P + + +FSDLT +EF++ +L N
Sbjct: 50 EEYNERLRVYTSNYYYIEQLNKEHGPHTEYELNQFSDLTFAEFKKIYLTEPQHCSATNGN 109
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ P +A+ P DWR+ +T VKDQG CGSCW+FS TG LE H + TG
Sbjct: 110 FQKPVNARD---------PVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLEAHHAIKTG 160
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+L+SLSEQQLVDC +G+ ++ GCNGGL + AFEYI GG+E E +Y YT D
Sbjct: 161 QLISLSEQQLVDC--------AGAFNNHGCNGGLPSQAFEYIKYNGGIESESNYNYTAKD 212
Query: 239 GGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLA 277
G C+F+ S +AA VS+ I+ D E + + GP++
Sbjct: 213 -GVCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVS 251
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 97/245 (39%), Positives = 136/245 (55%), Gaps = 22/245 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K++ K Y + EE R +++ NL R + L T G+ +F+DL EF
Sbjct: 31 WKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKEFVA 90
Query: 112 QFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
G R A+ + LP N+ LP DWR G VT VKDQG CGSCW+FSATG
Sbjct: 91 MMTGF-RVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATG 149
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+LEG HF TG+LVSLSEQ LVDC + + GCNGGLM+ AF+YI+ AGG++
Sbjct: 150 SLEGQHFKKTGKLVSLSEQNLVDCSDK---------NYGCNGGLMDRAFQYIIDAGGIDT 200
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELPH 287
E+ YPY D G+C F + + A V+ ++ ++S ++ V H GP++ +I+ H
Sbjct: 201 EESYPYIAMD-GNCHFKTANVGATVTGYTDVTSGSEKALQKAVAHIGPIS---VAIDASH 256
Query: 288 ISFSF 292
SF
Sbjct: 257 FSFQL 261
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 136/234 (58%), Gaps = 17/234 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTP 106
N + + FK K+ K Y E+ + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 15 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTT 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 131
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 132 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 182
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++
Sbjct: 183 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTIS 235
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 156/293 (53%), Gaps = 22/293 (7%)
Query: 1 MERLILSSLLLLLLSSVL-ASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH--- 52
M R+ +S LL+L++ V ASA + D I+QVV SDG E S ++ H
Sbjct: 1 MARVSPASFLLILIACVAGASAGSSFADQNPIKQVV-SDGLRELEASVLQVIGQTRHSLA 59
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ F ++ K+Y T EE RF +F +L+ + + GV +F+DLT EFR+
Sbjct: 60 FARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLSYTLGVNEFADLTWEEFRKH 119
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
LG + A + L LP DWR+ G VT VK+QG CGSCW+FS TGALE
Sbjct: 120 RLGAAQNC--SATLKGNHKLTNGLLPLKKDWREVGIVTPVKNQGHCGSCWTFSTTGALEA 177
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
A+ + G+ + LSEQQLVDC + + GCNGGL + AFEYI GG++ E+ Y
Sbjct: 178 AYVQAFGKAIFLSEQQLVDCARAYN-------NFGCNGGLPSQAFEYIKANGGLDTEEAY 230
Query: 233 PYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAGNVAS 282
PYTG D G CKF I V N ++ + DE + A V+ +A V S
Sbjct: 231 PYTGVD-GVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVAFEVVS 282
>gi|343477207|emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 124/233 (53%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GG + AF++IL + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTD---------DFGCRGGFSDPAFKWILWSNKGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G +CK + A +SN + DED + L + GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCKMSGKVVGAKISNRLYLPEDEDMITEWLARKGPVA 261
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 101/243 (41%), Positives = 135/243 (55%), Gaps = 17/243 (6%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVH---GVTKFSDLTPSEFRR 111
+K K+ K+Y + E R RV+++NL+ ++ +L D + G+ ++DL EF
Sbjct: 22 WKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEFMA 81
Query: 112 -QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ G + + + Q L LP+ DWR+ G VT VKDQG CGSCW+FSATG+L
Sbjct: 82 LKGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFSATGSL 141
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG HF TG L+SLSEQQLVDC + GCNGGLM SA++YI GGVE E
Sbjct: 142 EGQHFAKTGNLLSLSEQQLVDCAGRYG-------NYGCNGGLMESAYDYIKGVGGVELES 194
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELPHIS 289
YPYT D G CKFD+SK+ A + VI DE + + GP+A SI+ S
Sbjct: 195 AYPYTARD-GRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVA---VSIDASGYS 250
Query: 290 FSF 292
F
Sbjct: 251 FQL 253
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/248 (34%), Positives = 142/248 (57%), Gaps = 20/248 (8%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-- 96
GE+S+D + + +K++ +++Y +E + R +F+ NLR + +
Sbjct: 36 GERSDDEV---HRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSF 92
Query: 97 --GVTKFSDLTPSEFRRQFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
G+T+F+DLT E+R +LG+ RR + + ++DLP DWRD GAV
Sbjct: 93 RLGLTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAV 152
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
VKDQG+CGSCW+FS A+EG + + TG+L+SLSEQ+LVDCD + GCN
Sbjct: 153 VDVKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDT--------YYNQGCN 204
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ ++DYPYTG DG ++ K+ + ++ + ++++
Sbjct: 205 GGLMDYAFEFIISNGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQK 264
Query: 270 LVKHGPLA 277
V + P++
Sbjct: 265 AVANQPVS 272
>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
Length = 383
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 123/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ANL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTAFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/257 (37%), Positives = 143/257 (55%), Gaps = 22/257 (8%)
Query: 44 DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+HL N + LF+S + SK Y + EE +RF VF+ NL +R + G+ +
Sbjct: 39 EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98
Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
F+DLT EF+ ++LGL + R R P+ + + DLP DWR GAV VKDQG
Sbjct: 99 FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS A+EG + ++TG L SLSEQ+L+DCD + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYA 208
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
F+YI+ GG+ +E DYPY + G C+ K + +S + + ++D+ + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267
Query: 276 LAGNVASIELPHISFSF 292
++ +IE F F
Sbjct: 268 VS---VAIEASGRDFQF 281
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/254 (40%), Positives = 133/254 (52%), Gaps = 24/254 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK KF + Y EE YR VF NL+ K+ + + T + +F
Sbjct: 13 LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
SDLT EF G LR P A T+ P T+ DWR G VT VKDQG CG
Sbjct: 73 SDLTNDEFNSMMKGYKTSLR-PKPV--AVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCG 129
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC--DSGCNGGLMNSAF 217
SCW+FSATG+LEG HFL GELVSL+EQQLVDC +G + GCNGG +N AF
Sbjct: 130 SCWAFSATGSLEGQHFLKYGELVSLAEQQLVDC--------AGGIYYNQGCNGGWVNQAF 181
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI GG++ E YPY D +C+F+ + +AA S F S+ E GP+
Sbjct: 182 KYIKANGGIDTESSYPYEARD-NTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPI 240
Query: 277 AGNVASIELPHISF 290
+ +I+ H SF
Sbjct: 241 S---VAIDAAHRSF 251
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/238 (38%), Positives = 136/238 (57%), Gaps = 16/238 (6%)
Query: 45 HLLNA-EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
H LN EH F F+ K+ + YA EH R R+F+ NLR + + +A +G+T+F+
Sbjct: 302 HTLNKIEHLFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFA 361
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T +E++ GL +R A ++P ++P +FDWR AVT VK+QG CGS
Sbjct: 362 DMTSTEYKLH-AGLWQRSEDKPTGGAAAVVPPYAGEMPKEFDWRQKKAVTHVKNQGQCGS 420
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG + + TGEL SEQ+L+DCD S DS CNGGLM++A++ I
Sbjct: 421 CWAFSVTGNIEGLYAIKTGELEEFSEQELLDCD---------STDSACNGGLMDNAYKAI 471
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY C F+++ +S F + +E M L+ +GP++
Sbjct: 472 KDIGGLEYESEYPYAAKK-MQCHFNRTMSHVQLSGFVDLPKGNETAMQEWLLSNGPIS 528
>gi|29841177|gb|AAP06190.1| similar to GenBank Accession Number U07345 preprocathepsin L in
Schistosoma mansoni [Schistosoma japonicum]
Length = 356
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 134/220 (60%), Gaps = 19/220 (8%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
N ++ FK + K Y + +++ RF +FK+NL +A+ Q+L+ +AV+GVT +SDLT
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L R A +++ I P D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 266
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD S D GCNGGL ++A+E I++
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 317
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
GG+ E +YPY + C + +AA +++ ++ DE
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDE 356
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 139/244 (56%), Gaps = 14/244 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
++ F + +F K Y E RF +FK+N+ + V G+ +DLT E+
Sbjct: 178 KNEFENWIDRFEKKYDVSE-FKKRFSIFKSNMDFVHSWNSKNSQTVLGLNHLADLTNLEY 236
Query: 110 RRQFLGLNRR--LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
R+ +LG +++ L P + + + + DWR GAV+ +KDQG CGSCWSFS T
Sbjct: 237 RQFYLGTHKKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTT 296
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G++EGAH + +G +V LSEQ LVDC + + GCNGGLM+ AFEYI+ G++
Sbjct: 297 GSVEGAHQIKSGNMVELSEQNLVDC-------STSEGNMGCNGGLMDYAFEYIITNNGID 349
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELP 286
E YPYT + G +CK++K+ A +S++ I++ + A+ VK+ GP++ +I+
Sbjct: 350 TESSYPYTASSGTTCKYNKANSGATISSYKNITAGSESDLADAVKNAGPVS---VAIDAS 406
Query: 287 HISF 290
H SF
Sbjct: 407 HNSF 410
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 140/242 (57%), Gaps = 15/242 (6%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
+ S L A+H F F+ +F + Y + E R R+F+ NL+ ++ + + +A +G+
Sbjct: 296 KHSHRALDKADHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGI 355
Query: 99 TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQG 156
T+F+D+T SE++ + GL +R A ++P +LP +FDWR AVT VK+QG
Sbjct: 356 TEFADMTSSEYKER-TGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQG 414
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
+CGSCW+FS TG +EG H + TG+L SEQ+L+DCD + DS CNGGLM++A
Sbjct: 415 SCGSCWAFSVTGNIEGLHAVKTGDLKEFSEQELLDCD---------TTDSACNGGLMDNA 465
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
++ I GG+E E +YPY C F+++ V+ F + +E M L+ +GP
Sbjct: 466 YKAIKDIGGLEYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGP 524
Query: 276 LA 277
++
Sbjct: 525 IS 526
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 124/208 (59%), Gaps = 16/208 (7%)
Query: 73 RFRVFKANLRRAKR---RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
RF++F+ N+++ +L D A +GVT+FSDL EFRR +L L D +A
Sbjct: 2 RFKIFRENMKKINTLNDNELGD--AEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRA 59
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
I P D P FDWRDH AVT VK+QG CGSCW+FS T +EG + +LVSLSEQ+L
Sbjct: 60 KI-PDVDPPASFDWRDHNAVTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLVSLSEQEL 118
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCD D GC GGL +A+E I++ GG+E EK YPY D CKF +
Sbjct: 119 VDCD---------KLDDGCEGGLPVNAYEEIIRLGGLESEKKYPYDAED-EKCKFTVGDV 168
Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLA 277
A +++ ISS+E MAA L K+GP++
Sbjct: 169 AVYINSSVNISSNEADMAAWLYKNGPIS 196
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/281 (34%), Positives = 149/281 (53%), Gaps = 22/281 (7%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M +L + L +S+ +V +D + S +HL + + LF+S
Sbjct: 1 MALSVLKTSFLTFFASLFVCSVLAHDFSIV---------GYSPEHLTSVDKLVELFESWI 51
Query: 61 S---KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
S K Y + EE +RF VFK NL+ +R + G+ +F+DL+ EF+ +FLGL
Sbjct: 52 SGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLY 111
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
++ DLP DWR GAVT VK+QG+CGSCW+FS A+EG + +
Sbjct: 112 PEFPRKKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 171
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G L SLSEQQL+DCD S ++GCNGGLM+ AFE+I+ GG+ +E+DYPY
Sbjct: 172 AGNLTSLSEQQLIDCDT--------SFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYL-M 222
Query: 238 DGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ G+C + ++ +S + + +++Q + H PL+
Sbjct: 223 EEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLS 263
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/236 (37%), Positives = 135/236 (57%), Gaps = 15/236 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
L EH F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 307 LDKVEHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADM 366
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R A ++P + +LP +FDWR AVTGVK+QG CGSCW
Sbjct: 367 TSTEYKER-TGLWQRDEAKATGGSPAVVPAYSGELPKEFDWRSKNAVTGVKNQGQCGSCW 425
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + L GEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 426 AFSVTGNIEGLYALKYGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 476
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY C F+K+ V +F + +E M LV +GP++
Sbjct: 477 IGGLEYEAEYPYEAKK-KQCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPIS 531
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/245 (36%), Positives = 139/245 (56%), Gaps = 16/245 (6%)
Query: 39 GEQSEDH--LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAV 95
G + +H L EH F F+ KF + Y E R R+F+ NLR ++ + +A
Sbjct: 287 GHKKHNHHSLDKVEHLFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAK 346
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA--PILPTNDLPTDFDWRDHGAVTGVK 153
+G+T+F+D+T +E++ + R P QKA P P +LP +FDWR GAV+ VK
Sbjct: 347 YGITEFADMTSTEYKERTGLWQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAVSSVK 406
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG+CGSCW+FS G +EG + + TG+L SEQ+L+DCD + DS CNGGL
Sbjct: 407 NQGSCGSCWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCD---------TKDSACNGGLP 457
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
++A++ I + GG+E E +YPY C F+K+ V+ F + ++E M L+
Sbjct: 458 DNAYKAIQEIGGLEYESEYPYKARK-EQCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIA 516
Query: 273 HGPLA 277
+GP++
Sbjct: 517 NGPIS 521
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 130/228 (57%), Gaps = 11/228 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ ++ + K Y E + RF +FK NLR +D + G+ +F+DLT E++
Sbjct: 51 YEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAM 110
Query: 113 FLG--LNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
FLG + R+ R L +Q+ +DLP + DWR+ GAV VKDQG CGSCW+FS GA
Sbjct: 111 FLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGA 170
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TGEL+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E
Sbjct: 171 VEGINQIVTGELISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDTE 222
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+DYPY +D K+ + + + +++ V H P++
Sbjct: 223 EDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVS 270
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 139/236 (58%), Gaps = 15/236 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
F F F K Y +++E R+ +FK N++ + Q + TAV+GVT F+DLTP EFR
Sbjct: 195 QFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFR 254
Query: 111 RQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ +L +R +LP Q+ +P + +DWR+H AVT VK+QG CGSCW+F+
Sbjct: 255 KFYLSPQWKRDQLP---QRKASIPKGKIEDRWDWREHNAVTEVKNQGMCGSCWAFATIAN 311
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + GELVSLSEQ+LVDCD + D GC+GG ++A++ I++ GG+ E
Sbjct: 312 VEGVWAVKKGELVSLSEQELVDCD---------TLDQGCSGGYPSNAYKEIIRLGGLTTE 362
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+Y Y G + G+C+F +++ + DE ++AA + ++GP+A + + +
Sbjct: 363 TNYSYDG-NQGTCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAM 417
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/234 (40%), Positives = 136/234 (58%), Gaps = 15/234 (6%)
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
SK+Y + EE +R+ V++ N + + + T+ + KF DLT +EF + F GL
Sbjct: 38 SKSY-SNEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGDLTNAEFNKLFKGLAFDY 96
Query: 121 RLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
A+ A +P L DFDWR GAVT VK+QG CGSCWSFS TG+ EGA+FL TG
Sbjct: 97 SFHANKAAAEKAVPAPGLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTG 156
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
L SLSEQ L+DC SGS ++GCNGGLM+ AFEYI+ G++ E YPY T
Sbjct: 157 RLTSLSEQNLIDC--------SGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYQ-TA 207
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+C+++ + ++++++ +SS ++ N V P + +I+ H SF F
Sbjct: 208 QYTCQYNPANSGGSLTSYTDVSSGDENALLNAVATEPTS---VAIDASHNSFQF 258
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/242 (40%), Positives = 133/242 (54%), Gaps = 27/242 (11%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP---TAVH----GVT 99
LN E F +K F K+Y+ E R V++AN + L+D +H G+
Sbjct: 26 LNME--FEAWKRTFGKSYSDAVEEINRRAVWEAN------KMLVDAHNGAGIHSYTLGMN 77
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQG 156
F+DLT EF+R +LG L P + +PT + LP DWR G VT VKDQG
Sbjct: 78 IFADLTHEEFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQG 137
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCWSFS TG++EG H TG+LVSLSEQ LVDC + GCNGGLM+ A
Sbjct: 138 QCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSK-------AQGNQGCNGGLMDDA 190
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GP 275
F+YI+ G++ E YPYT D G+CKF+ + + A +S+F I+ + N V GP
Sbjct: 191 FQYIITNKGIDTEASYPYTAKD-GTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGP 249
Query: 276 LA 277
++
Sbjct: 250 VS 251
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 100/251 (39%), Positives = 135/251 (53%), Gaps = 19/251 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN---LRRAKRRQLLDPTAVHGVTKFSDL 104
N H+ FK++ +K Y + E R +F+ N + ++ D G+ F DL
Sbjct: 76 NLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFD--FYLGMNHFGDL 133
Query: 105 TPSEFRRQFLGLNRRLRLPADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T E+R ++LG R P+ A + D+P DWRD G VT VK+QG CGSCW
Sbjct: 134 TNKEYRERYLGYRRPENTPSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCW 193
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA G+LEG HF STG+LVSLSEQ LVDC PE +SGCNGG M+ AFEY+
Sbjct: 194 AFSAVGSLEGQHFKSTGKLVSLSEQNLVDCS---TPE----GNSGCNGGWMDQAFEYVKD 246
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVA 281
G++ E YPY GTD GSC F I A + F V DE+ + + GP++
Sbjct: 247 NHGIDTEDSYPYVGTD-GSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVS---V 302
Query: 282 SIELPHISFSF 292
+I+ + F F
Sbjct: 303 AIDASSMLFQF 313
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 99/279 (35%), Positives = 154/279 (55%), Gaps = 20/279 (7%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ ++++LLL ++SA+ D + + +S++ L++ + + K K
Sbjct: 36 MAMATILLLFTVFAVSSAL---DMSIISYDNAHAATSRSDEELMSMYEQWLV---KHGKV 89
Query: 64 YATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NR 118
Y E + RF++FK NLR D T G+ +F+DLT E+R ++LG NR
Sbjct: 90 YNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNR 149
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
RL + AP + + LP DWR GAV VKDQG CGSCW+FSA GA+EG + + T
Sbjct: 150 RLGKTPSNRYAPRV-GDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVT 208
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD + GCNGGLM+ AFE+I+ GG++ E+DYPY G D
Sbjct: 209 GELISLSEQELVDCDT--------GYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVD 260
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G + K+ ++ ++ + + ++ V + P++
Sbjct: 261 GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVS 299
>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
Length = 320
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/241 (41%), Positives = 142/241 (58%), Gaps = 13/241 (5%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFRQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQ 253
Query: 287 H 287
H
Sbjct: 254 H 254
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 139/249 (55%), Gaps = 18/249 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H++L+K+ K+YA +EE +R +++ NLR + L H G+ +F D+T
Sbjct: 26 DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + ++ AP + P DWR G VT VKDQG CGSCW+FS
Sbjct: 85 NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG H+ +TG+++SLSEQ LVDC + GCNGGLM+ AF+Y+ GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
++ E YPYT D C +D + +A + F ++S+ ++ N V GP++ +++
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSESEKDLMNAVASVGPVS---VAVD 252
Query: 285 LPHISFSFL 293
H SF F
Sbjct: 253 AGHQSFQFY 261
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 129/239 (53%), Gaps = 23/239 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FKSKF+K Y + EH F +K + + Q+ +P A G TKFSD++P EF +
Sbjct: 33 FEEFKSKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPEEFENK 92
Query: 113 FLGLN---------RRLRLPADAQKAPI-----LPTNDLPTDFDWRDHGAVTGVKDQGAC 158
L + + ++L A+ K + + +DLP FDWRD G +T K Q C
Sbjct: 93 MLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTC 152
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+F+ TG +E + L GEL+ SEQ L+DCD + + GC GGLM A++
Sbjct: 153 GSCWTFATTGVIESQYALKYGELLHFSEQMLLDCD---------NINQGCRGGLMTDAYQ 203
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++ ++GG++ Y C FDK+K+ A V ++ I +E+ + LVK+GP+A
Sbjct: 204 FLQQSGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVA 262
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/246 (36%), Positives = 140/246 (56%), Gaps = 18/246 (7%)
Query: 54 SLFKSKF---SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEF 109
+LF+S K+Y E + RF++FK NLR + L++ G+ KF+DLT E+
Sbjct: 43 TLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEY 102
Query: 110 RRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
R ++ G+ R ++ A + + L LP DWR+ GAV VKDQG+CGSCW+FS
Sbjct: 103 RSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFST 162
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
A+EG + ++TG+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG+
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGLMDYAFEFIINNGGI 214
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
+ + DYPYTG DG ++ K+ + ++ + + ++ + P++ +IE
Sbjct: 215 DTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPIS---VAIEAS 271
Query: 287 HISFSF 292
F F
Sbjct: 272 GRDFQF 277
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/246 (40%), Positives = 138/246 (56%), Gaps = 35/246 (14%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ--SEDHLLNAEHHFSLFKSKF 60
R + SL+LL++ A+ D +V +G Q S+D +L+ H +
Sbjct: 6 RALGLSLVLLVI------AIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWL---ETH 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG---LN 117
S+ Y + E +RF++FK N + G+ KFSDLT EFR Q+LG +N
Sbjct: 57 SRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKPVN 116
Query: 118 RRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
R+ + + D + P + DWR GAVT VKDQGACGSCW+FSA G++EG
Sbjct: 117 RQRKEANFMYEDVEAEPKV---------DWRLKGAVTDVKDQGACGSCWAFSAVGSVEGV 167
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TGELVSLSEQ+LVDCD + + GCNGGLM+ AFE+I+K GG++ EKDYP
Sbjct: 168 NAIKTGELVSLSEQELVDCDRK--------QNQGCNGGLMDYAFEFIIKNGGIDTEKDYP 219
Query: 234 YTGTDG 239
Y DG
Sbjct: 220 YKARDG 225
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/298 (37%), Positives = 155/298 (52%), Gaps = 32/298 (10%)
Query: 11 LLLLSSVLASA--VAVNDDDAMIRQVVPSDGEQSEDHLLNA---------EHHFSLFKSK 59
+L + SVLA A V + + + + H+L A E + FK
Sbjct: 3 VLWIVSVLAVARGATVQTGNVQWFDLEAAQKHPEQLHILKAKAGINYQPYEQAWKEFKIL 62
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRRQFLG 115
KTY EE RF +F+ N+++ + L + GV +FSDL EF + + G
Sbjct: 63 HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEFVK-YNG 121
Query: 116 LNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
L ++ L D + L N+L P DWR G VT VK+QG CGSCWSFS TG+LEG
Sbjct: 122 L-KKTSLK-DGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEGQ 179
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
HF +G+LVSLSE QLVDC E GCNGGLM++AF+YI GG+E E+DYP
Sbjct: 180 HFRKSGKLVSLSESQLVDCSQSFGNE-------GCNGGLMDNAFKYIKSVGGLESEEDYP 232
Query: 234 YTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
Y G+CKFD +K+AA + V S E + + + GP++ +I+ H SF
Sbjct: 233 YKPKQ-GTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVS---VAIDASHSSF 286
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 126/219 (57%), Gaps = 8/219 (3%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
K K + E D RF +FK NLR + + G+TKF+DLT E+R +LG
Sbjct: 48 KHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL 107
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ + + + + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + T
Sbjct: 108 KRKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVT 167
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E+DYPY G D
Sbjct: 168 GDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVD 219
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G + K+ + ++ + ++ ++ + H P++
Sbjct: 220 GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPIS 258
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 127/233 (54%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFR+FK ++ RAK +P A GVT+FSD++P EF
Sbjct: 117 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 176
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R +L G +K + T P DWR GAVT VKDQG+CGSCW+F+A G
Sbjct: 177 RATYLNGAKYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAAIG 236
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + + C GG + AF++I+ + G V
Sbjct: 237 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCGGGFADRAFKWIVSSNKGNV 287
Query: 227 EREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY DG C + A +S + DE+ +A L ++GP+A
Sbjct: 288 FTERSYPYASIDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVA 340
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/258 (37%), Positives = 140/258 (54%), Gaps = 21/258 (8%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+S+D +++ + + K K Y E RF +FK NLR + T G+TK
Sbjct: 19 RSDDEVMSI---YKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNRTYKVGLTK 75
Query: 101 FSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQ 155
F+DLT E+R FLG RRL + + D LP DWR GAV +KDQ
Sbjct: 76 FADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQ 135
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G+CGSCW+FS A+EG + + TGEL+SLSEQ+LVDCD ++GCNGGLM+
Sbjct: 136 GSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDR--------FYNAGCNGGLMDY 187
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHG 274
AF++I+ GG++ EKDYPY G D +C DK K A ++ F + +++ V H
Sbjct: 188 AFQFIINNGGLDTEKDYPYLGND-DTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQ 246
Query: 275 PLAGNVASIELPHISFSF 292
P++ +IE ++ F
Sbjct: 247 PVS---VAIEASGMALQF 261
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/254 (37%), Positives = 135/254 (53%), Gaps = 20/254 (7%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFS 102
L+ E + FK K K Y+ +EE+ R +F+ NL+ + T H GV +F+
Sbjct: 18 LSFESQWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFA 76
Query: 103 DLTPSEFRRQFLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
D+T +E+ Q +G + L +P + DWRD G VT +KDQG CG
Sbjct: 77 DMTHAEYLNQVIGGCLITSNLTKTGSRATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG+LEG H +TG LVSLSEQ LVDC + + GC GG M+ F+Y
Sbjct: 137 SCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQ-------EGNKGCEGGDMDQGFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAG 278
I++ G++ E+ YPY + CKFD S I A +S+F+ V S DED + GP++
Sbjct: 190 IIQNKGIDTEQCYPYKAKN-HRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPIS- 247
Query: 279 NVASIELPHISFSF 292
I+ H SF F
Sbjct: 248 --VGIDASHQSFQF 259
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 151/279 (54%), Gaps = 20/279 (7%)
Query: 3 RLIL-SSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLF 56
R IL S++LL+L+++ A ++ D+ IR V + E+S +L H F+ F
Sbjct: 5 RTILPSAVLLILIAASTAESIGF-DESNPIRMVSDRLREVEESVVQILGQSRHVISFARF 63
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
++ K Y EE RF +FK NL + + GV +F+D+T EF+R LG
Sbjct: 64 AHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADMTWQEFQRTKLGA 123
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ A + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 124 AQNC--SATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 181
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
+ G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 AFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 234
Query: 237 TDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
D G+CK+ + V N ++ + DE + A LV+
Sbjct: 235 ED-GTCKYSAENVGVEVLDSVNITLGAEDELKHAVGLVR 272
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/254 (37%), Positives = 134/254 (52%), Gaps = 19/254 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D L+A+ ++ ++S + K YA EE D+R V++ N++ +R HG T
Sbjct: 22 DQSLDAQ--WNQWRSTYKKVYAVNEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D T EFR+ G + P+ +PT DW G VT VKDQG CG
Sbjct: 79 AFGDKTNEEFRQLMNGFQSQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKDQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWR-------EGNEGCNGGLMDNAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+ GG++ E+ YPYT TD C+++ AA + F I E + + GP++
Sbjct: 190 VKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPIS-- 247
Query: 280 VASIELPHISFSFL 293
+I+ +SF F
Sbjct: 248 -VAIDAGQVSFQFY 260
>gi|328868405|gb|EGG16783.1| cysteine protease 4 [Dictyostelium fasciculatum]
Length = 454
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/240 (37%), Positives = 128/240 (53%), Gaps = 13/240 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ + K + Y++ E R+ VFK N+ + V G+ F+D++ E++R
Sbjct: 30 FTSWMQKQGRVYSSHE-FGARYNVFKKNMDYVQEWNSKGSETVLGLNVFADISNEEYQRI 88
Query: 113 FLG--LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+LG ++ RL A A DWR GAVT +K+QG CGSCWSFS TG+
Sbjct: 89 YLGTKVDGTARLAAAASTTMDRIYEVQAATVDWRQQGAVTAIKNQGQCGSCWSFSTTGST 148
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EGAHFLST LVSLSEQ L+DC + + GCNGGLM AF YI+K GG++ E
Sbjct: 149 EGAHFLSTKNLVSLSEQNLIDCS-------TAEGNQGCNGGLMTQAFTYIIKNGGIDTEA 201
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
YPY G C ++ + AA +S ++ ++S + A P++ +I+ H SF
Sbjct: 202 SYPYKAVQGKKCLYNTANKAATISKYTEVTSGSEAALATAANAAPIS---VAIDASHNSF 258
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/236 (38%), Positives = 136/236 (57%), Gaps = 15/236 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
L +H F F+ K+ + YA EH R R+F+ +L+ + + +A +G+T+F+D+
Sbjct: 286 LNKVDHLFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADM 345
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E+ Q GL +R A ++P +LP +FDWR AVT VK+QG CGSCW
Sbjct: 346 TSTEYA-QRAGLWQRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNAVTHVKNQGQCGSCW 404
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EGA+ + TG+L SEQ+L+DCD S DS CNGGLM++A++ I
Sbjct: 405 AFSVTGNIEGAYAIKTGDLQEFSEQELLDCD---------SKDSACNGGLMDNAYKAIKD 455
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY G C F+++ VS F + +E M L+ +GP++
Sbjct: 456 IGGLEYESEYPYEGKK-KQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPIS 510
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 128/234 (54%), Gaps = 16/234 (6%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K+Y T EE R+ +FKAN+ ++ V G+ F+D+T E+R +LG
Sbjct: 39 KSY-TSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNTYLGTKFDAS 97
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
Q+ + T+ + DWR GAVT VK+QG CG CWSFS TG+ EGAHF S GEL
Sbjct: 98 SLIGTQEEKVFTTSSAASK-DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGEL 156
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ L+DC E +SGC+GGLM AFEYI+ G++ E YPY + G
Sbjct: 157 VSLSEQNLIDCSTE---------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGK 206
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
C++ A +S++ +++ + + V P++ +I+ H SF L+T
Sbjct: 207 CEYKSENSGATLSSYKTVTAGSESSLESAVNVNPVS---VAIDASHQSFQ-LYT 256
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 137/248 (55%), Gaps = 16/248 (6%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ + + +TY E + RF VF+ NLR
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LG+ R + + N DLP DWR GAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
VKDQG+CGSCW+FS A+EG + + TG+++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ E+DYPY GTDG K+ + ++ + ++ ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259
Query: 270 LVKHGPLA 277
V + P++
Sbjct: 260 AVANQPIS 267
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 102/253 (40%), Positives = 142/253 (56%), Gaps = 24/253 (9%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDL 104
A ++ L+K K+Y EEH +R ++F ++ + R L T G+ KF+D+
Sbjct: 15 ASANWDLYKKVHGKSYGHDEEH-FRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDM 73
Query: 105 TPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
T EFR F GL + R QK L LPT DWR+ G VT VK+QG CGS
Sbjct: 74 TSEEFR-NFKGLKFDATKTKRNGTRFQKE--LLGEALPTQVDWREKGYVTPVKNQGQCGS 130
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG+LEG HF +TG+LVSLSEQ LVDC ++GCNGGLM++ F YI
Sbjct: 131 CWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRV-------EGNNGCNGGLMDNGFTYI 183
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGN 279
+ GG++ E+ YPYTG D G C F+++ + A V F V DE + A + GP++
Sbjct: 184 QQNGGIDTEESYPYTGKD-GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVS-- 240
Query: 280 VASIELPHISFSF 292
+I+ + SF +
Sbjct: 241 -VAIDASNDSFQY 252
>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
Length = 387
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 90/232 (38%), Positives = 130/232 (56%), Gaps = 20/232 (8%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T +E ++R+ VFK NL + + V G+ F+DLT +E++R +LG +
Sbjct: 46 TTQEFNHRYGVFKKNLNFVNQWNAKGSSTVLGMNVFADLTNAEYQRIYLGSKIDTSSMMN 105
Query: 126 AQKAPIL----PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A A + L DWR GAVT +K+Q CGSCWSFS TG++EGAH ++TG L
Sbjct: 106 ANAARLFDRTYNVKALSPTVDWRQKGAVTHIKNQQQCGSCWSFSTTGSIEGAHEIATGNL 165
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ L+DC + + GCNGGLM +AFEY++K GG++ E YPY+ T
Sbjct: 166 VSLSEQNLIDC-------STAEGNQGCNGGLMTNAFEYVIKNGGIDTEASYPYSATGPNK 218
Query: 242 CKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
C+++ + A +S N +V S AAN+ GP++ +I+ H SF
Sbjct: 219 CRYNPANSGATISSYVNVTVGSETALMAAANI---GPVS---VAIDASHNSF 264
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 99/232 (42%), Positives = 133/232 (57%), Gaps = 21/232 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANL-----RRAKRRQLLDPTAVHGVTKFSDLTPS 107
F FK K +KTY T E R+ +F+A L ++ Q L+ T GV KFSD T
Sbjct: 23 FQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFEQGLE-TYKKGVNKFSDWTQD 81
Query: 108 EFRRQFLGLNRRLRLPADAQKA-PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF +LGL+ + PA K P + T +P DWR G VTGVK+QG CGSCW+FS
Sbjct: 82 EFN-AYLGLHPK---PAKLGKGIPYVKTGVSVPASVDWRTEGYVTGVKNQGDCGSCWAFS 137
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG++EGA F STG+LVSLSEQQLVDC + G+ + GC+GG + F YI + G
Sbjct: 138 LTGSVEGALFKSTGKLVSLSEQQLVDCTY-------GTVNFGCDGGYLEETFPYIQET-G 189
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+E E YPY D G+CKFD SK+ ++++ DE+ + GP++
Sbjct: 190 LEAEASYPYKARD-GTCKFDASKVVTKINDYVYWYGDEEALLEATATIGPIS 240
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/246 (38%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSE 108
++ +K++ K Y + EE R +++ NL + + L T G+ +F+DL E
Sbjct: 28 WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
F G R A+ + LP N+ LP DWR G VT VKDQG CGSCW+FS
Sbjct: 88 FVAMMTGF-RVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFS 146
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG++EG HF +TG+LVSLSEQ LVDC D+GC+GG M+ AF+YI+ AGG
Sbjct: 147 TTGSVEGQHFKATGKLVSLSEQNLVDCSGR---------DAGCDGGFMDRAFQYIIDAGG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
++ E YPY D G C F K+ + A V+ ++ ++S ++ V H GP++ +I+
Sbjct: 198 IDTEASYPYKAVD-GKCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPIS---VAID 253
Query: 285 LPHISF 290
H+SF
Sbjct: 254 ASHMSF 259
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 131/231 (56%), Gaps = 26/231 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTA-VHGVTKFSDLTP 106
F L++ + Y +E RF +F +NL AKR P+ + G+ F+D +P
Sbjct: 52 FQLWRKEHGLVYKDLKEMAKRFEIFLSNLNYIIEFNAKRS---SPSGYLLGLNNFADWSP 108
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
SEF+ +L L +P D+ P+L P DWR+ AVT +K+QG+CGSCW+
Sbjct: 109 SEFQEIYL---HSLDMPTDSAPKLNGPLLSC-IAPASLDWRNKVAVTAIKNQGSCGSCWA 164
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA GA+EG H ++TGEL+SLSEQ+LV+CD GCNGG +N AF++++
Sbjct: 165 FSAAGAIEGIHAITTGELISLSEQELVNCDR---------VSKGCNGGWVNKAFDWVISN 215
Query: 224 GGVEREKDYPYTGTDGGSCKFDKS-KIAAAVSNFSVISSDEDQMAANLVKH 273
GG+ E +YPYTG DGG+C DK I A + + + ++ + ++VK
Sbjct: 216 GGITLEAEYPYTGKDGGNCNSDKQVPIKATIDGYEQVEQSDNGLLCSIVKQ 266
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 123/231 (53%), Gaps = 23/231 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK+KF K YA EE +R VF L+ +R + T + FSDLT E
Sbjct: 23 FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82
Query: 112 QFLGLNRRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
G+ RR LP A PT + D DWR+ GAVT VKDQG CGSCW+FSA
Sbjct: 83 TKTGMTRRRHPLSVLPKSA------PTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAV 136
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
ALEGAHFL TG+LVSLSEQ LVDC S + GCNGG A++YI+ G++
Sbjct: 137 AALEGAHFLKTGDLVSLSEQNLVDC-------SSSYGNQGCNGGWPYQAYQYIIANRGID 189
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E YPY D +C++D I A VS++ S DE + + GP++
Sbjct: 190 TESSYPYKAID-DNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVS 239
>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
Length = 266
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/185 (44%), Positives = 116/185 (62%), Gaps = 10/185 (5%)
Query: 93 TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
TAV+G T FSD + +E++ G N LR + +P DLP +FDWR+H VT V
Sbjct: 3 TAVYGDTPFSDWSAAEYKAHLAGFNPSLRQSNARLRQAAIPEIDLPDEFDWRNHSVVTPV 62
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG+CGSCW+FS TG +EG + + G+L+SLSEQ+LVDCD DSGCNGGL
Sbjct: 63 KDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCD---------KLDSGCNGGL 113
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
+A++ I GG+E E DYPY G + CKF+ + V+ IS++E +MA L++
Sbjct: 114 PENAYKAIHDIGGLETESDYPYNGHE-NKCKFNSNITRVQVTGGVEISTNETEMAQWLIQ 172
Query: 273 HGPLA 277
+GP++
Sbjct: 173 NGPIS 177
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 93/248 (37%), Positives = 137/248 (55%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H++LFK+ F K Y+T EE R ++AN+ ++ L +H G+ ++DLT +
Sbjct: 27 HWALFKTTFGKQYSTAEEITRRL-AWEANVAIIRQHNLEHDLGLHTYTLGLNNYADLTNA 85
Query: 108 EFRRQFLGLNRRLRLPADA-QKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + GL A ++ + P +LPT DWR G VT +KDQG CGSCW+FS
Sbjct: 86 EFNQVMNGLRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
+TG+LEG HF TG+LVSLSEQ L DC + + GCNGGLM+ AF YI + G
Sbjct: 146 STGSLEGQHFAKTGQLVSLSEQNLTDCSQK-------QGNMGCNGGLMDQAFTYIKENNG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY D C F + + A + ++ I+ DE+ + + + GP++ +I+
Sbjct: 199 IDTESSYPYKAVD-EKCHFKAADVGATDTGYTDIAQQDENALQSAIATVGPIS---VAID 254
Query: 285 LPHISFSF 292
H SF
Sbjct: 255 ASHSSFQL 262
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 133/230 (57%), Gaps = 13/230 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
++ + +K K Y E + RF +FK NL+ + + G+ +F+DLT E+R
Sbjct: 47 YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRSYKVGLNRFADLTNEEYRSM 106
Query: 113 FLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
FLG R ++ + +++ + ++ LP DWR+ GAV +KDQG+CGSCW+FS
Sbjct: 107 FLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAFSTV 166
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
A+EG + ++TGE++ LSEQ+LVDCD + D+GCNGGLM+ AFE+I+ GG++
Sbjct: 167 AAVEGVNQIATGEMIQLSEQELVDCDR--------TYDAGCNGGLMDYAFEFIINNGGID 218
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+DYPY G DG K+ +++++ + ++ V H P++
Sbjct: 219 TEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVS 268
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 135/234 (57%), Gaps = 23/234 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F F ++K Y+++E ++ R +FK NLRR + D A HG+T+F+DLT EF
Sbjct: 30 FKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDE-AQHGITQFADLTHEEFADM 88
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+LG +LR ++Q L + PT DW GAVT VK+QG+CGSCW+FS TG++
Sbjct: 89 YLGYKPQLR---NSQAKVSLSSTPFTAPTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSI 145
Query: 171 EGAHFLSTGE-LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
EG + L + L S SEQQLVDCD + D GCNGGLM++AF Y L++ +E E
Sbjct: 146 EGQYVLQLKQNLTSFSEQQLVDCDTK--------EDQGCNGGLMDNAFTY-LESAKLETE 196
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF------SVISSDEDQMAANLVKHGPLA 277
YPYT D GSCK+++S V++F ++ E+ M L GPL+
Sbjct: 197 SAYPYTAVD-GSCKYNQSLGVVGVASFVDIEQGKTVADTENTMGVALDNIGPLS 249
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 139/238 (58%), Gaps = 26/238 (10%)
Query: 53 FSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSE 108
L+KS + K Y E + RF +FK NLR + T G+ KF+DLT E
Sbjct: 43 MGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQE 102
Query: 109 FRRQFLGLN----RRL---RLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
+R +FLG RRL ++P+ A +A ++LP +WRDHGAV+ VKDQG+CG
Sbjct: 103 YRAKFLGTRTDPRRRLMKSKIPSSRYAHRA----GDNLPDSVNWRDHGAVSRVKDQGSCG 158
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA A+EG + + +GEL+SLSEQ+LVDCD S D+GCNGGLM+ AF++
Sbjct: 159 SCWAFSAIAAVEGINKIVSGELISLSEQELVDCDR--------SYDAGCNGGLMDYAFQF 210
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I+ GG++ EKDYPY G + K+ ++ + + ++E+ + V H P++
Sbjct: 211 IIDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVS 267
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTAFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/295 (35%), Positives = 157/295 (53%), Gaps = 32/295 (10%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSK 62
S L+ + S++L SA+A D I P + L + E LF+S + SK
Sbjct: 11 FSLLVAISASALLCSALA---RDFSIVGYTP-------EQLTSTEKLLELFESWMSEHSK 60
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
Y + EE +RF VF+ NL +R + G+ +F+DLT EF+ ++LGL +
Sbjct: 61 VYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFS 120
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R R P+ + + DLP DWR GAV VKDQG CGSCW+FS A+EG + ++T
Sbjct: 121 RKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITT 178
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L SLSEQ+L+DCD + +SGCNGGLM+ AF+YI+ GG+ +E DYPY +
Sbjct: 179 GNLSSLSEQELIDCDT--------TFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYL-ME 229
Query: 239 GGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
G C+ K + +S + + ++D+ + H P++ +IE F F
Sbjct: 230 EGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVS---VAIEASGRDFQF 281
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 127/208 (61%), Gaps = 25/208 (12%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL-----RRAKRRQLLDPTAVH 96
SE+ +L F +K K K Y EE + RF FK NL R AKR+ V
Sbjct: 41 SEERVLEI---FQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHV- 96
Query: 97 GVTKFSDLTPSEFRRQFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
G+ KF+D++ EFR+ +L +N+ + L + ++ + + D P+ DWR++G VT
Sbjct: 97 GLNKFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRK--VQSCDAPSSLDWRNYGVVTA 154
Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
VKDQG+CGSCW+FS+TGA+EG + L TG+L+SLSEQ+LV+CD + + GC GG
Sbjct: 155 VKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECD---------TSNYGCEGG 205
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDG 239
M+ AFE+++ GG++ E DYPYTG DG
Sbjct: 206 YMDYAFEWVINNGGIDSESDYPYTGVDG 233
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 96/240 (40%), Positives = 124/240 (51%), Gaps = 26/240 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A V
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGV 261
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|343475823|emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 123/233 (52%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC GL + AF++IL + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DLGCELGLKDPAFQWILWSNKGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G +C + A +SN + DED +A L + GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDMSGKVVGAKISNMRYLPLDEDTIAEWLARKGPVA 261
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 136/249 (54%), Gaps = 18/249 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H++L+K+ K+YA +EE +R +++ NLR + L H G+ +F D+T
Sbjct: 26 DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + ++ AP + P DWR G VT VKDQG CGSCW+FS
Sbjct: 85 NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG H+ +TG+++SLSEQ LVDC + GCNGGLM+ AF+Y+ GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPYT D C +D + +A + F V S E + + GP++ +++
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSGSEKDLMNAVASVGPVS---VAVD 252
Query: 285 LPHISFSFL 293
H SF F
Sbjct: 253 AGHQSFQFY 261
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 147/278 (52%), Gaps = 18/278 (6%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ---SEDHLLNAEHHFSLFKSK 59
+L+ S+ ++L L+ ++ S+ AM ++ D S + + K
Sbjct: 2 KLLNSATVILFLTMIVVSS-------AMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVK 54
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
K + E D RF +FK NLR + + G+TKF+DLT E+R +LG +
Sbjct: 55 HGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLK 114
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ + + + + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + TG
Sbjct: 115 RKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTG 174
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY G DG
Sbjct: 175 DLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDG 226
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ K+ + + + ++ ++ + H P++
Sbjct: 227 RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 264
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 100/275 (36%), Positives = 149/275 (54%), Gaps = 29/275 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+S++ A ++ + + +TY E + R++VF+ NLR
Sbjct: 31 IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG R +L A A DLP DWR
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 144
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+CGSCW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 145 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 196
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 256
Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
V + P++ +IE +F S +FT S
Sbjct: 257 LQKAVANQPVS---VAIEAAGTAFQLYSSGIFTGS 288
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 131/253 (51%), Gaps = 19/253 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN--LRRAKRRQLLDPTAVHGV--TKF 101
L A + FK+++ + Y +E YR RVF+ N L A ++ + V +F
Sbjct: 5 LATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQF 64
Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T EF G + R P A P + D DWR GAVT VKDQG CGS
Sbjct: 65 GDMTNEEFNAVMKGYKKGSRGEPTTVFTAEGRP---MAADVDWRTKGAVTPVKDQGQCGS 121
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FSATG+LEG HFL ELVSLSEQ+LVDC E + GC GG M SAF+YI
Sbjct: 122 CWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYG-------NDGCGGGWMTSAFDYI 174
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG++ E YPY D SC+FD + I A + F + E+ + + GP++
Sbjct: 175 KDNGGIDTESSYPYEAQD-RSCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPIS--- 230
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 231 VAIDASHFSFQFY 243
>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
Length = 366
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 92/244 (37%), Positives = 135/244 (55%), Gaps = 18/244 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ + KF + Y+ E ++ FK+N+ + V + +D +P E+++
Sbjct: 27 FTDWTHKFQRLYSNNEFLK-KYHTFKSNMDYVHSWNAKNSDTVLELNHLADHSPEEYKKF 85
Query: 113 FLGLNRRLRLPADAQKAPI---LPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+LG R + + Q I L T D DWR GAV+ +KDQG CGSCWSFS T
Sbjct: 86 YLGT-RVKHIHFNVQGTHINTQLSTVFEDSGATVDWRKKGAVSPIKDQGQCGSCWSFSTT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G++EGAH + TG +V LSEQ LVDC S + GCNGGLMN+AF+YI+ G++
Sbjct: 145 GSVEGAHQIKTGNMVELSEQNLVDC-------SSAEGNMGCNGGLMNNAFDYIISNHGID 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELP 286
E+ YPYT G CKF+K+ + A +S++ I+ + AN VK GP++ +I+
Sbjct: 198 TEQSYPYTANTGSVCKFNKTNVGATISSYKSITPGSETDLANAVKTAGPVS---VAIDAS 254
Query: 287 HISF 290
H SF
Sbjct: 255 HRSF 258
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 90/245 (36%), Positives = 133/245 (54%), Gaps = 16/245 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + +K K Y E RF +FK NLR + T G+TKF+DLT E+R
Sbjct: 4 YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEYRAM 63
Query: 113 FLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
FLG R ++ + +++ + LP DWR GAV +KDQG+CGSCW+FS
Sbjct: 64 FLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFSTV 123
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
A+EG + + TGEL+SLSEQ+LVDCD + ++GCNGGLM+ AF++I+ GG++
Sbjct: 124 AAVEGINQIVTGELISLSEQELVDCDR--------TYNAGCNGGLMDYAFQFIINNGGLD 175
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
EKDYPY G D K A ++ F + +++ V H P++ +IE
Sbjct: 176 TEKDYPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVS---VAIEASG 232
Query: 288 ISFSF 292
++ F
Sbjct: 233 MALQF 237
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 137/238 (57%), Gaps = 26/238 (10%)
Query: 53 FSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSE 108
L+KS + K Y E + RF +FK NLR + T G+ KF+DLT E
Sbjct: 42 MGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQE 101
Query: 109 FRRQFLGLN----RRL---RLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
+R +FLG RRL ++P+ A +A ++LP DWRDHGAV+ VKDQG+CG
Sbjct: 102 YRAKFLGTRTDPRRRLMKSKIPSSRYAHRA----GDNLPDSVDWRDHGAVSPVKDQGSCG 157
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS +EG + + +GELVSLSEQ+LVDCD S D+GCNGGLM+ AF++
Sbjct: 158 SCWAFSTIATVEGINKIVSGELVSLSEQELVDCDR--------SYDAGCNGGLMDYAFQF 209
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I+ GG++ EKDYPY G + K+ ++ + + ++E+ + V H P++
Sbjct: 210 IMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVS 266
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 100/252 (39%), Positives = 135/252 (53%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK Q V V K++D+ E
Sbjct: 27 WQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADMLHHE 86
Query: 109 FRRQFLGLN----RRLRL--PADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
F G N ++LR P+ I P + +P DWR GAVT VKDQG CGSC
Sbjct: 87 FHTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSC 146
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF G L+SLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 147 WAFSSTGALEGQHFRKAGTLISLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY G D SC F+K+ I A + + DE +MA + GP++
Sbjct: 200 DNGGIDTEKSYPYEGID-DSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVS--- 255
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 256 VAIDASHESFQF 267
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 134/252 (53%), Gaps = 20/252 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E F +K KF ++Y T E R +++ N + +L + G+T+F+D+
Sbjct: 24 EMEFHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMD 83
Query: 106 PSEFRRQF-LGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSC 161
E++ LG R A + + + LPT DWRD G VTGVKDQ CGSC
Sbjct: 84 NEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSC 143
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FSATG+LEG +F TG+LVSLSEQQLVDC + + GCNGGLM+ AF+YI
Sbjct: 144 WAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYG-------NMGCNGGLMDYAFKYIQ 196
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNV 280
+ GG++ EK YPY D G C+F + A + + V DED + + GP++
Sbjct: 197 ENGGIDTEKSYPYEAED-GQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVS--- 252
Query: 281 ASIELPHISFSF 292
I+ H SF
Sbjct: 253 VGIDASHSSFQL 264
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 138/242 (57%), Gaps = 17/242 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVT 99
++E+ L++ + + K K Y E + RF++FK NLR D T G+
Sbjct: 50 RTEEELMSMYEQWLV---KHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLN 106
Query: 100 KFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+F+DLT E+R ++LG NRRL + AP + + LP DWR GAV VKDQ
Sbjct: 107 RFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRV-GDKLPDSVDWRKEGAVPPVKDQ 165
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FSA GA+EG + + TGEL+SLSEQ+LVDCD + GCNGGLM+
Sbjct: 166 GGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT--------GYNQGCNGGLMDY 217
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
AFE+I+ GG++ ++DYPY G DG + K+ ++ ++ + + ++ V + P
Sbjct: 218 AFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQP 277
Query: 276 LA 277
++
Sbjct: 278 VS 279
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 132/250 (52%), Gaps = 22/250 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ K+Y + E RF++F N L A+ + V G+ +F DL P
Sbjct: 26 QWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R F G R A + P N LP DWR+ GAVT VK+QG CGSCW+
Sbjct: 86 EFARMFNGY--RGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG+LEG HFL TG LVSLSEQ LVDC E G + GC GGLM++AF+YI
Sbjct: 144 FSTTGSLEGQHFLKTGVLVSLSEQNLVDC-----SETFG--NHGCEGGLMDNAFQYIKAN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
GG++ EK YPY D G C+F K + A + F I ED + + GP++ +
Sbjct: 197 GGIDTEKSYPYEAED-GECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVS---VA 252
Query: 283 IELPHISFSF 292
I+ H SF
Sbjct: 253 IDASHSSFQL 262
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 99/239 (41%), Positives = 142/239 (59%), Gaps = 13/239 (5%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 15 NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 74 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 132
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 133 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 183
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ + ++ L
Sbjct: 184 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 241
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 99/239 (41%), Positives = 142/239 (59%), Gaps = 13/239 (5%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 112/295 (37%), Positives = 152/295 (51%), Gaps = 43/295 (14%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L L+L++V+ S AV+ D + Q +S FK + SK Y ++ E
Sbjct: 3 LFLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
+R ++F N + AK +L V G+ K++D+ EF G N+ +
Sbjct: 44 ERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103
Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+D A I P N LP DWRD GAVT VKDQG CGSCWSFSATG+LEG HF TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTG 163
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+LVSLSEQ LVDC SG ++GCNGGLM++AF YI GG++ EK YPY D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAED 215
Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
C + A F I ++ED + A + GP++ +I+ H +F
Sbjct: 216 -EKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVS---IAIDASHETFQL 266
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 92/245 (37%), Positives = 129/245 (52%), Gaps = 27/245 (11%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--------GVTKFSDLTPS 107
+K + K Y + E R +++AN R+ +D H G+ +F+DL S
Sbjct: 25 WKKEHGKVYNSDREELTRHIIWQAN------RKYVDEHNAHAEKFGFTVGMNQFADLESS 78
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EF R + G N + + K DLPT DWR G VT +K+QG CGSCW+FSA
Sbjct: 79 EFGRLYNGYNNKPSMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
LEG HF +TG LVSLSEQ LVDC + + GCNGGLM++AF+Y++K GG++
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCS-------TAEGNQGCNGGLMDNAFQYVIKNGGID 191
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI--SSDEDQMAANLVKHGPLAGNVASIEL 285
E YPY D CKF+ + + + S FS I E + + GP++ +I+
Sbjct: 192 TEASYPYKAVD-QKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPIS---VAIDA 247
Query: 286 PHISF 290
H SF
Sbjct: 248 SHTSF 252
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 101/258 (39%), Positives = 135/258 (52%), Gaps = 20/258 (7%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---G 97
S +L E + FKS+ +K Y++ E RF++F N L AK V
Sbjct: 18 SSQEILRTE--WEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLA 75
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQ 155
+ KF DL P EF + G + P ND LPT DWR GAVT VK+Q
Sbjct: 76 MNKFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQ 135
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS TG+LEG HF TG+LVSLSEQ LVDC + + GCNGGLM++
Sbjct: 136 GQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFG-------NQGCNGGLMDN 188
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
F+YI GG++ E+ +PYT D G CKF K+ + A + F + ED + + G
Sbjct: 189 GFQYIKANGGIDTEESHPYTAQD-GDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVG 247
Query: 275 PLAGNVASIELPHISFSF 292
P++ +I+ H SF
Sbjct: 248 PVS---VAIDASHGSFQL 262
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 99/261 (37%), Positives = 145/261 (55%), Gaps = 24/261 (9%)
Query: 39 GEQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
G S + L + +H+ F+ + + + Y E D R+ FK NL + + V
Sbjct: 12 GIASANRLFSEQHYQNQFTNWMVRLDRAYDVFEFQD-RYNAFKNNLDLIHKWNSQGHSTV 70
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRL-RLPADAQKAPILPTNDL----PTDFDWRDHGAVT 150
GV +DL+ E+R +LG+ RLP Q+A + N + DWR GAV
Sbjct: 71 LGVNHLADLSNEEYRNLYLGVKVDASRLP---QQAASIKLNKVFAPVAASLDWRSSGAVG 127
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
VKDQG CGSCWSFS TG++EGA+ ++TG SLSEQQL+DC + E GCNG
Sbjct: 128 RVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNE-------GCNG 180
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAAN 269
GLM++A +Y++ GG++ E+ YPYT +D +CKF+ + I A +S++ V E +AA
Sbjct: 181 GLMDAAMKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIGAKISSYIDVQRGSETDLAAK 240
Query: 270 LVKHGPLAGNVASIELPHISF 290
L K GP++ +I+ H SF
Sbjct: 241 LNK-GPVS---VAIDASHSSF 257
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 100/282 (35%), Positives = 142/282 (50%), Gaps = 43/282 (15%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ IL+SLL++ +S+ L + DG HF FK K
Sbjct: 1 MKSFILASLLVVAVSATL----------------LKEDGV-----------HFQSFKLKH 33
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
KTY Q E RF +F+ NLR+ + +H G+ KF+D+T +EF+ L
Sbjct: 34 GKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFK-AMLAT 92
Query: 117 NRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ + A K L +P DWR VT +KDQ CGSCWSF+ G+ EGA+
Sbjct: 93 QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYA 152
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
LSTG+L SEQQLVDC + + GC+GG ++ F YI + G+E E DYPYT
Sbjct: 153 LSTGKLTRFSEQQLVDC--------TTDLNYGCDGGYLDDTFPYI-QTNGLELESDYPYT 203
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G D GSC +D SK+ VS++ + ++E + + GP+A
Sbjct: 204 GYD-GSCSYDSSKVVTKVSSYVSVPANEQALLEAVGTAGPVA 244
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 137/248 (55%), Gaps = 16/248 (6%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ + + +TY E + RF VF+ NLR
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LG+ R + + N DLP DWR GAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
VKDQG+CGSCW+FS A+EG + + TG+++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ E+DYPY GTDG K+ + ++ + ++ ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259
Query: 270 LVKHGPLA 277
V + P++
Sbjct: 260 AVANQPIS 267
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 99/250 (39%), Positives = 136/250 (54%), Gaps = 20/250 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 25 DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G R+ + + + N L P DWRD+G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMYGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E YPY GTD C +D +A + F + S E + + GP++ +
Sbjct: 195 QGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS---VA 251
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 252 IDAGHESFQF 261
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 82/219 (37%), Positives = 125/219 (57%), Gaps = 8/219 (3%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
K K + E D RF +FK NLR + + G+TKF+DLT E+R +LG
Sbjct: 48 KHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL 107
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ + + + + + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + T
Sbjct: 108 KRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVT 167
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY G D
Sbjct: 168 GDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVD 219
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G + K+ + + + ++ ++ + H P++
Sbjct: 220 GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 258
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 99/239 (41%), Positives = 142/239 (59%), Gaps = 13/239 (5%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V DQG CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFKQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GPL+ + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252
>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
Length = 500
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 70 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 129
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 130 RYHNGAVHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 183
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 184 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 234
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 235 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 291
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 128/244 (52%), Gaps = 18/244 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSE 108
+S +K+ K Y EE +R V+K N++ ++ H T F D+T E
Sbjct: 29 WSQWKATHGKLYGMDEE-GWRREVWKKNMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F++ GL + +AP+ +P+ DWR+ G VT VKDQG CGSCW+FSATG
Sbjct: 88 FKQVMNGLQMQKHKKGKMFQAPLFAK--IPSSVDWREKGYVTPVKDQGPCGSCWAFSATG 145
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALEG F TG+LVSLSEQ LVDC + GCNGGLMN+AF+Y+ GG++
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSQ-------AEGNEGCNGGLMNNAFQYVKDNGGLDS 198
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E+ YPY D SCK+ AA + F I E + + GP++ I+ H
Sbjct: 199 EESYPYHAQD-ESCKYKPQDSAANDTGFFDIPQQEKALMVAVATKGPIS---VGIDASHF 254
Query: 289 SFSF 292
+F F
Sbjct: 255 TFQF 258
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 106/271 (39%), Positives = 149/271 (54%), Gaps = 29/271 (10%)
Query: 44 DHLL---NAEHHFSLFKS--KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
D+LL + E+H SLF K ++E+ YRF VF NL + + + TA +G
Sbjct: 1408 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1467
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVK 153
+T+F+D+T EF R LGL LR + + P +P +LP +FDWR VT VK
Sbjct: 1468 ITRFADMTQKEFSRS-LGLRTDLR---NENETPFAQAKIPNIELPKEFDWRKKNVVTEVK 1523
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+Q CGSCW+FS TG +EG + L G+L+ SEQ+LVDCD + D GCNGGLM
Sbjct: 1524 NQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLM 1574
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
++A+ I K GG+E E+DYPY D C F+++ V+ IS +E MA LV +
Sbjct: 1575 DTAYRSIEKIGGLETEQDYPYDAED-EKCHFNRTLARVQVTGALNISHNETDMAKWLVAN 1633
Query: 274 GPLA----GNVASIELPHISFSFLFTVSSPK 300
GP++ N + +S F F + SPK
Sbjct: 1634 GPISIAINANAMQFYMGGVSHPFKF-LCSPK 1663
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 15/242 (6%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
+ S L +H F F+ +F + Y + E R R+F+ NL+ + + +A +G+
Sbjct: 296 KHSHRGLDKVDHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGI 355
Query: 99 TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQG 156
T+F+DLT SE++ + GL +R A A ++P +LP +FDWR AVT VK+QG
Sbjct: 356 TEFADLTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKNAVTPVKNQG 414
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
+CGSCW+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A
Sbjct: 415 SCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNA 465
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
++ I GG+E E +YPY C F+++ V+ F + +E M L+ GP
Sbjct: 466 YKAIKDIGGLEYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGP 524
Query: 276 LA 277
++
Sbjct: 525 IS 526
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 106/271 (39%), Positives = 149/271 (54%), Gaps = 29/271 (10%)
Query: 44 DHLL---NAEHHFSLFKS--KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
D+LL + E+H SLF K ++E+ YRF VF NL + + + TA +G
Sbjct: 1443 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1502
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVK 153
+T+F+D+T EF R LGL LR + + P +P +LP +FDWR VT VK
Sbjct: 1503 ITRFADMTQKEFSRS-LGLRTDLR---NENETPFAQAKIPNIELPKEFDWRKKNVVTEVK 1558
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+Q CGSCW+FS TG +EG + L G+L+ SEQ+LVDCD + D GCNGGLM
Sbjct: 1559 NQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLM 1609
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
++A+ I K GG+E E+DYPY D C F+++ V+ IS +E MA LV +
Sbjct: 1610 DTAYRSIEKIGGLETEQDYPYDAED-EKCHFNRTLARVQVTGALNISHNETDMAKWLVAN 1668
Query: 274 GPLA----GNVASIELPHISFSFLFTVSSPK 300
GP++ N + +S F F + SPK
Sbjct: 1669 GPISIAINANAMQFYMGGVSHPFKF-LCSPK 1698
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 137/248 (55%), Gaps = 16/248 (6%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ + + +TY E + RF VF+ NLR
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LG+ R + + N DLP DWR GAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG+CGSCW+FS A+EG + + TG+++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AFE+I+ GG++ E+DYPY GTDG K+ + ++ + ++ ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259
Query: 270 LVKHGPLA 277
V + P++
Sbjct: 260 AVANQPIS 267
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 100/275 (36%), Positives = 148/275 (53%), Gaps = 29/275 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+S + A ++ + + +TY E + R++VF+ NLR
Sbjct: 26 IVSYGERSXEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 82
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG R +L A A DLP DWR
Sbjct: 83 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 139
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+CGSCW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 140 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 191
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 192 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 251
Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
V + P++ +IE +F S +FT S
Sbjct: 252 LQKAVANQPVS---VAIEAAGTAFQLYSSGIFTGS 283
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/261 (37%), Positives = 138/261 (52%), Gaps = 22/261 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAV---HG 97
S +L AE +S FK+K K+Y ++ E +R +++ N + AK + V
Sbjct: 18 SYQEVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMA 75
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVK 153
+ +F D+ EF G R + + P N LP DWR GAVT VK
Sbjct: 76 MNEFGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVK 135
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG CGSCW+FSATG+LEG HF +G +VSLSEQ LVDC + ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFG-------NNGCEGGLM 188
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
++AF+YI G++ EK YPY GTD G+C F KS + A S F + E Q+ +
Sbjct: 189 DNAFKYIRANKGIDTEKSYPYNGTD-GTCHFKKSTVGATDSGFVDIKEGSETQLKKAVAT 247
Query: 273 HGPLAGNVASIELPHISFSFL 293
GP++ +I+ H SF F
Sbjct: 248 VGPIS---VAIDASHESFQFY 265
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 128/231 (55%), Gaps = 14/231 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+ FK+KF ++Y +EE R VF N++ T GV +F+DLT EF +
Sbjct: 18 QWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSK 77
Query: 112 QFLGLNRRLRLPADAQKAP--ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
++G + + DA + LPT DW GAVT VK+QG CGSCWSFS TG+
Sbjct: 78 TYMGFKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTGS 137
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEGA+ +STG+LVSLSEQQ VDC + GCNGGLM+SAF+Y +A + E
Sbjct: 138 LEGANEISTGKLVSLSEQQFVDCAGTYG-------NQGCNGGLMDSAFKYA-EANALCTE 189
Query: 230 KDYPYTGTDGGSCKFDKSKIAAA---VSNFSVISSDEDQMAANLVKHGPLA 277
+ YPY GTD GSC+ A VS + +SSD +Q + V P++
Sbjct: 190 QSYPYKGTD-GSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVS 239
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 136/251 (54%), Gaps = 18/251 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA----VHGVTKF 101
+L+AE + FK + +K Y EE R +F N + K L T GV +F
Sbjct: 34 VLDAEVAWHKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTVGVNEF 93
Query: 102 SDLTPSEFRRQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
+D+T EF + GL R+ +P + LP + DWR G V+ VK+QG+CGS
Sbjct: 94 ADMTVHEFAQMMNGLKPDSTRVSGSTYLSPNIDA-PLPVEVDWRTKGLVSEVKNQGSCGS 152
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG+LEG H TG +V LSEQ LVDC + + GCNGGLM +AF+YI
Sbjct: 153 CWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDC-------STSYGNDGCNGGLMTNAFKYI 205
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGN 279
G++ E+ YPY G D G CKF K+K+ A V+ F I + +E ++ L GP++
Sbjct: 206 KDNKGIDTEEAYPYAGRD-GDCKFKKNKVGATVTGFVEIPAGNEKKLQEALATVGPVS-- 262
Query: 280 VASIELPHISF 290
+I+ H SF
Sbjct: 263 -VAIDANHQSF 272
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 96/244 (39%), Positives = 138/244 (56%), Gaps = 19/244 (7%)
Query: 56 FKSKFSKTYA-TQEEH-DYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
+ S+ + YA QE+H + RF VFK N+ R + T + +F+DLT EFR +
Sbjct: 40 WMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFND-GKTFKLAINQFADLTNEEFRASY 98
Query: 114 LGLNRRLRLPADAQK-APILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
G + L + K P N LP DWR GAVT VK+QG CG CW+FSA A
Sbjct: 99 NGFKGPMVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAA 158
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG +STG+L+SLSEQ+LVDCD + D GC GGLM++AFE+I+ GG+ E
Sbjct: 159 IEGITQISTGKLISLSEQELVDCD-------TKGIDHGCEGGLMDTAFEFIINNGGLTTE 211
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
+YPY G D G+C F+K+ IA +++ + + ++++Q V H P++ +IE
Sbjct: 212 SNYPYKGED-GTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVS---VAIEAGGS 267
Query: 289 SFSF 292
F F
Sbjct: 268 DFQF 271
>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
Full=Major cysteine proteinase; Flags: Precursor
gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
Length = 467
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + + KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 32 FDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L N +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92 SRLGLSVSASSLIMASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
DYPY D G+CK DK K + +++ + S++++ V P++ + E
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 98/276 (35%), Positives = 147/276 (53%), Gaps = 23/276 (8%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQS----EDHLLNAEHHFSLFKSKFSKTY 64
+LL +S L+SA D I S G +S +D ++ + + K K Y
Sbjct: 2 FMLLFFASTLSSA-----SDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLV---KHGKAY 53
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---NRRLR 121
+ E + RF VFK NLR + T G+ +F+DLT E+R +LG RR +
Sbjct: 54 NSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRNK 113
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
L + + + LP DWR GAV GVKDQG+CGSCW+FSA A+EG + + TG+L
Sbjct: 114 LRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDL 173
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
+SLSEQ+LVDCD+ S + GCNGGLM+ FE+I+ GG++ E+DYPY DG
Sbjct: 174 ISLSEQELVDCDN--------SYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRC 225
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ K+ ++ ++ + + + V + P++
Sbjct: 226 DTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVS 261
>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAVHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/250 (37%), Positives = 136/250 (54%), Gaps = 14/250 (5%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L+ + + +K KTY T EE D R ++ NL K+ + + + F+DLT
Sbjct: 21 LSQDRQWHAWKDFHGKTY-TGEEEDLRRAIWNDNLEIVKKHNAENHSYKLDMNHFADLTV 79
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+EF+++F+G + P L LP + DWRD G VT VK+QG CGSCW+FS+
Sbjct: 80 TEFKQRFMGYRAASNSTGGSTFLP-LSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSS 138
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG+LEG HF TG+LVSLSEQ LVDC + ++GC GGLM+ AF+YI G+
Sbjct: 139 TGSLEGQHFRKTGKLVSLSEQNLVDCSKKYG-------NNGCEGGLMDYAFKYIKNNDGI 191
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPYT D G C F + A V+ ++ V E + + + GP++ +I+
Sbjct: 192 DTEQSYPYTARD-GQCHFKPGSVGATVTGYTDVQRGSEGDLQSAVATVGPIS---VAIDA 247
Query: 286 PHISFSFLFT 295
H SF T
Sbjct: 248 GHSSFQLYKT 257
>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + K KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 30 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 89
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L + +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 90 SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 149
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 150 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 201
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
DYPY D G+CK DK K + +++ + S++++ V P++ + E
Sbjct: 202 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/256 (34%), Positives = 143/256 (55%), Gaps = 17/256 (6%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTK 100
S D ++ A + L K K+Y E + RF++FK N L ++ D + G+ +
Sbjct: 35 STDDVIMAAYESWLVK--HGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNR 92
Query: 101 FSDLTPSEFRRQFLGL---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
F+DLT E+R ++ G+ + R ++ +Q+ L LP DWR+HGAV VKDQG
Sbjct: 93 FADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQ 152
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS A+EG + ++TG+L++LSEQ+LVDCD S + GCNGGLM+ AF
Sbjct: 153 CGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGLMDDAF 204
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++I+ GG++ + DYPYTG DG ++ K+ + ++ + +++ + P++
Sbjct: 205 QFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPIS 264
Query: 278 GNVASIELPHISFSFL 293
+IE F F
Sbjct: 265 ---VAIEASGRDFQFY 277
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/248 (40%), Positives = 128/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + ED + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVGLPQDEAQIAAWLAVNGPVA 258
>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/276 (35%), Positives = 136/276 (49%), Gaps = 56/276 (20%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-------- 90
G E LN +F FK + K YAT+EEH R+++F N+ + ++
Sbjct: 4 GAYDEKEALN---YFQHFKKLYLKRYATEEEHHRRWKIFYDNINLVNQLNIMHKPNEIAG 60
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK----APILP-----TNDLPTDF 141
P A +G+T+F D++P+EF R L LP QK P P + LP F
Sbjct: 61 KPVAQYGITQFMDMSPNEFARVKL-------LPPTKQKDINHTPTAPKEKYQIDALPESF 113
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR+HGAVT VKDQ +CGSCW+FS +EGA+FL+ L S QQLVDCD
Sbjct: 114 DWREHGAVTAVKDQASCGSCWAFSTVENIEGAYFLAGHNLTKFSPQQLVDCD-------- 165
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG--------------------S 241
+ + GC GG A +YI K GG+ E YPY G +
Sbjct: 166 -NLNCGCFGGFPFIAMQYIQKRGGLATESSYPYCIPPLGNCFPCNTNKTYCPSGEYCNRT 224
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
C ++ A V+ + +S +ED +AA LVK+GPL+
Sbjct: 225 CSVQNYQLVAKVAGYENVSQNEDDIAAYLVKNGPLS 260
>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 389
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 152/284 (53%), Gaps = 34/284 (11%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSD 103
+L + FS FK++ K Y EE RF +F+ NL ++ Q+ + TA +G+T+FSD
Sbjct: 32 NLTQVKQLFSKFKAEHKKFYNFLEEQR-RFEIFRQNLDIISELNQVEEGTAEYGITQFSD 90
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+T EF+ Q L + R ++ + D PT +DWRDHGAVT VK+QG G+CW
Sbjct: 91 MTTEEFKSQILIPSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCW 150
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG FL+ LVSLSE+Q+VDCD +P +G D G GG AF+Y++
Sbjct: 151 TFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEP-STGHADCGVFGGWPYLAFDYVIN 209
Query: 223 AGGVEREKDYPYTGTDGGS--------------------------CKFDKSKIAAAVSNF 256
AGG+ E+ YPY +GG C+ + IAA + ++
Sbjct: 210 AGGLPSEETYPYCVGNGGCYPCPAPGYNETLCGPAVPYCNATAYPCRQGQVPIAAKIEDW 269
Query: 257 SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSPK 300
+S DED + L + GPL+ +++ ++ F + +S+PK
Sbjct: 270 KALSKDEDSIKQQLFEIGPLS---VALDASYLQF-YKKGISAPK 309
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/279 (37%), Positives = 149/279 (53%), Gaps = 20/279 (7%)
Query: 3 RLILSSLLLLLLSSV-LASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLF 56
R IL S+ LL+L +V A ++ + + IR V + E+S +L H F+ F
Sbjct: 5 RTILPSVALLILIAVSTAESIGFYESNP-IRMVFDRLLEVEESVVQILGQTRHVLSFARF 63
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
++ K Y EE RF +FK NL + + GV +F+D+T EF+R LG
Sbjct: 64 THRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFTDMTWQEFQRTKLGA 123
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ A + L LP DWR+ G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 124 AQNC--SATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 181
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
+ G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 AFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 234
Query: 237 TDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
D G+CK+ + V N ++ + DE + A L++
Sbjct: 235 ED-GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLLR 272
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 98/248 (39%), Positives = 135/248 (54%), Gaps = 24/248 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDP--TAVHGVTKFSDLTPSE 108
F FK ++ + YAT +E YR V+ N+ A Q + T + + +F D+T E
Sbjct: 22 FHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEE 81
Query: 109 FRRQFLGLNRRLRLPA-DAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
GL LPA +++ +L D LP + DWR GAVT VKDQ ACGSCW+FS
Sbjct: 82 INAVMNGL-----LPASESRGVAVLGGRDDTLPAEVDWRTKGAVTPVKDQKACGSCWAFS 136
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL G+LVSLSEQ LVDC + D GC GGLM+ AF YI GG
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDC-------STKQGDHGCGGGLMDFAFTYIKDNGG 189
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVASIE 284
++ E YPY TD G C+++ + A V+ + + D ED + + GP++ +I+
Sbjct: 190 IDTEASYPYEATD-GKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPIS---VAID 245
Query: 285 LPHISFSF 292
+F F
Sbjct: 246 ASRSTFHF 253
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/248 (40%), Positives = 129/248 (52%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI + G
Sbjct: 145 ATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKENDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + ED + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 158/292 (54%), Gaps = 23/292 (7%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
L L ++ L+ +VA + D +++ P D E S D L+ F + S F K Y T EE
Sbjct: 14 LALSAATLSLSVAASHDYSIV-GYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEK 68
Query: 71 DYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
RF VFK NL+ + G+ +F+DL+ EF++ +LGL + + +
Sbjct: 69 LLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA 128
Query: 131 ILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
D+ P DWR GAV VK+QG+CGSCW+FS A+EG + + TG L +LSEQ
Sbjct: 129 EFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQ 188
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--D 245
+L+DCD + ++GCNGGLM+ AFEYI+K GG+ +E+DYPY+ + G+C+ D
Sbjct: 189 ELIDCDT--------TYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKD 239
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVS 297
+S+ + V ++DE + L H PL+ +I+ F F VS
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALA-HQPLS---VAIDASGREFQFYSGVS 287
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 156/278 (56%), Gaps = 26/278 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTY 64
S+LL+L+ S L+SA D ++I +++ H + +L++S + K+Y
Sbjct: 11 SILLMLIFSTLSSA----SDMSIISY------DETHIHRRTDDEVSALYESWLIEHGKSY 60
Query: 65 ATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RL 120
E D RF++FK NLR ++ + + + G+TKF+DLT E+R +LG R
Sbjct: 61 NALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRK 120
Query: 121 RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+L + + D LP DWR+ G + GVKDQG+CGSCW+FSA A+E + + TG
Sbjct: 121 KLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTG 180
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDCD S + GC+GGLM+ AFE+++K GG++ E+DYPY +G
Sbjct: 181 NLISLSEQELVDCDR--------SYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNG 232
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++ K+ + ++ + + ++ V H P++
Sbjct: 233 VCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVS 270
>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 426
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + K KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L + +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92 SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
DYPY D G+CK DK K + +++ + S++++ V P++ + E
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + K KTY ++EE R ++FK N + L+ + T + F+DLT EF+
Sbjct: 32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LGL+ A K L + +P DWR GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92 SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG+L+SLSEQ+L+DCD S ++GCNGGLM+ AFE+++K G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
DYPY D G+CK DK K + +++ + S++++ V P++ + E
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 21/253 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLT 105
+ + FK +K Y ++ E +R ++F N AK +L V G+ K++D+
Sbjct: 24 QEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 106 PSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGS 160
EF + G NR LR LP + LP DWRD GAVT VKDQG CGS
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFSATG+LEG HF +G+LVSLSEQ LVDC E+ G ++GCNGGLM++AF YI
Sbjct: 144 CWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
GG++ E+ YPY D C + K+K A + S +ED++ + + GP++
Sbjct: 197 KANGGIDTEQAYPYKAED-EKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVS-- 253
Query: 280 VASIELPHISFSF 292
+I+ H SF
Sbjct: 254 -VAIDASHQSFQL 265
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 150/284 (52%), Gaps = 22/284 (7%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTYATQE 68
L+LS+ L A+ D +++ S +HL + + LF+S K SKTY + E
Sbjct: 11 LILSATLFITYAIAHDFSIVGY--------SPEHLASMDKTIELFESWMSKHSKTYRSIE 62
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
E +RF +F NL+ + G+ +F+DL+ EF+ ++LGL ++
Sbjct: 63 EKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSSRG 122
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
DLP DWR GAVT VK+QG+CGSCW+FS A+EG + + TG L SLSEQ+
Sbjct: 123 FSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 182
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
L+DCD S ++GC GGLM+ AF+YI+ G+ +E+DYPY +G + +
Sbjct: 183 LIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQF 234
Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+S + + ++++Q + H P++ +IE +F F
Sbjct: 235 EVVTISGYEDVPANDEQSLLKALSHQPVS---VAIEASSRNFQF 275
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/248 (39%), Positives = 138/248 (55%), Gaps = 22/248 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F +K+ +YAT E R +++ANL ++ + V KF+DLT EF +
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 113 FLGLNRRLRLPA-DAQKA----PILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+LGL R A +A K+ LP LP DWR G VT +KDQG CGSCWSFS
Sbjct: 82 YLGL----RFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFST 137
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG++EG H TG+LVSLSEQ LVDC S ++GCNGGLM+ AF+YI+ G+
Sbjct: 138 TGSVEGQHARKTGQLVSLSEQNLVDC-------SSAQGNAGCNGGLMDQAFQYIISNNGI 190
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
+ E YPYT D G+C+F+ + + A V+++ I+S + N V GP++ +I+
Sbjct: 191 DTESSYPYTAQD-GTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPIS---VAIDA 246
Query: 286 PHISFSFL 293
SF F
Sbjct: 247 SQPSFQFY 254
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/286 (34%), Positives = 147/286 (51%), Gaps = 25/286 (8%)
Query: 1 MERLILSSLLLLL----LSSVLASAVAVNDDDAMIRQVVP-SDGEQSEDHLLNAEHHFSL 55
M L LS ++LLL +S + ++ D++ I V SD E E +
Sbjct: 1 MGFLKLSPMILLLAMIGVSYAIDMSIISYDENHHISTVSSRSDAE--------VERIYEA 52
Query: 56 FKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+ + K Q E D RF +FK NLR + + G+T+F+DLT E+R
Sbjct: 53 WMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRS 112
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+LG R+ + + + LP DWR GAV VKDQG+CGSCW+FS GA+E
Sbjct: 113 MYLGAKPVKRVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVE 172
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E D
Sbjct: 173 GINKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEAD 224
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY DG + K+ + ++ + + + + H P++
Sbjct: 225 YPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPIS 270
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 306 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 365
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 366 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 424
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 425 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 475
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E E +YPY C F+++ V+ F + +E M L+ +GP++
Sbjct: 476 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPIS 526
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/250 (39%), Positives = 136/250 (54%), Gaps = 20/250 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 25 DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G R+ + + + N L P DWRD+G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E YPY GTD C +D +A + F + S E + + GP++ +
Sbjct: 195 QGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS---VA 251
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 252 IDAGHESFQF 261
>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 121/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EF
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFWS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ I DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVEIPQDEAQIAAWLAVNGPVA 258
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/296 (37%), Positives = 155/296 (52%), Gaps = 44/296 (14%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L+LL + +A+A AV+ + ++V + ++ FK + K Y ++ E
Sbjct: 3 ILILLMAFVAAANAVS-----LYELVKEE--------------WNAFKLQHRKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
R +++ N + AK Q D V K++DL EF + G NR + L
Sbjct: 44 ERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSL 103
Query: 123 PADAQKAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ P+ P N ++PT DWR GAVT VKDQG CGSCWSFSATGALEG HF T
Sbjct: 104 KGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKT 163
Query: 179 GELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+LVSLSEQ LVDC SG ++GCNGG+M+ AF+YI GG++ EK YPY
Sbjct: 164 GKLVSLSEQNLVDC--------SGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAI 215
Query: 238 DGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
D +C F+ + A + I DE+ + L GP++ +I+ H SF F
Sbjct: 216 D-DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVS---IAIDASHESFQF 267
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 305 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 364
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 365 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 423
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 424 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 474
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E E +YPY C F+++ V+ F + +E M L+ +GP++
Sbjct: 475 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPIS 525
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 138/248 (55%), Gaps = 20/248 (8%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M I S L LL+ SVL ++++ V ++ ++E A + + +
Sbjct: 1 MATSIKSITLALLIFSVLLISLSLG-------SVTATETTRNEAE---ARRMYERWLVEN 50
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL-GLNR 118
K Y E + RF +FK NL+ + + + T G+T+F+DLT EFR +L
Sbjct: 51 RKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKME 110
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R R+P +K + LP DWR GAV VKDQG+CGSCW+FSA GA+EG + + T
Sbjct: 111 RTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKT 170
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD S + GC GGLM+ AF++I++ GG++ E+DYPY TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATD 222
Query: 239 GGSCKFDK 246
C DK
Sbjct: 223 VNVCNSDK 230
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/261 (41%), Positives = 142/261 (54%), Gaps = 30/261 (11%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRR---QLLDPTAVHGVTKF 101
L+N E + FK + K Y + E R +++ N L+ A+ +L T + K+
Sbjct: 23 LVNQE--WINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKY 80
Query: 102 SDLTPSEFRRQFLGLNRRL-------RLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVK 153
D+ EF+ G NR + RLP A A I P N +LP DWR GAVT VK
Sbjct: 81 GDMLNHEFKNMLNGYNRTINHTLRNERLPVGA--AFIEPCNVELPKMVDWRKCGAVTEVK 138
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGL 212
DQG CGSCW+FSATG+LEG HF TG LVSLSEQ L+DC SGS ++GCNGGL
Sbjct: 139 DQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDC--------SGSYGNNGCNGGL 190
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLV 271
M+ AF YI G++ EK YPY G D C++DK A+ F I DE ++ A +
Sbjct: 191 MDQAFSYIKDNKGLDTEKTYPYEGED-DKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVA 249
Query: 272 KHGPLAGNVASIELPHISFSF 292
GP++ +I+ H SF F
Sbjct: 250 TVGPVS---VAIDASHQSFQF 267
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/287 (33%), Positives = 155/287 (54%), Gaps = 18/287 (6%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
S L L S L +++AV D +++ S+ +S D L+ F + S+ K Y +
Sbjct: 6 SKALFLACSFCLFASLAVAGDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYQS 60
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA 126
EE +RF +FK NL+ R + G+ +F+DL+ EF+ ++LGL ++
Sbjct: 61 IEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRES 120
Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
+ +LP DWR GAVT VK+QG+CGSCW+FS A+EG + + TG L SLSE
Sbjct: 121 PEEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
Q+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+C+ K
Sbjct: 181 QELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGTCEMTK 231
Query: 247 SKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ +S + + + +Q + + PL+ +IE F F
Sbjct: 232 EETEVVTISGYHDVPQNNEQSLLKALVNQPLS---VAIEASGRDFQF 275
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/201 (43%), Positives = 124/201 (61%), Gaps = 18/201 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFR 110
++ + K K+Y E + RF++FK NLR DP + G+ +F+DLT E+R
Sbjct: 49 YNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNA-DPDRSYELGLNRFADLTNEEYR 107
Query: 111 RQFLGLNRRLRLPADAQK-----APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
++LG R P ++ AP+ +LP DWR+ GAV VKDQG+CGSCW+FS
Sbjct: 108 AKYLGTKSRESRPKLSKGPSDRYAPV-EGEELPDSIDWREKGAVAAVKDQGSCGSCWAFS 166
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
A GA+EG + ++TGEL++LSEQ+LVDCD S + GC GGLM+ AF +I+K GG
Sbjct: 167 AIGAVEGINQITTGELITLSEQELVDCDR--------SYNEGCEGGLMDYAFNFIIKNGG 218
Query: 226 VEREKDYPYTGTDGGSCKFDK 246
++ + DYPYTG D G+C +K
Sbjct: 219 IDSDLDYPYTGRD-GTCNQNK 238
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 102/253 (40%), Positives = 137/253 (54%), Gaps = 25/253 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ FK + K Y ++ E R +++ N + AK Q D V K++DL E
Sbjct: 27 WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86
Query: 109 FRRQFLGLNR---RLRLPADAQKAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
F + G NR + L + P+ P N ++PT DWR GAVT VKDQG CGSC
Sbjct: 87 FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYI 220
WSFSATGALEG HF TG+LVSLSEQ LVDC SG ++GCNGG+M+ AF+YI
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDC--------SGKYGNNGCNGGMMDYAFQYI 198
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGN 279
GG++ EK YPY D +C F+ + A + I DE+ + L GP++
Sbjct: 199 KDNGGIDTEKSYPYEAID-DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVS-- 255
Query: 280 VASIELPHISFSF 292
+I+ H SF F
Sbjct: 256 -IAIDASHESFQF 267
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/186 (44%), Positives = 115/186 (61%), Gaps = 11/186 (5%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
EE D RF +FK NLR + + G+T+F+DLT E+R +LG + R+ +
Sbjct: 68 EEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVLKTSD 127
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + +P DWR GAV VKDQG+CGSCW+FS GA+EG + + TG+L+SLSEQ
Sbjct: 128 RYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQ 187
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
+LVDCD S + GCNGGLM+ AFE+I+K GG++ E+DYPY DG + D++
Sbjct: 188 ELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADG---RCDQT 236
Query: 248 KIAAAV 253
+ A V
Sbjct: 237 RKNAKV 242
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 148/288 (51%), Gaps = 20/288 (6%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
RL S LLL+LS +A +V DD IR V + E +L H F+ F
Sbjct: 4 RLFFVSSLLLVLSCAVAGSVF--DDSNPIRMVSDRLRELELEVVRVLGQVPHALRFARFA 61
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y T EE RF +F +L K + GV +F+D T EFR+ LG
Sbjct: 62 HRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFRKHRLGAA 121
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + L LP DWR G V+ VKDQG CGSCW+FS TGALE A+ +
Sbjct: 122 QNC--SATTKGSHKLTDTALPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQA 179
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+ +SLSEQQLVDC G + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 180 HGKGISLSEQQLVDCGR-------GFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGV 232
Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAGNVAS 282
D GSCKF + V N ++ + DE + A V+ +A V S
Sbjct: 233 D-GSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVS 279
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 101/248 (40%), Positives = 128/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G +R R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HRGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + E + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/256 (38%), Positives = 133/256 (51%), Gaps = 17/256 (6%)
Query: 29 AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRR 87
A++ +V + + +L E + FKS KTY + E RF++F N L AK
Sbjct: 5 ALLCAIVAAATAATSQEILRTE--WEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHN 62
Query: 88 QLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFD 142
V G+ +F+DL P EF + G + + P ND LP D
Sbjct: 63 VKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVD 122
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAVT VKDQG CGSCW+FS+TG+LEG HFL TG+LVSLSEQ LVDC S
Sbjct: 123 WRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDC-------SSA 175
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISS 261
+ GCNGGLM+++F YI GG++ E YPY D G C++ K + A + F +
Sbjct: 176 YGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAED-GDCRYKKEDVGATDTGFVDIKEG 234
Query: 262 DEDQMAANLVKHGPLA 277
E + + GP++
Sbjct: 235 SEKDLQKAVATVGPVS 250
>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
Length = 344
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 126/234 (53%), Gaps = 16/234 (6%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K+Y T EE R+ +F AN+ ++ V G+ F+D+T E+R +LG
Sbjct: 39 KSY-TSEEFGARYNIFTANMDYVQQWNSKGSETVLGLNNFADITNEEYRNTYLGTKFDAS 97
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
Q+ + TN DWR GAVT VK+QG CG CWSFS TG+ EGAHF S GEL
Sbjct: 98 SLIGTQEEKV-HTNSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGEL 156
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ L+DC E +SGC+GGLM AFEYI+ G++ E YPY + G
Sbjct: 157 VSLSEQNLIDCSTE---------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGK 206
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
C++ A +S++ +++ + + V P++ +I+ H SF L+T
Sbjct: 207 CEYKSENSGATLSSYKTVTAGSESSLESAVNVNPVS---VAIDASHQSFQ-LYT 256
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/209 (39%), Positives = 119/209 (56%), Gaps = 8/209 (3%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
E D RF +FK NLR + + G+T+F+DLT E+R +LG R+ + +
Sbjct: 70 EKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAKPTKRVLKTSDR 129
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
+ LP DWR GAV VKDQG+CGSCW+FS GA+EG + + TG+L+SLSEQ+
Sbjct: 130 YQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQE 189
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
LVDCD S + GCNGGLM+ AFE+I+K GG++ E DYPY DG + K+
Sbjct: 190 LVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNA 241
Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ ++ + + + + H P++
Sbjct: 242 KVVTIDSYEDVPENSEASLKKALAHQPIS 270
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 13/239 (5%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
NA + FK K+ K+Y+ ++ +YRFRVFK NL R K+ Q ++ TA +GVT+FSDLT
Sbjct: 26 NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EF+ ++L ++ +P D + P + + +FDWR+HGAV V D+G CGSCW+FSA
Sbjct: 85 QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDKGDCGSCWAFSA 143
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ IL GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ + DYPY G + G C+ SK+ ++ ++ DE A L + GP + + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPFSSALNALSL 252
>gi|225579644|gb|ACN93991.1| cathepsin L [Dicentrarchus labrax]
Length = 316
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/248 (39%), Positives = 136/248 (54%), Gaps = 20/248 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 23 HWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHTYRLGMNHFGDMTHE 81
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G +L+ + + N L P DWRD+G VT VKDQG CGSCW+FS
Sbjct: 82 EFRQLMNGY--KLKAARKFSGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWAFS 139
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF +G+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ G
Sbjct: 140 TTGALEGQHFRKSGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 192
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C +D + +A + F + S E + + GP++ +I+
Sbjct: 193 LDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDIPSGKEHALMKAVAAVGPVS---VAID 249
Query: 285 LPHISFSF 292
H SF F
Sbjct: 250 AGHESFQF 257
>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/237 (39%), Positives = 121/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC GGLMN+AF +I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFGWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTENSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/300 (38%), Positives = 151/300 (50%), Gaps = 47/300 (15%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L LLL S LA+A AV+ I +V + ++ FK + K Y ++ E
Sbjct: 3 LFLLLVSFLAAANAVS-----IFNLVKEE--------------WNAFKLQHRKKYDSESE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRL----R 121
R +++ N + AK Q D V K++DL EF G NR +
Sbjct: 44 ERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSK 103
Query: 122 LPADAQ----KAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
L Q + PI P N D+PT DWR+ GAVT VKDQG CGSCWSFSATGALEG
Sbjct: 104 LLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQ 163
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
HF TG+LVSLSEQ LVDC + ++GCNGGLM++AF+Y+ G++ EK YP
Sbjct: 164 HFRKTGKLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYP 216
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
Y D C ++ I A F I DE + L GP++ +I+ H SF F
Sbjct: 217 YEAID-DECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVS---VAIDASHESFQF 272
>gi|111226635|ref|XP_641720.2| cysteine proteinase [Dictyostelium discoideum AX4]
gi|38372247|sp|Q94504.1|CYSP7_DICDI RecName: Full=Cysteine proteinase 7; AltName: Full=Proteinase 1;
Flags: Precursor
gi|1644502|gb|AAC47482.1| cysteine proteinase [Dictyostelium discoideum]
gi|90970688|gb|EAL67742.2| cysteine proteinase [Dictyostelium discoideum AX4]
Length = 460
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 122/217 (56%), Gaps = 22/217 (10%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EE + R+ +FKAN+ V G+ F+D++ E+R +LG P D
Sbjct: 42 SSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRATYLGT------PFD 95
Query: 126 AQKAPILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE-- 180
A + ++ D DWR GAVT +K+QG CG CWSFS TGA EGA +L+ G+
Sbjct: 96 ASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKN 155
Query: 181 LVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
LVSLSEQ L+DC SGS ++GC GGLM AFEYI+ G++ E YPYT DG
Sbjct: 156 LVSLSEQNLIDC--------SGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDG 207
Query: 240 GSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
CKF+ +AA +S++ +V S E +AA V GP
Sbjct: 208 KKCKFNPKNVAAQLSSYVNVTSGSESDLAAK-VTQGP 243
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 166 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 225
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 226 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 284
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 285 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 335
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E E +YPY C F+++ V+ F + +E M L+ +GP++
Sbjct: 336 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPIS 386
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 86/241 (35%), Positives = 132/241 (54%), Gaps = 13/241 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + S+ K Y + EE RF +FK NL+ + G+ +F+DL+ EF++Q
Sbjct: 8 FESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSNYWLGLNEFADLSHHEFKKQ 67
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+LGL ++ + DLP DWR GAVT +K+QG+CGSCW+FS A+EG
Sbjct: 68 YLGLKVDFSTRRESSEEFTYRDVDLPKSVDWRKKGAVTNIKNQGSCGSCWAFSTVAAVEG 127
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
+ + TG L SLSEQ+L+DCD + +SGCNGGLM+ AF +I++ GG+ +E DY
Sbjct: 128 INQIVTGNLTSLSEQELIDCDR--------TYNSGCNGGLMDYAFSFIVENGGLHKEDDY 179
Query: 233 PYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFS 291
PY + G+C+ K + +S + + + +Q + + PL+ +IE F
Sbjct: 180 PYI-MEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQ 235
Query: 292 F 292
F
Sbjct: 236 F 236
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 140/270 (51%), Gaps = 12/270 (4%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L+LL V A + A + Q + D + A + L K K Y E
Sbjct: 1 MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKH--GKNYNALGE 58
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--NRRLRLPADAQ 127
+ RF +FK NL + + T G+ +F+DLT EFR +LG + RLP +
Sbjct: 59 KEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD 118
Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + LP DWR GAV VKDQG CGSCW+FS A+EG + + TG+L++LSEQ
Sbjct: 119 RYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQ 178
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DYPY G DG + K+
Sbjct: 179 ELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKN 230
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++ ++ + +++ V + P++
Sbjct: 231 AKVVSIDSYEDVPENDETALKKAVANQPVS 260
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 123/225 (54%), Gaps = 12/225 (5%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+KS K+Y+ E R +++ NL + KR D + + DLT EFR +LG
Sbjct: 30 WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLG 89
Query: 116 LNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
+ + P+N +P+ DW G VTGVK+QG CGSCW+FS TG++EG H
Sbjct: 90 VRAHHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQH 149
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYP 233
F TG LVSLSEQ L+DC SGS ++GC GGLM++AF YI GG++ E YP
Sbjct: 150 FRKTGSLVSLSEQNLIDC--------SGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYP 201
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQ-MAANLVKHGPLA 277
Y G GSC F S + A V+ + I +Q + + + GP++
Sbjct: 202 YLGQQ-GSCHFSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVS 245
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 150/278 (53%), Gaps = 20/278 (7%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFK 57
++SS++LLL + ASA A + DD+ ++V SDG E S ++ H F+ F
Sbjct: 7 LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF F NL + + G+ KF+D + EF+R LG
Sbjct: 67 RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNKFADWSWEEFQRHRLGAA 126
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + L + LP DWR+ G V+ VKDQG CGSCW+FS TG+LE A+ +
Sbjct: 127 QNC--SATTKGNHKLTADVLPETKDWRESGIVSPVKDQGHCGSCWTFSTTGSLEAAYHQA 184
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 185 FGKGISLSEQQLVDCAQAFN-------NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 237
Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
D G CKF + V N ++ + DE Q A LV+
Sbjct: 238 D-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 274
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 135/244 (55%), Gaps = 20/244 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
+ ++ + YA E + R+ VFK N+ R +R + T V +F+DLT EFR +
Sbjct: 41 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100
Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
G L + + + ++ LP DWR GAVT +KDQG CGSCW+FSA A
Sbjct: 101 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 160
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + G+L+SLSEQ+LVDCD + D GC GGLM++AF Y + GG+ E
Sbjct: 161 IEGVAQIKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSE 211
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
+YPY T+ G+C F+K+K IA ++ F + +++++ V H P++ +A + I
Sbjct: 212 SNYPYKSTN-GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD---I 267
Query: 289 SFSF 292
F F
Sbjct: 268 GFQF 271
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 128/229 (55%), Gaps = 12/229 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + K K+Y E + RF +FK NLR + ++ T G+ +F+DLT E+R +
Sbjct: 54 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSR 113
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LG R LR + + DLP DWR+ GAV VKDQG CGSCW+FS
Sbjct: 114 YLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIA 173
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + ++TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++
Sbjct: 174 AVEGINQIATGDLISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDS 225
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+DYPY D K+ ++ + + ++++ V + P++
Sbjct: 226 EEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVS 274
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 135/248 (54%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 HWDLWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHPYRLGMNHFGDMTHE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G +R + + + + N L P DWRD G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQIMNGYKQR-KTERKFKGSLFMEPNFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ G
Sbjct: 145 TTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C +D + +A + F V S E + + GP++ +I+
Sbjct: 198 LDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGKERALMKAVAAVGPVS---VAID 254
Query: 285 LPHISFSF 292
H SF F
Sbjct: 255 AGHESFQF 262
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/250 (37%), Positives = 135/250 (54%), Gaps = 13/250 (5%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
I ++ SD Q D + A + L K Y E + RF +FK NLR
Sbjct: 42 IPEIPHSDAHQRPDEEVAALYESWLVH--HGKAYNAIGEKERRFEIFKDNLRFIDEHNRE 99
Query: 91 DPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRL-PADAQKAPILPTNDLPTDFDWRDHG 147
T G+T+F+DLT E+R +FLG +R+ RL A + + +DLP D DWR G
Sbjct: 100 SRTYKVGLTRFADLTNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKG 159
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AV VKDQG CGSCW+FS+ A+EG + + TGEL+ LSEQ+LVDCD S + G
Sbjct: 160 AVATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDK--------SFNMG 211
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLM+ AF++I+ GG++ E+DYPY G D K+ + + + +++
Sbjct: 212 CNGGLMDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSL 271
Query: 268 ANLVKHGPLA 277
V + P++
Sbjct: 272 KKAVANQPVS 281
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 153/293 (52%), Gaps = 21/293 (7%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M IL++ + +LL +A + P +Q + + F + +
Sbjct: 1 MTSTILTTTIFILLMLCNTCVIASESE-------CPPTHKQKSSDVEAMKKRFDGWVKRH 53
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
+ Y +E + RF +++AN++ + + + KF+DLT EF+ ++GL+ RL
Sbjct: 54 GRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRL 113
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
R + DLP DWR GAVT + DQG CG CW+F+A A+EG + + +G+
Sbjct: 114 RSHNTGFRYD--EHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGK 171
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+SLSEQ+L+DCD + S + GC GGLM +A+ +I++ GG+ E+DYPY G D G
Sbjct: 172 LISLSEQELIDCDVK-------SGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVD-G 223
Query: 241 SCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+CK +K+ AA++S + + +D + H P++ +I+ SF F
Sbjct: 224 TCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVS---VAIDAGGYSFQF 273
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 136/252 (53%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ FK + K YA E +R ++F N AK Q V + K++D+ E
Sbjct: 29 WNTFKLEHRKNYADSTEETFRMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHE 88
Query: 109 FRRQFLGLN----RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSC 161
FR G N ++LR ++ + + LPT DWR GAVT VKDQG CGSC
Sbjct: 89 FRETMNGFNYTLHKQLRSTDESFTGVTFISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGA+EG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF Y+
Sbjct: 149 WAFSSTGAIEGQHFRKSGTLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYVK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK Y Y G D SC FDK+ I A F+ I +E ++A + GP++
Sbjct: 202 DNGGIDTEKSYAYEGID-DSCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVS--- 257
Query: 281 ASIELPHISFSF 292
+I+ SF F
Sbjct: 258 VAIDASQQSFQF 269
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 128/228 (56%), Gaps = 11/228 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + + K Y E + RF +FK NLR + + G+ +F+DLT E+R
Sbjct: 47 YEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSM 106
Query: 113 FLGLNRRLRLPADAQKA---PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
FLG N ++ + + K+ + LP DWR+ GAV+ VKDQG CGSCW+FS A
Sbjct: 107 FLGGNMEMKERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISA 166
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TGEL+SLSEQ+LVDCD S + GCNGGLM+ F++I+ GG++ E
Sbjct: 167 VEGINQIVTGELISLSEQELVDCDK--------SYNMGCNGGLMDYGFQFIINNGGIDTE 218
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+DYPY DG +F K+ +++ + + D++ V + P++
Sbjct: 219 EDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVS 266
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 115/192 (59%), Gaps = 15/192 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTAVHGVTKFSDLTPS 107
F L+K K K Y EE + R FK NL+ KR+ L+ G+ KF+DL+
Sbjct: 50 FKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKV--GLNKFADLSNE 107
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR +L ++ + +K L T D P+ DWR+ G VT VKDQG CGSCWSFS T
Sbjct: 108 EFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFSTT 167
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GA+E + + TG+L+SLSEQ+LVDCD + + GC GG M+SAF++++ GG++
Sbjct: 168 GAIEAINAIVTGDLISLSEQELVDCDT--------TNNYGCEGGDMDSAFQWVIGNGGID 219
Query: 228 REKDYPYTGTDG 239
E DYPYTG DG
Sbjct: 220 TEADYPYTGVDG 231
>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
Length = 271
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/221 (42%), Positives = 124/221 (56%), Gaps = 18/221 (8%)
Query: 80 NLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
L AKR Q ++ TA +GVT+FSDLT EF+ ++L R+R + P D+
Sbjct: 1 QLAAAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMRFDGPIVSEDLTPEEDVT 56
Query: 139 TD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
D FDWR+HGAV V DQG CGSCW+FS G +EG F TG+L++LSEQQLVDCDH
Sbjct: 57 MDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDH- 115
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
D GCNGG + I K GG+E DYPYTG D G C ++SK A V++
Sbjct: 116 --------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD-GICYMNQSKFVAYVND 166
Query: 256 FSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
+V+ E A L + GPL+ + ++ L +F +
Sbjct: 167 STVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 207
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 137/253 (54%), Gaps = 21/253 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLT 105
+ + FK +K Y + E +R ++F N AK +L V G+ K++D+
Sbjct: 24 QEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 106 PSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGS 160
EF + G NR LR LP + LP DWRD GAVT VKDQG CGS
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFSATG+LEG HF +G+LVSLSEQ LVDC E+ G ++GCNGGLM++AF YI
Sbjct: 144 CWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
GG++ E+ YPY D C + K+K A + S +ED++ + + GP++
Sbjct: 197 KANGGIDTEQAYPYKAED-EKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVS-- 253
Query: 280 VASIELPHISFSF 292
+I+ H SF
Sbjct: 254 -VAIDASHQSFQL 265
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 98/250 (39%), Positives = 133/250 (53%), Gaps = 24/250 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +K SK Y EE +R +++ NL++ + L +H G+ F D+T
Sbjct: 28 HWDQWKKWHSKKYHATEE-GWRRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHFGDMTHE 86
Query: 108 EFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G +RR R + I ++P DWR+ G VT VKDQG CGSCW+
Sbjct: 87 EFRQVMNGFKHKKDRRFRGSLFMEPNFI----EVPNKLDWREKGYVTPVKDQGECGSCWA 142
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGALEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+
Sbjct: 143 FSTTGALEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQ 195
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY GTD C FD AA + F + S E + + GP++ +
Sbjct: 196 NGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVS---VA 252
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 253 IDAGHESFQF 262
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 101/242 (41%), Positives = 136/242 (56%), Gaps = 19/242 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK + + Y EE + RF +FK NL+ K+ L + G+ +F+D+ EFR
Sbjct: 45 FKKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR- 103
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ GL R + Q + L L P + DWR G VT VK+QG CGSCWSFS TG+
Sbjct: 104 MYNGLRRDYNYSREVQCSNHLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG HF +G+LVSLSEQQLVDC + E GCNGGLM+ AFEYI+ GG+E E
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNE-------GCNGGLMDQAFEYIITNGGIETE 216
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
++YPY C F KS++AA S V S DE + ++ + GP++ +I+ H
Sbjct: 217 EEYPYDARQ-ERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVS---IAIDASHQ 272
Query: 289 SF 290
SF
Sbjct: 273 SF 274
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 152/299 (50%), Gaps = 22/299 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R + + + L L++ V + + N+ ++ + + + L+ AE +S FK+ K
Sbjct: 2 RPLEALIRLFLVTHVPLNGIWKNEGFVVLGCLFVTAAAITHQELVGAE--WSAFKALHGK 59
Query: 63 TYATQEEHDYRFRVFKANLRRAKR--RQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLNR 118
Y ++ E YR +++ N + R + + A + + +F DL EF G R
Sbjct: 60 EYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKR 119
Query: 119 RLRLPADAQKAPILPT----NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R I P LP DWR GAVT VK+QG CGSCW+FS TG+LEG H
Sbjct: 120 NYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQH 179
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG +VSLSEQ LVDC + ++GC GGLM++AF+YI GG++ E YPY
Sbjct: 180 FRKTGRMVSLSEQNLVDCSGKFG-------NNGCEGGLMDNAFKYIKANGGIDTELSYPY 232
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELPHISFSF 292
GTD G C F+KS + A + F I +Q+ V GP++ +I+ H SF F
Sbjct: 233 NGTD-GICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVS---VAIDASHESFQF 287
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 105/282 (37%), Positives = 148/282 (52%), Gaps = 21/282 (7%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL----NAEH--HFS 54
M R S L++L+ AS+ + DD+ IR VV + E +L ++ H F+
Sbjct: 1 MARTSFSLLIILIACVAGASSASTFDDENPIRTVVSDALREFETSILSVLGDSRHALSFA 60
Query: 55 LFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
F ++ K Y T EE RF +F NL+ + + GV F+D T EFRR L
Sbjct: 61 RFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFRRHRL 120
Query: 115 GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
G + A + L LP DWR G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 121 GAAQNC--SATTKGNHKLTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAY 178
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYP 233
+ G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEY+ GG++ E+ YP
Sbjct: 179 KQAFGKGISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYP 230
Query: 234 YTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
YTG + G CKF + V N ++ + DE + A V+
Sbjct: 231 YTGKN-GECKFSSENVGVQVLDSVNITLGAEDELKHAVAFVR 271
>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
Length = 375
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/259 (38%), Positives = 136/259 (52%), Gaps = 24/259 (9%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+SE+HL + FS FK K KTYA+ EH++R VF+ NLR + V
Sbjct: 64 RSEEHLHDE---FSRFKGKHQKTYASDREHEHRLNVFRQNLRFIHSHNRANRGFTVAVNH 120
Query: 101 FSDLTPSEFR--RQFLGLNRRLRLPADAQKAPILPT---NDLPTDFDWRDHGAVTGVKDQ 155
+D T E + R F R + Q P P +DLP +DWR GAVT VKDQ
Sbjct: 121 LADRTEDEMKSLRGF----RSSNVYNGGQAFPYKPAAHMDDLPDSWDWRISGAVTPVKDQ 176
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
CGSCWSF G +EGA+F T +LV S+Q LVDC G ++GC+GG
Sbjct: 177 SVCGSCWSFGTIGHIEGAYFRKTQKLVRFSQQALVDC-------SWGYGNNGCDGGEDFR 229
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
A+++I++ GGV E +Y Y G D G C+ + + A ++ + +V S D D L KHG
Sbjct: 230 AYQWIMQVGGVPMEDEYEYLGQD-GYCRVENVTLYAPITGWVNVTSGDPDAFKVALFKHG 288
Query: 275 PLAGNVASIELPHISFSFL 293
PL+ +I+ H SFSF
Sbjct: 289 PLS---IAIDAGHKSFSFY 304
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 96/248 (38%), Positives = 137/248 (55%), Gaps = 20/248 (8%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M I S L LL+ S+L ++++ V +D ++E A + + +
Sbjct: 1 MATPIKSITLALLIFSMLLISLSLG-------SVTAADTTRNEAE---ARRMYEQWLVEN 50
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL-GLNR 118
K Y E + RF +F NL+ + + + T G+T+F+DLT EFR +L
Sbjct: 51 RKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKME 110
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R R+P ++ + LP DWR GAV VKDQG CGSCW+FSA GA+EG + + T
Sbjct: 111 RTRVPVKGERYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKT 170
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GEL+SLSEQ+LVDCD S + GC GGLM+ AF++I++ GG++ E+DYPYT TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATD 222
Query: 239 GGSCKFDK 246
C DK
Sbjct: 223 DNICNSDK 230
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 149/285 (52%), Gaps = 22/285 (7%)
Query: 2 ERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--GEQSEDHLLNAEHHFSLFKSK 59
+ + +++L+LLL +A + A+ +V PS G + + + ++
Sbjct: 9 KHITMTTLMLLLCVIAIADCIC---QAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQ 65
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTPSEFRRQFLGLNR 118
+ + Y E +RF+VFKAN R V G +F+DLT EF + GL +
Sbjct: 66 YRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRK 125
Query: 119 RLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+P+ A++ P D DWR GAVT VK+QG CG CW+FSA GA+EG
Sbjct: 126 PAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEG 185
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
++TG LVSLSEQQ++DCD E G + GCNGG M++AF+Y++ GGV E Y
Sbjct: 186 LIMITTGNLVSLSEQQILDCD-----ESDG--NQGCNGGYMDNAFQYVVNNGGVTTEDAY 238
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
PY+ G+C+ + AA +S F + S ++ AN V + P++
Sbjct: 239 PYSAVQ-GTCQ--NVQPAATISGFQDLPSGDENALANAVANQPVS 280
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 92/245 (37%), Positives = 128/245 (52%), Gaps = 17/245 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPS 107
++ ++S + K YA EE D+R V++ N++ +R HG T F D+T
Sbjct: 28 QWNQWRSTYKKPYAVNEE-DWRRAVWEKNVKMIERHNQEYSQGKHGFTMAMNAFGDMTNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR+ G + P+ +PT DW G VT VK+QG CGSCW+FSAT
Sbjct: 87 EFRQVMNGFQNQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKNQGQCGSCWAFSAT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+Y+ GG++
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAFQYVQDNGGLD 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
E+ YPY TD +C + AA + F I E + + GP++ +I+ H
Sbjct: 198 SEESYPYLATDTHTCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAIDAGH 254
Query: 288 ISFSF 292
SF F
Sbjct: 255 ESFQF 259
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/292 (33%), Positives = 156/292 (53%), Gaps = 26/292 (8%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
S L+L S L +++A D +++ S+ +S D L+ F + SK K Y
Sbjct: 5 FSKALVLACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSKHGKIYQ 59
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLR 121
+ EE RF +FK NL+ R + G+ +F+DL+ EF+ ++LGL +RR
Sbjct: 60 SIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRE 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
P + + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L
Sbjct: 120 SPEEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
SLSEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+
Sbjct: 176 TSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGT 226
Query: 242 CKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
C+ K + +S + + + +Q + + PL+ +IE F F
Sbjct: 227 CEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 275
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 151/297 (50%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
+LLL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MLLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V SS+E + + GP++ +I+ H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFQEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C ++ AA + F + S E + + GP++ +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255
Query: 285 LPHISFSF 292
H SF F
Sbjct: 256 AGHTSFQF 263
>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
Length = 334
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 98/261 (37%), Positives = 135/261 (51%), Gaps = 23/261 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK L
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANTVEEAYRAKVFKENAIRIAKHNDLFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G ++++D+ E + G L+ + + DWR GAVT
Sbjct: 70 EVTFKVGYSQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT DG SC + + A + + V + E +
Sbjct: 183 GLMDSAFEYVESNGGIDTEESYPYTAVDGDSCLYKAANNAGVNTGYKDVQAKSESALRDA 242
Query: 270 LVKHGPLAGNVASIELPHISF 290
+ K GP++ +I+ + SF
Sbjct: 243 VEKAGPVS---VAIDASNWSF 260
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/254 (39%), Positives = 137/254 (53%), Gaps = 21/254 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
+ + FK +K Y + E +R ++F N AK +L V G+ K++D+
Sbjct: 23 VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADM 82
Query: 105 TPSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACG 159
EF + G NR LR LP + LP DWRD GAVT VKDQG CG
Sbjct: 83 LHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCG 142
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCWSFSATG+LEG HF +G+LVSLSEQ LVDC E+ G ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRY 195
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
I GG++ E+ YPY D C + K+K A + S +ED++ + + GP++
Sbjct: 196 IKANGGIDTEQAYPYKAED-EKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVS- 253
Query: 279 NVASIELPHISFSF 292
+I+ H SF
Sbjct: 254 --VAIDASHQSFQL 265
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 123/221 (55%), Gaps = 10/221 (4%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K Y E + RF +FK NL + + T G+ +F+DLT EFR +LG
Sbjct: 57 KHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRT 116
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ RLP + + + LP DWR GAV VKDQG CGSCW+FS A+EG + +
Sbjct: 117 GHKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKI 176
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+L++LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E DYPY G
Sbjct: 177 VTGDLIALSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLG 228
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DG + K+ ++ ++ + +++ V + P++
Sbjct: 229 RDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVS 269
>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 404
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 91/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ------- 112
+ K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR +
Sbjct: 2 YGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASY 61
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
F +RLR K + T P DWR+ GAVT +KDQG CGSCW+F + G +EG
Sbjct: 62 FAAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPMKDQGQCGSCWAFYSIGNIEG 115
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREK 230
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 116 QWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 166
Query: 231 DYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 167 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 215
>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
Length = 379
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/247 (39%), Positives = 143/247 (57%), Gaps = 29/247 (11%)
Query: 59 KFSKTYATQEEHDY--RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG- 115
+F K+Y E D+ RF VFK N+ V + +F+D+T E+RR +LG
Sbjct: 45 RFEKSY---ESFDFLQRFAVFKTNMDYVHEWNSKKLPTVLELNQFADITNQEYRRLYLGT 101
Query: 116 -LNRR--LRLPADAQKA----PILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFS 165
+N R L P + + + +D + DWR GAV+ +K+QG CGSCWSFS
Sbjct: 102 RINARHLLGTPGTHEMSNNFGKVFGDDDSDSSGATVDWRAKGAVSPIKNQGQCGSCWSFS 161
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAG 224
TG++EGAH++STG++V LSEQ LVDC SGS + GC GGLMN AF+YI+K
Sbjct: 162 TTGSVEGAHYISTGKMVPLSEQNLVDC--------SGSEGNMGCQGGLMNLAFDYIIKNE 213
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASI 283
G++ E YPY+ G C F+K+ + A +S++ I+S ++ A+ VK+ GP++ +I
Sbjct: 214 GIDTEDSYPYSAETGKKCLFNKTNVGATISSYKNITSGDESNLADAVKNAGPVS---VAI 270
Query: 284 ELPHISF 290
+ H SF
Sbjct: 271 DASHNSF 277
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 156/301 (51%), Gaps = 34/301 (11%)
Query: 11 LLLLSSVLA--SAVAVNDDDAMIRQVVPSDGEQSEDHLLNA---------EHHFSLFKSK 59
+L + SVLA S V +++ + + + H+L A E + FK
Sbjct: 26 VLWIVSVLAVVSGANVQNENVQWFDLESAQKHPEQLHILKAQTGINYQPYEQAWKEFKIL 85
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRRQFLG 115
K+Y EE RF +F+ N+ R ++ L + GV +F+DL +EF F G
Sbjct: 86 HDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEFV-NFNG 144
Query: 116 LNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
L ++ + + + L N++ P DWR G VT VK+QGACGSCW+FSATG+LEG
Sbjct: 145 L--KMTNLNNTKCSSHLSANNIVVPDSVDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQ 202
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDY 232
+F G+LV LSE QLVDC SGS + GCNGG M +AF+Y+ GG+E E DY
Sbjct: 203 YFRKNGKLVPLSESQLVDC--------SGSFGNEGCNGGFMENAFKYVKSVGGIESESDY 254
Query: 233 PYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPHISFS 291
PY +C FDK+K+ A VS V S E + + + GP++ +I+ H SF
Sbjct: 255 PYKARQ-RTCAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVS---VAIDAGHSSFQ 310
Query: 292 F 292
Sbjct: 311 L 311
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 131/241 (54%), Gaps = 17/241 (7%)
Query: 60 FSKTYATQEEHDYRFR------VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
F+K + +YRF +++ N+ R + + + + +F DLT +EF R F
Sbjct: 30 FAKWMRENTKSNYRFVYSNEEFIYRWNVWRDEEHNRQNKSYFLAMNQFGDLTNAEFNRLF 89
Query: 114 LGLNRRLRLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
GL A A P P +P++FDWR GAVT VK+QG CGSCWSFS TG+ EG
Sbjct: 90 KGLAFDYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEG 149
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
A+FL TG LVSLSEQ L+DC ++GCNGGLM+ AFEYI+ G++ E Y
Sbjct: 150 ANFLKTGRLVSLSEQNLIDCSVSYG-------NNGCNGGLMDYAFEYIINNRGIDTEASY 202
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
PY +C+++ + +++ ++ ++S ++ N P++ +I+ H SF F
Sbjct: 203 PYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVS---VAIDASHNSFQF 259
Query: 293 L 293
Sbjct: 260 Y 260
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C ++ AA + F + S E + + GP++ +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255
Query: 285 LPHISFSF 292
H SF F
Sbjct: 256 AGHTSFQF 263
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 120/202 (59%), Gaps = 12/202 (5%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ + K K Y E+ +RF V+K NL + + + T G+TKF+DLT EFRR
Sbjct: 53 QFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSET-NRTYSLGLTKFADLTNEEFRR 111
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ G A + ++ P DWR +GAVT VKDQG+CGSCW+FSA G++E
Sbjct: 112 MYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVE 171
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + GE VSLSEQ+LVDCD E + GCNGGLM+ AF++I++ GG++ EKD
Sbjct: 172 GINAIRNGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFIIQNGGIDTEKD 223
Query: 232 YPYTGTDGGSCKFDKSKIAAAV 253
YPY G DG + D SK A V
Sbjct: 224 YPYKGFDG---RCDNSKKNAHV 242
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 91/255 (35%), Positives = 138/255 (54%), Gaps = 22/255 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEN 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPLA 277
++ + + P++
Sbjct: 263 DETALKKAISYQPVS 277
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C ++ AA + F + S E + + GP++ +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255
Query: 285 LPHISFSF 292
H SF F
Sbjct: 256 AGHTSFQF 263
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS K+Y +EE +R V++ +LR + L H G+ F D+
Sbjct: 28 HWEQWKSWHGKSYEQKEE-TWRRMVWEEHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + Q + L N ++P DWRD G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG HF TG+LVSLSEQ LV+C PE + GCNGGLM+ AF+Y+ GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C ++ AA + F + S E + + GP++ +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255
Query: 285 LPHISFSF 292
H SF F
Sbjct: 256 AGHTSFQF 263
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNGYHGS-RKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 TTGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + ED + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 79/193 (40%), Positives = 114/193 (59%), Gaps = 15/193 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F L+K + + Y EE RF +FK NL+ R G+ KF+D++ EF+ +
Sbjct: 46 FHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNSKGHRHTLGMNKFADMSNEEFKEK 105
Query: 113 FLGLNRRLR------LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L ++ L Q+ + + P+ DWR G VTG+KDQG CGSCW+FS+
Sbjct: 106 YLSKIKKPINKKNNYLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSS 165
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGA+EG + + TG+L+SLSEQ+LVDCD + + GC GG M+ AFE+++ GG+
Sbjct: 166 TGAMEGINAIVTGDLISLSEQELVDCD---------TTNYGCEGGYMDYAFEWVISNGGI 216
Query: 227 EREKDYPYTGTDG 239
+ E DYPYTGTDG
Sbjct: 217 DSESDYPYTGTDG 229
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 135/244 (55%), Gaps = 20/244 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
+ ++ + YA E + R+ VFK N+ R +R + T V +F+DLT EFR +
Sbjct: 35 WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94
Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
G L + + + ++ LP DWR GAVT +KDQG CGSCW+FSA A
Sbjct: 95 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 154
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + G+L+SLSEQ+LVDCD + D GC GGLM++AF Y + GG+ E
Sbjct: 155 IEGVAQIKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSE 205
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
+YPY T+ G+C F+K+K IA ++ F + +++++ V H P++ +A + I
Sbjct: 206 SNYPYKSTN-GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD---I 261
Query: 289 SFSF 292
F F
Sbjct: 262 GFQF 265
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 99/249 (39%), Positives = 134/249 (53%), Gaps = 21/249 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ K+Y ++ E R+++F N L AK V G+ +F DL P
Sbjct: 6 QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 66 EFAKMFNGYHGE-RKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 124
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAG 224
ATG+LEG HFL +G+LVSLSEQ L+DC SGS + GC GGLM++AF+YI
Sbjct: 125 ATGSLEGQHFLKSGKLVSLSEQNLIDC--------SGSFGNEGCGGGLMDNAFKYIKAND 176
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASI 283
G++ E+ YPY D G C+F K + A + F + ED + + GP++ +I
Sbjct: 177 GIDTEESYPYEAMD-GDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPIS---VAI 232
Query: 284 ELPHISFSF 292
+ H SF
Sbjct: 233 DASHSSFQL 241
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 99/248 (39%), Positives = 130/248 (52%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVF-KANLRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F +++L A+ V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + ED + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 91/255 (35%), Positives = 138/255 (54%), Gaps = 22/255 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNED 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPLA 277
++ + + P++
Sbjct: 263 DETALKKAISYQPVS 277
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 91/251 (36%), Positives = 139/251 (55%), Gaps = 18/251 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSD 103
N + ++ +K+ K Y ++E +R V++ N++ + H + F D
Sbjct: 24 NLDARWTRWKAANGKLY-NKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
LT EF++ GL +++ P + +LP + P+ DWR+ G VT VKDQG CGSCW+
Sbjct: 83 LTNEEFKQVMNGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC ++GCNGGLM++AF Y+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY D G CK+ + AA + F+ I DE+ + ++ GP++ +I
Sbjct: 194 GGLDSEESYPYLAQD-GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPIS---VAI 249
Query: 284 ELPHISFSFLF 294
+ +F F +
Sbjct: 250 DASLDTFRFYY 260
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 91/251 (36%), Positives = 139/251 (55%), Gaps = 18/251 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSD 103
N + ++ +K+ K Y ++E +R V++ N++ + H + F D
Sbjct: 24 NLDARWTRWKAANGKLY-NKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
LT EF++ GL +++ P + +LP + P+ DWR+ G VT VKDQG CGSCW+
Sbjct: 83 LTNEEFKQVMNGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC ++GCNGGLM++AF Y+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY D G CK+ + AA + F+ I DE+ + ++ GP++ +I
Sbjct: 194 GGLDSEESYPYLAQD-GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPIS---VAI 249
Query: 284 ELPHISFSFLF 294
+ +F F +
Sbjct: 250 DASLDTFRFYY 260
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 102/273 (37%), Positives = 142/273 (52%), Gaps = 23/273 (8%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+S++ L L VL + A I V+P G E +F ++ K K Y+
Sbjct: 1 MSAMKLFLGLCVLVHVCS-----AFIPLVLPIPGLY--------EDYFKEWQEKHGKVYS 47
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
T+EE R +VF N+ + V +++D+T EF+ Q+L +
Sbjct: 48 TEEESQSRLKVFMKNVIYIDNHNKQGHSYELEVNEYADMTLDEFKDQYLMEPQHCSATHS 107
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ P D P DWR GAVT VK+QG CGSCW+FS TG LE HFL TG+LVSLS
Sbjct: 108 LKSDPP-KYRDPPKAIDWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKTGQLVSLS 166
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDC + ++GCNGGL + AFEYI GG++ E+ YPY D C F
Sbjct: 167 EQQLVDCAQAFN-------NNGCNGGLPSQAFEYIHYNGGLDSEESYPYRAHD-EKCHFV 218
Query: 246 KSKIAAAVSN-FSVISSDEDQMAANLVKHGPLA 277
S+++A VSN ++ S DE Q+ + GP++
Sbjct: 219 PSEVSATVSNVVNITSKDEMQLYNAVGTVGPVS 251
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 91/254 (35%), Positives = 137/254 (53%), Gaps = 22/254 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEN 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPL 276
++ + + P+
Sbjct: 263 DETALKKAISYQPV 276
>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
Length = 274
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 94/223 (42%), Positives = 130/223 (58%), Gaps = 19/223 (8%)
Query: 82 RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
RR + ++ D A +G + F+DLT EFR+ +L + + A I P P F
Sbjct: 5 RRIQEKEQGD--ATYGASPFADLTAEEFRKNYLSPVWNVTHDPFLKPASI-PIETPPDAF 61
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDH AVT VK+QG+CGSCW+FS TG +EG + +L+SLSEQ+LVDCD
Sbjct: 62 DWRDHDAVTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDK------- 114
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
D GCNGGL A++ I++ GG+E EKDYPY G G C F+K+++ ++ ISS
Sbjct: 115 --VDLGCNGGLPLQAYKEIMRIGGLETEKDYPYEGK-GDKCVFEKAEVEVNITGAVNISS 171
Query: 262 DEDQMAANLVKHGPLA----GNVASIELPHIS--FSFLFTVSS 298
+ED M A L K+GP++ N + +S FSFL + SS
Sbjct: 172 NEDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPFSFLCSPSS 214
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 101/294 (34%), Positives = 152/294 (51%), Gaps = 37/294 (12%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+ S++ LL +VLA V S + +L+AE + +FK +K Y
Sbjct: 1 MKSVVALLFLAVLAMGQTV-----------------SFNKILDAE--WFIFKLHHNKVYK 41
Query: 66 TQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ E YR +++ N R+ ++ +L + T G+ K+ D+ EF G N+ +
Sbjct: 42 SPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVT 101
Query: 122 LPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ + + P N LP + DW GAVT VKDQG CGSCW+FS+TGALEG HF STG
Sbjct: 102 AGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTG 161
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
LVSLSEQ L+DC + ++GCNGGLM+ AF+YI G++ EK YPY +
Sbjct: 162 YLVSLSEQNLIDCSGKYG-------NNGCNGGLMDYAFQYIKDNKGLDTEKTYPYE-AEN 213
Query: 240 GSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
C+++ A + I DE+++ A + GP++ +I+ H SF
Sbjct: 214 DRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPIS---VAIDASHESFQL 264
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 134/250 (53%), Gaps = 20/250 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+KS SK Y ++E +R V++ NL++ + L H G+ F D+T
Sbjct: 26 DEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + R + + L N++ P DWR+ G VT VKDQG CGSCW+
Sbjct: 86 NEEFRQVMNGYKLQQR---KFKGSLFLEPNNMEAPKQVDWREEGYVTPVKDQGQCGSCWA 142
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F T +LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 143 FSTTGAMEGQMFRKTQKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 195
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY GTD C + AA + F I S E + + GP++ +
Sbjct: 196 SGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVS---VA 252
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 253 IDAGHESFQF 262
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 95/250 (38%), Positives = 138/250 (55%), Gaps = 23/250 (9%)
Query: 43 EDHLLNA------EHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQ-LLDP 92
E H LN+ + SL++S K K Y E + RF +FK N+ R + +
Sbjct: 41 ETHGLNSPPLRTHDQLLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQ 100
Query: 93 TAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQKAPILPTND---LPTDFDWRDHG 147
+ G+ KF+DLT E+R +L + +R R D ++ D LP DWRD G
Sbjct: 101 SYKLGLNKFADLTNDEYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRG 160
Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
AV VKDQG CGSCW+FS GA+EG + + TGEL+SLSEQ+LVDCD+ + G
Sbjct: 161 AVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDN--------GYNQG 212
Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
CNGGLM+ AFE+I+K GG++ E DYPY G DG + K+ ++ + + ++++
Sbjct: 213 CNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSL 272
Query: 268 ANLVKHGPLA 277
V H P++
Sbjct: 273 KKAVAHQPVS 282
>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
Length = 369
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 146/262 (55%), Gaps = 28/262 (10%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDY--RFRVFKANLRRAKRRQLLDPTAVHGV--TKF 101
L + E + + FK + E H++ RF +FK N+ K D + H +
Sbjct: 33 LFSHEQYTTEFKGWVGQFEKNYESHEFLNRFDIFKKNMDYIKTWN--DKSVDHKLELNTL 90
Query: 102 SDLTPSEFRRQFLG--LNRRLRLP---ADAQ-----KAPILPTNDLPTDFDWRDHGAVTG 151
+DLT E++R +LG +N LR+ AD + K+ D P + DWR GAV+
Sbjct: 91 ADLTDKEYQRLYLGTKVNGALRVGLNHADERDFGHIKSVFSNVKDNP-NVDWRKQGAVSH 149
Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
VK+QG CGSCWSFS+TGA+EGAH + TGE++SLSEQQLVDC ++GCNGG
Sbjct: 150 VKNQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDCSKRYG-------NNGCNGG 202
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANL 270
LM AF+Y++ AGG+E E+ YPYT TD +C F+ + ++S+ I + +E + L
Sbjct: 203 LMTLAFDYVIDAGGLESEEAYPYTTTDTSACMFNSTNAVTSISDHQNIRAGNEKHLETVL 262
Query: 271 VKHGPLAGNVASIELPHISFSF 292
GP++ +I+ SF F
Sbjct: 263 RNVGPVS---VAIDASPRSFRF 281
>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 467
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 94/237 (39%), Positives = 121/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD D GC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DFGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>gi|281207567|gb|EFA81750.1| cysteine protease 4 [Polysphondylium pallidum PN500]
Length = 432
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 86/227 (37%), Positives = 128/227 (56%), Gaps = 14/227 (6%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
+E ++R+ VFK N+ ++ + V G+ F+DLT +E++R +LG +
Sbjct: 48 KEFNHRYGVFKKNMDYVQQWNAKGSSTVLGMNIFADLTNAEYQRIYLGTKIDASGLLNVA 107
Query: 128 KAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
A N L DWR GAVT +K+Q CGSCWSFS TG++EGAH +STG LV+
Sbjct: 108 AARAFDRNFNIKALNPTVDWRAKGAVTPIKNQAQCGSCWSFSTTGSVEGAHEISTGNLVA 167
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ L+DC PE + GCNGGLM +A EYI+K GG++ E YPYT T C+
Sbjct: 168 LSEQNLIDCSV---PEG----NQGCNGGLMWAAMEYIIKNGGIDTESSYPYTATGPNKCR 220
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
++ + A +S++ ++S + A+ P++ +I+ H SF
Sbjct: 221 YNSANSGAKISSYVNVTSGSETSLASAANVNPVS---VAIDASHNSF 264
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 129/253 (50%), Gaps = 33/253 (13%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDL 104
+ + FK + K Y ++ E++YR VF NL + L + + DL
Sbjct: 24 VQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDL 83
Query: 105 TPSEFRRQFLGLNR--------------RLRLPADAQK--APILPTN----DLPTDFDWR 144
T EF R + +N L LP D Q LPTN DLPTD DWR
Sbjct: 84 TKDEFMRIYT-VNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWR 142
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
GAVT VK+Q CGSCWSFSATGALE F T +L+SLSEQQLVDC
Sbjct: 143 QKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYG------- 195
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
+ GC+GG M+ AF YI + GG++ E+ YPYT D G C + AA VS ++ E+
Sbjct: 196 NHGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKD-GRCAYKPGNKAATVSQVIMVPRGEN 254
Query: 265 QMAANLVKHGPLA 277
Q+AA + GP++
Sbjct: 255 QLAAKVSSVGPIS 267
>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 128/227 (56%), Gaps = 12/227 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F ++ +FSK Y T EE R + F N Q D T G+ +DLT SEF+
Sbjct: 42 FRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSEFQS 101
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
++L +++ A + + + LP +FDWR+HG V+ VK+QG CGSCW+FS TG LE
Sbjct: 102 RYLMVSQDC--SATSTRDLDIDILSLPENFDWREHGGVSPVKNQGHCGSCWTFSTTGCLE 159
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
AH + + +LSEQQLVDC + D + GCNGGL + AFEYI GG+E E+D
Sbjct: 160 SAHLIHHKKAYNLSEQQLVDCAQDFD-------NHGCNGGLPSHAFEYIHYVGGLEEEQD 212
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLA 277
Y Y + G C+FD +K A V F++ +DEDQ+ L P++
Sbjct: 213 YSYHAEE-GLCEFDPTKTAGTVREVFNITETDEDQLTIALAYFNPVS 258
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 93/245 (37%), Positives = 132/245 (53%), Gaps = 24/245 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRR 111
+K++ K+Y +E R ++AN + V G T +F DL SEF+
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHN--QHAGVFGYTLKMNQFGDLENSEFKS 82
Query: 112 QFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ G R P + P +P DLP DW G VT VK+QG CGSCWSFSATG
Sbjct: 83 LYNGY-RMSNAPRKGK--PFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
++EG HF +TG L+SLSEQ LVDC + + GCNGGLM+ AFEY++K G++
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDC-------SAAEGNHGCNGGLMDDAFEYVIKNNGIDT 192
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVASIELPH 287
E YPY D +CKF+ + + A +S + ++ D E + + GP++ +I+ H
Sbjct: 193 EASYPYRAVD-STCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVS---VAIDASH 248
Query: 288 ISFSF 292
ISF F
Sbjct: 249 ISFQF 253
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 132/232 (56%), Gaps = 22/232 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVH---GVTKFSDLTPSEFRR 111
+K K+ K+Y + E R RV+++NL+ ++ +L D + G+ ++DL +
Sbjct: 22 WKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADL----YNE 77
Query: 112 QFLGLNR-----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+F+ L + + + Q L LP+ DWR+ G VT VKDQG CGSCWSFSA
Sbjct: 78 EFMALKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFSA 137
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG+LEG HF TG LVSLSEQQLVDC + GC+GGLM SA++YI AGGV
Sbjct: 138 TGSLEGQHFAKTGTLVSLSEQQLVDCSWSYG-------NYGCSGGLMESAYDYIRDAGGV 190
Query: 227 EREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ E YPYT + G C FD+SK +A + ++ S DE + + GP+A
Sbjct: 191 QLESAYPYTAQN-GRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVA 241
>gi|144228217|gb|ABO93617.1| papain-like cysteine proteinase [Vitis vinifera]
Length = 161
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 75/112 (66%), Positives = 92/112 (82%), Gaps = 4/112 (3%)
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
QLVDCDHECDPEE G+CD GCNGGLM SAFEYILKAGGVERE+ YPY G+D GSCKF+KS
Sbjct: 1 QLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSCKFNKS 60
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
+I A+VSNFSV+S DEDQ+AAN+VK+GPLA + ++ + +++ VS P
Sbjct: 61 QIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQ----TYMKGVSCP 108
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 102/243 (41%), Positives = 140/243 (57%), Gaps = 37/243 (15%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
R+IL ++LLL+ S + +A+ N D SE+ LL+ F + + K
Sbjct: 5 RMILKLVMLLLVFSSV-TAITYNPRDL------------SENGLLSL---FDRWCNHHGK 48
Query: 63 TYATQEEHDYRFRVFKANL-RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----- 116
TY T ++ RF+VFK NL ++ + T G+ FSDLT EFR Q +GL
Sbjct: 49 TY-TAKQRPLRFQVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPP 107
Query: 117 --NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R R P K+ +L ++P+ DWRD AVTGVKDQGACG CW+FSATGA+EG +
Sbjct: 108 SLKSRRREP----KSGLLELYNIPSSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGIN 163
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TG LVSLSEQ+L DCD S +SGC+GGLM+ AF++++ GG++ E DYPY
Sbjct: 164 KIVTGSLVSLSEQELCDCDT--------SYNSGCDGGLMDYAFQWVIVNGGIDTEVDYPY 215
Query: 235 TGT 237
G
Sbjct: 216 KGV 218
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 138/249 (55%), Gaps = 20/249 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP----TAVHGVTKFSDLT 105
E ++++FK+K +KTY+ E+ R+ +++ NL++ + L T G K++D+T
Sbjct: 19 EANWAIFKAKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMT 77
Query: 106 PSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFRR GL L P D + + LPT DWR G VT VKDQG CGSCW+F
Sbjct: 78 NEEFRRTLSGLRVDKELTPGDFVSG--MFKDSLPTAVDWRKEGYVTEVKDQGQCGSCWAF 135
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TG+LEG HF +T +LVSLSE LVDC + + GCNGGLM++AF+YI
Sbjct: 136 STTGSLEGQHFKATKQLVSLSESNLVDCSKKWG-------NQGCNGGLMDNAFKYIADNK 188
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASI 283
G++ EK YPY D C F K+ + A + I+S ED + + GP++ +I
Sbjct: 189 GIDTEKSYPYKPED-RKCNFKKANVGATDKLYKDITSGSEDALQEAVATIGPIS---VAI 244
Query: 284 ELPHISFSF 292
+ H SF
Sbjct: 245 DASHDSFQL 253
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 109/278 (39%), Positives = 150/278 (53%), Gaps = 26/278 (9%)
Query: 9 LLLLLLSSVLASAVA---VNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKS 58
L L++ + ASA+A D+ IRQVV SDG E + ++ H F+ F
Sbjct: 8 LALVVAGGLFASALAGPATFADENPIRQVV-SDGLHELENAILQVVGKTRHALSFARFAH 66
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
++ K Y + EE RF VF NL+ + + GV +F+DLT EFRR LG +
Sbjct: 67 RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126
Query: 119 RLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ K + TN LP DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 127 NC---SATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQA 183
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG
Sbjct: 184 FGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGK 236
Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
+ G CKF + V N ++ + DE + A LV+
Sbjct: 237 N-GLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR 273
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 133/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D NA+ H +KS + Y T EE ++R V++ N+R + HG T
Sbjct: 22 DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H+ + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 134/238 (56%), Gaps = 21/238 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR------RQLLDPTAVHGVTK 100
L+ + + FK FSK+Y E RF +F +NL R + R L T GV K
Sbjct: 17 LSDKEKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGL--STYEMGVNK 74
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
F+DLTP EF +F L R+ + +++A DLP + DW GAVT VK QG+CGS
Sbjct: 75 FADLTPEEFMERFRPL-RKTKPKFLSEQAKFNFDGDLPAEVDWTKQGAVTEVKSQGSCGS 133
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG++E +F+ TG+L+SLSEQQLVDC +SGC GG M+ A EYI
Sbjct: 134 CWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKN---------NSGCAGGWMDIALEYI 184
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
+A G+ E DYPY + +C+F+ SK A + ++ I +DE + + GP++
Sbjct: 185 -EADGIMSEDDYPYEERN-TTCRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVS 240
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 19/255 (7%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D I P D E S D L+ F + S F K Y T EE RF VFK NL+
Sbjct: 30 DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
+ G+ +F+DL+ EF++ +LGL + + + D+ P DWR
Sbjct: 86 NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
GAV VK+QG+CGSCW+FS A+EG + + TG L +LSEQ+L+DCD +
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
++GCNGGLM+ AFEYI+K GG+ +E+DYPY+ + G+C+ D+S+ + V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256
Query: 263 EDQMAANLVKHGPLA 277
E + L H PL+
Sbjct: 257 EKSLLKALA-HQPLS 270
>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 382
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 92/233 (39%), Positives = 125/233 (53%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCDSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC GG + AF++I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TNDFGCGGGFSDPAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G DKS + A + + + DE+ +A L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKNGPVA 261
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 93/271 (34%), Positives = 142/271 (52%), Gaps = 25/271 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + ++ TY E + RF F+ NLR +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 253
Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVS 297
V + P++ +IE +F +VS
Sbjct: 254 LQKAVANQPIS---VAIEAGGRAFQLYKSVS 281
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 108/291 (37%), Positives = 143/291 (49%), Gaps = 38/291 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L L LL +++A VA N + + Q + FK+ K+Y +
Sbjct: 2 LRLSLLCAIVAVTVAANSHEILRTQ-------------------WEAFKTTHKKSYESHM 42
Query: 69 EHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E RF++F N L AK V G+ +F DL EF + F G R R
Sbjct: 43 EELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGY-RGQRTSR 101
Query: 125 DAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+ P ND LP+ DWR GAVT VKDQG CGSCW+FSATG+LEG HFL GELV
Sbjct: 102 GSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELV 161
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ LVDC ++GC GGLM++AF+YI G++ E+ YPY D C
Sbjct: 162 SLSEQNLVDCSQSFG-------NNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMD-DKC 213
Query: 243 KFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+F K + A + F I ED + + GP++ +I+ H SF
Sbjct: 214 RFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPIS---VAIDAGHSSFQL 261
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVK 272
++ + DE + A LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 131/239 (54%), Gaps = 17/239 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+K +K Y+ E R+ ++K N RR + L + + +F D+T SEF+
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFK----A 85
Query: 116 LNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
N L + P N + P DWR+ G VT VKDQG CGSCW+FS TG+LEG H
Sbjct: 86 FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG+LVSLSEQ LVDC + ++GC+GGLM++AF YI + G++ E YPY
Sbjct: 146 FKKTGKLVSLSEQNLVDC-------STAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPY 198
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
T D G C F KS +AA + F I +E+++ + GP++ +I+ H SF F
Sbjct: 199 TAED-GKCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPIS---VAIDASHESFQF 253
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/291 (36%), Positives = 148/291 (50%), Gaps = 38/291 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L LL+L++ L+S ++ DA + + H+ L+KS SK Y +E
Sbjct: 2 LPLLVLTACLSSVLSAPVLDAQLNE------------------HWDLWKSWHSKKYHEKE 43
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLG--LNRRLRL 122
E +R V++ NL++ + L H G+ F D+T EFR+ G L + +
Sbjct: 44 E-GWRRMVWEKNLQKIELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKTQRKF 102
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
P T P+ DWR+ G VT VKDQG CGSCW+FS TGALEG F TG+LV
Sbjct: 103 TGSLFMEPNFMT--APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLV 160
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ LVDC PE + GC GGLM+ AF+Y+ G++ E YPYTGTD C
Sbjct: 161 SLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPC 213
Query: 243 KFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+D +A + F V S E + + GP++ +I+ H SF F
Sbjct: 214 HYDPLYNSANDTGFVDVPSGKEHALMKAVASVGPVS---VAIDAGHESFQF 261
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 133/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D NA+ H +KS + Y T EE ++R V++ N+R + HG T
Sbjct: 22 DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H+ + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKPVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/251 (39%), Positives = 134/251 (53%), Gaps = 20/251 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+K SK Y +EE +R V++ NLR+ + L H G+ F D+T
Sbjct: 25 DQHWQLWKGWHSKNYHEKEE-GWRRLVWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G RR + + + N L P DWRD G VT VKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGYKRREQRKYSG--SLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWA 141
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGALEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+
Sbjct: 142 FSTTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDN 194
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVAS 282
G++ E YPY GTD C+++ A + F I S +++ V GP++ +
Sbjct: 195 QGLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVS---VA 251
Query: 283 IELPHISFSFL 293
I+ H SF F
Sbjct: 252 IDAGHESFQFY 262
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 19/229 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R +F L ++ L + T G+ KFSDLT +EFR
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
E+ YPYTG GSC +K+K+ ++ + ++ D V P+
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 98/261 (37%), Positives = 136/261 (52%), Gaps = 22/261 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAV---HG 97
S +L AE +S FK+K K+Y ++ E +R +++ N + AK + V
Sbjct: 18 SYQEVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMA 75
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVK 153
+ +F D+ EF G R + + P N LP DWR GAVT VK
Sbjct: 76 MNEFGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVK 135
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG CGSCW+FSATG+LEG HF +G +VSLSEQ LV C + ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFG-------NNGCEGGLM 188
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
+ AF+YI G++ EK YPY GTD G+C F KS + A S F + E Q+ +
Sbjct: 189 DDAFKYIRANKGIDTEKSYPYNGTD-GTCHFKKSTVGATDSGFVDIKEGSETQLKKAVAT 247
Query: 273 HGPLAGNVASIELPHISFSFL 293
GP++ +I+ H SF F
Sbjct: 248 VGPIS---VAIDASHESFQFY 265
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVK 272
++ + DE + A LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 19/229 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R +F L ++ L + T G+ KFSDLT +EFR
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
E+ YPYTG GSC +K+K+ ++ + ++ D V P+
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/280 (38%), Positives = 145/280 (51%), Gaps = 40/280 (14%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LLL+L++V+ S AV+ D + Q +S FK + SK Y ++ E
Sbjct: 3 LLLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
+R ++F N + AK +L V G+ K++D+ EF G N+ +
Sbjct: 44 ERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103
Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+D A I P N LP DWRD GAVT VKDQG CGSCWSFS +G+LEG HF TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTG 163
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+LVSLSEQ LVDC SG ++GCNGGLM++AF YI GG++ E+ YPY D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 215
Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
C + A F I +ED + A + GP++
Sbjct: 216 -EKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPIS 254
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 152/289 (52%), Gaps = 20/289 (6%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
S L+L S L ++A D +++ S+ +S D L+ F + S+ K Y T
Sbjct: 6 SKTLVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYET 60
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL--RLPA 124
EE RF VFK NL+ R + G+ +F+DL+ EF+ ++LGL L R +
Sbjct: 61 IEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLSQRRES 120
Query: 125 DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
++ DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG L SL
Sbjct: 121 SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSL 180
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E DYPY + +C+
Sbjct: 181 SEQELIDCDT--------TYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYI-MEESTCEM 231
Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
K + ++ + + + +Q + + PL+ +IE F F
Sbjct: 232 KKEETQVVTINGYHDVPQNNEQSLLKALANQPLS---VAIEASSRDFQF 277
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 137/256 (53%), Gaps = 21/256 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E R +++ N + + +L + + + K
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR+ GA+T VKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+LVSLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ E YPY D G C+++ A F I S +ED++ A + GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-GVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 254
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF F
Sbjct: 255 S---VAIDASHESFQF 267
>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
Length = 456
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/230 (39%), Positives = 129/230 (56%), Gaps = 13/230 (5%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK++ K+Y + E YR RVF+ +++ A+ +P A GVTKFSDLT EF+
Sbjct: 35 QFAAFKAEHGKSYTSAAEEGYRMRVFEESMKAAQAHAAANPHAKFGVTKFSDLTHEEFKT 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ A + P+ T P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 95 LYANGAAHFAAAAKRARRPVSVTGTAPDEWDWRKKGAVTPVKDQGHCGSCWTFSTTGNIE 154
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVERE 229
G ++ EL +LSEQ LV CD D GC+GGLM++AFE+I+ G V E
Sbjct: 155 GQWAVAGNELTNLSEQMLVSCDAR---------DYGCSGGLMDNAFEWIVNQNDGFVFTE 205
Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ YPY G + C K+ A + + +DE++MAA L +GP++
Sbjct: 206 ESYPYASGSGDAPLCDVGGRKVGATIKGHVGLPNDEEKMAAWLAANGPIS 255
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 136/227 (59%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY T+EE +R VF N+ RA++ Q LD TA +G+TKFSDLT EFR
Sbjct: 176 FKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTEEEFRT 235
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + P +P D+DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 236 IYL--NPLLREDPGQKMRLGKAPKGPVPPDWDWRTKGAVTKVKDQGMCGSCWAFSVTGNV 293
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GG+ ++A+ I GG+E E+
Sbjct: 294 EGQWFLNRGTLLSLSEQELLDCD---------KVDKACMGGVPSNAYSAIKTLGGLETEE 344
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G +C F K +++ +S +E ++AA L K+GP++
Sbjct: 345 DYSYHG-HLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPIS 390
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/233 (39%), Positives = 123/233 (52%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSA G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCHSSWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC GG + AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DFGCGGGFSDPAFKWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G DKS + A + + + DE+ +A L K GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKKGPVA 261
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/249 (37%), Positives = 138/249 (55%), Gaps = 24/249 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSE 108
++ +K++ K Y + EE R +++ NL + + L T G+ +F+DL E
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEE 87
Query: 109 FRRQFLGLNRRLRLPADAQK-APILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
F G R+ + A K + LP+N+ LP DWR G VT VKDQG CGSCW+F
Sbjct: 88 FVAMMTGF--RVNGTSKAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAF 145
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATG+LEG F TG+LVSLSEQ LVDC + + GC+GG M+ AF+YI+ AG
Sbjct: 146 SATGSLEGQQFKKTGKLVSLSEQNLVDCSYR---------NYGCHGGFMDRAFQYIIDAG 196
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASI 283
G++ E Y Y D G+C F K+ + A V+ ++ ++S ++ V H GP++ +I
Sbjct: 197 GIDTEATYSYRAVD-GNCHFKKANVGATVTGYTDVTSGSEKALQKAVAHIGPIS---VAI 252
Query: 284 ELPHISFSF 292
+ H F F
Sbjct: 253 DASHKFFKF 261
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 120/209 (57%), Gaps = 16/209 (7%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GGLM+ AF++I+ GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222
>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
Length = 376
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 134/244 (54%), Gaps = 17/244 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ + K K Y QE R+ +FK N+ V G+ F+DLT E+++
Sbjct: 34 FTEWTIKHGKQYENQE-FGRRYGIFKDNMDYVHDWNSKGSETVLGLNIFADLTNLEYQKY 92
Query: 113 FLG--LNRRLRLPADAQK-APILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+LG +N L D + I ++D PT DW GAVT +KDQG CGSCWSFS T
Sbjct: 93 YLGTHVNSLLHRGYDGRALEEIFGSDDGRNPTSVDWNKKGAVTPIKDQGQCGSCWSFSTT 152
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G++EGAH + TG+LVSLSEQ LVDC + GC+GGLM++AF YI++ G++
Sbjct: 153 GSVEGAHQIKTGKLVSLSEQNLVDC-------SGAEGNLGCDGGLMDNAFIYIIQNKGID 205
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELP 286
E YPY G C F + I A +S + ++ + E Q+ + K+GP++ +I+
Sbjct: 206 TESSYPYKAQSGTKCLFKPTSIGATLSGYVNITAGSESQLETAVAKNGPVS---VAIDAS 262
Query: 287 HISF 290
H SF
Sbjct: 263 HNSF 266
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVK 272
++ + DE + A LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/226 (43%), Positives = 132/226 (58%), Gaps = 13/226 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 193 FRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L N LR P K + P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 253 TYL--NPLLREPGKKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 310
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G FL+ G L+SLSEQ+L+DCD D C GGL +SA+ I GG+E E D
Sbjct: 311 GQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIKNLGGLETEDD 361
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 362 YSYRG-HMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 406
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/242 (40%), Positives = 138/242 (57%), Gaps = 24/242 (9%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY T+EE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 183 QDFSVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKF 242
Query: 102 SDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
SDLT EFR +L R++RL P P ++DWR GAVT VKDQ
Sbjct: 243 SDLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLP-------PPEWDWRKKGAVTKVKDQ 295
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D GC GGL ++
Sbjct: 296 GMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------VDKGCMGGLPSN 346
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A+ I GG+E E+DY Y G +C F+ K +++ +S +E ++AA L + GP
Sbjct: 347 AYSAIKTLGGLETEEDYSYRG-HLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGP 405
Query: 276 LA 277
++
Sbjct: 406 IS 407
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYITANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 207 TDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
Length = 467
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 93/231 (40%), Positives = 122/231 (52%), Gaps = 14/231 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYKSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 QFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
++ G A+ + +P DWR GAVT VKDQG CGSCW+FSA G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGVPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVER 228
E FL+ L +LSEQ LV CD DSGC+GGLMN AFE+I++ G V
Sbjct: 157 ESQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNDAFEWIVQENDGAVYT 207
Query: 229 EKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 208 EESYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVA 258
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/284 (33%), Positives = 148/284 (52%), Gaps = 22/284 (7%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTYATQE 68
L+LS+ L A D +++ S +HL + + LF+S K SK Y + E
Sbjct: 11 LILSATLFITYATAHDFSIVGY--------SPEHLASMDKTIELFESWMSKHSKAYRSIE 62
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
E +RF +F NL+ + G+ +F+DL+ EF+ ++LGL ++
Sbjct: 63 EKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSSRG 122
Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
DLP DWR GAVT VK+QG+CGSCW+FS A+EG + + TG L SLSEQ+
Sbjct: 123 FSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 182
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
L+DCD S ++GC GGLM+ AF+YI+ G+ +E+DYPY +G + +
Sbjct: 183 LIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQF 234
Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+S + + ++++Q + H P++ +IE +F F
Sbjct: 235 EVVTISGYEDVPANDEQSLLKALSHQPVS---VAIEASSRNFQF 275
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + E + + GP++ +I+
Sbjct: 198 IDTEKSYPYKAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 120/209 (57%), Gaps = 16/209 (7%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 26 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 82
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 83 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 142
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 194
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GGLM+ AF++I+ GG++ E DYPY G D
Sbjct: 195 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 223
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/237 (42%), Positives = 137/237 (57%), Gaps = 14/237 (5%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY +QEE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 153 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKF 212
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT-DFDWRDHGAVTGVKDQGACGS 160
SDLT EFR +L N L+ P P D+P +DWR+ GAVT VKDQG CGS
Sbjct: 213 SDLTEEEFRTIYL--NPLLKDAPGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGS 270
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 271 CWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT---------DKACLGGLPSNAYSAI 321
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G +C F K +++ +S +E ++AA L K+GP++
Sbjct: 322 RTLGGLETEDDYSYRGRL-QTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVS 377
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + E + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 97/282 (34%), Positives = 142/282 (50%), Gaps = 43/282 (15%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M+ IL+SLL++ +S+ L + +D A HF FK K
Sbjct: 1 MKSFILASLLVVAVSATL-----LKEDGA----------------------HFQSFKLKH 33
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
KTY Q E RF +F+ NLR+ + +H G+ KF+D+T +EF+ L
Sbjct: 34 GKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFK-AMLAT 92
Query: 117 NRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
+ + A K L +P DWR VT +KDQ CGSCW+F+ G+ EGA+
Sbjct: 93 QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYA 152
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
LSTG+L SEQQLVDC + + GC+GG ++ F YI + G+E E DYPYT
Sbjct: 153 LSTGKLTRFSEQQLVDC--------TTDLNYGCDGGYLDDTFPYI-QTNGLELESDYPYT 203
Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G D G C ++ SK+ VS++ + ++E + + GP+A
Sbjct: 204 GYD-GYCSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVA 244
>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
Length = 597
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 139/238 (58%), Gaps = 14/238 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F + +++TY T+EE +R VF +N+ RA++ Q LD TA +GVTK
Sbjct: 289 SQDFSVKMASIFKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTK 348
Query: 101 FSDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
FSDLT EFR +L N LR +P + P ++DWR +GAVT VKDQG CG
Sbjct: 349 FSDLTEEEFRTIYL--NPLLREVPGKKMHLAKSIGDPAPPEWDWRKNGAVTKVKDQGMCG 406
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+
Sbjct: 407 SCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSA 457
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I GG+E E DY Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 458 IKNLGGLETEDDYSYQG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPIS 514
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/255 (38%), Positives = 136/255 (53%), Gaps = 28/255 (10%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H+ +K+ K Y +EE +R V++ NL++ + L H G+ +F D+T
Sbjct: 26 DNHWEQWKNWHGKKYHEKEE-GWRRMVWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMT 84
Query: 106 PSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACG 159
EFR+ G RR R + + N ++P DWR+ G VT VKDQG CG
Sbjct: 85 HEEFRQVMNGYKHKKERRFR------GSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECG 138
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 139 SCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQY 191
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAG 278
I G++ E+ YPY GTD C +D AA + F + S E + + GP++
Sbjct: 192 IKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVS- 250
Query: 279 NVASIELPHISFSFL 293
+I+ H SF F
Sbjct: 251 --VAIDAGHESFQFY 263
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 41/285 (14%)
Query: 11 LLLLSSVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LLLL LA + +DD+ IR +K +K Y+ E
Sbjct: 7 LLLLGVTLAYIIERPTEDDSWIR-----------------------WKMAHNKAYSHDGE 43
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
R+ ++K N RR + L + + +F D+T +EF+ N L +
Sbjct: 44 ETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEFKD----FNGYLSHKHVSGST 99
Query: 130 PILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
+ P + + P DWR+ G VT VKDQG CGSCW+FS TG+LEG +F TG+LVSLSEQ
Sbjct: 100 FLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQN 159
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
LVDC + ++GCNGGLM++AF YI + G++ E YPYT D G C F K
Sbjct: 160 LVDC-------STAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKD-GKCAFTKPN 211
Query: 249 IAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+AA + F I S DE+++ + GP++ +I+ H SF F
Sbjct: 212 VAATDTGFVDIPSGDENKLKEAVASVGPIS---VAIDASHFSFQF 253
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ KTY + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + E + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSS 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 147/265 (55%), Gaps = 23/265 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEE-HDYRFRVFKANLRRAKRRQLLDPTAVH-GVT 99
S D L+ E ++ + +KF K A+ D+RF FK N R + + G+
Sbjct: 4 SSDSDLSGE--YASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLN 61
Query: 100 KFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
+FSDLT EFR++FLGL + L++P D+ DLP DWR HGAVT K
Sbjct: 62 QFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPK 121
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG+CG CW+F+ TGA+EG + + TG+LVSLSEQ+L+DCD + D GC+GGLM
Sbjct: 122 DQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKK--------ADKGCDGGLM 173
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVK 272
+A+++I++ GG++ E DYPY ++ C K + A+ + I ++Q V
Sbjct: 174 ENAYQFIVENGGLDTETDYPYHASE-SHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVA 232
Query: 273 HGPLAGNV--ASIELPHISFSFLFT 295
P++ + AS + H + S +FT
Sbjct: 233 KQPVSVAIEGASKDFQHYA-SGVFT 256
>gi|330794859|ref|XP_003285494.1| hypothetical protein DICPUDRAFT_149375 [Dictyostelium purpureum]
gi|325084585|gb|EGC38010.1| hypothetical protein DICPUDRAFT_149375 [Dictyostelium purpureum]
Length = 421
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 131/249 (52%), Gaps = 23/249 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L + F+ + + + YA+ EE R+ +FKAN+ + + G+ F+D+T
Sbjct: 24 LQYRNAFTNWMIQNQRHYAS-EEFATRYNIFKANMDYVQEWNSKGSETILGLNAFADITN 82
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDL----PTDFDWRDHGAVTGVKDQGACGSCW 162
E+R +LG P DA T + DWR GAVT +K+Q CG CW
Sbjct: 83 QEYRANYLGT------PFDASSIVGTETEKIFAAPAATVDWRTKGAVTPIKNQQQCGGCW 136
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
SFS TG+ EGAH +STG LVSLSEQ L+DC SGS + GCNGGLM AFEYI+
Sbjct: 137 SFSTTGSTEGAHQISTGNLVSLSEQNLIDC--------SGSYGNDGCNGGLMTLAFEYII 188
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
G++ E YPYT G CKF + I A +S+++ ++S + + P++
Sbjct: 189 NNKGIDTESSYPYTAETGTVCKFKTANIGATLSSYNNVTSGSESSLESAANVNPVS---V 245
Query: 282 SIELPHISF 290
+I+ H SF
Sbjct: 246 AIDASHNSF 254
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 19/255 (7%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D I P D E S D L+ F + S F K Y T EE RF VFK NL+
Sbjct: 30 DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
+ G+ +F+DL+ EF++ +LGL + + + D+ P DWR
Sbjct: 86 NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
GAV VK+QG+CGSCW+FS A+EG + + TG L +LSEQ+L+DCD +
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
++GCNGGLM+ AFEYI+K GG+ +E+DYPY+ + G+C+ D+S+ + V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256
Query: 263 EDQMAANLVKHGPLA 277
E + L H PL+
Sbjct: 257 EKSLLKALA-HQPLS 270
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/248 (38%), Positives = 132/248 (53%), Gaps = 20/248 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H++L+KS +K Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 29 HWNLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNE 87
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + L N L P DWRD G VT VKDQG CGSCW+FS
Sbjct: 88 EFRQLMNGYKHKAERKVKG--SLFLEPNFLEAPRSLDWRDKGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATGALEG F TG++V LSEQ LV+C PE + GCNGGLM+ AF+Y+ G
Sbjct: 146 ATGALEGQQFRKTGKMVQLSEQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDNQG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E+ YPY GTD C +D A + F + S E + + GP++ +I+
Sbjct: 199 LDSEESYPYLGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPIS---VAID 255
Query: 285 LPHISFSF 292
H SF F
Sbjct: 256 AGHESFQF 263
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/241 (38%), Positives = 136/241 (56%), Gaps = 15/241 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTP 106
N F ++ ++ K+Y++ EE YR VF N LD ++ + ++DLT
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 107 SEFRRQFLGLNRRLR--LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EF+ LG + LR P Q+ P LP D+P DWR GAVT VKDQG+CG+CWSF
Sbjct: 84 HEFKVSRLGFSPALRNFRPVLPQE-PSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACWSF 141
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATGA+EG + + TG L+SLSEQ+L+DCD S +SGC GGLM+ A+++++
Sbjct: 142 SATGAMEGINQIMTGSLISLSEQELIDCDR--------SYNSGCGGGLMDYAYQFVISNH 193
Query: 225 GVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
G++ E DYPY D GSC+ DK + + ++ I S+++ V P++ +
Sbjct: 194 GIDTENDYPYQARD-GSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGS 252
Query: 284 E 284
E
Sbjct: 253 E 253
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
DGG CKF I V N ++ + DE + A LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 119/209 (56%), Gaps = 16/209 (7%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GGLM+ AF++I+ GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 93/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTK 100
++ A F+ FKS++ K Y + YR +V+K N + + R + + T +
Sbjct: 15 YIAEAASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNH 74
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKA-PILPTND--LPTDFDWRDHGAVTGVKDQGA 157
+D+ P EF FLG NR LR + P D + + DWR GA++ VKDQG
Sbjct: 75 LADMHPREFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGH 134
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALE FL G VSLSEQ L+DC ++GC GGLM AF
Sbjct: 135 CGSCWAFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYG-------NNGCEGGLMEQAF 187
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+Y+ G++ E+ YPY G D C+F K+ + A + F I S DE + + GPL
Sbjct: 188 QYVRDNDGIDTEEAYPYEGED-SECRFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPL 246
Query: 277 AGNVASIELPHISFSF 292
+ +I+ + SF F
Sbjct: 247 S---IAIDASNPSFQF 259
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 119/209 (56%), Gaps = 16/209 (7%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GGLM+ AF++I+ GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
DGG CKF I V N ++ + DE + A LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/277 (34%), Positives = 148/277 (53%), Gaps = 20/277 (7%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQV-VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+++ +L ++ +A A +D + + +V P E +L F F +++K Y ++
Sbjct: 1 MIVFVLCAISFTAAAPQNDVSDVEKVRKPVFYSMDEAPIL-----FENFIREYNKKYDSK 55
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
E+ + RF++F NL+R AVHG+ KF+DL+ EF++ + G D
Sbjct: 56 EKEE-RFKIFVNNLKRINDLNHKSTNAVHGINKFTDLSKEEFKKFYTGFKPDKSFLDDNI 114
Query: 128 KAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
K P + ++ P FDWRD G VT VK+QG CGSCW+FS G +E + + G LV LS
Sbjct: 115 KKPSQLSFNITAPPAFDWRDKGVVTRVKNQGTCGSCWAFSTIGNVESVNAIKHGNLVELS 174
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCD S D C+ GL ++A +Y++ G + E+ YPY G +C +D
Sbjct: 175 EQQLVDCD---------SKDEACDSGLPDNAQQYLVSHGAIS-EQSYPYKGY-AANCTYD 223
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
S++ +SNF + E QMA L PL+ +A+
Sbjct: 224 SSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAA 260
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/247 (39%), Positives = 132/247 (53%), Gaps = 17/247 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS SK Y +EE +R V++ NL+ + L H G+ +F D+T
Sbjct: 43 HWQLWKSWHSKDYHEREE-SWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAE 101
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFR+ G + + P+ + P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 102 EFRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 161
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ GG+
Sbjct: 162 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PEG----NQGCNGGLMDQAFQYVQDNGGI 214
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPYT D C++ AA + F I E + + GP++ +I+
Sbjct: 215 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVS---VAIDA 271
Query: 286 PHISFSF 292
H SF F
Sbjct: 272 GHSSFQF 278
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/274 (37%), Positives = 142/274 (51%), Gaps = 26/274 (9%)
Query: 33 QVVPSDGEQSEDHLL-------NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
QV+P E S + L + H+ L+KS K Y +EE +R V++ NL+ +
Sbjct: 107 QVIPVTKENSTETLHCRWQVDPELDGHWQLWKSWHRKDYHEREE-GWRRVVWEKNLKMIE 165
Query: 86 RRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PT 139
L H G+ +F D+T EFR+ G + + + + L N L P
Sbjct: 166 IHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGYVHK-KSERKYRGSQFLEPNFLEAPR 224
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DWR+ G VT VKDQG CGSCW+FS TGALEG HF TG+LVSLSEQ LVDC PE
Sbjct: 225 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE 281
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
+ GCNGGLM+ AF+Y+ GG++ E+ YPYT D C++ AA + F I
Sbjct: 282 ----GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDI 337
Query: 260 -SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
E + + GP++ +I+ H SF F
Sbjct: 338 PQGHERALMKAVAAVGPVS---VAIDAGHSSFQF 368
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 98/254 (38%), Positives = 134/254 (52%), Gaps = 23/254 (9%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
+ + FK + K Y + E +R ++F N + AK +L + V + K++D+
Sbjct: 23 VQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADM 82
Query: 105 TPSEFRRQFLGLNRRLRLP-----ADAQKAP-ILPTN-DLPTDFDWRDHGAVTGVKDQGA 157
EF G NR P D Q A I P N P + DWR+HGAVT VKDQG
Sbjct: 83 LHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGH 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCWSFSATGALEG HF T +LVSLSEQ LVDC + + GCNGGLM++AF
Sbjct: 143 CGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDC-------STKFGNDGCNGGLMDNAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+Y+ G++ E YPY D C ++ A F I + DE+++ A + GP+
Sbjct: 196 KYVKYNHGIDTEASYPYHADD-EKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPV 254
Query: 277 AGNVASIELPHISF 290
+ +I+ H SF
Sbjct: 255 S---VAIDASHESF 265
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/219 (39%), Positives = 124/219 (56%), Gaps = 21/219 (9%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + R ++FK NL+ + L + T G+T+F+DLT E + F+ +R L
Sbjct: 11 KNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE-PKDFMKADRYL 69
Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
D LP + DWR GAV VKDQG CGSCW+FSA GA+EG + + TGE
Sbjct: 70 YKEGDI----------LPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGAVEGINQIKTGE 119
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+SLS+Q+L+DCD G ++GC GG+MN AFE+I+ GG+E ++DYPYT TD G
Sbjct: 120 LISLSDQELIDCDR-------GFVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTATDLG 172
Query: 241 SCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
C DK V + ++ ++++ V H P+
Sbjct: 173 VCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVG 211
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 131/236 (55%), Gaps = 17/236 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
A HF FK + K++ + +RF FK N++ A +P A + V+ KF+ LTP
Sbjct: 38 ASAHFMHFKKQHGKSFGEEAVEGHRFNAFKENMQTAVYLNAQNPHAHYDVSGKFAALTPQ 97
Query: 108 EFRRQFLGLNRRLR-LPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF +Q+L + R L A ++A + + DWR+ GAVT VKDQG CGSCW+FS
Sbjct: 98 EFAKQYLNPDYYTRQLKAHKERAHVYEGVRGGLSAVDWREKGAVTEVKDQGLCGSCWAFS 157
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
A G +EG LS LVSLSEQ LV CD + D GCNGGLM+ A+ +I+K +
Sbjct: 158 AIGNIEGQWALSGNTLVSLSEQMLVSCD---------TVDMGCNGGLMDQAWAWIIKNHS 208
Query: 224 GGVEREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPYT DG SC K+ A +S + DED + A L K+GP++
Sbjct: 209 GAVYTEVSYPYTSGDGSTASC-LSTGKVGARISGQVSLPQDEDAIEAWLEKNGPIS 263
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/275 (36%), Positives = 147/275 (53%), Gaps = 29/275 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+S++ A ++ + + +TY E + R++VF+ NLR
Sbjct: 29 IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 85
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG R +L A A DLP DWR
Sbjct: 86 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 142
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+ GSCW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 143 GAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 194
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254
Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
V + P++ +IE F S +FT S
Sbjct: 255 LQKAVANQPVS---VAIEAAGTQFQLYSSGIFTGS 286
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 134/248 (54%), Gaps = 21/248 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +K +K Y +EE +R V++ NL++ + L H + F D+
Sbjct: 28 HWQQWKEWHNKDYHEKEE-GWRRMVWEKNLKKIELHNLEHSLGKHSYRLAMNHFGDMPHE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G ++R + + + N L P+ DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGYKHKVR---KIRGSLFMEPNFLEAPSKLDWREKGYVTPVKDQGQCGSCWAFS 143
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI GG
Sbjct: 144 TTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PEG----NEGCNGGLMDQAFQYIKDNGG 196
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY GTD C +D S AA + F + S E + + GP++ +I+
Sbjct: 197 LDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHALMKAVTAVGPVS---VAID 253
Query: 285 LPHISFSF 292
H SF F
Sbjct: 254 AGHESFQF 261
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 134/258 (51%), Gaps = 38/258 (14%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F +K +F ++Y + E R ++ +N R ++ + G+T F+D+ E
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 109 FRRQF----LG-----LNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
++RQ LG L RR LRLP A DLP DWR+ G VT VKDQ
Sbjct: 86 YKRQISQGCLGSFNASLPRRGSAYLRLPEGA---------DLPNSVDWREKGYVTDVKDQ 136
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
CGSCW+FS TG+LEG F TG+LVSLSEQQLVDC + E GC GGLM+S
Sbjct: 137 KQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNE-------GCMGGLMDS 189
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
AF YI GG++ E YPY D G C+++ + I A + + V DED + L G
Sbjct: 190 AFRYIQANGGIDTEDSYPYEAED-GQCRYNSANIGATCTGYVDVKQGDEDALKEALATIG 248
Query: 275 PLAGNVASIELPHISFSF 292
P++ +I+ H SF
Sbjct: 249 PVS---VAIDASHSSFQL 263
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/280 (38%), Positives = 145/280 (51%), Gaps = 40/280 (14%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LLL+L++V+ S AV+ D + Q +S FK + SK Y ++ E
Sbjct: 3 LLLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43
Query: 70 HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
+R ++F N + AK +L V G+ K++D+ EF G N+ +
Sbjct: 44 ERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103
Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+D A I P N LP DWRD GAVT VKDQG CGSCWSFS +G+LEG HF TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTG 163
Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
+LVSLSEQ LVDC SG ++GCNGGLM++AF YI GG++ E+ YPY D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNTGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 215
Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
C + A F I +ED + A + GP++
Sbjct: 216 -EKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVS 254
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
DGG CKF I V N ++ + DE + A LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 129/233 (55%), Gaps = 17/233 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
+ ++ + YA E + R+ VFK N+ +R + T V +F+DLT EFR +
Sbjct: 40 WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 99
Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
G L + + + ++ LP DWR GAVT +KDQG+CGSCW+FSA A
Sbjct: 100 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 159
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + G+L+SLSEQ+LVDCD + D GC GG MNSAF Y + GG+ E
Sbjct: 160 IEGVAQIKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSE 210
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
+YPY TD G+C +K+K IA ++ F + +++++ V H P++ +A
Sbjct: 211 SNYPYKSTD-GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 262
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 156/290 (53%), Gaps = 26/290 (8%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L+L+ S L +++A D +++ S+ +S D L+ F + S+ K Y
Sbjct: 8 ALVLIACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYENI 62
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
EE RF +FK NL+ R + G+++F+DL+ EF ++LGL +RR P
Sbjct: 63 EEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRRRESP 122
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+ + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L S
Sbjct: 123 EEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+C+
Sbjct: 179 LSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGACE 229
Query: 244 FDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
K + +S + + + +Q + + PL+ +IE F F
Sbjct: 230 MTKEETQVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 276
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/245 (39%), Positives = 131/245 (53%), Gaps = 17/245 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
++ +K+ K+YA+ EE R +++ NLR + +H +TKF+DL E
Sbjct: 23 WNEWKNTHGKSYASHEELK-RQLIWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDE 81
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F +L R+ P+ + PT DWR G VT VK+Q CGSCW+FS TG
Sbjct: 82 FAAMYLPRMRKDSRNGFCSAQPVGGFVENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTG 141
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+LEG HF T LVSLSEQQL+DC + D GC GG+M+ AF+YI AGGVE
Sbjct: 142 SLEGQHFAKTKNLVSLSEQQLMDCSFK-------EGDEGCGGGIMDYAFDYIFLAGGVES 194
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
E DYPY + C+FD S IAA ++ V S E Q+ + GP++ +I+ H
Sbjct: 195 EADYPYEARN-DHCRFDNSSIAATLTGCVDVTSGSETQLEKAVGSIGPVS---VAIDASH 250
Query: 288 ISFSF 292
ISF
Sbjct: 251 ISFQL 255
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 131/249 (52%), Gaps = 18/249 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
N + + +K+ + Y T EE +R V++ N++ + HG T F D
Sbjct: 24 NLDTQWYQWKATHRRLYGTNEE-GWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ + + + P+L +LP DWR G VT VK+Q CGSCW+
Sbjct: 83 MTNEEFRQVMVCFRNQKHKNRKVFRGPLL--LNLPKSVDWRKKGYVTPVKNQKQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC H + GCNGG MN+AF+Y+ +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSHP-------QGNQGCNGGFMNNAFQYVKEN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E YPY D GSCK+ A + F VI + E ++ + GP++ ++
Sbjct: 194 GGLDSEASYPYVAKD-GSCKYKPENSVANDTGFVVIPAHEKELMKAVATVGPIS---VAV 249
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 250 DASHSSFQF 258
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 89/241 (36%), Positives = 129/241 (53%), Gaps = 21/241 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
L N+E F F +K+ K YA E RF VFKANL R + +A G+ +SDL+
Sbjct: 30 LSNSEVLFDEFVTKYGKVYANDAERKSRFDVFKANLAIINERNAQEESATFGINFYSDLS 89
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---------LPTDFDWRDHGAVTGVKDQG 156
+E R+ G + L D +K T LP F+WRD AVT VK Q
Sbjct: 90 SNELLRKQTGF--KTALHNDNEKKSKYCTRRVITGPSTRLLPEAFNWRDSDAVTSVKQQR 147
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FSA +E +++ + V LSEQQ+VDCD ++GCNGGLM+ A
Sbjct: 148 DCGSCWAFSAVANIESQYYIKNKQYVDLSEQQIVDCD---------PINNGCNGGLMSWA 198
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
EY++++GGV+ E+DY Y G + G CK + + + S +E+++ LV +GP+
Sbjct: 199 MEYVMRSGGVQLEEDYQYVGNE-GVCKNNSANVVQISGCVSYDLRNEERLRELLVSNGPI 257
Query: 277 A 277
+
Sbjct: 258 S 258
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILVAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 127/228 (55%), Gaps = 11/228 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + +K K+Y E + RF++FK NLR + T G+ +F+DLT E+R
Sbjct: 53 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 112
Query: 113 FLGLN---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LG +R + + + LP DWR GAV VKDQG+CGSCW+FS A
Sbjct: 113 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 172
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E
Sbjct: 173 VEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSE 224
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+DYPY +DG ++ K+ + + + ++++ V + P++
Sbjct: 225 EDYPYKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVS 272
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 134/253 (52%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K + K Y + E R +++ NL+ L +H G+ D+T
Sbjct: 22 DHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMT 81
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ + +N LP DWR+ G VT VK QGACG+CW
Sbjct: 82 SEEV----ISLMSSLRVPSQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACW 137
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVD C E+ G + GCNGG M AF+YI+
Sbjct: 138 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTEAFQYIID 191
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVA 281
G++ E YPY TD G C++D AA S ++ + S ED + + GP++
Sbjct: 192 NNGIDSEASYPYKATD-GKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVS---V 247
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 248 AIDARHSSF-FLY 259
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 152/289 (52%), Gaps = 19/289 (6%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
S+ LL +S + + A D +++ D S D L + F + SK K+Y
Sbjct: 6 FSNFFLLFISMAVFAYSAFARDFSIVG--YSPDDLTSMDKLTDL---FESWMSKHGKSYR 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EE +RF VF+ NL+ + G+ +F+DL+ EF+R++LGL L D
Sbjct: 61 SFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRD 120
Query: 126 A-QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
+ ++ DLP DWR GAV VK+QGACGSCW+FS A+EG + + TG L +L
Sbjct: 121 SPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTAL 180
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ+L+DCD ++GCNGGLM+ AF +I+ GG+ +E+DYPY + G+C
Sbjct: 181 SEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV-MEEGTCGE 231
Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
K ++ +S + + D +Q + + PL+ +IE F F
Sbjct: 232 KKEELEVVTISGYHDVPEDNEQSFLKALANQPLS---VAIEASSRGFQF 277
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 88/234 (37%), Positives = 129/234 (55%), Gaps = 18/234 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTPSEFR 110
+ + +++ + Y E +RF+VFKAN R V G +F+DLT EF
Sbjct: 58 RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFA 117
Query: 111 RQFLGLNRRLRLPADAQKAPILPTN-------DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+ GL + +P+ A++ P + D DWR GAVT VK+QG CG CW+
Sbjct: 118 AMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWA 177
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA GA+EG ++TG LVSLSEQQ++DCD E G + GCNGG M++AF+Y++
Sbjct: 178 FSAVGAMEGLIMITTGNLVSLSEQQILDCD-----ESDG--NQGCNGGYMDNAFQYVINN 230
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GGV E YPY+ G+C+ + AA +S F + S ++ AN V + P++
Sbjct: 231 GGVTTEDAYPYSAVQ-GTCQ--NVQPAATISGFQDLPSGDENALANAVANQPVS 281
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/236 (36%), Positives = 132/236 (55%), Gaps = 15/236 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
L +H F F+ +F + Y E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 314 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 373
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R ++P + P +FDWR AVT VK+QG+CGSCW
Sbjct: 374 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 432
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 433 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 483
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY C F+++ VS F + +E M L+ HGP++
Sbjct: 484 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPIS 538
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 129/233 (55%), Gaps = 17/233 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
+ ++ + YA E + R+ VFK N+ +R + T V +F+DLT EFR +
Sbjct: 34 WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 93
Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
G L + + + ++ LP DWR GAVT +KDQG+CGSCW+FSA A
Sbjct: 94 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 153
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + G+L+SLSEQ+LVDCD + D GC GG MNSAF Y + GG+ E
Sbjct: 154 IEGVAQIKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSE 204
Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
+YPY TD G+C +K+K IA ++ F + +++++ V H P++ +A
Sbjct: 205 SNYPYKSTD-GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 256
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/236 (37%), Positives = 133/236 (56%), Gaps = 20/236 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK+ ++ YA+ +E RF +F AN+++A +P A G +F+D++ EF+ +
Sbjct: 25 FRDFKTTHARNYASADEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 113 FLGLNRRLRLPADAQKAPILPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+ A K T + + DWR GAVT VK+QG+CGSCWSFS T
Sbjct: 85 HNAARHYAAVMARPPKNTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSFSTT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GG 225
G +EG H ++TG+LVSLSEQ+LV CD + D GC+GGLM++AF ++L A G
Sbjct: 145 GNIEGQHAIATGQLVSLSEQELVSCD---------TVDDGCSGGLMDNAFGWLLSAHNGQ 195
Query: 226 VEREKDYPYTGTDG--GSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ E YPY +G +C F+ + + A +++F I E MAA + K+GPL+
Sbjct: 196 ITTEASYPYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLS 251
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 155/290 (53%), Gaps = 26/290 (8%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L+L+ S L +++A D +++ S+ +S D L+ F + S+ K Y
Sbjct: 8 ALVLIACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYENI 62
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
EE RF +FK NL+ R + G+ +F+DL+ EF ++LGL +RR P
Sbjct: 63 EEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRRRESP 122
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+ + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L S
Sbjct: 123 EEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+L+DCD + ++GCNGGLM+ AF +I++ GG+ +E+DYPY + G+C+
Sbjct: 179 LSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGTCE 229
Query: 244 FDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
K + +S + + + +Q + + PL+ +IE F F
Sbjct: 230 MTKEETQVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 276
>gi|332030000|gb|EGI69825.1| Cathepsin L [Acromyrmex echinatior]
Length = 328
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 138/256 (53%), Gaps = 20/256 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVT 99
+ +L+AE + +FK+ K Y + E YR ++F N R+ ++ +L + G+
Sbjct: 22 NKILDAE--WFIFKTHHKKIYKSSVEEGYRMKIFLDNKRKIAEHNRKYELNEVPYKLGMN 79
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGA 157
K+ D+ EF G N+ + A + P N +LP + DWR HGAVT VKDQG
Sbjct: 80 KYGDMLHHEFVNTLNGFNKSEKAQKQFMGATFISPANVELPKEVDWRKHGAVTEVKDQGH 139
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG+LEG HF TG LVSLSEQ L+DC E GCNGGLM++AF
Sbjct: 140 CGSCWAFSTTGSLEGQHFRQTGILVSLSEQNLIDCSGNYGNE-------GCNGGLMDNAF 192
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+Y+ G++ EK YPY + C+++ A + F I +E ++ A + GP+
Sbjct: 193 KYVRDNKGLDTEKSYPYE-AENDKCRYNPRNSGAIDTGFVDIPRGNEHKLKAAVATIGPV 251
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF
Sbjct: 252 S---VAIDASHESFQL 264
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 134/244 (54%), Gaps = 16/244 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ L+ ++ + Y +E RF VFK N + + G+ +F+DL+ EF+
Sbjct: 42 YELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LG +RL P +++ DLP DWR+ GAVT VKDQG+CGSCW+FS
Sbjct: 102 YLGAKLDTKKRLSRPP-SRRYQYSDGEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVA 160
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIINNGGLDS 212
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E+DYPYT DG + K+ + ++ + ++++ + P++ +IE
Sbjct: 213 EEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPIS---VAIEASGR 269
Query: 289 SFSF 292
F F
Sbjct: 270 EFQF 273
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 133/251 (52%), Gaps = 20/251 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK+Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY GTD C + A + F + S E M + GP++ +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253
Query: 283 IELPHISFSFL 293
I+ H SF F
Sbjct: 254 IDAGHESFQFY 264
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/256 (38%), Positives = 131/256 (51%), Gaps = 27/256 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ FK + K Y ++ E R +++ N + AK Q D V K++DL E
Sbjct: 28 WTAFKLQHRKKYDSETEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEE 87
Query: 109 FRRQFLGLNRRLR----------LPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGA 157
F G NR + P + I P N D+PT DWR GAVT VKDQG
Sbjct: 88 FVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANVDVPTAMDWRTKGAVTQVKDQGH 147
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCWSFSATGALEG HF TG+LVSLSEQ LVDC + ++GCNGG+M+ AF
Sbjct: 148 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYG-------NNGCNGGMMDFAF 200
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ EK YPY D C ++ + A F I +E + L GP+
Sbjct: 201 QYIKDNKGIDTEKSYPYEAID-DECHYNPKAVGATDKGFVDIPQGNEKALMKALATVGPV 259
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF F
Sbjct: 260 S---VAIDASHESFQF 272
>gi|343474209|emb|CCD14094.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 307
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 88/233 (37%), Positives = 124/233 (53%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+A G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPKTVDWRKKGAVTPVKDQGKCDSSWAFAAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G +++AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFLDTAFKWIVSSNNGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G +C + A + + I +E+ +A L K GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVA 261
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/236 (36%), Positives = 132/236 (55%), Gaps = 15/236 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
L +H F F+ +F + Y E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 316 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 375
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R ++P + P +FDWR AVT VK+QG+CGSCW
Sbjct: 376 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 434
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 435 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 485
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY C F+++ VS F + +E M L+ HGP++
Sbjct: 486 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPIS 540
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/246 (36%), Positives = 130/246 (52%), Gaps = 18/246 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+K+ K+Y +EE +R +++ NLR + L H G+ +F D+T
Sbjct: 28 HWHLWKNWHKKSYLPKEE-GWRRVLWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR+ G + + AP + P DWR+ G VT VKDQG CGSCW+FS T
Sbjct: 87 EFRQLMNGYKNQKMIKGSTFLAP--NNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG H+ G+L+SLSEQ LVDC + GCNGGLM+ AF+Y+ GG++
Sbjct: 145 GALEGQHYRKAGKLISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGGID 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELP 286
E YPYT D C +D + +A + F V S E + + GP++ +++
Sbjct: 198 SEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSGSEKDLMKAVASVGPVS---VAVDAG 254
Query: 287 HISFSF 292
H SF F
Sbjct: 255 HKSFQF 260
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 134/253 (52%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K + K Y + E R +++ NL+ L +H G+ D+T
Sbjct: 34 DHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMT 93
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ + +N LP DWR+ G VT VK QGACG+CW
Sbjct: 94 SEEV----ISLMSSLRVPSQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACW 149
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVD C E+ G + GCNGG M AF+YI+
Sbjct: 150 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTEAFQYIID 203
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVA 281
G++ E YPY TD G C++D AA S ++ + S ED + + GP++
Sbjct: 204 NNGIDSEASYPYKATD-GKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVS---V 259
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 260 AIDARHSSF-FLY 271
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/228 (39%), Positives = 125/228 (54%), Gaps = 17/228 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVF-----KANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+K+K+ KTY + E R ++ K A+ Q L + G+ F+D+ EFR
Sbjct: 30 YKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKL-GLNSFADMHNGEFR 88
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+ G R P ++ + LP DWR GAVT +K+QG CGSCW+FS TG+L
Sbjct: 89 KMMNGYRRGT--PRNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSL 146
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H L G+LVSLSEQ+LVDC + + GC+GGLM+ AF YI K G++ E+
Sbjct: 147 EGQHALKKGKLVSLSEQELVDC-------SAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQ 199
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
YPYTG D G+C F KS +AA V+ F V S E + GP++
Sbjct: 200 SYPYTGED-GTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPIS 246
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 127/228 (55%), Gaps = 11/228 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + +K K+Y E + RF++FK NLR + T G+ +F+DLT E+R
Sbjct: 51 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 110
Query: 113 FLGLN---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LG +R + + + LP DWR GAV VKDQG+CGSCW+FS A
Sbjct: 111 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 170
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E
Sbjct: 171 VEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSE 222
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+DYPY +DG ++ K+ + + + ++++ V + P++
Sbjct: 223 EDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVS 270
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/217 (39%), Positives = 119/217 (54%), Gaps = 16/217 (7%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
+ D RF +FK NLR + A + G+T F++LT E+R +LG RR+
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 123 PADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ ND+ P DWR GAV +KDQG CGSCW+FS A+EG + + TGE
Sbjct: 84 AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD S + GCNGGLM+ AF++I+K GG+ EKDYPY GT+G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + + + S ++ V + P++
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 133/237 (56%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQNDSAAKYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 81 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPFEFDWRRLNKVTNVKNQGVCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+A +LE + +L+ LSEQQ++DCD S D+GCNGGL+++AFE +
Sbjct: 137 CWAFAALASLESQFAMKHNQLIDLSEQQMIDCD---------SVDAGCNGGLLHTAFEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+K GGV+ EKDYPY + +C+ + +K V + + I E+++ L GP+
Sbjct: 188 IKMGGVQLEKDYPYEAAN-NNCRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPI 243
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 155/278 (55%), Gaps = 26/278 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTY 64
SLLL+L+ S L+SA D ++I +++ H + + +L++S + K+Y
Sbjct: 11 SLLLMLIFSTLSSA----SDMSIISY------DETHIHHRSDDEVSALYESWLIEHGKSY 60
Query: 65 ATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RL 120
E D RF++FK NL+ ++ + + + G+TKF+DLT E+R +LG R
Sbjct: 61 NALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRR 120
Query: 121 RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+L + + D LP DWRD G + GVKDQG+CGSCW+FSA A+E + + TG
Sbjct: 121 KLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTG 180
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L+SLSEQ+LVDCD S + GC+GGLM+ AFE+++ GG++ E+DYPY +
Sbjct: 181 NLISLSEQELVDCDK--------SYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERND 232
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++ K+ + ++ + + ++ V H P++
Sbjct: 233 VCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVS 270
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K +SK Y + E R +++ NL+ L +H G+ D+T
Sbjct: 25 DHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ Q+ +N LP DWR+ G VT VK QG+CG+CW
Sbjct: 85 GEEV----ISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 140
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVD C E+ G + GCNGG M +AF+YI+
Sbjct: 141 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTTAFQYIID 194
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
G++ E YPY + G C++D K AA S ++ + ED + + GP++
Sbjct: 195 NNGIDSEASYPYKAMN-GKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVS---V 250
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 251 AIDASHYSF-FLY 262
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
+E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L +A
Sbjct: 51 SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 110
Query: 97 GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
GV KFSD TP E FL L++ L + + P LP +DWRD VT +
Sbjct: 111 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPNIRLPDYYDWRDTNKVTPI 169
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GCNGGL
Sbjct: 170 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 220
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
M+ AF+ +L GGVE E DYPY G++ C D KIA + S F DE+++ +
Sbjct: 221 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 279
Query: 272 KHGPLAGNVASIEL 285
GP+A V ++++
Sbjct: 280 TTGPVAIAVDAMDI 293
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K +SK Y + E R +++ NL+ L +H G+ D+T
Sbjct: 33 DHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 92
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ Q+ +N LP DWR+ G VT VK QG+CG+CW
Sbjct: 93 GEEV----ISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 148
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVD C E+ G + GCNGG M +AF+YI+
Sbjct: 149 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTTAFQYIID 202
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
G++ E YPY + G C++D K AA S ++ + ED + + GP++
Sbjct: 203 NNGIDSEASYPYKAMN-GKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVS---V 258
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 259 AIDASHYSF-FLY 270
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 19/229 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R VF L ++ + T G+ KFSDLT +EFR
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG + AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPDDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
E+ YPYTG GSC +K+K+ ++ + ++ D V P+
Sbjct: 170 TEEAYPYTGF-AGSCNTNKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216
>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
Length = 334
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 133/261 (50%), Gaps = 23/261 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK L
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDLFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G +++D+ E + G L+ + + DWR GA T
Sbjct: 70 EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAAT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT DG SC + + A + + V + E +
Sbjct: 183 GLMDSAFEYVKSNGGIDTEESYPYTAVDGDSCLYRAANNAGVNTGYKDVQAKSESALRDA 242
Query: 270 LVKHGPLAGNVASIELPHISF 290
+ K GP++ +I+ + SF
Sbjct: 243 VEKVGPVS---VAIDASNWSF 260
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
+E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L +A
Sbjct: 53 SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 112
Query: 97 GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
GV KFSD TP E FL L++ L + + P LP +DWRD VT +
Sbjct: 113 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPNIRLPDYYDWRDTNKVTPI 171
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GCNGGL
Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 222
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
M+ AF+ +L GGVE E DYPY G++ C D KIA + S F DE+++ +
Sbjct: 223 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 281
Query: 272 KHGPLAGNVASIEL 285
GP+A V ++++
Sbjct: 282 TTGPVAIAVDAMDI 295
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/229 (37%), Positives = 127/229 (55%), Gaps = 19/229 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F + +K K+Y++ E R +F L ++ + T G+ KFSDLT +EFR
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++G + + P + P + + LPT DWR GAVT +KDQG CGSCW+FSA
Sbjct: 62 NYVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++E AHFL+T ELVSLSEQQL+DCD + D GC GG AF+++++ GGV
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
E+ YPYTG GSC +K+K+ ++ + ++ D V P+
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 128/224 (57%), Gaps = 13/224 (5%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
K K Y E D RF +FK NLR + T G+ +F+DLT E+R ++LG
Sbjct: 10 KHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEYRARYLGTRI 69
Query: 117 --NRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
NRR ++ + + ++LP DWR+ AV VKDQG CGSCW+FS GA+EG
Sbjct: 70 DPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAFSTIGAVEGI 129
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ A+E+I+ GG++ E+DYP
Sbjct: 130 NKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAYEFIINNGGIDSEEDYP 181
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y DG ++ K+ + ++ + ++++ V + P++
Sbjct: 182 YRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVS 225
>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
Length = 307
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 130/229 (56%), Gaps = 16/229 (6%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRL 122
T +E RF +FK N+ + + V G+ +D++ E++R +LG + + R
Sbjct: 9 TAQEFGTRFNIFKKNMDFVHKWNAKGSSTVLGLNSMADISNEEYQRVYLGTHIDASQFRQ 68
Query: 123 PADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + K + T + + DWR GAVT +K+QG CGSCWSFS TG+ EGAHF+ TG L
Sbjct: 69 QAASHK--LGRTFKVQAANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKTGNL 126
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ L+DC PE + GCNGGLM +AFEYI+K G++ E YPY DG
Sbjct: 127 VSLSEQNLMDCS---KPEG----NQGCNGGLMTAAFEYIIKNNGIDTESSYPYKAEDGKK 179
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
C ++ + AA +S++ +++ + A GP++ +I+ H SF
Sbjct: 180 CLYNPANSAATLSSYVNVTTGSESDLAVKSGLGPVS---VAIDASHNSF 225
>gi|66815893|ref|XP_641963.1| cysteine protease 4 [Dictyostelium discoideum AX4]
gi|166201984|sp|P54639.2|CYSP4_DICDI RecName: Full=Cysteine proteinase 4; Flags: Precursor
gi|60469981|gb|EAL67962.1| cysteine protease 4 [Dictyostelium discoideum AX4]
Length = 442
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 128/234 (54%), Gaps = 13/234 (5%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L + F+ + +TY++ EE + R+++FK+N+ + V G+ F+D+T
Sbjct: 24 LQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITN 82
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
E+R +LG ++ I T PT DWR GAVT +K+QG CG CWSFS
Sbjct: 83 QEYRTTYLGTPFDGSALIGTEEEKIFST-PAPT-VDWRAQGAVTPIKNQGQCGGCWSFST 140
Query: 167 TGALEGAHFLSTG---ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
TG+ EGAHF+++G +LVSLSEQ L+DC ++GC GGLM AFEYI+
Sbjct: 141 TGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG-------NNGCEGGLMTLAFEYIINN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G++ E YPYT DG CKF S I A + ++ ++S + + + P++
Sbjct: 194 KGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVS 247
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 100/227 (44%), Positives = 134/227 (59%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 82 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 142 IYL--NPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 199
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 200 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 250
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G SC F K +++ V+S +E ++AA L K GP++
Sbjct: 251 DYSYQG-HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPIS 296
>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 124/236 (52%), Gaps = 14/236 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+FSATG
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSATG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G + AF +I+ + G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTD---------DLGCRDGFPDIAFNWIVSSNKGNV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
E+ YPY G DKS + A + + ++ DED +A L + GP A V
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDEDMIAEWLARKGPAAITV 264
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/251 (39%), Positives = 133/251 (52%), Gaps = 20/251 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK+Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY GTD C + A + F I S E M + GP++ +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253
Query: 283 IELPHISFSFL 293
I+ H SF F
Sbjct: 254 IDAGHESFQFY 264
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 146/272 (53%), Gaps = 20/272 (7%)
Query: 11 LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
LS LA +++ D + QV E++E L + ++ K+ K Y E
Sbjct: 14 FYFLSVCLAIDMSIIDYNLKHGQVP----ERTEAETLRL---YEMWLVKYGKAYNALGEK 66
Query: 71 DYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQ 127
+ RF +FK NL+ + + +P+ G+ KF+DL+ E+R +LG ++ + RL +
Sbjct: 67 ERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPK 126
Query: 128 KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A L +DLP DWR+ GAV VKDQG CGSCW+FS GA+EG + + TG L SLS
Sbjct: 127 SARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLS 186
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD + GCNGGLM+ AFE+I+K GG++ E+DYPY D
Sbjct: 187 EQELVDCDK--------VYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNR 238
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + + + ++++ V + P++
Sbjct: 239 KNARVVTIDGYEDVPQNDEKSLRKAVANQPVS 270
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 92/239 (38%), Positives = 130/239 (54%), Gaps = 17/239 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
+K +K Y+ E R+ ++K N RR + L + + +F D+T SEF+
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFK----A 85
Query: 116 LNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
N L + P N + P DWR+ G VT VKDQG CGSCW+FS TG+LEG H
Sbjct: 86 FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG+LVSLSEQ LVDC + ++GCNGGLM++AF YI + G++ E YPY
Sbjct: 146 FKKTGKLVSLSEQNLVDC-------STAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPY 198
Query: 235 TGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
T D G C F K +AA + F + +E+++ + GP++ +I+ H SF F
Sbjct: 199 TAED-GKCVFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPIS---VAIDASHESFQF 253
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/287 (37%), Positives = 146/287 (50%), Gaps = 42/287 (14%)
Query: 13 LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDY 72
+ SS +A+AV V A +V P D+++ F+ FK+K+ K Y E
Sbjct: 1 MKSSCIAAAVLV----AAGHEVPP------PDYMM----MFNNFKTKYGKVYNGINEDAV 46
Query: 73 RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA-PI 131
RF +FKAN+ + T GV +F+DLT E + GL PA P
Sbjct: 47 RFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEELAASYTGLK-----PASLWSGLPR 101
Query: 132 LPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
L T++ L + DW G VT VK+QG CGSCWSFS TGALEGA LSTG LVSLSE
Sbjct: 102 LSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSE 161
Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
QQ VDCD + DSGCNGG M++AF + K + E YPYT TD G+C
Sbjct: 162 QQFVDCD---------TTDSGCNGGWMDNAFSFA-KKNSICTEGSYPYTATD-GTCNLSG 210
Query: 247 SKIA---AAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
++ V ++ +S+D +Q + V P++ +IE SF
Sbjct: 211 CQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVS---IAIEADQYSF 254
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 96/251 (38%), Positives = 132/251 (52%), Gaps = 22/251 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
F ++ KF +TY++ E R + + N + +L + G+T F+D+
Sbjct: 25 EFHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENE 84
Query: 108 EFRRQF----LGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
E++R LG + LP LP N DLP DWRD G VT VKDQ CGSCW
Sbjct: 85 EYKRLISQGCLG-SFNASLPRRGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSATG+LEG F TG+LVSLSEQQLVDC + + GC GGLM+ AF YI
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYG-------NMGCGGGLMDDAFRYIQA 196
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVA 281
GG++ E+ YPY D G C++ + A + + +SS DED + + GP++
Sbjct: 197 TGGIDTEESYPYEAED-GECRYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPIS---V 252
Query: 282 SIELPHISFSF 292
I+ HISF
Sbjct: 253 GIDASHISFQL 263
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 99/251 (39%), Positives = 133/251 (52%), Gaps = 20/251 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKHYHESEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY GTD C + AA + F I S E M + GP++ +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253
Query: 283 IELPHISFSFL 293
I+ H SF F
Sbjct: 254 IDAGHESFQFY 264
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 134/256 (52%), Gaps = 31/256 (12%)
Query: 47 LNAEHHFSLFKSKFSKTYATQE------EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
L AE H+ +++F+ Q+ E R+ FK NL R ++ G T
Sbjct: 19 LFAEKHY---QNQFTNWMVVQDRQYDAYEFRTRYSAFKDNLDFIHRWNAVNKETELGATV 75
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKD 154
F+DLT E+R +LG+N DA P + + DWR++GAV VKD
Sbjct: 76 FADLTNEEYRAVYLGMN------VDASNFAAQPATLDQVYQPVRSTLDWRNNGAVGRVKD 129
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CGSCW+FS TGA+EGAH ++TG VSLSEQQL+DC + GC GGLM+
Sbjct: 130 QGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYG-------NHGCQGGLMD 182
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
SA YI+K GG+ E+ YPY D +CK++ + A +S +S I + A + G
Sbjct: 183 SAMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNGAKLSGYSNIKRGSEADLAAKLNIG 242
Query: 275 PLAGNVASIELPHISF 290
P+A +++ H SF
Sbjct: 243 PVA---IALDASHSSF 255
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/251 (39%), Positives = 133/251 (52%), Gaps = 20/251 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
E H+ L+K+ SK Y EE +R V++ NL++ + L H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKNYHASEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSHRLGMNHFGDMT 85
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EFR+ G + + + + N L P DWR+ G VT VKDQG+CGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGA+EG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI
Sbjct: 144 FSTTGAMEGQPFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY GTD C + AA + F + S E M + GP++ +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253
Query: 283 IELPHISFSFL 293
I+ H SF F
Sbjct: 254 IDAGHESFQFY 264
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 137/256 (53%), Gaps = 21/256 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR+ GA+T VKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+L+SLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F I S +ED++ A + GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 254
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF F
Sbjct: 255 S---VAIDASHESFQF 267
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 127/228 (55%), Gaps = 14/228 (6%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRLPAD 125
E + RF+VFK NLR + + G+ +F+DLT E+R +LG +R RL
Sbjct: 70 EKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRS 129
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ + + LP DWR GAV VKDQG+CGSCW+FS A+EG + + TG+L+SLS
Sbjct: 130 SNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLS 189
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD S + GCNGGLM+ AF++I+ GG++ E+DYPY DG +
Sbjct: 190 EQELVDCDR--------SYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYR 241
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
K+ + N+ + ++++ V + P++ +IE F F
Sbjct: 242 KNAKVVTIDNYEDVPVNDEKALQKAVANQPVS---VAIEAGGREFQFY 286
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 99/248 (39%), Positives = 127/248 (51%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
+ FK+ K+Y + E RF++F N L AK V G+ +F DL
Sbjct: 26 QWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R F G + R + P ND LP DWR GAVT VKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFL GELVSLSEQ LVDC ++GC GGLM AF+YI G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ EK YPY D G C+F K + A + + I + E + + GP++ +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253
Query: 285 LPHISFSF 292
H SF
Sbjct: 254 ASHSSFQL 261
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 158/290 (54%), Gaps = 35/290 (12%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
++ +L+LLL L SAV + D QVV + + ++ +A +F F S+++K Y+
Sbjct: 1 MNKILILLL---LVSAVLTSHD-----QVVAVTIKPNLYNINSAPLYFEKFISQYNKQYS 52
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+++E YR+ +F+ N+ + + +AV+ + +F+D+T +E +NR L +
Sbjct: 53 SEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEV------VNRHTGLASG 106
Query: 126 AQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
A T P +FDWR++ VT VKDQG CG+CW+F+ GALE + +
Sbjct: 107 DIGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIK 166
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
L+ L+EQQLVDCD D GC+GGL+++A+E I+ GGVE+E DYPY
Sbjct: 167 YDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYKAV 217
Query: 238 DGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
C K A V N + + E+++ +L++H GP+A V +++L
Sbjct: 218 R-LPCAVKPHKFAVGVRNCYRYVLLSEERL-EDLLRHVGPIAIAVDAVDL 265
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 19/234 (8%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGV-------TKFSDLTPS 107
+ ++ +TYA EE R +F+AN R D A V +F+DLT
Sbjct: 46 WMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADLTDE 105
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD----FDWRDHGAVTGVKDQGACGSCWS 163
EFR GL R + L D DWR GAVTGVKDQG+CG CW+
Sbjct: 106 EFRAARTGLRRPAAVAGAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQGSCGCCWA 165
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA A+EG + TG LVSLSEQQLVDCD D D GC GGLM++AF+YI +
Sbjct: 166 FSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGD-------DQGCEGGLMDNAFQYISRQ 218
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+ E YPY+G DGGSC+ +++ AA++ + ++ + V H P++
Sbjct: 219 GGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVS 272
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 91/235 (38%), Positives = 130/235 (55%), Gaps = 29/235 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANL--------RRAKRRQLLDPTAVHGVTKFSDL 104
+ + + K Y + E+ RF++FK N+ RR L G+ KF+DL
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSL-------GLNKFADL 90
Query: 105 TPSEFRRQFLGLNRRLRLPADAQK-APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
T SEFR ++G RL+ PA + I D T DWR G VT +KDQG CGSCW+
Sbjct: 91 TNSEFRGLYVG---RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWA 147
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSA A+EG FLSTG LVSLSEQ+LVDCD + + GC+GG+M+ AF+Y+++
Sbjct: 148 FSAVAAVEGLTFLSTGTLVSLSEQELVDCDT--------TVNQGCDGGIMDYAFQYMIRN 199
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+ + +YPY G+C DK K AA ++ F I +++ V + P++
Sbjct: 200 GGITSQSNYPYRALR-GACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVS 253
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 100/295 (33%), Positives = 150/295 (50%), Gaps = 41/295 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L LLL+ ++LA+A A+ S L+N E ++ FK + +K Y
Sbjct: 3 LFLLLIVAILATAQAI-----------------SFFELVNQE--WTTFKMEHNKVYKNDI 43
Query: 69 EHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E +R ++F N + + ++ + + K+ D+ EF G N+ +
Sbjct: 44 EERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQL 103
Query: 125 DAQKAPI-----LPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+++ PI P N LP DWR+HGAVT VKDQG CGSCWSFSATGALEG HF T
Sbjct: 104 RSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRT 163
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L+ LSEQ L+DC + ++GCNGGLM+ AF+YI G++ E YPY +
Sbjct: 164 GILIPLSEQNLIDCSGKYG-------NNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE-AE 215
Query: 239 GGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
C+++ + A V + +E ++ A + GP++ +I+ H SF F
Sbjct: 216 NDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVS---VAIDASHQSFQF 267
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 90/245 (36%), Positives = 128/245 (52%), Gaps = 14/245 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ +KS K Y Q E D+R VF N++ T + +FSDLT EF +
Sbjct: 25 WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNA-KSTFKMAINEFSDLTRKEFVKT 83
Query: 113 FLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ G ++ + + P N ++PT+ DWR G VT +K+QG CGSCW+FS TG+LE
Sbjct: 84 YNGYRLSMKKSTNKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLE 143
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G HF TG+LVSLSEQ L+DC + + GC GG M+ AFEYI G++ E
Sbjct: 144 GQHFRKTGKLVSLSEQNLIDC-------SAAEGNDGCGGGFMDDAFEYIKLNNGIDTEAS 196
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHISF 290
YPY G D C++ K+ A + + I ED + A + GP++ +I+ H SF
Sbjct: 197 YPYEGRD-DICRYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPIS---VAIDASHKSF 252
Query: 291 SFLFT 295
T
Sbjct: 253 HMYHT 257
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 138/254 (54%), Gaps = 25/254 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+S FK + Y ++ E ++R +++ + AK Q + V G+ K+ D+ E
Sbjct: 27 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 86
Query: 109 FRRQFLGLNR------RLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
F + G N+ L + + + I P N LP DWR HGAVT +KDQG CG
Sbjct: 87 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 146
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCWSFS TGALEG HF +G LVSLSEQ L+DC E+ G ++GCNGGLM++AF+Y
Sbjct: 147 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-----SEQYG--NNGCNGGLMDNAFKY 199
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
I GG++ E+ YPY G D C+++ K+ A V + DE ++ + GP++
Sbjct: 200 IKDNGGIDTEQTYPYEGVD-DKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVS- 257
Query: 279 NVASIELPHISFSF 292
+I+ H SF
Sbjct: 258 --VAIDASHTSFQL 269
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 146/265 (55%), Gaps = 23/265 (8%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEE-HDYRFRVFKANLRRAKRRQLLDPTAVH-GVT 99
S D L+ E ++ + +KF K A+ D RF FK N R + + G+
Sbjct: 4 SSDSDLSGE--YASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLN 61
Query: 100 KFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
+FSDLT EFR++FLGL + L++P D+ DLP DWR HGAVT K
Sbjct: 62 QFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPK 121
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG+CG CW+F+ TGA+EG + + TG+L+SLSEQ+L+DCD + D GC+GGLM
Sbjct: 122 DQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKK--------ADKGCDGGLM 173
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVK 272
+A+++I++ GG++ E DYPY ++ C K + A+ + I ++Q V
Sbjct: 174 ENAYQFIVENGGLDTETDYPYHASE-SHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVA 232
Query: 273 HGPLAGNV--ASIELPHISFSFLFT 295
P++ + AS + H + S +FT
Sbjct: 233 KQPVSVAIEGASKDFQHYA-SGVFT 256
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 85/236 (36%), Positives = 132/236 (55%), Gaps = 15/236 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
L +H F F+ +F + Y E R R+F+ NL+ + + +A +G+T+F+D+
Sbjct: 164 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 223
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T +E++ + GL +R ++P + P +FDWR AVT VK+QG+CGSCW
Sbjct: 224 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 282
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 283 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 333
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
GG+E E +YPY C F+++ VS F + +E M L+ HGP++
Sbjct: 334 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPIS 388
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 134/253 (52%), Gaps = 23/253 (9%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
+ + FK K Y ++ E +R ++F N + AK +L V GV K+SD+
Sbjct: 23 VQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYSDM 82
Query: 105 TPSEFRRQFLGLNRRLRLPA-----DAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGAC 158
EF G NR + P D I P N +LP DWR GAVT VKDQG C
Sbjct: 83 LNHEFVHTLNGYNRS-KTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQC 141
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCWSFS TG+LEG HF + +LVSLSEQ L+DC E+ G ++GCNGGLM++AF
Sbjct: 142 GSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCS-----EKYG--NNGCNGGLMDNAFR 194
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
YI GG++ E+ YPY D C + A F I S DE+++ A + GP++
Sbjct: 195 YIKDNGGIDTEQSYPYKAED-EKCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPIS 253
Query: 278 GNVASIELPHISF 290
+I+ H +F
Sbjct: 254 ---VAIDASHPTF 263
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 146/289 (50%), Gaps = 22/289 (7%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIR--QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
+LL LS L+SA D ++I Q + D + A + L K K Y
Sbjct: 12 FVLLFLSFTLSSA----SDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQ--GKVYNA 65
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRLP 123
E + RF+VFK NLR + T G+ F+DLT E+R +LG +R RL
Sbjct: 66 LGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLR 125
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+ + LP DWR GAV VKDQG+CGSCW+FS A+EG + + TG+L+S
Sbjct: 126 KTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLIS 185
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYPY DG
Sbjct: 186 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDT 237
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ K+ + ++ + + + V + P++ +IE F F
Sbjct: 238 YRKNAKVVTIDDYEDVPVNSETALQKAVANQPVS---VAIEAGGRDFQF 283
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 133/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E+ + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEEALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 88/233 (37%), Positives = 122/233 (52%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD D GC G M++AF++I+ + G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DLGCRAGFMDTAFKWIVSSNNGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G +C + A + + I +E+ +A L K GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVA 261
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 94/243 (38%), Positives = 129/243 (53%), Gaps = 19/243 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K + K Y TQ+E R ++ NL+ + + T + +F DLT E+R
Sbjct: 25 WKRTYGKEY-TQKEEALRHMIWNVNLKMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYRE 83
Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
G + + +LP+N P DWR G VT VKDQGACGSCW+FS+TG+L
Sbjct: 84 LMCGYKKSNKTVISKPSTFLLPSNYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGSL 143
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG F TG+LV LSEQQLVDC + + GC GG M+ AF YI K G E E
Sbjct: 144 EGQTFKKTGKLVPLSEQQLVDCSGDYG-------NMGCGGGWMDQAFSYI-KDKGEESED 195
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHIS 289
YPYTGTD +C +D SK+ A + ++ I DE+ + + GP++ +I+ H S
Sbjct: 196 GYPYTGTD-DTCVYDASKVVATDTGYTDIPEMDENALQQAVATVGPIS---VAIDATHSS 251
Query: 290 FSF 292
F F
Sbjct: 252 FQF 254
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 116/206 (56%), Gaps = 13/206 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ ++++K F+K Y EE R V++ N+ ++ H G +++D+T
Sbjct: 25 DQEWAIYKDMFAKNYVADEERMRRL-VWEDNIDYIEKHNRRADRGEHKFWLGTNEYADMT 83
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF+ G + D +P DLP DWRD G VT VK+QG CGSCWSFS
Sbjct: 84 IDEFKAIMNGFIMQNGTKGDTYMSPS-NIGDLPDKVDWRDKGYVTPVKNQGHCGSCWSFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HF STG+LVSLSEQ L+DC + + GC GGLM+ AFEYI K G
Sbjct: 143 ATGSLEGQHFKSTGKLVSLSEQNLIDCSKK-------EGNHGCKGGLMDFAFEYIQKNDG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAA 251
++ E+ YPYT DG C+F K+ + A
Sbjct: 196 IDTEQSYPYTAKDGIECRFKKADVGA 221
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/296 (36%), Positives = 158/296 (53%), Gaps = 30/296 (10%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK---SKF 60
+ LS LLLL + + VA N D +++ SE+ L + E LF+ +K
Sbjct: 8 MKLSGALLLL---CVGACVARNSDFSIVGY--------SEEDLSSNERLVELFEKWLAKH 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR- 119
K YA+ EE +RF VFK NL+ + + G+ +F+DLT EF+ +LGL+
Sbjct: 57 QKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAAP 116
Query: 120 -LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R + + + + +DLP DWR GAVT VK+QG CGSCW+FS A+EG + + T
Sbjct: 117 ARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L +LSEQ+L+DC S +SGCNGGLM+ AF YI +GG+ E+ YPY +
Sbjct: 177 GNLTALSEQELIDC--------SVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYL-ME 227
Query: 239 GGSCKFDKSKIAAAV--SNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
GSC K + AV S + + ++++Q + H P++ +IE F F
Sbjct: 228 EGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVS---VAIEASGRHFQF 280
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 108/294 (36%), Positives = 152/294 (51%), Gaps = 26/294 (8%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNA------EHHFS 54
M R + L+ L S++LA A D+ +I Q V + E LL HF+
Sbjct: 1 MARFLAFLALVFLSSAILARANHAFDEANLI-QSVTERIDSLETSLLGVLGQTRNALHFA 59
Query: 55 LFKSKFSKTYATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F ++ K Y + EE RF +F NL R RR L P + G+ +++D++ EFR
Sbjct: 60 RFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNRRGL--PYKL-GINRYADMSWEEFRA 116
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
LG + A + + LP DWR+ G V+ VKDQG+CGSCW+FS TGALE
Sbjct: 117 SRLGAAQNC--SATLKGNHKMTDELLPKTKDWREDGIVSPVKDQGSCGSCWTFSTTGALE 174
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
A+ +TG+ +SLSEQQLVDC + + + GCNGGL + AFEYI GG++ E+
Sbjct: 175 AAYTQATGKGISLSEQQLVDCAYAFN-------NFGCNGGLPSQAFEYIKYNGGLDTEES 227
Query: 232 YPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAGNVAS 282
YPY G + G C F + V N ++ + DE A LV+ +A V S
Sbjct: 228 YPYAGVN-GFCHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIAFEVVS 280
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 97/254 (38%), Positives = 135/254 (53%), Gaps = 25/254 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGV---TKFSDLTPSE 108
+S FK + SK Y ++ E +R +++ N R AK Q + AV K++D+ E
Sbjct: 27 WSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLSHE 86
Query: 109 FRRQFLGLNRRLRLPADAQKAP--------ILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
F G N+ L+ P I P + P DWR GAVT VKDQG CG
Sbjct: 87 FVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGKCG 146
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TGALEG HF TG LVSLSEQ L+DC + ++GCNGGLM++AF+Y
Sbjct: 147 SCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDC-------SAAYGNNGCNGGLMDNAFKY 199
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
I GG++ EK YPY G D C+++ K+ A V + DE+++ + GP++
Sbjct: 200 IKDNGGIDTEKAYPYEGVD-DKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVS- 257
Query: 279 NVASIELPHISFSF 292
+I+ SF F
Sbjct: 258 --VAIDASQESFQF 269
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 86/246 (34%), Positives = 135/246 (54%), Gaps = 21/246 (8%)
Query: 37 SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH 96
SDGE E + L+ +K K Y +E + RF++FK NL+ + T
Sbjct: 27 SDGEVRE--------IYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKV 78
Query: 97 GVTKFSDLTPSEFRRQFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
G+ F+DLT E+R +LG R ++ +++ + + LP DWR GAV
Sbjct: 79 GLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAP 138
Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
VK+QG+CGSCW+FS A+EG + + TGEL+SLSEQ+LV CD + +SGCNGG
Sbjct: 139 VKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKK--------YNSGCNGG 190
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
LM+ AF++I+ GG++ E+DYPY DG K+ ++ + + +++++ V
Sbjct: 191 LMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPANDEESLKKAV 250
Query: 272 KHGPLA 277
H P++
Sbjct: 251 AHQPVS 256
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/248 (38%), Positives = 132/248 (53%), Gaps = 17/248 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+KS +K Y +EE +R V++ NL+ + L H G+ +F D+T
Sbjct: 9 HWQLWKSWHNKDYHEREE-SWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTE 67
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFR+ G + + P+ + P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 68 EFRQLMNGYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 127
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ GG+
Sbjct: 128 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNQGCNGGLMDQAFQYVQDNGGI 180
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPYT D C++ AA + F + E + + GP++ +I+
Sbjct: 181 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVS---VAIDA 237
Query: 286 PHISFSFL 293
H SF F
Sbjct: 238 GHSSFQFY 245
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 138/252 (54%), Gaps = 24/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSE 108
++ FK K Y ++ E +R ++F N + ++ +L + + G+ K+ D+ E
Sbjct: 28 WNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLHHE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPI-----LPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
F G N+ + AQ+ PI P N ++P+ DWR HGAVT +KDQG CGSCW
Sbjct: 88 FINTLNGFNKSVSAQLRAQRRPIGSRFIEPANVEIPSSVDWRTHGAVTPIKDQGHCGSCW 147
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
SFSATGALEG H+ TG+LVSLSEQ L+DC SG ++GCNGGLM+ AF+YI
Sbjct: 148 SFSATGALEGQHYRITGKLVSLSEQNLIDC--------SGRYGNNGCNGGLMDQAFQYIK 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
G++ E YPY + C+++ A S + I +E ++ A + GP++
Sbjct: 200 DNHGLDTEISYPYE-AENDKCRYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVS--- 255
Query: 281 ASIELPHISFSF 292
+I+ SF F
Sbjct: 256 VAIDASAESFQF 267
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 100/295 (33%), Positives = 149/295 (50%), Gaps = 41/295 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
L L L+ +VLA+A A+ S L+N E ++ FK + +K Y
Sbjct: 3 LFLFLIVAVLATAQAI-----------------SFFELVNQE--WTTFKMEHNKVYKNDV 43
Query: 69 EHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E +R ++F N + + ++ + + K+ D+ EF G N+ +
Sbjct: 44 EERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQL 103
Query: 125 DAQKAPIL-----PTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+++ PI P N LP DWR+HGAVT VKDQG CGSCWSFSATGALEG HF T
Sbjct: 104 RSERLPIAASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRT 163
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G L+ LSEQ L+DC + ++GCNGGLM+ AF+YI G++ E YPY +
Sbjct: 164 GILIPLSEQNLIDCSGKYG-------NNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE-AE 215
Query: 239 GGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
C+++ + A V + +E ++ A + GP++ +I+ H SF F
Sbjct: 216 NDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVS---VAIDASHQSFQF 267
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 121/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ANL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P D + P DWR+ GAVT VK+QG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAEERARVPVDVEVV------GAPAAKDWREEGAVTAVKNQGICGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
+A G +EG FL+ L LSEQ LV CD+ +SGC GGL + AFE+I++
Sbjct: 151 AAIGNIEGQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGLSSKAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY G CK + A ++ + DE Q+AA+ GPL+
Sbjct: 202 NGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATITGHVELPQDEAQIAASGAVKGPLS 258
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
+E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L +A
Sbjct: 53 SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 112
Query: 97 GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
GV KFSD TP E FL L++ L + + P LP +DWRD VT +
Sbjct: 113 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPDIRLPDYYDWRDTNKVTPI 171
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GCNGGL
Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 222
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
M+ AF+ +L GGVE E DYPY G++ C D KIA + S F DE+++ +
Sbjct: 223 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 281
Query: 272 KHGPLAGNVASIEL 285
GP+A V ++++
Sbjct: 282 TTGPVAIAVDAMDI 295
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/265 (35%), Positives = 145/265 (54%), Gaps = 23/265 (8%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
S L+L S L +++A D +++ S+ +S D L+ F + SK K Y
Sbjct: 5 FSKALVLACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSKHGKIYQ 59
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLR 121
+ EE RF +FK NL+ R + G+ +F+DL+ EF+ ++LGL +RR
Sbjct: 60 SIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRE 119
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
P + + +LP DWR GAV VK+QG+CGSCW+FS A+EG + + TG L
Sbjct: 120 SPEEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
SLSEQ+L+DCD + +GCNGGLM+ AF +I++ GG+ +E+DYPY + G+
Sbjct: 176 TSLSEQELIDCDR--------TYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGT 226
Query: 242 CKFDKSKI-AAAVSNFSVISSDEDQ 265
C+ K + +S + + + +Q
Sbjct: 227 CEMTKEETEVVTISGYHDVPQNNEQ 251
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 150/286 (52%), Gaps = 26/286 (9%)
Query: 1 MERLILSSLLLLLLSS----VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLF 56
M + I+++LL L SS + S + ++ + + SD ED + N + ++
Sbjct: 1 MAKTIITTLLFALFSSLSYAIDMSIIDYKNNHYARKWTLQSD----EDQVKN---RYEMW 53
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
++ + Y E + RF +FK NLR + + T G+ +F+DLT E+R +LG
Sbjct: 54 LAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLG 113
Query: 116 LN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R ++ +Q+ P +P DWR GAV +K+QG+CGSCW+FS A+
Sbjct: 114 TKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAV 173
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TGE+++LSEQ+LVDCD +SGCNGGLM+ AFE+I+ GG++ EK
Sbjct: 174 EGINQIVTGEMITLSEQELVDCDR--------VQNSGCNGGLMDYAFEFIISNGGMDTEK 225
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
YPY G +G K+ ++ + + +E + V H P+
Sbjct: 226 HYPYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERAL-QKAVAHQPV 270
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 120/203 (59%), Gaps = 16/203 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFR 110
F ++ + K Y EE + RF FK NL+ + + T H G+ KF+DL+ EF+
Sbjct: 43 FQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFADLSNEEFK 102
Query: 111 RQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+ +L ++ R+ A+ + L + D P+ DWR G VT VKDQG CGSCWSFS
Sbjct: 103 QLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFST 162
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGA+EG + + T +L+SLSEQ+LVDCD + + GC GG M+ AFE+++ GG+
Sbjct: 163 TGAIEGINAIVTSDLISLSEQELVDCD---------TTNYGCEGGYMDYAFEWVINNGGI 213
Query: 227 EREKDYPYTGTDGGSCKFDKSKI 249
+ E +YPYTG D G+C K +I
Sbjct: 214 DTEANYPYTGVD-GTCNTAKEEI 235
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 98/258 (37%), Positives = 134/258 (51%), Gaps = 38/258 (14%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F +K +F ++Y + E R ++ +N R ++ + G+T F+D+ E
Sbjct: 26 FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85
Query: 109 FRRQF----LG-----LNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
++RQ LG L RR LRLP A DLP DWR+ G VT VKDQ
Sbjct: 86 YKRQISQGCLGSFNASLPRRGSAYLRLPEGA---------DLPNSVDWREKGYVTEVKDQ 136
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
CGSCW+FS TG+LEG F TG+LVSLSEQQLVDC + E GC GGLM+S
Sbjct: 137 KQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNE-------GCMGGLMDS 189
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
AF YI GG++ E YPY D G C+++ + I A + + V DED + + G
Sbjct: 190 AFRYIQANGGIDTEDSYPYEAED-GQCRYNSANIGATCTGYVDVKQGDEDALKEAVATIG 248
Query: 275 PLAGNVASIELPHISFSF 292
P++ +I+ H SF
Sbjct: 249 PVS---VAIDASHSSFQL 263
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 122/217 (56%), Gaps = 16/217 (7%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
+ D RF +FK NLR + A + G+T F++LT E+R +LG RR+
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 123 PADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + + + +++P DWR GAV +KDQG CGSCW+FS A+EG + + TGE
Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD S + GCNGGLM+ AF++I+K GG+ EKDYPY GT+G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + + + S ++ V + P++
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 97/253 (38%), Positives = 134/253 (52%), Gaps = 24/253 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
++ +K + K Y ++ E R +++ N + AK Q + V K++DL E
Sbjct: 27 WNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEE 86
Query: 109 FRRQFLGLNR-RLRLPA------DAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGS 160
F + G NR + P D I P N ++P DWR+ GAVT VKDQG CGS
Sbjct: 87 FVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGS 146
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CWSFSATGALEG HF TG+LVSLSEQ LVDC + ++GCNGG+M+ AF+YI
Sbjct: 147 CWSFSATGALEGQHFRKTGKLVSLSEQNLVDCS-------TKYGNNGCNGGMMDFAFQYI 199
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGN 279
GG++ EK YPY D +C ++ + A F I DE + + GP++
Sbjct: 200 KDNGGIDTEKAYPYEAID-DTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVS-- 256
Query: 280 VASIELPHISFSF 292
+I+ H SF F
Sbjct: 257 -VAIDASHESFQF 268
>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
Length = 334
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 94/249 (37%), Positives = 130/249 (52%), Gaps = 17/249 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
N + H+ +K+ + Y EE +R V++ N + HG + F D
Sbjct: 24 NLDAHWHQWKATHRRLYGMNEE-GWRRAVWEKNKKIIDLHNQEYSQGKHGFSMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + R + P+L D+P DW G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKRKKGKLFREPLLI--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC P+ + GCNGGLM++AF+YI +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGLMDNAFQYIKEN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY TD SC + AA + F I E + + GP++ +I
Sbjct: 194 GGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 251 DAGHASFQF 259
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 131/250 (52%), Gaps = 22/250 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F ++ KF K+Y + E +R +++ N + +L G+T F+D+ E
Sbjct: 26 FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85
Query: 109 FR----RQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
++ R LG + LP LP DLP DWR+ G VTGVKDQ CGSCW+
Sbjct: 86 YKKLVSRGCLG-SFNASLPRRGSTFLRLPEGIDLPDAVDWREQGYVTGVKDQKQCGSCWA 144
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG HF TG LVSLSEQQLVDC E GCNGG M+SAF YI
Sbjct: 145 FSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNE-------GCNGGWMDSAFRYIEAN 197
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
GG++ E YPY D C+++ + + A S + V DE+ + + GP++ +
Sbjct: 198 GGIDTEASYPYEAED-WLCRYNPASVGATCSGYVDVNKYDEEALKEAVATIGPVS---VA 253
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 254 IDASHASFQF 263
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 127/232 (54%), Gaps = 14/232 (6%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-----GVTKFSDLT 105
+H + K K Y E + RF +F+ NL + + G+ KF+DLT
Sbjct: 3 YHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLT 62
Query: 106 PSEFRRQFLGLNRRLRLPA-DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFRR + G+ R + + + + + ++LP DWR GAV+ VKDQG CGSCW+F
Sbjct: 63 NDEFRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA GA+EG + + TG+L++LSEQ+LVDCD S +SGC+GGLM+ AF +I+ G
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDT--------SYNSGCDGGLMDYAFRFIINNG 174
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
G++ +KDYPY TDG K+ + + ++ ++ V H P+
Sbjct: 175 GIDTDKDYPYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPV 226
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 99/293 (33%), Positives = 144/293 (49%), Gaps = 40/293 (13%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V+ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILGAVITMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFTFEKNTIKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVKVN 99
Query: 125 DAQKAPILPTND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H TG+
Sbjct: 100 KPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVKDQGECGSCWAFSTTGSLEGQHANKTGK 159
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT TD
Sbjct: 160 LVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDK 212
Query: 241 SCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 213 PCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPIS---VAIDAGHESFQF 262
>gi|38048171|gb|AAR09988.1| similar to Drosophila melanogaster CG12163, partial [Drosophila
yakuba]
Length = 213
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 119/192 (61%), Gaps = 13/192 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
L A+H F F+ +F + Y + E R R+F+ NL+ ++ + + +A +G+T+F+D+
Sbjct: 30 LDKADHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADM 89
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
T SE++ + GL +R A ++P +LP +FDWR AVT VK+QG+CGSCW
Sbjct: 90 TSSEYKER-TGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQGSCGSCW 148
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TG +EG H + TG+L SEQ+L+DCD + DS CNGGLM++A++ I
Sbjct: 149 AFSVTGNIEGLHAVKTGDLKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 199
Query: 223 AGGVEREKDYPY 234
GG+E E +YPY
Sbjct: 200 IGGLEYEAEYPY 211
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 152/289 (52%), Gaps = 20/289 (6%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
S L+L S L ++A D +++ S+ +S D L+ F + S+ K Y T
Sbjct: 6 SKTLVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYET 60
Query: 67 QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL--RLPA 124
EE RF VFK NL+ R + G+ +F+DL+ EF+ ++LGL L R +
Sbjct: 61 IEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRES 120
Query: 125 DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
++ DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG L SL
Sbjct: 121 SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSL 180
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
SEQ+L+DCD + ++GCNGGLM+ AF +I + GG+ +E+DYPY + +C+
Sbjct: 181 SEQELIDCDT--------TYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYI-MEESTCEM 231
Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
K + ++ + + + +Q + + PL+ +IE F F
Sbjct: 232 KKEETQVVTINGYHDVPQNNEQSLLKALANQPLS---VAIEASSRDFQF 277
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 95/226 (42%), Positives = 133/226 (58%), Gaps = 11/226 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY T+EE ++R VF N+ RA++ Q LD TA +G+TKFSDLT EFR
Sbjct: 192 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 251
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L R + A + + P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 252 IYLNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 311
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G FL G L+SLSEQ+L+DCD D C GGL ++A+ I+ GG+E E D
Sbjct: 312 GQWFLKEGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYSAIMTLGGLETEDD 362
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 363 YSYQG-HLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPIS 407
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDYAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/256 (39%), Positives = 143/256 (55%), Gaps = 16/256 (6%)
Query: 26 DDDAMIRQVVPSDGEQS--EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
D + ++ +P+ S +D + F F +++TY ++EE +R VF +N+ R
Sbjct: 134 DRNETLKSTLPALNRDSLPQDFSVKMASIFKKFVRTYNRTYESKEEAQWRLSVFASNMVR 193
Query: 84 AKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDF 141
A++ Q LD TA +G+TKFSDLT EFR +L N LR + P D P +
Sbjct: 194 AQKIQSLDRGTAQYGITKFSDLTEEEFRTIYL--NPLLRSEPGKKMQLAKPVEDPAPPQW 251
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWR GAVT VKDQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD
Sbjct: 252 DWRSKGAVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDK------- 304
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
D C GGL ++A+ I GG+E E+DY Y G +C F K +++ +S
Sbjct: 305 --LDKACLGGLPSNAYSAIKNLGGLETEEDYTYQG-HMQACNFSAQKAKVYINDSVELSQ 361
Query: 262 DEDQMAANLVKHGPLA 277
+E ++AA L K GP++
Sbjct: 362 NEQKLAAWLAKRGPIS 377
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLML--KIPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANGTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 103/267 (38%), Positives = 150/267 (56%), Gaps = 24/267 (8%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
SLL++L A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 SLLIVLFCVASAAAGFSFHDSNPIRMV--SDVEEQLLQVIGESRHAVSFARFANRYGKRY 62
Query: 65 ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ +E RF++F NL R + +R+L + GV F+D T EFR LG +
Sbjct: 63 DSVDEMKLRFKIFSENLELIRSSNKRRL---SYKLGVNHFADWTWEEFRSHRLGAAQNC- 118
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + + +LP + DWR G V+GVKDQG+CGSCW+FS TGALE A+ + G+
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKN 177
Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN-G 228
Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQM 266
CKF +A V + ++ ED++
Sbjct: 229 LCKFRSEHVAVKVLGSVNITLGAEDEL 255
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 99/231 (42%), Positives = 127/231 (54%), Gaps = 18/231 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKAN---LRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
+K +KTYAT E R R+F N +R R L T + F+DLT EF
Sbjct: 33 WKVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAE 92
Query: 112 QFLGLNRRLR--LPADAQKAPI-LPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
++L L + + D + PT L P DWR G VT +KDQG CGSCW+FSAT
Sbjct: 93 KYLTLKQTPMEGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSAT 152
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG TG+L+SLSEQQLVDC + + + GCNGG MN AF Y ++ G E
Sbjct: 153 GALEGQLKRKTGKLISLSEQQLVDC-------STYTGNEGCNGGDMNDAFRYWMR-NGAE 204
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E DYPYT D G CKF+ SK+ VS F V EDQ+ ++ + GP++
Sbjct: 205 SESDYPYTAMD-GKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVS 254
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/297 (34%), Positives = 149/297 (50%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
++LL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R + + N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFILEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + ND LP DWR+ V+ VKDQG CGSCW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|66816665|ref|XP_642342.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
gi|60470393|gb|EAL68373.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
Length = 337
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 99/272 (36%), Positives = 142/272 (52%), Gaps = 29/272 (10%)
Query: 29 AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ 88
A++ V + E SE +A F+ + K+Y++ E R+ +FK N +
Sbjct: 9 ALLITVATAKQELSESQYRDA---FTDWMISNQKSYSSSE-FITRYNIFKTNFDYIEEWN 64
Query: 89 LLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-----KAPILPTNDLPTDFDW 143
V G+ K +D+T E+R +LG P DA K IL +N + DW
Sbjct: 65 SKGSETVLGLNKMADITNEEYRSLYLGK------PFDASSLIGTKEEILFSNKFSSTVDW 118
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS---TGELVSLSEQQLVDCDHECDPEE 200
R GAVT VK+Q +C CWSFSATGA EGAH L+ T ELVSLSEQ L+DC
Sbjct: 119 RKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFG--- 175
Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
++GCNGG++ AFEYI+ GG++ EK YP+ GTD G+C++ A +S++ ++
Sbjct: 176 ----NTGCNGGVITYAFEYIISNGGIDTEKSYPFEGTD-GTCRYKSENSGATISSYVNVT 230
Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ + V P+A SI+ H SF F
Sbjct: 231 FGSESSLESAVNVNPVA---CSIDASHSSFLF 259
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 97/252 (38%), Positives = 137/252 (54%), Gaps = 27/252 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKF-------SDLT 105
F+LFK K Y + E YR ++F N +R ++ + G F +D+
Sbjct: 27 FTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKH---NSRYKQGKVSFKLKLNHLADML 83
Query: 106 PSEFRRQFLGLNRRLRLPADA-QKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCW 162
E+ +LG N+ + + Q +P L + DWR GAVT VK+QG CGSCW
Sbjct: 84 IHEYSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCW 143
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
+FS TGALEG +F TG+LVSLSEQ LVDC SGS ++GC GGLM++AF+YI
Sbjct: 144 AFSTTGALEGQNFRKTGKLVSLSEQNLVDC--------SGSYGNNGCEGGLMDNAFQYIK 195
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNV 280
+ G++ EK YPY G D +C+F K+ I A S F + DE+ + + GP++
Sbjct: 196 ENHGIDTEKSYPYEGED-ETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPIS--- 251
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 252 VAIDASHQSFQF 263
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 129/242 (53%), Gaps = 19/242 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL 114
+ +K K Y EE RF++FK N+ + + + + G+ +F+DLT EFR +
Sbjct: 42 WMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWN 101
Query: 115 GLNRRLRLPADAQK--APILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
G R P DA + P N LP DWR GAVT +KDQ CGSCW+FSA A
Sbjct: 102 GYKR----PLDASRIVTPFKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAAT 157
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H L TG+LVSLSEQ+LVDCD + + D GC GGLM AF++I + GG+ E
Sbjct: 158 EGVHKLRTGKLVSLSEQELVDCDVKGE-------DKGCQGGLMEDAFKFIKRNGGITTEA 210
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
+Y Y G DG ++ A ++ + V+ + + V H P++ SI+ +SF
Sbjct: 211 NYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEAALLKAVAHQPVS---VSIDAGSMSF 267
Query: 291 SF 292
F
Sbjct: 268 QF 269
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 104/296 (35%), Positives = 151/296 (51%), Gaps = 27/296 (9%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMI-------RQVVPSDGEQSEDHLLNAEHHFSLF 56
L+ +++ LL+ +S L DD A+ Q + E E H +A FS F
Sbjct: 65 LVAAAVSLLVFASFLIQWQG-EDDRAVFPPSPVEDHQPPANIWEWKEAHFQDA---FSSF 120
Query: 57 KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
++ ++K+YAT+EE R+ +FK NL + + F DL+ EFRR++LG
Sbjct: 121 QAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGF 180
Query: 117 NRRLRLPAD-----AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ L + + +LP+ +LP DWR G VT VKDQ CGSCW+FS TGALE
Sbjct: 181 KKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALE 239
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
GAH TG+LVSLSEQ+L+DC + C+GG MN AF+Y+L +GG+ E
Sbjct: 240 GAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMNDAFQYVLDSGGICSEDA 292
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELP 286
YPY D C+ + + F V E M A L K P++ + + ++P
Sbjct: 293 YPYLARD-EECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMP 346
>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
Length = 347
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 90/247 (36%), Positives = 135/247 (54%), Gaps = 19/247 (7%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L + F+ + + + YA+ EE R+ +FKAN+ + V G+ F+D+T
Sbjct: 24 LQYRNAFTNWMIQNQRHYAS-EEFAARYNIFKANMDYVQEWNSKGSETVLGLNTFADITN 82
Query: 107 SEFRRQFLG--LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
EFR +LG + + + +K P + DWR GAVT +K+Q CG CWSF
Sbjct: 83 QEFRSIYLGTPFDGSSIINTETEKIFAAPAASI----DWRTKGAVTPIKNQQQCGGCWSF 138
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKA 223
S TG+ EGA ++ G L SLSEQ L+DC SGS ++GCNGGLM AFEYI+
Sbjct: 139 STTGSTEGATAIAKGNLPSLSEQNLIDC--------SGSYGNNGCNGGLMTLAFEYIINN 190
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
G++ E YPYT DG +CK++ + I A +S++S ++S + + GP++ +I
Sbjct: 191 KGIDTESSYPYTAKDGKTCKYNPANIGATLSSYSNVTSGSEPSLESAANIGPVS---VAI 247
Query: 284 ELPHISF 290
+ H SF
Sbjct: 248 DASHNSF 254
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 84 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 143
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 144 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 202
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 203 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 253
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 254 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 309
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 100/227 (44%), Positives = 134/227 (59%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 251 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 310
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 311 IYL--NPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 368
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 369 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 419
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G SC F K +++ V+S +E ++AA L K GP++
Sbjct: 420 DYSYQG-HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPIS 465
>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 315
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 87/197 (44%), Positives = 116/197 (58%), Gaps = 19/197 (9%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
N + +S FK+K++K YA + YR +F NL+ + T +G+T+F D+T
Sbjct: 35 NIQALWSAFKTKYNKKYADPDFERYRIEIFTENLKVVESN-----TKNYGITQFMDITRE 89
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EF++ +L L + L A +P ND + DW GAVT VKDQG CGSCWSFS T
Sbjct: 90 EFKQTYLTLKMKNGLKA----SPFAKFNDAGVEIDWTTKGAVTPVKDQGQCGSCWSFSTT 145
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GA+EGA FLST +L SLSEQ LVDC S + GCNGGLM++AF++I + G+
Sbjct: 146 GAVEGALFLSTKKLTSLSEQYLVDC--------SKDGNEGCNGGLMDTAFDFISQH-GIP 196
Query: 228 REKDYPYTGTDGGSCKF 244
E YPY D G+CK
Sbjct: 197 TEAAYPYKAVD-GTCKM 212
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 151/285 (52%), Gaps = 19/285 (6%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
L+L S L ++A D +++ S+ +S D L+ F + S+ K Y T EE
Sbjct: 9 LVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYETIEE 63
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
RF VFK NL+ R + G+ +F+DL+ EF+ ++LGL L ++ +
Sbjct: 64 KLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSEE 123
Query: 130 PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
+ DLP DWR GAVT VK+QG CGSCW+FS A+EG + + TG L SLSEQ+
Sbjct: 124 EFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 183
Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS- 247
L+DCD + ++GCNGGLM+ AF +I+K GG+ +E+DYPY + +C+ K
Sbjct: 184 LIDCDT--------TYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYI-MEESTCEMKKEV 234
Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
++ + + + +Q + + PL+ +IE F F
Sbjct: 235 SEVVTINGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 276
>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
Length = 521
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 92/248 (37%), Positives = 129/248 (52%), Gaps = 27/248 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ + K ++Y + E + RF VFK N+ V +T F+D++ E++R
Sbjct: 31 FTNWMIKNDRSYQSAEFGN-RFNVFKKNMDYVNEWNSKGSETVLDLTIFADISNEEYQRI 89
Query: 113 FLGLN----------RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+ + + AP+ DWR GAVT +K+QG CGSCW
Sbjct: 90 YLGTKIDATQKLIDAARITMNNNFAAAPVFNAT-----VDWRQKGAVTPIKNQGQCGSCW 144
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
SFS TG+ EGAHFLSTG LVSLSEQ LVDC PE + GCNGGLM+ AF YI+K
Sbjct: 145 SFSTTGSTEGAHFLSTGNLVSLSEQNLVDCS---GPEG----NDGCNGGLMDQAFTYIIK 197
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E YPY G C F+ I A ++ ++ + S + GP++ +
Sbjct: 198 NKGIDTESSYPYKAVQ-GKCAFNPKNIGATLTGYTDVKSGSESDLEAKANTGPVS---VA 253
Query: 283 IELPHISF 290
I+ H SF
Sbjct: 254 IDASHNSF 261
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 30 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 89
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 90 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 148
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 149 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 199
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 200 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 255
>gi|33242884|gb|AAQ01146.1| cathepsin [Petromyzon marinus]
Length = 333
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 130/234 (55%), Gaps = 16/234 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL---DPTAVH-GVTKFSDLTPS 107
+ +KS + K Y +++E +R VF+ NL+R + LL + H G+ K+SDL
Sbjct: 26 QWDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELH 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAP--ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
E+ + +G LR + AP + ++LP DWR G VT VK+QG CGS W+FS
Sbjct: 86 EYHEKVVGRFWNLRNGTRRRGAPFPLRSMDNLPEQVDWRLKGYVTPVKEQGLCGSSWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HF +TG L SLSEQQLVDC ++GCNGG A +YI+ G
Sbjct: 146 ATGSLEGQHFAATGNLTSLSEQQLVDC-------TKSYYNNGCNGGRSERALQYIIDNNG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI--SSDEDQMAANLVKHGPLA 277
++ E YPY D G C+F + +A S++ + SS+E+ + + GP+A
Sbjct: 199 IDSELSYPYEHAD-GKCRFKPANVATKCSSYQFVEPSSNEEVLRQAVASVGPIA 251
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 90/228 (39%), Positives = 125/228 (54%), Gaps = 20/228 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKF 101
L N F FK K SK+Y+ Q E R +F NLR + L + V +F
Sbjct: 18 LENVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQF 77
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGS 160
+DLT EF+ +L L+ + L P + T +PT DWR G VTGVKDQG CGS
Sbjct: 78 TDLTIDEFK-AYLTLHSKPTL----NTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGS 132
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS G+ EGA++ STG+LVSLSEQQL+DC + + + GC+GG + F Y+
Sbjct: 133 CWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDC--------TTNVNDGCDGGYLEETFPYV 184
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
+ G V E YPYTG D G+C+ +S + VS + ++ + D + A
Sbjct: 185 QQTGLVS-ESSYPYTGRD-GNCRISESDVVTKVSKYVLLGGEADLLEA 230
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 158/290 (54%), Gaps = 33/290 (11%)
Query: 1 MERLILSSLLLLLLSSVLASAV-----AVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH- 52
M RL + + +L+LL +V + + D++ IR V S D E S L+ H
Sbjct: 1 MARLSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHA 60
Query: 53 --FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSE 108
F+ F ++ K+Y T +E RF +F NL+ R+ R+ L T V +F+D T E
Sbjct: 61 HSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLA--VNQFADWTWEE 118
Query: 109 FRRQFLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
FRR LG N L + + ++ LP DWR+ G V+ +KDQG CGSCW+FS
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVI----LPETKDWREDGIVSPIKDQGHCGSCWTFST 174
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGG 225
TGALE A+ + G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG
Sbjct: 175 TGALEAAYAQAFGKGISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGG 226
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
++ E+ YPYTG D G+CKF I V N ++ + DE + A V+
Sbjct: 227 LDTEEAYPYTGLD-GTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVR 275
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 144/262 (54%), Gaps = 25/262 (9%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFS---KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
G SED L + + LF+S S K Y + EE +RF +FK NL+ R +
Sbjct: 32 GYSSED-LKSMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYW 90
Query: 96 HGVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
G+ +F+DL+ EF+ ++LGL +RR P + + +LP DWR GAVT
Sbjct: 91 LGLNEFADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDV----ELPKSVDWRKKGAVTQ 146
Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
VK+QG+CGSCW+FS A+EG + + TG L SLSEQ+L+DCD + ++GCNGG
Sbjct: 147 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDR--------TYNNGCNGG 198
Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANL 270
LM+ AF +I++ G+ +E+DYPY + G+C+ K + +S + + + +Q
Sbjct: 199 LMDYAFSFIVENDGLHKEEDYPYI-MEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKA 257
Query: 271 VKHGPLAGNVASIELPHISFSF 292
+ + PL+ +IE F F
Sbjct: 258 LANQPLS---VAIEASGRDFQF 276
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 99/247 (40%), Positives = 129/247 (52%), Gaps = 28/247 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F+ FK+K+ K Y E RF +FKAN+ + T GV +F+DLT EF
Sbjct: 27 FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEFAAS 86
Query: 113 FLGLNRRLRLPADAQKA-PILPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+ GL PA P L T++ L + DW G VT VK+QG CGSCWSFS
Sbjct: 87 YTGLK-----PASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFST 141
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEGA LSTG LVSLSEQQ DCD + DSGCNGG M++AF + K +
Sbjct: 142 TGALEGAWALSTGNLVSLSEQQFEDCD---------TTDSGCNGGWMDNAFSFA-KKNSI 191
Query: 227 EREKDYPYTGTDGGSCKFDKSKIA---AAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
E YPYT TD G+C ++ V ++ +S+D +Q + V P++ +I
Sbjct: 192 CTEGSYPYTATD-GTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVS---IAI 247
Query: 284 ELPHISF 290
E SF
Sbjct: 248 EADQYSF 254
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 135/253 (53%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
DH LN + + L+K+ K Y EE +R V+K N++ + H +
Sbjct: 22 DHSLNTQ--WELWKAVHRKPYDLNEE-GWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F DLT EFR+ G R+ I + +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDLTSEEFRQMMNGFQRQENKKGKVFHETIFAS--IPPSVDWREKGYVTPVKNQGKCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TGALEG F TG+LVSLSEQ LVDC PE + GC+GGLM++AF+Y
Sbjct: 137 SCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQ---PEG----NRGCHGGLMDNAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+L GG++ E+ YPYTG G+C ++ AA + F + E+ + + GP++
Sbjct: 190 VLDVGGLDSEESYPYTGLV-GTCNYNPKNSAANETGFVDLPKQENALMKAVATLGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ + SF F
Sbjct: 247 -VAVDASNPSFQF 258
>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
Length = 308
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 16 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 75
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 76 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 134
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 135 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSAI 185
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 186 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 241
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 101/257 (39%), Positives = 141/257 (54%), Gaps = 23/257 (8%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNA---EHH---FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
D+ IRQ+V + E+ +L H F+ F ++ K Y T EE RF VF N
Sbjct: 29 DENPIRQIVSDGLHELENGILQVVGKTRHALLFARFAHRYGKRYETVEEIKQRFEVFLDN 88
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPT 139
L+ + + GV +F+D+T EFRR LG + + K + TN LP
Sbjct: 89 LKMIRSHNKKGLSYKLGVNEFTDITWDEFRRDRLGAAQNC---SATTKGNLKLTNVVLPE 145
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DWR+ G V+ VK+QG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC
Sbjct: 146 TKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDC------- 198
Query: 200 ESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+G+ ++ GCNGGL + AFEYI GG++ E+ YPYTG + G CKF + V N
Sbjct: 199 -AGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN-GLCKFSSENVGVKVIDSVN 256
Query: 256 FSVISSDEDQMAANLVK 272
++ + DE + A LV+
Sbjct: 257 ITLGAEDELKYAVALVR 273
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 93/255 (36%), Positives = 136/255 (53%), Gaps = 25/255 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++ +++FK +++K Y +EE R V+++NL L H G+ ++ D+T
Sbjct: 24 DNEWNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMT 82
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPI-LPTN---DLPTDFDWRDHGAVTGVKDQGACGSC 161
EF + G R+ AP+ +P N DLP DWR G VT +K+QG CGSC
Sbjct: 83 NEEFTKTMNGY----RMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSC 138
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
WSFSATG+LEG F TG+LVSLSEQ LVDC + + GC GGLM+ AF YI
Sbjct: 139 WSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKK-------QGNHGCEGGLMDDAFTYIK 191
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNV 280
G++ E YPY D G C+F + + A + F + + DE+ + + GP++
Sbjct: 192 ANNGIDTEASYPYKARD-GKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPIS--- 247
Query: 281 ASIELPHISFSFLFT 295
+I+ H+SF T
Sbjct: 248 VAIDASHMSFQLYRT 262
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 139/261 (53%), Gaps = 24/261 (9%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
+D+ L + + +K + YA +E + R+ VFK N+ R +R + T V +
Sbjct: 29 DDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQ 88
Query: 101 FSDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
F+DLT EFR + G L+ + + + + + LP DWR GAVT +K+
Sbjct: 89 FADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKN 148
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CG CW+FSA A+EGA + G+L+SLSEQQLVDCD D GC+GGLM+
Sbjct: 149 QGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCDTN---------DFGCSGGLMD 199
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
+AFE+I+ GG+ E +YPY G D +CK +K A +++ + + ++++ V H
Sbjct: 200 TAFEHIMATGGLTTESNYPYKGKD-ATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAH 258
Query: 274 GPLAGNVASIELPHISFSFLF 294
P+ SI + F F F
Sbjct: 259 QPV-----SIGIEGGGFDFQF 274
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 85/212 (40%), Positives = 122/212 (57%), Gaps = 22/212 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + + + ++ +TY E + RF VF+ NLR +
Sbjct: 27 IVSYGERSEEEV---RRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAG 83
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTNDLPTDFDWRDH 146
+H G+ +F+DLT E+R +LG+ R RL Q A +LP DWR+
Sbjct: 84 LHSFRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSGRYQAAD---NEELPESVDWREK 140
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG++++LSEQ+LVDCD S +
Sbjct: 141 GAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDT--------SYNQ 192
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GCNGGLM+ AFE+I+ GG++ E+DYPY D
Sbjct: 193 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERD 224
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 136/256 (53%), Gaps = 21/256 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL---DPTAVH-GVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + H + K
Sbjct: 21 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNK 78
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N +P DWR+ GA+T VKDQG
Sbjct: 79 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQ 138
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+LVSLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 139 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 191
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F I S +ED++ A + GP+
Sbjct: 192 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 250
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF F
Sbjct: 251 S---VAIDASHESFQF 263
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 131/248 (52%), Gaps = 44/248 (17%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
FS FK+ ++ Y + E RF +F AN+++A +P A G +F+D++ EF+ +
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 113 F-----------------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+ AD QK DWR GAVT VK+Q
Sbjct: 85 HNAARHYAAAKARRAKHTKSFTKEEIKAADGQK------------IDWRLKGAVTSVKNQ 132
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G+CGSCWSFS TG +EG + ++TG LVSLSEQ+LV CD + D+GCNGGLM++
Sbjct: 133 GSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDN 183
Query: 216 AFEYIL--KAGGVEREKDYPYTGTDG--GSCKF--DKSKIAAAVSNFSVISSDEDQMAAN 269
AF +++ + G + E YPY +G +C + D + A +SNF I+ E+ MAA
Sbjct: 184 AFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAF 243
Query: 270 LVKHGPLA 277
+ +GPL+
Sbjct: 244 VFNYGPLS 251
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|1222695|gb|AAA92019.1| CP4 [Dictyostelium discoideum]
Length = 442
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 127/234 (54%), Gaps = 13/234 (5%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L + F+ + +TY++ EE + R+++FK+N+ + V G+ F+D+T
Sbjct: 24 LQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITN 82
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
E+R +LG ++ I T PT DWR GAVT +K+QG CG CWSFS
Sbjct: 83 QEYRTTYLGTPFDGSALIGTEEEKIFST-PAPT-VDWRAQGAVTPIKNQGQCGGCWSFST 140
Query: 167 TGALEGAHFLSTG---ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
TG+ EGAHF+++G +LVSLSEQ L+DC ++GC GGLM FEYI+
Sbjct: 141 TGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG-------NNGCEGGLMTLGFEYIINN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G++ E YPYT DG CKF S I A + ++ ++S + + + P++
Sbjct: 194 KGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVS 247
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 134/251 (53%), Gaps = 22/251 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + ++ TY E + RF F+ NLR +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 253
Query: 267 AANLVKHGPLA 277
V + P++
Sbjct: 254 LQKAVANQPIS 264
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLD-PTAVHGVTKF 101
+L A +F F ++K Y + E + R+ +FK NL AK D PTA +G+ KF
Sbjct: 27 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKF 86
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
SDL+ SE +F GL+ R ++ K +L P + P FDWR+ VT +K+QGACG
Sbjct: 87 SDLSKSELIAKFTGLSIPQR-ASNFCKTIVLNQPPDKGPLHFDWREQNKVTSIKNQGACG 145
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
+CW+F+ ++E + LV LSEQQL+DCD S D GCNGGL+++AFE
Sbjct: 146 ACWAFATLASVESQFAMRHNRLVDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 196
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPL 276
I++ GGV+ E DYP+ G D C D+ + + + V + + +E+++ L GP+
Sbjct: 197 IIRMGGVQAELDYPFVGRD-RRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPI 254
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 132/242 (54%), Gaps = 32/242 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK+ ++ YA+ +E RF +F N+++A +P A G +F+D+T EF+ +
Sbjct: 25 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 84
Query: 113 F----LGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+ R P + + KA + DWR GAVT VK+QGACGSC
Sbjct: 85 HNAARHYAAAKARPPKNTKTFTAEEIKAAV------GQQIDWRLKGAVTPVKNQGACGSC 138
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
WSFS TG +EG H ++TG+LV++SEQ+LV CD D GCNGGLM++AF +++
Sbjct: 139 WSFSTTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLI 189
Query: 222 KA--GGVEREKDYPYTGTDG----GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A G + E +YPY +G S + + A +S F I+ E+ MAA + KHGP
Sbjct: 190 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGP 249
Query: 276 LA 277
L+
Sbjct: 250 LS 251
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 96/275 (34%), Positives = 147/275 (53%), Gaps = 29/275 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE++++ A ++ + + +TY + R++VF+ NLR
Sbjct: 29 IVSYGERTDEE---ARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAG 85
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+ +LG R +L A A DLP DWR
Sbjct: 86 VHSFRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGARYHAAD---NEDLPESVDWRAK 142
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG+CG+CW+FS A+EG + + TG+L+SLSEQ+LVDCD S +
Sbjct: 143 GAVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 194
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ EKDYPY GTDG K+ + ++ + +++++
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254
Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
V + P++ +IE +F S +FT S
Sbjct: 255 LQKAVANQPVS---VAIEAAGTAFQLYSSGIFTGS 286
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/248 (35%), Positives = 134/248 (54%), Gaps = 16/248 (6%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+E + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ AF++I+ GG++ E DYPY G D K+ + ++ ++ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQK 253
Query: 270 LVKHGPLA 277
V++ P++
Sbjct: 254 AVRNQPVS 261
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/240 (40%), Positives = 130/240 (54%), Gaps = 20/240 (8%)
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
SK Y +EE +R V++ NL++ + L H G+ F D+T EFR+ G
Sbjct: 1 SKKYHEKEE-GWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGY 59
Query: 117 NRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
R+ + + + N L P DWRD+G VT VKDQG CGSCW+FS TGALEG H
Sbjct: 60 KRKPQRKFTG--SLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQH 117
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI G++ E YPY
Sbjct: 118 FRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPY 170
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELPHISFSFL 293
GTD C +D +A + F I S +++ V GP++ +I+ H SF F
Sbjct: 171 LGTDDQPCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVS---VAIDAGHESFQFY 227
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/227 (37%), Positives = 125/227 (55%), Gaps = 11/227 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + SK K Y + EE +RF VF+ NL R + G+ +F+DL+ EF+ +
Sbjct: 404 FESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEEFKSK 463
Query: 113 FLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+LGL D + + DLP DWR GAVT VK+QGACGSCW+FS A+E
Sbjct: 464 YLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVAAVE 523
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TG L +LSEQ+L+DCD + +SGCNGGLM+ AF +I GG+ +E D
Sbjct: 524 GINQIVTGNLTTLSEQELIDCDT--------TFNSGCNGGLMDYAFAFIASNGGLHKEDD 575
Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY + G+C+ K + +S + + +++ + H PL+
Sbjct: 576 YPYL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLS 621
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 147/297 (49%), Gaps = 50/297 (16%)
Query: 9 LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
+LLL+L +V++ A A V+P + E + ++K + K Y T+
Sbjct: 1 MLLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
E R +F+ N + + +H T KF D+ EF ++ +G ++
Sbjct: 40 EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96
Query: 125 DAQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
K P+L + LP DWR+ V+ VKDQG CG CW+FS TG+LEG H
Sbjct: 97 ---KKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQGECGPCWAFSTTGSLEGQHSN 153
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG+LV LSEQQLVDC + + GC GGLM+ AF+YI GG++ E+ YPYT
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIPANGGLDTEESYPYTA 206
Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
TD CKFD S + A + + V S +E + + GP++ +I+ H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260
>gi|407844577|gb|EKG02025.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative, partial [Trypanosoma
cruzi]
Length = 308
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 128/256 (50%), Gaps = 27/256 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ANL A+ +P A GVT FSDLT EFR
Sbjct: 65 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHANFGVTPFSDLTREEFRS 124
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P D + P DWR+ GAVT VK+QG CGSCW+F
Sbjct: 125 RYQNGAAHFAAAQERARVPVDVEVV------GAPAAKDWREEGAVTAVKNQGMCGSCWAF 178
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--K 222
+A G +EG FL+ L LSEQ LV CD+ +SGC GG AF++I+
Sbjct: 179 AAIGNIEGQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGSPFRAFKWIVDRN 229
Query: 223 AGGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
G V E YPY G CK + A +S + I SDE ++AA L GPL+ V
Sbjct: 230 NGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATISGYVTIPSDEKRIAAVLAVKGPLSVAV 289
Query: 281 ASIELPHISFSFLFTV 296
+ H + +FT+
Sbjct: 290 DASSWMHYT-GGVFTI 304
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/233 (37%), Positives = 123/233 (52%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G M++AF++I+ G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G +C + A + + I +E+ +A L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIRDHVHILDNENAIAEWLAKNGPVA 261
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 131/248 (52%), Gaps = 44/248 (17%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
FS FK+ ++ Y + E RF +F AN+++A +P A G +F+D++ EF+ +
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 113 F-----------------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+ AD QK DWR GAVT VK+Q
Sbjct: 85 HNAARHYAAAKARRAKHTKSFTKEEIKAADGQK------------IDWRLKGAVTSVKNQ 132
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G+CGSCWSFS TG +EG + ++TG LVSLSEQ+LV CD + D+GCNGGLM++
Sbjct: 133 GSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDN 183
Query: 216 AFEYIL--KAGGVEREKDYPYTGTDG--GSCKF--DKSKIAAAVSNFSVISSDEDQMAAN 269
AF +++ + G + E YPY +G +C + D + A +SNF I+ E+ MAA
Sbjct: 184 AFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAF 243
Query: 270 LVKHGPLA 277
+ +GPL+
Sbjct: 244 VFNYGPLS 251
>gi|66815417|ref|XP_641725.1| cysteine proteinase [Dictyostelium discoideum AX4]
gi|74844418|sp|Q94503.1|CYSP6_DICDI RecName: Full=Cysteine proteinase 6; Flags: Precursor
gi|1644500|gb|AAC47481.1| cysteine proteinase [Dictyostelium discoideum]
gi|60469754|gb|EAL67741.1| cysteine proteinase [Dictyostelium discoideum AX4]
Length = 434
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 128/235 (54%), Gaps = 26/235 (11%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EE + RF +FKAN+ V G+ F+D+T E+R +LG P D
Sbjct: 42 SSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRATYLGT------PFD 95
Query: 126 AQKAPILPTNDL-----PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG- 179
A + P+ + DWR GAVT +K+QG CG CWSFSATGA EGA +++ G
Sbjct: 96 ASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGD 155
Query: 180 -ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
+L S+SEQQL+DC SGS ++GC GGLM AFEYI+ GG++ E YP+T
Sbjct: 156 SDLTSVSEQQLIDC--------SGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFT-A 206
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ CK++ S I A +S++ ++S + A V GP + +I+ SF F
Sbjct: 207 NTEKCKYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTS---VAIDASQPSFQF 258
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 137/238 (57%), Gaps = 14/238 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
FSDLT EFR +L N LR P + K + P ++DWR GAVT VKDQG CG
Sbjct: 236 FSDLTEEEFRTIYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCG 293
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+
Sbjct: 294 SCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSA 344
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 345 IKNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 401
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/252 (34%), Positives = 139/252 (55%), Gaps = 18/252 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLN------RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
+F+D+T EF +F GLN +P+ K L +D+P++ DWR+ GAVT VK
Sbjct: 87 EFADITSEEFLAKFTGLNIPNSYLSPSPMPSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG CG CW+FSA G+LEGA+ ++TG L+ SEQ+L+DC + + GCNGG M
Sbjct: 147 NQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
+AF++I++ GG+ RE DY Y G +C+ A +SN+ V+ E + + K
Sbjct: 198 TNAFDFIIENGGISRESDYEYLGQQ-YTCRSQGKTAAVQISNYQVVPEGETSLLQAVTKQ 256
Query: 274 GPLAGNVASIEL 285
G AS +L
Sbjct: 257 PVSIGIAASHDL 268
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 135/249 (54%), Gaps = 19/249 (7%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTP 106
+ + FK+++ K Y + +E YR V++ N + T +F D+T
Sbjct: 20 NEWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTT 79
Query: 107 SEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
E G L+ ++P P++ ++LP DWRD GAVT VKDQ ACGSCW+FS
Sbjct: 80 EEINAAMNGFLSAGKKVPRGTMYQPLV--DELPDTVDWRDKGAVTPVKDQKACGSCWAFS 137
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG HFLSTG+LVSLSEQ LVDC + + GC GGLM++AF YI G
Sbjct: 138 ATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYG-------NFGCGGGLMDNAFRYIKDNNG 190
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIE 284
++ E+ YPY + G C+F+ + A +S++ I ED + + + GP++ +I+
Sbjct: 191 IDTEESYPYEAKN-GPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVS---VAID 246
Query: 285 LPHISFSFL 293
+F F
Sbjct: 247 ASTSTFHFY 255
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 96/257 (37%), Positives = 134/257 (52%), Gaps = 22/257 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKF 101
L+ AE +S FK+ K Y ++ E YR +++ N + A+ + V + ++
Sbjct: 24 LVGAE--WSAFKALHGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEY 81
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT----NDLPTDFDWRDHGAVTGVKDQGA 157
D+ EF G R R I P LP DWR GAVT VK+QG
Sbjct: 82 GDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQ 141
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG+LEG HF +G++VSLSEQ LVDC + ++GC GGLM++AF
Sbjct: 142 CGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDC-------STAFGNNGCEGGLMDNAF 194
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPL 276
+YI GG++ EK YPY GTD G+C F KS + A + F I + + V GP+
Sbjct: 195 KYIKANGGIDTEKSYPYNGTD-GTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPI 253
Query: 277 AGNVASIELPHISFSFL 293
+ +I+ H SF F
Sbjct: 254 S---VAIDASHQSFQFY 267
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 236 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 294
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 295 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 345
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 346 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPIS 401
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/233 (37%), Positives = 123/233 (52%), Gaps = 14/233 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F+ FK K+S++Y E +RFRVFK N+ RAK +P A GVT+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
R + G +K + T P DWR GAVT VKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
+EG ++ EL SLSEQ LV CD + D GC G M++AF++I+ G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208
Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E+ YPY G +C + A + + I +E+ +A L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIDDHVHILDNENAIAEWLAKNGPVA 261
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/267 (38%), Positives = 150/267 (56%), Gaps = 24/267 (8%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
SLL++L A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 SLLIVLFCVASAAAGFSFHDSNPIRMV--SDVEEQLLQVIGESRHAVSFARFANRYGKRY 62
Query: 65 ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ +E RF++F N+ R + +R+L + GV F+D T EFR LG +
Sbjct: 63 DSVDEMKLRFKIFSENIELIRSSNKRRL---SYKLGVNHFADWTWEEFRSHRLGAAQNC- 118
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + + +LP + DWR G V+GVKDQG+CGSCW+FS TGALE A+ + G+
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKN 177
Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN-G 228
Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQM 266
CKF +A V + ++ ED++
Sbjct: 229 LCKFRSEHVAVKVLGSVNITLGAEDEL 255
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 128/223 (57%), Gaps = 13/223 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F + SK Y ++E RF ++++N++ L +F+D+T SEF
Sbjct: 40 KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99
Query: 110 RRQFLGLNRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ FLGLN LRL Q+ P ++P DWR GAVT +++QG CG CW+FSA
Sbjct: 100 KAHFLGLNTSSLRLHKK-QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVA 158
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + + TG LVSLSEQQL+DCD G+ + GC+GGLM +AFE+I GG+
Sbjct: 159 AIEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKTNGGLAT 211
Query: 229 EKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDED--QMAA 268
E DYPYTG + G+C +KSK + + ++ +E Q+AA
Sbjct: 212 ETDYPYTGIE-GTCDQEKSKNKVVTIQGYQKVAQNEASLQIAA 253
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 136/256 (53%), Gaps = 21/256 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR GA+T VKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQ 142
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS+TGALEG F TG+L+SLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F I S +ED++ A + GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-NVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPV 254
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF F
Sbjct: 255 S---VAIDASHESFQF 267
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 23/253 (9%)
Query: 31 IRQVVPSDGEQSEDHLLN----AEHHFSL--FKSKFSKTYATQEEHDYRFRVFKANLRRA 84
IRQVV + E+ +L + H S F ++ K Y + EE RF VF NL+
Sbjct: 33 IRQVVSDGLHELENGILQVVGQSRHALSFVRFAHRYGKRYESVEEIKQRFEVFLDNLKMI 92
Query: 85 KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDW 143
+ + GV +F+DLT EFRR LG + + K + TN LP DW
Sbjct: 93 RSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQNC---SATTKGNVKLTNAVLPETKDW 149
Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
R+ G V+ VK+QG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +G+
Sbjct: 150 REDGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDC--------AGA 201
Query: 204 CDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVI 259
++ GCNGGL + AFEYI GG++ E+ YPYTG + G CKF + V N ++
Sbjct: 202 FNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN-GLCKFSSENVGVKVIDSVNITLG 260
Query: 260 SSDEDQMAANLVK 272
+ DE + A LV+
Sbjct: 261 AEDELKYAVALVR 273
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 140/243 (57%), Gaps = 14/243 (5%)
Query: 37 SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAV 95
+ G S+D + F F + +++TY ++EE +R +F N+ RA++ Q LD TA
Sbjct: 178 NKGPLSKDFSMQMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTAR 237
Query: 96 HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKD 154
+G+TKFSDLT EFR +L N LR + P D P ++DWR+ GAVT VK+
Sbjct: 238 YGITKFSDLTEEEFRTIYL--NPLLREDPGKKMRVAKPVGDPAPPEWDWRNKGAVTNVKN 295
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL +
Sbjct: 296 QGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------MDKACLGGLPS 346
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
+A+ I GG+E E+DY Y G +C F K +++ +S +E ++AA L K G
Sbjct: 347 NAYSAIKNLGGLETEEDYSYQG-QMQACNFSAEKAKVYINDSVELSHNEQKLAAWLAKKG 405
Query: 275 PLA 277
P++
Sbjct: 406 PIS 408
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 132/242 (54%), Gaps = 32/242 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK+ ++ YA+ +E RF +F N+++A +P A G +F+D+T EF+ +
Sbjct: 10 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 69
Query: 113 F----LGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+ R P + + KA + DWR GAVT VK+QGACGSC
Sbjct: 70 HNAARHYAAAKARPPKNTKTFTAEEIKAAV------GQQIDWRLKGAVTPVKNQGACGSC 123
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
WSFS TG +EG H ++TG+LV++SEQ+LV CD D GCNGGLM++AF +++
Sbjct: 124 WSFSTTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLI 174
Query: 222 KA--GGVEREKDYPYTGTDG----GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A G + E +YPY +G S + + A +S F I+ E+ MAA + KHGP
Sbjct: 175 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGP 234
Query: 276 LA 277
L+
Sbjct: 235 LS 236
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 100/234 (42%), Positives = 133/234 (56%), Gaps = 28/234 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY +QEE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
Query: 112 QFL--------GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+L G N RL P T+ P +DWR+ GAVT VKDQG CGSCW+
Sbjct: 223 IYLNPLLKDAPGRNMRLAQPV---------TDVPPPQWDWRNKGAVTDVKDQGMCGSCWA 273
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 274 FSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT---------DKACLGGLPSNAYSAIRTL 324
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E E DY Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 325 GGLETEDDYSYRG-HLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPIS 377
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/215 (40%), Positives = 121/215 (56%), Gaps = 11/215 (5%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFL-GLNRR 119
K Y E D RF +F NL+ + + + G+T+F+DLT EFR +L R
Sbjct: 46 KNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFRAIYLRSKMER 105
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
R +++ + LP + DWR GAV VKDQG+CGSCW+FSA GA+EG + + TG
Sbjct: 106 TRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQIKTG 165
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
ELVSLSEQ+LVDCD S ++GC GGLM+ AF++I+ GG++ E+DYPYT TD
Sbjct: 166 ELVSLSEQELVDCDT--------SYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYTATDD 217
Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKH 273
C DK + + + +E+ + L
Sbjct: 218 NICNTDKKNTRVVTIDGYEDVPENENSLKKALANQ 252
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/232 (39%), Positives = 127/232 (54%), Gaps = 17/232 (7%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTP 106
+ + L+K+ + K+Y T EE YR ++ N K + HG T F DLT
Sbjct: 25 NEWELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNT--DSDKHGYTLEMNSFGDLTS 82
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+EF + G + L + + N +P+ DWRD VT VK+QG CGSCW+FS
Sbjct: 83 AEFSSLYNGYRQNLETSGSVFSSSL--RNAMPSSLDWRDKKVVTDVKNQGKCGSCWAFST 140
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG+LEG H L TG LVSLSEQQL+DC + ++GC+GG M SAF+YI AGG
Sbjct: 141 TGSLEGLHALKTGHLVSLSEQQLMDCSVKYG-------NNGCDGGNMRSAFQYIKDAGGD 193
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
+ E+ YPYT + SC+FD K+ A + I S DE + L + GP++
Sbjct: 194 DTEESYPYTAKN-ESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPIS 244
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 5 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 65 IYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 122
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 123 EGQWFLNQGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSAIKNLGGLETED 173
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 174 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 219
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|16076439|emb|CAC94444.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 72/85 (84%), Positives = 80/85 (94%)
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
DHECDPEE G+CDSGC+GGLM +AFEY LKAGG+EREKDYPYTGTD GSCKFDKSKIAA+
Sbjct: 1 DHECDPEEYGACDSGCSGGLMTTAFEYTLKAGGLEREKDYPYTGTDRGSCKFDKSKIAAS 60
Query: 253 VSNFSVISSDEDQMAANLVKHGPLA 277
VSNFSV+S DEDQ+AANLVK+GPLA
Sbjct: 61 VSNFSVVSIDEDQIAANLVKNGPLA 85
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/253 (36%), Positives = 127/253 (50%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+K + K Y + E R +++ NL+ L +H G+ D+T
Sbjct: 36 DRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 95
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
E L LR+P+ Q+ P LP DWRD G VT VK QG+CGSCW
Sbjct: 96 SEEVT----ALMSSLRVPSQWQRNVTYKSNPNQKLPDSVDWRDKGCVTDVKYQGSCGSCW 151
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVDC + GCNGG M AF+YI+
Sbjct: 152 AFSAVGALEAQVKLKTGKLVSLSAQNLVDC------SVGKYSNRGCNGGFMTEAFQYIID 205
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVA 281
G+E E YPY D G C++D AA S ++ + D ED + + GP++
Sbjct: 206 NNGIESEASYPYKAMD-GKCQYDSKYRAATCSRYTELPEDSEDALKEAVANKGPVS---V 261
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 262 AIDASHPSF-FLY 273
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 143/281 (50%), Gaps = 23/281 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ LS+L+L LL+ S+ RQ+ D + D + + H + ++ +T
Sbjct: 1 MSLSTLILALLA---MSSAVAAPRALAARQL-AGDEAITVDAAMVSRHE--KWMAEHGRT 54
Query: 64 YATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
YA +EE R VF+AN + D T +F+DLT EFR GL R
Sbjct: 55 YANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAA 114
Query: 119 --RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
A + D DWR GAVTGVKDQG+CG CW+FSA A+EG +
Sbjct: 115 AAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKI 174
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG LVSLSEQQLVDCD D D GC GGLM++AFEY++ GG+ E YPY G
Sbjct: 175 RTGRLVSLSEQQLVDCDVYGD-------DEGCAGGLMDNAFEYMINRGGLTTESSYPYRG 227
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
TD GSC+ +S AA++ + + ++ + V H P++
Sbjct: 228 TD-GSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVS 265
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 130/249 (52%), Gaps = 18/249 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
N + + +K+ + Y+T EE +R V++ N++ + HG T F D
Sbjct: 24 NLDTQWYQWKATHRRLYSTNEE-GWRRAVWEKNMKMIELHNGEYSRGKHGFTMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ + + + P+L DLP DWR G VT VK+Q CGSCW+
Sbjct: 83 MTNEEFRQVMVCFRNQKHKNGKVFRGPLLL--DLPKSVDWRKKGYVTPVKNQKQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC P+ + GCNGG MN AF Y+ +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGFMNYAFRYVKEN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E YPY D G CK+ A + F VI + E ++ + GP++ ++
Sbjct: 194 GGLDSEASYPYEAKD-GICKYKPENSVANDTGFVVIPTHEKELMKAVATVGPIS---VAV 249
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 250 DASHSSFQF 258
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 136/244 (55%), Gaps = 28/244 (11%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
+D + F F + +++TY +QEE +R VF N+ RA++ Q LD TA +GVTKF
Sbjct: 170 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKF 229
Query: 102 SDLTPSEFRRQFL--------GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
SDLT EFR +L G N RL P T+ P +DWR+ GAVT VK
Sbjct: 230 SDLTEEEFRTIYLNPLLKDAPGRNMRLAQPV---------TDVPPPQWDWRNKGAVTDVK 280
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
DQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD D C GGL
Sbjct: 281 DQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT---------DKACLGGLP 331
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
++A+ I GG+E E DY Y G +C F K +++ +S +E ++AA L K
Sbjct: 332 SNAYSAIRTLGGLETEDDYSYRG-HLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKK 390
Query: 274 GPLA 277
GP++
Sbjct: 391 GPIS 394
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 132/248 (53%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +KS SK Y +EE +R +++ NL+ + L H G+ F D+T
Sbjct: 27 HWLSWKSWHSKKYHEKEE-GWRRMIWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G ++ R + + L N L P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQVMNGF-KQSRSQRKYKGSQFLEPNFLQAPKSVDWREKGYVTPVKDQGQCGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATGALEG HF TG+LVSLSEQ L+DC PE + GCNGGLM+ AF+YI G
Sbjct: 145 ATGALEGQHFRKTGKLVSLSEQNLIDCS---GPE----GNQGCNGGLMDQAFQYIKDNNG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
++ E+ YPY G D C + +A + F I ++ V GP++ +I+
Sbjct: 198 IDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEGRERALMKAVAAVGPIS---VAID 254
Query: 285 LPHISFSF 292
H SF F
Sbjct: 255 ASHTSFQF 262
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 143/281 (50%), Gaps = 23/281 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ LS+L+L LL+ S+ RQ+ D + D + + H + ++ +T
Sbjct: 1 MSLSTLILALLA---MSSAVAAPRALAARQL-AGDEAITVDSAMVSRHE--KWMAEHGRT 54
Query: 64 YATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
YA +EE R VF+AN + D T +F+DLT EFR GL R
Sbjct: 55 YANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAA 114
Query: 119 --RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
A + D DWR GAVTGVKDQG+CG CW+FSA A+EG +
Sbjct: 115 AAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKI 174
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG LVSLSEQQLVDCD D D GC GGLM++AFEY++ GG+ E YPY G
Sbjct: 175 RTGRLVSLSEQQLVDCDVYGD-------DEGCAGGLMDNAFEYMINRGGLTTESSYPYRG 227
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
TD GSC+ +S AA++ + + ++ + V H P++
Sbjct: 228 TD-GSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVS 265
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 133/248 (53%), Gaps = 15/248 (6%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
+N + L+K K+ KTY + E + R +++ N +D + V +F+DLT
Sbjct: 23 VNDAEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTA 82
Query: 107 SEFRRQFLGLNR-RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF + G + R R + +P DWR G VT VK+Q CGSCW+FS
Sbjct: 83 EEFSSIYNGYGKGRNRENHENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG+LEGAH TG+LVSLSEQ LVDCD + D GC GGLM +AF+YI + G
Sbjct: 143 TTGSLEGAHAKKTGKLVSLSEQNLVDCDKK---------DHGCQGGLMTTAFKYIEENKG 193
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVS-NFSVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E+ YPY + G C+F K I A V + S++++D + + + + GP++ +++
Sbjct: 194 IDTEESYPYKAKN-GRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPIS---VAMD 249
Query: 285 LPHISFSF 292
H SF
Sbjct: 250 ASHSSFQL 257
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/282 (36%), Positives = 146/282 (51%), Gaps = 32/282 (11%)
Query: 28 DAMIRQVVPSDGEQSEDH------LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
DA+++Q + +D +S H L+N E + FK + K Y + E +R ++F N
Sbjct: 7 DAVVQQKLTND--ESRTHAVSFFELVNQE--WMTFKMEHKKVYKSDVEERFRMKIFMDNK 62
Query: 82 RR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAP-----IL 132
+ AK + V + K+ D+ EF G N+ + +++ P I
Sbjct: 63 HKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIE 122
Query: 133 PTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVD 191
P N LP DWR GAVT VKDQG CGSCWSFSATGALEG HF TG LVSLSEQ L+D
Sbjct: 123 PANVVLPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLID 182
Query: 192 CDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAA 251
C + ++GCNGGLM+ AF+YI G++ E YPY + C+++ + A
Sbjct: 183 CSGKYG-------NNGCNGGLMDQAFQYIKDNKGLDTEASYPYE-AENDKCRYNPANSGA 234
Query: 252 A-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
V + + DE + A + GP++ +I+ H SF F
Sbjct: 235 IDVGYIDIPTGDEKLLKAAVATIGPVS---VAIDASHQSFQF 273
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/248 (38%), Positives = 130/248 (52%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +K+ SK Y QEE +R +++ NL+ + L H G+ F D+T
Sbjct: 28 HWQAWKTWHSKKYHQQEE-GWRRMIWEKNLKMIQLHNLDHSLGKHSYRLGMNHFGDMTNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + + + L N L P DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGY-KHSKTEKKYRGSEFLEPNFLVVPKSVDWREKGYVTPVKDQGQCGSCWAFS 145
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG+LEG HF TG+LVSLSEQ LVDC PE + GCNGGLM+ AFEYI GG
Sbjct: 146 TTGSLEGQHFRKTGKLVSLSEQNLVDCSR---PEG----NQGCNGGLMDQAFEYIADNGG 198
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E+ YPY D C + AA + F V E + + GP++ +I+
Sbjct: 199 IDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERALMKAVAAVGPVS---VAID 255
Query: 285 LPHISFSF 292
H +F F
Sbjct: 256 ASHSTFQF 263
>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
Length = 377
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/229 (39%), Positives = 128/229 (55%), Gaps = 7/229 (3%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGV-TKFSDLTPSEFRR 111
F F +KF KTY T EE +R VF N + G+ +F+D T EF
Sbjct: 65 FMTFMTKFEKTYETVEEWAHRLTVFAQNAKIVLEHDAKAEGFALGLDNQFADWTAEEFA- 123
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+ L+ R + P+ A + PT DWR G V +K+QG+CGSCW+FS ++E
Sbjct: 124 SYQKLHSRPK-PSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCWTFSTVVSIE 182
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVERE 229
GA TG+LV+LSEQ LVDC + + C GC+GGLM++AF+YI+K GG++ E
Sbjct: 183 GAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTE 242
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLA 277
Y YTG D G+C FDK+ + A +SN++ V DE +A L GP++
Sbjct: 243 ASYGYTGKD-GTCAFDKANVGATISNWTDVAVGDEVALADALANAGPVS 290
>gi|118401108|ref|XP_001032875.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89287220|gb|EAR85212.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 360
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/224 (40%), Positives = 118/224 (52%), Gaps = 11/224 (4%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F FK K++KTY E YRF VF N R + GV +F+DLT EF
Sbjct: 42 ERAFKNFKVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNKFLVFSKVGVNQFADLTHEEF 101
Query: 110 RRQFLGLNRRLRLPADAQKA--PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+ + G D K P LPT++LP FDWRD GA+T VK Q CG CW+FS
Sbjct: 102 KALYTGHKHSKDDDDDDNKNKQPHLPTDNLPASFDWRDKGAITPVKVQNGCGGCWAFSTV 161
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
++EG +FL TG+L SLS QQ++DC C +E SGC GG AF I GG+
Sbjct: 162 QSIEGLYFLKTGKLESLSTQQVIDC---CRIDE-----SGCLGGDPEPAFRCIQNNGGIM 213
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
E +YPY SCKFD+ K + + + SD+ Q+ A L+
Sbjct: 214 TETEYPYIAKQ-QSCKFDEDKPTFQIGGYIDVPSDQSQVKAALL 256
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/277 (38%), Positives = 146/277 (52%), Gaps = 24/277 (8%)
Query: 9 LLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNA----EHHFS--LFKSK 59
L L++ + ASA+A D+ IRQVV + E+ +L H S F +
Sbjct: 8 LALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALSSARFAHR 67
Query: 60 FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
+ K Y + EE RF VF NL+ + + GV +F+DLT EFRR LG +
Sbjct: 68 YGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQN 127
Query: 120 LRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+ K + TN LP WR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 128 C---SATTKGNLKVTNVVLPETKGWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAF 184
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+ YPYTG +
Sbjct: 185 GKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN 237
Query: 239 GGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
G CKF + V N ++ + DE + A LV+
Sbjct: 238 -GLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR 273
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/244 (37%), Positives = 143/244 (58%), Gaps = 22/244 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E F F++K+ K Y + E +YR +V N+ ++ + + G+T F+D+T +EF
Sbjct: 24 EKLFQTFEAKYGKNYLSSE-REYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEF 82
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTD-FDWRDHGAVTGVKDQGACGSCWSFSATG 168
L ++ P + ++A +L N++ + DWR+ GAVT VK+QG+CGSCW+FSATG
Sbjct: 83 ATS--KLCGCMKKPLNHKQARVL--NNMAVESIDWREKGAVTPVKNQGSCGSCWAFSATG 138
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALEG +F++TG+LVSLSEQQLVDCD E D+GC GG M++AFEY++K G+
Sbjct: 139 ALEGGNFVATGKLVSLSEQQLVDCDTE---------DAGCGGGFMDTAFEYVMKK-GLCT 188
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E+DYPY D CK D+ +++ + + +++ + P+ S+ +
Sbjct: 189 EEDYPYHAKD-EDCKDDQCTSVISITGYEDVPANDGVALKQALTKAPV-----SVAIQAD 242
Query: 289 SFSF 292
SF F
Sbjct: 243 SFVF 246
>gi|1019667|gb|AAA79287.1| rangelipain, partial [Trypanosoma rangeli]
Length = 263
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 95/236 (40%), Positives = 122/236 (51%), Gaps = 22/236 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK + K Y + E +R VFK NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 112 QFLGLNRRLRLPADAQKAPILPT------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
++ + A AQK +P P DWR GAVT +KDQG CGSCW+FS
Sbjct: 97 RY---HNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFS 153
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KA 223
G +EG L+ L LSEQ LV CD+ D+GC+GGLM+SAF++I+
Sbjct: 154 TIGNIEGQWHLAGNPLTGLSEQMLVSCDN---------ADNGCDGGLMDSAFDWIVGQNN 204
Query: 224 GGVEREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E Y Y G D +C + A +S + DED+MAA L +GPLA
Sbjct: 205 GSVYTEASYSYVSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLA 260
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 143/263 (54%), Gaps = 22/263 (8%)
Query: 40 EQSEDHLLNAEHH----FSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTA 94
EQ E LL+A+ + F + +++K YA +E + RF V+ NL +
Sbjct: 28 EQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSH 87
Query: 95 VHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA-QKAPIL----PTNDLPTDFDWRDHGAV 149
+ F+DLT EFR + LG + + R ++ Q +P + N LPT+ DWR GAV
Sbjct: 88 WLHLNAFADLTTDEFRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAV 146
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T VK+QG CGSCW+F+ TG++EG + + TGEL SLSEQ+LVDCD + D GC+
Sbjct: 147 TEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTD--------EDRGCS 198
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GGLM+ A+++I+K GG++ E DYPYT DG K++ + + I +++
Sbjct: 199 GGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKK 258
Query: 270 LVKHGPLAGNVASIELPHISFSF 292
H P+A +IE SF
Sbjct: 259 AAAHQPIA---VAIEADAKSFQL 278
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/253 (37%), Positives = 130/253 (51%), Gaps = 19/253 (7%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDL 104
+ ++ FK + K Y ++ E +R ++F N + + L ++ + K+ DL
Sbjct: 23 VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82
Query: 105 TPSEFRRQFLGLNRRL----RLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
EF G NR R I P + D+P DWR GAVT VKDQG CG
Sbjct: 83 LHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCG 142
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCWSFSATGALEG HF T +LVSLSEQ LVDC S ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDC-------SSRFGNNGCNGGLMDNAFRY 195
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I GG++ E YPY G D K++ A + S DED++ A + GP++
Sbjct: 196 IKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPIS-- 253
Query: 280 VASIELPHISFSF 292
+I+ H SF
Sbjct: 254 -IAIDASHESFQL 265
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/256 (39%), Positives = 137/256 (53%), Gaps = 21/256 (8%)
Query: 27 DDAMIRQVVPSDGEQSED---HLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
D+ IRQVV + E H++ H F+ F ++ K Y + EE RF +F N
Sbjct: 27 DENPIRQVVSDSFHELESGILHVVGQTRHALSFARFARRYGKRYDSVEEIKQRFDIFLDN 86
Query: 81 LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
L + GV +FSDLT EFRR LG + A + L LP
Sbjct: 87 LEMINSHNDKGLSYKLGVNEFSDLTWDEFRRDRLGAAQNC--SATTKGNLKLRDAVLPET 144
Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
DWR+ G V+ VK+QG CGSCW+FS TGALE A+ G+ +SLSEQQLVDC
Sbjct: 145 KDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDC-------- 196
Query: 201 SGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS---NF 256
+G+ ++ GCNGGL + AFEYI GG+E E+ YPYTG + G CKF + V+ N
Sbjct: 197 AGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYTGKN-GLCKFSSQNVGVKVTDSVNI 255
Query: 257 SVISSDEDQMAANLVK 272
++ + DE + A LV+
Sbjct: 256 TLGAEDELKYAVALVR 271
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/237 (37%), Positives = 132/237 (55%), Gaps = 14/237 (5%)
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
SK K+Y + EE +RF VF+ NL+ + G+ +F+DL+ EF+R++LGL
Sbjct: 2 SKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLK 61
Query: 118 RRLRLPADA-QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
L D+ ++ DLP DWR GAV VK+QGACGSCW+FS A+EG + +
Sbjct: 62 IELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQI 121
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TG L +LSEQ+L+DCD ++GCNGGLM+ AF +I+ GG+ +E+DYPY
Sbjct: 122 VTGNLTALSEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV- 172
Query: 237 TDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ G+C K ++ +S + + D +Q + + PL+ +IE F F
Sbjct: 173 MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLS---VAIEASSRGFQF 226
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/250 (36%), Positives = 133/250 (53%), Gaps = 20/250 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
++H+ +K KTYA +EE +R +++ NL+ + L H G+ +F D+T
Sbjct: 26 DNHWYSWKDWHKKTYAPKEE-GWRRVLWEKNLKMIEFHNLDHSLGKHSYRLGMNQFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF++ G + + AP + P DWR G VT VKDQG CGSCW+FS
Sbjct: 85 NEEFKQLMNGYKNQKMIRGSTFLAP--NNFEAPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG H+ T +L+SLSEQ LVDC + GCNGGLM+ AF+Y+ GG
Sbjct: 143 TTGALEGQHYRKTSKLISLSEQNLVDC-------SRAQGNEGCNGGLMDQAFQYVKDNGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS--DEDQMAANLVKHGPLAGNVASI 283
++ E YPYT D C +D + +A + F + S ++D M A + GP++ +I
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDLMKA-VASVGPVS---VAI 251
Query: 284 ELPHISFSFL 293
+ H SF F
Sbjct: 252 DAGHQSFQFY 261
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 137/258 (53%), Gaps = 31/258 (12%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M++L+ SL L L+ +V A+ N+ D +SE L N + ++S
Sbjct: 3 MKKLLFISLSLALIFTV-ANTFDFNEHDL-----------ESEKSLWNL---YERWRSHH 47
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---N 117
+ T E+H+ RF VFKAN+ LD + KF D+T EFRR + +
Sbjct: 48 TVTRNLDEKHN-RFNVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISH 106
Query: 118 RRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
R+ + + N D+P+ DWR+ GAVTGVKDQG CGSCW+FS A+EG +
Sbjct: 107 HRMFRGMSHENGTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQ 166
Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
+ T +LVSLSEQQLVDCD E + GCNGGLM AFE+I K G+ E +YPY
Sbjct: 167 IKTQKLVSLSEQQLVDCDTE--------ENEGCNGGLMEYAFEFI-KQNGITTESNYPYA 217
Query: 236 GTDGGSCKFDKSKIAAAV 253
D G+C +K A ++
Sbjct: 218 AKD-GTCDVEKEDKAVSI 234
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 132/258 (51%), Gaps = 24/258 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK KF + Y EE YR VF NL+ K+ + + T + +F
Sbjct: 13 LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
SD+T +F G + R PA A T+ P T+ DWR GAVT VKDQG CG
Sbjct: 73 SDMTNEKFNAVMKGYKKGPR-PA----AVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCG 127
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFE 218
SCW+FS TG +EG HFL TG LVSLSEQQLVDC GS + GCNGG + A
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-------AGGSYYNQGCNGGWVERAIM 180
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLA 277
Y+ GGV+ E YPY D +C+F+ + I A + + I+ + + GP++
Sbjct: 181 YVRDNGGVDTESSYPYEARD-NTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPIS 239
Query: 278 GNVASIELPHISFSFLFT 295
+I+ H SF +T
Sbjct: 240 ---VAIDASHRSFQSYYT 254
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 150/284 (52%), Gaps = 23/284 (8%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
+ R LS LL++ ++ A +++ D ++ +++D ++ + + K
Sbjct: 3 LHRSSLSLFLLMIFTASSAVDMSIVSYD---QRHADKSSWRTDDEVM---AMYEAWLVKH 56
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---- 116
K Y E + RF +FK NLR + T G+ +F+DLT E+R +LG+
Sbjct: 57 GKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGA 116
Query: 117 ---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
R++ +D A + + LP DWR GAV GVKDQG+CGSCW+FS A+EG
Sbjct: 117 TRVTRKVSRKSDRFAARV--GDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGI 174
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ GG++ E+DYP
Sbjct: 175 NQIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSEEDYP 226
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y D ++ K+ ++ + + +++ V P++
Sbjct: 227 YRAADQKCDQYRKNANVVSIDGYEDVPENDEAALKKAVAKQPVS 270
>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
Length = 318
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/243 (39%), Positives = 131/243 (53%), Gaps = 22/243 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-----RQLLDPTAVHGVTKFSDLTPSEFR 110
FK +K Y +E YR +F+ N + + RQ L T + +F D+T EF
Sbjct: 21 FKLTHAKVYTHGKEDLYRRSIFENNQKVVEEHNERFRQGL-VTFDLKMNRFGDMTTEEFV 79
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
Q GLN+ R + P + DWRD GAVT VKDQG CGSCW+FS TGAL
Sbjct: 80 SQMTGLNKVERTVG--KVFAHYPEVERADTVDWRDKGAVTPVKDQGQCGSCWAFSTTGAL 137
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EGAHFL G+LVSLSEQ LVDC E +SGCNGG++ A++YI G++ E
Sbjct: 138 EGAHFLKHGDLVSLSEQNLVDCSTE---------NSGCNGGVVQWAYDYIKSNNGIDTES 188
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELPHIS 289
YPY D +C+FD + + A V+ ++ I +DE A+ + GP++ I+ H S
Sbjct: 189 SYPYEAQD-LTCRFDAAHVGATVTGYADIPYADEVTQASAVHDDGPVS---VCIDAGHNS 244
Query: 290 FSF 292
F
Sbjct: 245 FQL 247
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 126/225 (56%), Gaps = 17/225 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
F+ F S+F K+Y ++EE R+ +F NLR R+ ++ L T V F+D T EF+
Sbjct: 55 FARFVSRFGKSYQSEEEMKERYEIFSQNLRFIRSHNKKRLPYTL--SVNHFADWTWEEFK 112
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R LG + + L LP DWR G V+ VKDQG+CGSCW+FS TGAL
Sbjct: 113 RHRLGAAQNCSATLNGNHK--LTDAVLPPTKDWRKEGIVSSVKDQGSCGSCWTFSTTGAL 170
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GC+GGL + AFEYI GG+E E+
Sbjct: 171 EAAYAQAFGKSISLSEQQLVDCAGPFN-------NFGCHGGLPSQAFEYIKYNGGLETEE 223
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
YPYTG D G CKF +A V N ++ + DE + A V+
Sbjct: 224 AYPYTGKD-GVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVR 267
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 130/224 (58%), Gaps = 15/224 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
+ F + SK Y ++E RF ++++N++ L +F+D+T SEF
Sbjct: 40 KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99
Query: 110 RRQFLGLNRR-LRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+ FLGLN LRL ++ P+ P ++P DWR GAVT +++QG CG CW+FSA
Sbjct: 100 KAHFLGLNTSSLRL--HKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
A+EG + + TG LVSLSEQQL+DCD G+ + GC+GGLM +AFE+I GG+
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKSNGGLT 210
Query: 228 REKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDED--QMAA 268
E DYPYTG + G+C +K+K + + ++ +E Q+AA
Sbjct: 211 TETDYPYTGIE-GTCDQEKAKNKVVTIQGYQKVAQNEASLQIAA 253
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/283 (35%), Positives = 149/283 (52%), Gaps = 29/283 (10%)
Query: 9 LLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+LL LL+S +++ + DD ++ + V E F + K KTYAT
Sbjct: 10 ILLFLLASFTDVSLSFDPLDDFVMSESVQRAAE------------FERWTIKHKKTYATA 57
Query: 68 EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-RRLRLPAD 125
EE+++R RV+ AN KR + P + +F+DLT +EF+R +L + + R
Sbjct: 58 EEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLTFAEFKRIYLSSSSQHCRATTG 117
Query: 126 AQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
+ P+ N + P DWR +T V+DQG+CGSCW+FSAT L L TG+L+SL
Sbjct: 118 NFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWAFSATSCLSAHLALKTGQLISL 177
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
S+QQL+DC + + GC GGL + AFEYI GG+E E+DYPY + C F
Sbjct: 178 SKQQLLDCSRSFN-------NRGCKGGLPSQAFEYIRYNGGIESERDYPYKDRE-EKCHF 229
Query: 245 DKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAGNVASIE 284
S +AA V+ NF+ ED +A L GP++ + S +
Sbjct: 230 KPSLVAATVTGVVNFT--QGAEDDIAVALANIGPVSIGIHSTK 270
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 129/232 (55%), Gaps = 11/232 (4%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTP 106
A + L+ ++ ++Y EH+ RFRVF NLR A + D G+ +F+DLT
Sbjct: 50 ARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTN 109
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
EFR FLG R A ++ +LP DWR+ GAV VK+QG CGSCW+FSA
Sbjct: 110 EEFRATFLGAKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
+E + L TGE+++LSEQ+LV+C + +SGCNGGLM+ AF++I+K GG+
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVEC-------STNGQNSGCNGGLMDDAFDFIIKNGGI 222
Query: 227 EREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ E DYPY D G C ++ ++ F + ++++ V H P++
Sbjct: 223 DTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVS 273
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 131/249 (52%), Gaps = 18/249 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
N + + +K+ + Y EE +R V++ N+R + HG T + D
Sbjct: 24 NLDTQWYQWKATHKRLYGLNEE-GWRRAVWEKNMRMIELHNGEYSQGKHGFTMGMNAYGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + + P+L P DWR+ G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHKKGKMFRDPLLL--QYPKSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+L+SLSEQ LVDC H P+ + GCNGGLM+ AF+Y+
Sbjct: 141 FSATGALEGQMFQKTGKLISLSEQNLVDCSH---PQG----NQGCNGGLMDYAFQYVKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
G++ E+ YPY G D G+CK+ A + F I E + + GP++ A+I
Sbjct: 194 SGLDSEESYPYEGMD-GTCKYKPECSVANDTGFVDIPGHEKALLRAVATVGPIS---AAI 249
Query: 284 ELPHISFSF 292
+ H+SF F
Sbjct: 250 DAGHMSFQF 258
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 129/226 (57%), Gaps = 15/226 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + K K+Y T +E R+ VF+ N+ + + G+ +DLT EF++
Sbjct: 32 FQNWMVKHQKSY-TNDEFGSRYSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKL 90
Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
+LG + +K ++ + LP DWR +GAVT VK+QG CG C++FS TG++EG
Sbjct: 91 YLGTKANVTY----KKKTLVGVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEG 146
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVEREKD 231
H +++ +LV LSEQQ++DC SGS ++GC+GGLM ++FEYI+ GG++ E
Sbjct: 147 IHEITSQQLVPLSEQQILDC--------SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEAS 198
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPYTG + G CKF+K I A ++ + + S + V P++
Sbjct: 199 YPYTG-EVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQPVS 243
>gi|29789900|gb|AAF21457.2|U56958_1 cysteine proteinase [Paragonimus westermani]
Length = 272
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 18/208 (8%)
Query: 83 RAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPT 139
RA++ QL D TA +GVT+FSDLTP EF ++L + Q + PT P
Sbjct: 2 RAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVN-----NDQVKRVRPTGLKAAPE 56
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DWR GAVT V++QG+CGSCW+FS G +EG F+ TG+LVSLS+QQLVDCD D
Sbjct: 57 RIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD-- 114
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
GCNGG S++ I+ GG+E + DYPY G C +K ++ A + + +
Sbjct: 115 -------GCNGGWPASSYLEIMHMGGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIAL 166
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPH 287
ED AA L +HGPL+ + +I L +
Sbjct: 167 XPSEDDNAAYLAEHGPLSTLLNAITLQY 194
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 30 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 89
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 90 YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 149
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 150 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMGQAFDWLLNNRNGAV 200
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G + +S I A + I S+ED MAA L +GP+A
Sbjct: 201 YTGASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 254
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/212 (40%), Positives = 121/212 (57%), Gaps = 22/212 (10%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + ++ TY E + RF F+ NLR +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
VH G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GCNGGLM+ AFE+I+ GG++ E+DYPY D
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERD 225
>gi|42516556|gb|AAS17989.1| cysteine proteinase CP2 [Paragonimus westermani]
Length = 272
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 18/208 (8%)
Query: 83 RAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPT 139
RA++ QL D TA +GVT+FSDLTP EF ++L + Q + PT P
Sbjct: 2 RAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVN-----NDQVKRVRPTGLKAAPE 56
Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
DWR GAVT V++QG+CGSCW+FS G +EG F+ TG+LVSLS+QQLVDCD D
Sbjct: 57 RIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD-- 114
Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
GCNGG S++ I+ GG+E + DYPY G C +K ++ A + + +
Sbjct: 115 -------GCNGGWPASSYLEIMHMGGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIAL 166
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPH 287
ED AA L +HGPL+ + +I L +
Sbjct: 167 GPSEDDNAAYLAEHGPLSTLLNAITLQY 194
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 134/253 (52%), Gaps = 19/253 (7%)
Query: 40 EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT 99
E E H +A FS F++ ++K+YAT+EE R+ +FK NL + +
Sbjct: 106 EWKEAHFQDA---FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMN 162
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPAD-----AQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
F DL+ EFRR++LG + L + + +LP+ +LP DWR G VT VKD
Sbjct: 163 HFGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKD 221
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
Q CGSCW+FS TGALEGAH TG+LVSLSEQ+L+DC + C+GG MN
Sbjct: 222 QRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMN 274
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKH 273
AF+Y+L +GG+ E YPY D C+ + + F V E M A L K
Sbjct: 275 DAFQYVLDSGGICSEDAYPYLARD-EECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK- 332
Query: 274 GPLAGNVASIELP 286
P++ + + ++P
Sbjct: 333 SPVSIAIEADQMP 345
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 95/252 (37%), Positives = 135/252 (53%), Gaps = 25/252 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
FK + K Y ++ E +R +++ N + AK QL + V G K++D+ EF +
Sbjct: 31 FKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLHHEFIQ 90
Query: 112 QFLGLNRRLR-------LPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCW 162
G NR + D + A +P + P DW GAVT VKDQG CGSCW
Sbjct: 91 AMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGKCGSCW 150
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FS TGALEG HF +G LVSLSEQ L+DC S ++GCNGGLM++AF+YI
Sbjct: 151 AFSTTGALEGQHFRKSGYLVSLSEQNLIDC-------SSTYGNNGCNGGLMDNAFKYIKD 203
Query: 223 AGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
GG++ EK YPY G D C+++ K+ A V + S DE+++ + GP++
Sbjct: 204 NGGIDTEKTYPYEGVD-DKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVS---V 259
Query: 282 SIELPHISFSFL 293
+I+ SF F
Sbjct: 260 AIDASQNSFQFY 271
>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
Length = 467
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 95/236 (40%), Positives = 122/236 (51%), Gaps = 22/236 (9%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK + K Y + E +R VFK NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 112 QFLGLNRRLRLPADAQKAPILPT------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
++ + A AQK +P P DWR GAVT +KDQG CGSCW+FS
Sbjct: 97 RY---HNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFS 153
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KA 223
G +EG L+ L LSEQ LV CD+ D+GC+GGLM+SAF++I+
Sbjct: 154 TIGNIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVGQNN 204
Query: 224 GGVEREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E Y Y G D +C + A +S + DED+MAA L +GPLA
Sbjct: 205 GSVYTEASYSYVSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLA 260
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 120/207 (57%), Gaps = 23/207 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH---GVTKFSDLTPSEF 109
F ++ + K Y E + R+R FK NL+ + A+ G+ KF+DL+ EF
Sbjct: 50 FQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADLSNEEF 109
Query: 110 RRQFLGLNRRLRLPADAQKAPI-------LPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+ +L +++ P + +++ L T D P+ DWR G VT VKDQG CGSCW
Sbjct: 110 KELYLS---KVKKPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCGSCW 166
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
SFS TGA+EG + + TG+L+SLSEQ+LVDCD + + GC GG M+ AFE+++
Sbjct: 167 SFSTTGAIEGINAIVTGDLISLSEQELVDCD---------TTNYGCEGGYMDYAFEWVIN 217
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI 249
GG++ E +YPYTG D G+C K +I
Sbjct: 218 NGGIDTEANYPYTGVD-GTCNTTKEEI 243
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R +F N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 82 FRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 142 IYL--NPLLREEPGKKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 199
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL +SA+ I GG+E E
Sbjct: 200 EGQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIKNLGGLETED 250
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 251 DYSYRG-HMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 296
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/245 (36%), Positives = 137/245 (55%), Gaps = 23/245 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K IL P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVIILDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
K GGV+ E DYPY D +C+ + +K V + + ++ ++ + LV P+A
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 279 NVASI 283
+ A I
Sbjct: 247 DAADI 251
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK++G CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNKGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/248 (35%), Positives = 127/248 (51%), Gaps = 16/248 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ + +FK + +K Y +E YR VF + ++ L VH G+ +++D+
Sbjct: 19 DREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMP 78
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF R G + + P P DLP DWR G VT VK+QG CGSCW+FS
Sbjct: 79 NEEFVRVMNGYKMQEQRPKAPTYMPPSNVGDLPATVDWRTKGYVTEVKNQGQCGSCWAFS 138
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
+TG+LEG F +L+SLSEQ LVDC E + GC GGLM+ AF YI G
Sbjct: 139 STGSLEGQTFKKYNKLISLSEQNLVDCSTE-------QGNMGCGGGLMDQAFTYIKVNDG 191
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY G C+F+K+ + A + ++ I S E + + + GP+A +I+
Sbjct: 192 IDTETSYPYEAAS-GKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIA---VAID 247
Query: 285 LPHISFSF 292
H+SF
Sbjct: 248 ASHMSFQL 255
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 92/248 (37%), Positives = 132/248 (53%), Gaps = 20/248 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ +K+ K Y +EE +R +++ NLR+ + L +H G+ F D+
Sbjct: 28 HWEQWKTWHGKNYHEKEE-GWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G + + + + N ++P+ DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGYKHKTE--RKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFS 144
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGA+EG F G+LVSLSEQ LVDC PE + GCNGGLM+ AF+YI G
Sbjct: 145 TTGAMEGQMFRKQGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNNG 197
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E+ YPY GTD C +D AA + F + S E + + GP++ +I+
Sbjct: 198 LDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVS---VAID 254
Query: 285 LPHISFSF 292
H SF F
Sbjct: 255 AGHESFQF 262
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/191 (45%), Positives = 118/191 (61%), Gaps = 18/191 (9%)
Query: 69 EHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN----RRL-RL 122
E D RF +FK NLR ++ D + G+ +F+DLT E+R +LG RR+ +
Sbjct: 66 EKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARRRIAKT 125
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+D + AP LP DWR+ GAV VKDQG+CGSCW+FS A+EG + + TGEL+
Sbjct: 126 KSDRRYAP-KAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGELI 184
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ E DYPYTG G
Sbjct: 185 SLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGRYG--- 233
Query: 243 KFDKSKIAAAV 253
+ D+++ A V
Sbjct: 234 RCDQTRKNAKV 244
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 93/249 (37%), Positives = 133/249 (53%), Gaps = 20/249 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F +K KF ++Y + E +R +++ N + +L + G+T F+D+ E
Sbjct: 26 FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85
Query: 109 FRR---QFLGLNRRLRLPADAQKAPILPT-NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
++R Q + LP LP DLP DWRD G VT VKDQ CGSCW+F
Sbjct: 86 YKRVISQGCLHSFNASLPRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWAF 145
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATG+LEG HF TG LVSLSEQQLVDC + + GC GGLM+ AF+YI G
Sbjct: 146 SATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYG-------NMGCMGGLMDYAFQYIQANG 198
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASI 283
G++ E+ YPY + G C+++ I A + ++ +S DED + + GP++ I
Sbjct: 199 GIDTEESYPYE-AENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPIS---VGI 254
Query: 284 ELPHISFSF 292
+ +SF F
Sbjct: 255 DASQMSFQF 263
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 118/200 (59%), Gaps = 12/200 (6%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
S+D +L+ H + + S+ Y + E RF++FK NL + + G+ K
Sbjct: 43 HSDDGMLDVFHQW---LERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKSYWLGLNK 99
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF-DWRDHGAVTGVKDQGACG 159
FSDLT EFR +LG+ R + + + + DWR GAV+ VKDQG+CG
Sbjct: 100 FSDLTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEMVDWRKKGAVSDVKDQGSCG 159
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA G++EG + + TGEL+SLSEQ+LVDCD + GCNGGLM+ AF++
Sbjct: 160 SCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDR--------GQNQGCNGGLMDYAFDF 211
Query: 220 ILKAGGVEREKDYPYTGTDG 239
I+K GG++ E+DYPY TDG
Sbjct: 212 IIKNGGIDTEEDYPYKATDG 231
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 14/239 (5%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFS 102
+D L+ A H + +++ + Y+ E R VFKAN+ + + +F+
Sbjct: 25 DDWLIAARHE--QWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFA 82
Query: 103 DLTPSEFRRQFLGLNRRL---RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
D+T EFR G ++ + A + + +DLP DWR +GAVT VKDQG CG
Sbjct: 83 DITKDEFRAMHKGYKMQVIGSKARATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCG 142
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
CW+FS ++EG +STG+L+SLSEQ+LVDCD G + GC GGLM++AFE+
Sbjct: 143 CCWAFSTVASMEGIVKVSTGKLISLSEQELVDCD-------VGMQNKGCGGGLMDNAFEF 195
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I+ GG++ E DYPYTG D G+C +K S IAA++ + + ++++ V P++
Sbjct: 196 IVNNGGLDTEADYPYTGAD-GTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVS 253
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 129/253 (50%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH+ L+K + K Y + E R +++ NL+ L +H G+ +D+T
Sbjct: 34 DHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLKYVMLHNLEHSMGMHSYDLGMNHLADMT 93
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ Q+ P LP DWRD G VT VK QG+CGSCW
Sbjct: 94 SEEV----MLLMSSLRVPSQWQRNVTFKSNPNQKLPDSMDWRDKGCVTEVKYQGSCGSCW 149
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVDC + GCNGG M AF+YI+
Sbjct: 150 AFSAVGALEAQLKLKTGKLVSLSVQNLVDCS------TGKYSNKGCNGGFMTEAFQYIID 203
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
G++ E YPY D G C++D AA S + + +E+ + + GP++
Sbjct: 204 NNGIDSEASYPYKAMD-GKCQYDVKNRAATCSKYVELPFGNEEALKEAVANKGPVS---V 259
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 260 AIDASHPSF-FLY 271
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 131/253 (51%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
+ NA+ H +KS + + Y T EE ++R V++ N++ + HG T
Sbjct: 22 NQTFNAQWH--KWKSTYRRLYGTNEE-EWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ LP DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQLVNGYKHQKHRKGKVFQEPLML--QLPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA GALEG L TG LVSLSEQ LVDC + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQ-------AEGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+L G++ E+ YPY D G+CK+ AA + + I E + + GP+A
Sbjct: 190 VLNNKGLDSEESYPYEAKD-GTCKYKPEFAAANDTGYVDIPQLEKALMKAVATVGPIA-- 246
Query: 280 VASIELPHISFSF 292
+I+ H SF F
Sbjct: 247 -IAIDASHPSFQF 258
>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
Length = 484
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 137/238 (57%), Gaps = 14/238 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
FSDLT EFR +L N LR P + K + P ++DWR GAVT VKDQG CG
Sbjct: 236 FSDLTEEEFRTIYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCG 293
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG ++G FL+ G L+SLSEQ+L+DCD D C GGL ++A+
Sbjct: 294 SCWAFSVTGNVKGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSA 344
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 345 IKNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 401
>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
Length = 350
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/275 (37%), Positives = 149/275 (54%), Gaps = 26/275 (9%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
SLL++L A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 SLLIVLFCVTTAAAGFSFHDSNPIRMV--SDAEEQLLQVIGESRHAVSFARFANRYGKLY 62
Query: 65 ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
+ +E RF++F NL R +R+L + GV F+D T EF+ LG +
Sbjct: 63 DSVDEMKLRFKIFSENLELIRSTNKRRL---SYKLGVNHFADWTWEEFKSHRLGAAQNC- 118
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
A + + +LP + DWR G V+ VKDQG CGSCW+FS TGALE A+ + G+
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKN 177
Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
+SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEETYPYTGSN-G 228
Query: 241 SCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
CKF +A V N ++ S DE + A +
Sbjct: 229 LCKFTSENVALKVLGSVNITLGSEDELKHAVAFAR 263
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 93/259 (35%), Positives = 139/259 (53%), Gaps = 23/259 (8%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
+D L+ + H + ++ +TYA E + R+ VFK N+ R +R + T V +
Sbjct: 29 DDELIMQKKH-DEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQ 87
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKD 154
F+DLT EFR + G L + +Q + LP DWR GAVT +K+
Sbjct: 88 FADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKN 147
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG+CG CW+FSA A+EGA + G+L+SLSEQQLVDCD + D GC+GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCSGGLMD 198
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
+AFE+I+ GG+ E +YPY G D +CK +K AA+++ + + +++ V H
Sbjct: 199 TAFEHIMATGGLTTESNYPYKGED-ANCKIKSTKPSAASITGYEDVPVNDENALMKAVAH 257
Query: 274 GPLAGNVASIELPHISFSF 292
P++ IE F F
Sbjct: 258 QPVS---VGIEGGGFDFQF 273
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 133/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 280 IYL--NSLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 337
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 338 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 388
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 389 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 434
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 84/214 (39%), Positives = 123/214 (57%), Gaps = 21/214 (9%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----- 116
K Y + EE +RF +FK NL+ R + G+ +FSDL+ EF++ +LGL
Sbjct: 6 KIYESIEEKLHRFEIFKENLKHIDERNKIVSNYWLGLNEFSDLSHDEFKKMYLGLKVDHD 65
Query: 117 --NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
N + + D + + DLP DWR GAVT VK+QG CGSCW+FS A+EG +
Sbjct: 66 LLNNKKQSQQDFEYRDFV---DLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGIN 122
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
+ TG L SLSEQ+L+DCD + ++GCNGGLM+ AF++I+ GG+ +E DYPY
Sbjct: 123 QIKTGNLTSLSEQELIDCD--------TTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPY 174
Query: 235 TGTDGGSC--KFDKSKIAAAVSNFSVISSDEDQM 266
+ G+C K D+S++ V ++DE +
Sbjct: 175 L-MEEGTCDEKRDESEVVTIDGYRDVPANDEQSL 207
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)
Query: 7 SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--GEQSEDHLLNAEHHFSLFKSKFSKTY 64
S +L++L+ L +A D + SD +S+ + N + + K +
Sbjct: 8 SPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNI 67
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------R 118
E+ D RF +FK NL+ + T G+ +F+DL+ E+R ++LG
Sbjct: 68 DGSEK-DKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMM 126
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R + + + LP DWR GAV VKDQG+CGSCW+FS A+EG + + T
Sbjct: 127 MARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVT 186
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GELVSLSEQ+LVDCD + ++GC+GGLM AFE+I+ GG++ ++DYPY G D
Sbjct: 187 GELVSLSEQELVDCDR--------TVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVD 238
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G ++ K+ ++ ++ + + ++ V + P++
Sbjct: 239 GKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPIS 277
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 132/247 (53%), Gaps = 24/247 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K K+ K Y +EE+DY F N+ +L T G+ +DL SE+R+
Sbjct: 48 YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 106
Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
L R RL D+ + ++P N +P DWR+H VT VK+QG CGSCW+FSA
Sbjct: 107 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSA 164
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI G+
Sbjct: 165 TGALEGQHFRATGKLVSLSEQNLVDCS-------TKYGNHGCNGGLMDLAFEYIKDNHGI 217
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPY G + C F K I A F + DED + + GP++ +I+
Sbjct: 218 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPIS---IAIDA 273
Query: 286 PHISFSF 292
H SF
Sbjct: 274 GHRSFQL 280
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 29/285 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNAEHH----- 52
M RL SL+L+L++ + A+A+A D IRQVV D + E+ +L
Sbjct: 1 MSRL---SLVLILVAGLFATALAGPATFADKNPIRQVVFPD--ELENGILQVVGQTRSAL 55
Query: 53 -FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ F + K Y + EE RF +F NL+ + + G+ +F+DLT EFR+
Sbjct: 56 SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRK 115
Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LG ++ + K + TN LP DWR G V+ VK QG CGSCW+FS TGAL
Sbjct: 116 HKLGASQNC---SATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGAL 172
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+
Sbjct: 173 EAAYAQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEE 225
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
YPYTG + G CKF ++ I V N ++ + E + A LV+
Sbjct: 226 AYPYTGKN-GICKFSQANIGVKVISSVNITLGAEYELKYAVALVR 269
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 90/251 (35%), Positives = 129/251 (51%), Gaps = 11/251 (4%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKF 101
E H L + +K K Y +E RF++FK+N+ + + + + G+ KF
Sbjct: 29 ELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKF 88
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
+DLT EFR + G R L LP+ DWR GAVT +KDQG CGSC
Sbjct: 89 ADLTNEEFRAFWNGYKRPLGASRKITPFKYENVTALPSSIDWRSKGAVTPIKDQGVCGSC 148
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FSA A EG H L TG+LVSLSEQ+LVDCD + D GC GGLM AF++I
Sbjct: 149 WAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQ-------DKGCQGGLMVDAFKFIK 201
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
+ GG+ E +YPY G DG ++ A ++ + + + + V + P++
Sbjct: 202 RHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVS---V 258
Query: 282 SIELPHISFSF 292
+I+ +SF F
Sbjct: 259 AIDAGSLSFQF 269
>gi|345493482|ref|XP_001602523.2| PREDICTED: cathepsin L-like [Nasonia vitripennis]
Length = 514
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 93/255 (36%), Positives = 136/255 (53%), Gaps = 24/255 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-----AKRRQLLDPTAVHGVTKFSDLTPS 107
+ FK + K Y + E +R ++F N + AK L P + + K++D+
Sbjct: 28 WKTFKVQHKKGYNSDIEEKFRMKIFMENKHKIAKHNAKYEMGLVPYKLQ-INKYADMLHH 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAP-----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
EF G N+ + + P I P + +LP DWR GAVT +KDQG CGSC
Sbjct: 87 EFVNTLNGFNKTKPGMLQSYQKPVGAKFIAPAHVELPKSVDWRQEGAVTPIKDQGHCGSC 146
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
WSFSATGALEG HF TG+LVSLSEQ L+DC + ++GCNGGLM++AF+YI
Sbjct: 147 WSFSATGALEGQHFRQTGKLVSLSEQNLIDCSGKYG-------NNGCNGGLMDNAFKYIR 199
Query: 222 KAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
G++ E YPY D C+++ ++ A V + DE+++ A + GP++
Sbjct: 200 DNKGLDTESTYPYEAED-DECRYNARNSGAEDVGFVDIPEGDEEKLKAAIATIGPVS--- 255
Query: 281 ASIELPHISFSFLFT 295
+I+ H +F F T
Sbjct: 256 VAIDASHQTFQFYST 270
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 16/155 (10%)
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
D+ GAVT VK+QG CGSCW+FSATG+LEG HF G L+SLSEQ LVDC S
Sbjct: 300 DYWKQGAVTPVKNQGNCGSCWAFSATGSLEGQHFRHNGSLISLSEQNLVDC--------S 351
Query: 202 GS-CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVI 259
G + GC+GGLMN+AF Y+ G++ EK YPY D C+++ AA + + ++
Sbjct: 352 GRFGNDGCDGGLMNNAFTYVKVNRGLDSEKSYPYEAED-DRCRYNPKNSAADDAGYVNIP 410
Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLF 294
+ E ++ A + GP+ S+ + S SF+F
Sbjct: 411 TGSESKLQAAVATVGPI-----SVAIDADSDSFMF 440
>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
Length = 548
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/254 (40%), Positives = 144/254 (56%), Gaps = 22/254 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 257 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 316
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 317 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 374
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 375 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 425
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL----- 285
DY Y G SC F K +++ +S +E ++AA L K GP++ + + +
Sbjct: 426 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQVRPX 484
Query: 286 PHISFSFLFTVSSP 299
PH S + ++SP
Sbjct: 485 PHCS---AWIINSP 495
>gi|3641698|dbj|BAA33398.1| preprocathepsin L [Bos taurus]
Length = 301
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 126/249 (50%), Gaps = 17/249 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
N + H+ +K+ + Y EE ++R V++ N + HG + F D
Sbjct: 24 NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + P+L D+P DW G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+YI
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY TD SC + AA + F I E + + GP++ +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 251 DAGHTSFQF 259
>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
Length = 334
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 126/249 (50%), Gaps = 17/249 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
N + H+ +K+ + Y EE ++R V++ N + HG + F D
Sbjct: 24 NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + P+L D+P DW G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+YI
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY TD SC + AA + F I E + + GP++ +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 251 DAGHTSFQF 259
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 95/257 (36%), Positives = 140/257 (54%), Gaps = 24/257 (9%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
D L+++ H +K++ +TYA E+ +R ++ NL+ + L H G+
Sbjct: 22 DQTLDSQWH--QWKAQHRRTYAANED-GWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78
Query: 100 KFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
KF D+T EF++ G N + R + P+L LP DWR+ G VT VK+QG
Sbjct: 79 KFGDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLA--QLPKSVDWREKGYVTPVKNQG 136
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FSATG+LEG F T +LVSLSEQ LVDC + ++GC+GGLM++A
Sbjct: 137 QCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCS-------TSEGNNGCSGGLMDNA 189
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGP 275
FEY+ GG++ E+ YPY G D CK+ A V+ F I S +E + + GP
Sbjct: 190 FEYVKNNGGIDTEQAYPYLGQD-NECKYRAECSGANVTGFVDIPSMNERALMKAVANVGP 248
Query: 276 LAGNVASIELPHISFSF 292
++ +I+ + SF F
Sbjct: 249 IS---VAIDAGNPSFQF 262
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 132/247 (53%), Gaps = 24/247 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K K+ K Y +EE+DY F N+ +L T G+ +DL SE+R+
Sbjct: 43 YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 101
Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
L R RL D+ + ++P N +P DWR+H VT VK+QG CGSCW+FSA
Sbjct: 102 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSA 159
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI G+
Sbjct: 160 TGALEGQHFRATGKLVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGI 212
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPY G + C F K I A F + DED + + GP++ +I+
Sbjct: 213 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPIS---IAIDA 268
Query: 286 PHISFSF 292
H SF
Sbjct: 269 GHRSFQL 275
>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 320
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 23/200 (11%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTP 106
N + +S FK+ ++K YA + YR VF NL+ ++D + G+TKF DLT
Sbjct: 38 NIKTLWSTFKNSYNKKYADPDFEQYRIEVFTENLK------IIDSNCQNFGITKFMDLTQ 91
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF--DWRDHGAVTGVKDQGACGSCWSF 164
EF++ +L L + + ++ P ND D DW GAVT VKDQG CGSCWSF
Sbjct: 92 EEFKQTYLTLKTKKYI----EEIPETVFNDSNGDIEIDWTMKGAVTPVKDQGKCGSCWSF 147
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
S TGA+EGAHFLS+ ELVSLSEQ L+DC S + + GCNGGLM++AF++I +
Sbjct: 148 STTGAVEGAHFLSSNELVSLSEQYLIDC--------SKNGNEGCNGGLMDTAFDFIAQ-N 198
Query: 225 GVEREKDYPYTGTDGGSCKF 244
G+ E YPY D G+CK
Sbjct: 199 GIPTENAYPYKALD-GTCKM 217
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 131/242 (54%), Gaps = 19/242 (7%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLNRR- 119
K Y E + RF++FK N+ R + + G KFSDLT EFR G R
Sbjct: 51 KVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSH 110
Query: 120 -LRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ + K TN D+P DWR GAVT +KDQ CG CW+FSA A+EG H L
Sbjct: 111 PKVMTSSKGKTHFRYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQL 170
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
TGEL+ LSEQ+LVDCD E + D GC+GGL+++AF++ILK G+ E +YPY G
Sbjct: 171 KTGELIPLSEQELVDCDVEGE-------DEGCSGGLLDTAFDFILKNKGLTTEVNYPYKG 223
Query: 237 TDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
D G C KS ++AA ++ + + ++ ++ V + P+ S+ + SF F F
Sbjct: 224 ED-GVCNKKKSALSAAKITGYEDVPANSEKALLQAVANQPV-----SVAIDGSSFDFQFY 277
Query: 296 VS 297
S
Sbjct: 278 SS 279
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/245 (37%), Positives = 130/245 (53%), Gaps = 28/245 (11%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR---------RAKRRQLLDPTAVHGVTK 100
E F + ++ K YAT EE R VF N A P+ +
Sbjct: 38 EALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNA 97
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--------NDLPTDFDWRDHGAVTGV 152
F+DLT EFR LG R+ A A ++P P +P DWR++GAVT V
Sbjct: 98 FADLTHEEFRAARLG---RIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKV 154
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG+CG+CWSFSATGA+EG + + TG LVSLSEQ+L+DCD S +SGC GGL
Sbjct: 155 KDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDR--------SYNSGCGGGL 206
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
M+ A+++++K GG++ E+DYPY DG K K + +S + S+++ + V
Sbjct: 207 MDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVA 266
Query: 273 HGPLA 277
P++
Sbjct: 267 QQPVS 271
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 126/237 (53%), Gaps = 22/237 (9%)
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
+EH RF +FK N++ D G+ KF+DL+ EF+ + ++ LR
Sbjct: 61 DEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKFADLSNEEFKAMHMTTKMEKHKSLRGD 120
Query: 124 ADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+ + N LP DWR GAVT VK+QG CGSCW+FS ++EG +++ TG+L
Sbjct: 121 RGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAFSTIASVEGINYIKTGKL 180
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQQLVDC E ++GCNGGLM++AF+YI+ GG+ E +YPYT + G
Sbjct: 181 VSLSEQQLVDCSKE---------NAGCNGGLMDNAFQYIIDNGGIVTEDEYPYT-AEAGE 230
Query: 242 C---KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
C K + IA + F + ++ + V H P++ +IE F F T
Sbjct: 231 CSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVS---IAIEASGHDFQFYST 284
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 87/251 (34%), Positives = 134/251 (53%), Gaps = 22/251 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ + ++ + S+ +TY E + RF VF+ NLR +
Sbjct: 26 IVSYGERSEEEV---RRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAG 82
Query: 95 VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
+H G+ +F+DLT E+R +LG +R +L A Q +LP DWR
Sbjct: 83 LHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQADD---NEELPETVDWRKK 139
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAV +KDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S +
Sbjct: 140 GAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNE 191
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GCNGGLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 192 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 251
Query: 267 AANLVKHGPLA 277
V + P++
Sbjct: 252 LQKAVANQPIS 262
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 95/248 (38%), Positives = 133/248 (53%), Gaps = 25/248 (10%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L E+ F+ F++++ K Y E +R +VF N+ A++ D G T F+D+T
Sbjct: 17 LRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTN 76
Query: 107 SEFRRQFLG---LNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+EF L L ++ PA PI+ P + DWR+ GAVT VK+Q +CGSCW
Sbjct: 77 TEFAVSKLCGCMLKPKMTKPA----TPIMEPAAEA---VDWREKGAVTPVKNQASCGSCW 129
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSATGA+EG +F++ GEL+SLSEQQLVDCDH+ SGC GGLM AFEY K
Sbjct: 130 AFSATGAMEGRNFVANGELISLSEQQLVDCDHQ---------SSGCGGGLMTYAFEY-AK 179
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
G+ +E+DYPY D CK DK + + + V GP++ +
Sbjct: 180 KKGMCKEEDYPYHAVD-EDCKDDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVS---VA 235
Query: 283 IELPHISF 290
+E I F
Sbjct: 236 VEADSIVF 243
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 93/242 (38%), Positives = 130/242 (53%), Gaps = 17/242 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA--VHGVTKFSDLTPSEFRRQF 113
+K++ +K Y+ E R+++++ N + + G+ KF DL EF F
Sbjct: 25 WKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFAEMF 84
Query: 114 LGLNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
G + R +++ K + P DWR GAVTGVK+QG CGSCW+FS TG+LEG
Sbjct: 85 NGYMMQAR--SNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTGSLEG 142
Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
HFL TG+LVSLSEQ LVDC + E GCNGGLM+ AFEYI K GG++ E Y
Sbjct: 143 QHFLKTGKLVSLSEQNLVDCSGKEGNE-------GCNGGLMDQAFEYIKKNGGIDTEASY 195
Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELPHISFS 291
PY D C+F S + A + + I DE+ + + K GP++ +I+ H SF
Sbjct: 196 PYQAHD-ERCRFKASDVGATCTGYVDIKREDENALMQAVEKIGPVS---VAIDASHSSFQ 251
Query: 292 FL 293
Sbjct: 252 LY 253
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R +F N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 253 IYL--NPLLREEPSNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 310
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 311 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 361
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 362 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 407
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 129/239 (53%), Gaps = 14/239 (5%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEFR 110
F+ F ++ K+YA+ EE + RF +F NL + + G+TKF+D++ EF+
Sbjct: 33 QFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQ 92
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH-GAVTGVKDQGACGSCWSFSATGA 169
+ L N + P P+ FDWR+ G VT V DQG CGSCW+FSAT
Sbjct: 93 SRVLMSNPPPPPTEKPYRGPKFEGFTAPSTFDWRNKPGVVTPVYDQGQCGSCWAFSATEN 152
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+E L+ +L LS QQ+VDC D GC GG + A++Y++ A G++
Sbjct: 153 IESQWALAGHKLTGLSMQQIVDCSW---------WDDGCGGGFPSYAYDYVIDAPGLDAL 203
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD--EDQMAANLVKHGPLAGNVASIELP 286
+YPYT GGSC F +S++ A +S+++ ++D E QMA L +HGP++ V + P
Sbjct: 204 ANYPYTAV-GGSCAFKESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESWP 261
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 90/251 (35%), Positives = 132/251 (52%), Gaps = 22/251 (8%)
Query: 51 HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEF 109
H + + K K Y E + RF++FK NLR + D + G+ KF+DLT E+
Sbjct: 46 HVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEY 105
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTN--------DLPTDFDWRDHGAVTGVKDQGACGSC 161
R FLG R R P + T+ +LP DWR+ GAVT +KDQG CGSC
Sbjct: 106 RAMFLGT--RTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQCGSC 163
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS GA+EG + + TG L SLSEQ+LVDCD + GCNGGLM+ AFE+I+
Sbjct: 164 WAFSTVGAVEGINQIVTGNLTSLSEQELVDCDR--------GYNMGCNGGLMDYAFEFIV 215
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
+ GG++ E+DYPY D K+ + + + +++++ V + P++
Sbjct: 216 QNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVS---V 272
Query: 282 SIELPHISFSF 292
+IE + F
Sbjct: 273 AIEAGGMEFQL 283
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 130/248 (52%), Gaps = 20/248 (8%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSDL 104
AE H +KS + Y T EE ++R +++ N+R + HG + F D+
Sbjct: 1 AEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDM 57
Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
T EFR+ G + + P++ +P DWR+ G VT VK+QG CGSCW+F
Sbjct: 58 TNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCWAF 115
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+YI + G
Sbjct: 116 SASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKENG 168
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
G++ E+ YPY D GSCK+ A + F I E + + GP++ +++
Sbjct: 169 GLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS---VAMD 224
Query: 285 LPHISFSF 292
H S F
Sbjct: 225 ASHPSLQF 232
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/247 (36%), Positives = 134/247 (54%), Gaps = 19/247 (7%)
Query: 54 SLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEF 109
S ++ K Y + E YR ++F N R+ ++ ++ + G+ K+ D+ E
Sbjct: 64 SCHRTHHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHEL 123
Query: 110 RRQFLGLNRRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
G N+ + + + I P N +LP DWR GAVT +KDQG CGSCW+FS+
Sbjct: 124 INTLNGFNKSVTVSEEQLIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSS 183
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +G LVSLSEQ L+DC + ++GCNGGLM+ AF YI + G+
Sbjct: 184 TGALEGQHFRQSGVLVSLSEQNLIDCSGKYG-------NNGCNGGLMDYAFRYIKENKGL 236
Query: 227 EREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ EK YPY + C+++ K+ A+ V + DED++ A + GP++ +I+
Sbjct: 237 DTEKSYPYE-AENDQCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPIS---VAIDA 292
Query: 286 PHISFSF 292
H SF F
Sbjct: 293 SHESFHF 299
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 99/287 (34%), Positives = 153/287 (53%), Gaps = 18/287 (6%)
Query: 11 LLLLSSVLA-SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+LLL +VLA SA+A + A + + ED + + L+ ++ K Y E
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAI--MELYELWLAQHKKAYNGLGE 60
Query: 70 HDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLP-AD 125
RF VFK N L + +P+ G+ +F+DL+ EF+ +LG L+ + RL +
Sbjct: 61 KQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSP 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
+ + DLP DWR+ GAVT VKDQG+CGSCW+FS A+EG + + TG L SLS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD S + GCNGGLM+ AF++I+ GG++ E DYPY DG +
Sbjct: 181 EQELVDCDT--------SYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYR 232
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
K+ + ++ + ++++ + P++ +IE +F F
Sbjct: 233 KNAHVVTIDDYEDVPENDEKSLKKAAANQPIS---VAIEASGRAFQF 276
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 94/251 (37%), Positives = 132/251 (52%), Gaps = 22/251 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL----LDPTAVHGVTKFSDLT 105
E F FKS F + Y + E +R +F+ANL+ R + D T V F+DL+
Sbjct: 30 EAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLS 89
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
EFR F G R L A + + ND LP DW G VT +K+Q CGSCW
Sbjct: 90 NEEFRATFNGYRR---LAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCW 146
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++EG H L TG+LVSLSEQ LVDC + D GC+GG M+ AF+Y+++
Sbjct: 147 AFSAVASMEGQHALKTGKLVSLSEQNLVDC-------SAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVA 281
G++ E YPY D SC+F ++ I A + +F V + DE + + GP++
Sbjct: 200 NRGIDTEASYPYKAID-ESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPIS---V 255
Query: 282 SIELPHISFSF 292
+I+ SF F
Sbjct: 256 AIDASQPSFQF 266
>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
Length = 335
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 93/254 (36%), Positives = 133/254 (52%), Gaps = 20/254 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
DH LNAE + +K+ + + Y EE +R V++ N + + HG T
Sbjct: 22 DHSLNAE--WYQWKATYRRLYGADEE-GWRRAVWEKNRKMIELHNREYSQRKHGFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
F D+T EFR+ G L ++ + P+ ++P+ DWR G VT VK+QG C
Sbjct: 79 AFGDMTNEEFRQVMNGFLKQKQHRNGRLFREPLFA--EIPSSVDWRQKGYVTPVKNQGQC 136
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FSA GALEG F TG+LVSLSEQ LVDC H + GCNGGLM++AF+
Sbjct: 137 GSCWAFSANGALEGQMFRKTGKLVSLSEQNLVDCSHS-------QGNQGCNGGLMDNAFQ 189
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
Y+ G++ E+ YPY G + +C + AA + F I E + + GP++
Sbjct: 190 YVKDNKGLDSEESYPYLGRESNTCNYRPEYSAANDTGFVDIPQHERGLMKAVATVGPIS- 248
Query: 279 NVASIELPHISFSF 292
+I+ H SF F
Sbjct: 249 --VAIDAGHSSFQF 260
>gi|410968392|ref|XP_003990691.1| PREDICTED: cathepsin S, partial [Felis catus]
Length = 310
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 132/253 (52%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K + K Y + E R +++ NL+ L +H G+ D+T
Sbjct: 37 DHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 96
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ Q+ +N LP DWR+ G VT VK QG+CG+CW
Sbjct: 97 SEEV----ISLMGCLRVPSQWQRNVTYKSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 152
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG LVSLS Q LVD C E+ G + GCNGG M AF+YI+
Sbjct: 153 AFSAVGALEAQLKLKTGNLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTEAFQYIID 206
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
G++ E YPY D G C++D AA S ++ + E+ + + GP++
Sbjct: 207 NNGIDSEASYPYKAMD-GKCQYDSKNRAATCSKYTELPFGSEEDLKETVANKGPVS---V 262
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 263 AIDASHSSF-FLY 274
>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 467
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 97/237 (40%), Positives = 125/237 (52%), Gaps = 24/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF+ FK + K Y + E +R VFK NL A+ +P A GVT FSDLT EFR
Sbjct: 37 HFAAFKQRHGKVYRSAAEEAFRLGVFKENLLLARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 112 QF-------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
++ +R R+P + + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNAAAHFAAAQKRARVPVEVEVE----VGGAPAAVDWRARGAVTAVKDQGECGSCWAF 152
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--K 222
S G +EG L+ L SLSEQ LV CD+ D+GC+GGLM++AF++I+
Sbjct: 153 STIGNIEGQWHLAGNPLTSLSEQMLVSCDNA---------DNGCDGGLMDNAFDWIVGKN 203
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E Y Y G S K D S + A +S + DED+MAA L +GPLA
Sbjct: 204 NGTVYTEASYSYVSGGGNSQKCDMSGHVVGAVISGHVDLPKDEDKMAAWLAANGPLA 260
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 135/253 (53%), Gaps = 24/253 (9%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTK 100
+DH+ F F S+FSK Y ++EE + R + +K+N+ Q + G
Sbjct: 37 QDHI-----DFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNH 91
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
+D T E+++ LG R + + P L D+P DWR+ GAV VKDQG CGS
Sbjct: 92 LADYTHDEYKK-MLGYKPRNKTGKEVYSTPNLK--DIPESIDWREKGAVNAVKDQGQCGS 148
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS +LE +F+ TG+L SLSEQQLVDC S + + GCNGG M A +YI
Sbjct: 149 CWAFSTIASLESRYFIETGKLQSLSEQQLVDC--------SKNGNEGCNGGDMGLAMDYI 200
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
AGGVE EKDYPY G D +C F+ SK +A + +++ + A + GP++
Sbjct: 201 ASAGGVETEKDYPYVGKD-QTCAFEASKEVATDKGHINIVPGKFATLQA-AIAEGPVS-- 256
Query: 280 VASIELPHISFSF 292
+IE + F F
Sbjct: 257 -VAIEADSLFFQF 268
>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
Length = 318
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 112/300 (37%), Positives = 160/300 (53%), Gaps = 42/300 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAV-----AVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH- 52
M RL + + +L+LL +V + + D++ IR V S D E S L+ H
Sbjct: 1 MARLSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHA 60
Query: 53 --FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSE 108
F+ F ++ K+Y T +E RF +F NL+ R+ R+ L T V +F+D T E
Sbjct: 61 HSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLA--VNQFADWTWEE 118
Query: 109 FRRQFLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
FRR LG N L + + ++ LP DWR+ G V+ +KDQG CGSCW+FS
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVI----LPETKDWREDGIVSPIKDQGHCGSCWTFST 174
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGG 225
TGALE A+ + G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG
Sbjct: 175 TGALEAAYAQAFGKGISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGG 226
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD------------EDQMAANLVKH 273
++ E+ YPYTG D G+CKF I V + I+ D ED +A L+K+
Sbjct: 227 LDTEEAYPYTGLD-GTCKFSSENIGVQVLDSVNITLDVNHAVLAVGYGVEDGVAYWLIKN 285
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 123/217 (56%), Gaps = 16/217 (7%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
+ D RF +FK NLR + A + G+TKF+DLT E+R +LG RR+
Sbjct: 69 DQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAK 128
Query: 123 PADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + + + ++P DWR GAV +KDQG CGSCW+FS A+EG + + TGE
Sbjct: 129 AKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGE 188
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+SLSEQ+LVDCD+ S + GCNGGLM+ AF++I+K GG++ EKDYPY G G
Sbjct: 189 LISLSEQELVDCDN--------SYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFGGK 240
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
F K+ ++ + + + ++ + P++
Sbjct: 241 CNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVS 277
>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
Length = 338
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 135/237 (56%), Gaps = 12/237 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 30 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 89
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
FSDLT EFR +L R + P + K + P ++DWR GAVT VKDQG CGS
Sbjct: 90 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 148
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I
Sbjct: 149 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 199
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 200 KNLGGLETVDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 255
>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
Length = 358
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/274 (38%), Positives = 151/274 (55%), Gaps = 27/274 (9%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEH---HFSLF 56
L+LS+ L+L+ S A+A + D+ IR V SDG EQ +L HF+ F
Sbjct: 6 LVLSAALVLVAISCGAAASSF-DESNPIRLV--SDGLRELEQQVVQVLGNSRRALHFARF 62
Query: 57 KSKFSKTYATQEEHDYRFRVFKAN--LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
++ K Y + EE R+ +F N L R+ ++ L T V +F+D + EFRRQ L
Sbjct: 63 AHRYGKKYESVEEMKLRYEIFSENKKLIRSTNKKGLPYTL--AVNRFADWSWEEFRRQRL 120
Query: 115 GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
G + A + + L LP +WR+ G VT VKDQG CGSCW+FS TGALE A+
Sbjct: 121 GAAQNC--SATTKGSHELTDAVLPESKNWREEGIVTPVKDQGHCGSCWTFSTTGALEAAY 178
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E YP
Sbjct: 179 VQAFRKQISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGGLDTEAAYP 230
Query: 234 YTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQM 266
Y GTD G+CKF + V + ++ DE ++
Sbjct: 231 YVGTD-GACKFSAENVGVQVLDSVNITLGDEQEL 263
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 107/296 (36%), Positives = 152/296 (51%), Gaps = 34/296 (11%)
Query: 6 LSSLLLLLLSSVLA-SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
++ L LLS VL +VA + Q +P D E L + E +SL++ K+ +
Sbjct: 1 MAKLSYALLSVVLVLGSVA-------LAQSIPFD----EKDLASEESLWSLYE-KWRAHH 48
Query: 65 ATQ---EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---- 116
A ++ D RF VFK N++ Q D T + KF D+T EFR + G
Sbjct: 49 AVSRDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDH 108
Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ LR DA + +DLPT DWR+ GAVTGVKDQG CGSCW+FS A+EG + +
Sbjct: 109 HMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQI 168
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
T ELVSLSEQQLVDCD + +SGCNGGLM+ AF++I GG+ E YPY
Sbjct: 169 KTNELVSLSEQQLVDCDTK---------NSGCNGGLMDYAFDFIKNNGGLSSEDSYPYL- 218
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ SC + + + + + + + V + P++ +IE +F F
Sbjct: 219 AEQKSCGSEANSAVVTIDGYQDVPRNNEAALMKAVANQPVS---VAIEASGYAFQF 271
>gi|16076437|emb|CAC94443.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 72/91 (79%), Positives = 82/91 (90%)
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
DHECDPEE GSCDSGC+GGLMNSAFEY LKAGG+ RE+DYPYTGTD +CKFDKSKIAA+
Sbjct: 1 DHECDPEEQGSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDRSTCKFDKSKIAAS 60
Query: 253 VSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
VSNFSVIS DEDQ+AANLVK+GPLA + ++
Sbjct: 61 VSNFSVISLDEDQIAANLVKNGPLAVAINAV 91
>gi|66823853|ref|XP_645281.1| hypothetical protein DDB_G0272298 [Dictyostelium discoideum AX4]
gi|60473355|gb|EAL71301.1| hypothetical protein DDB_G0272298 [Dictyostelium discoideum AX4]
Length = 305
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 96/256 (37%), Positives = 135/256 (52%), Gaps = 33/256 (12%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
K++K Y +E+ RF +F+ N R + ++SDLT EF +F
Sbjct: 3 KYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFF--- 59
Query: 118 RRLRLPADAQKAPILPTNDL-------------PTDFDWRDHGAVTGVKDQGACGSCWSF 164
+L + + PI ND+ P FDWRDHGAV VK+QG+C SCWSF
Sbjct: 60 --EKLVPEPRSGPI---NDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSF 114
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SA GALEG +++ GEL+ LSEQ LVDC P+ GC G M+ AF+YI+ +G
Sbjct: 115 SALGALEGHYYIKYGELLDLSEQNLVDCATPFGPK-------GCKTGWMHDAFKYIISSG 167
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNV--A 281
GV E YPYTG D CKF++S+ A VS F +I DE + + +GP+A + +
Sbjct: 168 GVNLESQYPYTGKD-EVCKFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTS 226
Query: 282 SIELPHISFSFLFTVS 297
+ E H+S ++ S
Sbjct: 227 TKEFQHLSGGIYYSDS 242
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 91/247 (36%), Positives = 132/247 (53%), Gaps = 23/247 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRR 111
FK+ + Y EE R VF+ NL++ + L + G+ +F+D+ EF
Sbjct: 47 FKTVHERNYGETEEMQ-RKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFAS 105
Query: 112 QFLG--LNRRLRLPADAQK---APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
G +N R ++ +P +P + LP + DWR G VT +KDQG CGSCWSFS
Sbjct: 106 VVNGFRMNNRTKVRDHLHSHYISPAIPVS-LPAEVDWRKEGYVTPIKDQGHCGSCWSFST 164
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF TG+LVSLSEQ L+DC + ++GCNGG+M+ AF+YI G
Sbjct: 165 TGALEGQHFRKTGKLVSLSEQNLIDC-------STSYGNNGCNGGVMDYAFQYIKDNDGD 217
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIEL 285
+ E YPY D G C+F K + A + ++ + DE++M + GP++ +I+
Sbjct: 218 DTEDSYPYEAAD-GPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVS---VAIDA 273
Query: 286 PHISFSF 292
H SF
Sbjct: 274 SHTSFQM 280
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/246 (36%), Positives = 131/246 (53%), Gaps = 14/246 (5%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E + +K +K Y T E R +++ NL++ ++ + + DLT EF
Sbjct: 25 EQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQDEF 84
Query: 110 RRQFLGLNRRLRLPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
R + G+ Q + L P++ +P DWR G VT VK+QG CGSCW+FS T
Sbjct: 85 RYFYTGMRSHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G+LEG +F TG+LVSLSEQ LVDC + ++GC GGLM+ AF+YI + GG++
Sbjct: 145 GSLEGQNFKKTGKLVSLSEQNLVDC-------STAYGNNGCQGGLMDYAFKYIKENGGID 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELP 286
E+ YPY + C+F KS I A + F V DE+ + GP++ +I+
Sbjct: 198 TEESYPYEARN-DRCRFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGPIS---VAIDAG 253
Query: 287 HISFSF 292
H+SF F
Sbjct: 254 HMSFQF 259
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 93/256 (36%), Positives = 135/256 (52%), Gaps = 21/256 (8%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
+LL E H LFK+ K Y +Q E +R +++ N + + +L + + + K
Sbjct: 21 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNK 78
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
F DL EFR G + + + A+ P N ++P DWR+ GA+T VKDQG
Sbjct: 79 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 138
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CG CW+FS+TGALEG F TG+LVSL EQ L+DC + E GCNGGLM+ AF
Sbjct: 139 CGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNE-------GCNGGLMDQAF 191
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
+YI G++ E YPY D C+++ A F + S +ED++ A + GP+
Sbjct: 192 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 250
Query: 277 AGNVASIELPHISFSF 292
+ +I+ H SF F
Sbjct: 251 S---VAIDASHESFQF 263
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 121/224 (54%), Gaps = 13/224 (5%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--- 115
K K Y E D RF++FK NL + T + G+ KF+D+T E+R +LG
Sbjct: 45 KHQKVYNGLREKDQRFQIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRS 104
Query: 116 -LNRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+ RR ++ + + LP DWR GA+T +KDQG+CGSCW+FS +E
Sbjct: 105 DIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAI 164
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
+ + TG+LVSLSEQ+LVDCD + + GCNGGLM+ AFE+I+ GG++ ++ YP
Sbjct: 165 NKIVTGKLVSLSEQELVDCDR--------AFNEGCNGGLMDYAFEFIIGNGGIDTDQHYP 216
Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y G +G K ++ + + S+ + V H P++
Sbjct: 217 YKGFEGRCDPTRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVS 260
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 131/253 (51%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K + + Y + E R +++ NL+ L +H G+ +D+T
Sbjct: 33 DHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMT 92
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E L LR+P+ Q +N LP DWR+ G VT VK QGACG+CW
Sbjct: 93 SEEVSS----LMSSLRVPSQWQANVTYKSNSNQKLPDSVDWREKGCVTEVKYQGACGACW 148
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG LVSLS Q LVD C E G + GCNGG M AF+YI+
Sbjct: 149 AFSAVGALEAQLKLKTGNLVSLSAQNLVD----CSTERYG--NKGCNGGFMTKAFQYIID 202
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
G++ E YPY D G+C++D AA S ++ + ED + + GP++
Sbjct: 203 NNGIDSEVSYPYKAMD-GNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVS---V 258
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 259 AIDAKHSSF-FLY 270
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 124/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D+GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 84 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 144 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 201
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 202 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 252
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 253 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPIS 298
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 90/247 (36%), Positives = 135/247 (54%), Gaps = 34/247 (13%)
Query: 44 DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTAV 95
+ + E F LF++ + + Y QEE RF++F++NLR AKR+ PT
Sbjct: 33 EQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQSNLRYINEMNAKRK---SPTTQ 89
Query: 96 H--GVTKFSDLTPSEFRRQFLGLNRRLRLP-------ADAQKAPILPTNDLPTDFDWRDH 146
H G+ KF+D++P EF + +L + + +P QK ++LP DWRD
Sbjct: 90 HRLGLNKFADMSPEEFMKTYL---KEIEMPYSNLESRKKLQKGDDADCDNLPHSVDWRDK 146
Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
GAVT V+DQG C S W+FS TGA+EG + + TG LVSLS QQ+VDCD
Sbjct: 147 GAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDCD---------PASH 197
Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
GC GG +AF Y+++ GG++ E YPYT + G+CK + +K+ ++ N V+ E+ +
Sbjct: 198 GCAGGFYFNAFGYVIENGGIDTEAHYPYTAQN-GTCKANANKV-VSIDNLLVVVGPEEAL 255
Query: 267 AANLVKH 273
+ K
Sbjct: 256 LCRVSKQ 262
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 23/245 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
K GGV+ E DYPY D +C+ + +K V + + ++ ++ + LV P+A
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 279 NVASI 283
+ A I
Sbjct: 247 DAADI 251
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 90/259 (34%), Positives = 137/259 (52%), Gaps = 23/259 (8%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
++ L+ + H + +K + YA +E R+ VFK+N+ R + + T V +
Sbjct: 29 DNELIMQKRHIE-WMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQ 87
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKD 154
F+DLT EFR + G L + +Q + + LP DWR GAVT +K+
Sbjct: 88 FADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKN 147
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG+CG CW+FSA A+EGA + G+L+SLSEQQLVDCD + D GC GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCEGGLMD 198
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
+AFE+I+ GG+ E +YPY G D +C K+ A +++ + + +++Q V H
Sbjct: 199 TAFEHIMATGGLTTESNYPYKGED-ATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAH 257
Query: 274 GPLAGNVASIELPHISFSF 292
P++ IE F F
Sbjct: 258 QPVS---VGIEGGGFDFQF 273
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 130/253 (51%), Gaps = 19/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D LNA+ + +K+ + Y EE +R V++ N++ + HG T
Sbjct: 22 DQSLNAQWY--QWKATHRRLYGMNEE-GWRRAVWEKNMKMIELHNREYSQGKHGFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P+ ++P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFA--EIPKSVDWREKGYVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNEGCNGGLMDNAFRY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+ GG++ E+ YPY G D +C + AA + F + E + + GP++
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQREKALMKAVATLGPIS-- 247
Query: 280 VASIELPHISFSF 292
+I+ H SF F
Sbjct: 248 -VAIDAGHQSFQF 259
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 158 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G + +S I A + I S+ED MAA L +GP+A
Sbjct: 209 YTGASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 262
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 85/235 (36%), Positives = 129/235 (54%), Gaps = 14/235 (5%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F + ++K Y +E YR+++FK NL + ++ AV + KFSD++
Sbjct: 20 LLKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVEDHAVFSINKFSDMS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
SE ++ GL+ + + +A IL P N P +FDWR + AVT V+ QG CGSCW+
Sbjct: 80 KSEIISKYTGLSLPSLMQENFCRAIILDGPPNKAPINFDWRQYNAVTPVRVQGNCGSCWA 139
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS +E + + + +SLS QQLVDCD + + GC GGL+++A E I+ A
Sbjct: 140 FSTLAGIESQYSIKYNKQISLSVQQLVDCD---------TSNMGCAGGLLHTALEQIINA 190
Query: 224 -GGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPL 276
GGV +E+DYPY G D C + A V + I +E+++ L GP+
Sbjct: 191 GGGVLQEEDYPYKGVD-KQCNLPHNNFAVQVLGCYRYIVMNEEKLKDVLRAVGPI 244
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 84/223 (37%), Positives = 127/223 (56%), Gaps = 14/223 (6%)
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG-- 115
K K Y + RF +FK NLR + + ++ + G+ KF+DL+ E++ FLG
Sbjct: 13 KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLGGR 72
Query: 116 -LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
+ R +D K + ++LP DWR+ GAV VKDQG CGSCW+FS A+EG +
Sbjct: 73 MVRDRKGFESDRFKYGV--GDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130
Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
++TG+L+SLSEQ+LVDCD + GCNGG M+ AFE+I+K GG++ E DYPY
Sbjct: 131 QIATGDLISLSEQELVDCDK--------GFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPY 182
Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G DG + K+ ++ F + ++++ V H P++
Sbjct: 183 KGVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVS 225
>gi|47199802|emb|CAF88807.1| unnamed protein product [Tetraodon nigroviridis]
Length = 261
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 122/233 (52%), Gaps = 21/233 (9%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
E +R V++ NL++ + L H G+ F D+T EFR+ G + P
Sbjct: 1 EEGWRRMVWEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGYKHK---PQ 57
Query: 125 DAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
+ + + P DWRD G VT VKDQG CGSCW+FS TGALEG HF TG+L
Sbjct: 58 RKFRGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRQTGKL 117
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ LVDC PE + GCNGGLM+ AF+YI GG++ E YPY TD
Sbjct: 118 VSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNGGLDSEASYPYLATDDQP 170
Query: 242 CKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
C +D S +A + F V S E + + GP++ +I+ H SF F
Sbjct: 171 CHYDPSNNSANETGFVDVPSGSERALMKAVASVGPVS---VAIDAGHESFQFY 220
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 131/240 (54%), Gaps = 16/240 (6%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
+ ++ + Y +E + R+ +FK N+ R + D GV KF+DLT EFR
Sbjct: 8 WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMHH 67
Query: 115 GLNRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
G R+ +L + + + L +PT DWR GAVT VKDQG CG CW+FSA A+EG
Sbjct: 68 GYKRQSSKLMSSSFRHENLSA--IPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVAAIEGI 125
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
L TG+L+SLSEQQLVDCD + D GC GGLM++AF++IL+ GG+ E YP
Sbjct: 126 IKLKTGKLISLSEQQLVDCDVK-------GVDQGCGGGLMDNAFQFILRNGGLTSEATYP 178
Query: 234 YTGTDGGSCKFDKS-KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
Y G D G+CK K+ I A ++ + + + + V P++ ++E F F
Sbjct: 179 YQGVD-GTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVS---VAVEGGGYDFQF 234
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 23/245 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
K GGV+ E DYPY D +C+ + +K V + + ++ ++ + LV P+A
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 279 NVASI 283
+ A I
Sbjct: 247 DAADI 251
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 129/229 (56%), Gaps = 19/229 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + K K+Y T +E R+ +F+ N+ + + G+ +DLT E++R
Sbjct: 32 FQNWMVKHQKSY-TNDEFGSRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRI 90
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+LG ++ P I+ D+ P DWR +GAVT VK+QG CG C+SFS TG+
Sbjct: 91 YLGTKTTVKKPN-----LIIGVTDVSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGS 145
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVER 228
+EG H +++ +LVSLSEQQ++DC SGS ++GC+GGLM ++FEYI+ GG++
Sbjct: 146 VEGIHEITSKQLVSLSEQQILDC--------SGSEGNNGCDGGLMTNSFEYIIAVGGLDT 197
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY G G CKF+K+ I A ++ + + S + V P++
Sbjct: 198 EASYPYEGVV-GKCKFNKANIGATITGYKNVKSGSESDLQTAVAAQPVS 245
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 23/245 (9%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
K GGV+ E DYPY D +C+ + +K V + + ++ ++ + LV P+A
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246
Query: 279 NVASI 283
+ A I
Sbjct: 247 DAADI 251
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 88/244 (36%), Positives = 130/244 (53%), Gaps = 14/244 (5%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
++ +KS K Y + E R +++ NL++ + + D+T E +
Sbjct: 28 NWKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQ 87
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPT--DFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
LGL + + + A LP ++ DWR G VT VK+QG CGSCW+FS TGA
Sbjct: 88 TLLGLKLKKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG HF TG+LVSLSEQ LVDC + ++GC GGLM++AF+YI + GG++ E
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYG-------NNGCEGGLMDNAFQYIKENGGIDTE 200
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
K YPY D G C ++KS I A + F + + DE+ + L GP++ +I+
Sbjct: 201 KSYPYLAKD-GVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPIS---IAIDASQS 256
Query: 289 SFSF 292
+F F
Sbjct: 257 TFHF 260
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 93/251 (37%), Positives = 132/251 (52%), Gaps = 22/251 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL----LDPTAVHGVTKFSDLT 105
E F FKS F + Y + E +R +F+ANL+ R + D T V F+DL+
Sbjct: 30 EAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLS 89
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
EFR F G R L A + + ND LP DW G VT +K+Q CGSCW
Sbjct: 90 NEEFRATFNGYRR---LAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCW 146
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++EG H L TG+LVSLSEQ LVDC + D GC+GG M+ AF+Y+++
Sbjct: 147 AFSAVASMEGQHALKTGKLVSLSEQNLVDC-------SAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVA 281
G++ E YPY D SC+F ++ + A + +F V + DE + + GP++
Sbjct: 200 NRGIDTEASYPYKAID-ESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPIS---V 255
Query: 282 SIELPHISFSF 292
+I+ SF F
Sbjct: 256 AIDAAQPSFQF 266
>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 327
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 92/231 (39%), Positives = 128/231 (55%), Gaps = 15/231 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E HF + + +K Y+ QE H R ++F N RR ++ + + G+ +FSD+T +EF
Sbjct: 26 EQHFKSWMALHNKAYSVQEFHQ-RLQIFTENKRRIEKHNGGNHSFTMGLNQFSDMTFAEF 84
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHG-AVTGVKDQGACGSCWSFSAT 167
R++FL + A K + TN P DWR G VT VK+QGACGSCW+FS T
Sbjct: 85 RKRFLWSEPQ---NCSATKGSYMKTNSPQPESIDWRTKGNYVTPVKNQGACGSCWTFSTT 141
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
G LE ++TG+LV LSEQQLVDC + + + GCNGGL + AFEYI G+
Sbjct: 142 GCLESVTAINTGKLVPLSEQQLVDCAWDFN-------NHGCNGGLPSQAFEYIKYNKGLM 194
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLA 277
E YPYT + G CK+ AA V N ++ + DE M + H P++
Sbjct: 195 TESGYPYTAFE-GKCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVS 244
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLD-PTAVHGVTKF 101
+L A +F F ++K Y + E + R+ +FK NL AK D PTA + + KF
Sbjct: 48 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
SDL+ SE +F GL+ R+ ++ K IL P + P FDWR+ VT +K+QGACG
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-SNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACG 166
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
+CW+F+ ++E + L+ LSEQQL+DCD S D GCNGGL+++AFE
Sbjct: 167 ACWAFATLASVESQFAMRHNRLIDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 217
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPL 276
I++ GGV+ E DYP+ G + C D+ + + + V + + +E+++ L GP+
Sbjct: 218 IMRMGGVQTELDYPFVGRN-RRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPI 275
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 91/245 (37%), Positives = 126/245 (51%), Gaps = 17/245 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPS 107
+ +K+ + Y EE +R V++ N++ HG T F D+T
Sbjct: 28 QWYQWKATHRRLYGMNEE-GWRRAVWEKNMKMIDLHNREYSQGQHGFTMAMNAFGDMTNE 86
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
EFR+ G + + P+ ++P DW G VT VK+QG CGSCW+FSAT
Sbjct: 87 EFRQVMNGFRNQKPRKGKVFQEPLFA--EIPKSVDWTLKGYVTPVKNQGQCGSCWAFSAT 144
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GALEG F TG+LVSLSEQ LVDC E GCNGGLM++AF+Y+ + GG++
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRSQGNE-------GCNGGLMDNAFQYVKENGGLD 197
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
E+ YPY GTD SCK+ AA + F I E + + GP++ +I+ H
Sbjct: 198 SEESYPYLGTDTDSCKYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAIDAGH 254
Query: 288 ISFSF 292
SF F
Sbjct: 255 QSFQF 259
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 129/228 (56%), Gaps = 16/228 (7%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEF---R 110
+ S++ K Y +E + RF++FK N+ + D T + G+ +F+DLT EF R
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASR 101
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+F G + + K + +P+ DWR GAVT VK+QG CG CW+FSA A
Sbjct: 102 NKFKGHMCSSIMRTTSFKYE--NVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAAT 159
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG H LSTG+L+SLSEQ+LVDCD + D GC GGLM+ AF++I++ G+ E
Sbjct: 160 EGIHKLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIIQNHGLSTEA 212
Query: 231 DYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY G D G+C +K+ + A ++ + + ++ +Q V + P++
Sbjct: 213 QYPYEGVD-GTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPIS 259
>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
Length = 355
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 124/225 (55%), Gaps = 17/225 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
F+ F S+F K+Y ++EE R+ +F NLR R+ + L T V F+D T EF+
Sbjct: 55 FARFMSRFGKSYRSEEEMRERYEIFSQNLRFIRSHNKNRLPYTL--SVNHFADWTWEEFK 112
Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
R LG + + L LP DWR G V+ VKDQG+CGSCW+FS TGAL
Sbjct: 113 RHRLGAAQNCSATLNGNHK--LTDAVLPPTKDWRKEGIVSDVKDQGSCGSCWTFSTTGAL 170
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG+E E+
Sbjct: 171 EAACAQAFGKSISLSEQQLVDCAGRFN-------NFGCNGGLPSQAFEYIKYNGGLETEE 223
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
YPYTG D G CKF +A V N ++ + +E + A V+
Sbjct: 224 AYPYTGKD-GVCKFSAENVAVQVIDSVNITLGAENELKHAVAFVR 267
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 130/236 (55%), Gaps = 18/236 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F KF+K Y+++ E RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
++ GGV+ E DYPY G+DG + + I+ E+++ L GP+
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPI 243
>gi|195428245|ref|XP_002062184.1| GK16790 [Drosophila willistoni]
gi|194158269|gb|EDW73170.1| GK16790 [Drosophila willistoni]
Length = 549
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 97/258 (37%), Positives = 137/258 (53%), Gaps = 26/258 (10%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFS 102
+ H+ NA HHF K K Y + +EH++R +F+ NLR + T V +
Sbjct: 238 DSHVDNAFHHF---KRKHGVAYRSDKEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLA 294
Query: 103 DLTPSEF--RRQFLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
D T E RR + N P D K T+D+P+ +DWR +GAVT VKDQ
Sbjct: 295 DKTEEELKARRGYKSSGVYNTGKPFPYDVNKY----TDDIPSQYDWRLYGAVTPVKDQSV 350
Query: 158 CGSCWSFSATGALEGAHFLST-GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCWSF G LEGA FL G LV LS+Q L+DC G ++GC+GG
Sbjct: 351 CGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC-------SWGFGNNGCDGGEDFRV 403
Query: 217 FEYILKAGGVEREKDY-PYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
++++LK+GGV E++Y PY G D G C + + A ++ F +V S+D + L+KHG
Sbjct: 404 YQWMLKSGGVPTEEEYGPYLGQD-GYCHVNNVTLVAPITGFVNVTSNDPNAFKIALLKHG 462
Query: 275 PLAGNVASIELPHISFSF 292
PL+ +I+ +FSF
Sbjct: 463 PLS---VAIDASPKTFSF 477
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 96/250 (38%), Positives = 134/250 (53%), Gaps = 20/250 (8%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H++L+KS SK Y +EE +R V++ NL++ + L H G+ F D+T
Sbjct: 26 DQHWNLWKSWHSKNYHQREE-GWRRLVWEKNLKKIELHNLEHSMGKHSYRLGMNHFGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
EF++ G + + + L N L P DWR+ G VT VKDQG CGSCW+
Sbjct: 85 HEEFKQIMNGYKHKAE--RKFKGSLFLEPNFLEAPRSVDWREKGYVTPVKDQGECGSCWA 142
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGALEG F TG+LVSLS Q LV+C PE + GCNGGLM+ AF+Y+
Sbjct: 143 FSTTGALEGQEFTRTGKLVSLSGQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDN 195
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
G++ E YPY GTD C +D AA + F + S +E + + GP++ +
Sbjct: 196 QGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSGNERALMKAVASVGPVS---VA 252
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 253 IDAGHESFQF 262
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 133/236 (56%), Gaps = 21/236 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPI 242
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/247 (35%), Positives = 132/247 (53%), Gaps = 22/247 (8%)
Query: 39 GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-- 96
GE+SE+ + ++ + ++ TY E + RF F+ NLR + VH
Sbjct: 31 GERSEEEV---RRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSF 87
Query: 97 --GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
G+ +F+DLT E+R +LG +R +L A Q A ++LP DWR GAV
Sbjct: 88 RLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKKGAVG 144
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
VKDQG CGSCW+FSA A+EG + + TG+++ LSEQ+LVDCD S + GCNG
Sbjct: 145 AVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQGCNG 196
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANL 270
GLM+ AFE+I+ GG++ E+DYPY D K+ + + + + ++
Sbjct: 197 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 256
Query: 271 VKHGPLA 277
V + P++
Sbjct: 257 VANQPIS 263
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 131/247 (53%), Gaps = 24/247 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+K K+ K Y +EE+DY F N+ +L T G+ +DL SE+R+
Sbjct: 43 YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 101
Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
L R RL D+ + ++P N P DWR+H VT VK+QG CGSCW+FSA
Sbjct: 102 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKAPDSVDWREHNLVTPVKNQGMCGSCWAFSA 159
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TGALEG HF +TG+LVSLSEQ LVDC + + GCNGGLM+ AFEYI G+
Sbjct: 160 TGALEGQHFRATGKLVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGI 212
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPY G + C F K I A F + DED + + GP++ +I+
Sbjct: 213 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPIS---IAIDA 268
Query: 286 PHISFSF 292
H SF
Sbjct: 269 GHRSFQL 275
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 130/236 (55%), Gaps = 18/236 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F KF+K Y+++ E RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
++ GGV+ E DYPY G+DG + + I+ E+++ L GP+
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPI 243
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 133/243 (54%), Gaps = 22/243 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
+ +++ + Y +E R+++FK N+ R + + +D + + +F+DLT EFR
Sbjct: 42 WMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS-- 99
Query: 115 GLNRRLRLPA-----DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
R R A +A +P+ DWR GAVT +KDQG CGSCW+FSA A
Sbjct: 100 ----RNRFKAHICSTEATSFKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAA 155
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG LSTG+L+SLSEQ+LVDCD SG D GCNGGLM+ AF++I + G+ E
Sbjct: 156 MEGITQLSTGKLISLSEQELVDCD------TSGE-DQGCNGGLMDDAFKFIEQNHGLATE 208
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHIS 289
+YPY GTDG + + AA ++ + + ++ ++ V H P+A +I+
Sbjct: 209 ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIA---VAIDAGGFE 265
Query: 290 FSF 292
F F
Sbjct: 266 FQF 268
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/231 (36%), Positives = 131/231 (56%), Gaps = 26/231 (11%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSE 108
F +K K K Y EE + R F+ NL+ + ++++ L G+ KF+D++ E
Sbjct: 49 FQQWKEKHRKVYKHVEEAEKRLENFRRNLKYVVEKNQKKKNLGSAHTVGLNKFADMSNVE 108
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND-------LPTDFDWRDHGAVTGVKDQGACGSC 161
FR+++L +++ P + ++ + P+ DWR G VT VKDQG CGSC
Sbjct: 109 FRQKYLS---KVKKPIKKRNNNLMTSRQRNLQSCVAPSSLDWRKKGVVTPVKDQGDCGSC 165
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGA+EG + + TG+LVSLSEQ+L+DCD + + GC+GG M+ AFE+++
Sbjct: 166 WAFSSTGAIEGINAIVTGDLVSLSEQELMDCD---------TTNYGCDGGYMDYAFEWVI 216
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDK--SKIAAAVSNFSVISSDEDQMAANL 270
GG++ E DYPYTG D G+C K +K+ + V SD + A +
Sbjct: 217 NNGGIDTEIDYPYTGVD-GTCNIAKEETKVVSVDGYEDVAESDSALLCATV 266
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/274 (37%), Positives = 144/274 (52%), Gaps = 24/274 (8%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
+LL++ A+A D IR V SD E+ ++ H F+ F +++ K Y
Sbjct: 5 TLLIVFFCVATAAAGLSFHDSNPIRMV--SDMEKQLLQVIGESRHAVSFARFANRYGKRY 62
Query: 65 ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--NRRLRL 122
T +E RF++F NL+ + GV F+D T EFR LG N L
Sbjct: 63 DTVDEMKRRFKIFSENLQLIESTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCSATL 122
Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+ + ++ LP + DWR G V+ VKDQG CGSCW+FS TGALE A+ + G+ +
Sbjct: 123 KGNHRITDVV----LPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNI 178
Query: 183 SLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
SLSEQQLVDC +G+ ++ GCNGGL + AFEYI GG+E E+ YPYTG + G
Sbjct: 179 SLSEQQLVDC--------AGAFNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQN-GP 229
Query: 242 CKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
CKF +A V N ++ + DE + A +
Sbjct: 230 CKFTSEDVAVQVLGSVNITLGAEDELKHAVAFAR 263
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 133/237 (56%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F F+K Y+++ E +RF++F+ NL + L D +A + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+ GG++ E DYPY + G C+ + +K V + I+ E+++ L GP+
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPI 243
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 160/287 (55%), Gaps = 24/287 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+I++ LL LL ++++A+ +D + P+ ++ +A +F F S+++K
Sbjct: 25 IIMNKSLLFLL--LVSTALTRQNDAVHTPTIKPT-----LYNINSAPLYFEKFISQYNKH 77
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
Y ++E YR+ +F+ N+ + + +AV+ + +F+D+T +E + GL L
Sbjct: 78 YKNEDEKKYRYNIFRHNIESINHKNSRNDSAVYKINRFADMTKNEVVIRHTGLASG-ELG 136
Query: 124 ADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + ++ PT FDWR VT VKDQG CG+CW+F+ GALE + +
Sbjct: 137 VNFCETIVVDGPGQRQRPTSFDWRTLNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDR 196
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
L+ LSEQQLVDCDH D GC+GGL+++A+E I++ GGVE++ DYPY +
Sbjct: 197 LIDLSEQQLVDCDH---------VDMGCDGGLIHTAYEEIMRMGGVEQDFDYPYRA-ERQ 246
Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
C K AA V S + + +E+++ +L++H GP+A V ++++
Sbjct: 247 PCALKPHKFAAGVRSCYRYVLLNEERL-EDLLRHVGPIAIAVDAVDI 292
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/248 (38%), Positives = 130/248 (52%), Gaps = 19/248 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+K+ K+Y EE +R V++ NL+ + L +H G+ +F DLT
Sbjct: 78 HWELWKNWHQKSYHKAEE-GWRRMVWEENLKVIELHNLEQSLGLHTYQLGMNQFGDLTNE 136
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EF+ Q L R + L N +PT DWRDHG VT VK+QG CGSCW+FS
Sbjct: 137 EFQ-QMLISERHFSEGNRINGSAFLEVNYVQVPTSVDWRDHGYVTPVKNQGHCGSCWAFS 195
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALEG F +G LVSLSEQ LVDC + + GCNGG+++ AF+YIL+ G
Sbjct: 196 TTGALEGQLFRKSGRLVSLSEQNLVDCSWQ-------QGNQGCNGGIVDFAFQYILENRG 248
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIE 284
++ E YPYT D C F A V+ F I E+ + + GP++ +I+
Sbjct: 249 IDSEDCYPYTAKDTAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVGPVS---VAID 305
Query: 285 LPHISFSF 292
SF F
Sbjct: 306 AHPTSFRF 313
>gi|300123574|emb|CBK24846.2| unnamed protein product [Blastocystis hominis]
Length = 305
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/241 (38%), Positives = 131/241 (54%), Gaps = 24/241 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
FS F++++ K Y E +R +VF+ N+ A++ + G+T F+D+T +EF
Sbjct: 21 FSAFEARYGKNY-LPAERAFRAKVFEYNMEWARKMNAQNHPYTVGMTPFADMTNTEFANS 79
Query: 113 FLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
L L ++ PA PI+ D DWR+ GAVT VK+Q +CGSCW+FSATGA
Sbjct: 80 KLCGCMLKPKMTKPA----TPIMQRAD--ETVDWREKGAVTPVKNQASCGSCWAFSATGA 133
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG +F++ GEL+SLSEQQLVDCDH+ SGC GG M AFEY +K G+ +E
Sbjct: 134 MEGRNFVANGELISLSEQQLVDCDHQ---------SSGCGGGWMTYAFEYAMKK-GMCKE 183
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHIS 289
+DYPY D CK DK + + + V GP++ ++E I
Sbjct: 184 EDYPYHAVD-EDCKDDKCTPVVFPKGYEEVPMYDGAALKQAVSQGPVS---VAVEADSIV 239
Query: 290 F 290
F
Sbjct: 240 F 240
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR + DL P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 223 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 280
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 281 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 331
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 332 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 377
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/254 (38%), Positives = 138/254 (54%), Gaps = 22/254 (8%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D LNA H+ +K+K K Y +EE +R V++ N++ + HG T
Sbjct: 22 DGSLNA--HWYRWKAKHRKLYGMREE-GWRRAVWEKNMKMIEVHNQEYSQGKHGFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGL-NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
F D+T EFR+ G N++ + Q+ L ++P DWR+ G VT VK+QG C
Sbjct: 79 AFGDMTNEEFRQVMNGFRNQKHKKGKVFQEPSFL---EVPKSVDWREKGYVTPVKNQGQC 135
Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
GSCW+FSATGALEG F TG+L+SLSEQ LVDC P+ + GC+GGLM+ AF+
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSR---PQ----GNEGCDGGLMDYAFQ 188
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
YI + GG++ E+ YPY D SCK+ A + F I +E + + GP++
Sbjct: 189 YIKENGGLDSEESYPYDAMD-ESCKYRPEYSVANDTGFVDIPKEEKALMKAVATVGPIS- 246
Query: 279 NVASIELPHISFSF 292
+I+ H SF F
Sbjct: 247 --VAIDAGHESFQF 258
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 93/263 (35%), Positives = 131/263 (49%), Gaps = 16/263 (6%)
Query: 31 IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
I +V+ + ++E L+ E H + +K+ K Y E + RF +FK N+ +
Sbjct: 22 ISRVISRELHETETSLI--ERH-EQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAA 78
Query: 91 DPTAVH-GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
GV +DLT EF+ GL R +P DWR GAV
Sbjct: 79 GNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYENVTAIPASVDWRKKGAV 138
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
T +KDQG CGSCW+FS A EG H +STG+LVSLSEQ+LVDCD + D GC
Sbjct: 139 TPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRK-------GTDQGCE 191
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
GG M FE+I+K GG+ E +YPY D GSCK + + AA + + + + ++
Sbjct: 192 GGYMEDGFEFIIKNGGITTEANYPYKAVD-GSCK-NATAPAAQIKGYEKVPVNSEKALLK 249
Query: 270 LVKHGPLAGNVASIELPHISFSF 292
V + P++ SI+ SF F
Sbjct: 250 AVANQPVS---VSIDAADGSFMF 269
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 133/227 (58%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N LR P + K + P ++DWR GAVT VKDQG CGSCW+FS TG +
Sbjct: 247 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 305 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G +C F K +++ +S +E ++AA L K GP++
Sbjct: 356 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPIS 401
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/275 (36%), Positives = 143/275 (52%), Gaps = 27/275 (9%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P D+ + A H+ FK + K + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + R D K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKD-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D+ ++ A ++ F + DE+++A + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVA 263
>gi|354502593|ref|XP_003513368.1| PREDICTED: cathepsin L1-like isoform 2 [Cricetulus griseus]
Length = 330
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 128/240 (53%), Gaps = 17/240 (7%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG---- 97
+ D L+AE H +K++ KTY EE R V++ N + + HG
Sbjct: 20 THDPSLDAEWH--EWKTQHGKTYVMDEEGQKR-AVWENNRKMIELHNEDYTKGKHGFHLE 76
Query: 98 VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
+ F DLT +EFR+ G + + P L D+P DWR HG VT VKDQG+
Sbjct: 77 MNAFGDLTNTEFRQLMTGFQSMGTTEMNVFQEPRL--GDVPKSVDWRKHGYVTPVKDQGS 134
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
C SCW+FSA G+LEG F TG+LV LSEQ LVDC ++GC+GGL SAF
Sbjct: 135 CVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRS-------QHNNGCHGGLFTSAF 187
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+YI GG++ + YPY D G C++D AA ++ F V+ S+E+ + + GP++
Sbjct: 188 QYIKDNGGLDTSESYPYEAQD-GPCRYDPKHSAANITGFVVVPSNEEALMKAVATVGPIS 246
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 141/249 (56%), Gaps = 19/249 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLP-ADAQKAPI--LPTNDLPTDFDWRDHGAVTGVKDQG 156
+F+D+T EF +F GLN +P + +PI L +D+P++ DWR+ GAVT VK+QG
Sbjct: 87 EFADITSQEFLAKFTGLN----IPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQG 142
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CG CW+FSA G+LEGA+ ++TG L+ SEQ+L+DC + + GCNGG M +A
Sbjct: 143 QCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFMTNA 193
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
F++I + GG+ RE DY Y G +C+ + A +S++ V+ E + + K
Sbjct: 194 FDFIKENGGISRESDYEYLGQQ-YTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQPVS 252
Query: 277 AGNVASIEL 285
G AS +L
Sbjct: 253 IGIAASQDL 261
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/278 (31%), Positives = 153/278 (55%), Gaps = 20/278 (7%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMI---RQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
L + +L++L +VLA + A+ D ++I R G +S++ +++ + + K K
Sbjct: 7 LMATILIVLFTVLAVSSAL--DMSIISYDRSHADKSGWKSDEEVMSIYEEWLV---KHGK 61
Query: 63 TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RR 119
Y EE + RF++FK NL + ++ T G+ +FSDL+ E+R ++LG R
Sbjct: 62 VYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKIDPSR 121
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ + +P + N LP DWR GAV VK+Q C CW+FSA A+EG + + TG
Sbjct: 122 MMARPSRRYSPRVADN-LPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTG 180
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L +LSEQ+L+DCD + ++GC+GGL++ AFE+I+ GG++ E+DYP+ G DG
Sbjct: 181 NLTALSEQELLDCDR--------TVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADG 232
Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++ + A + + + + ++ V + P++
Sbjct: 233 ICDQYKINARAVTIDGYERVPAYDELALKKAVANQPVS 270
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 80/192 (41%), Positives = 113/192 (58%), Gaps = 13/192 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
+ + K K Y E RF +FK NLR R + + G+ +F+DLT E+R
Sbjct: 43 YETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSV 102
Query: 113 FLGLNRRLRLPADAQKA-----PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
+LG R A + ++ + LP DWR GAV G+KDQG+CGSCW+FSA
Sbjct: 103 YLGTRPRSVAVARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAI 162
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
A+EG + + TG+L+SLSEQ+LV+CD S + GC+GGLM+ AFE+I+K G++
Sbjct: 163 AAVEGVNQIVTGDLISLSEQELVECDT--------SYNDGCDGGLMDYAFEFIIKNEGID 214
Query: 228 REKDYPYTGTDG 239
++DYPYTG DG
Sbjct: 215 SDEDYPYTGRDG 226
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 141/249 (56%), Gaps = 19/249 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLP-ADAQKAPI--LPTNDLPTDFDWRDHGAVTGVKDQG 156
+F+D+T EF +F GLN +P + +PI L +D+P++ DWR+ GAVT VK+QG
Sbjct: 87 EFADITSQEFLAKFTGLN----IPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQG 142
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CG CW+FSA G+LEGA+ ++TG L+ SEQ+L+DC + + GCNGG M +A
Sbjct: 143 QCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFMTNA 193
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
F++I + GG+ RE DY Y G +C+ + A +S++ V+ E + + K
Sbjct: 194 FDFIKENGGISRESDYEYLGQQ-YTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQPVS 252
Query: 277 AGNVASIEL 285
G AS +L
Sbjct: 253 IGIAASQDL 261
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 132/237 (55%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F KF+K Y+++ E +RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + IL P + P +FDWR VT VK+QG CG+
Sbjct: 81 KEEAISKYTGLS----LPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + L++LSEQQ +DCD ++GC+GGL+++AFE
Sbjct: 137 CWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---------VNAGCDGGLLHTAFESA 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPL 276
++ GGV+ E DYPY T G C+ + ++ V S I E+++ L GP+
Sbjct: 188 MEMGGVQMESDYPYE-TANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPI 243
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/254 (33%), Positives = 137/254 (53%), Gaps = 20/254 (7%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
+L ++E F F K++K+Y+++EE +F FK N+R + L +AV+ + +SD+
Sbjct: 17 NLNDSEKLFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSNSAVYDINFYSDM 76
Query: 105 TPSEFRRQFLGLNRRLR---------LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
+E R+ G L+ + + + P LP FDWRD +T VK+Q
Sbjct: 77 NKNELLRKQTGFKINLKKNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHVITSVKNQ 136
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
CGSCW+FS +E + + +L+ LSEQQLV+CD + ++GCNGGLM+
Sbjct: 137 RDCGSCWAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQ---------NNGCNGGLMHW 187
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
A E I++ GGV E D+PYT +D G CK + + N I S+ED++ L+ +GP
Sbjct: 188 AMEEIIRQGGVSNETDFPYTASD-GFCKRKQGFVNINGCN-QFILSNEDRLRELLIFNGP 245
Query: 276 LAGNVASIELPHIS 289
++ + I++ S
Sbjct: 246 ISIAIDVIDVIDYS 259
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 132/237 (55%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F F+K Y+++ E +RF++F+ NL + L D +A + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+ GG++ E DYPY + G C+ + +K V + + E+++ L GPL
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPL 243
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/237 (37%), Positives = 120/237 (50%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ANL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P D + P DWR+ GAVT VK+QG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVDVEFV------GAPAAKDWREEGAVTAVKNQGMCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--K 222
+A G +E FL+ L LSEQ LV CD+ +SGC GG AF++I+
Sbjct: 151 AAIGNIECQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGWPLVAFKWIVDRN 201
Query: 223 AGGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E+ YPY G S C + A ++ + I DE+ +AA L +GP+A
Sbjct: 202 NGTVYTEESYPYHSCIGISPPCTTSGHTVGATITGYVTIPRDENGIAAWLAVNGPVA 258
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 96/261 (36%), Positives = 133/261 (50%), Gaps = 24/261 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANAAEEAYRAKVFKENAIRIAKHNDRFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G +++D+ E + G L+ + + DWR GAVT
Sbjct: 70 EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT D G+C + + A + + V + E +
Sbjct: 183 GLMDSAFEYVKSYGGIDTEESYPYTAED-GTCLYKAANNAGVNTGYKDVQAKSESALRDA 241
Query: 270 LVKHGPLAGNVASIELPHISF 290
+ K GP++ +I+ + SF
Sbjct: 242 VEKVGPVS---VAIDASNWSF 259
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/257 (37%), Positives = 134/257 (52%), Gaps = 22/257 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKF 101
L+ AE +S FK+ K YA+ E YR +++ N L+ A+ + + V + +F
Sbjct: 22 LVGAE--WSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEF 79
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGA 157
DL EF G R R + P LP DWR GAVT VK+QG
Sbjct: 80 GDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQ 139
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG+LEG HF T +LVSLSEQ LVDC ++GC GGLM++AF
Sbjct: 140 CGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFG-------NNGCEGGLMDNAF 192
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
+YI G++ E YPY TD G C F++S + A + F I DE+++ + GP+
Sbjct: 193 KYIKSNKGIDTEWSYPYNATD-GVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPV 251
Query: 277 AGNVASIELPHISFSFL 293
+ +I+ H SF F
Sbjct: 252 S---VAIDASHESFQFY 265
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/260 (37%), Positives = 136/260 (52%), Gaps = 28/260 (10%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
DH L+A+ + +K+ K Y EE R +++ N++ +R H T
Sbjct: 22 DHSLDAD--WYKWKATHRKLYGLNEE-GRRRAIWEKNMKMIERHNWEHRQGKHSFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
F D+T EFR+ G +++ ++ DA A P DWR+ G VT VK+Q
Sbjct: 79 AFGDMTNEEFRKTMNGFQNQKHKKGKVFLDAGSALT------PHSVDWREKGYVTAVKNQ 132
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FSATGALEG F T +L+SLSEQ LVDC PE + GCNGGLM++
Sbjct: 133 GHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSW---PEG----NEGCNGGLMDN 185
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
AF+YI GG++ E+ YPY G D GSCK+ AA + + I E + + GP
Sbjct: 186 AFQYIKDNGGLDSEESYPYFGKD-GSCKYKPQSSAANDTGYVDIPKQEKALMKAVATVGP 244
Query: 276 LAGNVASIELPHISFSFLFT 295
++ I+ H SF F T
Sbjct: 245 IS---VGIDASHESFQFYST 261
>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
Length = 459
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/232 (40%), Positives = 133/232 (57%), Gaps = 24/232 (10%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F + +++TY TQEE +R VF N+ RA++ Q LD TA +G+TKFSDLT EFR
Sbjct: 162 FKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRA 221
Query: 112 QFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+L N+ + L + P ++DWR GAVT VK+QG CGSCW+FS
Sbjct: 222 IYLNPLLKENRNKMMHLAKSI-------GDHAPPEWDWRTKGAVTNVKNQGMCGSCWAFS 274
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TG +EG FL G+L+SLSEQ+L+DCD D C GGL ++A+ I GG
Sbjct: 275 VTGNVEGQWFLKQGDLLSLSEQELLDCD---------KVDKACLGGLPSNAYLAIKNLGG 325
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+E E DY Y+G +C F K +++ +S +E ++AA L K GP++
Sbjct: 326 LETEDDYSYSG-HLQTCSFSAKKAKVYINDSVELSQNEQKLAAWLAKKGPIS 376
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/238 (34%), Positives = 135/238 (56%), Gaps = 15/238 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+F F ++++K Y +++E YR+ +F+ N+ ++ + +AV+ + +F+D+T +E
Sbjct: 42 YFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNEIVI 101
Query: 112 QFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ GL L A+ + ++ P +FDWR VT VKDQG CG+CW+F+ G
Sbjct: 102 RHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFAGLG 160
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALE + + L+ L+EQQLVDCD D GC+GGL+++A+E I++ GGVE+
Sbjct: 161 ALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGGVEQ 211
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
E DYPY + C K AA V N + + +E+++ L GP+A V +++L
Sbjct: 212 EFDYPYK-AERQPCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 268
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/238 (34%), Positives = 135/238 (56%), Gaps = 15/238 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+F F ++++K Y +++E YR+ +F+ N+ ++ + +AV+ + +F+D+T +E
Sbjct: 41 YFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNEIVI 100
Query: 112 QFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ GL L A+ + ++ P +FDWR VT VKDQG CG+CW+F+ G
Sbjct: 101 RHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFAGLG 159
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALE + + L+ L+EQQLVDCD D GC+GGL+++A+E I++ GGVE+
Sbjct: 160 ALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGGVEQ 210
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
E DYPY + C K AA V N + + +E+++ L GP+A V +++L
Sbjct: 211 EFDYPYK-AERQPCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 267
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 131/253 (51%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
+ NA+ H +KS + Y T EE ++R V++ N++ + HG T
Sbjct: 22 NQTFNAQWH--KWKSTHRRLYDTNEE-EWRRAVWEKNMKMIELHNGEYSEGKHGFTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ LP DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQLVNGYKHQKHRKGKLFQEPLML--QLPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA GALEG L TG LVSLSEQ LVDC G + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSR-------GEGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+L G++ E+ YPY D G+CK+ AA + + I E + + GP+A
Sbjct: 190 VLNNKGLDSEESYPYEAKD-GTCKYKPEFAAANDTGYVDIPQLEKALMKAVATVGPIA-- 246
Query: 280 VASIELPHISFSF 292
+I+ H SF F
Sbjct: 247 -VAIDASHPSFQF 258
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 130/237 (54%), Gaps = 18/237 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F + ++ KTY+++EE R +VF+ N + + + + + F+DLT EF+
Sbjct: 29 FEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFKA 88
Query: 112 QFLGLNRRLRLPADAQ--KAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
LG + P AQ ++ P +L P DWR GAVTGVKDQG CG CWSFS T
Sbjct: 89 SRLGFS-----PGRAQSIRSVGTPVQELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTT 143
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
GA+EG + + TG LVSLSEQ+LVDCD S +SGC GGLM+ A+++++K G++
Sbjct: 144 GAIEGINKIVTGSLVSLSEQELVDCDR--------SYNSGCEGGLMDYAYQFVIKNQGID 195
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
E DYPY G D K K + ++ I ++++ +V P++ + E
Sbjct: 196 SEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSE 252
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 134/238 (56%), Gaps = 15/238 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
+F F ++++K Y T++E YR+ +F+ N+ + + +A++ + +F+D+T +E
Sbjct: 39 YFEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNSRNDSAIYKINRFADMTKNEVVI 98
Query: 112 QFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+ GL L A+ + ++ PT FDWR VT VKDQG CG+CW+F+ G
Sbjct: 99 RHTGLASG-ELGANFCETIVVDGPAQRQRPTSFDWRTLNKVTSVKDQGMCGACWAFAGLG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALE + + L+ L+EQQLVDCD S D GC+GGL+++A+E I+ GGVE+
Sbjct: 158 ALESQYAIKYDRLIDLAEQQLVDCD---------SVDMGCDGGLIHTAYEQIMHMGGVEQ 208
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
E DYPY + C K AA V S + + +E+++ L GP+A V +++L
Sbjct: 209 EFDYPYR-AERQPCALKPHKFAAGVRSCYRYVLLNEERLEDLLRYVGPIAIAVDAVDL 265
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 132/246 (53%), Gaps = 26/246 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK ++ + Y T E YR VF+ N + + + + T + +F D+T EF
Sbjct: 22 FKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAA 81
Query: 112 QFLG-LNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
G LN R P IL +D LP DWR GAVT VKDQ CGSCW+FS TG
Sbjct: 82 TMNGFLNVPTRHPV-----AILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTG 136
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVE 227
+LEG HFL G+LVSLSEQ LVDC SG + GC GGLM+ AF+YI + G++
Sbjct: 137 SLEGQHFLKDGKLVSLSEQNLVDC--------SGKFGNMGCCGGLMDQAFKYIKENKGID 188
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELP 286
E+ YPY D G C+FD S + A + F I+ +E+ + + GP++ +I+
Sbjct: 189 TEESYPYEAQD-GKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPIS---VAIDAS 244
Query: 287 HISFSF 292
H SF F
Sbjct: 245 HPSFQF 250
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 82/225 (36%), Positives = 121/225 (53%), Gaps = 28/225 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL--DPTAVH-GVTKFSDLTPSEF 109
F +K + K Y EE R FK NL+ R + P H G+ +F+D++ EF
Sbjct: 51 FQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEF 110
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
+ +F+ + + +D P DWR G VTGVKDQG CGSCWSFS+TGA
Sbjct: 111 KNKFI--------------SKVESCDDAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGA 156
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
+EG + + TG+L+SLSEQ+LVDCD + + GC GG M+ AFE+++ GG++ E
Sbjct: 157 IEGVNAIVTGDLISLSEQELVDCD---------TTNDGCEGGYMDYAFEWVINNGGIDTE 207
Query: 230 KDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKH 273
DYPY G GG+C K + + ++ ++ + + VK
Sbjct: 208 ADYPYIGV-GGTCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQ 251
>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
Length = 310
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 94/245 (38%), Positives = 128/245 (52%), Gaps = 27/245 (11%)
Query: 62 KTYATQEEH----DYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQF 113
K + +++ H +R +K NL+ + L +H G+ F D+T EFR+
Sbjct: 6 KKWPSKKXHAPXXGWRRIFWKKNLKXIEMHNLXHSMGIHTYRLGMNHFGDMTHEEFRQVM 65
Query: 114 LGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
G +RR R + I ++P DWR+ G VT VKDQG CGSCW+FS TGA
Sbjct: 66 NGFKHKKDRRFRGSLFMEPXFI----EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGA 121
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LEG F TG+LVSLSEQ LVDC PE + GCNGGLM+ AF+Y+ G++ E
Sbjct: 122 LEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQNGLDSE 174
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
+ YPY GTD C FD AA + F + S E + + GP++ +I+ H
Sbjct: 175 ESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVS---VAIDAGHE 231
Query: 289 SFSFL 293
SF F
Sbjct: 232 SFQFY 236
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 121/211 (57%), Gaps = 17/211 (8%)
Query: 73 RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNR-RLRLPADAQKA 129
R+ +FK NLR + G+ F+DLT EFR Q G +R R R + +
Sbjct: 85 RYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSYEEFRY 144
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
+ DLP DWR+ GAV GVKDQG+CGSCW+FSA A+EG + L+TGELVSLSEQ+L
Sbjct: 145 GSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQEL 204
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCD D GCNGGLM+ AF +++K GG++ E DYPY G G C D+SK+
Sbjct: 205 VDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGY-GTRC--DRSKM 253
Query: 250 AAAV---SNFSVISSDEDQMAANLVKHGPLA 277
A V + + +++ V H P++
Sbjct: 254 NAKVVTIDGYEDVPVNDETALLKAVAHQPVS 284
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 135/249 (54%), Gaps = 20/249 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
F +K KF K+Y ++ + R +++ N + +L + G+T+F+D+ E
Sbjct: 33 FHAWKLKFEKSYDSESDEAQRKQIWLNNRKHVLVHNILADQGLKSYRLGMTQFADMENEE 92
Query: 109 FRR---QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
++R Q + LP LP LP DWRD G VT V++Q CGSCW+F
Sbjct: 93 YKRLVSQGCLHSFNSSLPRRGSTFFRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSCWAF 152
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
SATG+LEG HF TG+LVSLS+QQLVDC E E GCNGGLM+SAF+YI G
Sbjct: 153 SATGSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNE-------GCNGGLMDSAFQYIQANG 205
Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASI 283
G++ E+ YPY D G C+++ A + + V ++E+ + + GP++ +I
Sbjct: 206 GIDTEESYPYEAED-GKCRYNPKSTGATCTGYVDVQPANEETLKEAVATIGPIS---VAI 261
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 262 DAFHPSFQF 270
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G + +S I A + I S+ED MAA L +GP+A
Sbjct: 209 YTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 262
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/260 (36%), Positives = 130/260 (50%), Gaps = 25/260 (9%)
Query: 47 LNAEH--HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ----LLDPTAVHGVTK 100
LN +H F +K+ + K Y T EE + + + N + L + + +
Sbjct: 21 LNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNE 80
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL-------PTNDLPTDFDWRDHGAVTGVK 153
+ DLT EF G +RL + LPT DWR HG VT VK
Sbjct: 81 YGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVK 140
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
+QG CGSCWSFSATG+LEG H TG+LVSLSEQ L+DC PE + GCNGGLM
Sbjct: 141 NQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCS---TPEG----NDGCNGGLM 193
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
+ AF+YI GG++ E YPY D +C+F+ + A + F + S DE+ +
Sbjct: 194 DQAFKYIKIQGGIDTEAYYPYEAKD-DTCRFNITDSGATDTGFVDIKSGDEEMLKEAAAT 252
Query: 273 HGPLAGNVASIELPHISFSF 292
GP++ +I+ H SF F
Sbjct: 253 VGPIS---VAIDASHTSFQF 269
>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 381
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 128/236 (54%), Gaps = 22/236 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
+L N A Q A + +P DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98 YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
G +E + LVSLSEQQLV CD + D+GCNGGLM AFE++L+ G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206
Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLA 277
V EK YPYT +G + + SK+ A + + +I S+E MAA L ++GP+A
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIA 262
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + YAT +E R F+ NL + Q +P A G+TKF DL+ EF +
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L + + + + + P DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
+E +L+T L+SLSEQ+LV CD D GCNGGLM AF+++L + G V
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G + +S I A + I S+ED MAA L +GP+A
Sbjct: 209 YTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 262
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 131/233 (56%), Gaps = 17/233 (7%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ ++K SKTY ++ E R +++ NLR L +H G+ D+T
Sbjct: 25 HWLMWKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTRE 84
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
E + F G R+R + +P + + + P DWR+ G VT VK+QG+CGSCW+FS
Sbjct: 85 EILQMFAGT--RVRPNLTRRSSPFVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
A GALEG +TG++ SLS Q LVDC S + GCNGG M AF+Y++ GG
Sbjct: 143 AAGALEGQLKRTTGQVKSLSPQNLVDC-------SSKYGNKGCNGGFMTQAFQYVIDDGG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLA 277
++ ++ YPYT D G C++D+S+ AA S+++ +S DE+ + + GP++
Sbjct: 196 IDSDEAYPYTAMD-GQCRYDQSQRAANCSSYNYVSEGDEEALKQAVATIGPIS 247
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 129/237 (54%), Gaps = 31/237 (13%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSE 108
F FK + KTY Q E RF +F N+R + L + G+ KF+D++ E
Sbjct: 26 FQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEE 85
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTN-------DLPTDFDWRDHGAVTGVKDQGACGSC 161
F+ L A + P L T ++P+ DWR G VTGVKDQG CGSC
Sbjct: 86 FKTM---------LTLSASRKPTLETTSYVKTGVEIPSSVDWRKEGRVTGVKDQGDCGSC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+ EGA+ +G+LVSLSEQQL+DC C +GC+GG ++ F+Y++
Sbjct: 137 WAFSITGSTEGAYARKSGKLVSLSEQQLIDC---CT-----DTSAGCDGGSLDDNFKYVM 188
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
K G++ E+ Y Y G D G+CK++ + + VS ++ I + DED + + GP++
Sbjct: 189 K-DGLQSEESYTYKGED-GACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVS 243
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/255 (36%), Positives = 141/255 (55%), Gaps = 19/255 (7%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTK 100
S+D ++ A H + +++S+ Y E RF VFKAN++ + GV +
Sbjct: 121 SDDSVMVARHE--QWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQ 178
Query: 101 FSDLTPSEFR--RQFLGL-NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
F+DLT EFR + GL + +++P + + + LPT DWR GAVT +KDQG
Sbjct: 179 FADLTNDEFRSTKTNKGLKSSNMKIPTGFRYENV-SADALPTTIDWRTKGAVTPIKDQGQ 237
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CG CW+FSA A EG +STG+LVSL+EQ+LVDCD + D GC GGLM+ AF
Sbjct: 238 CGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGE-------DQGCEGGLMDDAF 290
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
++I+K GG+ E YPYT D G CK S AA + + + ++++ V + P++
Sbjct: 291 KFIIKNGGLTTESSYPYTAAD-GKCK-SGSNSAATIKGYEDVPANDEAALMKAVANQPVS 348
Query: 278 GNVASIELPHISFSF 292
+++ ++F F
Sbjct: 349 ---VAVDGGDMTFQF 360
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/275 (36%), Positives = 144/275 (52%), Gaps = 27/275 (9%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P+ D+ + A H+ FK + SK + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPA-----VDNFV-ASAHYGSFKKRHSKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + D K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYTSHLKD-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D+ ++ A ++ F + DE+++A + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIADWVEKRGPVA 263
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 128/253 (50%), Gaps = 19/253 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK+++ + Y +E YR RVF+ N + K+ + + T + +F
Sbjct: 13 LATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQF 72
Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T EF G + R P A P + D DWR VT VKDQ CGS
Sbjct: 73 GDMTNEEFNAVMKGYKKGSRGEPKAVFTAEAGP---MAADVDWRTKALVTPVKDQEQCGS 129
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FSATGALEG HFL ELVSLSEQQLVDC + + GC GG M SAF+YI
Sbjct: 130 CWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYG-------NDGCGGGWMTSAFDYI 182
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG++ E YPY D SC+FD + I A + + E+ + + GP++
Sbjct: 183 KDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPIS--- 238
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 239 VAIDASHFSFQFY 251
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/253 (34%), Positives = 142/253 (56%), Gaps = 19/253 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLN--RRLRLPA-----DAQKAPILPTNDLPTDFDWRDHGAVTGV 152
+F+D+T EF +F GLN P+ + +K L +D+P++ DWR+ GAVT V
Sbjct: 87 EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQV 146
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
K QG CG CW+FSA G+LEGA+ ++TG+L+ SEQ+L+DC + + GCNGG
Sbjct: 147 KHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT---------TNNYGCNGGF 197
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
M +AF++I++ GG+ RE DY Y G + +C+ + A +S++ V+ E + + K
Sbjct: 198 MTNAFDFIIENGGISRESDYEYLG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTK 256
Query: 273 HGPLAGNVASIEL 285
G AS +L
Sbjct: 257 QPVSIGIAASQDL 269
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 150/287 (52%), Gaps = 36/287 (12%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LL +L++ + A A N +A + E+ ED + +++ + Y +E
Sbjct: 14 LLFVLAAWASQATARNLHEASMY-------ERHEDWM-----------AQYGRVYKDADE 55
Query: 70 HDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEF---RRQFLGLNRRLRLPAD 125
R+++FK N+ R + + +D + + +F+DLT EF R +F + +
Sbjct: 56 KSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRF----KAHICSTE 111
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A +P+ DWR GAVT +KDQG CGSCW+FSA A+EG LSTG+L+SLS
Sbjct: 112 ATSFKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLS 171
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQ+LVDCD SG D GCNGGLM+ AF++I + G+ E +YPY GTDG +
Sbjct: 172 EQELVDCD------TSGE-DQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKK 224
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ AA ++ + + ++ ++ V H P+A +I+ F F
Sbjct: 225 AAHPAAKINGYEDVPANNEKALQKAVVHQPIA---VAIDAGGFEFQF 268
>gi|261289781|ref|XP_002611752.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
gi|229297124|gb|EEN67762.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
Length = 327
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/263 (35%), Positives = 139/263 (52%), Gaps = 23/263 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKF 101
L+N E + +FK +++ YA +EE+ R +F+ NL+ + +H GV ++
Sbjct: 18 LMNPE--WEVFKKAYNRVYAAEEEYARRL-IFEDNLKTIQMHNEEADRGLHTFRLGVNQY 74
Query: 102 SDLTPSEFRRQFLG---LNRRLRLPADAQKAPILPT-NDLPTDFDWRDHGAVTGVKDQGA 157
+D+T EF +G L+ PT D+P DWRD G VT VK+Q
Sbjct: 75 ADMTHKEFLENVIGGCLLDTNTSKSTADHVHEYDPTLTDVPDTVDWRDKGYVTPVKNQAQ 134
Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
CGSCW+FS TG+LEG HF +T +LVSLSEQ L+DC + + GC GGLM+ AF
Sbjct: 135 CGSCWAFSTTGSLEGQHFKATNKLVSLSEQNLMDCSRK-------EGNQGCQGGLMDQAF 187
Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPL 276
+YI GG++ E+ YPY + C + S A +S+++ V S DED + + GP+
Sbjct: 188 KYIKTNGGIDTEECYPYKAKN-EQCNYQASCSGATLSSYTDVKSKDEDALQQAVATVGPI 246
Query: 277 AGNVASIELPHISFSFLFTVSSP 299
+ +I+ H SF + P
Sbjct: 247 S---VAIDAGHSSFQLYHSGKPP 266
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 84/245 (34%), Positives = 132/245 (53%), Gaps = 24/245 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + K +K Y E +D +++ FK N+ + V G+ +F+DLT E+++
Sbjct: 34 FLGWMKKHNKAYHHHEFND-KYQTFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKT 92
Query: 113 FLGLNRRLRLPADAQKAPILPTNDL-------PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+LG++ + L A+ +P N L P+ DWR +GAV VKDQG CGSCW+F+
Sbjct: 93 YLGMSINVNLRANQ-----VPMNGLNFERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFA 147
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGA+EGAH + TG +V+ SEQ LVDC ++GC+GGLM SAF+YI+ G
Sbjct: 148 TTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYG-------NNGCDGGLMTSAFKYIIDNDG 200
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
+ E+ YPYT T C ++ + + A+S + + + + P+A +I+
Sbjct: 201 IATEEAYPYTATQ-NRCVYNTTMLGTAISGYKDVPRGSESALTAAISKQPVA---VAIDA 256
Query: 286 PHISF 290
I+F
Sbjct: 257 SPITF 261
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 132/247 (53%), Gaps = 26/247 (10%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK ++ + Y T E YR VF+ N + + + + T + +F D+T EF
Sbjct: 6 FKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAA 65
Query: 112 QFLG-LNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
G LN R P IL +D LP DWR GAVT VKDQ CGSCW+FS TG
Sbjct: 66 TMNGFLNVPTRHPV-----AILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTG 120
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVE 227
+LEG HFL G+LVSLSEQ LVDC SG + GC GGLM+ AF+YI + G++
Sbjct: 121 SLEGQHFLKDGKLVSLSEQNLVDC--------SGKFGNMGCCGGLMDQAFKYIKENKGID 172
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELP 286
E+ YPY D G C+FD S + A + F I+ +E+ + + GP++ +I+
Sbjct: 173 TEESYPYEAQD-GKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPIS---VAIDAS 228
Query: 287 HISFSFL 293
H SF F
Sbjct: 229 HPSFQFY 235
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 121/211 (57%), Gaps = 17/211 (8%)
Query: 73 RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNR-RLRLPADAQKA 129
R+ +FK NLR + G+ F+DLT EFR Q G +R R R + +
Sbjct: 85 RYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHEEFRY 144
Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
+ DLP DWR+ GAV GVKDQG+CGSCW+FSA A+EG + L+TGELVSLSEQ+L
Sbjct: 145 GSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQEL 204
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDCD D GCNGGLM+ AF +++K GG++ E DYPY G G C D+SK+
Sbjct: 205 VDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGY-GTRC--DRSKM 253
Query: 250 AAAV---SNFSVISSDEDQMAANLVKHGPLA 277
A V + + +++ V H P++
Sbjct: 254 NAKVVTIDGYEDVPVNDETALLKAVAHQPVS 284
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 129/227 (56%), Gaps = 11/227 (4%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + SK K Y + EE +RF +FK NL G+ +F+DL+ EF+ +
Sbjct: 33 FESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVNYWLGLNEFADLSHEEFKNK 92
Query: 113 FLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+LGLN L + +++ + +P DWR GAVT VK+QG+CGSCW+FS A+E
Sbjct: 93 YLGLNVDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVE 152
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TG L SLSEQ+LVDCD + ++GCNGGLM+ AF YI+ GG+ +E+D
Sbjct: 153 GINQIVTGNLTSLSEQELVDCDT--------TYNNGCNGGLMDYAFAYIISNGGLHKEED 204
Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY + G+C+ K++ +S + + + ++ + + PL+
Sbjct: 205 YPYI-MEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLS 250
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 152/295 (51%), Gaps = 36/295 (12%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
+S L+ L ++ + A+A DA I E+ E+ + ++F + Y+
Sbjct: 10 ISLALIFFLGALASQAIARTLQDASIH-------EKHEEWM-----------TRFKRVYS 51
Query: 66 TQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
+E + R+++FK N++R + + + + G+ +F+DLT EF+ NR
Sbjct: 52 DAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADLTNEEFKTS---RNRFKGHMC 108
Query: 125 DAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
+Q P N +P+ DWR GAVT +KDQG CGSCW+FSA A+EG L+T +L+
Sbjct: 109 SSQAGPFRYENITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLI 168
Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
SLSEQ+LVDCD + + D GC GGLM+ AF++I + G+ E +YPY G+DG
Sbjct: 169 SLSEQELVDCDTKGE-------DQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCN 221
Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVS 297
++ AA ++ F + ++ + V P+ S+ + F F F S
Sbjct: 222 TKQEANHAAKINGFEDVPANNEGALMKAVAKQPV-----SVAIDAGGFEFQFYSS 271
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/274 (34%), Positives = 145/274 (52%), Gaps = 37/274 (13%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
LL +L++ + A A N +A + E+ ED ++ ++ + Y +E
Sbjct: 14 LLFVLAAWASQATARNLHEASMY-------ERHEDWMV-----------QYGREYKDADE 55
Query: 70 HDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA---- 124
R+++FK N+ R + + +D + + +F+DLT EFR R R A
Sbjct: 56 KSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS------RNRFKAHICS 109
Query: 125 -DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A +P+ DWR GAVT +KDQG CGSCW+FSA A+EG LSTG+L+S
Sbjct: 110 TEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD SG D GC+GGLM+ AF++I + G+ E +YPY GTDG +
Sbjct: 170 LSEQELVDCD------TSGE-DQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNR 222
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ AA ++ + + ++ ++ V H P+A
Sbjct: 223 KKAAHPAAKINGYEDVPANNEKALQKAVAHQPIA 256
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 128/253 (50%), Gaps = 19/253 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK+++ + Y +E YR RVF+ N + K+ + + T + +F
Sbjct: 12 LATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQF 71
Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T EF G + R P A P + D DWR VT VKDQ CGS
Sbjct: 72 GDMTNEEFNAVMKGYKKGSRGEPKAVFTAEAGP---MAADVDWRTKALVTPVKDQEQCGS 128
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FSATGALEG HFL ELVSLSEQQLVDC + + GC GG M SAF+YI
Sbjct: 129 CWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYG-------NDGCGGGWMTSAFDYI 181
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG++ E YPY D SC+FD + I A + + E+ + + GP++
Sbjct: 182 KDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPIS--- 237
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 238 VAIDASHFSFQFY 250
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/227 (37%), Positives = 130/227 (57%), Gaps = 15/227 (6%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEF---RR 111
+ +++K Y +E + RF++FK N+ + GV +F DLT EF R
Sbjct: 42 WMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRN 101
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+F G + + K + T +P++ DWR GAVT VKDQG CG CW+FSA A E
Sbjct: 102 RFKGHMCSSIIRTNTYKYENVTT--VPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATE 159
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G H LSTG+L+SLSEQ+LVDCD + D GC GGLM+ AF++I++ G++ E
Sbjct: 160 GIHQLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIIQNHGLDTEAK 212
Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY G D G+C +++ I AA ++++ + ++ +Q V + P++
Sbjct: 213 YPYQGVD-GTCNANEASINAATITSYEDVPTNNEQALQKAVANQPIS 258
>gi|229366026|gb|ACQ57993.1| Cathepsin H precursor [Anoplopoma fimbria]
Length = 247
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 128/221 (57%), Gaps = 17/221 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
E HF + ++ ++ Y+ QE H+ RF++F N RR + + T G+ +FSD+T SEF
Sbjct: 23 EFHFKSWMAQHNRVYSMQEYHE-RFQIFSENKRRIDKHNEGNHTFTMGLNQFSDMTFSEF 81
Query: 110 RRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHG-AVTGVKDQGACGSCWSFSA 166
R+ FL + A K +ND P DWR G VT VK+QGACGSCW+FS
Sbjct: 82 RKSFLWSEPQ---NCSATKGNYF-SNDGPHPDTIDWRKKGNYVTDVKNQGACGSCWTFST 137
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG LE +STG+LV LSEQQLVDC + + + GCNGGL + AFEYI+ + G+
Sbjct: 138 TGCLESVTAISTGKLVPLSEQQLVDCAQDFN-------NHGCNGGLPSQAFEYIMYSKGL 190
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
EKDYPYT + +C + K K+AAA V + D+M
Sbjct: 191 MTEKDYPYTAFE-DTCAY-KQKLAAAFVREVVNITAYDEMG 229
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/237 (34%), Positives = 134/237 (56%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
+L A ++F F KF+K+Y+++ E RF++F+ NL + D TA + + KF+DL+
Sbjct: 21 VLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFADLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + + ++LSEQQL+DCD D+GC+GGL+++AFE +
Sbjct: 137 CWAFATLGSLESQFAIKHNQFINLSEQQLIDCDF---------VDAGCDGGLLHTAFEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+ GG++ E DYPY + G C+ + +K V + I+ E+++ L GP+
Sbjct: 188 MNMGGIQAESDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPI 243
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 16/209 (7%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQ GSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GGLM+ AF++I+ GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 23/236 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+ L+ ++ + Y E D RFRVF NLR A + + G+ +F+DLT EFR
Sbjct: 52 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 111
Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+PA ++ + +LP DWR+ GAV VK+QG CGSCW
Sbjct: 112 AAYLGA----RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 167
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++E + + TGE+V+LSEQ+LV+C + +SGCNGGLM++AF++I+K
Sbjct: 168 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 220
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG++ E DYPY D G C ++ ++ F + ++++ V H P++
Sbjct: 221 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVS 275
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/205 (41%), Positives = 119/205 (58%), Gaps = 13/205 (6%)
Query: 74 FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILP 133
FR ANLR + + + G+T+F+DLT +EF +R + + +
Sbjct: 48 FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFS----AYVKRFPMNVTRPRNEVWI 103
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
T + DWR AVT +K+QG CGSCWSFS TG++EGAH ++TG+LVSLSEQQL+DC
Sbjct: 104 TEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCS 163
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
+ GCNGGLM+ AFEY++ GG++ E+DYPYT DG + K AA +
Sbjct: 164 TRYG-------NHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEI 216
Query: 254 SNF-SVISSDEDQMAANLVKHGPLA 277
F +V EDQ+AA V GP++
Sbjct: 217 HGFRNVPKEHEDQLAA-AVSIGPVS 240
>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
Length = 333
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/261 (36%), Positives = 133/261 (50%), Gaps = 24/261 (9%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
+PSD + + H+ FK+ +KTYA E YR +VFK N +R AK
Sbjct: 18 IPSD--------MEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDRFASG 69
Query: 94 AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
V G +++D+ E + G L+ + + DWR GAVT
Sbjct: 70 EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129
Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
+KDQG CGSCWSFSATG+LEG FL LVSLSEQ LVDC + E GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182
Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
GLM+SAFEY+ GG++ E+ YPYT D G+C + + A + + V + E +
Sbjct: 183 GLMDSAFEYVKSNGGIDTEESYPYTAED-GTCLYKAANNAGVNTGYKDVQAKSESALRDA 241
Query: 270 LVKHGPLAGNVASIELPHISF 290
+ K GP++ +I+ + SF
Sbjct: 242 VEKVGPVS---VAIDASNWSF 259
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/252 (34%), Positives = 141/252 (55%), Gaps = 18/252 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI--LPTNDLPTDFDWRDHGAVTGVK 153
+F+D+T EF +F GLN P + + I L +D+P++ DWR+ GAVT VK
Sbjct: 87 EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
QG CG CW+FSA G+LEGA+ ++TG+L+ SEQ+L+DC + + GCNGG M
Sbjct: 147 HQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT---------TNNYGCNGGFM 197
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
+AF++I++ GG+ RE DY Y G + +C+ + A +S++ V+ E + + K
Sbjct: 198 TNAFDFIIENGGISRESDYEYLG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256
Query: 274 GPLAGNVASIEL 285
G AS +L
Sbjct: 257 PVSIGIAASQDL 268
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/255 (36%), Positives = 138/255 (54%), Gaps = 20/255 (7%)
Query: 5 ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGE-QSEDHLLNAEHHFSLFKSKFSKT 63
+LS L +L ++ ++A++ + P ++ D +L + + K K
Sbjct: 1 MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLV---KHGKN 57
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRR 119
Y E + RF +FK NL + + G+ +F+DLT E+R +FLG NRR
Sbjct: 58 YNALGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRR 117
Query: 120 LR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
R + + + + LP DWR GAV GVKDQG+CGSCW+FSA A+EG + L+T
Sbjct: 118 NRKVNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLAT 177
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+I+ + E+DYPY D
Sbjct: 178 GDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAID 229
Query: 239 GGSCKFDKSKIAAAV 253
G + D+++ A V
Sbjct: 230 G---RCDQNRKNAKV 241
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 157/283 (55%), Gaps = 20/283 (7%)
Query: 8 SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
+L++LL+ + L + D+ ++ + + S ++ +A +F F S+++K Y +
Sbjct: 4 TLIILLVVNALLNW----RDNELVDAAGTAANKPSLYNINSAPQYFEQFISQYNKQYKNE 59
Query: 68 EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
E +RF +F N+ ++ + +AV+ + +F+D+T +E + GL L ++
Sbjct: 60 AEKRHRFNIFMHNIEEINQKNSRNDSAVYKINRFADMTKNEVVIRHTGLASIGELNSNFC 119
Query: 128 KAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
+ ++ P+ FDWR + VT VKDQ CG+CW+F++ GALE + + L+ L
Sbjct: 120 ETVVVDGPGQRQRPSSFDWRTYNKVTSVKDQSMCGACWAFASLGALESQYAIKYDRLIDL 179
Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
+EQQLVDCD D GC+GGL+++A+E I++ GGVE+E DYPY + C
Sbjct: 180 AEQQLVDCDF---------VDMGCDGGLIHTAYEQIMQMGGVEQEFDYPYRA-ERQPCAL 229
Query: 245 DKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
K AA V F + +E+++ +L++H GP+A V +++L
Sbjct: 230 KPHKFAAGVRKCFRYVLRNEERL-EDLLRHVGPIAIAVDAVDL 271
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 129/242 (53%), Gaps = 21/242 (8%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLG----- 115
K Y E + RF +FK NL + D G+ KF+DLT EFR +LG
Sbjct: 62 KNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSS 121
Query: 116 ----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
L + + + ++LP DWR +GAV VKDQG CGSCW+FS A+E
Sbjct: 122 SSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVE 181
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G + + TGEL+SLSEQ+LVDCD S +SGC+GGLM+ A+E+I+ GG++ + D
Sbjct: 182 GINQIVTGELLSLSEQELVDCDT--------SYNSGCDGGLMDYAYEFIINNGGIDTDAD 233
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFS 291
YPYT DG ++ K+ + +F + ++++ V H P++ +IE +F
Sbjct: 234 YPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVS---VAIEAGGSTFQ 290
Query: 292 FL 293
F
Sbjct: 291 FY 292
>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
Length = 459
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/226 (41%), Positives = 134/226 (59%), Gaps = 12/226 (5%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
F F + +++TY T+EE +R +F +N+ RA++ Q LD TA +GVTKFSDLT EFR
Sbjct: 162 FKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 221
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
+L + ++A + + P ++DWR GAVT VKDQG CGSCW+FS TG +E
Sbjct: 222 IYLNPLLKEEPGVKMRRAKSV-GDSAPPEWDWRSKGAVTEVKDQGMCGSCWAFSVTGNVE 280
Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
G FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E D
Sbjct: 281 GQWFLNRGALLSLSEQELLDCD---------KVDKACMGGLPSNAYSAIKTLGGLETEDD 331
Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y Y G +C F K +++ ++ +E ++AA L K GP++
Sbjct: 332 YSYHG-HLQACSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPIS 376
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 23/236 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+ L+ ++ + Y E D RFRVF NLR A + + G+ +F+DLT EFR
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 168
Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+PA ++ + +LP DWR+ GAV VK+QG CGSCW
Sbjct: 169 AAYLGA----RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 224
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++E + + TGE+V+LSEQ+LV+C + +SGCNGGLM++AF++I+K
Sbjct: 225 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 277
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG++ E DYPY D G C ++ ++ F + ++++ V H P++
Sbjct: 278 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVS 332
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 138/259 (53%), Gaps = 23/259 (8%)
Query: 43 EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
++ L+ + H + +K + YA +E + R+ VFK N+ R + + T V +
Sbjct: 29 DNELIMQKRHIE-WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQ 87
Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQK--API----LPTNDLPTDFDWRDHGAVTGVKD 154
F+DLT EFR + G L + +Q +P + + LP DWR GAVT +K+
Sbjct: 88 FADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKN 147
Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
QG+CG CW+FSA A+EGA + G+L+SLSEQQLVDCD + D GC GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCEGGLMD 198
Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
+AFE+I GG+ E +YPY G D +C K+ A +++ + + +++Q V H
Sbjct: 199 TAFEHIKATGGLTTESNYPYKGED-ATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAH 257
Query: 274 GPLAGNVASIELPHISFSF 292
P++ IE F F
Sbjct: 258 QPVS---VGIEGGGFDFQF 273
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/244 (36%), Positives = 133/244 (54%), Gaps = 16/244 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F + +K+ K YA+ EE +RF VFK NL T G+ F+DLT EF+
Sbjct: 66 FEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADLTHDEFKAT 125
Query: 113 FLGLNR-RLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+LGL + + D++ + + +D+P DWR GAVT VK+QG CGSCW+FS A+
Sbjct: 126 YLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAFSTVAAV 185
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG + + TG L SLSEQ+LVDC S ++GCNGG+M++AF YI +GG+ E+
Sbjct: 186 EGINQIVTGNLTSLSEQELVDC--------STDGNNGCNGGVMDNAFSYIASSGGLRTEE 237
Query: 231 DYPYTGTDGGSC--KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
YPY + G C K + +S + + ++++Q + H PL+ +IE
Sbjct: 238 AYPYL-MEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLS---VAIEASGR 293
Query: 289 SFSF 292
F F
Sbjct: 294 HFQF 297
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 125/249 (50%), Gaps = 17/249 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
N + H+ +K+ + Y EE ++R V++ N + H + F D
Sbjct: 24 NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + P+L D+P DW G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+YI
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY TD SC + AA + F I E + + GP++ +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 251 DAGHTSFQF 259
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 131/234 (55%), Gaps = 23/234 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVH--GVTKFSDLTPSEF 109
F + K K Y +E + +F+ F+ NLR ++ + H G+ KF+D++ EF
Sbjct: 51 FKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEF 110
Query: 110 RRQFLG-----LNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGS 160
R ++ ++R+ + Q D PT DWR +G VTGVKDQG CGS
Sbjct: 111 REVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGS 170
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FS+TGA+EG + L+ G+L+SLSEQ+LVDCD S + GC GG M+ AFE++
Sbjct: 171 CWAFSSTGAIEGINALANGDLISLSEQELVDCD---------STNDGCEGGYMDYAFEWV 221
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKH 273
+ GG++ E DYPYTG D G+C K + A ++ + ++ +E + ++K
Sbjct: 222 MSNGGIDTETDYPYTGED-GTCNTTKEETKAVSIDGYEDVAEEESALFCAVLKQ 274
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/252 (37%), Positives = 136/252 (53%), Gaps = 19/252 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKF 101
++ + ++ +K++ K Y + EE R +++ NL + + L T G+ +F
Sbjct: 21 FIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGMNQF 80
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACG 159
+DL EF G R A + P+N D+PT DWR G VT VK+Q CG
Sbjct: 81 ADLKNEEFVSLMNGF-RGNSSKATRGSTFLPPSNVFDMPTMVDWRTKGYVTPVKNQLQCG 139
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATG+LEG HF TG+LVSLSEQ LVDC + + GC GGLM+ AF+Y
Sbjct: 140 SCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGK-------EGNMGCEGGLMDQAFQY 192
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAG 278
IL GG++ E YPYT D G C F+K+ I A + ++ V + E + + GP++
Sbjct: 193 ILDVGGIDTEMSYPYTAMD-GQCHFNKANIGATDTGYTDVTTGSESALQMAVASVGPIS- 250
Query: 279 NVASIELPHISF 290
+I+ H SF
Sbjct: 251 --VAIDASHQSF 260
>gi|261328616|emb|CBH11594.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 220
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 124/244 (50%), Gaps = 41/244 (16%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VKDQG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + D GC GGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYT 235
YPY
Sbjct: 214 YPYV 217
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/248 (37%), Positives = 131/248 (52%), Gaps = 20/248 (8%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
H+ L+K+ SK Y +EE +R +++ NL + + L H G+ F D+T
Sbjct: 27 HWELWKNWHSKKYHEKEE-GWRRMIWEKNLNKIELHNLEHSMGKHSYRLGMNHFGDMTHE 85
Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
EFR+ G R+ A + + N + P+ DWR+ G VT VKDQG CGSCW+FS
Sbjct: 86 EFRQIMNGYQRKTERKAIG--SLFMEPNFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFS 143
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
TGALZG +F G+LVSLSEQ LVDC PE + GC GGLM+ AF+Y+ G
Sbjct: 144 TTGALZGQNFRKMGKLVSLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVKDNQG 196
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY GTD C +D + + F + S E + + GP++ +I+
Sbjct: 197 LDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKEHALMKAVASVGPVS---VAID 253
Query: 285 LPHISFSF 292
H SF F
Sbjct: 254 AGHESFQF 261
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 128/233 (54%), Gaps = 26/233 (11%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEF---R 110
+ S++ K Y +E + RF++F N+ + D ++ GV +F+DLT EF R
Sbjct: 41 WMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTNDEFTSSR 100
Query: 111 RQFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
+F G + R + A +P+ DWR GAVT VK+QG CG CW+FS
Sbjct: 101 NKFKGHMCSSITRTSTFKYENASA-------IPSSVDWRKKGAVTPVKNQGQCGCCWAFS 153
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
A A EG H LSTG+L+SLSEQ+LVDCD + D GC GGLM+ AF++I++ G
Sbjct: 154 AVAATEGIHKLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIIQNHG 206
Query: 226 VEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
+ E +YPY G D G+C +K I A ++ + + ++ +Q V + P++
Sbjct: 207 LNTEANYPYQGVD-GTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPIS 258
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/250 (36%), Positives = 129/250 (51%), Gaps = 19/250 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+ H+ L+K+ K+Y EE +R V++ NL+ + L +H G+ +F DLT
Sbjct: 26 DRHWKLWKNWHQKSYHEAEE-GWRRTVWEENLKAIQLHNLEQSLGLHTYRLGMNQFGDLT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF+ G R + L N +PT DWRDHG VT VK+QG CGSCW+
Sbjct: 85 NEEFQEILTG-ERHFSKGNRINGSAFLEANFVQVPTSVDWRDHGYVTPVKNQGHCGSCWA 143
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TGALEG F +G L+SLSEQ LVDC + + GC+GG+++ AF+YIL+
Sbjct: 144 FSTTGALEGQLFRKSGRLISLSEQNLVDCSWQ-------QGNQGCHGGIVDLAFQYILQN 196
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
G++ E YPYT D C F A V+ F I E+ + + GP++
Sbjct: 197 QGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVGPVS---VG 253
Query: 283 IELPHISFSF 292
I+ SF F
Sbjct: 254 IDASSTSFRF 263
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 95/247 (38%), Positives = 131/247 (53%), Gaps = 22/247 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
++ +K + K+Y +E+ R +++ NLR + H G+ + SDLTPSE
Sbjct: 40 WTEYKETYGKSYDMKED-VVRRSLWEGNLRHISMHNVKHDLGKHSFSMGINELSDLTPSE 98
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+R Q LGL R L K + +P DWRD G VT VK+QGACGSCW+FS+TG
Sbjct: 99 YR-QRLGL--RPALGERTGKKFVYNGEKVPEHVDWRDKGYVTPVKNQGACGSCWAFSSTG 155
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+LEG HF TG+LVSLSEQ LVDC + ++GCNGG M++AF Y+ G++
Sbjct: 156 SLEGQHFRLTGQLVSLSEQNLVDCTKKYG-------NAGCNGGWMDNAFNYVKANNGIDT 208
Query: 229 EKDYPYTGTDGGSCKFDKS---KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
E YPY G D C +D S K A + V DE + + GP++ I+
Sbjct: 209 EAFYPYEGHD-DWCGYDGSPGHKGANCTGHVDVQQGDELALKQAVATVGPVS---VGIDA 264
Query: 286 PHISFSF 292
H SF
Sbjct: 265 THRSFQL 271
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 187
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 188 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 247
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 248 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 298
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 299 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 352
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/254 (36%), Positives = 134/254 (52%), Gaps = 20/254 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
DH L A+ ++ +K+ ++ Y EE +R V++ N++ ++ H T
Sbjct: 22 DHSLEAQ--WTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIEQHNQEYREGKHSFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G R + P+ + P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATGALEG F TG+LVSLSEQ LVDC P+ + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+ GG++ E+ YPY T+ SCK++ A + F I E + + GP++
Sbjct: 190 VQDNGGLDSEESYPYEATE-ESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSFL 293
+++ H SF F
Sbjct: 247 -VAVDAGHQSFQFY 259
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/245 (37%), Positives = 125/245 (51%), Gaps = 17/245 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
+ +K SK Y T+EE D R ++++ NL++ + +H G+ K++DL E
Sbjct: 28 WEAWKQTHSKQY-TKEEEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEE 86
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F + GL K P DWRD G VT VKDQG CGSCW+FS TG
Sbjct: 87 FVQMMNGLKFDASRERQGIKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSCWAFSTTG 146
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+LEG HF STG L SLSEQ LVDC ++GC GGLM+ AF+YI G++
Sbjct: 147 SLEGQHFRSTGVLTSLSEQNLVDC-------SISYGNNGCEGGLMDYAFQYIKDNLGIDT 199
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPH 287
E YPY D +C+F + A S + V S DED + +GP++ +I+ H
Sbjct: 200 EDKYPYEAED-DTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPIS---VAIDASH 255
Query: 288 ISFSF 292
SF
Sbjct: 256 ESFQL 260
>gi|381283083|gb|AFG19440.1| cathepsin L, partial [Larimichthys crocea]
Length = 257
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 122/225 (54%), Gaps = 19/225 (8%)
Query: 76 VFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI 131
V++ NLR+ + L H G+ F D+T EFR+ G R+ + +
Sbjct: 2 VWEMNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYKRKAE--GKFKGSLF 59
Query: 132 LPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
+ N L P DWRD+G VT VKDQG CGSCW+FS TGALEG HF TG+LVSLSEQ L
Sbjct: 60 MEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNL 119
Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
VDC PE + GCNGGLM+ AF+Y+ G++ E YPY GTD C +D +
Sbjct: 120 VDCSR---PE----GNEGCNGGLMDQAFQYVKDNHGLDSEDSYPYLGTDDQPCHYDPNYN 172
Query: 250 AAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
+A + F V S E + + GP++ +I+ H SF F
Sbjct: 173 SANDTGFVDVPSGKEHALMKAVAAVGPVS---VAIDAGHESFQFY 214
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 128/228 (56%), Gaps = 15/228 (6%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
+ ++ + Y +E + R+ +FK N+ R + D GV KF+DLT EFR +
Sbjct: 43 WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 102
Query: 115 GLNRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
G R+ +L + + + L +D+PT DWR+ GAVT VKDQG CG CW+FS A+EG
Sbjct: 103 GYKRQSSKLMSSSFRYENL--SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGI 160
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
L TG L+SLSEQQLVDC + + GC GGLM++AF+YI++ GG+ E +YP
Sbjct: 161 IKLQTGNLISLSEQQLVDCT---------AGNKGCQGGLMDTAFQYIIRNGGLTSEDNYP 211
Query: 234 YTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
Y G D G+C +K + A ++ + + + + V P++ V
Sbjct: 212 YQGVD-GTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGV 258
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/231 (37%), Positives = 129/231 (55%), Gaps = 28/231 (12%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRR 111
F + + K Y + EE R ++F+ NL+ ++ G+ KF+DLT EF+
Sbjct: 43 FDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKT 102
Query: 112 QFLGLN-------RRLRLPADAQKAPILPTN--------DLPTDFDWRDHGAVTGVKDQG 156
++ G N RR L A+ P+L + + DWR GAVTGVKDQ
Sbjct: 103 RYFGKNSKQWRDRRRTELEG-AELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQA 161
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS TGA+EG +F+STG+LVSLSEQ+LV CD + + GC GG M+ A
Sbjct: 162 QCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACD---------ATNYGCEGGDMDYA 212
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQM 266
F ++++ GG++ EKDY YTG D +C +K +K ++ ++ +S D+ +
Sbjct: 213 FTWVIQNGGIDTEKDYSYTGVD-STCNTNKEAKKIVSIDGYTDVSPDDSAL 262
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
Length = 323
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 133/237 (56%), Gaps = 21/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPI 242
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 135/248 (54%), Gaps = 23/248 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDLTPSE 108
+ +FK+ KTY Q E +R ++F N ++ + + + + + + F DL E
Sbjct: 27 WHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHE 86
Query: 109 FRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
F+ L ++ D ++ L P+N +LP DWR GAVT VKDQG CGSCWSFS
Sbjct: 87 FK----ALMNGFKMSPDTKRNGELYFPSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFS 142
Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
ATG+LEG FL TG+LVSLSEQ LVDC + ++GC GGLM+ AF+Y+ G
Sbjct: 143 ATGSLEGQVFLKTGKLVSLSEQNLVDC-------STSYGNNGCEGGLMDQAFQYVSDNKG 195
Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
++ E YPY + +C+F K+K+ + + + DE + L GP++ +I+
Sbjct: 196 IDTEASYPYEARE-NTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPIS---VAID 251
Query: 285 LPHISFSF 292
H SF F
Sbjct: 252 ANHGSFQF 259
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/266 (34%), Positives = 135/266 (50%), Gaps = 20/266 (7%)
Query: 16 SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
S++ V + D +R++ G++ L A F F ++K Y E+ + RF+
Sbjct: 8 SMVHVLVLFSIDQCKVREL----GQRRLYSLEEAPTLFEQFIKDYNKEYDESEKEE-RFK 62
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
+F NL+ AV+G+ KFSDL+ EF + + GL R + K LP +
Sbjct: 63 IFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPES 122
Query: 136 ---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDC 192
P FDWR G V+ +K+Q CGSCW+FSA +E H + TG+L+ +SEQQL+DC
Sbjct: 123 FNVTAPDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC 182
Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
D DSGC+GGL A Y + A G K YPY + G C++D SK+
Sbjct: 183 D---------KYDSGCSGGLPWDALRYFV-ANGAMSLKSYPYVAKE-GKCRYDSSKVEIR 231
Query: 253 VSNFSVISS-DEDQMAANLVKHGPLA 277
+ + + S EDQ+ +L GPL+
Sbjct: 232 LKGYKIFSKISEDQIKEHLYNIGPLS 257
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 89/256 (34%), Positives = 138/256 (53%), Gaps = 18/256 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLN-RRLRLPADAQKAPILPTNDL-----PTDFDWRDHGAVTGVK 153
+F+D+T EF +F GLN L + TNDL P++ DWR+ GAVT VK
Sbjct: 87 EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLSDDDMPSNLDWRESGAVTQVK 146
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
QG CG CW+FSA G+LEGA+ ++TG L+ SEQ+L+DC + + GCNGG M
Sbjct: 147 HQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
+AF++I++ GG+ RE DY Y G +C+ + A +S++ V+ E + + K
Sbjct: 198 TNAFDFIIENGGISRESDYEYLGQQ-YTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256
Query: 274 GPLAGNVASIELPHIS 289
G AS +L S
Sbjct: 257 PVSIGIAASQDLQFYS 272
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/252 (34%), Positives = 140/252 (55%), Gaps = 18/252 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI--LPTNDLPTDFDWRDHGAVTGVK 153
+F+D+T EF +F GLN P + + I L +D+P++ DWR+ GAVT VK
Sbjct: 87 EFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
QG CG CW+FSA G+LEGA+ ++TG L+ SEQ+L+DC + + GCNGG M
Sbjct: 147 HQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
+AF++I++ GG+ RE DY Y G + +C+ + A +S++ V+ E + + K
Sbjct: 198 TNAFDFIIENGGISRESDYEYLG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256
Query: 274 GPLAGNVASIEL 285
G AS +L
Sbjct: 257 PVSIGIAASQDL 268
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 92/244 (37%), Positives = 133/244 (54%), Gaps = 18/244 (7%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSE 108
+H F+ F +K+ K+Y T+EE+D+R ++FK NL + + D T G+ KF+D T +E
Sbjct: 40 DHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAE 99
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
++R LG + K P ND +W + GAVT VKDQG CGSCWSFSATG
Sbjct: 100 YKR-LLGFGGQKNKNPRNIKVLGAPKND---GVNWVEQGAVTPVKDQGQCGSCWSFSATG 155
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
A+EG + G L SLSEQQLVDC + GC GG M+ AF+Y+ + +E
Sbjct: 156 AMEGHAKIQFGTLYSLSEQQLVDCSQ-------AEGNEGCGGGWMDQAFQYVEQT-ALET 207
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E YPY D +C+ + + S V ++ +++ A L K GP++ +IE +
Sbjct: 208 EDQYPYEAVD-DTCRASSAGVVKVDSFVDVTPNNVNELKAALDK-GPVS---VAIEADQM 262
Query: 289 SFSF 292
F F
Sbjct: 263 VFQF 266
>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
Length = 333
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 130/229 (56%), Gaps = 15/229 (6%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
HF + ++ KTY++ E ++YR + F N R+ + T G+ +FSD+T +E +R
Sbjct: 32 HFKSWMTQHQKTYSSVE-YNYRLKTFANNWRKIHAHNQRNHTFKMGLNQFSDMTFAEIKR 90
Query: 112 QFLGLNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGA-VTGVKDQGACGSCWSFSATGA 169
++L + A K L T LP DWR G V+ VK+QG+CGSCW+FS TGA
Sbjct: 91 KYLWSEPQ---NCSATKGNYLRGTGPLPPSMDWRKKGNFVSAVKNQGSCGSCWTFSTTGA 147
Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
LE A +++G+++SL+EQQLVDC + + GC GGL + AFEYIL G+ E
Sbjct: 148 LESAVAIASGKMLSLAEQQLVDCAQNFN-------NHGCEGGLPSQAFEYILYNKGIMGE 200
Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLA 277
YPY G D G CKFD K A V + + I+ +DE M + + P++
Sbjct: 201 DTYPYRGKD-GHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVS 248
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 133/237 (56%), Gaps = 21/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKDQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPI 242
>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
Length = 332
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 136/227 (59%), Gaps = 14/227 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
F F + +++TY ++EE +R VF N+ RA++ Q LD TA +G+TKFSDLT EF
Sbjct: 35 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
+L N L+ + + +P NDL P ++DWR GAVT VK+QG CGSCW+FS TG +
Sbjct: 95 IYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 152
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+ I GG+E E
Sbjct: 153 EGQWFLNRGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYAAIKNLGGLETED 203
Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
DY Y G +C F +++ +S +E+++AA L + GP++
Sbjct: 204 DYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPIS 249
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 127/225 (56%), Gaps = 15/225 (6%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
+ ++ + Y +E + R+ +FK N+ R + D GV KF+DLT EFR +
Sbjct: 8 WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 67
Query: 115 GLNRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
G R+ +L + + + L +D+PT DWR+ GAVT VKDQG CG CW+FS A+EG
Sbjct: 68 GYKRQSSKLMSSSFRYENL--SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGI 125
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
L TG L+SLSEQQLVDC + + GC GGLM++AF+YI++ GG+ E +YP
Sbjct: 126 IKLQTGNLISLSEQQLVDCT---------AGNKGCQGGLMDTAFQYIIRNGGLTSEDNYP 176
Query: 234 YTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
Y G D G+C +K + A ++ + + + + V P++
Sbjct: 177 YQGVD-GTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVS 220
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 99/294 (33%), Positives = 149/294 (50%), Gaps = 25/294 (8%)
Query: 4 LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
+ LS LL L + + D +++ P D S D L+ F + S K
Sbjct: 1 MALSKLLPLAMCMSFFVVTSFGKDFSIV-GYWPED-LTSMDRLIEL---FEEWISNHGKI 55
Query: 64 YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRR 119
Y T EE +RF VFK NL+ + GV +F+DLT EF+ +LGL +R
Sbjct: 56 YETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRT 115
Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
+ P + ++ DLP DWR GAVT VK+QG+CGSCW+FS A+EG + + G
Sbjct: 116 RQSPEEFTYKDVV---DLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGG 172
Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
L SLSEQ+L+DCD ++GC+GGLM+ AF +I+ +GG+ +E+DYPY +
Sbjct: 173 NLTSLSEQELIDCDR--------PYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVE- 223
Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+C K ++ +S + + + + + H PL+ +IE F F
Sbjct: 224 STCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLS---VAIEASGRDFQF 274
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 23/236 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
+ L+ ++ + Y E D RFRVF NLR A + + G+ +F+DLT EFR
Sbjct: 49 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 108
Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
+LG R+PA ++ + +LP DWR+ GAV VK+QG CGSCW
Sbjct: 109 AAYLGA----RIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 164
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA ++E + + TGE+V+LSEQ+LV+C + +SGCNGGLM++AF++I+K
Sbjct: 165 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 217
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG++ E DYPY D G C ++ ++ F + ++++ V H P++
Sbjct: 218 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVS 272
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 92/235 (39%), Positives = 123/235 (52%), Gaps = 19/235 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK----IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 263
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/281 (33%), Positives = 151/281 (53%), Gaps = 30/281 (10%)
Query: 6 LSSLLLLLLS-SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
++S+L + +S ++L+ ++ V+ + + P + AEHH + ++FS+ Y
Sbjct: 1 MTSILFMFVSLTILSMSLKVSQATSRVTFHEP----------IVAEHH-QQWMTRFSRVY 49
Query: 65 ATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
+ + E RF VFK NL+ ++ + D T GV +F+D T EF GL +P
Sbjct: 50 SDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIP 109
Query: 124 ADAQKAPILPTNDL-------PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
+ ++P+ + P DWR GAVT VK QG CG CW+FS+ A+EG +
Sbjct: 110 SSEFVDEMIPSWNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKI 169
Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G LVSLSEQQL+DCD E D+GCNGG+M+ AF YI+K G+ E YPY
Sbjct: 170 VGGNLVSLSEQQLLDCDRE--------RDNGCNGGIMSDAFSYIIKNRGIASEASYPYQE 221
Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
T+ G+C+++ +K +A + F + S+ ++ V P++
Sbjct: 222 TE-GTCRYN-AKPSAWIRGFQTVPSNNERALLEAVSRQPVS 260
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/252 (34%), Positives = 140/252 (55%), Gaps = 18/252 (7%)
Query: 41 QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
+S+ L +E H L+ S+ + Y + E RF +FK N++ + + + + G+
Sbjct: 28 RSQPELSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86
Query: 100 KFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI--LPTNDLPTDFDWRDHGAVTGVK 153
+F+D+T EF +F GLN P + + I L +D+P++ DWR+ GAVT VK
Sbjct: 87 EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146
Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
QG CG CW+FSA G+LEGA+ ++TG L+ SEQ+L+DC + + GCNGG M
Sbjct: 147 HQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197
Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
+AF++I++ GG+ RE DY Y G + +C+ + A +S++ V+ E + + K
Sbjct: 198 TNAFDFIIENGGISRESDYEYQG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256
Query: 274 GPLAGNVASIEL 285
G AS +L
Sbjct: 257 PVSIGIAASQDL 268
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 130/244 (53%), Gaps = 18/244 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSE 108
+S +K+ K Y EE +R V++ NL+ K+ H T F DLT E
Sbjct: 29 WSQWKAAHGKLYDENEE-GWRRAVWEKNLKVIKQHNQEYSQGKHSFTMAMNAFGDLTNEE 87
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
F++ GL + R + +AP P + P+ DWR G VT VK+QG CGSCW+FSATG
Sbjct: 88 FKQVMNGLKSQKRKEGNVFQAP--PFAETPSSVDWRKKGYVTPVKNQGPCGSCWAFSATG 145
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
ALEG F T LVSLSEQ LVDC + GC+GGLM+ AF+Y+ GG++
Sbjct: 146 ALEGQMFRKTKRLVSLSEQNLVDCSQ-------AEGNEGCSGGLMDYAFQYVKDNGGLDS 198
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E+ YPY D SCK+ + AA + F I +E+ + + GP++ A+I+
Sbjct: 199 EESYPYRAQD-ESCKYKPEQSAANDTGFMDIHPEEESLKLAVATVGPIS---AAIDASLS 254
Query: 289 SFSF 292
+F F
Sbjct: 255 TFQF 258
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 93/250 (37%), Positives = 133/250 (53%), Gaps = 22/250 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ LFK++FSK Y T+ E +R +VF N + A+ +L V + F DL E
Sbjct: 31 WELFKTQFSKAYNTEIEEKFRMKVFMDNKHKIARHNKLFQNGEVSYELEMNHFGDLLHHE 90
Query: 109 FRRQFLGLNRRLRLPA--DAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSF 164
F + G LR + +P ++ P DWR GAVT VK+QG CGSCW+F
Sbjct: 91 FVKTVNGYRHSLRRVTGDEIDSVTFIPAYNVTVPDSVDWRTEGAVTEVKNQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKA 223
S TG+LEG HF +T +L SLSEQ L+DC SG ++GC+GGLM++AF YI
Sbjct: 151 STTGSLEGQHFRNTKQLTSLSEQNLIDC--------SGKYGNNGCSGGLMDNAFAYIKSN 202
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
G++ E+ YPY G D C++ + A F I DE+++ + GP++ +
Sbjct: 203 KGIDTEQSYPYEGID-DKCRYKPQESGATDKGFVDIPQGDEEKLKLAVATVGPIS---VA 258
Query: 283 IELPHISFSF 292
I+ H SF F
Sbjct: 259 IDASHQSFQF 268
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 124/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VK+QGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D+GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,631,252,965
Number of Sequences: 23463169
Number of extensions: 193130953
Number of successful extensions: 473898
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4839
Number of HSP's successfully gapped in prelim test: 1605
Number of HSP's that attempted gapping in prelim test: 457566
Number of HSP's gapped (non-prelim): 6881
length of query: 300
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 159
effective length of database: 9,050,888,538
effective search space: 1439091277542
effective search space used: 1439091277542
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)