BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy5095
(192 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
Length = 1095
Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats.
Identities = 71/183 (38%), Positives = 110/183 (60%), Gaps = 11/183 (6%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA-GLEAEADY-PFRN 58
++ESQYAIKH L+P S+ QL++C+ N GC GG A +YL+ + GLE DY ++N
Sbjct: 915 VIESQYAIKHQKLVPFSEQQLVDCDDINDGCHGGLMTDAYKYLQQSGGLEFAEDYGDYKN 974
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+ +C +D KV+ ++ ++ + + ++ LY GP+ AG+N LLQ Y +
Sbjct: 975 KK---EKCKFDLNKVQAKIKEWQQIDEDEEIIKKQLYQNGPIAAGVNARLLQFYKSGIFD 1031
Query: 118 KNDVCPSENLNHAVVIVGYGM-RHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
+ C S+ +NHA++IVGYG+ + WI++N WG+ WG DGYF + RG CGI +Y
Sbjct: 1032 PKE-CDSD-INHAILIVGYGVEKDGQKYWIIKNQWGKDWGM-DGYFKLARGKKQCGIHTY 1088
Query: 176 GGI 178
I
Sbjct: 1089 ASI 1091
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/183 (37%), Positives = 106/183 (57%), Gaps = 10/183 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADY--PFRN 58
++ESQYA+K+G LL S+ L++C+ NQGC+GG A Q+L+ +G AD ++N
Sbjct: 163 VIESQYALKYGELLHFSEQMLLDCDNINQGCRGGLMTDAYQFLQQSGGIQTADTYGDYKN 222
Query: 59 QNGVTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+ + C +D KVK +V D + + +T RR L GP+ G+N LQ Y G ++
Sbjct: 223 KKDI---CNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVAVGINARTLQFYEGGIVD 279
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
+ + +NHAV+IVGYG+ +P W+++N WG WG G+F + RG CGI +Y
Sbjct: 280 PKNC--DDKINHAVLIVGYGVEEGIPYWLIKNQWGAEWGI-KGFFKLIRGKKQCGIHTYA 336
Query: 177 GIC 179
I
Sbjct: 337 SIA 339
>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
Length = 359
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 105/175 (60%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQYAI+H LL LS+ QL++C+ +QGC GG + A Q L+ GLE+E YP++
Sbjct: 181 IESQYAIRHDRLLDLSEQQLVDCDQIDQGCSGGLMHLAFQEILQMGGLESELVYPYQ--- 237
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
GV C + RK V++SD ++ D R ++Y GP+ ++ + DY ++
Sbjct: 238 GVDYACRLNPRKFDVKLSDCHRYDLRDERKLRELVYTVGPIAVAIDCIDIIDYKSGIV-- 295
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C + LNHAV++VG+G+ P WI++NSWG WG + GYF ++R N CG+
Sbjct: 296 -SMCNNNGLNHAVLLVGFGIEFDTPYWILKNSWGNDWG-EKGYFRLKRNINGCGM 348
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 101/172 (58%), Gaps = 8/172 (4%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
E Y KH L+ LS+ QL++C+ N GC GG + Y++ GL+ E+ YP+ G
Sbjct: 145 EGAYYRKHKQLVSLSEQQLVDCSTSINYGCNGGFLDATFPYIEQYGLQTESSYPY---TG 201
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C YD+ KV ++S+++ +GS++ + GP+ M+ + L Y+ + N
Sbjct: 202 VDGSCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVAITMDASYLSSYSSGIYAANK 261
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
C + NLNHAV++VGYG ++ WIV+NSWG WG + GYF + RG+N CG
Sbjct: 262 -CTTTNLNHAVLVVGYGSQNGQNYWIVKNSWGSGWG-EQGYFRLLRGSNECG 311
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 123 bits (308), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 99/175 (56%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQN 60
LE+ YAIKH L+ LS+ QLI+C+ N C GG + A + L +AG L E DYP++
Sbjct: 159 LETLYAIKHNYLINLSEQQLIDCDSANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQ--- 215
Query: 61 GVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G G C D +K + VS +F + ++ L GP+ ++ A + Y+ +I
Sbjct: 216 GTKGVCKIDNKKFALSVSSCKRYIFQNEENLKKELITMGPIAMAIDAASISTYSKGIIH- 274
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG V W ++NSWG WG +DGYF V+R NACG+
Sbjct: 275 --FCENLGLNHAVLLVGYGTEGGVSYWTLKNSWGSDWG-EDGYFRVKRNINACGL 326
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 122 bits (307), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 99/175 (56%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQN 60
LE+ YAIKH L+ LS+ QLI+C+ N C GG + A + L +AG L E DYP++
Sbjct: 159 LETLYAIKHNYLINLSEQQLIDCDSANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQ--- 215
Query: 61 GVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G G C D +K + VS +F + ++ L GP+ ++ A + Y+ +I
Sbjct: 216 GTKGICKIDNKKFALSVSSCKRYIFQNEENLKKELITTGPIAMAIDAASISTYSKGIIH- 274
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG V W ++NSWG WG +DGYF V+R NACG+
Sbjct: 275 --FCENLGLNHAVLLVGYGTEGGVSYWTLKNSWGSDWG-EDGYFRVKRNINACGL 326
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 122 bits (306), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 105/175 (60%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQYAI+H L+ LS+ QL++C+ + GC GG + A Q L G+E EADYP++
Sbjct: 187 IESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELLLMGGVETEADYPYQ--- 243
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C D RK+ V+++ ++ D + ++Y GP+ ++ + +Y ++ +
Sbjct: 244 GSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMDIINYRRGILNQ 303
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C +LNHAV+++G+G+ + VP WI++NSWG WG ++GY V R NACG+
Sbjct: 304 ---CHIYDLNHAVLLIGWGIENNVPYWIIKNSWGEDWG-ENGYLRVRRNVNACGL 354
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 122 bits (305), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 97/174 (55%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
LESQYAIK+ L+ L++ QL++C+ + GC GG + A Q ++ G+E E DYP+R +
Sbjct: 165 LESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMQMGGVEQEFDYPYRAER 224
Query: 61 GVTGRCAYDARKVK--VRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
CA K VR V + +L H GP+ ++ L DY G ++
Sbjct: 225 Q---PCALKPHKFAAGVRKCFRYVLRNEERLEDLLRHVGPIAIAVDAVDLTDYYGGIV-- 279
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W ++NSWG +DGY V RG N+CG+
Sbjct: 280 -SFCENNGLNHAVLLVGYGVENNVPFWTLKNSWGSDYGEDGYVRVRRGVNSCGL 332
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 122 bits (305), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 109/186 (58%), Gaps = 11/186 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E QYAIK G L+ LS+ +L++C+ ++GC+GG N Q K GLE+E+DYP++
Sbjct: 96 IEGQYAIKTGKLVSLSEQELVDCDTIDKGCEGGLPSNAYKQIEKLGGLESESDYPYK--- 152
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +C ++ +VKV ++ +V + + L GP+ G+N +Q Y G +
Sbjct: 153 GADSKCKFNKAEVKVTINSSVVISKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPW 212
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYGG 177
+ C +LNH V+IVGYG+++ P WI++NSWG WG + GY+ + RG CG+ +
Sbjct: 213 KIFCNPSSLNHGVLIVGYGVKNGTPYWIIKNSWGPSWG-EKGYYLIYRGGGCCGLNT--- 268
Query: 178 ICTRTL 183
+CT +
Sbjct: 269 MCTSAV 274
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 105/175 (60%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQYAI+H L+ LS+ QL++C+ + GC GG + A Q L G+E EADYP++
Sbjct: 189 IESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELLLMGGVETEADYPYQ--- 245
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C D RK+ V+++ ++ D + ++Y GP+ ++ + +Y ++ +
Sbjct: 246 GSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMDIINYRRGILNQ 305
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C +LNHAV+++G+G+ + VP WI++NSWG WG ++GY V R NACG+
Sbjct: 306 ---CHIYDLNHAVLLIGWGIENNVPYWIIKNSWGEDWG-ENGYLRVRRNVNACGL 356
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 98/174 (56%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LESQYAIK+ L+ L++ QL++C+ + GC GG + A + + H G+E E DYP+R +
Sbjct: 159 LESQYAIKYDRLIDLAEQQLVDCDSVDMGCDGGLIHTAYEQIMHMGGVEQEFDYPYRAER 218
Query: 61 GVTGRCAYDARKVK--VRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
CA K VR V + +L + GP+ ++ L DY G ++
Sbjct: 219 Q---PCALKPHKFAAGVRSCYRYVLLNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIV-- 273
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP WI++NSWG +DGY V RG N+CG+
Sbjct: 274 -SFCENNGLNHAVLLVGYGVENNVPFWIIKNSWGSDYGEDGYVRVRRGVNSCGM 326
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 101/174 (58%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQYAIK+ + LS+ QL++C+ + GC GG + A + + GLE E DYP+R+
Sbjct: 166 LESQYAIKYNEHVDLSEQQLVDCDTIDMGCAGGLLHTAYEEIMAMGGLEYEEDYPYRS-- 223
Query: 61 GVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
V G C + K +V V + V D + +L+ GP+ ++ L DY G +I
Sbjct: 224 -VQGPCRLQSDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITS 282
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W+++NSWG ++G+ V+R N+CG+
Sbjct: 283 ---CKNYGLNHAVLLVGYGIENGVPFWVLKNSWGSDYGENGFVRVKRNVNSCGM 333
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 105/177 (59%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ QLI+C+ + GC GG + A + + G++AE DYP+ N
Sbjct: 146 LESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNMGGIQAENDYPYEANN 205
Query: 61 GVTGRCAYDARK--VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C +A K VKV+ V + + +L GPL ++ + + +Y +IR
Sbjct: 206 G---DCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAIDASDIVNYKRGVIR- 261
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGY + + VP WI++N+WG WG + GYF V++ NACGI++
Sbjct: 262 --YCANHGLNHAVLLVGYAVENGVPFWILKNTWGTDWG-EQGYFRVQQNINACGIQN 315
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LESQYAIK+ L+ L++ QL++C+ + GC GG + A + + H G+E E DYP++
Sbjct: 159 LESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYK--- 215
Query: 61 GVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
V CA K V V + V + +L H GP+ ++ L DY G +I
Sbjct: 216 AVRLPCAVKPHKFAVGVRNCYRYVLLSEERLEDLLRHVGPIAIAVDAVDLTDYYGGVI-- 273
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W ++NSWG ++GY + RG N+CG+
Sbjct: 274 -SFCENNGLNHAVLLVGYGIENNVPYWTIKNSWGSDYGENGYVRIRRGVNSCGM 326
>gi|339244637|ref|XP_003378244.1| cathepsin F [Trichinella spiralis]
gi|316972865|gb|EFV56511.1| cathepsin F [Trichinella spiralis]
Length = 317
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/180 (37%), Positives = 107/180 (59%), Gaps = 13/180 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+ES +AIK G L+ LS+ Q+I+C+ N+GC+GG KA + ++ +G++AE+DYP+
Sbjct: 95 IESAWAIKFGDLISLSEQQIIDCDKINRGCRGGQPLKAYHEIIRMSGVQAESDYPY---T 151
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIR-K 118
G+ G C + K+KV ++D ++ + ++ T LY +GP+ MN +L Y +I+
Sbjct: 152 GLHGSCKLNKEKIKVYINDTVLLHKNETTIANYLYEHGPVAVRMNADILMLYRKGIIKPT 211
Query: 119 NDVCPSENLNHAVVIVGYGMRHQV-----PVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
C LNH I+GYG + P WI++NSWG WG ++GYF + RG ACG+
Sbjct: 212 KSSCNPNFLNHGATIIGYGKESWLHWWSNPYWIIKNSWGVDWG-ENGYFRLYRGNEACGV 270
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 105/175 (60%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQYAI+H L+ LS+ QL++C+ + GC GG + A Q L G+E EADYP++
Sbjct: 189 IESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELLLMGGVETEADYPYQ--- 245
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C D RK+ V+++ ++ D + ++Y GP+ ++ + +Y ++ +
Sbjct: 246 GSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMDIINYRRGILNQ 305
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C +LNHAV+++G+G+ + VP WI++NSWG WG ++G+ V R NACG+
Sbjct: 306 ---CHIYDLNHAVLLIGWGIENNVPYWIIKNSWGEDWG-ENGFLRVRRNVNACGL 356
>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
Length = 337
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 101/175 (57%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQYAI H +L+ LS+ QL++C+ +QGC GG + A Q ++ G+E E DYP++
Sbjct: 159 IESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHLAFQEIIRIGGVEHEIDYPYQ--- 215
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR--RMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G+ C K+ VR+S ++ D + +LY GP+ ++ + DY +
Sbjct: 216 GIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCVDIIDYRSGIAT- 274
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
VC LNHAV++VGYG+ + P WI +NSWG WG ++GYF R NACG+
Sbjct: 275 --VCNDNGLNHAVLLVGYGIENDTPYWIFKNSWGSNWG-ENGYFRARRNINACGM 326
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 113/187 (60%), Gaps = 13/187 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES +AIK G L+ +S+ QL++C+ Y+ GC GG A++Y G + YP+ + G
Sbjct: 160 VESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGLPWDALRYFVANGAMSLKSYPYVAKEG 219
Query: 62 VTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C YD+ KV++R+ + +F+ D + LY+ GPL ++ + ++ Y G ++ +
Sbjct: 220 ---KCRYDSSKVEIRLKGYKIFSKISEDQIKEHLYNIGPLSIAIDVSPIKPYVGGIVMEE 276
Query: 119 -NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
++VC +NHAV++VGYG + V WIV+NSWG WG ++GYF +ERG N C + +
Sbjct: 277 CHEVC---QVNHAVLLVGYGKEYSVEYWIVKNSWGPNWG-ENGYFRMERGVN-CLLLTST 331
Query: 177 GICTRTL 183
GI T +
Sbjct: 332 GITTAVI 338
>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
Length = 341
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LESQYAIK+ L+ L++ QL++C+ + GC GG + A + + H G+E E DYP++
Sbjct: 163 LESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYK--- 219
Query: 61 GVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
V CA K V V + V + +L H GP+ ++ L DY G +I
Sbjct: 220 AVRLPCAVKPHKFAVGVRNCYRYVLLSEERLEDLLRHVGPIAIAVDAVDLTDYYGGVI-- 277
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W ++NSWG ++GY + RG N+CG+
Sbjct: 278 -SFCENNGLNHAVLLVGYGVENNVPYWTIKNSWGPDYGENGYVRIRRGVNSCGM 330
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQYAIK+ L+ LS+ QL++C+ + GC GG + A + ++ G+E + DYP+R +
Sbjct: 186 LESQYAIKYDRLIDLSEQQLVDCDHVDMGCDGGLIHTAYEEIMRMGGVEQDFDYPYRAER 245
Query: 61 GVTGRCAYDARKVK--VRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
CA K VR V + +L H GP+ ++ + DY G ++
Sbjct: 246 Q---PCALKPHKFAAGVRSCYRYVLLNEERLEDLLRHVGPIAIAVDAVDITDYYGGIV-- 300
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP WI++NSWG +DGY V RG N+CG+
Sbjct: 301 -SFCENNGLNHAVLLVGYGVENNVPYWILKNSWGSDYGEDGYVRVRRGVNSCGM 353
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 101/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKA-IQYLKHAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C++ +GC GG + + ++ + GLE+E DYP+
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPSSSYLEIMDMGGLESENDYPYV--- 197
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
GV CA + K+ ++ D +V S+ L +GPL +N LQ Y +G L
Sbjct: 198 GVEQTCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVALQHYQSGILHPS 257
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ CP ++LNHAV+ VGY +P WI++NSWG WG + GYF + RG CGI
Sbjct: 258 HKDCPDDDLNHAVLTVGYDREGDMPYWIIKNSWGTDWG-EKGYFRLFRGDCVCGI 311
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
LESQYAIK+ L+ L++ QL++C+ + GC GG + A Q ++ G+E E DYP++ +
Sbjct: 162 LESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRMGGVEQEFDYPYKAER 221
Query: 61 GVTGRCAYDARKVK--VRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
CA K VR V + +L + GP+ ++ L DY G ++
Sbjct: 222 QP---CALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIV-- 276
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP WI++NSWG +DGY V RG N+CG+
Sbjct: 277 -SFCKNNGLNHAVLLVGYGVENNVPYWIIKNSWGSDYGEDGYVRVRRGVNSCGM 329
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
LESQYAIK+ L+ L++ QL++C+ + GC GG + A Q ++ G+E E DYP++ +
Sbjct: 161 LESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRMGGVEQEFDYPYKAER 220
Query: 61 GVTGRCAYDARKVK--VRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
CA K VR V + +L + GP+ ++ L DY G ++
Sbjct: 221 QP---CALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIV-- 275
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP WI++NSWG +DGY V RG N+CG+
Sbjct: 276 -SFCKNNGLNHAVLLVGYGVENNVPYWIIKNSWGSDYGEDGYVRVRRGVNSCGM 328
>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
Length = 337
Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 100/175 (57%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
LESQYAIK+ L+ LS+ QL++C+ + GC GG + A Q +K G+E E DY ++ +
Sbjct: 159 LESQYAIKYDRLIDLSEQQLVDCDFVDMGCDGGLIHTAYEQIMKMGGVEQEFDYSYKAER 218
Query: 61 GVTGRCAYDARKVKVRVSD---FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
CA K V + +++ N + +L + GP+ ++ L DY G ++
Sbjct: 219 QP---CALKPHKFATGVRNCYRYVILN-EERLEDLLRYVGPIAIAVDAVDLTDYYGGIV- 273
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP WI++NSWG +DGY V RG N+CG+
Sbjct: 274 --SFCENNGLNHAVLLVGYGVENNVPYWIIKNSWGSDYGEDGYVRVRRGVNSCGM 326
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 100/174 (57%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQYAIK+ + LS+ QL++C+ + GC GG + A + + G+E E DYP+R+
Sbjct: 166 LESQYAIKYNEHIDLSEQQLVDCDTIDMGCAGGLLHTAYEEIMSMGGVEYEEDYPYRS-- 223
Query: 61 GVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
V G C + K +V V + + D + +L+ GP+ ++ L DY G +I
Sbjct: 224 -VQGPCRIENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITS 282
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG + +P W+++NSWG ++G+ V+R N+CG+
Sbjct: 283 ---CKNYGLNHAVLLVGYGTENGIPFWVLKNSWGTDYGENGFVRVKRNVNSCGM 333
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 97/175 (55%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C+ QGC GG + ++ + GLE+E+DYP+
Sbjct: 146 VEGQWFIKTGQLVSLSKQQLVDCDRAAQGCNGGWPASSYLEIMYMGGLESESDYPYV--- 202
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV CA + K+ ++ D +V + L +GPL +N LQ Y +++
Sbjct: 203 GVEQTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQHYQSGVLKPT 262
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
D CP LNHAV+ VGY +P WI++NSWG WG + GYF + RG CGI
Sbjct: 263 FDECPDTELNHAVLTVGYDKEGDMPYWIIKNSWGTDWG-EKGYFRLFRGDCTCGI 316
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 104/177 (58%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ QLI+C+ + GC GG + A + + G++AE DYP+ N
Sbjct: 146 LESQFAIKHDQLINLSEQQLIDCDFVDVGCDGGLLHTAYEAVMNMGGIQAENDYPYEANN 205
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C +A K VRV + + + +L GP+ ++ + + Y +IR
Sbjct: 206 G---PCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAIDASDIVGYKRGIIR- 261
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGYG+ + +P WI++N+WG WG + GYF V++ NACGI++
Sbjct: 262 --YCENHGLNHAVLLVGYGVENGIPFWILKNTWGADWG-EQGYFRVQQNINACGIKN 315
>gi|449139100|gb|AGE89905.1| cathepsin-like cysteine proteinase [Spodoptera littoralis NPV]
Length = 336
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 100/175 (57%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQYAI H +L+ LS+ QL++C+ +QGC GG + A Q ++ G+E E DYP++
Sbjct: 158 IESQYAILHDSLIDLSEQQLLDCDRIDQGCDGGLMHLAFQEIMRIGGVEHEIDYPYQ--- 214
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR--RMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G+ C K VR+S ++ D + +LY GP+ ++ + DY +
Sbjct: 215 GIEYACRSAPSKFAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCRDIIDYRSGIAT- 273
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
VC LNHAV++VGYG+ + P WI +NSWG WG ++GYF R NACG+
Sbjct: 274 --VCNDNGLNHAVLLVGYGIENDTPYWIFKNSWGSNWG-ENGYFRARRNINACGM 325
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 106/179 (59%), Gaps = 15/179 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ QLI+C+ + GC GG + A + + G++AE DYP+ N
Sbjct: 146 LESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNMGGIQAENDYPYEANN 205
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C +A K V+V VF + + +L GP+ ++ + + +Y ++
Sbjct: 206 G---DCRANAAKFVVKVKKCYRYITVF--EEKLKDLLRSVGPIPVAIDASDIVNYKRGIM 260
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C + LNHAV++VGY +++ VP WI++N+WG WG + GYF V++ NACGI++
Sbjct: 261 K---YCANHGLNHAVLLVGYAVQNGVPFWILKNTWGADWG-EQGYFRVQQNINACGIQN 315
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 105/179 (58%), Gaps = 15/179 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH + LS+ QLI+C+ + GC GG + A + + G++AE+DYP+ N
Sbjct: 146 LESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANN 205
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C +A K V+V VF + + +L GP+ ++ + + +Y ++
Sbjct: 206 G---DCRANAAKFVVKVKKCYRYITVF--EEKLKDLLRSVGPIPVAIDASDIVNYKRGIM 260
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C + LNHAV++VGY + + VP WI++N+WG WG + GYF V++ NACGI++
Sbjct: 261 K---YCANHGLNHAVLLVGYAVENGVPFWILKNTWGADWG-EQGYFRVQQNINACGIQN 315
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 99/180 (55%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q AI G L+ LS+ +L+ C+ + GC GG + A +L + + EA YP+ +
Sbjct: 147 IEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIATEASYPYVS 206
Query: 59 QNGVTGRCAY--DARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C+Y D + V +S+F G++ +++YGPL G++ + Q Y G +
Sbjct: 207 GNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGI 266
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIES 174
I CP ++H V+IVGY P WI++NSW WG +DGY V +G+N CG+ S
Sbjct: 267 IT---YCPDVQIDHGVLIVGYDDTAPTPYWIIKNSWTANWG-EDGYIRVAKGSNMCGLTS 322
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 99/180 (55%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q AI G L+ LS+ +L+ C+ + GC GG + A +L + + EA YP+ +
Sbjct: 147 IEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIATEASYPYVS 206
Query: 59 QNGVTGRCAY--DARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C+Y D + V +S+F G++ +++YGPL G++ + Q Y G +
Sbjct: 207 GNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGI 266
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIES 174
I CP ++H V+IVGY P WI++NSW WG +DGY V +G+N CG+ S
Sbjct: 267 IT---YCPDVQIDHGVLIVGYDDTAPTPYWIIKNSWTANWG-EDGYIRVAKGSNMCGLTS 322
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 99/179 (55%), Gaps = 18/179 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ QLI+C+ +QGC GG + ++ GLE +DYP+ ++
Sbjct: 148 VEGQWFRKTGDLLGLSEQQLIDCDHSDQGCDGGYPPQTYSAIEEMGGLELRSDYPYTGKD 207
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGS-------DTFRRMLYHYGPLVAGMNGALLQDYNG 113
G+ C D K V NGS T + L GPL +G+N LLQ Y
Sbjct: 208 GI---CYMDQSKFVAYV------NGSTRLPWCEKTQAKSLKEIGPLSSGLNAVLLQLYKR 258
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
++R P+E LNHAV+ VGYGM H++P WIV+NSWG+ + GYF + RG CGI
Sbjct: 259 GIMRPRWCNPAE-LNHAVLTVGYGMEHRMPYWIVKNSWGKRFGEKGYFRIYRGDGTCGI 316
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 104/177 (58%), Gaps = 15/177 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLRLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C LNHAV++VGYG+ + VP W +N+WG WG +DG+F V++ NACG+
Sbjct: 260 K---YCFDSGLNHAVLLVGYGVENNVPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 105/177 (59%), Gaps = 15/177 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLRLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C + LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+
Sbjct: 260 K---YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 104/177 (58%), Gaps = 15/177 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLRLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+
Sbjct: 260 K---YCFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 96/175 (54%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C+ +GC GG + +KH GLE+E+DYP+
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDRVAEGCNGGWPVSSYLEIKHMGGLESESDYPYV--- 197
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
G CA + K+ ++ D +V + L +GPL +N LQ Y +G L
Sbjct: 198 GAEQTCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPT 257
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ CP LNHAV+ VGY +P WI++NSWG WG + GYF + RG CGI
Sbjct: 258 YEECPDTELNHAVLTVGYDKEGDMPYWIIKNSWGTDWG-EKGYFRLFRGDYTCGI 311
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 105/177 (59%), Gaps = 15/177 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLRLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C + LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+
Sbjct: 260 K---YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 101/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQN 60
+ES +AIK G L+ LS+ +LI+C++ ++GC GG A + +K G LE E YP+ +N
Sbjct: 281 IESLWAIKTGKLISLSEQELIDCDVIDKGCNGGLPINAFREIKRMGGLEPEDQYPYEAKN 340
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
G C ++ V + D + ++T + + GPL G++ LL Y +G L
Sbjct: 341 GT---CHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLSVGIDAELLSYYKSGILHPS 397
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
CP +NH V+I GYG+ + +P W ++NSWG +WG ++GYF + RG N CG+
Sbjct: 398 KSRCPPSKINHGVLITGYGIENNLPYWTIKNSWGEQWG-ENGYFQLMRGKNICGV 451
>gi|633096|dbj|BAA04664.1| prepro NTP [Paragonimus westermani]
Length = 245
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 98/175 (56%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C++ +GC GG + ++ + GLE+E+DYP+
Sbjct: 64 VEGQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPASSYLEIMYMGGLESESDYPYV--- 120
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV CA + K+ ++ D +V + L +GPL +N LQ Y +++
Sbjct: 121 GVEQTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPT 180
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ CP LNHAV+ VGY +P WI++NSWG WG + GYF + RG CGI
Sbjct: 181 FEECPDTELNHAVLTVGYDKEGDMPYWIIKNSWGTDWG-EKGYFRLFRGDCTCGI 234
>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 250
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 104/182 (57%), Gaps = 10/182 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADY-PFRN 58
++ESQYA+K+ L+ S+ QLI+C+ N GC+GG A + ++ GLE DY + N
Sbjct: 71 VIESQYALKYNKLVNFSEQQLIDCDSINDGCRGGLMTDAYKAIQEMGGLETSEDYGEYLN 130
Query: 59 QNGVTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G+C D+ KV +V + + + + RR L GP+ G+N LQ Y G ++
Sbjct: 131 S---KGQCKIDSNKVSAKVINWYQISEDEEAIRRELVQNGPIAVGVNARFLQFYQGGILD 187
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
+C +++NHAV+IVGYG + WI++N WG+ WG +GYF + RG CG+ +Y
Sbjct: 188 PK-LC-DDSINHAVLIVGYGEENGKKYWIIKNQWGKSWGI-NGYFKLVRGKKQCGVHTYA 244
Query: 177 GI 178
I
Sbjct: 245 SI 246
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 116 bits (291), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 104/177 (58%), Gaps = 15/177 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLPLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+
Sbjct: 260 K---YCFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
Length = 323
Score = 116 bits (291), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 102/175 (58%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQYAIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+
Sbjct: 145 LESQYAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYE--- 201
Query: 61 GVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C + K VRV D V + + +L GP+ ++ A + +Y +IR
Sbjct: 202 ANNNNCRMNGNKFAVRVKDCYRYVTVYEEKLKDLLRVAGPIPMAIDAADIVNYKQGVIR- 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + +P WI +N+WG WG +DGYF V++ NACG+
Sbjct: 261 --YCFNSGLNHAVLLVGYGVENNIPFWIFKNTWGTDWG-EDGYFRVQQNINACGM 312
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 116 bits (291), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 104/177 (58%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C ++ K V+V D + + + +L GP+ ++ A + +Y +I+
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIK- 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+ +
Sbjct: 261 --YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGMRN 314
>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
Length = 323
Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 102/175 (58%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C ++ K V+V D + + + +L GP+ ++ A + +Y +I+
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIK- 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+
Sbjct: 261 --YCFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
Length = 323
Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 104/177 (58%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C ++ K V+V D + + + +L GP+ ++ A + +Y +I+
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIK- 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+ +
Sbjct: 261 --YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGMRN 314
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 67/181 (37%), Positives = 106/181 (58%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E QYAIKHG LL LS+ +L++C+ + GC GG + A + ++ GLE E+DYP+ ++
Sbjct: 850 IEGQYAIKHGELLSLSEQELVDCDKLDSGCNGGLPDTAYRAIEELGGLELESDYPYDAED 909
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C ++ KVKV + L ++T + L GP+ G+N +Q Y G +
Sbjct: 910 ---EKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINANAMQFYMGGVSHPF 966
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C ++L+H V+IVGYG+ + +P WI++NSWG RWG + GY+ V RG CG
Sbjct: 967 KFLCSPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSWGPRWG-EQGYYRVYRGDGTCG 1025
Query: 172 I 172
+
Sbjct: 1026 V 1026
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 104/186 (55%), Gaps = 14/186 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E QYAIKHG LL LS+ +L++C+ ++GC GG + A + ++ GLE E+DYP+ +N
Sbjct: 588 VEGQYAIKHGQLLSLSEQELVDCDHLDEGCNGGLPDNAYRAIEQLGGLELESDYPYEAEN 647
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C + VKV ++ + ++T + L GP+ G+N +Q Y G +
Sbjct: 648 ---EKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPIAIGINANAMQFYMGGVSHPL 704
Query: 120 DV-CPSENLNHAVVIVGYG------MRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+ C NLNH V+IVGYG +P WI++NSWG+ WG + GY+ V RG CG
Sbjct: 705 KILCNPNNLNHGVLIVGYGTSRYPLFHKNLPYWIIKNSWGKSWG-EQGYYRVYRGDGTCG 763
Query: 172 IESYGG 177
+ +
Sbjct: 764 LNTMAS 769
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 97/175 (55%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C+ QGC GG + ++ + GLE+E+DYP+
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDRAAQGCNGGWPASSYLEIMYMGGLESESDYPYV--- 197
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV CA + K+ ++ D +V + L +GPL +N LQ Y +++
Sbjct: 198 GVEQTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPT 257
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ CP LNHAV+ VGY +P WI++NSWG WG + GYF + RG CGI
Sbjct: 258 FEECPDTELNHAVLTVGYDKEGDMPYWIIKNSWGTDWG-EKGYFRLFRGDCTCGI 311
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 91/173 (52%), Gaps = 7/173 (4%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
E+ Y K G L+ LS+ QL++C+ N GC GG ++ Y+K GLEAE+ YP++ G
Sbjct: 145 EAAYYRKAGKLVSLSEQQLVDCSTDINAGCNGGYLDETFTYVKSKGLEAESTYPYK---G 201
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C Y A KV +VS D + + GP+ ++ L Y I ++
Sbjct: 202 TDGSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLSSYESG-IYED 260
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
D C LNH V++VGYG + WIV+NSWG + GYF + RG N CG+
Sbjct: 261 DWCSPSELNHGVLVVGYGTSNGKKYWIVKNSWGGSFGESGYFRLLRGKNECGV 313
>gi|1666270|emb|CAA49713.1| envelope glycoprotein [Autographa californica nucleopolyhedrovirus]
Length = 208
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 104/177 (58%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 30 LESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 89
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C ++ K V+V D + + + +L GP+ ++ A + +Y +I+
Sbjct: 90 N---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIK- 145
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+ +
Sbjct: 146 --YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGMRN 199
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 115 bits (289), Expect = 6e-24, Method: Composition-based stats.
Identities = 64/181 (35%), Positives = 104/181 (57%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYA++HG LL S+ +L++C+ +QGC GG + A + + K GLE E DYP+ ++
Sbjct: 1540 VEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDTAYRSIEKIGGLETEQDYPYDAED 1599
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C ++ +V+V+ L + ++T + L GP+ +N +Q Y G +
Sbjct: 1600 ---EKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSHPF 1656
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG+ + +P WIV+NSWG WG + GY+ V RG CG
Sbjct: 1657 KFLCSPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKNSWGTGWG-EQGYYRVYRGDGTCG 1715
Query: 172 I 172
+
Sbjct: 1716 L 1716
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 115 bits (289), Expect = 6e-24, Method: Composition-based stats.
Identities = 64/181 (35%), Positives = 104/181 (57%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYA++HG LL S+ +L++C+ +QGC GG + A + + K GLE E DYP+ ++
Sbjct: 1575 VEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDTAYRSIEKIGGLETEQDYPYDAED 1634
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C ++ +V+V+ L + ++T + L GP+ +N +Q Y G +
Sbjct: 1635 ---EKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSHPF 1691
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG+ + +P WIV+NSWG WG + GY+ V RG CG
Sbjct: 1692 KFLCSPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKNSWGTGWG-EQGYYRVYRGDGTCG 1750
Query: 172 I 172
+
Sbjct: 1751 L 1751
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 115 bits (289), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 104/175 (59%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
LESQ+AIK+ L+ LS+ Q I+C+ N GC GG + A + ++ G++ E+DYP+ N
Sbjct: 146 LESQFAIKYNRLINLSEQQFIDCDRVNAGCDGGLLHTAFESAMEMGGVQMESDYPYETAN 205
Query: 61 GVTGRCAYDARK--VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G +C + + V VR + + + +L GP+ ++ + + +Y ++R+
Sbjct: 206 G---QCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRRGIMRQ 262
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + LNHAV++VGY + + +P WI++N+WG WG +DGYF V++ NACGI
Sbjct: 263 ---CANHGLNHAVLLVGYAVENNIPYWILKNTWGTDWG-EDGYFRVQQNINACGI 313
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 115 bits (289), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 98/176 (55%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G L+ LSK QL++C+ + GC GG + +K GLE ++DYP+
Sbjct: 145 IEGQWFLKTGYLVSLSKQQLVDCDTVDNGCYGGYPPYTYKEIKRMGGLELQSDYPY---T 201
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGALLQDY-NGKLIRK 118
G C D K+ ++ D +V + + L +GP+ +N LQ Y +G L
Sbjct: 202 GWGHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHPS 261
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+C E LNHAV+ VGY +H +P WI++NSWG WG +DGYF + RG CGI+
Sbjct: 262 KAMCSPEGLNHAVLTVGYDTKHGIPYWIIKNSWGTSWG-EDGYFRIYRGDGTCGID 316
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 115 bits (289), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 99/175 (56%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES+ A+K G+L+ LS+ QL++CN N GC GG + A+QY++ AGL E +YP++ NG
Sbjct: 146 IESRLALKTGSLVSLSEQQLLDCNRVNAGCDGGVLSYALQYVESAGLTTEDEYPYKAWNG 205
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
C + V + L++ S++ GP+ +N LLQ Y K I
Sbjct: 206 T---CNSTHKPVAAYTKGYTLIYTRSESDLMKAVAEGPVAVALNADLLQ-YYSKGIFNPS 261
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
C S +NH ++VGY +P WI++NSWG WG ++GYF + +G N CGI S
Sbjct: 262 AC-SSTVNHGGLVVGYEENATLPYWIIKNSWGATWG-ENGYFRMAKGYNLCGITS 314
>gi|4972585|gb|AAD34707.1|AF071801_1 cysteine proteinase [Paragonimus westermani]
Length = 229
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 100/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGF--NKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ +K G L+ LSK QL++C++ + GC GGG+ N ++ ++ GLE ++DYP+
Sbjct: 49 VEGQWFLKTGQLVSLSKQQLVDCDVMDYGC-GGGWPTNAYMEIMRMGGLELQSDYPYV-- 105
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
GV +C + K+ ++ D +V + L +GPL + +N LQ Y +
Sbjct: 106 -GVQQQCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHP 164
Query: 119 N-DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ + C +LNHAV+ VGY + VP WI++NSWG ++GYF + RG CGI
Sbjct: 165 SYEECSPASLNHAVLTVGYDTENGVPYWIIKNSWGTGWGENGYFRLYRGDGTCGI 219
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 100/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+ +K G L+ LSK QL++C++ + GC GG N ++ ++ GLE ++DYP+
Sbjct: 145 VEGQWFLKTGQLVSLSKQQLVDCDVMDYGCGGGWPTNAYMEIMRMGGLELQSDYPYV--- 201
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV +C + K+ ++ D +V + L +GPL + +N LQ Y + +
Sbjct: 202 GVQQQCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPS 261
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C +LNHAV+ VGY + VP WI++NSWG WG ++GYF + RG CGI
Sbjct: 262 YEECSPASLNHAVLTVGYDTENGVPYWIIKNSWGTGWG-ENGYFRLYRGDGTCGI 315
>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
Length = 323
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 103/177 (58%), Gaps = 15/177 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIGCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLRLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+
Sbjct: 260 K---YCFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGM 312
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/187 (36%), Positives = 105/187 (56%), Gaps = 18/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E QYAIKH LL LS+ +L++C+ ++GC GG A + ++ GLE E+DYP+ +
Sbjct: 698 VEGQYAIKHNQLLSLSEQELVDCDSLDEGCNGGDMENAYKAIERLGGLELESDYPY---D 754
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKLIR 117
+C + K KV+V N + +RM L GP+ G+N +Q Y G +
Sbjct: 755 AKDEKCHFLQNKAKVQVVS--AVNITSDEKRMAQWLVKNGPISVGINANAMQFYFGGVSH 812
Query: 118 K-NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA 169
N +C +NL+H V+IVGYG+ ++P WI++NSWG RWG + GY+ V RG
Sbjct: 813 PLNFLCNPKNLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGPRWG-ERGYYRVYRGDGT 871
Query: 170 CGIESYG 176
CG+ +
Sbjct: 872 CGVNTMA 878
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 105/186 (56%), Gaps = 14/186 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYAIKH LL LS+ +L++C+ ++GC GG + A + + K GLE E+DYP+ +N
Sbjct: 846 VEGQYAIKHNKLLSLSEQELVDCDDLDEGCNGGLPDNAYRAIEKLGGLELESDYPYEAEN 905
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
RC + KV+V + ++T + L GP+ G+N +Q Y G +
Sbjct: 906 ---ERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGINANAMQFYMGGVSHPF 962
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG + ++P WIV+NSWG RWG + GY+ V RG CG
Sbjct: 963 KFLCNPKNLDHGVLIVGYGTSNYPLFHKKLPYWIVKNSWGDRWG-EQGYYRVYRGDGTCG 1021
Query: 172 IESYGG 177
+ +
Sbjct: 1022 LNTMAS 1027
>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
Length = 323
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 105/179 (58%), Gaps = 15/179 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + + G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEANCRMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 205 N---NCRMNSNKFLVQVKDCYRYIIVY--EEKLKDLLRLVGPIPMAIDAADIVNYKQGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+ C + LNHAV++VGYG+ + +P W +N+WG WG +DG+F V++ NACG+ +
Sbjct: 260 K---YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EDGFFRVQQNINACGMRN 314
>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
Length = 242
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 100/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+ES +AIK G L+ LS+ +LI+C++ + GC GG A + +K GLE E YP++ +N
Sbjct: 62 IESLWAIKTGNLISLSEQELIDCDVIDNGCNGGLPINAFREIKRMGGLEPEDQYPYKAKN 121
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
G C ++ V + D + ++T + + GPL G++ LL Y +G L
Sbjct: 122 GT---CHLVRAQIAVTIDDAIEIPRNETVMKAWIAQRGPLSVGIDAELLAYYKSGILHPS 178
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
CP +NH V+I GYG+ + +P W ++NSWG WG ++GYF + RG + CG+
Sbjct: 179 KSRCPPSKINHGVLITGYGIENGLPYWTIKNSWGEEWG-ENGYFRLMRGKDICGV 232
>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
Length = 323
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 103/177 (58%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E+DYP+ N
Sbjct: 145 LESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADN 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C + K V+V D + + + +L GP+ ++ A + +Y +I+
Sbjct: 205 N---NCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIK- 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGYG+ + +P W +N+WG WG ++G+F V++ NACG+ +
Sbjct: 261 --YCFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-EEGFFRVQQNINACGMRN 314
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats.
Identities = 61/181 (33%), Positives = 103/181 (56%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G L+ LS+ +L++C+ +QGC GG + A + ++ GLE+E DYP+
Sbjct: 2491 IEGQWKMKTGDLVSLSEQELVDCDKLDQGCNGGLPDNAYRAIEQLGGLESEDDYPYE--- 2547
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G +C+++ +V++S + ++T + L +GP+ G+N +Q Y G +
Sbjct: 2548 GSDDKCSFNKTLARVQISGAVNITSNETDMAKWLVKHGPISIGINANAMQFYMGGISHPW 2607
Query: 119 NDVCPSENLNHAVVIVGYGMR------HQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C NL+H V+IVGYG + +P WI++NSWG WG + GY+ V RG CG
Sbjct: 2608 RMLCNPSNLDHGVLIVGYGAKDYPLFHKHLPYWIIKNSWGTSWG-EQGYYRVYRGDGTCG 2666
Query: 172 I 172
+
Sbjct: 2667 V 2667
>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
Length = 227
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 97/180 (53%), Gaps = 8/180 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E + +K L+ L + QL++C+ + GC+GG A +Y+K GLEAE DYP++ +N
Sbjct: 42 VEGAHFLKSRELISLREEQLVDCDRMDGGCKGGDMLNAYEYIKAKGLEAEEDYPYQEENY 101
Query: 62 VT-----GRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
RC + KV ++++ V D L GPL +N + DY G +
Sbjct: 102 KEYMFPHHRCHFRPSKVAATIANYSTVSEDEDQIAANLVKNGPLSIALNANYIMDYMGGV 161
Query: 116 IRKNDVCPS-ENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+CP +N+NHAV++VGYGM P WI++NSW +DGYF + RG CG+ +
Sbjct: 162 ACPR-ICPGGDNMNHAVLLVGYGMDGDKPYWILKNSWSENYGEDGYFRLCRGFGVCGMNT 220
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 104/179 (58%), Gaps = 15/179 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+A+KH L+ LS+ Q+I+C+ + GC GG + A + +K G++ E DYP+ N
Sbjct: 146 LESQFAMKHNQLIDLSEQQMIDCDSVDAGCNGGLLHTAFEAVIKMGGVQLEKDYPYEAAN 205
Query: 61 GVTGRCAYDARKVKVRVSD----FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
C ++ K V+V D +V+ + + +L GP+ ++ A + +Y +I
Sbjct: 206 N---NCRMNSNKFLVKVKDCYRYIIVY--EEKLKDLLRSVGPIPMAIDAADIVNYKQGII 260
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+ C + LNHAV++VGYG+ + +P W +N+WG WG + GYF +++ NACG+ +
Sbjct: 261 K---YCLNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG-ESGYFRLQQNINACGMRN 315
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 113 bits (283), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 103/192 (53%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + ++ G L+ LS+ QL++C N + GC GG N A +Y LK GL+ E
Sbjct: 167 LEGSHFLQTGELVSLSEQQLVDCDHECDPAEYNSCDSGCNGGLMNNAFEYILKAGGLQKE 226
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
ADYP+ G G C +D K+ V++F +V D L GPL G+N A +Q
Sbjct: 227 ADYPY---TGRDGTCKFDKSKIAASVANFSVVSTDEDQIAANLVTNGPLAIGINAAWMQT 283
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDGYFT 162
Y G+ + +C ++H V++VGYG + P WI++NSWG WG +DGY+
Sbjct: 284 YIGQ-VSCPYICSKTKMDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGEDWG-EDGYYK 341
Query: 163 VERGTNACGIES 174
+ G NACG+++
Sbjct: 342 LCSGYNACGMDT 353
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 102/177 (57%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E +AIK L+ LS+ +L++C+I +QGC GG + A + ++ GLEAE+DYP+ +
Sbjct: 128 IEGAWAIKTSKLVSLSEQELVDCDIIDQGCNGGLPSNAYREIIRMGGLEAESDYPY---D 184
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +C + + V ++D L + + L GP+ G+N LQ Y +
Sbjct: 185 GRGEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPISIGLNANPLQFYRHGIAHPW 244
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
V C ++L+H V+IVGYG P WI++NSWG +WG ++GYF + RG N CGI+
Sbjct: 245 RVFCSPKHLDHGVLIVGYGSETDKPYWIIKNSWGTKWG-EEGYFRLFRGKNVCGIQE 300
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 106/175 (60%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LS+ +L++C+ + GC+GG + A + +K G +E YP+R +N
Sbjct: 273 MEGQWQIKKGELISLSEQELVDCDKVDGGCEGGEMSDAYEAIIKLGGAMSEEKYPYRGEN 332
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C ++ V+V+++ ++ + ++T L +GP+ G+N ++Q Y G +
Sbjct: 333 E---KCKFNMTDVRVKINGYVNISKNETEMAGWLAAHGPISIGINALMMQFYFGGIAHPW 389
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C ++L+H V+IVGY ++ P WIV+NSWG+ WG ++GY+ V RG CG+
Sbjct: 390 KIFCSPDSLDHGVLIVGYSVKDGEPYWIVKNSWGKDWG-EEGYYLVYRGDGTCGL 443
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 92/166 (55%), Gaps = 9/166 (5%)
Query: 11 GTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAY 68
G L+ LS+ QL++C N GC GG + Y++ GLEAEA YP++ ++G C +
Sbjct: 151 GKLVSLSEQQLVDCTYGTVNFGCDGGYLEETFPYIQETGLEAEASYPYKARDGT---CKF 207
Query: 69 DARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENL 127
DA KV +++D++ + G + GP+ M+ + Y + +C S++L
Sbjct: 208 DASKVVTKINDYVYWYGDEEALLEATATIGPISVAMDANYIDSYASGVFSSR-LCSSDDL 266
Query: 128 NHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
NH V++VGYG + V W+V+NSW WG + GY + RG N CGI
Sbjct: 267 NHGVLVVGYGSENGVNYWLVKNSWAEDWG-ESGYLKLLRGQNECGI 311
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 71/192 (36%), Positives = 100/192 (52%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E +K G L+ LS+ QL++C+ + + GC GG A QY LK GL+ E
Sbjct: 193 MEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALKAGGLQRE 252
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G+ G C +D KV V++F + D L GPL G+N A +Q
Sbjct: 253 EDYPY---TGIDGSCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLAVGINAAFMQT 309
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWG-RWGPDDGYFT 162
Y G + VC +NL+H V++VGYG P WI++NSWG WG +DGY+
Sbjct: 310 YVGG-VSCPYVCNKQNLDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDWG-EDGYYK 367
Query: 163 VERGTNACGIES 174
+ RG N CGI +
Sbjct: 368 LCRGHNVCGINT 379
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 112 bits (281), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 7/177 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES AIKHG L+ LS+ QL++C+ ++ C G + A QYL G +E YP++ G
Sbjct: 159 VESVNAIKHGNLVELSEQQLVDCDSKDEACDSGLPDNAQQYLVSHGAISEQSYPYK---G 215
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
C YD+ +V VR+S+F S+ LY PL + +L Y K I N+
Sbjct: 216 YAANCTYDSSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAAEVLGTYT-KGILVNE 274
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
S++LNHAV++VGYG WI++NSWG WG + GYF ++RG N I YG
Sbjct: 275 CEQSQDLNHAVLLVGYGNEGGTNFWILKNSWGTNWG-EGGYFRIKRGVNCLMITDYG 330
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 112 bits (281), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 103/174 (59%), Gaps = 10/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ES YAIK+ LL LS+ QL+ C+ N GC GG + A++ ++ G+ E D+P+ +
Sbjct: 151 IESLYAIKYNKLLDLSEQQLVNCDEQNNGCNGGLMHWAMEEIIRQGGVSNETDFPYTASD 210
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
G C V + + + + D R +L GP+ ++ + DY+ + +
Sbjct: 211 GF---CKRKQGFVNINGCNQFILSNEDRLRELLIFNGPISIAIDVIDVIDYSQGI---SS 264
Query: 121 VCPSEN-LNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
C ++N LNHAV++VGYG+++ +P WI++NSWG +WG ++GYF V+R N+CG+
Sbjct: 265 TCRNDNGLNHAVLLVGYGVKNNIPYWILKNSWGSQWG-ENGYFRVQRNINSCGM 317
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 91/174 (52%), Gaps = 9/174 (5%)
Query: 3 ESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
E YA K G L+ LS+ QLI+C + GC GG + +Y+ GL++E Y ++ ++G
Sbjct: 146 EGAYARKSGKLVSLSEQQLIDCCTDTSAGCDGGSLDDNFKYVMKDGLQSEESYTYKGEDG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
C Y+ V +VS + D + GP+ GM+ + L Y+ +
Sbjct: 206 A---CKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGMDASYLSSYDSGIYEDQ 262
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
D P+ LNHA++ VGYG + WI++NSWG WG + GYF + RG N CGI
Sbjct: 263 DCSPA-GLNHAILAVGYGTENGKDYWIIKNSWGASWG-EQGYFRLARGKNQCGI 314
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 105/185 (56%), Gaps = 14/185 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYAIKHG LL LS+ +L++C+ ++GC GG + A + + K GLE E+DYP+ +N
Sbjct: 701 IEGQYAIKHGRLLSLSEQELVDCDDLDEGCNGGLPDNAYRAIEKLGGLELESDYPYEAEN 760
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C + KV+++ + ++T + L GP+ G+N +Q Y G +
Sbjct: 761 ---EKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIGINANAMQFYVGGVSHPF 817
Query: 119 NDVCPSENLNHAVVIVGYG------MRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG ++P W ++NSWG RWG + GY+ V RG CG
Sbjct: 818 KFLCNPKNLDHGVLIVGYGTSDYPLFHKKLPYWTIKNSWGKRWG-EQGYYRVYRGDGTCG 876
Query: 172 IESYG 176
+ +
Sbjct: 877 LNTLA 881
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 103/195 (52%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L LS+ QL++C+ + GC GG N A +Y LK GLE E
Sbjct: 168 LEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
ADYP+ +G G C +D KV VS+F V + D L +GPL +N A +Q
Sbjct: 228 ADYPYTGTDG--GTCKFDKSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ +H V++VGYG + P WI++NSWG+ WG ++G
Sbjct: 286 YVGGV-----SCPYICSKRQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNICGVDS 354
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 98/174 (56%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ +K G LL LS+ QLI+C+ ++GC GG K +K GLE +DYP++
Sbjct: 423 IEGQWFLKTGELLSLSEQQLIDCDNVDEGCNGGYPPKTYGAVIKMGGLELNSDYPYK--- 479
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
+ +C D +K+KV ++D +VF ++ + L GPL + +N L+ Y G +
Sbjct: 480 ALAEKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNANPLKFYKTGIMHLP 539
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C LNHAV+ VGYG + +P W V+NSWG +DGYF + RG CGI
Sbjct: 540 VASCFPRALNHAVLTVGYGTENGLPYWTVKNSWGTAFGEDGYFRIYRGGGTCGI 593
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 82/155 (52%), Gaps = 6/155 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G LL LS Q+++C+ + GC GG + + + GL+ +ADY ++
Sbjct: 72 IEGQWFLKSGELLHLSVQQVLDCDHVDHGCNGGYPPQVYRQVNQMGGLQLDADYSYK--- 128
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G+C D K + V+ ++ + ++ F+ L GPL + +N LQ Y ++
Sbjct: 129 AAVGKCHTDRSKFRAYVNSSVILSQNEQFQANKLKTIGPLASTLNARTLQFYRKGIMHPT 188
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR 153
C LNHAV+ VGYG +P WIV+NSW R
Sbjct: 189 PSACNPGQLNHAVLTVGYGTEQGMPYWIVKNSWSR 223
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 96/176 (54%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G L+ LSK QL++C+ + GC GG + +K GLE ++ YP+ +
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEIKRMGGLELQSAYPYTSWK 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGALLQDY-NGKLIRK 118
C D K+ ++ D +V + + L +GP+ +N LQ Y +G L
Sbjct: 205 QA---CRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPS 261
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+C E LNHAV+ VGY H VP W VRNSWG RWG ++GYF + RG CGI+
Sbjct: 262 KAMCSPEGLNHAVLTVGYDTEHGVPYWTVRNSWGTRWG-ENGYFRIYRGDGTCGID 316
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 112 bits (279), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 73/196 (37%), Positives = 102/196 (52%), Gaps = 28/196 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + + G LL LS+ QL++C N + GC GG A YL AG L +
Sbjct: 174 VEGAHFVATGKLLNLSEQQLVDCDHTCDAVAKNECDSGCSGGLMTNAYTYLIRAGGLMEQ 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGALLQ 109
A YP+ G G C +DA KV VRV+ F + D R L GPL G+N A +Q
Sbjct: 234 AAYPY---TGAQGTCRFDANKVAVRVTSFTAVPPDDEDQIRASLVRAGPLAVGLNAAFMQ 290
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYF 161
Y G + +CP + +NH V++VGYG R P+ WI++NSWG+ WG + GY+
Sbjct: 291 TYLGG-VSCPLLCPRKLINHGVLLVGYGARGLAPLRLGYRPYWIIKNSWGKEWG-EGGYY 348
Query: 162 TVERGT---NACGIES 174
+ RG N CG++S
Sbjct: 349 RLCRGARNRNVCGVDS 364
>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
Length = 1118
Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats.
Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 8/170 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES +AIK G L+ +S+ QL++C+ Y+ GC GG A++Y G + YP+ + G
Sbjct: 87 VESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGLPWDALRYFVANGAMSLKSYPYVAKEG 146
Query: 62 VTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C YD+ KV++R+ ++ D + LY+ GPL + + L YNG ++ +
Sbjct: 147 ---KCRYDSSKVEIRLKEYKHKEKLSEDQIKEHLYNIGPLSIAITSSPLASYNGGILIE- 202
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
+ S +NHAV++VGYG + V WIV+NSWG+ WG ++GYF ++ G N
Sbjct: 203 ECHRSYLINHAVLLVGYGKENGVKYWIVKNSWGQNWG-ENGYFRMKMGVN 251
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 67/187 (35%), Positives = 106/187 (56%), Gaps = 11/187 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKA-IQYLKHAGLEAEADYPFRNQ 59
+ES AIK G L+ +S+ QL++C+ +N GC GG +K+ Y G + YP+
Sbjct: 938 VESINAIKTGKLIDVSEQQLVDCDEWNFGCSGGIACSKSHFSYFHKKGAMSLESYPYV-- 995
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G G+C Y++ KV +R+ D+ F D + LY+ GPL ++ + + Y G ++
Sbjct: 996 -GKEGQCRYNSSKVVIRLKDYQYFIALSEDEIKEYLYNIGPLSIDIDSSQIHHYKGGIVI 1054
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
K + + NHAV++VGYG + V WIV+NSWG+ WG + GYF ++RG N C + +
Sbjct: 1055 K-ECQEVKKTNHAVLLVGYGKENGVEYWIVKNSWGQNWG-EKGYFRIQRGVN-CLLLAKD 1111
Query: 177 GICTRTL 183
GI T +
Sbjct: 1112 GITTAVI 1118
Score = 91.7 bits (226), Expect = 1e-16, Method: Composition-based stats.
Identities = 54/160 (33%), Positives = 88/160 (55%), Gaps = 14/160 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES +AIK G L+ +S+ QL++C+ + GC GG A++Y + G + YP+ QN
Sbjct: 638 VESIHAIKTGKLVHVSEQQLVDCDSQDSGCSGGLTWNAMRYFRTNGAVSLKSYPYVAQN- 696
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI--- 116
C YD+ KV +R+ D+ + D + LY+ G L + L Y G ++
Sbjct: 697 --ENCRYDSNKVVIRLKDYKHITQLSEDQIKEHLYNIGLLSIDITSTQLTWYEGGILIEE 754
Query: 117 -RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWG 155
R++D+ ++HAV++V YG + V WIV+NSWG+ G
Sbjct: 755 CRRSDL-----VDHAVLLVEYGKENSVEYWIVKNSWGQNG 789
Score = 40.0 bits (92), Expect = 0.43, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 25/33 (75%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG 34
+ES +AIK G L+ +S+ QL++C+ Y+ GC GG
Sbjct: 421 VESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGG 453
>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
Length = 373
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 95/187 (50%), Gaps = 25/187 (13%)
Query: 8 IKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
+ G LL LS+ QL++C N N GC GG A YL K GL + YP+
Sbjct: 181 LATGKLLELSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYAYLMKSGGLMEQRAYPY- 239
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKL 115
G G C +D K VRV++F D R L GPL G+N A +Q Y G
Sbjct: 240 --TGAPGPCRFDPAKAAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGG- 296
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWG-RWGPDDGYFTVERGT 167
+ +CP +NH V++VGYG R P WI++NSWG RWG + GY+ + RG+
Sbjct: 297 VSCPLLCPRAWVNHGVLLVGYGARGFAALRLGYRPYWIIKNSWGERWG-EQGYYRLCRGS 355
Query: 168 NACGIES 174
N CG++S
Sbjct: 356 NVCGVDS 362
>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
Length = 271
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ QL++C+ ++GC GG K + K GLE +DYP+
Sbjct: 91 VEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKMGGLELASDYPY---T 147
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV G C + K V+D V S+ + + L GPL + +N LLQ Y G +I
Sbjct: 148 GVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 207
Query: 120 D-VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C LNHAV+ VGYG +P WIV+NSWG + GYF + RG CGI
Sbjct: 208 PFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCGI 261
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ QL++C+ ++GC GG K + K GLE +DYP+
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKMGGLELASDYPY---T 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV G C + K V+D V S+ + + L GPL + +N LLQ Y G +I
Sbjct: 205 GVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
Query: 120 D-VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C LNHAV+ VGYG +P WIV+NSWG + GYF + RG CGI
Sbjct: 265 PFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCGI 318
>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
Length = 245
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 99/194 (51%), Gaps = 29/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----------IYNQGCQGGGFNKAIQY-LKHAGLEA 50
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL+
Sbjct: 47 LEGANYLATGELISLSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALKAGGLQK 106
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQ 109
E DYP+ ++G C +D K+ V +F V + D L YGPL G+N A +Q
Sbjct: 107 EKDYPYTGKDGT---CKFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQ 163
Query: 110 DYNGKLIRKNDVCP---SENLNHAVVIVGYGMRH------QVPVWIVRNSWGRWGPDDGY 160
Y G + CP ++L+H V+IVGYG + P WI++NSWG + GY
Sbjct: 164 TYIGGV-----SCPYICGKSLDHGVLIVGYGTGYAPVRLKNKPYWIIKNSWGESWGESGY 218
Query: 161 FTVERGTNACGIES 174
+ + RG N CG+ES
Sbjct: 219 YKICRGRNVCGVES 232
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 102/195 (52%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L LS+ QL++C+ + GC GG N A +Y LK GLE E
Sbjct: 168 LEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C +D KV VS+F V + D L +GPL +N A +Q
Sbjct: 228 EDYPYTGTDG--GTCKFDKSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ +H V++VGYG + P WI++NSWG+ WG ++G
Sbjct: 286 YVGGV-----SCPYICSKRQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNICGVDS 354
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 104/177 (58%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AI H L+ LS+ Q+I+C+ + GC+GG + A + + G++ E DYP+ + N
Sbjct: 145 LESQFAIAHDRLINLSEQQMIDCDSVDVGCEGGLLHTAFEAIISMGGVQIENDYPYESSN 204
Query: 61 GVTGRCAYDARK--VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
C D K V V+ + + + + +L GP+ ++ + + +Y +I+
Sbjct: 205 NY---CRMDPTKFVVGVKQCNRYITIYEEKLKDVLRLAGPIPVAIDASDILNYEQGIIK- 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C + LNHAV++VGYG+ + VP WI++NSWG WG + G+F +++ NACGI++
Sbjct: 261 --YCANNGLNHAVLLVGYGVENNVPYWILKNSWGTDWG-EQGFFKIQQNVNACGIKN 314
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 100/192 (52%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG A +Y L+ GLE E
Sbjct: 170 LEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNGGLMTTAYEYVLQSGGLEKE 229
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D K+ V++F V + D L +GPL G+N +Q
Sbjct: 230 KDYPYTGKDGT---CKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINAVFMQT 286
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y G + +C NL+H V++VGYG P+ WIV+NSWG WG ++GY+
Sbjct: 287 YIGG-VSCPYICSKRNLDHGVLLVGYGAAGYAPIRFKDKPYWIVKNSWGENWG-EEGYYK 344
Query: 163 VERGTNACGIES 174
+ RG N CGI+S
Sbjct: 345 ICRGNNICGIDS 356
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 101/186 (54%), Gaps = 14/186 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYAIKH LL LS+ +L++C+ + GC GG A + + K GLE E DYP+ +N
Sbjct: 698 IEGQYAIKHKKLLSLSEQELVDCDNLDDGCGGGYMINAYKTVEKLGGLELETDYPYDARN 757
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C + K KV+V+ L + N + L GP+ G+N +Q Y G +
Sbjct: 758 ---EKCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGPISVGINANAMQFYFGGVSHPF 814
Query: 119 NDVCPSENLNHAVVIVGYG------MRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C NL+H V+IVGY + ++P WI++NSWG +WG + GY+ V RG CG
Sbjct: 815 KFLCDPANLDHGVLIVGYATSTYPLFKKKLPYWIIKNSWGPKWG-EQGYYRVYRGDGTCG 873
Query: 172 IESYGG 177
+ +
Sbjct: 874 VNAMAS 879
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 102/195 (52%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L LS+ QL++C+ + GC GG N A +Y LK GLE E
Sbjct: 168 LEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C +D KV VS+F V + D L +GPL +N A +Q
Sbjct: 228 EDYPYTGTDG--GTCKFDKSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ +H V++VGYG + P WI++NSWG+ WG ++G
Sbjct: 286 YVGGV-----SCPYICSKRQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNICGVDS 354
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 107/186 (57%), Gaps = 11/186 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+AI L+ LS+ +L++C+ ++GC GG ++A + ++ GLE E DY +R N
Sbjct: 535 IEGQWAISKKKLVSLSEQELVDCDKVDEGCNGGLPSQAYKEIIRLGGLETETDYKYRGHN 594
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C+ D K++V+++ + + ++T L GP+ G+N +Q Y G +
Sbjct: 595 E---KCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPISIGINAFAMQFYMGGISHPW 651
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYGG 177
+ C + L+H V+IVGYG++ P WI++NSWG WG + GY+ V RG CG+ +
Sbjct: 652 KIFCNPKELDHGVLIVGYGVKGSKPYWIIKNSWGPDWG-EKGYYLVYRGAGVCGLNT--- 707
Query: 178 ICTRTL 183
+CT +
Sbjct: 708 MCTSAV 713
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 102/195 (52%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L LS+ QL++C+ + GC GG N A +Y LK GLE E
Sbjct: 168 LEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C +D KV VS+F V + D L +GPL +N A +Q
Sbjct: 228 EDYPYTGTDG--GTCKFDKSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ +H V++VGYG + P WI++NSWG+ WG ++G
Sbjct: 286 YVGGV-----SCPYICSKRQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNICGVDS 354
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 107/186 (57%), Gaps = 14/186 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G LL LS+ +L++C+ + GC GG + A + ++ GLE E +YP+ ++
Sbjct: 352 IEGQWKLKTGKLLSLSEQELVDCDKMDDGCDGGYMDNAYRAIEQLGGLETEEEYPYEAED 411
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C+++ KV++S + + ++T + L H GP+ G+N +Q Y G +
Sbjct: 412 D---KCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPISIGINANAMQFYVGGVSHPW 468
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+C +N++H V+IVGYG++ Q+P W+V+NSWG WG + GY+ V RG CG
Sbjct: 469 KALCNPKNIDHGVLIVGYGIKEYPLFNKQLPYWVVKNSWGPGWG-EQGYYRVFRGDGTCG 527
Query: 172 IESYGG 177
+ +
Sbjct: 528 VNTMAS 533
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 97/177 (54%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQ--KMGGLELASDYPY-- 203
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIR 117
GV G C D K ++ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 204 -TGVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMR 262
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 263 PR-LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 104/184 (56%), Gaps = 14/184 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E QYAIK G L+ LS+ +L++C+ Y+ GC+GG F A ++ GLE E+DYP+ +
Sbjct: 618 IEGQYAIKTGNLVSLSEQELVDCDKYDDGCEGGLFETAYHAIEELGGLELESDYPY---S 674
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C +++ +V+V ++ + + N + L GP+ G+N +Q Y G +
Sbjct: 675 GRDNTCHFNSSEVRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSHPL 734
Query: 119 NDVCPSENLNHAVVIVGYGM-------RHQVPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
+C + L+H V+IVGYG+ RH +P W+++NSW + GY+ + RG +CG
Sbjct: 735 KFLCDPKTLDHGVLIVGYGIHRTWLLHRH-LPYWLIKNSWSSYWGAKGYYMLYRGDGSCG 793
Query: 172 IESY 175
+ +
Sbjct: 794 VNQW 797
>gi|449512065|ref|XP_002196301.2| PREDICTED: cathepsin O-like, partial [Taeniopygia guttata]
Length = 193
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 97/178 (54%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEA--EADYPFRNQ 59
+ES YAIK TL LS Q+I+C+ N GC GG A+ +L ++ +++Y F+ Q
Sbjct: 13 IESAYAIKRNTLEELSVQQVIDCSYNNYGCNGGSTVSALSWLNQTKVKLVRDSEYTFKAQ 72
Query: 60 NGVTGRCAYDARK-VKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
TG C Y R V ++ F ++ S + RML +GPL ++ QDY G +
Sbjct: 73 ---TGLCHYFERSDFGVSITGFASYDFSGQEEEMMRMLVSWGPLAVTVDAVSWQDYLGGI 129
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ +P WIV+NSWG WG DGY V+ G N CGI
Sbjct: 130 IQYH--CSSGRANHAVLITGFDRTGSIPYWIVQNSWGPTWGI-DGYVRVKMGGNVCGI 184
>gi|326918260|ref|XP_003205408.1| PREDICTED: cathepsin O-like, partial [Meleagris gallopavo]
Length = 283
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 99/178 (55%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEA--EADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L ++ +++Y F+ Q
Sbjct: 103 IESAYAIKGNNLEELSVQQVIDCSYSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQ 162
Query: 60 NGVTGRCAYDARK-VKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
TG C Y AR V ++ F ++ S + R+L +GPL ++ QDY G +
Sbjct: 163 ---TGLCHYFARSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGI 219
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ +P WIV+NSWGR WG DGY V+ G+N CGI
Sbjct: 220 IQYH--CSSGKANHAVLITGFDRTGSIPYWIVQNSWGRTWGI-DGYVRVKIGSNVCGI 274
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 97/187 (51%), Gaps = 25/187 (13%)
Query: 8 IKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
+ G L+ LS+ QL++C N N GC GG A YL + GL ++ YP+
Sbjct: 189 LATGELVDLSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYSYLMESGGLMEQSAYPY- 247
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKL 115
G G C +D +V VRV++F D R L GPL G+N A +Q Y G
Sbjct: 248 --TGAAGPCRFDPTQVAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGG- 304
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWGR-WGPDDGYFTVERGT 167
+ +CP +NH V++VGYG R P WI++NSWG+ WG + GY+ + RG+
Sbjct: 305 VSCPLICPRAWVNHGVLLVGYGARGFAALRLGYRPYWIIKNSWGKQWG-EQGYYRLCRGS 363
Query: 168 NACGIES 174
N CG++S
Sbjct: 364 NVCGVDS 370
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ QL++C+ ++GC GG K + K GLE +DYP+
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKMGGLELASDYPY---T 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV G C + K V++ V S+ + + L GPL + +N LLQ Y G +I
Sbjct: 205 GVDGICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
Query: 120 D-VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C LNHAV+ VGYG +P WIV+NSWG + GYF + RG CGI
Sbjct: 265 PFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCGI 318
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 109 bits (273), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 102/177 (57%), Gaps = 14/177 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG--LEAEADYPFRNQ 59
+ESQY+IK+ + LS QL++C+ N GC GG + A++ + +AG + E DYP++
Sbjct: 146 IESQYSIKYNKQISLSVQQLVDCDTSNMGCAGGLLHTALEQIINAGGGVLQEEDYPYK-- 203
Query: 60 NGVTGRCAYDARKVKVRV---SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV +C V+V ++V N + + +L GP+ ++ A + DY+ +I
Sbjct: 204 -GVDKQCNLPHNNFAVQVLGCYRYIVMN-EEKLKDVLRAVGPIPVAIDAASIVDYSRGII 261
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
R C LNHAV++VGYG++ VP W ++N+WG WG + GYF V + N+CGI
Sbjct: 262 R---TCTYYGLNHAVLLVGYGVQDGVPYWTLKNTWGDDWG-EHGYFRVRQNVNSCGI 314
>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
Length = 1157
Score = 109 bits (273), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 104/201 (51%), Gaps = 17/201 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRN 58
+E QY ++ LL LS+ QL++C+ +QGC GG G + IQ L GLE EADYP+
Sbjct: 496 IEGQYFMRVHRLLSLSEQQLVDCDRIDQGCAGGTPYGAFEGIQQL--GGLELEADYPYL- 552
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + + V ++ + D + L+ +GPL G+NGALLQ Y+ +++
Sbjct: 553 --GHQDNCQSNPLRFVVSINGSVQLPKDEDQIAQYLFDHGPLSVGINGALLQYYSSGIMQ 610
Query: 118 KN-DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDG------YFTVERGTNA 169
D C +NHA + VG+G VP W ++NSWG WG +D Y T+ERGT
Sbjct: 611 PLWDNCNPAEMNHAGLAVGFGFEQDVPYWTIKNSWGMLWGEEDNIKQAEFYQTLERGTAL 670
Query: 170 CGIESYGGICTRTLNGVFLLL 190
G+ + + FL L
Sbjct: 671 YGVTQFSDLTGEEFQETFLGL 691
Score = 102 bits (254), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 8/171 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI--QYLKHAGLEAEADYPFRNQ 59
+E Q+ K G L+ LSK QL++C+ ++GC GGG+ A + GLE E DY + +
Sbjct: 745 IEGQWFRKTGQLVSLSKQQLVDCDRSSRGC-GGGYPPATYDSIRRIGGLEIELDYRYTGR 803
Query: 60 NGVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDY-NGKLIR 117
+GV C + RK V S + +T L ++GP+ +N LLQ Y +G +
Sbjct: 804 DGV---CHQNPRKFVAYVNSSVALTKDENTIAEWLSYHGPISMALNARLLQFYVSGIMHP 860
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTN 168
CP ++++HAV+ VG+G + VP WIV+NSWG ++GYF + RG +
Sbjct: 861 PAAYCPVKDISHAVLSVGFGTKGNVPFWIVKNSWGTLWGEEGYFRIYRGDD 911
Score = 72.8 bits (177), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 76/152 (50%), Gaps = 11/152 (7%)
Query: 20 QLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQNGVTGR--CAYDARKVKVR 76
QL++C+ ++GC+GG + + + GL+ DYP+ + R C ++ ++
Sbjct: 24 QLVDCDHVDRGCEGGFPLDAFMAVQRLGGLQLSIDYPY-----IASRQACQFNPKQAVAF 78
Query: 77 VSDFLVFNGSDTF-RRMLYHYGPLVAGMNGALLQDYN-GKLIRKNDVCPSENLNHAVVIV 134
V+ F ++ L+ GPL G+N L+ YN G L + C E LNHA + V
Sbjct: 79 VTGFAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCDPEALNHAALAV 138
Query: 135 GYGMRHQVPVWIVRNSWGR-WGPDDGYFTVER 165
G+G P WI++N++G+ WG F ER
Sbjct: 139 GFGTDESTPFWIIKNTFGKDWGEQLDEFEDER 170
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 71/142 (50%), Gaps = 7/142 (4%)
Query: 18 KSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQNGVTGRCAYDARKVKVR 76
+++++C+ + GC GG A + ++ GLE YP+ G C D R
Sbjct: 246 SAEVVDCDHADHGCSGGFPIHAYECVQRLGGLELAVRYPYV---GYQQYCQADPRYFVAY 302
Query: 77 VSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN-DVCPSENLNHAVVIV 134
++ + S+ + L +GPL ++ LLQ Y ++ + C E LNHAV+ V
Sbjct: 303 INGSVALPKDSEQIAKFLATFGPLSVVLDARLLQYYRSGILNPSVAYCNPEELNHAVLSV 362
Query: 135 GYGMRHQVPVWIVRNSWG-RWG 155
G+G +P WI++NSWG +WG
Sbjct: 363 GFGTEQGIPYWIIKNSWGEQWG 384
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 56/111 (50%), Gaps = 7/111 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI--QYLKHAGLEAEADYPFRNQ 59
+E Q+ K G LL LS+ QLI+C+ + GC GGG+ +K GLE ADYP+
Sbjct: 1032 IEGQWFKKTGQLLTLSEQQLIDCDSVDDGC-GGGYPPDTYGDIVKMGGLELNADYPYIAA 1090
Query: 60 NGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQ 109
+GV C + K + V+ LV D L GPL AG+N LQ
Sbjct: 1091 DGV---CKMERSKFRAYVNKSLVLPTKEDQQAVWLSKNGPLSAGINADYLQ 1138
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 109 bits (273), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 97/176 (55%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ +K G L+ LSK QL++C++ + GC GG + ++ GLEA+ DYP+
Sbjct: 145 VEGQWFLKTGQLVSLSKQQLVDCDVQDSGCDGGYPPTTYGEIIRMGGLEAQRDYPYV--- 201
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C D K+ +++ +V ++ + + +GP+ +G+N LQ Y + +
Sbjct: 202 GREQPCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPS 261
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
C + LNH V+ VGYG VP WI++NSWG WG + GYF + RG CGIE
Sbjct: 262 KSQCQPDWLNHGVLSVGYGTEDGVPYWIIKNSWGTGWG-EKGYFRLYRGDGTCGIE 316
>gi|224049669|ref|XP_002196637.1| PREDICTED: cathepsin O [Taeniopygia guttata]
Length = 299
Score = 109 bits (273), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 103/194 (53%), Gaps = 20/194 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEA--EADYPFRNQ 59
+ES YAIK TL LS Q+I+C+ N GC GG A+ +L ++ +++Y F+ Q
Sbjct: 119 IESAYAIKRNTLEELSVQQVIDCSYNNYGCNGGSTVSALSWLNQTKVKLVRDSEYTFKAQ 178
Query: 60 NGVTGRCAYDARK-VKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
TG C Y R V ++ F ++ S + RML +GPL ++ QDY G +
Sbjct: 179 ---TGLCHYFERSDFGVSITGFAAYDFSGQEEEMMRMLVSWGPLAVTVDAVSWQDYLGGI 235
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
I+ + C S NHAV+I G+ +P WIV+NSWG WG DGY V+ G N CGI
Sbjct: 236 IQYH--CSSGRANHAVLITGFDRTGSIPYWIVQNSWGPTWGI-DGYVRVKMGGNVCGIAD 292
Query: 175 YGGICTRTLNGVFL 188
T++ VF+
Sbjct: 293 -------TVSAVFV 299
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 102/181 (56%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E +AIK LL LS+ +LI+C+ + GC GG + + +K GLE E DYP+ +N
Sbjct: 248 IEGLWAIKKHELLSLSEQELIDCDKIDNGCNGGYMPETYEAIMKLGGLETETDYPYEAEN 307
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C + ++KV+++ + S+ + LY GP+ AG+N +Q Y G +
Sbjct: 308 E---KCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSAGLNANAMQFYLGGISHPP 364
Query: 120 DV-CPSENLNHAVVIVGYG------MRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+ C E +H ++IVGYG ++ +P WI++NSWG+ WG + GY+ + RG+ CG
Sbjct: 365 KILCNPEEQDHGILIVGYGIHKSSILKRTIPYWIIKNSWGKHWG-EKGYYRLYRGSGVCG 423
Query: 172 I 172
I
Sbjct: 424 I 424
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 96/177 (54%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQ--KMGGLELASDYPY-- 203
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIR 117
GV G C D K ++ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 204 -TGVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMR 262
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 263 PK-WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 101/174 (58%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ QLI+C+ + GC GG + A + ++ G++AE DYP+ +
Sbjct: 146 LESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQMGGVQAENDYPYEGSD 205
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G + ++ VF + + +L GP+ ++ + + +Y ++R
Sbjct: 206 GNCRVDVAKFVVKVKKCYRYIAVF--EEKLKDLLRIVGPIPVAIDASDIVNYRRGIMR-- 261
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP WI++N+WG WG + GYF V++ NACGI
Sbjct: 262 -YCSNYGLNHAVLLVGYGVENNVPYWILKNTWGEDWG-EQGYFRVQQNINACGI 313
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 96/186 (51%), Gaps = 18/186 (9%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNIYNQ-----GCQGGGFNKAIQYLKHAG-LEAEADYPF 56
E + + G LL LS+ QL++C+ ++ GC GG A +YL AG LE E YP+
Sbjct: 171 EGAHFVSTGKLLSLSEQQLVDCDQADKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPY 230
Query: 57 RNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
G G C +D KV VRV +F + L +GPL G+N +Q Y G
Sbjct: 231 ---TGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGG- 286
Query: 116 IRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGYFTVERGTN 168
+ +C N+NH V++VGYG + P WI++NSWG+ ++GY+ + RG +
Sbjct: 287 VSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHD 346
Query: 169 ACGIES 174
CGI S
Sbjct: 347 ICGINS 352
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 109 bits (272), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 101/181 (55%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+AI G L+ +S+ +L+ C+ + GC GG + A +L H G + EA+YP+ +
Sbjct: 147 IEGQHAIATGQLVAVSEQELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANYPYVS 206
Query: 59 QNGVTGRCAY--DARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C+ +++ V +S F + + ++ +GPL G++ + Q Y G +
Sbjct: 207 GNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSIGVDASTWQSYAGGI 266
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIES 174
+ CP + ++H V+IVG+ P WI++NSW WG ++GY V +G+N CG+ S
Sbjct: 267 M---SYCPQDQIDHGVLIVGFDDTASTPYWIIKNSWTANWG-EEGYIRVAKGSNQCGLTS 322
Query: 175 Y 175
+
Sbjct: 323 H 323
>gi|13625989|gb|AAK35220.1|AF362769_1 pre-procathepsin L [Paragonimus westermani]
Length = 235
Score = 108 bits (271), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 96/176 (54%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G L+ LSK QL++C+ + GC GG + +K GLE ++ YP+
Sbjct: 55 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEIKRMGGLELQSAYPY---T 111
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C D K+ ++ D +V ++ + L +GP+ +N LQ Y ++ +
Sbjct: 112 GWEQACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPS 171
Query: 120 D-VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ C E LNHAV+ VGY VP W VRNSWG RWG ++GYF + RG CGI+
Sbjct: 172 EYACSPEGLNHAVLTVGYDTERGVPYWTVRNSWGTRWG-ENGYFRIYRGDGTCGID 226
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 108 bits (271), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 96/176 (54%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G L+ LSK QL++C+ + GC GG + +K GLE ++ YP+
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEIKRMGGLELQSAYPY---T 201
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C D K+ ++ D +V ++ + L +GP+ +N LQ Y ++ +
Sbjct: 202 GWEQACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPS 261
Query: 120 D-VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ C E LNHAV+ VGY VP W VRNSWG RWG ++GYF + RG CGI+
Sbjct: 262 EYACSPEGLNHAVLTVGYDTERGVPYWTVRNSWGTRWG-ENGYFRIYRGDGTCGID 316
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 108 bits (271), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 101/181 (55%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+AI G L+ +S+ +L+ C+ + GC GG + A +L H G + EA+YP+ +
Sbjct: 132 IEGQHAIATGQLVAVSEQELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANYPYVS 191
Query: 59 QNGVTGRCAY--DARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C+ +++ V +S F + + ++ +GPL G++ + Q Y G +
Sbjct: 192 GNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSIGVDASTWQSYAGGI 251
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIES 174
+ CP + ++H V+IVG+ P WI++NSW WG ++GY V +G+N CG+ S
Sbjct: 252 M---SYCPQDQIDHGVLIVGFDDTASTPYWIIKNSWTANWG-EEGYIRVAKGSNQCGLTS 307
Query: 175 Y 175
+
Sbjct: 308 H 308
>gi|393904668|gb|EFO15826.2| hypothetical protein LOAG_12683 [Loa loa]
Length = 202
Score = 108 bits (271), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 100/177 (56%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E +AIK G L+ LS+ +LI+C++ +QGC+GG N + ++ GLE+E DYP+ +
Sbjct: 22 IEGAWAIKKGKLISLSEQELIDCDVIDQGCKGGLPLNAYKEIIRMGGLESEKDYPY---D 78
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G +C +++ V ++D + + + GP+ G+N LQ Y +
Sbjct: 79 GHGEKCHLVRKEIAVYINDSIQLPDDEIKIAAWVAKKGPVSIGVNAGPLQFYRHGISHPW 138
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
C ++NH V+IVGYG P WI++NSWG +WG ++GY+ + RG N CG++
Sbjct: 139 KAFCLPSHINHGVLIVGYGQEANKPYWIIKNSWGTKWG-ENGYYRLYRGKNVCGVKE 194
>gi|312095086|ref|XP_003148243.1| hypothetical protein LOAG_12683 [Loa loa]
Length = 195
Score = 108 bits (271), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 100/177 (56%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E +AIK G L+ LS+ +LI+C++ +QGC+GG N + ++ GLE+E DYP+ +
Sbjct: 15 IEGAWAIKKGKLISLSEQELIDCDVIDQGCKGGLPLNAYKEIIRMGGLESEKDYPY---D 71
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G +C +++ V ++D + + + GP+ G+N LQ Y +
Sbjct: 72 GHGEKCHLVRKEIAVYINDSIQLPDDEIKIAAWVAKKGPVSIGVNAGPLQFYRHGISHPW 131
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
C ++NH V+IVGYG P WI++NSWG +WG ++GY+ + RG N CG++
Sbjct: 132 KAFCLPSHINHGVLIVGYGQEANKPYWIIKNSWGTKWG-ENGYYRLYRGKNVCGVKE 187
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K+G L S+ +L++C+ + C GG + A + +K GLE EA+YP+ +
Sbjct: 433 IEGLYALKYGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYEAKK 492
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V DF+ G++T + L GP+ G+N +Q Y G +
Sbjct: 493 K---QCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPISIGINANAMQFYRGGVSHP 549
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V++VGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 550 WKALCSKKNLDHGVLVVGYGVSDYPNYHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 608
Query: 171 GI 172
G+
Sbjct: 609 GV 610
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 101/192 (52%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G LL L++ +L++C+ + GC GG A +Y L+ GLE E
Sbjct: 172 LEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVLQSGGLEKE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D K+ V++F V + D L +GPL G+N +Q
Sbjct: 232 KDYPYTGRDGT---CKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINSIFMQT 288
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y G + +C +NL+H V+IVGYG P+ WI++NSWG WG ++GY+
Sbjct: 289 YIGG-VSCPYICSKKNLDHGVLIVGYGAAGYAPIRFKDKPYWIIKNSWGENWG-EEGYYK 346
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 347 ICRGNNICGVDS 358
>gi|344238391|gb|EGV94494.1| Ras-specific guanine nucleotide-releasing factor 1 [Cricetulus
griseus]
Length = 1632
Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats.
Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GC+GG ++A +Y L + G+ E YP+R
Sbjct: 1447 LESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDTYPYR- 1505
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY--NGK 114
G G C +D +K V D + N + Y P+ + D+ K
Sbjct: 1506 --GKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFE--VTDDFMLYQK 1561
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG WG D GYF +ERG N CG
Sbjct: 1562 GIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNWG-DKGYFLIERGKNMCG 1620
Query: 172 IES 174
+ +
Sbjct: 1621 LAA 1623
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 95/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ L++C+ + GC GG N AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQPLVDCDYLDGGCDGGYPPQTNTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K ++ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPR- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 101/195 (51%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A Y+ AG ++ E
Sbjct: 160 LEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFDYILQAGGVQTE 219
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G C +D KV V++F V + D L +GPL G+N +Q
Sbjct: 220 KDYPY---SGRDETCKFDKSKVAATVANFSVVSLDEDQIAANLVKHGPLAVGINAIFMQT 276
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +NL+H V++VGYG P+ WI++NSWG WG +DG
Sbjct: 277 YIGGV-----SCPYICGKNLDHGVLLVGYGAAGYAPIRFKDKPFWIIKNSWGESWG-EDG 330
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 331 YYKICRGKNVCGVDS 345
>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 99/192 (51%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + G LL LS+ QL++C+ N GC GG A +YL +G L +
Sbjct: 173 VEGANFVATGKLLDLSEQQLVDCDHTCDAVAKTECNSGCSGGLMTNAYRYLMSSGGLMEQ 232
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
A YP+ G G C +D KV VRV++F D R L GPL G+N A +Q
Sbjct: 233 AAYPY---TGAQGPCRFDRGKVAVRVANFTAVPLDEDQMRAALVRGGPLAVGLNAAFMQT 289
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +CP +NH V++VGYG R P W+++NSWG +WG + GY+
Sbjct: 290 YVGG-VSCPLICPRAMVNHGVLLVGYGARGFSALRLGYRPYWLIKNSWGAQWG-EGGYYK 347
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 348 LCRGRNVCGVDS 359
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 94/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K V+ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|189528132|ref|XP_695717.3| PREDICTED: cathepsin O [Danio rerio]
Length = 334
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 95/175 (54%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES A L LS Q+I+C+ NQGC GG +A+ +L + L+ +EA+YPF+
Sbjct: 154 IESVSAKGGEKLQQLSVQQVIDCSYQNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGA 213
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+GV V VR F+G + L +GPLV ++ QDY G +I+
Sbjct: 214 DGVCQFFPQAHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQH 273
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I GY +VP WIVRNSWG WG DDGY ++ G + CG+
Sbjct: 274 H--CSSHKANHAVLITGYDTTGEVPYWIVRNSWGTSWG-DDGYAYIKIGNDVCGV 325
>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
Length = 367
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 100/189 (52%), Gaps = 17/189 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E+Q+AIK+ + LS Q+++C+ GC GG ++ + L +GL +E DYP++
Sbjct: 162 VEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLASEQDYPYKG-T 220
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
T RC + + DFL+ + + R L GP+ +N LLQ Y +IR
Sbjct: 221 VKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRAT 280
Query: 120 D-VCPSENLNHAVVIVGYGMR-----------HQVPVWIVRNSWG-RWGPDDGYFTVERG 166
C +NH+V++VG+G H +P WI++NSWG WG ++GYF + RG
Sbjct: 281 PATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWG-EEGYFRLHRG 339
Query: 167 TNACGIESY 175
+N CGI Y
Sbjct: 340 SNTCGITKY 348
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 94/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K V+ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 100/174 (57%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIKH L+ LS+ QLI+C+ + GC GG + A + ++ G++AE DYP+ +
Sbjct: 146 LESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQMGGVQAENDYPYEGSD 205
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G + ++ VF + + +L GP+ ++ + + +Y ++R
Sbjct: 206 GNCRVDVAKFVVKVKKCYRYIAVF--EEKLKDLLRIVGPIPVAIDASDIVNYRRGIMR-- 261
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + NHAV++VGYG+ + VP WI++N+WG WG + GYF V++ NACGI
Sbjct: 262 -YCSNYGFNHAVLLVGYGVENNVPYWILKNTWGEDWG-EQGYFRVQQNINACGI 313
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 95/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K ++ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPR- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 95/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K ++ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPR- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 105/188 (55%), Gaps = 15/188 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPF--RN 58
+E Q+ IK GTL+ LS+ +L++C+ +QGC GG + A Q ++ G+ +E DYP+ R+
Sbjct: 172 IEGQWKIKKGTLVSLSEQELVDCDKLDQGCNGGLPSNAYQEIMRFGGIMSEDDYPYTGRD 231
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
Q+ C +A KV ++ + + + L GP+ G+N +Q Y G +
Sbjct: 232 QD-----CKLNATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANAMQFYFGGVSH 286
Query: 118 KNDV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
+ C ENL+H V+IVGYG + P WI++NSWGR WG +GY+ V RG CG+
Sbjct: 287 PWKIFCNPENLDHGVLIVGYGTKDGTPYWIIKNSWGRSWGV-EGYYLVYRGGGVCGLNE- 344
Query: 176 GGICTRTL 183
+CT +
Sbjct: 345 --MCTSAI 350
>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
Length = 259
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 94/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 84 QWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 138
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K V+ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 139 VGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 197
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 198 WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 251
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y L + G+ E
Sbjct: 161 LEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSGCNGGLMNSAFEYILNNGGVMRE 220
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ NG G C +D K+ V++F +V D L GPL +N +Q
Sbjct: 221 EDYPYSGTNG--GTCKFDKAKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQT 278
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ LNH V++VGYG Q P WI++NSWG WG ++G
Sbjct: 279 YVGGV-----SCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWG-ENG 332
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 333 YYKICRGRNICGVDS 347
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP+ +
Sbjct: 442 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYEAKK 501
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+VS F+ G++T + L +GP+ G+N +Q Y G +
Sbjct: 502 Q---QCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFYRGGVSHP 558
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 559 WKALCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 617
Query: 171 GI 172
G+
Sbjct: 618 GV 619
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP+ +
Sbjct: 290 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYEAKK 349
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+VS F+ G++T + L +GP+ G+N +Q Y G +
Sbjct: 350 Q---QCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFYRGGVSHP 406
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 407 WKALCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 465
Query: 171 GI 172
G+
Sbjct: 466 GV 467
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 100/181 (55%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E QYA+K LL LS+ +LI+C+ + GC GG +A + +++ GLE E+DYP+
Sbjct: 398 IEGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENLGGLETESDYPYEGHA 457
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C VKV +S + V + + L +GPL G+N +Q Y G +
Sbjct: 458 DRKG-CQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPI 516
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+ +C ++L+H V IVGYG+ +P W+++NSWG WG + GY+ + RG +CG
Sbjct: 517 HALCSPKSLDHGVAIVGYGVHRTKYTHKNLPYWLIKNSWGPGWG-EKGYYLLYRGDGSCG 575
Query: 172 I 172
+
Sbjct: 576 V 576
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP+ +
Sbjct: 440 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYEAKK 499
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+VS F+ G++T + L +GP+ G+N +Q Y G +
Sbjct: 500 Q---QCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFYRGGVSHP 556
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 557 WKALCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 615
Query: 171 GI 172
G+
Sbjct: 616 GV 617
>gi|402870704|ref|XP_003899346.1| PREDICTED: cathepsin O [Papio anubis]
Length = 321
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 141 VESAYAIKGKPLEDLSVQQVIDCSYTNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 200
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 201 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 261 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 312
>gi|355687683|gb|EHH26267.1| hypothetical protein EGK_16186 [Macaca mulatta]
gi|384945482|gb|AFI36346.1| cathepsin O preproprotein [Macaca mulatta]
Length = 321
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 141 VESAYAIKGKPLEDLSVQQVIDCSYTNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 200
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 201 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 261 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 312
>gi|355749637|gb|EHH54036.1| hypothetical protein EGM_14772, partial [Macaca fascicularis]
Length = 311
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 131 VESAYAIKGKPLEDLSVQQVIDCSYTNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 190
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 191 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 250
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 251 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 302
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 99/181 (54%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E QYA+K LL LS+ +LI+C+ + GC GG +A + +++ GLE E+DYP+
Sbjct: 398 IEGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENLGGLETESDYPYEGHA 457
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C VKV +S + V + + L +GPL G+N +Q Y G +
Sbjct: 458 DRKG-CQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPI 516
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C ++L+H V IVGYG+ +P W ++NSWG +WG GY+ + RG +CG
Sbjct: 517 HALCSPKSLDHGVAIVGYGVHKYPYLNATLPFWTIKNSWGDKWGM-QGYYLLYRGDGSCG 575
Query: 172 I 172
+
Sbjct: 576 V 576
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 100/196 (51%), Gaps = 31/196 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----------IYNQGCQGGGFNKAIQY-LKHAGLEA 50
LE + + G L+ LS+ QL++C+ + GC GG N A +Y L + G+
Sbjct: 161 LEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDSGCNGGLMNSAFEYILNNGGVMR 220
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQ 109
E DYP+ NG G C +D K+ V++F +V D L GPL +N +Q
Sbjct: 221 EEDYPYSGTNG--GTCKFDKAKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQ 278
Query: 110 DYNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDD 158
Y G + CP S+ LNH V++VGYG Q P WI++NSWG WG ++
Sbjct: 279 TYVGGV-----SCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWG-EN 332
Query: 159 GYFTVERGTNACGIES 174
GY+ + RG N CG++S
Sbjct: 333 GYYKICRGRNICGVDS 348
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 95/190 (50%), Gaps = 22/190 (11%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAEA 52
E + + G LL LS+ QL++C+ + GC GG A +YL AG LE E
Sbjct: 171 EGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEER 230
Query: 53 DYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQDY 111
YP+ G G C +D KV VRV +F + L +GPL G+N +Q Y
Sbjct: 231 SYPY---TGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTY 287
Query: 112 NGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGYFTVE 164
G + +C N+NH V++VGYG + P WI++NSWG+ ++GY+ +
Sbjct: 288 IGG-VSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLC 346
Query: 165 RGTNACGIES 174
RG + CGI S
Sbjct: 347 RGHDICGINS 356
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 93/174 (53%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K V+ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG + GYF + RG CGI S
Sbjct: 265 WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEKGYFRIYRGDGTCGINS 318
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 100/176 (56%), Gaps = 12/176 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+A++H L+ LS+ QLI+C+ + GC GG + A + ++ G++ E DYPF +N
Sbjct: 177 VESQFAMRHNRLIDLSEQQLIDCDSVDMGCNGGLLHTAFEEIMRMGGVQTELDYPFVGRN 236
Query: 61 GVTGRCAYDARK---VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
RC D + V + V + + +L GP+ ++ A + +Y +I
Sbjct: 237 R---RCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS 293
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W+ +N+WG WG ++GYF V + NACG+
Sbjct: 294 S---CENNGLNHAVLLVGYGVENGVPYWVFKNTWGDDWG-ENGYFRVRQNVNACGM 345
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 102/199 (51%), Gaps = 31/199 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + G LL LS+ QL++C+ + GC GG A YL +G L +
Sbjct: 171 VEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYLMSSGGLMEQ 230
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVF--------NGSDTFRRMLYHYGPLVAGM 103
+ YP+ G G C +DA +V VRV++F V +G R L +GPL G+
Sbjct: 231 SAYPY---TGAQGTCRFDANRVAVRVANFTVVAPPGGNDGDGDAQMRAALVRHGPLAVGL 287
Query: 104 NGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWGR-WG 155
N A +Q Y G + VCP +NH V++VGYG R P WI++NSWG+ WG
Sbjct: 288 NAAYMQTYVGG-VSCPLVCPRAWVNHGVLLVGYGERGFAALRLGHRPYWIIKNSWGKAWG 346
Query: 156 PDDGYFTVERGTNACGIES 174
+ GY+ + RG N CG+++
Sbjct: 347 -EQGYYRLCRGRNVCGVDT 364
>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
Length = 232
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 94/174 (54%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 57 QWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 111
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K ++ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 112 VGGICHMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 170
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 171 WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 224
>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
Length = 370
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 101/197 (51%), Gaps = 28/197 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E+Q+ IK + +S +L++C GC GG ++ I L ++GL +E DYPF Q
Sbjct: 162 IEAQWGIKTRQSVEVSVQELLDCGRCGDGCSGGFVWDAFITVLNNSGLASEKDYPF--QG 219
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHY---GPLVAGMNGALLQDY-NGKLI 116
V +C K + DF++ SD +R+ ++ GP+ +N LLQ Y NG +
Sbjct: 220 AVRAKCQAKKHKKVAWIQDFIML--SDNEQRIAWYLATEGPITVTINKKLLQQYQNGVIK 277
Query: 117 RKNDVCPSENLNHAVVIVGYGM-----------------RHQVPVWIVRNSWG-RWGPDD 158
C +N++H V++VG+G R P WI++NSWG WG +
Sbjct: 278 ATQTTCDPQNVDHVVLLVGFGKTKSVEGRQAKGVPGHSRRRSTPYWILKNSWGANWG-EK 336
Query: 159 GYFTVERGTNACGIESY 175
GYF + RG+NACGI Y
Sbjct: 337 GYFRLHRGSNACGITKY 353
>gi|395735444|ref|XP_002815290.2| PREDICTED: cathepsin O [Pongo abelii]
Length = 318
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 138 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 197
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 198 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 257
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 258 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 309
>gi|114596533|ref|XP_517502.2| PREDICTED: cathepsin O [Pan troglodytes]
gi|410212082|gb|JAA03260.1| cathepsin O [Pan troglodytes]
gi|410330245|gb|JAA34069.1| cathepsin O [Pan troglodytes]
Length = 318
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 138 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 197
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 198 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 257
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 258 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 309
>gi|397504019|ref|XP_003822607.1| PREDICTED: cathepsin O [Pan paniscus]
Length = 321
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 141 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 200
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 201 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 261 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 312
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 99/190 (52%), Gaps = 24/190 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE +K G L+ LS+ QL++C+ + GC GG A QY LK GLE E
Sbjct: 172 LEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C+++ K+ VS+F V + + L GPL G+N A +Q
Sbjct: 232 EDYPY---TGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQT 288
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWG-RWGPDDGYFT 162
Y G + VC NL+H V++VGYG P+ W+++NSWG WG ++GY+
Sbjct: 289 YVGG-VSCPYVCSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWG-ENGYYK 346
Query: 163 VERGTNACGI 172
+ RG N CGI
Sbjct: 347 LCRGHNVCGI 356
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 99/190 (52%), Gaps = 24/190 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE +K G L+ LS+ QL++C+ + GC GG A QY LK GLE E
Sbjct: 172 LEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C+++ K+ VS+F V + + L GPL G+N A +Q
Sbjct: 232 EDYPY---TGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQT 288
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWG-RWGPDDGYFT 162
Y G + VC NL+H V++VGYG P+ W+++NSWG WG ++GY+
Sbjct: 289 YVGG-VSCPYVCSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWG-ENGYYK 346
Query: 163 VERGTNACGI 172
+ RG N CGI
Sbjct: 347 LCRGHNVCGI 356
>gi|301607871|ref|XP_002933519.1| PREDICTED: cathepsin O-like [Xenopus (Silurana) tropicalis]
Length = 370
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/184 (36%), Positives = 99/184 (53%), Gaps = 13/184 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK TL LS Q+I+C+ + GC GG N A+++L L ++Y F+ +
Sbjct: 190 VESAYAIKWHTLEELSVQQVIDCSYLDSGCNGGSTNGALKWLYQTKTKLVRASEYNFKAK 249
Query: 60 NGVTGRCAYDARK---VKVRVSDFLVFNGS-DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
TG C Y + V + + F+G+ D +ML GP+V +N QDY G +
Sbjct: 250 ---TGLCHYFPKTDFGVSINGYETQDFSGTEDAMMKMLVDLGPMVVIVNAVSWQDYLGGI 306
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
I+ + C S NHAV+++GY P WIV+NSWG WG DGY ++ G N CGI
Sbjct: 307 IQHH--CSSGAPNHAVLVIGYDKTGDTPYWIVKNSWGTAWGA-DGYVYIKMGENICGIAD 363
Query: 175 YGGI 178
+ +
Sbjct: 364 FVAV 367
>gi|297293584|ref|XP_001093045.2| PREDICTED: cathepsin O [Macaca mulatta]
Length = 421
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 241 VESAYAIKGKPLEDLSVQQVIDCSYTNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 300
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 301 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 360
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 361 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 412
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 99/190 (52%), Gaps = 24/190 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE +K G L+ LS+ QL++C+ + GC GG A QY LK GLE E
Sbjct: 172 LEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C+++ K+ VS+F V + + L GPL G+N A +Q
Sbjct: 232 EDYPY---TGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQT 288
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWG-RWGPDDGYFT 162
Y G + VC NL+H V++VGYG P+ W+++NSWG WG ++GY+
Sbjct: 289 YVGG-VSCPYVCSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWG-ENGYYK 346
Query: 163 VERGTNACGI 172
+ RG N CGI
Sbjct: 347 LCRGHNVCGI 356
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 102/195 (52%), Gaps = 34/195 (17%)
Query: 8 IKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAG-LEAEADYPFR 57
+ G LL LS+ QL++C+ + GC GG A YL +G L ++ YP+
Sbjct: 180 LATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPY- 238
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVF---------NGSDTFRRMLYHYGPLVAGMNGALL 108
G G C +DA +V VRV++F V +G R L +GPL G+N A +
Sbjct: 239 --TGAQGACRFDANRVAVRVANFTVVAPAAGPGGNDGDAQMRAALVRHGPLAVGLNAAYM 296
Query: 109 QDYNGKLIRKNDVCPSENLNHAVVIVGYGMR--------HQVPVWIVRNSWGR-WGPDDG 159
Q Y G + VCP +NH V++VGYG R H+ P WI++NSWG+ WG + G
Sbjct: 297 QTYVGG-VSCPLVCPRAWVNHGVLLVGYGERGFAALRLGHR-PYWIIKNSWGKAWG-EQG 353
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 354 YYRLCRGRNVCGVDT 368
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ +S+ QL++C+ +QGC GG A +Y LK G+E E
Sbjct: 168 LEGAHFLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
YP+ + G C ++ ++ VS+F V + D + GPL G+N +Q
Sbjct: 228 ETYPYIGSD--RGSCKFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y K CP S NL+H VV+VGYG P+ WI++NSWG WG +DG
Sbjct: 286 Y-----MKGVSCPYICSRNLDHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWG-EDG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG NACG++S
Sbjct: 340 YYKICRGHNACGVDS 354
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 94/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+AI G L+ LS+ +L+ C+ + GC GG + A +L H G + EA YP+ +
Sbjct: 147 IEGQHAIATGQLVSLSEQELVSCDTVDDGCSGGLMDNAFGWLLSAHNGQITTEASYPYVS 206
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKL 115
NG+ C +++ V + + T R M ++ YGPL G++ + Q Y G +
Sbjct: 207 GNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLSIGVDASSWQSYIGGI 266
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
+ C ++H V+IVG+ P WI++NSW + GY V +G+N CG+ S+
Sbjct: 267 LSH---CSDVQIDHGVLIVGFDDTASTPYWIIKNSWSSMWGEQGYIRVAKGSNQCGLTSF 323
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 100/204 (49%), Gaps = 28/204 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMRE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G T C D K+ VS+F V + + L GPL +N A +Q
Sbjct: 228 EDYPYTGKDGAT--CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP LNH V++VGYG + P WI++NSWG +DG+
Sbjct: 286 YIGGV-----SCPYICMRRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGEDGF 340
Query: 161 FTVERGTNACGIESYGGICTRTLN 184
+ + RG N CG++S T T++
Sbjct: 341 YKICRGRNVCGVDSLVSTVTATVS 364
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/182 (36%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YAIK G L S+ +L++C+ + C GG + A + +K GLE E++YP+
Sbjct: 412 IEGAYAIKTGDLQEFSEQELLDCDSKDSACNGGLMDNAYKAIKDIGGLEYESEYPYE--- 468
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G +C ++ V+VS F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 469 GKKKQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHP 528
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+ +C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 529 WSPLCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 587
Query: 171 GI 172
G+
Sbjct: 588 GV 589
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 97/175 (55%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E +AIK G L+ LS+ +LI+C+ ++GC GG N + + GLE E YP++ +N
Sbjct: 257 IEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRMGGLEPEDQYPYKARN 316
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
G C + V + D + ++T + + GPL G++ LL Y +G L
Sbjct: 317 GT---CHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAYYKSGILHPS 373
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
CP ++H V+I GYG+ + +P W ++NSWG +WG +DGYF + G + CG+
Sbjct: 374 RSRCPPSGIDHGVLITGYGVENGLPYWTIKNSWGDQWG-EDGYFRLMLGKDVCGV 427
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 97/175 (55%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E +AIK G L+ LS+ +LI+C+ ++GC GG N + + GLE E YP++ +N
Sbjct: 292 IEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRMGGLEPEDQYPYKARN 351
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
G C + V + D + ++T + + GPL G++ LL Y +G L
Sbjct: 352 GT---CHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAYYKSGILHPS 408
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
CP ++H V+I GYG+ + +P W ++NSWG +WG +DGYF + G + CG+
Sbjct: 409 RSRCPPSGIDHGVLITGYGVENGLPYWTIKNSWGDQWG-EDGYFRLMLGKDVCGV 462
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 106 bits (265), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 94/175 (53%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C+ GC GG + ++ + GLE++ DYP+
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDRAADGCNGGWPASSYLEIMHMGGLESQDDYPYA--- 197
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGS-DTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV +C + ++ ++ D + S D L +GPL +N LQ Y +I +
Sbjct: 198 GVKEQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPS 257
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C +LNHAV+ VGY +P WI++NSW WG + GYF + RG CGI
Sbjct: 258 YEECSPVDLNHAVLTVGYDKEGDMPYWIIKNSWNVEWG-EKGYFRLYRGDGTCGI 311
>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
Length = 255
Score = 106 bits (265), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 96/178 (53%), Gaps = 6/178 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K+GTLL LS+ +L++C+ +Q C+GG + A + + K GLE E DY +
Sbjct: 75 IEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAIEKLGGLETETDYSY---T 131
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G RC + RKV + S + L GP+ +N +Q Y +
Sbjct: 132 GKKQRCDFTNRKVAAYINSSVELPKDEKEIAAWLAENGPISVALNAFAMQFYKKGVSHPW 191
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESYG 176
+ C ++HAV++VGYG R+ +P W ++NSWG + GY+ + RG+NACGI G
Sbjct: 192 KIFCNPWMIDHAVLLVGYGERNGIPFWAIKNSWGEDYGEQGYYYLHRGSNACGINKMG 249
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 106 bits (265), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 93/174 (53%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ K G LL LS+ QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K V+ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NH V+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 WCDPAGVNHGVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK G+ E
Sbjct: 166 LEGAHFLSTGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKSGGVMRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + +G C +D K+ V++F V + D L GPL +N A +Q
Sbjct: 226 EDYPYSGAD--SGTCKFDKTKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAAYMQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGM-------RHQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S LNH V++VGYG + P WI++NSWG WG ++G
Sbjct: 284 YIGGV-----SCPYVCSRRLNHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWG-ENG 337
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 338 YYKICRGRNICGVDS 352
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 92/174 (52%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ QL++C+ +GC GG K + K GLE +DYP+
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLEKGCNGGYPPKTYGEIEKMGGLELASDYPY---T 204
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
GV G C + K V+D V S+ + + L GPL + +N LLQ Y G +I
Sbjct: 205 GVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264
Query: 120 D-VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C LNHAV+ VGYG +P WIV+NS G + GYF + RG CGI
Sbjct: 265 PFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSLGVGFGEKGYFRIFRGAGTCGI 318
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 92/174 (52%), Gaps = 8/174 (4%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
E YA+ G L S+ QL++C N GC GG + Y++ GLE E+DYP+ G
Sbjct: 148 EGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYIQTNGLELESDYPY---TG 204
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
G C+YD+ KV +VS ++ ++ + GP+ +N LQ Y +I +
Sbjct: 205 YDGSCSYDSSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGII-DDK 263
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
C E L+H V+ VGY + + W+++NSWG WG + GYF RG N CG++
Sbjct: 264 YCDPEWLDHGVLAVGYNSENGLDYWLIKNSWGADWG-ESGYFRFLRGQNICGVK 316
>gi|71895793|ref|NP_001026300.1| cathepsin O precursor [Gallus gallus]
gi|53127320|emb|CAG31043.1| hypothetical protein RCJMB04_1m17 [Gallus gallus]
Length = 320
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 95/177 (53%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEA--EADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L ++ +++Y F+ Q
Sbjct: 140 IESAYAIKGHNLEELSVQQVIDCSYSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQ 199
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
TG C Y V ++ F ++ S + R+L +GPL ++ QDY G +
Sbjct: 200 ---TGLCHYFPHSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGI 256
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ +P WIV+NSWGR DGY V+ G+N CGI
Sbjct: 257 IQYH--CSSGKANHAVLITGFDTTGSIPYWIVQNSWGRTWGIDGYVRVKIGSNVCGI 311
>gi|167427531|gb|ABZ80402.1| cathepsin L6, partial [Fasciola hepatica]
Length = 306
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 101/185 (54%), Gaps = 10/185 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY K T + S+ QL++C+ N GC+GGG +A +YLK GLE E+ YP++
Sbjct: 121 IEGQYVKKFQTRVSFSEQQLVDCSTIPGNHGCRGGGMRRAYEYLKKNGLEPESSYPYK-- 178
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G+C Y + +V++ LV +G++T + ++ GP ++ I
Sbjct: 179 -AVEGQCQYKSDLALAKVTNSQLVRSGNETQLKNLIGAEGPASVAVDVKPDFSMYRSGIY 237
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGIESY 175
++ C S +NHAV+ VGYG + WIV+NSWG RWG + GY + R N CGI S
Sbjct: 238 QSQTCSSRRMNHAVLAVGYGTEGGMDYWIVKNSWGPRWG-EAGYIRMARNRNNMCGIASA 296
Query: 176 GGICT 180
G + T
Sbjct: 297 GSLPT 301
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 99/175 (56%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G LL LS+ +L++C+ ++ C GG + A +K GLE E DY + +
Sbjct: 279 VEGQWFLKQGDLLSLSEQELVDCDTLDKACMGGLPSNAYSAIKTLGGLETEDDYSY---H 335
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ A KVKV ++D + + + L GP+ +N +Q Y + R
Sbjct: 336 GHLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGPISIAINAFGMQFYRRGISRPL 395
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R VP W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 396 RLLCSPWFIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EEGYYYLHRGSRACGV 449
>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
Length = 292
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 99/194 (51%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + G L LS+ Q+++C+ +QGC GG N A QYL K GLE+E
Sbjct: 91 LEGANFLATGKLETLSEQQMVDCDHECDAEEPDDCDQGCNGGLMNTAFQYLQKVGGLESE 150
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+K V +F V + + L +GPL +N +Q
Sbjct: 151 KDYPYTGTD--RGTCKFDESKIKASVHNFSVVSIDEEQIAANLVKHGPLAIAINAVFMQT 208
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGY 160
Y G + CP ++L+H V++VGYG P+ WI++NSWG ++GY
Sbjct: 209 YIGGV-----SCPYICGKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGETWGENGY 263
Query: 161 FTVERGTNACGIES 174
+ + RG N CG++S
Sbjct: 264 YKICRGRNVCGVDS 277
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 100/196 (51%), Gaps = 31/196 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----------IYNQGCQGGGFNKAIQY-LKHAGLEA 50
LE + + G L+ LS+ QL++C+ + GC+GG N A +Y L + G+
Sbjct: 161 LEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMR 220
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQ 109
E DYP+ G G C +D K+ V++F +V D L GPL +N +Q
Sbjct: 221 EEDYPYSGTAG--GTCKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQ 278
Query: 110 DYNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDD 158
Y G + CP S+ LNH V++VGYG Q P WI++NSWG WG ++
Sbjct: 279 TYVGGV-----SCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWG-EN 332
Query: 159 GYFTVERGTNACGIES 174
GY+ + RG N CG++S
Sbjct: 333 GYYKICRGRNVCGVDS 348
>gi|332217574|ref|XP_003257933.1| PREDICTED: cathepsin O [Nomascus leucogenys]
Length = 318
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 92/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 138 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 197
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 198 NGLCHYFLGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 257
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 258 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 309
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC+GG N A +Y L + G+ E
Sbjct: 161 LEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMRE 220
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C +D K+ V++F +V D L GPL +N +Q
Sbjct: 221 EDYPYSGTAG--GTCKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQT 278
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ LNH V++VGYG Q P WI++NSWG WG ++G
Sbjct: 279 YVGGV-----SCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWG-ENG 332
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 333 YYKICRGRNVCGVDS 347
>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
Length = 367
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 104/189 (55%), Gaps = 17/189 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ + IK+ + +S +L++CN GCQGG ++ I L ++GL +E DYPF+ +
Sbjct: 162 IEALWGIKYHQSVEVSVQELLDCNRCGDGCQGGFVWDAFITVLNNSGLASEKDYPFK-AS 220
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIR-K 118
T RC + + + DF++ ++ + L +GP+ +N LLQ Y +I+ K
Sbjct: 221 VKTHRCLANKYRKVAWIQDFIMLEDNEHKIAQYLATHGPITVTINMKLLQHYKKGVIKAK 280
Query: 119 NDVCPSENLNHAVVIVGYGMR-----------HQVPVWIVRNSWG-RWGPDDGYFTVERG 166
C + +NH+V++VG+G P WI++NSWG WG ++GYF + RG
Sbjct: 281 PTTCDPQLVNHSVLLVGFGAETVSSQSHLRPHRSTPYWILKNSWGAHWG-EEGYFRLHRG 339
Query: 167 TNACGIESY 175
+N+CGI Y
Sbjct: 340 SNSCGITKY 348
>gi|350587549|ref|XP_003482436.1| PREDICTED: cathepsin O-like [Sus scrofa]
Length = 209
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L ++ ++++YPF+ Q
Sbjct: 29 VESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVKVVSDSEYPFKAQ 88
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C Y V + D+ ++ S D + L GPL+ ++ QDY G +
Sbjct: 89 NGL---CHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDAVSWQDYLGGI 145
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
I+ + C S NHAV++ G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 146 IQHH--CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGI-DGYALVKMGGNICGI 200
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 93/191 (48%), Gaps = 23/191 (12%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECN----------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
E + + G LL LS+ QL++C+ + GC GG A +YL AG LE E
Sbjct: 171 EGAHFVSTGKLLSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEE 230
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
YP+ G G C +D KV VRV +F D L GPL G+N +Q
Sbjct: 231 RSYPY---TGKRGHCKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQT 287
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGYFTV 163
Y G + +C +NH V++VGYG + P WI++NSWG+ ++GY+ +
Sbjct: 288 YIGG-VSCPLICSKRKVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKL 346
Query: 164 ERGTNACGIES 174
RG + CGI S
Sbjct: 347 CRGHDICGINS 357
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 101/195 (51%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y L+ G++ E
Sbjct: 169 LEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKE 228
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C +D KV VS++ V + D L GPL G+N +Q
Sbjct: 229 KDYPY---TGRDGTCKFDKTKVAATVSNYSVVSLDEDQIAANLVKNGPLAVGINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V+IVGYG P+ WI++NSWG WG ++G
Sbjct: 286 YIGGV-----SCPYICGKHLDHGVLIVGYGEGAYAPIRFKNKPYWIIKNSWGESWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNVCGVDS 354
>gi|426345827|ref|XP_004040600.1| PREDICTED: cathepsin O [Gorilla gorilla gorilla]
Length = 321
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 93/177 (52%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 141 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 200
Query: 60 NGVTGRCAYDARKVKVR---VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG+ + ++ DF N D + L +GPLV ++ QDY G +I
Sbjct: 201 NGLCHYFSGSHSGFSIKGYSAHDFS--NQEDEMAKALLTFGPLVVIVDAVSWQDYLGGII 258
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ + C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 259 QHH--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 312
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/192 (34%), Positives = 99/192 (51%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A QY++ AG LE E
Sbjct: 172 VEGAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYVEAAGGLELE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+DYP+ G G+C +D+ KV V+VS+F + D L GPL G+N +Q
Sbjct: 232 SDYPYE---GRDGKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLAIGINAEFMQT 288
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWG-RWGPDDGYFT 162
Y + C NL+H V++VGY R P WI++NSWG WG D+GY+
Sbjct: 289 YIAG-VSCPIFCNKRNLDHGVLLVGYAERGFAPARLAYKPYWIIKNSWGPNWG-DNGYYK 346
Query: 163 VERGTNACGIES 174
+ RG CG+ +
Sbjct: 347 ICRGHGECGLNT 358
>gi|4557501|ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens]
gi|1168795|sp|P43234.1|CATO_HUMAN RecName: Full=Cathepsin O; Flags: Precursor
gi|574804|emb|CAA54562.1| cathepsin O [Homo sapiens]
gi|29351630|gb|AAH49206.1| Cathepsin O [Homo sapiens]
gi|312153238|gb|ADQ33131.1| cathepsin O [synthetic construct]
Length = 321
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 141 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 200
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPLV ++ QDY G +I+
Sbjct: 201 NGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 260
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 261 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 312
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 101/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ +QGC GG A +Y LK GLE E
Sbjct: 162 LEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGLMTTAFEYILKSGGLERE 221
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
ADYP+ + G C ++ K+ S+F V + D L +GPL G+N +Q
Sbjct: 222 ADYPYTGTD--RGTCKFNKAKISAVASNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQT 279
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG + P WI++NSWG WG ++G
Sbjct: 280 YVGGV-----SCPYICGKHLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGENWG-ENG 333
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 334 YYKICRGRNVCGVDS 348
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 99/176 (56%), Gaps = 12/176 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+A++H L+ LS+ QLI+C+ + GC GG + A + ++ G++AE DYPF
Sbjct: 156 VESQFAMRHNRLVDLSEQQLIDCDSVDMGCNGGLLHTAFEEIIRMGGVQAELDYPFV--- 212
Query: 61 GVTGRCAYDARK---VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G RC D + V + V + + +L GP+ ++ A + +Y +I
Sbjct: 213 GRDRRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS 272
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W +N+WG WG ++GYF V + NACG+
Sbjct: 273 S---CENNGLNHAVLLVGYGVENGVPYWAFKNTWGDDWG-ENGYFRVRQNINACGM 324
>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
Length = 299
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 101/176 (57%), Gaps = 13/176 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+ESQYAIKH + LS+ Q+I+C+ + GC GG + A Q ++ G++ E +YP+
Sbjct: 120 IESQYAIKHNVQINLSEQQMIDCDYVDMGCDGGLLHTAFEQMIEMGGVKHEHEYPYE--- 176
Query: 61 GVTGRCAYDARKVKVRV---SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G+ C + V++ ++V + + +L GP+ ++ + + +Y +I
Sbjct: 177 GINMNCRLNDDNFAVKIIGCYRYIVLQ-EEKLKDLLRAVGPIPIAIDASGIANYYQGVI- 234
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ C + LNHAV++VGYG+ + +P W ++N+WG WG ++GYF V + NACG+
Sbjct: 235 --NYCENHGLNHAVLLVGYGVENNIPYWTIKNTWGEDWG-ENGYFRVRQNINACGM 287
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 99/179 (55%), Gaps = 6/179 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K+GTLL LS+ +L++C+ +Q C+GG + A + + K GLE+E DY +
Sbjct: 293 IEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAIEKLGGLESETDYSY---T 349
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +C + RKV ++ + + L GP+ +N +Q Y +
Sbjct: 350 GHKQKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPISVALNAFAMQFYKKGVSHPW 409
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESYGG 177
+ C ++HAV++VGYG R+ +P W ++NSWG + GY+ ++RG+NACGI G
Sbjct: 410 KIFCNPWMIDHAVLLVGYGERNGIPFWAIKNSWGEDYGEQGYYYLQRGSNACGINRMGS 468
>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
Length = 358
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 95/177 (53%), Gaps = 5/177 (2%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRN 58
++ES YAIK+GTL P S ++I+C + GCQGG + +L + + +E YP
Sbjct: 170 VVESMYAIKNGTLYPFSVQEMIDCMPGSYGCQGGDTCALLSWLLESKTKIISENVYPLTL 229
Query: 59 QNGVTGRCAYDARKVKVRVSDFL---VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+N A+ V+++DF N +L +GP+VAG+N Q+Y G +
Sbjct: 230 RNDPCKLSKTSAKTTGVKITDFTCNSFVNAESNLLTLLGTHGPVVAGVNAISWQNYLGGI 289
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I+ + +LNHAV IVGY M ++P +I++NSWG + GY + G N CGI
Sbjct: 290 IQYHCDGSFSHLNHAVQIVGYDMAARIPHYIIKNSWGSTFGNKGYIYIAIGKNLCGI 346
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YAIK G L S+ +L++C+ + C GG + A + +K GLE E++YP+ +
Sbjct: 418 IEGLYAIKTGELREFSEQELLDCDSTDSACNGGLMDNAYKAIKDIGGLEYESEYPYLAKK 477
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+DF+ G++T + L GP+ G+N +Q Y G +
Sbjct: 478 K---QCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPISIGLNANAMQFYRGGVSHP 534
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ + RG N C
Sbjct: 535 WGPLCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRIYRGDNTC 593
Query: 171 GI 172
G+
Sbjct: 594 GV 595
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 93/174 (53%), Gaps = 10/174 (5%)
Query: 5 QYAIKHGTLLPLSKSQLIECNIYNQGCQGG---GFNKAIQYLKHAGLEAEADYPFRNQNG 61
Q+ + G LL LS QL++C+ + GC GG AIQ K GLE +DYP+ G
Sbjct: 151 QWFRETGHLLALSGQQLVDCDYLDDGCDGGYPPQTYTAIQ--KMGGLELASDYPY---TG 205
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V G C D K V+ + S+ + + L GPL + +N LQ Y G ++R
Sbjct: 206 VGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK- 264
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +NHAV+ VGYG+++ P WIV+NSWG ++GYF + RG CGI S
Sbjct: 265 WCDPAGVNHAVLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINS 318
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 99/194 (51%), Gaps = 29/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 171 LEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMRE 230
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + T C +D KV +V++F V + + L GPL +N +Q
Sbjct: 231 EDYPYTGTDKAT--CKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQT 288
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGY 160
Y G + CP S+ L+H V++VGYG + P WI++NSWG +WG + GY
Sbjct: 289 YVGGV-----SCPYICSKQLDHGVLLVGYGTGFSPIRMKEKPYWIIKNSWGEKWG-ESGY 342
Query: 161 FTVERGTNACGIES 174
+ + RG N CG++S
Sbjct: 343 YKIRRGRNVCGVDS 356
>gi|119625288|gb|EAX04883.1| cathepsin O [Homo sapiens]
Length = 336
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 156 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 215
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPLV ++ QDY G +I+
Sbjct: 216 NGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 275
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 276 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 327
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 95/185 (51%), Gaps = 7/185 (3%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRN-- 58
++E +K G L+ LS+ QLI+C+ + GC+GG A +Y+K GLEAE DYP+
Sbjct: 160 VVEGANFLKTGKLISLSEEQLIDCDYKDNGCEGGDMLSAYEYVKARGLEAEEDYPYEELG 219
Query: 59 --QNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G C Y KV ++++ V D L GPL + G +L Y G +
Sbjct: 220 YRHKPVRGPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGV 279
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
+CP E +NH V++VGYG+ + + W +N+W ++GYF + RG C + S
Sbjct: 280 ACPR-ICPGE-INHGVLLVGYGVENGLRYWTFKNTWTDEFGENGYFRLCRGVGVCDMNSE 337
Query: 176 GGICT 180
G +
Sbjct: 338 VGTVS 342
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP++ +
Sbjct: 428 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKK 487
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+ F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 488 N---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGPISIGINANAMQFYRGGVSHP 544
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V++VGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 545 WKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 603
Query: 171 GI 172
G+
Sbjct: 604 GV 605
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 99/181 (54%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 169 LEAAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 228
Query: 59 QNGVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++GV C + A+ V VRV D + D ++ + P+ A + +
Sbjct: 229 KDGV---CKFTAKNVAVRVIDSINITLGAEDELKQAVAFVRPVSVAFEVAKDFRFYNNGV 285
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ +C S ++NHAV+ VGYG+ VP WI++NSWG WG D+GYF +E G N CG+
Sbjct: 286 YTSTICGSTPMDVNHAVLAVGYGVEDGVPYWIIKNSWGSNWG-DNGYFKMELGKNMCGVA 344
Query: 174 S 174
+
Sbjct: 345 T 345
>gi|241111179|ref|XP_002399230.1| cysteine protease and A protease inhibitor, putative [Ixodes
scapularis]
gi|215492918|gb|EEC02559.1| cysteine protease and A protease inhibitor, putative [Ixodes
scapularis]
Length = 363
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 96/177 (54%), Gaps = 12/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRN 58
+E+ +A+ GTL S Q+I+C N N GC GG A+++LK L ++ YPF+
Sbjct: 177 VETMHALAAGTLTGFSVQQMIDCSNNSNHGCNGGDTCAALKWLKVNRIKLVRDSVYPFK- 235
Query: 59 QNGVTGRCAYDARKVKVRVSDFL---VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
VTG C + A V VSD+ + + ML + GPLV ++ QDY G +
Sbjct: 236 --AVTGSCQHPASDVTAEVSDYTCDRLVGNEERMIDMLANVGPLVVAVDATTWQDYLGGV 293
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I+ + C + NHAV IVGY + VP +IVRNSWG+ +DGY + G N CGI
Sbjct: 294 IQFH--CDA-GRNHAVQIVGYDLTGDVPYYIVRNSWGKQFGNDGYLYIAVGKNLCGI 347
>gi|449272742|gb|EMC82496.1| Cathepsin O, partial [Columba livia]
Length = 275
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 97/178 (54%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEA--EADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L ++ +++Y F+ Q
Sbjct: 95 IESAYAIKGHNLEELSVQQVIDCSYNNYGCSGGSTVSALSWLNQTKVKLVRDSEYAFKAQ 154
Query: 60 NGVTGRCAYDARK-VKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
TG C Y V ++ F ++ S + RML ++GPL ++ QDY G +
Sbjct: 155 ---TGLCHYFGHSDFGVSITGFAAYDFSGQEEEMMRMLVNWGPLAVTVDAVSWQDYLGGI 211
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ +P WIV+NSWG WG DGY V+ G+N CGI
Sbjct: 212 IQYH--CSSGRANHAVLITGFDRTGSIPYWIVQNSWGPAWGI-DGYVRVKIGSNVCGI 266
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL+EC+ + GC GG N A +Y LK GL E
Sbjct: 178 LEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKE 237
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ VS+F V + D L GPL +N +Q
Sbjct: 238 EDYPYTGTD--RGSCKFDKTKIAASVSNFSVISLDEDQIAANLVKIGPLAVAINAVFMQT 295
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 296 YVGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWG-ENG 349
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 350 FYKICRGRNVCGVDS 364
>gi|301777930|ref|XP_002924382.1| PREDICTED: cathepsin O-like [Ailuropoda melanoleuca]
Length = 300
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 92/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 120 VESAYAIKGEPLEALSVQQVIDCSYNNYGCSGGSTVSALHWLNKTQVKLVRDSEYPFKAQ 179
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPLV ++ QDY G +I+
Sbjct: 180 NGLCHYFSDSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQH 239
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 240 H--CSSGEANHAVLITGFDKIGSTPYWIVRNSWGSSWGV-DGYARVKMGGNICGI 291
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 97/178 (54%), Gaps = 12/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
LE QYAIK G L+ S+ +L++C+ N GCQGG + A +Y + E E+DY + +
Sbjct: 148 LEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLAEKESDYTYTAK 207
Query: 60 NGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKL 115
NG +C Y+A+ + S F + D + + + GP+ M+ + Q Y+
Sbjct: 208 NG---KCKYNAQLGVTKDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMYHSG- 263
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I +C L+H V++VGYG + V W+++NSWG WG DGYF +E ++ CGI
Sbjct: 264 IYTPFLCSKTKLDHGVLVVGYGTDNGVDYWLIKNSWGMAWGM-DGYFKIEMKSDKCGI 320
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 95/185 (51%), Gaps = 7/185 (3%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRN-- 58
++E +K G L+ LS+ QLI+C+ + GC+GG A +Y+K GLEA+ DYP+
Sbjct: 160 VVEGANFLKTGKLISLSEEQLIDCDYKDNGCEGGDMLSAYEYVKARGLEADEDYPYEELG 219
Query: 59 --QNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G C Y KV ++++ V D L GPL + G +L Y G +
Sbjct: 220 YRHKPVRGPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGV 279
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
+CP E +NH V++VGYG+ + + W +NSW ++GYF + RG C + S
Sbjct: 280 ACPR-ICPGE-INHGVLLVGYGVENGLRYWTFKNSWTDEFGENGYFRLCRGVGVCDMTSE 337
Query: 176 GGICT 180
G +
Sbjct: 338 VGTVS 342
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP++ +
Sbjct: 428 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKK 487
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+ F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 488 N---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHP 544
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V++VGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 545 WKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 603
Query: 171 GI 172
G+
Sbjct: 604 GV 605
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP++ +
Sbjct: 427 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKK 486
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+ F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 487 N---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQFYRGGVSHP 543
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V++VGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 544 WKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 602
Query: 171 GI 172
G+
Sbjct: 603 GV 604
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GLE E
Sbjct: 168 LEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDSGCSGGLMNNAFEYALKAGGLERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C ++ KV VS+F V + D L +GPL +N +Q
Sbjct: 228 KDYPYTGND--RGACKFEKSKVAASVSNFSVVSLDEDQIAANLVKHGPLSVAINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S++ +H V++VGYG + P WI++NSWG WG ++G
Sbjct: 286 YIGGV-----SCPYICSKHQDHGVLLVGYGAAGYAPIRFKEKPFWIIKNSWGENWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + R N CG++S
Sbjct: 340 YYKICRARNICGVDS 354
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats.
Identities = 65/183 (35%), Positives = 107/183 (58%), Gaps = 14/183 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYAIK+ LL LS+ +L++C+ ++GC GG A + + K GLE E+DYP+ +N
Sbjct: 695 VEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAIEKLGGLELESDYPYDGRN 754
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C + + KV+V + ++T + L GP+ G+N +Q Y G +
Sbjct: 755 ---EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQFYIGGVSHPF 811
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C ++L+H V+IVGYG+ ++P WI++NSWG RWG ++GY+ V RG CG
Sbjct: 812 HFLCNPKDLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGSRWG-ENGYYRVYRGDGTCG 870
Query: 172 IES 174
+ +
Sbjct: 871 VNA 873
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP++ +
Sbjct: 288 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKK 347
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+ F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 348 N---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQFYRGGVSHP 404
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V++VGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 405 WKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 463
Query: 171 GI 172
G+
Sbjct: 464 GV 465
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 97/194 (50%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 117 LEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMRE 176
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C D K+ VS+F V + D L GPL +N A +Q
Sbjct: 177 KDYPYTGTDG--GSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQT 234
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP S LNH V++VGYG + P WI++NSWG ++G+
Sbjct: 235 YIGGV-----SCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGF 289
Query: 161 FTVERGTNACGIES 174
+ + +G N CG++S
Sbjct: 290 YKICKGRNICGVDS 303
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 105/183 (57%), Gaps = 15/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
LE QY K+G L+PLS+SQL++C + N+GC GG A +Y+K G+E+E+DYP++
Sbjct: 199 LEGQYFRKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYVKSVGGIESESDYPYKA 258
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSD-TFRRMLYHYGPLVAGMNG--ALLQDYNGK 114
+ CA+D KV VS + V +GS+ + + ++ GP+ ++ + Q Y G
Sbjct: 259 RQRT---CAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSSFQLYAGG 315
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERG-TNACG 171
+ +C + LNH V+ VGYG Q WIV+NSWG RWG +GY + R N CG
Sbjct: 316 -VYDEPLCSTSRLNHGVLCVGYGTSLQGKDYWIVKNSWGVRWGV-EGYIKMSRNKNNQCG 373
Query: 172 IES 174
I S
Sbjct: 374 IAS 376
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL+EC+ + GC GG N A +Y LK GL E
Sbjct: 178 LEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKE 237
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ VS+F V + D L GPL +N +Q
Sbjct: 238 EDYPYTGTD--RGSCKFDKTKIAASVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQT 295
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 296 YVGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWG-ENG 349
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 350 FYKICRGRNVCGVDS 364
>gi|431901237|gb|ELK08303.1| Cathepsin O [Pteropus alecto]
Length = 322
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/180 (37%), Positives = 93/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 142 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVKLVRDSEYPFKAQ 201
Query: 60 NGVTGRCAYDARKVKVR---VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG+ A ++ DF + D + L +GPLV ++ QDY G +I
Sbjct: 202 NGLCLYFADTHSGFSIKGYSAHDFS--DQEDEMAKALLTFGPLVGIVDAVSWQDYLGGII 259
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+ + C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI +
Sbjct: 260 QHH--CSSGEANHAVIITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGDNTCGIADF 316
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 97/194 (50%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 165 LEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMRE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C D K+ VS+F V + D L GPL +N A +Q
Sbjct: 225 KDYPYTGTDG--GSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQT 282
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP S LNH V++VGYG + P WI++NSWG ++G+
Sbjct: 283 YIGGV-----SCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGF 337
Query: 161 FTVERGTNACGIES 174
+ + +G N CG++S
Sbjct: 338 YKICKGRNICGVDS 351
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 85/171 (49%), Gaps = 5/171 (2%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
E Y G L+ LS+ QLI+C N GC GG + Y++ GL +E+ YP+ G
Sbjct: 143 EGAYYKSTGKLVSLSEQQLIDCTTNVNDGCDGGYLEETFPYVQQTGLVSESSYPY---TG 199
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDV 121
G C V +VS +++ G + GP+ M+ + Y + ++ +
Sbjct: 200 RDGNCRISESDVVTKVSKYVLLGGEADLLEAVGSVGPVSVAMDATYIYSYASG-VYESSL 258
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
C +LNH V++VGYG + W+++NSWG + GY + RGTN CGI
Sbjct: 259 CSLYSLNHGVLVVGYGTQDGKDYWLIKNSWGNTWGEQGYLKLLRGTNECGI 309
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 65/183 (35%), Positives = 107/183 (58%), Gaps = 14/183 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E QYAIK+ LL LS+ +L++C+ ++GC GG A + + K GLE E+DYP+ +N
Sbjct: 695 VEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAIEKLGGLELESDYPYDGRN 754
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C + + KV+V + ++T + L GP+ G+N +Q Y G +
Sbjct: 755 ---EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQFYIGGVSHPF 811
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C ++L+H V+IVGYG+ ++P WI++NSWG RWG ++GY+ V RG CG
Sbjct: 812 HFLCNPKDLDHGVLIVGYGISKYPLFHKKLPYWIIKNSWGSRWG-ENGYYRVYRGDGTCG 870
Query: 172 IES 174
+ +
Sbjct: 871 VNA 873
>gi|327273973|ref|XP_003221753.1| PREDICTED: cathepsin O-like [Anolis carolinensis]
Length = 376
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 100/179 (55%), Gaps = 13/179 (7%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA--GLEAEADYPFRN 58
++ES +AIK L LS Q+I+C+ N GC+GG A+ ++ L +++Y F+
Sbjct: 195 IIESVHAIKRNVLEELSVQQVIDCSYINSGCRGGSPVGALGWINQTRVKLVRDSEYHFQA 254
Query: 59 QNGVTGRCAYDAR-KVKVRVSDFLVFNGSD---TFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ TG C Y +R V + + ++ SD +++L +GPL ++ A QDY G
Sbjct: 255 E---TGLCRYFSRADFGVSIKGYAAYDLSDQEDKMKKLLLEWGPLAVVVDAASWQDYLGG 311
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+I+ + C S NHAV+I GY +P WIV+NSWG WG DGY ++ G+N CGI
Sbjct: 312 IIQYH--CSSGEPNHAVLITGYDTTGSIPFWIVKNSWGPAWG-IDGYVRIKIGSNVCGI 367
>gi|74834619|sp|O97397.1|CATLL_PHACE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|4210800|emb|CAA76927.1| thiol protease [Phaedon cochleariae]
Length = 324
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 102/178 (57%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+ESQ AIK G+ +PLS QL++C+ N GC GG +Y+K GLE++ADYP+
Sbjct: 143 IESQSAIKSGSKVPLSPQQLVDCSTSYGNHGCNGGFAVNGFEYVKDNGLESDADYPY--- 199
Query: 60 NGVTGRC-AYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G +C A D + V ++ + S+T + + GP+ A + G ++ Y G +
Sbjct: 200 SGKEDKCKANDKSRSVVELTGYKKVTASETSLKEAVGTIGPISAVVFGKPMKSYGGGIF- 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN-ACGIE 173
+ C +NL+H V +VGYG+ + WI++N+WG WG + GY + R T+ +CG+E
Sbjct: 259 DDSSCLGDNLHHGVNVVGYGIENGQKYWIIKNTWGADWG-ESGYIRLIRDTDHSCGVE 315
>gi|410956684|ref|XP_003984969.1| PREDICTED: cathepsin O [Felis catus]
Length = 390
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L H L +++YPF+ Q
Sbjct: 210 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKTHVKLVRDSEYPFKAQ 269
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPLV ++ QDY G +I+
Sbjct: 270 NGLCRYFSDSHSGFPIKGYSAYDFSDQEDEMAKALVTFGPLVVVVDAVSWQDYLGGIIQH 329
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 330 H--CSSGEANHAVLITGFDKIGNTPYWIVRNSWGSSWG-VDGYAHVKMGGNICGI 381
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK G+ E
Sbjct: 164 LEGAHFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKSGGVMRE 223
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ V++F V + D L GPL +N A +Q
Sbjct: 224 EDYPYSGTD--RGNCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLAVAINAAYMQT 281
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGM-------RHQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S L+H V++VGYG + P WI++NSWG WG ++G
Sbjct: 282 YIGGV-----SCPYICSRRLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWG-ENG 335
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 336 YYKICRGRNICGVDS 350
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG A +Y LK GLE E
Sbjct: 168 LEGAHFLATGELVSLSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLKAGGLERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ VS+F V + D L +GPL G+N +Q
Sbjct: 228 EDYPYTGND--RGPCKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ +H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 286 YMGGV-----SCPYICSKRQDHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGESWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 340 YYRICRGRNICGVDA 354
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 169 LEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 228
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + C +D KV RV++F V + D L GPL +N +Q
Sbjct: 229 EDYPYTGTD--RDACKFDKNKVAARVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 286
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWG-RWGPDDG 159
Y G + CP S L+H V++VGYG + P WI++NSWG +WG ++G
Sbjct: 287 YIGGV-----SCPYICSRRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNSWGEKWG-ENG 340
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 341 FYKICRGRNVCGVDS 355
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
Length = 266
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/189 (34%), Positives = 105/189 (55%), Gaps = 17/189 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YA+++G LL LS+ +L++C+ + GC GG A + + GLE E+DYP+ N
Sbjct: 80 VEGIYAVRNGDLLSLSEQELVDCDKLDSGCNGGLPENAYKAIHDIGGLETESDYPY---N 136
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +C +++ +V+V+ + + ++T + L GP+ G+N +Q Y G +
Sbjct: 137 GHENKCKFNSNITRVQVTGGVEISTNETEMAQWLIQNGPISIGINANAMQYYRGGVSHPW 196
Query: 120 DV-CPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
V C ++H V+IVGYG+ +P WIV+NSWG RWG + GY+ V RG CG
Sbjct: 197 KVLCRPGGIDHGVLIVGYGVSQYPKFNKTLPYWIVKNSWGTRWG-EQGYYRVFRGDGTCG 255
Query: 172 IESYGGICT 180
+ +CT
Sbjct: 256 LNQ---MCT 261
>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
Length = 333
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GC+GG ++A +Y L + G+ E YP+R
Sbjct: 148 LESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDTYPYR- 206
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
G G C +D +K V D + N + Y P+ + D+ K
Sbjct: 207 --GKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFE--VTDDFMLYQK 262
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG WG D GYF +ERG N CG
Sbjct: 263 GIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNWG-DKGYFLIERGKNMCG 321
Query: 172 IES 174
+ +
Sbjct: 322 LAA 324
>gi|345780796|ref|XP_539782.3| PREDICTED: cathepsin O [Canis lupus familiaris]
Length = 456
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK L +S Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 276 VESAYAIKGKPLADISVQQVIDCSYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQ 335
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + +R S + + D ++L +GPLV ++ QDY G +I+
Sbjct: 336 NGLCHYFSDSYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGIIQH 395
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 396 H--CSSGEANHAVLITGFDKIGSTPYWIVRNSWGSSWG-VDGYAHVKMGGNICGI 447
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 100/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G LL LS+ +L++C+ ++ C GG + A +K GLE E DY + +
Sbjct: 230 VEGQWFLKRGDLLSLSEQELVDCDKVDKACMGGLPSNAYSAIKTLGGLETEDDYSY---S 286
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ A+K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 287 GHLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGPISIAINAFGMQFYRHGISRPL 346
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R VP W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 347 RPLCSRWFIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EEGYYYLHRGSGACGV 400
>gi|14602252|ref|NP_148795.1| ORF11 cathepsin [Cydia pomonella granulovirus]
gi|13124000|sp|O91466.1|CATV_GVCPM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|14591773|gb|AAK70678.1| ORF11 cathepsin [Cydia pomonella granulovirus]
Length = 333
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 98/174 (56%), Gaps = 10/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ES Y IK+ L LS+ L+ C+ N GC GG + A++ L+ G+ + + P+
Sbjct: 157 IESLYNIKYDKALNLSEQHLVNCDNINNGCAGGLMHWALESILQEGGVVSAENEPYY--- 213
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
G G C ++ + S V + R +L GP+ ++ + L +Y + D
Sbjct: 214 GFDGVCKKSPFELSISGSRRYVLQNENKLRELLVVNGPISVAIDVSDLINYKAGIA---D 270
Query: 121 VCPS-ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+C + E LNHAV++VGYG+++ VP WI++NSWG WG ++GYF V+R N+CG+
Sbjct: 271 ICENNEGLNHAVLLVGYGVKNDVPYWILKNSWGAEWG-EEGYFRVQRDKNSCGM 323
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 103/205 (50%), Gaps = 30/205 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 173 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMRE 232
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G T C D K+ VS+F V + D L GPL +N A +Q
Sbjct: 233 EDYPYTGKDGPT--CKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLAVAINAAYMQT 290
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP + LNH V++VGYG + P WI++NSWG WG ++G
Sbjct: 291 YIGGV-----SCPYICARRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGESWG-ENG 344
Query: 160 YFTVERGTNACGIESYGGICTRTLN 184
++ + +G N CG++S + T++
Sbjct: 345 FYKICKGRNICGVDSLVSTVSATVS 369
>gi|345307542|ref|XP_001510786.2| PREDICTED: cathepsin O-like [Ornithorhynchus anatinus]
Length = 358
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 92/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAI+ L LS Q+I+C+ N GC GG A+ +L L +A+Y F+ Q
Sbjct: 178 IESAYAIRGKPLEELSVQQVIDCSYNNFGCSGGSTINALNWLNKTQVKLVRDAEYSFKAQ 237
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G+ + + +R F+G D ++L +GPL ++ QDY G +I+
Sbjct: 238 TGICHYFSGSHYGISIRGYSAYDFSGQEDEMVKVLLSFGPLAVIVDAVSWQDYLGGIIQH 297
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I GY VP WIVRNSWG WG +GY V+ G N CGI
Sbjct: 298 H--CSSGEANHAVLITGYDKSGSVPYWIVRNSWGSSWGV-NGYAHVKMGANICGI 349
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 97/194 (50%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 164 LEGAHFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMRE 223
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C D K+ VS+F V + D L GPL +N A +Q
Sbjct: 224 EDYPYTGTDG--GSCKLDRSKIVASVSNFSVVSINEDQIAANLVKNGPLAVAINAAYMQT 281
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP S LNH V+++GYG + P WI++NSWG ++G+
Sbjct: 282 YIGGV-----SCPYICSRRLNHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGESWGENGF 336
Query: 161 FTVERGTNACGIES 174
+ + +G N CG++S
Sbjct: 337 YKICKGRNICGVDS 350
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 174 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMRE 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D KV +V++F V + D L+ GPL +N +Q
Sbjct: 234 EDYPYTGTD--RGTCKFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQT 291
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG PV WI++NSWG WG ++G
Sbjct: 292 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWG-ENG 345
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 346 FYRICRGRNICGVDS 360
>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 91/173 (52%), Gaps = 8/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ES Y IKH L LS+ LI C+ N GC GG + A++ L+ G+ +E D P+
Sbjct: 163 IESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQGGIVSEKDEPYY--- 219
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
G+ C V + V + R +L GP+ ++ + DY + D
Sbjct: 220 GLDAVCKPKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVDIIDVIDYKEGIT---D 276
Query: 121 VCPSEN-LNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C + N LNHAV++VGYG+ + +P WI++NSWG + GY V+R N+CG+
Sbjct: 277 ICENMNGLNHAVLLVGYGVHNNIPYWIMKNSWGEEWGEKGYLRVQRNINSCGL 329
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 94/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ GC GG ++A Q++ + + E YP+ +
Sbjct: 159 IEGQWKVAGHELTSLSEQMLLSCDTREDGCGGGLMDRAFQWIVSSNKGNVFTEQSYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G RC + V ++SD++ + L GP+ + LQ Y G ++
Sbjct: 219 TDGDVPRCNKSGKVVGAKISDYVDLPQDENAIAEWLAKNGPVAIAVEATSLQRYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
C SE L+H V++VGY + P WI++NSWG+ WG ++GY +E+GTN C +++Y
Sbjct: 279 S---CISEQLDHGVLLVGYDDTSKPPYWIIKNSWGKGWG-EEGYIRIEKGTNQCLMKNYA 334
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 98/180 (54%), Gaps = 12/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E +A+K L+ LS+ QL++C+ + GC+GG N ++ ++ GLE E DY + +
Sbjct: 182 IEGAWAVKTAQLISLSEQQLVDCDRLDDGCEGGLPVNAYLEIIRLGGLEKEEDYKYTAR- 240
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+G+C ++ K V ++D +V D R + GP+ G+N + Y + +
Sbjct: 241 --SGKCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVAVGLNADAMMFYRSGIAHPS 298
Query: 120 DV-CPSENLNHAVVIVGYGMRHQV----PVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ C + +NH V IVGY ++ + P WI++NSWG WG + GY+ + RG CGI+
Sbjct: 299 RLMCSPDGINHGVTIVGYDVKESLFWSTPYWIIKNSWGPNWG-EKGYYYLYRGKGVCGID 357
>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 91/173 (52%), Gaps = 8/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ES Y IKH L LS+ LI C+ N GC GG + A++ L+ G+ +E D P+
Sbjct: 163 IESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQGGIVSEKDEPYY--- 219
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
G+ C V + V + R +L GP+ ++ + DY + D
Sbjct: 220 GLDAVCKPKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVDIIDVIDYKEGIT---D 276
Query: 121 VCPSEN-LNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C + N LNHAV++VGYG+ + +P WI++NSWG + GY V+R N+CG+
Sbjct: 277 ICENMNGLNHAVLLVGYGVHNNIPYWIMKNSWGEEWGEKGYLRVQRNINSCGL 329
>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
Length = 358
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 98/182 (53%), Gaps = 13/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ +AI + L LS +L++C QGC+GG ++ + L +GL E DYP+R Q
Sbjct: 164 VEALWAINYQQLFKLSVQELLDCRRCGQGCEGGFVWDAYMTILNQSGLAEEQDYPYRPQ- 222
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT------FRRMLYHYGPLVAGMNGALLQDYNGK 114
++ C +K + + DFL+ + + + L GP+ +N LL+ Y
Sbjct: 223 -LSKGC--QKKKKRAWIHDFLMLHKEENSPSPPDMAQYLAEKGPITVTINSRLLKSYIRG 279
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+I+ + C + ++H V +VG+G H WI++NSWG WG + GYF + RG NACGI
Sbjct: 280 VIKPGNNCDPKYVDHVVQLVGFGQIHNFTYWILKNSWGSSWG-EKGYFRLHRGRNACGIT 338
Query: 174 SY 175
+
Sbjct: 339 KF 340
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 174 LEGANYLATGKLVSLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMRE 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 234 EDYPYTGTD--RGACQFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 291
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG + G
Sbjct: 292 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWG-ESG 345
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 346 YYKICRGRNICGVDS 360
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 96/180 (53%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 170 LEAAYAQAFGKSISLSEQQLVDCAGPFNNFGCHGGLPSQAFEYIKYNGGLETEEAYPYTG 229
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++GV C + A V V+V D + D + + P+ + +
Sbjct: 230 KDGV---CKFSAENVAVQVLDSVNITLGAEDELKHAVAFVRPVSVAFQVVNGFHFYENGV 286
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+D C S+++NHAV+ VGYG+ + VP W+++NSWG ++GYF +E G N CG+ +
Sbjct: 287 FTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGESWGENGYFKMELGKNMCGVAT 346
>gi|68086379|gb|AAH98219.1| Cathepsin O [Mus musculus]
Length = 312
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES AI+ +L LS Q+I+C+ N GC GG A+++L L+ A++ YPF+
Sbjct: 132 IESARAIQGKSLDYLSVQQVIDCSFNNSGCLGGSPPCALRWLNETQLKLVADSQYPFK-- 189
Query: 60 NGVTGRCA-YDARKVKVRVSDFLVFN---GSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G+C + + V V DF +N D R L +GPLV ++ QDY G +
Sbjct: 190 -AVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGI 248
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+VRNSWG WG +GY V+ G N CGI
Sbjct: 249 IQHH--CSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGV-EGYAHVKMGGNVCGI 303
>gi|157862755|gb|ABV90500.1| cathepsin L, partial [Fasciola gigantica]
Length = 251
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY + S+ QL++C + N GC GG KA +YL+H GLE E+ YP+R
Sbjct: 66 MEGQYMKSQRINISFSEQQLVDCSGDFGNHGCSGGLMEKAYEYLRHFGLETESSYPYRAD 125
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C YD + ++SD+ + + D + ++ GP ++ + I
Sbjct: 126 EGP---CQYDKQLGVAQLSDYYIVHSQDEVALKNLIGVEGPAAVALDVNIDFMMYKSGIY 182
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGIESY 175
++++C S LNHA++ VGYG WIV+NSWG RWG + GY + R N CGI +
Sbjct: 183 QDEICSSRYLNHALLAVGYGTEDGTEYWIVKNSWGSRWG-EHGYIRLARNRDNMCGIATL 241
Query: 176 GGI 178
+
Sbjct: 242 ASL 244
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ ++GC GG N A +Y LK G+
Sbjct: 168 LEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDRGCNGGLMNTAFEYILKAGGVVRG 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C +D K+ VS+F + D L GPL G+N +Q
Sbjct: 228 EDYPY---TGTDGHCKFDKTKIAASVSNFSTVSIDEDQIAANLVKNGPLAVGINAIFMQS 284
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S +LNH V++VGYG + P W+++NSWG+ WG + G
Sbjct: 285 YAGGV-----SCPFICSTSLNHGVLLVGYGSAGYSPIRFKEKPYWLLKNSWGQNWG-EHG 338
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 339 YYKICRGHNICGVDS 353
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 102/182 (56%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E +A+K G L S+ +L++C+ + C GG + A + +K GLE EA+YP++ +
Sbjct: 428 IEGLHAVKTGDLKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKK 487
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+ F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 488 N---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHP 544
Query: 119 -NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V++VGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 545 WKALCSKKNLDHGVLVVGYGVSEYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 603
Query: 171 GI 172
G+
Sbjct: 604 GV 605
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 91/174 (52%), Gaps = 8/174 (4%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
E YA+ G L S+ QL++C N GC GG + Y++ GLE E+DYP+ G
Sbjct: 148 EGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYIQTNGLELESDYPY---TG 204
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
G C+Y++ KV +VS ++ ++ + GP+ +N LQ Y +I +
Sbjct: 205 YDGYCSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGII-DDK 263
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
C E L+H V+ VGY + W+++NSWG WG + GYF RG N CG++
Sbjct: 264 YCDPEYLDHGVLAVGYDSENGRDYWLIKNSWGADWG-ESGYFRFLRGQNICGVK 316
>gi|395861575|ref|XP_003803057.1| PREDICTED: cathepsin O [Otolemur garnettii]
Length = 320
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 93/175 (53%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 140 VESACAIKGEPLEDLSVQQVIDCSYNNYGCNGGSTVNALNWLNKMQVKLVKDSEYPFKAQ 199
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + + ++ S++ D + L +GPLV ++ QDY G +I+
Sbjct: 200 NGLCHYFSGSHSGISIKDYSEYDFNEQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 259
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 260 H--CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNICGI 311
>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 98/194 (50%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK G+ E
Sbjct: 164 LEGAHFLSTGKLVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKSGGVMRE 223
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D +K+ V++F V + D L GPL +N +Q
Sbjct: 224 EDYPYSGTD--RGSCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLAIALNAVYMQT 281
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGM-------RHQVPVWIVRNSWGRWGPDDGY 160
Y G + CP S+ L+H V++VGYG + P WI++NSWG ++GY
Sbjct: 282 YVGGV-----SCPYICSKRLDHGVLLVGYGSGAYSPIRLKEKPYWIIKNSWGETWGENGY 336
Query: 161 FTVERGTNACGIES 174
+ + RG N CG++S
Sbjct: 337 YKICRGRNICGVDS 350
>gi|281354027|gb|EFB29611.1| hypothetical protein PANDA_013700 [Ailuropoda melanoleuca]
Length = 266
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 91/174 (52%), Gaps = 7/174 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 96 VESAYAIKGEPLEALSVQQVIDCSYNNYGCSGGSTVSALHWLNKTQVKLVRDSEYPFKAQ 155
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPLV ++ QDY G +I+
Sbjct: 156 NGLCHYFSDSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQH 215
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CG
Sbjct: 216 H--CSSGEANHAVLITGFDKIGSTPYWIVRNSWGSSWGV-DGYARVKMGGNICG 266
>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
Length = 381
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 99/191 (51%), Gaps = 24/191 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E I G LL LS+ QL++C+ + GC GG A +YL AG LE E
Sbjct: 185 IEGANFIATGKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGLEDE 244
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
YP+ G G+C +D +K+ VRV +F + L H+GPL G+N +Q
Sbjct: 245 ISYPY---TGKPGKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQT 301
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C + +NH V++VGYG + P WI++NSWG RWG ++GY+
Sbjct: 302 YIGG-VSCPLICGKKWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWG-EEGYYR 359
Query: 163 VERGTNACGIE 173
+ +G CG++
Sbjct: 360 ICKGYGMCGMD 370
>gi|195729975|gb|ACG50798.1| cathepsin L1 [Fascioloides magna]
Length = 327
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 91/182 (50%), Gaps = 8/182 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY K T + S+ QL++C N N GC GG +A +YL+ GLE E+ YP+R
Sbjct: 142 MEGQYIKKFRTTVSFSEQQLVDCTRNYGNSGCNGGWMERAFEYLRRNGLETESSYPYR-- 199
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V C Y+++ +V+ + + + + M+ GP+ ++ I
Sbjct: 200 -AVDDHCRYESQLGVAKVTGYYTEHSGNEVSLMNMVGGEGPVAVAVDVQSDFSMYKSGIY 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIESYG 176
+++ C + +NHAV+ VGYG WI++NSWG W D GY R N CGI SY
Sbjct: 259 QSETCSTYYVNHAVLAVGYGTESGTDYWILKNSWGSWWGDQGYIRFARNRNNMCGIASYA 318
Query: 177 GI 178
+
Sbjct: 319 SV 320
>gi|170784978|pdb|2P7U|A Chain A, The Crystal Structure Of Rhodesain, The Major Cysteine
Protease Of T. Brucei Rhodesiense, Bound To Inhibitor
K777
gi|171848756|pdb|2P86|A Chain A, The High Resolution Crystal Structure Of Rohedsain, The
Major Cathepsin L Protease From T. Brucei Rhodesiense,
Bound To Inhibitor K11002
Length = 215
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 87/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 34 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 93
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + D L GPL ++ DYNG ++
Sbjct: 94 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 153
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 154 S---CTSEQLDHGVLLVGYNDASNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 203
>gi|348582234|ref|XP_003476881.1| PREDICTED: cathepsin O-like [Cavia porcellus]
Length = 478
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES +AI+ L LS Q+I+C+ N GC GG A+ +LK L +++YPF+ Q
Sbjct: 298 VESAWAIRGEPLEDLSAQQVIDCSYNNFGCNGGSPLSALTWLKKTRVKLVKDSEYPFKAQ 357
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C Y + + D+ ++ S D R+L GPLV ++ QDY G +
Sbjct: 358 NGL---CHYFSSSHPGFSIQDYAAYDFSAQEDEMARVLLLSGPLVVIVDAVSWQDYLGGV 414
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV++ G+ P WIVRNSWG WG DGY V+ +N CGI
Sbjct: 415 IQHH--CSSGEANHAVLVTGFDQTGSTPYWIVRNSWGSSWG-VDGYAYVKMRSNVCGI 469
>gi|148683493|gb|EDL15440.1| cathepsin O [Mus musculus]
Length = 312
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES AI+ +L LS Q+I+C+ N GC GG A+++L L+ A++ YPF+
Sbjct: 132 IESARAIQGKSLDYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFK-- 189
Query: 60 NGVTGRCA-YDARKVKVRVSDFLVFN---GSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G+C + + V V DF +N D R L +GPLV ++ QDY G +
Sbjct: 190 -AVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGI 248
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+VRNSWG WG +GY V+ G N CGI
Sbjct: 249 IQHH--CSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGV-EGYAHVKMGGNVCGI 303
>gi|28278727|gb|AAH44664.1| Ctso protein [Mus musculus]
Length = 292
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES AI+ +L LS Q+I+C+ N GC GG A+++L L+ A++ YPF+
Sbjct: 112 IESARAIQGKSLDYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFK-- 169
Query: 60 NGVTGRCA-YDARKVKVRVSDFLVFN---GSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G+C + + V V DF +N D R L +GPLV ++ QDY G +
Sbjct: 170 -AVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGI 228
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+VRNSWG WG +GY V+ G N CGI
Sbjct: 229 IQHH--CSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGV-EGYAHVKMGGNVCGI 283
>gi|26340204|dbj|BAC33765.1| unnamed protein product [Mus musculus]
Length = 312
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES AI+ +L LS Q+I+C+ N GC GG A+++L L+ A++ YPF+
Sbjct: 132 IESARAIQGKSLDYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFK-- 189
Query: 60 NGVTGRCA-YDARKVKVRVSDFLVFN---GSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G+C + + V V DF +N D R L +GPLV ++ QDY G +
Sbjct: 190 -AVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGI 248
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+VRNSWG WG +GY V+ G N CGI
Sbjct: 249 IQHH--CSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGV-EGYAHVKMGGNVCGI 303
>gi|29244082|ref|NP_808330.1| cathepsin O precursor [Mus musculus]
gi|67460397|sp|Q8BM88.1|CATO_MOUSE RecName: Full=Cathepsin O; Flags: Precursor
gi|26329979|dbj|BAC28728.1| unnamed protein product [Mus musculus]
gi|74139152|dbj|BAE38466.1| unnamed protein product [Mus musculus]
gi|74141620|dbj|BAE38573.1| unnamed protein product [Mus musculus]
Length = 312
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES AI+ +L LS Q+I+C+ N GC GG A+++L L+ A++ YPF+
Sbjct: 132 IESARAIQGKSLDYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFK-- 189
Query: 60 NGVTGRCA-YDARKVKVRVSDFLVFN---GSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G+C + + V V DF +N D R L +GPLV ++ QDY G +
Sbjct: 190 -AVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGI 248
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+VRNSWG WG +GY V+ G N CGI
Sbjct: 249 IQHH--CSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGV-EGYAHVKMGGNVCGI 303
>gi|291401083|ref|XP_002716930.1| PREDICTED: cathepsin O [Oryctolagus cuniculus]
Length = 309
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 95/175 (54%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES +AIK L LS Q+I+C+ N GC GG A+++L L +++YPF+ +
Sbjct: 129 VESTWAIKGHPLEDLSVQQVIDCSYNNYGCSGGSTLSALKWLNKTQVRLVNDSEYPFKAR 188
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+G+ + ++ S + + D + L YGPLV ++ QDY G +I+
Sbjct: 189 SGLCHYFPSSHSGLSIKGYSAYDFSDQEDEMAKSLLIYGPLVVIVDAVSWQDYLGGVIQH 248
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ +P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 249 H--CSSGEANHAVLITGFDKTGSIPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 300
>gi|348511930|ref|XP_003443496.1| PREDICTED: cathepsin O-like [Oreochromis niloticus]
Length = 338
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 101/178 (56%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA--GLEAEADYPFRNQ 59
++S +AI L LS Q+++C+ N GC GG +A+ +LK L +++YP++ +
Sbjct: 158 IQSVHAIGGSQLEQLSVQQVVDCSYQNAGCNGGSTTRALNWLKQTRVKLVTQSEYPYKAK 217
Query: 60 NGVTGRCAYDARKVK-VRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKL 115
+ C + ++ V + +F + S + M L YGPLVA ++ QDY G +
Sbjct: 218 TEI---CHFFSQSHGGVAIKNFTTHDFSGQEKAMMGQLVQYGPLVAIVDAVSWQDYLGGI 274
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S+ NHA++IVGY +P WIV+NSWG RWG ++GY ++ G N CGI
Sbjct: 275 IQHH--CSSQWSNHAILIVGYDTTGDIPYWIVQNSWGTRWG-NEGYVYIKIGGNICGI 329
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 169 LEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 228
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D KV V++F V + D L GPL +N +Q
Sbjct: 229 EDYPYTGMD--RGACKFDKNKVAAGVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 286
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S L+H V++VGYG PV WI++NSWG WG ++G
Sbjct: 287 YIGGV-----SCPYICSRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWG-ENG 340
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 341 FYKICRGRNICGVDS 355
>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
Length = 334
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 99/176 (56%), Gaps = 12/176 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+ESQYAI++ L LS+ Q+I+C+ + GC GG + A Q ++ G+E E YP+
Sbjct: 155 IESQYAIRNNVHLDLSEQQMIDCDYVDMGCYGGLLHTAFEQMIQMGGVEEERQYPYE--- 211
Query: 61 GVTGRCAYDARK---VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
GV C + + VKV+ + + + +L GPL ++ + + +Y +I
Sbjct: 212 GVNNNCRLKSDERFVVKVKGCYRYLVMREEKLKDLLRAVGPLPMAIDASSIFNYYRGVI- 270
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C + LNHAV++VGYG+ + VP W +N+WG WG +DGYF V + +ACG+
Sbjct: 271 --NYCGNNGLNHAVLLVGYGVENGVPFWTFKNTWGDDWG-EDGYFRVRQNVDACGM 323
>gi|296195327|ref|XP_002745330.1| PREDICTED: cathepsin O [Callithrix jacchus]
Length = 453
Score = 102 bits (255), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 92/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 273 VESACAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQ 332
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 333 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 392
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 393 H--CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 444
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 102 bits (254), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 102/182 (56%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYN-QGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQ 59
+E Y +K G L+ LS+ L++C + GC GG +KA++Y++ AG + +E DYP+
Sbjct: 143 VEGAYFLKTGKLVSLSEQNLVDCAKEDCYGCSGGYMDKALEYIETAGGIMSENDYPYE-- 200
Query: 60 NGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGAL-LQDYNGKLI 116
G+ +C +D+ KV ++S+F N D + + GP+ ++ + Q Y+ ++
Sbjct: 201 -GIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAIDASFNFQLYDSGIL 259
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGI 172
+ C S+ +LNH V++VGYG + WIV+NSWG WG DGY + R N CGI
Sbjct: 260 -DDSSCYSDFNSLNHGVLVVGYGTEKEQDYWIVKNSWGADWGM-DGYIWMSRNKNNQCGI 317
Query: 173 ES 174
+
Sbjct: 318 AT 319
>gi|308462787|ref|XP_003093674.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
gi|308249538|gb|EFO93490.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
Length = 392
Score = 102 bits (254), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 95/179 (53%), Gaps = 8/179 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ESQYAI+ GTL LS+ +L++C+ + GC GG +KA+ ++ GLE E DYP+
Sbjct: 210 VESQYAIRKGTLWSLSEQELVDCDGESYGCGGGFLDKALGWVLGNGLETEDDYPYECTQ- 268
Query: 62 VTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C + K +V V + + + D+ + GP+ M+ + NG
Sbjct: 269 -HDQCYINGGKTRVTVDEGWSLGRDEDSIADWVASVGPVAFAMSVPNSFTAYSNGVYNPS 327
Query: 119 NDVCPSENLN-HAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C E+L HA+ ++GYG P WIV+NSWG WG D GY + RG NACG+ +
Sbjct: 328 EHECRDESLGYHAMTLIGYGTEGNQPYWIVKNSWGSSWG-DQGYMRLARGNNACGMRDF 385
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG A +Y LK GLE E
Sbjct: 167 LEGAHFLATGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILKAGGLERE 226
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C ++ K+ V++F V + D L GPL G+N +Q
Sbjct: 227 EDYPYTGSD--RGPCKFERAKIAASVNNFSVVSVDEDQIAANLVQNGPLAVGINAVFMQT 284
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ +H VV+VGYG PV WI++NSWG WG ++G
Sbjct: 285 YIGGV-----SCPYICSKRQDHGVVLVGYGSAGYAPVRLKDKPFWIIKNSWGENWG-ENG 338
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 339 YYKICRGRNVCGVDA 353
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 102/181 (56%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E +A++ G L S+ +L++C+ + C GG + A + + K GLE E+DYP+ +
Sbjct: 283 IEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAIEKIGGLELESDYPYHARK 342
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C +++ K+ V+V + ++T + L GP+ G+N +Q Y G +
Sbjct: 343 D---QCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQFYRGGVSHPP 399
Query: 120 DV-CPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ C +NL+H V+IVGYG+ + +P WIV+NSWG +WG + GY+ V RG N CG
Sbjct: 400 HILCSRKNLDHGVLIVGYGVSDYPMFKKTLPYWIVKNSWGKKWG-EQGYYRVYRGDNTCG 458
Query: 172 I 172
+
Sbjct: 459 V 459
>gi|351700981|gb|EHB03900.1| Cathepsin H [Heterocephalus glaber]
Length = 334
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 99/183 (54%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP+
Sbjct: 149 LESAVAIASGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNG--K 114
G G C + +K V D + N + + Y P+ + +D+ +
Sbjct: 208 --GKDGHCRFQPQKAIAFVKDIVNITLNDEEAMVEAVALYNPVSFAYE--VTEDFMSYKR 263
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG+ H VP WIV+NSWG +WG ++GYF +ERG N CG
Sbjct: 264 GIYSSTSCHKTPDKVNHAVLAVGYGVDHGVPYWIVKNSWGTQWG-NNGYFLIERGKNMCG 322
Query: 172 IES 174
+ +
Sbjct: 323 LAA 325
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 101/181 (55%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E + IK L S+ +L++C+ + CQGG + A + + K GLE E++YP+ +
Sbjct: 978 IEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAIEKIGGLELESEYPYLAKK 1037
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
T C +++ +V VRV + ++T + L GP+ G+N +Q Y G +
Sbjct: 1038 QKT--CHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISIGLNANAMQFYRGGISHPW 1095
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG++ +P WIV+NSWG +WG + GY+ + RG N CG
Sbjct: 1096 KPLCSKKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWGPKWG-EQGYYRIFRGDNTCG 1154
Query: 172 I 172
+
Sbjct: 1155 V 1155
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 23/192 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG A +Y+ K GLE E
Sbjct: 174 LEGANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVKAGGLERE 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C + K+ ++F V N +D L GPL G+N +Q
Sbjct: 234 EDYPYTGTD--RGSCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAVFMQT 291
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDGYFT 162
Y K I +C NL+H V++VGYG + P WI++NSWG WG ++GY+
Sbjct: 292 YM-KGISCPYICSKRNLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWG-ENGYYF 349
Query: 163 VERGTNACGIES 174
+ +G N CG ES
Sbjct: 350 ICKGKNICGSES 361
>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
Length = 265
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 101/181 (55%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E + IK L S+ +L++C+ + CQGG + A + + K GLE E++YP+ +
Sbjct: 78 IEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAIEKIGGLELESEYPYLAKK 137
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
T C +++ +V VRV + ++T + L GP+ G+N +Q Y G +
Sbjct: 138 QKT--CHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISIGLNANAMQFYRGGISHPW 195
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG++ +P WIV+NSWG +WG + GY+ + RG N CG
Sbjct: 196 KPLCSKKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWGPKWG-EQGYYRIFRGDNTCG 254
Query: 172 I 172
+
Sbjct: 255 V 255
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 102 bits (253), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 68/181 (37%), Positives = 103/181 (56%), Gaps = 15/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFRN 58
LE Q+ K G L+ LS+SQL++C+ N+GC GG + A +Y+K G LE+E DYP++
Sbjct: 176 LEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKP 235
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+ G C +D KV + + V +GS++ ++ + GP+ ++ + Q Y G
Sbjct: 236 KQGT---CKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGG 292
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERG-TNACG 171
+ + + C SE L+H V+ VGYG Q WIV+NSWG WG +DGY + R N CG
Sbjct: 293 VYDEPE-CSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWG-EDGYVKMSRNKKNQCG 350
Query: 172 I 172
I
Sbjct: 351 I 351
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/183 (36%), Positives = 97/183 (53%), Gaps = 15/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAG-LEAEADYPFRN 58
LE +A+K G L+ LS+ QL++C++ N GC GG A QY+K AG + E YP+
Sbjct: 144 LEGLHALKTGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPYTA 203
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM--LYHYGPLVAGMNGAL--LQDYNGK 114
+N C +D +KV ++ D M LY GP+ M+ L Q Y K
Sbjct: 204 KNE---SCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYK-K 259
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGT-NACG 171
I + +C + +LNH V ++GYG P W+V+NSWG+ WG DGYF + R N CG
Sbjct: 260 GIYSDYLCSNTHLNHGVTLIGYGESSDGSPYWLVKNSWGKDWGI-DGYFMLARYVGNMCG 318
Query: 172 IES 174
+ +
Sbjct: 319 VAT 321
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 101/182 (55%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E YAIK G L S+ +L++C+ + C GG + A + +K GLE E++YP+ +
Sbjct: 430 IEGLYAIKTGELEEFSEQELLDCDSTDSACNGGLMDNAYKAIKDIGGLEYESEYPYAAKK 489
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V++S F+ G++T + L GP+ G+N +Q Y G +
Sbjct: 490 M---QCHFNRTMSHVQLSGFVDLPKGNETAMQEWLLSNGPISIGLNANAMQFYRGGVSHP 546
Query: 119 -NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ + RG N C
Sbjct: 547 WAPLCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRIYRGDNTC 605
Query: 171 GI 172
G+
Sbjct: 606 GV 607
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C S+ L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSKQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 404
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 113 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 172
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 173 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 232
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 233 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 282
>gi|1594287|gb|AAC48340.1| cathepsin L-like cysteine proteinase [Toxocara canis]
Length = 360
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 97/180 (53%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YA+ G L LS+ QL++CN+ N C GG +KA++Y+ GL E DYP+
Sbjct: 178 VESAYALGTGELRSLSEQQLLDCNLENNACDGGDVDKALRYVYDEGLMREYDYPYVAHRQ 237
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKND 120
T C +++ + FL + + +L HYGP+ G+N A ++ Y G + D
Sbjct: 238 DT--CQLRGETTRIKAAVFLHQDEASIIDWLL-HYGPVNVGINVTADMKAYKGG-VYTPD 293
Query: 121 VCPSENL---NHAVVIVGYGMRHQV--PVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
EN H++ IVGYG + WIV+NSWG+ +G +DGY RG N+CGIE
Sbjct: 294 KWECENKIIGTHSINIVGYGTWNATNQKYWIVKNSWGQSYGIEDGYVYFARGINSCGIED 353
>gi|344293694|ref|XP_003418556.1| PREDICTED: cathepsin O-like [Loxodonta africana]
Length = 327
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 91/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 147 VESACAIKGEPLEDLSVQQVIDCSYSNYGCNGGSTLSALNWLNKMQVKLVKDSEYPFKAQ 206
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPL+ ++ QDY G +I+
Sbjct: 207 NGLCQYFSVSHSGFSIKGYSAYDFSDREDEMAKALLTFGPLIVVVDAVSWQDYLGGVIQH 266
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 267 H--CSSGEANHAVLVTGFDTTGSTPYWIVRNSWGSSWGV-DGYAHVKMGANICGI 318
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats.
Identities = 58/181 (32%), Positives = 99/181 (54%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E + IK L S+ +LI+C+ + GC GG + A + + K GLE E +YP++ +
Sbjct: 1598 IEGLHQIKTKKLEAYSEQELIDCDTVDNGCNGGYMDDAFKAIEKLGGLELEDEYPYQAKA 1657
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTF-RRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
T C ++ VRV + ++TF + L GP+ G+N +Q Y G +
Sbjct: 1658 QKT--CHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGLNANAMQFYRGGISHPW 1715
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C + ++H V+IVGYG++ +P W ++NSWG +WG + GY+ + RG N+CG
Sbjct: 1716 HLLCSHKQIDHGVLIVGYGVKEYPLFNKTLPYWTIKNSWGPKWG-EQGYYRIYRGDNSCG 1774
Query: 172 I 172
+
Sbjct: 1775 V 1775
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDSSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|410960470|ref|XP_003986812.1| PREDICTED: pro-cathepsin H [Felis catus]
Length = 321
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 98/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G LL L++ QL++C N N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 136 LESAIAIKTGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 195
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
Q+G C + K V D + N + + Y P+ + D+ K
Sbjct: 196 QDG---DCKFQPSKAIAFVKDVANITINDEEAMVEAVALYNPVSFAFE--VTDDFMMYRK 250
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ + C + +NHAV+ VGYG + +P WIV+NSWG +WG GYF +ERG N CG
Sbjct: 251 GVYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGPQWG-MKGYFLIERGKNMCG 309
Query: 172 IES 174
+ +
Sbjct: 310 LAA 312
>gi|344284284|ref|XP_003413898.1| PREDICTED: pro-cathepsin H-like [Loxodonta africana]
Length = 335
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 95/182 (52%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G LL L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYKG 209
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
Q+ V C + +K V D + N + + Y P+ + D+ K
Sbjct: 210 QDDV---CKFQPKKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFE--VTDDFMKYSK 264
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I + C + +NHAV+ VGYG +P WIV+NSWG + DGYF +ERG N CG+
Sbjct: 265 GIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFLIERGKNMCGL 324
Query: 173 ES 174
+
Sbjct: 325 AA 326
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L+ LS+ L+ C+ + GC GG + A ++ ++ + EA YP+ +
Sbjct: 159 IEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG +C + ++ ++D + + D L GPL ++ DYNG ++
Sbjct: 219 GNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
C SE L+H V++VGY P WI++NSW +DGY +E+GTN C
Sbjct: 279 S---CTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328
>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
Length = 441
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 93/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+ + +L+ LS+ +L+ C+ ++GC GG +A +L K+ + A YP+ +
Sbjct: 159 IESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 219 GNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMSYTGGIL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C LNH V++VGY M +VP W+++NSWG WG + GY V +GTN C I+ Y
Sbjct: 279 TS---CDGRQLNHGVLLVGYNMTGEVPYWLIKNSWGENWG-EKGYVRVRKGTNECLIQEY 334
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 97/191 (50%), Gaps = 21/191 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + ++ G L+ LS+ QL++C+ + GC GG A Y +K GLE E
Sbjct: 137 IEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKSGGLETE 196
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G+C ++A K+ V++F + D L +GPL G+N +Q
Sbjct: 197 TDYPYTGNS--NGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQT 254
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGYFTV 163
Y G + +C +++H V++VGYG + P+ WI++NSWG + GY+ +
Sbjct: 255 YIGG-VSCPIICSKHHIDHGVLLVGYGAKGYAPIRFTEKPYWIIKNSWGATWGEQGYYKI 313
Query: 164 ERGTNACGIES 174
RG CG+ +
Sbjct: 314 CRGHGMCGMNT 324
>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
Length = 441
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 93/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+ + +L+ LS+ +L+ C+ ++GC GG +A +L K+ + A YP+ +
Sbjct: 159 IESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 219 GNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMSYTGGIL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C LNH V++VGY M +VP W+++NSWG WG + GY V +GTN C I+ Y
Sbjct: 279 TS---CDGRQLNHGVLLVGYNMTGEVPYWLIKNSWGENWG-EKGYVRVRKGTNECLIQEY 334
>gi|355681662|gb|AER96817.1| Cathepsin O precursor [Mustela putorius furo]
Length = 265
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 91/173 (52%), Gaps = 7/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK L LS Q+I+C+ N GCQGG A+ +L L +++YPF+ Q
Sbjct: 96 VESAYAIKGKPLEDLSVQQVIDCSYNNYGCQGGSTLSALNWLNKTQVRLVRDSEYPFKAQ 155
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + + D + L +GPLV ++ QDY G +I+
Sbjct: 156 NGLCHYFSDSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQH 215
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N C
Sbjct: 216 H--CSSGEANHAVLITGFDKIGNTPYWIVRNSWGSSWGV-DGYAHVKMGGNIC 265
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAG-LEAE 51
+E I G LL LS+ QL++C+ + GC GG A YL AG +E E
Sbjct: 201 IEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAFNYLIEAGGIEEE 260
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQD 110
YP+ G G C ++ KV V+V +F ++ + H GPL G+N +Q
Sbjct: 261 VTYPY---TGKRGECKFNPEKVAVKVRNFAKIPEDESQIAANVVHNGPLAIGLNAVFMQT 317
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C + +NH V++VGYG R P WI++NSWG RWG + GY+
Sbjct: 318 YIGG-VSCPLICDKKRINHGVLLVGYGSRGFSILRLGYKPYWIIKNSWGKRWG-EHGYYR 375
Query: 163 VERGTNACGIES 174
+ RG N CG+ +
Sbjct: 376 LCRGHNMCGMST 387
>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
Length = 274
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 98/179 (54%), Gaps = 12/179 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+AI+ LL LS+ +L++C+ + GC GG + ++ GLE E DYP+
Sbjct: 90 VEGQWAIQKKKLLSLSEQELVDCDKVDLGCNGGLPLQAYKEIMRIGGLETEKDYPYE--- 146
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G +C ++ +V+V ++ + + + D + L+ GP+ G+N +Q Y G +
Sbjct: 147 GKGDKCVFEKAEVEVNITGAVNISSNEDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPF 206
Query: 119 NDVCPSENLNHAVVIVGYGMRH----QVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ +C +L+H V+I GYG++ P W ++NSWG WG + GY+ + RG CG+
Sbjct: 207 SFLCSPSSLDHGVLITGYGIKQGWMSDSPFWAIKNSWGESWG-EKGYYLLYRGAGVCGV 264
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 98/194 (50%), Gaps = 29/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y+ AG ++ E
Sbjct: 166 LEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILGAGGVQRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G C +D K+ V+++ V + D L GPL G+N +Q
Sbjct: 226 EDYPYA---GRDSSCKFDKSKIAASVANYSVISLDEDQIAANLVKNGPLAVGINAVYMQT 282
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGY 160
Y G + CP ++ L+H V IVGYG P+ WI++NSWG ++GY
Sbjct: 283 YIGGV-----SCPYICAKRLDHGVQIVGYGESGYAPIRFKEKPYWIIKNSWGESWGENGY 337
Query: 161 FTVERGTNACGIES 174
+ + RG NACG++S
Sbjct: 338 YKICRGQNACGVDS 351
>gi|9635308|ref|NP_059206.1| ORF58 [Xestia c-nigrum granulovirus]
gi|13124001|sp|Q9PYY5.1|CATV_GVXN RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6175702|gb|AAF05172.1|AF162221_58 ORF58 [Xestia c-nigrum granulovirus]
Length = 346
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 91/173 (52%), Gaps = 8/173 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ES Y IKH L LS+ QL++C+ N GC GG + A + ++ G+ EA YP+
Sbjct: 166 IESLYHIKHNVSLDLSEQQLVDCDKVNNGCNGGLMSWAFEGIIRAGGISYEAPYPY---T 222
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
GV G C R V++ R++L+ GP+ ++ L +Y + +
Sbjct: 223 GVDGVCKNTTRYVQLSGCYAYDLRSEKKLRQVLHEKGPVSVAIDVVDLTNYKSGVAKHCS 282
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
V LNH V++VGYG + V W ++NSWG WG + G+F ++R N+CGI
Sbjct: 283 V--DHGLNHGVLLVGYGQENDVKYWTLKNSWGSDWG-EQGFFRIKRDVNSCGI 332
>gi|341887744|gb|EGT43679.1| hypothetical protein CAEBREN_04647 [Caenorhabditis brenneri]
Length = 394
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 97/179 (54%), Gaps = 8/179 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+Q+A+K G LL LS+ +L++C++ + GC GG N A+ + GLE EADYP+
Sbjct: 212 IETQFALKKGALLSLSEQELVDCDVLSYGCNGGYLNTALLFAIEKGLETEADYPYVAIQ- 270
Query: 62 VTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKN 119
+C+ +K++V++ D + + D + GP+ M + Y G + +
Sbjct: 271 -QKQCSIQTQKIRVKIDDGYHLKANEDQIADWVAREGPVSFLMPVPKSIMFYRGGIFNPS 329
Query: 120 DV-CPSENL-NHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C ++ + NH + IVG+G WIV+NSWG RWG + GY + RG N CG +Y
Sbjct: 330 MAECRAQAVGNHVMAIVGFGREGNQKFWIVKNSWGTRWG-EQGYLKMARGVNICGFTNY 387
>gi|146386356|gb|ABQ23966.1| cathepsin H [Oryctolagus cuniculus]
Length = 215
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GC+GG ++A +Y L + G+ E YP+R
Sbjct: 31 LESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYR- 89
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNG--K 114
+ GRC + +K V D + N + + Y P+ + +D+ K
Sbjct: 90 --AMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFE--VTEDFMQYRK 145
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + VP WIV+NSWG WG +GYF +ERG N CG
Sbjct: 146 GIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGM-NGYFYIERGKNMCG 204
Query: 172 IES 174
+ +
Sbjct: 205 LAA 207
>gi|403272508|ref|XP_003928101.1| PREDICTED: cathepsin O [Saimiri boliviensis boliviensis]
Length = 465
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 92/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 285 VESACAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLSALNWLNKMQVKLVKDSEYPFKAQ 344
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ S + N D + L +GPLV ++ QDY G +I+
Sbjct: 345 NGLCHYFSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQH 404
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ P WIVRNSWG WG DGY V+ G+N CGI
Sbjct: 405 H--CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSSWGV-DGYAHVKMGSNVCGI 456
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 97/195 (49%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 175 LEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 234
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D KV V++F V + D L GPL N +Q
Sbjct: 235 EDYPYTGMD--RGACKFDKDKVAAGVANFSVVSLDEDQIAANLVKNGPLAVATNAVFMQT 292
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S L+H V++VGYG PV WI++NSWG WG ++G
Sbjct: 293 YIGGV-----SCPYICSRRLDHGVLLVGYGSAGYAPVRMKEKPYWIIKNSWGESWG-ENG 346
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 347 FYKICRGRNICGVDS 361
>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 323
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 97/180 (53%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N+GC GG +A +++ G++ E YP++
Sbjct: 142 IEGQFFKKNGTLVSLSAQELVDCATEYYGNEGCNGGLMGQAFDFVEDEGIQTEESYPYKA 201
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + C + V +V + + R + GP+ ++ + L Y+ ++ +
Sbjct: 202 KRSI---CQMNGEYV-TKVKTYHLLLNEQEIARAVSAKGPVAVAIDASQLSFYDQGIVDE 257
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI +Y
Sbjct: 258 KCKCSKKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGNY 316
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K+GTLL LS+ +L++C+ +Q C+GG + A + + K GLE E+DY +
Sbjct: 295 IEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAIEKLGGLETESDYSY---T 351
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G RC + KV + S + L GP+ +N +Q Y +
Sbjct: 352 GHKQRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVSVALNAFAMQFYRKGISHPL 411
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ C ++HAV++VGYG R +P W ++NSWG + GY+ + RG+NACGI
Sbjct: 412 KIFCNPWMIDHAVLLVGYGERKGIPFWAIKNSWGEDYGEQGYYYLYRGSNACGI 465
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 96/194 (49%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC G N A +Y LK GL E
Sbjct: 165 LEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKTGGLMRE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +G G C D K+ VS+F V + D L GPL +N A +Q
Sbjct: 225 KDYPYTGTDG--GSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQT 282
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP S LNH V++VGYG + P WI++NSWG ++G+
Sbjct: 283 YIGGV-----SCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGF 337
Query: 161 FTVERGTNACGIES 174
+ + +G N CG++S
Sbjct: 338 YKICKGRNICGVDS 351
>gi|334265690|ref|YP_004376219.1| cathepsin [Clostera anachoreta granulovirus]
gi|315451014|gb|ADU24593.1| cathepsin [Clostera anachoreta granulovirus]
gi|327553705|gb|AEB00299.1| cathepsin [Clostera anachoreta granulovirus]
Length = 332
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 95/174 (54%), Gaps = 10/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ES + IK+G L LS+ L+ C+ N GC GG + A++ L GL AE D P+ +
Sbjct: 156 IESLHKIKYGVELDLSEQHLVNCDPLNNGCDGGLMHWALENILYEGGLVAERDEPYFGYD 215
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
V C + V + R +L GP+ ++ + DY + D
Sbjct: 216 AV---CKPKRLSSTISGCTRFVLQNENRLRELLVVNGPVSVAIDVIDVIDYKEGIA---D 269
Query: 121 VCPSEN-LNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C ++N LNHAV++VGYG+ + VP WI++NSWG WG ++G+F V+R N+CGI
Sbjct: 270 MCHNKNGLNHAVLLVGYGVDNDVPYWILKNSWGENWG-ENGFFRVQRNVNSCGI 322
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 102/175 (58%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+ESQY IK+ + LS+ Q+++C+ N GC GG + A++Y ++ G++ E DY +
Sbjct: 161 IESQYYIKNKQYVDLSEQQIVDCDPINNGCNGGLMSWAMEYVMRSGGVQLEEDYQYVGNE 220
Query: 61 GVTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
GV C ++ V V++S + ++ + R +L GP+ ++ + +Y + +
Sbjct: 221 GV---CKNNSANV-VQISGCVSYDLRNEERLRELLVSNGPISVAIDVMDVTNYQSGIAKH 276
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
V + LNHAV++VGYG+++ P W+ +NSWG WG ++GYF V R N+CG+
Sbjct: 277 CSV--AHGLNHAVLLVGYGVQNNTPYWVFKNSWGSDWG-ENGYFRVLRDVNSCGM 328
>gi|387765908|gb|AFJ95133.1| cathepsin-L [Toxocara canis]
Length = 360
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 97/180 (53%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YA+ G L LS+ QL++CN+ N C GG +KA++Y+ GL E DYP+
Sbjct: 178 VESAYALGTGELRSLSEHQLLDCNLENNACDGGDVDKALRYVYDEGLMREYDYPYVAHRQ 237
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKND 120
T C +++ + FL + + +L HYGP+ G+N A ++ Y G + D
Sbjct: 238 DT--CQLRGETTRIKAAVFLHQDEASIIDWLL-HYGPVNVGINVTADMKAYKGG-VYTPD 293
Query: 121 VCPSENL---NHAVVIVGYGMRHQV--PVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
EN H++ IVGYG + WIV+NSWG+ +G +DGY RG N+CGIE
Sbjct: 294 RWECENKIIGTHSINIVGYGTWNATNQKYWIVKNSWGQSYGIEDGYVYFARGINSCGIED 353
>gi|291410711|ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus]
Length = 333
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GC+GG ++A +Y L + G+ E YP+R
Sbjct: 148 LESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYR- 206
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNG--K 114
+ GRC + +K V D + N + + Y P+ + +D+ K
Sbjct: 207 --AMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFE--VTEDFMQYRK 262
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + VP WIV+NSWG WG +GYF +ERG N CG
Sbjct: 263 GIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWG-MNGYFYIERGKNMCG 321
Query: 172 IES 174
+ +
Sbjct: 322 LAA 324
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 97/194 (50%), Gaps = 29/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C N + GC GG A +Y LK GL+ E
Sbjct: 57 VEGAHFLATGELVSLSEQQLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLKAGGLQRE 116
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 117 KDYPY---TGRDGKCHFDKSKIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQT 173
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP + +H V++VGYG + P WI++NSWG + GY
Sbjct: 174 YVGGV-----SCPLICFKRQDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQGY 228
Query: 161 FTVERGTNACGIES 174
+ + RG N CG+++
Sbjct: 229 YKICRGRNICGVDA 242
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 97/191 (50%), Gaps = 21/191 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + ++ G L+ LS+ QL++C+ + GC GG A Y +K GLE E
Sbjct: 174 IEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKSGGLETE 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G+C ++A K+ V++F + D L +GPL G+N +Q
Sbjct: 234 TDYPYTGNS--NGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQT 291
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGYFTV 163
Y G + +C +++H V++VGYG + P+ WI++NSWG + GY+ +
Sbjct: 292 YIGG-VSCPIICSKHHIDHGVLLVGYGAKGYAPIRFTEKPYWIIKNSWGATWGEQGYYKI 350
Query: 164 ERGTNACGIES 174
RG CG+ +
Sbjct: 351 CRGHGMCGMNT 361
>gi|358255491|dbj|GAA57187.1| cathepsin L [Clonorchis sinensis]
Length = 368
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 96/182 (52%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAG-LEAEADYPFRN 58
+E I + L LS QLI+C++ N GC GG + +YLK +G LE + DYP+ +
Sbjct: 182 VEGHTYIHNNQLETLSTQQLIDCSLEYGNGGCTGGDSVTSFKYLKESGGLERDRDYPYVS 241
Query: 59 QNGV--TGRCAYDARKVKVRVSDFLV--FNGSDTFRRMLYHYGPLVAGMNGAL--LQDYN 112
+ C +D K V+ F+V ++ D + + YGP+ ++ L +DY
Sbjct: 242 DKTIRPNPECKFDWTKCAAEVTGFVVLPYHDEDAILQAVGFYGPVAISVDSRLQSFKDYK 301
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
G + +D +N +H++V+VGYG + P WI++NSWG + GY + RG N CG+
Sbjct: 302 GDIY--SDPLCGKNSDHSMVVVGYGEENGTPYWIIKNSWGEHWGEKGYLRLRRGVNMCGV 359
Query: 173 ES 174
S
Sbjct: 360 AS 361
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 100/195 (51%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y L+ G++ E
Sbjct: 172 LEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C +D KV VS++ V + + L GPL +N +Q
Sbjct: 232 KDYPY---TGRDGTCKFDKTKVAATVSNYSVVSLDEEQIAANLVKNGPLAVAINAVFMQT 288
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 289 YVGGV-----SCPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWG-ENG 342
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 343 YYKICRGRNVCGVDS 357
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 97/195 (49%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 169 LEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 228
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D KV V++F + D L GPL +N +Q
Sbjct: 229 EDYPYTGMD--RGACKFDKNKVAAGVANFSAVSLDEDQIAANLVKNGPLAVAINAVFMQT 286
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S L+H V++VGYG PV WI++NSWG WG ++G
Sbjct: 287 YIGGV-----SCPYICSRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWG-ENG 340
Query: 160 YFTVERGTNACGIES 174
++ + RG N CG++S
Sbjct: 341 FYKICRGRNICGVDS 355
>gi|442736236|gb|AGC65593.1| cathepsin [Achaea janata granulovirus]
Length = 338
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 98/176 (55%), Gaps = 13/176 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
ESQYAIKHG + S+ L++C+ N GC GG + A + ++ G+ E DYP+
Sbjct: 161 FESQYAIKHGKHVDFSEQHLLDCDQLNYGCDGGLMHWAFEEIIRMGGVVLEYDYPY---T 217
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
GV CA + + +S + ++ D R +L GP+ ++ + DY ++
Sbjct: 218 GVESFCANNV-NMYTTISGCVQYDLRDEEKLRELLVTNGPIAVALDIVDIVDYKSGVVS- 275
Query: 119 NDVCPSEN-LNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + N LNHAV++VGYG+ + W+++NSWG WG ++GYF ++R N+CGI
Sbjct: 276 --FCGTNNGLNHAVLLVGYGVDKTIEYWLLKNSWGTDWG-EEGYFRIKRNRNSCGI 328
>gi|5777611|emb|CAB53397.1| cysteine protease [Medicago sativa]
Length = 209
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C N + GC GG N A +Y L+ G+ +E
Sbjct: 13 LEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAFEYILQSGGVVSE 72
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + G G C +D K+ VS+F V + D L GPL +N A +Q
Sbjct: 73 KDYAY---TGRDGSCKFDKSKIVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAWMQT 129
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y + +C L+H V++VG+G P+ WI++NSWG+ WG ++GY+
Sbjct: 130 YMSG-VSCPHICAKARLDHGVLLVGFGSGGYAPIRLKEKPYWIIKNSWGQNWG-EEGYYK 187
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 188 ICRGRNVCGVDS 199
>gi|312192187|gb|ADQ43790.1| cathepsin [Dione juno MNPV tmk1/ARG/2003]
Length = 166
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 91/160 (56%), Gaps = 10/160 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIK+ L+ LS+ QLI+C+ + GC+GG + A + ++ G++ E DYP+ +N
Sbjct: 13 LESQFAIKYNRLINLSEQQLIDCDSVDAGCEGGLLHTAYEAIMEMGGVQVEHDYPYERRN 72
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C D K V V + + + +L GPL ++ + + +Y +IR
Sbjct: 73 G---DCRVDTAKFVVNVKKCYRYITVLEEKLKDLLRIVGPLPVAIDASDIVNYKRGIIR- 128
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPD 157
C + LNHAV++VGY + + VP WI++N+WG WG D
Sbjct: 129 --YCSNHGLNHAVLLVGYAVENGVPYWILKNTWGTDWGED 166
>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
Length = 379
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 95/176 (53%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+Q+AIK G L+ LS+ ++++C+ N GC GG A++++K GLE E YP+
Sbjct: 197 IEAQHAIKKGILVSLSEQEMVDCDGRNNGCSGGYRPYAMRFVKENGLETEKSYPYSALKH 256
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGS-DTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C KV + D+ + + S + + GP+ GMN A+ +G
Sbjct: 257 --DQCMLHQNDTKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYRSGIFNPS 314
Query: 119 NDVCPSENLN-HAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C +++ HA+ IVGYG WIV+NSWG WG DGYF + RG N+CG+
Sbjct: 315 AEDCAEKSMGAHALTIVGYGGEGTSAYWIVKNSWGTSWG-SDGYFRLARGVNSCGL 369
>gi|324513891|gb|ADY45690.1| Cysteine proteinase [Ascaris suum]
Length = 398
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 97/179 (54%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YAI G L LS+ QL++CN+ N C GG +KA++Y+ GL E DYP+
Sbjct: 216 VESAYAIGTGELKSLSEQQLLDCNVENNACDGGDIDKALRYVYEEGLMTEYDYPYVAHRQ 275
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKND 120
T C +++ + FL + L H GP+ G+N A ++ Y G + N
Sbjct: 276 ET--CYLRGETTRIKAAVFL-HQDEASIIDWLIHNGPVNVGVNVTADMKAYKGGVYTPNK 332
Query: 121 -VCPSENL-NHAVVIVGYGMRHQV--PVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
C ++ + HA+ IVGYG ++ WIV+NSWG+ +G ++GY RG N+CGIE
Sbjct: 333 WECENKIIGTHAMNIVGYGTWNKTNEKYWIVKNSWGQSYGVENGYVYFARGINSCGIED 391
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG A +Y LK GL E
Sbjct: 167 LEGAHFLSTGGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLKAGGLMRE 226
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++ G C +D K+ V++F V + + L GPL G+N +Q
Sbjct: 227 EDYPYTGRD--RGPCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGINAVFMQT 284
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 285 YIGGV-----SCPYICGKHLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWG-EEG 338
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 339 YYKICRGRNVCGVDS 353
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 100/195 (51%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG A +Y L+ GL E
Sbjct: 167 LEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLQAGGLMRE 226
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++ G C +D KV V++F V + + L GPL G+N +Q
Sbjct: 227 KDYPYTGRD--RGPCKFDKSKVAASVANFSVVSLDEEQIAANLVQNGPLAVGINAVFMQT 284
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 285 YIGGV-----SCPYICGKHLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWG-EEG 338
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 339 YYKICRGRNVCGVDS 353
>gi|344239864|gb|EGV95967.1| Cathepsin O [Cricetulus griseus]
Length = 291
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES AI+ L LS Q+I+C+ N GC GG A+ +L L +++YPF+ +
Sbjct: 111 IESACAIQGKPLDYLSVQQVIDCSFNNYGCSGGSPLSALSWLNKTQVKLMEDSEYPFKAE 170
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C Y + V + DF ++ S D + L ++GPLV ++ QDY G +
Sbjct: 171 NGL---CRYFPQSQSGVSIKDFSAYDFSGQEDEMAKALLNFGPLVVIVDAVSWQDYLGGI 227
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+V NSWG WG DGY V+ G N CGI
Sbjct: 228 IQHH--CSSGEANHAVLITGFDKTGNTPYWMVHNSWGNSWGI-DGYAHVKMGGNVCGI 282
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
+E I G LL LS+ QL++C+ + GC GG A +YL + GLE E
Sbjct: 171 IEGANFIATGKLLNLSEQQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQSGGLEEE 230
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+ YP+ G G C +D KV VR+++F + + L +GPL G+N +Q
Sbjct: 231 SSYPY---TGAKGECKFDPGKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQT 287
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C + LNH V++VGY + P WI++NSWG RWG DGY+
Sbjct: 288 YIGG-VSCPLICSKKWLNHGVLLVGYRAKGFSILRLGNKPYWIIKNSWGKRWGV-DGYYK 345
Query: 163 VERGTNACGIES 174
+ RG CG+ +
Sbjct: 346 LCRGHGMCGMNT 357
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 101/192 (52%), Gaps = 25/192 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC----------NIYNQGCQGGGFNKAIQYL-KHAGLEA 50
+E Q+ + GTL+ LS+ L++C N+ N GC GG A Y+ K+ G++
Sbjct: 158 VEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQT 217
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQ 109
EA YP+ V G C +++ +V ++S F + ++T L++ GPL + Q
Sbjct: 218 EATYPY---TAVDGECKFNSAQVGAKISSFTMVPQNETQIASYLFNNGPLAIAADAEEWQ 274
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-----PVWIVRNSWGR-WGPDDGYFTV 163
Y G + D + L+H ++IVGYG + + P WI++NSWG WG + GY V
Sbjct: 275 FYMGGVF---DFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWG-EAGYLKV 330
Query: 164 ERGTNACGIESY 175
ER T+ CG+ ++
Sbjct: 331 ERNTDKCGVANF 342
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
+E + G L+ LS+ QL++C+ + GC GG A YL + GLE E
Sbjct: 168 IEGANFLATGKLVSLSEQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYNYLLESGGLEEE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+ YP+ G G C +D K+ VR+++F + + L GPL G+N +Q
Sbjct: 228 SSYPY---TGERGECKFDPEKITVRITNFTNIPVDENQIAAYLVKNGPLAMGVNAIFMQT 284
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C + LNH V++VGYG + P WI++NSWG +WG +DGY+
Sbjct: 285 YIGG-VSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGKKWG-EDGYYK 342
Query: 163 VERGTNACGIES 174
+ RG CGI +
Sbjct: 343 LCRGHGMCGINT 354
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 18/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
LE Q K G L+ LS+ QL++C+ Y N+GC GG N A +Y G E+E+DYP+
Sbjct: 155 LEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYWMRNGAESESDYPY--- 211
Query: 60 NGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMN----GALLQDYNG 113
+ G+C +++ KV +VS F+ D + + GP+ ++ G +L
Sbjct: 212 TAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAIDATSSGFMLYK--- 268
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGM-RHQVPVWIVRNSWGR-WGPDDGYFTVERGT-NAC 170
K I +++ C + L+HAV++VGY + + WIV+NSWG WG GY + R N C
Sbjct: 269 KGIYQDNTCSQQYLDHAVLVVGYDADKTRQKYWIVKNSWGEDWG-QRGYIWMARDKGNMC 327
Query: 171 GIES 174
GI +
Sbjct: 328 GIAT 331
>gi|121531590|gb|ABM55480.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 321
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 96/178 (53%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGC-QGGGFNKAIQYLKHAGLEAEADYPFRN 58
LE Q AI H PLS+ QL++C+ N C +GG A +Y+K G+EA + YP++
Sbjct: 143 LEGQNAIHHKVKTPLSERQLLDCSSSYGNGDCDEGGLMTNAFKYIKAKGIEAGSSYPYQ- 201
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G G C Y+A+K +R+ F S+ ++ + GP+ ++ L+ Y G +I
Sbjct: 202 --GRVGSCRYNAQKTILRIKGFKELRASEVELKKAVGTIGPISVAVSSEHLRLYGGGVIT 259
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++L+HAV+ VGYG + W +RNSWG+ D GYF + R N CG+ S
Sbjct: 260 TRCI---KDLDHAVLAVGYGSENGRKYWKIRNSWGKTWGDHGYFKLARDAGNLCGVAS 314
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 95/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ + GCQGG ++A++++ + + E YP+ +
Sbjct: 159 IEGQWKVAGHELTSLSEQMLVSCDNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G C + V ++S + + + L GP+ ++ + DY G ++
Sbjct: 219 TDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
C S+ LNH V++VGY + P WI++NSWG +WG ++GY VE+GTN C ++ Y
Sbjct: 279 S---CSSDALNHDVLLVGYDDSSKPPYWIIKNSWGKKWG-EEGYIRVEKGTNQCLMKEYA 334
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A +Y LK GL+ E
Sbjct: 165 VEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLMTTAFEYTLKAGGLQRE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 225 KDYPY---TGKXGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT 281
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP + +H V++VGYG P+ WI++NSWG WG + G
Sbjct: 282 YVGGV-----SCPLICFKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWG-EHG 335
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 336 YYKICRGHNICGVDA 350
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 104/181 (57%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
+E Q+ + G L+ LS+ QL++C+ N GC GG + A +Y+K H G++ E YP+ + N
Sbjct: 186 IEGQHYLATGKLVSLSEQQLVDCSSSNDGCDGGLMDLAFEYVKEHKGIDTEVHYPYVSGN 245
Query: 61 -GVTGRCAYDARKVKVRVSDFL-VFNGSDTF-RRMLYHYGPLVAGMNGAL--LQDYNGKL 115
G +C++D + V V+ ++ + G + ++ + +GP+ G+N L Y
Sbjct: 246 TGYARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLPSFMAYESG- 304
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGIE 173
I + C +L+H V++VGYG+ + VP W+++NSWG WG ++GY + R N CG+
Sbjct: 305 IYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWG-ENGYVRILRNHNNLCGVA 363
Query: 174 S 174
+
Sbjct: 364 T 364
>gi|121531592|gb|ABM55481.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 318
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 95/178 (53%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGF-NKAIQYLKHAGLEAEADYPFRN 58
LE Q AI H PLS+ QL++C+ N C GG A +Y+K G+EA + YP++
Sbjct: 140 LEGQNAIHHKVKTPLSEQQLLDCSSSYGNGDCDEGGLMTNAFKYIKAKGIEAGSSYPYQ- 198
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G G C Y+A+K +R+ F S+ ++ + GP+ ++ L+ Y G +I
Sbjct: 199 --GRVGSCRYNAQKTILRIKGFKELRASEVELKKAVGTIGPISVAVSSEHLRLYGGGVIT 256
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++L+HAV+ VGYG + W +RNSWG+ D GYF + R N CG+ S
Sbjct: 257 TRCI---KDLDHAVLAVGYGSENGRKYWKIRNSWGKTWGDHGYFKLARDAGNLCGVAS 311
>gi|354474585|ref|XP_003499511.1| PREDICTED: cathepsin O-like [Cricetulus griseus]
Length = 311
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES AI+ L LS Q+I+C+ N GC GG A+ +L L +++YPF+ +
Sbjct: 131 IESACAIQGKPLDYLSVQQVIDCSFNNYGCSGGSPLSALSWLNKTQVKLMEDSEYPFKAE 190
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG+ C Y + V + DF ++ S D + L ++GPLV ++ QDY G +
Sbjct: 191 NGL---CRYFPQSQSGVSIKDFSAYDFSGQEDEMAKALLNFGPLVVIVDAVSWQDYLGGI 247
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P W+V NSWG WG DGY V+ G N CGI
Sbjct: 248 IQHH--CSSGEANHAVLITGFDKTGNTPYWMVHNSWGNSWG-IDGYAHVKMGGNVCGI 302
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMRE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 228 EDYPYTGND--LQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 286 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNVCGVDS 354
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 96/186 (51%), Gaps = 24/186 (12%)
Query: 8 IKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
I G LL LS+ QL++C+ N GC GG A +YL + GLE E+ YP+
Sbjct: 217 IATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPY- 275
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G +G+C + + K+ V+VS+F + L GPL G+N +Q Y G +
Sbjct: 276 --TGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGG-V 332
Query: 117 RKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+C +NH V++VGYG ++P W+++NSWG RWG + GY+ + RG
Sbjct: 333 SCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWG-EHGYYRLCRGHG 391
Query: 169 ACGIES 174
CGI +
Sbjct: 392 MCGINT 397
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 100/202 (49%), Gaps = 29/202 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C N + GC GG A +Y LK GL+ E
Sbjct: 161 VEGAHFLATGELVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKAGGLQLE 220
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +NG +C +D ++ VS+F V D L +GPL G+N A +Q
Sbjct: 221 KDYPYTGRNG---KCHFDKSRIAASVSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQT 277
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGY 160
Y + CP + +H V++VGYG P+ WI++NSWG+ + GY
Sbjct: 278 Y-----VRGVSCPLICFKRQDHGVLLVGYGSEGFAPIRLKNKPYWIIKNSWGKTWGEHGY 332
Query: 161 FTVERGTNACGIESYGGICTRT 182
+ + RG + CG+++ T T
Sbjct: 333 YKICRGHHICGVDAMVSTVTAT 354
>gi|26245861|gb|AAN77406.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 196
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 95/178 (53%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGF-NKAIQYLKHAGLEAEADYPFRN 58
+E Q AI + PLS+ QL++C+ N C GG KA Y+ G+EAE+ YP+
Sbjct: 17 VEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCHDGGLMTKAFNYIIDNGIEAESSYPYVE 76
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
Q C YDA+K V++ + + D ++ + GP+ GM+ L Y G ++
Sbjct: 77 Q---MTECQYDAKKTIVQIKGYKKLLADEDELKKAVGAVGPISVGMSSENLHMYGGGIL- 132
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
+D C +++HAV++VGYG + W V+NSWG +DGYF +ER N C I S
Sbjct: 133 -DDQCYF-DMDHAVLVVGYGEANGKKFWRVKNSWGTTWGEDGYFRIERDADNLCDIAS 188
>gi|403355691|gb|EJY77431.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 102/181 (56%), Gaps = 15/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LES + +K+G LS+ QL++C N N GC GG + A +YLK + G+ E YP+
Sbjct: 168 LESHFLLKYGQFRNLSEQQLVDCAGNYDNHGCNGGLPSHAFEYLKDNGGIAEETSYPYV- 226
Query: 59 QNGVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPL-VAGMNGALLQDYNGKL 115
VT CA ++ V V+ V D ++ +Y +GP+ +A + +DY
Sbjct: 227 --AVTNTCALKKGSQSVGVKGGAVNVSLSEDDLKQAIYSHGPVSIAFQVASDFRDYRAG- 283
Query: 116 IRKNDVCPS--ENLNHAVVIVGYGM-RHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+ + VC + +++NHAV+ VG+G ++V WI++NSWG WG D GYF +ERG N CG
Sbjct: 284 VYTSKVCKNGPQDVNHAVLAVGFGTDENKVDYWIIKNSWGAVWG-DQGYFKMERGVNMCG 342
Query: 172 I 172
+
Sbjct: 343 V 343
>gi|8547325|gb|AAF76330.1|AF271385_1 cathepsin L [Fasciola hepatica]
Length = 326
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 93/184 (50%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C + N GC GG A +YLK GLE E+ YP+R
Sbjct: 141 MEGQYMKNQRTSISFSEQQLVDCSRDFGNYGCNGGLMENAYEYLKRFGLETESSYPYR-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMN--GALLQDYNGKL 115
V G+C Y+ + +V+ + + D + ++ GP ++ + +G
Sbjct: 199 -AVEGQCRYNEQLGVAKVTGYYTVHSGDEVELQNLVGAEGPAAVALDVESDFMMYRSG-- 255
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYF-TVERGTNACGIES 174
I ++ C + LNH V+ VGYG++ WIV+NSWG W +DGY V + N CGI S
Sbjct: 256 IYQSQTCSPDRLNHGVLAVGYGIQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIAS 315
Query: 175 YGGI 178
+
Sbjct: 316 LASV 319
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 178 LEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKE 237
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ V++F V + + L GPL +N +Q
Sbjct: 238 QDYPYTGTD--RGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQT 295
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWG-RWGPDDG 159
Y K CP S++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 296 Y-----IKGVSCPYICSKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWG-ENG 349
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 350 YYKICRGRNICGVDS 364
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMRE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 228 EDYPYTGND--LQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 286 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNVCGVDS 354
>gi|213512532|ref|NP_001134063.1| Cathepsin O precursor [Salmo salar]
gi|209730446|gb|ACI66092.1| Cathepsin O precursor [Salmo salar]
Length = 341
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 90/176 (51%), Gaps = 9/176 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA--GLEAEADYPFRNQ 59
+ES YA LS Q+I+C+ NQGC GG +A+ +LK L +++YP++ +
Sbjct: 161 IESVYAKSGQPFKQLSVQQVIDCSYKNQGCNGGSITRALSWLKQTRVKLVKQSEYPYKAE 220
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKLI 116
G+ + V V DF + S M L +GPL ++ QDY G ++
Sbjct: 221 TGICH--LFSQSHDGVLVKDFAAHDYSGHEEAMMGRLVEWGPLAVTVDAISWQDYLGGIM 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ + C + NHAV++ GY VP WIV+NSWG ++GY ++ G N CGI
Sbjct: 279 QHH--CSCHHANHAVLVTGYDTTGDVPYWIVQNSWGTSWGNEGYVYIKMGGNVCGI 332
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 103/180 (57%), Gaps = 12/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E Q+ + LL LS+ +L++C+ + GC+GG +A++ ++ GLE E++YP++
Sbjct: 172 VEGQWFLSRSKLLSLSEQELVDCDHGDHGCKGGYMGQAMKAVIEMGGLETESEYPYK--- 228
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
GV G C ++ + K RV F+ ++T L +GP+ G+N +Q Y G +
Sbjct: 229 GVDGTCEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPW 288
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C +L+H V++VG+G+ R VP WIV+NSWG++ + GY+ V RG CG+
Sbjct: 289 KFLCSPTDLDHGVLLVGFGVDKRSFRRKPVPYWIVKNSWGKYWGEKGYYRVYRGDGTCGV 348
>gi|395542489|ref|XP_003773162.1| PREDICTED: cathepsin O-like [Sarcophilus harrisii]
Length = 407
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 92/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK +L LS Q+I+C+ N GC GG A+ +L L +++Y F+ Q
Sbjct: 227 IESAYAIKGESLEDLSVQQVIDCSYNNFGCSGGSTVNALNWLNKTQVRLVRDSEYSFKAQ 286
Query: 60 NGVTGRCAYDARKVKVR-VSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G+ + V ++ S + + D ++L YGPL ++ QDY G +I+
Sbjct: 287 TGLCHYFSGSHAGVSIKGYSSYDFSDKEDEMAKVLLAYGPLAVIVDAISWQDYLGGIIQH 346
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 347 H--CSSGEANHAVLITGFDKTGNTPYWIVRNSWGTSWGV-DGYAFVKMGANICGI 398
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 96/186 (51%), Gaps = 24/186 (12%)
Query: 8 IKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
I G LL LS+ QL++C+ N GC GG A +YL + GLE E+ YP+
Sbjct: 217 IATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPY- 275
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G +G+C + + K+ V+VS+F + L GPL G+N +Q Y G +
Sbjct: 276 --TGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGG-V 332
Query: 117 RKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+C +NH V++VGYG ++P W+++NSWG RWG + GY+ + RG
Sbjct: 333 SCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWG-EHGYYRLCRGHG 391
Query: 169 ACGIES 174
CGI +
Sbjct: 392 MCGINT 397
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 166 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 226 EDYPYTGND--LQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 284 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG-ENG 337
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 338 YYKICRGRNVCGVDS 352
>gi|293345419|ref|XP_001070844.2| PREDICTED: cathepsin O-like [Rattus norvegicus]
Length = 307
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 95/175 (54%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFRNQ 59
+ES AI+ L LS Q+I+C+ N GC+GG A+ +L L+ A++ YPF+ +
Sbjct: 131 VESAGAIQGKPLDYLSVQQVIDCSFNNYGCRGGSPLGALSWLNETQLKLVADSQYPFKAE 190
Query: 60 NGVTGRCAYDARKVK-VRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ C Y + V +S F N D R L +GPLV ++ QDY G +I+
Sbjct: 191 NGL---CRYFPQSFNYVYISSFGS-NQEDEMARALLSFGPLVVIVDAVSWQDYLGGIIQH 246
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV+I G+ P W+VRNSWG WG +GY V+ G N CGI
Sbjct: 247 H--CSSGEANHAVLITGFDKTGNTPYWMVRNSWGNSWG-VEGYAYVKMGGNVCGI 298
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 96/192 (50%), Gaps = 23/192 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG A +Y LK GLE E
Sbjct: 174 LEGANFLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLKAGGLERE 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + +C +D K+ V S+F V + + L GPL G+N +Q
Sbjct: 234 EDYPYTGTD--HSKCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQT 291
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDGYFT 162
Y G + +C L+H V++VGYG + P WI++NSWG WG + GY+
Sbjct: 292 YIGG-VSCPYICSKRLLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWG-EKGYYK 349
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 350 ICRGRNICGMDS 361
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A +Y LK GL+ E
Sbjct: 163 VEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLE 222
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 223 KDYPY---TGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP + +H V++VGYG P+ WI++NSWG WG + G
Sbjct: 280 YVGGV-----SCPLICFKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWG-EHG 333
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 334 YYKICRGHNICGVDA 348
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 96/195 (49%), Gaps = 29/195 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 177 LEGANFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKE 236
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQ 109
DYP+ + T C +D K+ +++F V N D L GPL +N +Q
Sbjct: 237 QDYPYAGIDRNT--CNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQ 294
Query: 110 DYNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG ++G
Sbjct: 295 TYIGGV-----SCPFICSKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENG 349
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 350 YYKICRGRNICGVDS 364
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 96/189 (50%), Gaps = 22/189 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAG-LEAE 51
+E I G LL LS+ QL++C+ + GC GG A +YL AG L+ E
Sbjct: 131 VEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIEAGGLQEE 190
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+ YP+ G +G C +D K+ V+V++F + + L H+GPL G+N +Q
Sbjct: 191 SSYPY---TGKSGECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIFMQT 247
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWGRWGPDDGYFTV 163
Y G + +C + LNH V++VGYG R P WI++NSWG + GY+ +
Sbjct: 248 YIGG-VSCPLICGKKWLNHGVLLVGYGARGYSILRFGYKPYWIIKNSWGNHWGEKGYYRL 306
Query: 164 ERGTNACGI 172
RG CG+
Sbjct: 307 CRGHGMCGM 315
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A +Y LK GL+ E
Sbjct: 163 VEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLE 222
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 223 KDYPY---TGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP + +H V++VGYG P+ WI++NSWG WG + G
Sbjct: 280 YVGGV-----SCPLICFKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWG-EHG 333
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 334 YYKICRGHNICGVDA 348
>gi|321452489|gb|EFX63862.1| hypothetical protein DAPPUDRAFT_334897 [Daphnia pulex]
Length = 221
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 94/185 (50%), Gaps = 11/185 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA--GLEAEADYPFRNQ 59
LE + K+GTLL LS+ QL++C Y+ GC GG A YLK+ G ++ YP+
Sbjct: 41 LEFNWCKKNGTLLALSEQQLVDCEPYHNGCGGGWVTNAWNYLKYGSGGSAKQSLYPY--- 97
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRML--YHYGPLVAGMNGALLQDYNGKLIR 117
T C + + + ++ + F +T + L + +GP + Y I
Sbjct: 98 TATTTTCRFCSSMIGAQILTYGEFQPLNTTKMQLAVFVFGPFPVAITVTNSFFYYLGGIY 157
Query: 118 KNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+ C P+ +NHAVV+VGYG + + WIVRNSWG WG GY ++RG N C IE
Sbjct: 158 NDVACDNPAIGVNHAVVVVGYGTENGIDYWIVRNSWGTGWG-QWGYVFIQRGVNKCKIEQ 216
Query: 175 YGGIC 179
Y +C
Sbjct: 217 YPAVC 221
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 99/204 (48%), Gaps = 28/204 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMKE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G T C D K+ VS+F V + + L GPL +N +Q
Sbjct: 228 EDYPYTGKDGKT--CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP + LNH V++VGYG + P WI++NSWG ++G+
Sbjct: 286 YIGGV-----SCPYICTRRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGENGF 340
Query: 161 FTVERGTNACGIESYGGICTRTLN 184
+ + +G N CG++S T ++
Sbjct: 341 YKICKGRNICGVDSLVSTVTAAVS 364
>gi|307206026|gb|EFN84119.1| Cathepsin O [Harpegnathos saltator]
Length = 353
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 90/175 (51%), Gaps = 5/175 (2%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQN 60
ES YA+K+GTL P S ++I+C + GCQGG + +L + E+ YP ++
Sbjct: 169 ESMYAMKNGTLYPFSVQEMIDCMPGDFGCQGGDICSLLSWLLTSKTKIIPESAYPLTRRD 228
Query: 61 GVTGRCAYDARKVKVRVSDFL---VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
A+ V ++DF + D +L +GP+ A +N Q+Y G +I+
Sbjct: 229 DQCKLLKLSAKTSGVGITDFTCDSFADAEDELLALLASHGPVAAAVNAISWQNYLGGVIQ 288
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ +LNHAV IVGY + +P +IV+NSWG D GY + G+N CGI
Sbjct: 289 YHCDGSFSSLNHAVQIVGYDLSAGIPHYIVKNSWGTAFGDKGYLYISIGSNLCGI 343
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
+E + G L+ LS+ QL++C+ + GC GG A YL + GLE E
Sbjct: 173 IEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEE 232
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+ YP+ G G C +D K+ V++++F + + L GPL G+N +Q
Sbjct: 233 SSYPY---TGERGECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQT 289
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C + LNH V++VGYG + P WI++NSWG +WG +DGY+
Sbjct: 290 YIGG-VSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWG-EDGYYK 347
Query: 163 VERGTNACGIES 174
+ RG CGI +
Sbjct: 348 LCRGHGMCGINT 359
>gi|237651947|gb|ACR08662.1| cathepsin F, partial [Drosophila silvestris]
Length = 186
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 99/178 (55%), Gaps = 15/178 (8%)
Query: 6 YAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQNGVTG 64
YAI+ G L S+ +L++C+ + C GG + A + +K GLE E++YP+ +
Sbjct: 3 YAIRTGELQEFSEQELLDCDSTDSACNGGLMDNAYKAIKDIGGLEYESEYPYAAKKM--- 59
Query: 65 RCAYDARKVKVRVSDFLVF-NGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK-NDV 121
+C ++ V++S F+ G++T + L GP+ G+N +Q Y G + +
Sbjct: 60 QCHFNRTLSHVQISGFVDLPKGNETAMQEWLLSNGPISIGLNANAMQFYRGGVSHPWAPL 119
Query: 122 CPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
C +NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ + RG N CG+
Sbjct: 120 CSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGQRWG-EQGYYRIYRGDNTCGV 176
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS QL++C+ + GC GG N A +Y LK G+ E
Sbjct: 172 LEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDDGCNGGLMNNAFEYILKAGGVAQE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C ++ K+ V++F V + D L GPL G+N +Q
Sbjct: 232 EDYPYTGTD--RGLCRFNKTKIAASVANFSVVSLDEDQIAANLVKNGPLAVGINAVFMQT 289
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y + CP S L+H V++VGYG + P WI++NSWG + GY
Sbjct: 290 YKSGV-----SCPYICSSTLDHGVLLVGYGSAGYSPIRFKEKPYWIIKNSWGESWGEQGY 344
Query: 161 FTVERGTNACGIES 174
+ + RG N CG++S
Sbjct: 345 YKICRGHNICGVDS 358
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 100/195 (51%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG + A +Y LK GL+ E
Sbjct: 163 VEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQLE 222
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 223 KDYPY---TGKDGKCHFDKSKICAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP + +H V++VGYG P+ WI++NSWG WG + G
Sbjct: 280 YVGGV-----SCPLICFKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWG-EHG 333
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 334 YYKICRGHNICGVDA 348
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 97/175 (55%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K G LL LS+ +L++C+ ++ C GG + A +K GLE E DY + N
Sbjct: 278 VEGQWFLKRGDLLSLSEQELVDCDKLDKACLGGLPSNAYSAIKTLGGLETEDDYGY---N 334
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y +
Sbjct: 335 GHLQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPISIAINAFGMQFYRHGISHPL 394
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R +P W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 395 RPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWG-EEGYYYLHRGSGACGV 448
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 95/181 (52%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA HG + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 172 LEAAYAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPY-- 229
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + V V+V D + D + + P+ K +
Sbjct: 230 -TGVDGSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGV 288
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIE 173
++ C S ++NHAV+ VGYG+ +P W+++NSW G WG D+GYF +E G N CG+
Sbjct: 289 YTSNSCGSTPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWG-DNGYFKMEMGKNMCGVA 347
Query: 174 S 174
+
Sbjct: 348 T 348
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 97/180 (53%), Gaps = 12/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
+E Q+ +K+G LL LS+ Q+++C+ + GC GG A++Y++ + GLE E YP++
Sbjct: 288 IEGQHFLKNGELLSLSEQQMVDCSWLDFGCNGGQPMLAMEYVRFNGGLELETAYPYK--- 344
Query: 61 GVTGRCAYDARKVKVRVSDFLV--FNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLI 116
GV G C D + +++ F + F ++ + GP+ GM+ G Q Y I
Sbjct: 345 GVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYKSG-I 403
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
+ C S L+HAV+ VGYG W+V+NSW WG + GYF + R N CGI +
Sbjct: 404 YNPESCSSIGLDHAVLAVGYGTSDDGDYWLVKNSWNTSWG-EKGYFKLPRNKGNKCGIAT 462
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 99/204 (48%), Gaps = 28/204 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G T C D K+ VS+F V + + L GPL +N +Q
Sbjct: 228 EDYPYTGKDGKT--CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP + LNH V++VGYG + P WI++NSWG ++G+
Sbjct: 286 YIGGV-----SCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGF 340
Query: 161 FTVERGTNACGIESYGGICTRTLN 184
+ + +G N CG++S T++
Sbjct: 341 YKICKGRNICGVDSMVSTVAATVS 364
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +YL + G+ E
Sbjct: 165 LEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + G G C +D KV VS+F V + + L GPL G+N A +Q
Sbjct: 225 KDYAY---TGRDGSCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQT 281
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y + VC L+H V++VG+G P+ WIV+NSWG+ WG + GY+
Sbjct: 282 YMSG-VSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWG-EQGYYK 339
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 340 ICRGRNVCGVDS 351
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 91/177 (51%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE +A K G L+ LS+ L++C+ + GCQGG A +Y++ G++ E YP++ +N
Sbjct: 147 LEGAHAKKTGKLVSLSEQNLVDCDKKDHGCQGGLMTTAFKYIEENKGIDTEESYPYKAKN 206
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDYNGKLI 116
G RC + + V + +D ++ + GP+ M+ + Q Y I
Sbjct: 207 G---RCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQLYKSG-I 262
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C S L+H V++VGYG W+V+NSWG+ WG +GYF + N CGI
Sbjct: 263 YDPKICSSRKLDHGVLVVGYGKEDGEEYWLVKNSWGKNWGM-EGYFKIASKKNLCGI 318
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES YA G + LS+ QL++C YN GC GG ++A +Y+K+ GLE E YP+
Sbjct: 159 LESAYAQAFGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLETEEVYPYTG 218
Query: 59 QNGVTGRCAYDARKVKVRV--SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
QNG+ C + + V V+V S + D + + P+ ++ D+ K
Sbjct: 219 QNGL---CKFTSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQ--VVDDFRLYKK 273
Query: 115 LIRKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACG 171
+ C S ++NHAV+ VGYG+ VP W+++NSW G WG D GYF +E G N CG
Sbjct: 274 GVYTGTTCGSTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWG-DHGYFKMEMGKNMCG 332
Query: 172 IES 174
+ +
Sbjct: 333 VAT 335
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +YL + G+ E
Sbjct: 165 LEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + G G C +D KV VS+F V + + L GPL G+N A +Q
Sbjct: 225 KDYAY---TGRDGSCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQT 281
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y + VC L+H V++VG+G P+ WIV+NSWG+ WG + GY+
Sbjct: 282 YMSG-VSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWG-EQGYYK 339
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 340 ICRGRNVCGVDS 351
>gi|341876229|gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri]
Length = 389
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 96/176 (54%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+Q+AIK G L+ LS+ ++++C+ N GC GG A++++K GLE+E +YP+
Sbjct: 207 VEAQHAIKKGQLVSLSEQEMVDCDGRNNGCSGGYRPYAMRFVKENGLESEKEYPYSALK- 265
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C +V + DF ++ + + GP+ GMN A+ +G
Sbjct: 266 -HDQCFLKQNDTRVFIDDFRMLSTNEEDIANWVGTKGPVTFGMNVVKAMYSYRSGIFNPS 324
Query: 119 NDVCPSENLN-HAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
++ C +++ HA+ IVGYG WIV+NSWG WG GYF + RG N+CG+
Sbjct: 325 SEDCAEKSMGAHALTIVGYGGEGSSAFWIVKNSWGTSWG-SSGYFRLARGVNSCGL 379
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 96/194 (49%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 166 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 226 EDYPYTGND--LQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFVQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGY 160
Y G + CP S+ L+H V++VGYG P+ WI++NSWG ++GY
Sbjct: 284 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGY 338
Query: 161 FTVERGTNACGIES 174
+ + RG N CG++S
Sbjct: 339 YKICRGRNVCGVDS 352
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 94/180 (52%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ES++ + +L+ LS+ +L+ C+ ++GC GG +A +L ++ + A YP+ +
Sbjct: 151 IESKWYLATHSLISLSEQELVSCDDVDEGCNGGLMGQAFDWLLNNRNGAVYTGASYPYVS 210
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 211 GNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVL 270
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG + GY V +GTN C I+ Y
Sbjct: 271 TS---CDGKQLNHGVLLVGYNMTGEVPYWVIKNSWGENWG-EKGYVRVRKGTNECLIQEY 326
>gi|338717354|ref|XP_001492337.3| PREDICTED: pro-cathepsin H-like [Equus caballus]
Length = 323
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G LL L++ QL++C N N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 138 LESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 197
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
Q+G C + K V D + N + Y P+ + +D+ K
Sbjct: 198 QDG---DCKFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFE--VTEDFMMYRK 252
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG WG +GYF +ERG N CG
Sbjct: 253 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGM-NGYFLIERGKNMCG 311
Query: 172 IES 174
+ +
Sbjct: 312 LAA 314
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 103/180 (57%), Gaps = 12/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E Q+ + LL LS+ +L++C+ + GC+GG +A++ ++ GLE E++YP++
Sbjct: 286 VEGQWFLSRSKLLSLSEQELVDCDHGDHGCKGGYMGQAMKAVIEMGGLETESEYPYK--- 342
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
GV G C ++ + K RV F+ ++T L +GP+ G+N +Q Y G +
Sbjct: 343 GVDGTCEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPW 402
Query: 119 NDVCPSENLNHAVVIVGYGM------RHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+C +L+H V++VG+G+ R VP WIV+NSWG++ + GY+ V RG CG+
Sbjct: 403 KFLCSPTDLDHGVLLVGFGVDKRSFRRKPVPYWIVKNSWGKYWGEKGYYRVYRGDGTCGV 462
>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
Length = 331
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 96/173 (55%), Gaps = 6/173 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQN 60
+ESQYAIK+ + LS+ Q+I+C+ + GC GG + A + + G L E +YP+ N
Sbjct: 149 IESQYAIKNNVHIDLSEQQMIDCDYVDMGCDGGLLHTAFEQMIQMGELVQEHEYPYAGVN 208
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKND 120
+ VKV+ V + + +L GP+ ++ + + +Y+ +I
Sbjct: 209 KPCELRGDETGVVKVKGCYRYVVFREEKLKDLLRAVGPIPMAIDASGIVNYHHGIIH--- 265
Query: 121 VCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
C + LNHAV++VGYG+ + VP W +N+WG+ WG ++GYF V + +ACG+
Sbjct: 266 YCENYGLNHAVLLVGYGVENNVPFWTFKNTWGKDWG-EEGYFRVRQNVDACGM 317
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A +Y LK GL+ E
Sbjct: 165 VEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQLE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 225 KDYPY---TGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT 281
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP + +H V++VGYG P+ WI++NSWG WG + G
Sbjct: 282 YVGGV-----SCPLICFKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWG-EHG 335
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG+++
Sbjct: 336 YYKICRGHNICGVDA 350
>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
Length = 619
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 59/178 (33%), Positives = 97/178 (54%), Gaps = 7/178 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ +AI + LS ++++C+ + C+GG ++ + L+ GL E DYP+++Q
Sbjct: 384 VEALWAIHYEQHFELSVQEVLDCDRCGKACKGGFVWDAFLTILRQRGLARERDYPYQDQL 443
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + + DFL+ + L GP+ +N ALL+ Y +IR
Sbjct: 444 SRKG-CQKKQNRTG-WIQDFLMLPKEENAMAEHLALKGPITVTINQALLKTYRKGVIRPK 501
Query: 120 DVCPSENLNHAVVIVGYGMRHQV-PVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
D C ++H+V++VG+G + WI++NSWG WG ++GYF + RGTNACGI Y
Sbjct: 502 DDCDPNQVDHSVLLVGFGQNTKDGAYWILKNSWGSDWG-EEGYFRLRRGTNACGITKY 558
>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 99/185 (53%), Gaps = 18/185 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFRN 58
LES + I H LS+ QL++C + N GC GG + A +Y+ + G LE E DY +
Sbjct: 158 LESAHLIHHKKAYNLSEQQLVDCAQDFDNHGCNGGLPSHAFEYIHYVGGLEEEQDYSYHA 217
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFN----GSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ G+ C +D K V + VFN D L ++ P+ + +
Sbjct: 218 EEGL---CEFDPTKTAGTVRE--VFNITETDEDQLTIALAYFNPVSVAFEVVDGFRFYKE 272
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGM--RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA 169
+ ++D C S E++NHAV+ VGYGM + + P +IV+NSWG WG D+G+F ++RG N
Sbjct: 273 GVYQSDTCKSGPEDVNHAVLAVGYGMCKKCETPYFIVKNSWGAEWG-DEGFFKIKRGENM 331
Query: 170 CGIES 174
CGI +
Sbjct: 332 CGIAT 336
>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
Length = 363
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 96/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
+E + G L+ LS QL++C+ + GC GG A YL + GLE E
Sbjct: 156 IEGANFLATGKLVSLSDQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEE 215
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+ YP+ G G C +D K+ V++++F + + L GPL G+N +Q
Sbjct: 216 SSYPY---TGERGECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQT 272
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C + LNH V++VGYG + P WI++NSWG +WG +DGY+
Sbjct: 273 YIGG-VSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWG-EDGYYK 330
Query: 163 VERGTNACGIES 174
+ RG CGI +
Sbjct: 331 LCRGHGMCGINT 342
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 97/178 (54%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG----GFNKAIQYLKHAGLEAEADYPFR 57
+E Q+ + G L LS+ +L++C+ ++GC+GG ++ + L GLE E DYP+
Sbjct: 172 IEGQWYLNKGKLYSLSEQELVDCDKIDEGCKGGLPLNAYHSIMNRL--GGLETEKDYPYV 229
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+NG +C + + V ++ + + ++T L +GP+ G+N + Y G +
Sbjct: 230 AKNG---KCKLNKSEEVVYINSSVKVSTNETDLAAWLVAHGPVAIGINSVNMLHYKGGIA 286
Query: 117 R-KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
N C + L+H V+IVGYG P WI++NSWG WG + GY+ V RG ACG+
Sbjct: 287 HPTNKDCNPKLLDHGVLIVGYGEEKSTPYWIIKNSWGTDWG-EKGYYRVVRGIGACGL 343
>gi|126331447|ref|XP_001375261.1| PREDICTED: cathepsin O-like [Monodelphis domestica]
Length = 414
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 92/178 (51%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES YAIK +L LS Q+I+C+ N GC GG A+ +L L +++Y F+ Q
Sbjct: 234 IESAYAIKGESLEDLSVQQVIDCSYNNFGCSGGSTVNALNWLNKTQVRLVKDSEYSFKAQ 293
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKL 115
TG C Y V + D+ ++ S M L +GPL ++ QDY G +
Sbjct: 294 ---TGLCHYFSGSHAGVSIKDYSSYDFSGKENEMANVLLAFGPLAVIVDAVSWQDYLGGI 350
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 351 IQHH--CSSGEANHAVLITGFDRTGNTPYWIVRNSWGTSWG-VDGYAFVKMGANVCGI 405
>gi|42564157|gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 322
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 94/178 (52%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGF-NKAIQYLKHAGLEAEADYPFRN 58
LE Q AI + PLS+ QL++C+ N C GG +A Y+ G+EAE+ YP+
Sbjct: 143 LEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPYVE 202
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
Q C YDA+K V++ + + D ++ + GP+ GM+ L Y G ++
Sbjct: 203 Q---MTECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGVL- 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
+D C ++HAV++VGYG + W V+NSWG +DGYF +ER N C I S
Sbjct: 259 -DDQCYF-GMDHAVLVVGYGEANGKKFWKVKNSWGTTWGEDGYFRIERDANNLCDIAS 314
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 99/183 (54%), Gaps = 13/183 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYN-QGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ + +K G L+ LS+ L++C GC GG +KA++Y++ G+ +E DYP+
Sbjct: 143 VEAAHFLKTGNLVSLSEQNLVDCAKDTCYGCGGGWMDKALEYIEKGGIMSEKDYPYE--- 199
Query: 61 GVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNG-ALLQDYNGKLIR 117
GV C +D KV ++S+F N + + + GP+ ++ A Q Y ++
Sbjct: 200 GVDDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAIDASATFQLYVSGILD 259
Query: 118 KNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGIE 173
+ C +E +LNH V++VGYG + WI++NSWG WG DGY + R N CGI
Sbjct: 260 DTE-CSNEFDSLNHGVLVVGYGTENGKDYWIIKNSWGVNWGM-DGYIRMSRNKNNQCGIT 317
Query: 174 SYG 176
+ G
Sbjct: 318 TDG 320
>gi|62751833|ref|NP_001015747.1| cathepsin L1 precursor [Xenopus (Silurana) tropicalis]
gi|58477061|gb|AAH89683.1| MGC107932 protein [Xenopus (Silurana) tropicalis]
Length = 333
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 96/181 (53%), Gaps = 15/181 (8%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
++ES+Y I+ LL LS+ QL++C+ N+GC GG KA++Y+ G+ +Y + +
Sbjct: 148 VMESRYCIRTKELLNLSEQQLVDCDEINEGCCGGFPIKALEYVAQHGVMRNKEYEYSQKK 207
Query: 61 GVTGRCAYDARK-VKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK 118
C YD+ K + + VS F + G + + GP+ G+ + Q Y+ +
Sbjct: 208 AT---CEYDSDKAIHMNVSKFYILPGEENMATSVAIEGPITVGIGVSSDFQLYSEGIFEG 264
Query: 119 NDVCPSENLNHAVVIVGYGMRH-------QVPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
+ C +E+ NHAV+IVGYG H WI++NSWG+ +DGY ++R N C
Sbjct: 265 D--C-AESPNHAVIIVGYGTEHANDKEEEDKDYWIIKNSWGKEWGEDGYVKMKRNINQCS 321
Query: 172 I 172
I
Sbjct: 322 I 322
>gi|417399160|gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
Length = 336
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G +L LS+ QL++C N N GCQGG ++A +Y+++ G+ E YP+
Sbjct: 151 LESAIAIKTGKMLSLSEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMEEDSYPYE- 209
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C + K V D + N + Y P+ K I
Sbjct: 210 --GKDSNCRFQPEKAIAFVKDVANITLNDEAAMVEAVALYNPVSFAFEVTSDFMLYRKGI 267
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+ C + +NHAV+ VGYG ++ P WIV+NSWG + +GYF +ERGTN CG+ +
Sbjct: 268 YSSTSCHKTPDKVNHAVLAVGYGEQNGKPYWIVKNSWGPYWGMNGYFLIERGTNMCGLAA 327
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 99/183 (54%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E + G L+ LS+ +L++C+ + GC GG ++A + ++ GLE E YP+ +
Sbjct: 175 IEGAWFKATGDLVSLSEQELVDCDQKDSGCNGGLMDQAFEEVIRIGGLETEQQYPY---D 231
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
GV C ++ KV++ DF+ + + L +GPL +N +Q Y G +
Sbjct: 232 GVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGGISHPL 291
Query: 119 NDVCPSENLNHAVVIVGYGM--------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA 169
+ +C + L+H V++VGYG+ RH P W ++NSWG RWG +DGY+ V RG
Sbjct: 292 SFLCSQDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWG-EDGYYRVARGKGV 350
Query: 170 CGI 172
CG+
Sbjct: 351 CGV 353
>gi|332024588|gb|EGI64786.1| Cathepsin O [Acromyrmex echinatior]
Length = 356
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 87/175 (49%), Gaps = 5/175 (2%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQN 60
ES YAI++GTL S ++I+C N GCQGG + +L + +E DYP Q
Sbjct: 172 ESMYAIENGTLHSFSVQEMIDCMPGNFGCQGGDICSLLSWLLASKTRIISEIDYPLTLQT 231
Query: 61 GVTGRCAYDARKVKVRVSDFL---VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
A+ VR++DF + +L +GP+ +N Q+Y G +I+
Sbjct: 232 DTCRLHKISAKTSGVRITDFTCDSFVDAETELLTLLVTHGPVAVAVNAISWQNYLGGIIQ 291
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
N +LNHAV IVGY ++P +I++NSWG + GY + G N CGI
Sbjct: 292 YNCDSSFNSLNHAVQIVGYDTEARIPHYIIKNSWGPSFGNKGYIYIAVGKNLCGI 346
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 101/198 (51%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L LS+ Q ++C+ + GC GG A YL+ AG LE+E
Sbjct: 170 LEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESE 229
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V +F V + + L +GPL G+N A +Q
Sbjct: 230 KDYPY---TGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQT 286
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 287 YIGGV-----SCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWG-ENG 340
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 341 YYKICRGSNVRNKCGVDS 358
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 97/182 (53%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E A+K G L S+ +L++C+ + C GG + A + ++ GLE E++YP++ +
Sbjct: 423 IEGLNAVKTGQLKEFSEQELLDCDTKDSACNGGLPDNAYKAIQEIGGLEYESEYPYKARK 482
Query: 61 GVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+C ++ V+V+ F L N + L GP+ G+N +Q Y G +
Sbjct: 483 E---QCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIANGPISIGINANAMQFYRGGVSHP 539
Query: 119 NDV-CPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+ C NL+H V+IVGYG+ +P WIV+NSWG RWG + GY+ V RG N C
Sbjct: 540 WKILCEKSNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG-EQGYYRVYRGDNTC 598
Query: 171 GI 172
G+
Sbjct: 599 GV 600
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 100/195 (51%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE Y + G L+ LS+ QL++C+ + GC GG N A +Y L+ G++ E
Sbjct: 121 LEGAYYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKE 180
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D KV VS++ +V + L GPL +N +Q
Sbjct: 181 KDYPYTGRDGT---CKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQT 237
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 238 YVGGV-----SCPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWG-ENG 291
Query: 160 YFTVERGTNACGIES 174
Y + RG N CG++S
Sbjct: 292 YDEICRGRNVCGVDS 306
>gi|301775254|ref|XP_002923050.1| PREDICTED: cathepsin H-like [Ailuropoda melanoleuca]
Length = 307
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 96/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G LL L++ QL++C + N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 122 LESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKG 181
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAG--MNGALLQDYNGK 114
Q+G C + K V D + N + + P+ + G + G
Sbjct: 182 QDG---DCKFQPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGV 238
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ + +NHAV+ VGYG ++ VP WIV+NSWG +WG GYF +ERG N CG+
Sbjct: 239 YSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGM-HGYFLIERGKNMCGLA 297
Query: 174 S 174
+
Sbjct: 298 A 298
>gi|281350252|gb|EFB25836.1| hypothetical protein PANDA_012122 [Ailuropoda melanoleuca]
Length = 294
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 96/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G LL L++ QL++C + N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 109 LESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKG 168
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAG--MNGALLQDYNGK 114
Q+G C + K V D + N + + P+ + G + G
Sbjct: 169 QDG---DCKFQPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGV 225
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ + +NHAV+ VGYG ++ VP WIV+NSWG +WG GYF +ERG N CG+
Sbjct: 226 YSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWG-MHGYFLIERGKNMCGLA 284
Query: 174 S 174
+
Sbjct: 285 A 285
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 101/198 (51%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L LS+ Q ++C+ + GC GG A YL+ AG LE+E
Sbjct: 170 LEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESE 229
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V +F V + + L +GPL G+N A +Q
Sbjct: 230 KDYPY---TGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQT 286
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 287 YIGGV-----SCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWG-ENG 340
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 341 YYKICRGSNVRNKCGVDS 358
>gi|348551380|ref|XP_003461508.1| PREDICTED: pro-cathepsin H-like [Cavia porcellus]
Length = 335
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 96/182 (52%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GC+GG ++A +Y L + G+ E YP++
Sbjct: 150 LESAVAIASGKMLSLAEQQLVDCAQDFNNHGCEGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL- 115
G G C + +K V D + N + + Y P+ + +D+
Sbjct: 209 --GKDGHCRFQPQKAIAFVKDVVNITLNDEEAMVEAVALYNPVSFAFE--VTEDFISYQS 264
Query: 116 -IRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I + C + +NHAV+ VGYG+++ VP WIV+NSWG DGYF +ERG N CG+
Sbjct: 265 GIYSSTSCHKTPDKVNHAVLAVGYGVQNGVPYWIVKNSWGTAWGQDGYFLIERGKNMCGL 324
Query: 173 ES 174
+
Sbjct: 325 AA 326
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 100/203 (49%), Gaps = 31/203 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A +Y LK GL+ E
Sbjct: 163 VEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCNGGLMTTAFEYTLKAGGLQRE 222
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G+C +D K+ V++F V D L +GPL G+N A +Q
Sbjct: 223 KDYPY---TGRDGKCHFDKSKIAASVANFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y + CP + +H V++VGYG + P WI++NSWG WG + G
Sbjct: 280 Y-----MRGVSCPLICFKRQDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGENWG-EHG 333
Query: 160 YFTVERGTNACGIESYGGICTRT 182
Y+ + RG N CG+++ T T
Sbjct: 334 YYKICRGHNICGVDAMVSTVTAT 356
>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
Length = 318
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 96/195 (49%), Gaps = 32/195 (16%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAGLEAEA 52
LE + + G L+ LS+ QL++C+ + GC GG N A + L+ G++ E
Sbjct: 121 LEVSFYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEILQSGGVQKEK 180
Query: 53 DYPFRNQNGVTGRCAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
D P+ G G C +D K KV +D + V + L GPL +N +Q
Sbjct: 181 DIPY---TGRDGTCKFD--KTKVAATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQT 235
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG +DG
Sbjct: 236 YVGGV-----SCPYICGKHLDHGVLLVGYGEGRYAPIRFKNKPYWIIKNSWGESWGENDG 290
Query: 160 YFTVERGTNACGIES 174
Y + RG N CG+++
Sbjct: 291 YDEICRGRNVCGVDA 305
>gi|42564153|gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 322
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 94/178 (52%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGF-NKAIQYLKHAGLEAEADYPFRN 58
LE Q AI + PLS+ QL++C+ N C GG +A Y+ G+EAE+ YP+
Sbjct: 143 LEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPYVE 202
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
Q C YDA+K V++ + + D ++ + GP+ GM+ L Y G ++
Sbjct: 203 Q---MTECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGVL- 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
+D C ++HAV++VGYG + W V+NSWG +DGYF +ER N C I S
Sbjct: 259 -DDQCYF-GMDHAVLVVGYGEANGKKFWKVKNSWGTTWGEDGYFRIERDADNLCDIAS 314
>gi|324514421|gb|ADY45863.1| Viral cathepsin [Ascaris suum]
Length = 399
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 97/176 (55%), Gaps = 8/176 (4%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
++ES AI L+ LS+ +LI+C+ + GC GG A +Y++ G+ +E DYP++ +
Sbjct: 219 VVESMNAIAKNPLISLSEQELIDCDTDDNGCSGGYRPYAFRYVRRHGIVSEKDYPYKGKE 278
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+CA + +V ++ ++ N D +++ GP+ G+N +G K
Sbjct: 279 --QSQCAANGTRVYIKSVKYIGRN-EDAMADFVFYRGPISVGINVTKEFFHYRSGVFTPK 335
Query: 119 NDVCPSENL-NHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C ++ +HAV +VGYG ++ W+++NSWG +WG DGY +RG N CGI
Sbjct: 336 KEDCEEDSQGSHAVAVVGYGSQNGEDYWLIKNSWGKKWGM-DGYVLYKRGENCCGI 390
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 95/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ + GCQGG ++A++++ + + E YP+ +
Sbjct: 159 IEGQWKVAGHELTSLSEQMLVSCDNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G C + V ++S + + + L GP+ ++ + DY G ++
Sbjct: 219 TDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDASSFLDYKGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
C S+ LNH V++VGY + P WI++NSWG +WG ++GY VE+GTN C ++ Y
Sbjct: 279 S---CSSDALNHDVLLVGYDDTSKPPYWIIKNSWGKKWG-EEGYIRVEKGTNQCLMKEYA 334
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNGGLMNSAFEYTLKAGGLMRE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 228 EDYPYTGND--LQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 286 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG-ENG 339
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 340 YYKICRGRNVCGVDS 354
>gi|52546916|gb|AAU81591.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 190
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 95/186 (51%), Gaps = 29/186 (15%)
Query: 10 HGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQ 59
H L+ LS+ QL++C+ + GC GG N A +Y LK GL E DYP+
Sbjct: 1 HEELVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 60
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ +C +D KV +V++F V + + L GPL +N +Q Y G +
Sbjct: 61 D--RAKCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGV--- 115
Query: 119 NDVCP---SENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
CP S+ +H V++VGYG + P WI++NSWG +WG + GY+ + RG N
Sbjct: 116 --SCPYICSKRQDHGVLLVGYGSGFAPIRMKEKPYWIIKNSWGEKWG-ESGYYKICRGRN 172
Query: 169 ACGIES 174
CG++S
Sbjct: 173 VCGVDS 178
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 95/185 (51%), Gaps = 20/185 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA HG + LS+ QL++C N GC GG ++A +Y+K+ G+ E +YP+
Sbjct: 168 LEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGIALEKEYPYTA 227
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++ C + A V VRV D + D + + P+ Q +G +
Sbjct: 228 KDEA---CKFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVA-----FQVVDGFRL 279
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNA 169
K V S+ ++NHAV+ VGYG+ + VP WI++NSWG D GYF +E G N
Sbjct: 280 YKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGYFKMELGKNM 339
Query: 170 CGIES 174
CG+ +
Sbjct: 340 CGVAT 344
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 93/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGEYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI +Y
Sbjct: 261 RCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGTY 319
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 93/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGDYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI+ Y
Sbjct: 261 KCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIDYY 319
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 94/180 (52%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+ + +L+ LS+ +L+ C+ ++GC GG +A +L ++ + YP+ +
Sbjct: 159 IESQWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGVSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 219 GNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG+ WG + GY V +GTN C I+ Y
Sbjct: 279 TS---CDGKQLNHGVLLVGYNMTGEVPYWLIKNSWGKNWG-EKGYVRVRKGTNECLIQEY 334
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 101/181 (55%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E +A++ G L S+ +L++C+ + C GG + A + + K GLE E+DYP+ +
Sbjct: 283 IEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAIEKIGGLELESDYPYHARK 342
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C +++ K+ V+V + ++T + L GP+ G+N +Q Y G +
Sbjct: 343 D---QCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQFYRGGVSHPP 399
Query: 120 DV-CPSENLNHAVVIVGYGM------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ C +NL+H V+IVGY + + +P WIV+NSWG +WG + GY+ V RG N CG
Sbjct: 400 HILCSRKNLDHGVLIVGYRVSDYPMFKKTLPYWIVKNSWGKKWG-EQGYYRVYRGDNTCG 458
Query: 172 I 172
+
Sbjct: 459 V 459
>gi|17569349|ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans]
gi|351061560|emb|CCD69414.1| Protein R09F10.1 [Caenorhabditis elegans]
Length = 383
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 96/176 (54%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+Q AIK G L+ LS+ ++++C+ N GC GG A++++K GLE+E +YP+
Sbjct: 201 VEAQNAIKKGKLVSLSEQEMVDCDGRNNGCSGGYRPYAMKFVKENGLESEKEYPYSALK- 259
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C +V + DF ++ N + + GP+ GMN A+ +G
Sbjct: 260 -HDQCFLKENDTRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYSYRSGIFNPS 318
Query: 119 NDVCPSENLN-HAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C +++ HA+ I+GYG + WIV+NSWG WG GYF + RG N+CG+
Sbjct: 319 VEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGA-SGYFRLARGVNSCGL 373
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 99/183 (54%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E + G L+ LS+ +L++C+ + GC GG ++A + ++ GLE E YP+ +
Sbjct: 175 IEGAWFKATGDLISLSEQELVDCDQKDSGCNGGLMDQAFEEVIRIGGLETEQQYPY---D 231
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
GV C ++ KV++ DF+ + + L +GPL +N +Q Y G +
Sbjct: 232 GVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGGVSHPL 291
Query: 119 NDVCPSENLNHAVVIVGYGM--------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA 169
+ +C + L+H V++VGYG+ RH P W ++NSWG RWG +DGY+ V RG
Sbjct: 292 SFLCSPDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWG-EDGYYRVARGKGV 350
Query: 170 CGI 172
CG+
Sbjct: 351 CGV 353
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K+GTL+ LS+ +L++C+ +Q C GG + A + + K GLE E DY +
Sbjct: 295 IEGQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAIEKLGGLETETDYSYI--- 351
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + +KV ++ + + + L GP+ +N +Q Y +
Sbjct: 352 GKKQSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPL 411
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ C ++HAV++VGYG R +P W ++NSWG + GY+ + RG+NACGI
Sbjct: 412 KIFCNPWMIDHAVLMVGYGERKGIPFWAIKNSWGEDYGEQGYYYLHRGSNACGI 465
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 100/193 (51%), Gaps = 26/193 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + I LL LS+ QL++C+ + GC+GG A +YL AG LE E
Sbjct: 50 VEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEE 109
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQ 109
+ YP+ ++G C + +V VRV +F + N + ++ H GPL G+N +Q
Sbjct: 110 SSYPYTGKHG---ECKFKPDRVAVRVVNFTEVPINENQIAANLVCH-GPLAVGLNAIFMQ 165
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWG-RWGPDDGYF 161
Y G + +CP +NH V++VGYG + P WI++NSWG RWG + GY+
Sbjct: 166 TYIGG-VSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWG-EHGYY 223
Query: 162 TVERGTNACGIES 174
+ RG CG+ +
Sbjct: 224 RLCRGHGMCGMNT 236
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 99/184 (53%), Gaps = 16/184 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG A QY+ + G+++EA YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG +C YD++K S + L F D + + + GP+ ++ + Y+ L
Sbjct: 208 AMNG---KCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAS---HYSFFL 261
Query: 116 IRKN---DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACG 171
R + ++N+NH V++VGYG + W+V+NSWG D GY + R + N CG
Sbjct: 262 YRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCG 321
Query: 172 IESY 175
I SY
Sbjct: 322 IASY 325
>gi|351707349|gb|EHB10268.1| Cathepsin O, partial [Heterocephalus glaber]
Length = 266
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 92/174 (52%), Gaps = 7/174 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES +AI+ G L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 96 VESAWAIRGGPLEDLSAQQVIDCSYNNYGCNGGSPLSALSWLNKTRVKLVRDSEYPFKAQ 155
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+G + + ++ F+G + R L +GPLV ++ QDY G +I+
Sbjct: 156 DGPCHYFSQSQPGLSIQGYSAYDFSGQEAEMARALLAHGPLVVIVDAVSWQDYLGGVIQH 215
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ C S NHAV+I G+ P WIVRNSWG WG GY V+ G+N CG
Sbjct: 216 H--CSSGRANHAVLITGFDRTDSTPYWIVRNSWGSSWG-VGGYVYVKMGSNTCG 266
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 96/194 (49%), Gaps = 28/194 (14%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L LS+ QL++C+ + GC GG N A +Y LK G+E E
Sbjct: 168 LEGAHFLATGNLESLSEQQLVDCDRECDPEEYDACDDGCNGGLMNNAFEYILKTGGVERE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++ C ++ K+ VS+F V + D L GPL G+N +Q
Sbjct: 228 KDYPYTGRD--RSPCKFNESKIVASVSNFSVVSIDEDQIAANLVKNGPLAVGINAVFMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y + CP S L+H V++VGYG + P WI++NSW ++ + GY
Sbjct: 286 YTAGV-----SCPFLCSGELDHGVLLVGYGSAGYSPIRFKEKPYWILKNSWSKYWGEHGY 340
Query: 161 FTVERGTNACGIES 174
+ + RG N CG++S
Sbjct: 341 YRICRGQNMCGVDS 354
>gi|444519298|gb|ELV12725.1| Cathepsin O [Tupaia chinensis]
Length = 428
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 98/189 (51%), Gaps = 20/189 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES A+ L LS Q+++C ++GC GG A+ +L L E++YPF +
Sbjct: 66 VESACAMAGAPLRELSVQQVLDCAYDDRGCGGGSTLSALNWLNKTQVKLVGESEYPFTAR 125
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+G+ + A V + +L ++ S D + L GPLVA ++ QDY G +I
Sbjct: 126 DGICR--FFPASCPGVSIRGYLAYDFSAQEDEMAKALVALGPLVAVVDAVSWQDYLGGVI 183
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
+ + C S NHAV++ G+ Q P W+VRNSWGR WG DGY V+ G+N
Sbjct: 184 QHH--CSSGEANHAVLVTGFDKAGQTPYWVVRNSWGRSWGL-DGYARVKMGSN------- 233
Query: 176 GGICTRTLN 184
IC RTL+
Sbjct: 234 --ICVRTLD 240
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 97/178 (54%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + +K GL E +YP+ +N
Sbjct: 138 VESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLEDNYPYDAKN 197
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V V ++ + +T LYH + GMN LLQ Y +
Sbjct: 198 E---KCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 254
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG ++GYF + RG +CGI +
Sbjct: 255 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-ENGYFRMYRGDGSCGINT 311
>gi|355681666|gb|AER96819.1| cathepsin W [Mustela putorius furo]
Length = 373
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 102/196 (52%), Gaps = 24/196 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ ++I++ + +S +L++CN GC+GG ++ + L ++GL +E DYPFR
Sbjct: 163 IEALWSIRYNQSVQVSVQELLDCNRCGDGCKGGFVWDAFVTVLNNSGLASEKDYPFRGSL 222
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
A + +KV + DF++ N T L +GP+ +N LLQ Y +I+
Sbjct: 223 KRHKCLASNYKKV-AWIQDFIMLQNNEQTMANYLATHGPITVTINMKLLQQYKKGVIKAT 281
Query: 120 D-VCPSENLNHAVVIVGYGMRHQ------------------VPVWIVRNSWG-RWGPDDG 159
C +NH+V++VG+G + +P WI++NSWG WG ++G
Sbjct: 282 PATCDPYLVNHSVLLVGFGKTNSSERRRAKGGHFWPHPHRPIPYWILKNSWGAEWG-EEG 340
Query: 160 YFTVERGTNACGIESY 175
YF + RG+N CGI Y
Sbjct: 341 YFRLHRGSNTCGITKY 356
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K+GTL+ LS+ +L++C+ +Q C GG + A + + K GLE E DY +
Sbjct: 295 IEGQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAIEKLGGLETETDYSYI--- 351
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + +KV ++ + + + L GP+ +N +Q Y +
Sbjct: 352 GKKQSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPL 411
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ C ++HAV++VGYG R +P W ++NSWG + GY+ + RG+NACGI
Sbjct: 412 KIFCNPWMIDHAVLMVGYGERKGIPFWAIKNSWGEDYGEQGYYNLYRGSNACGI 465
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 99/184 (53%), Gaps = 16/184 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG A QY+ + G+++EA YP++
Sbjct: 156 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYK 215
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG +C YD++K S + L F D + + + GP+ ++ + Y+ L
Sbjct: 216 AMNG---KCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAS---HYSFFL 269
Query: 116 IRKN---DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACG 171
R + ++N+NH V++VGYG + W+V+NSWG D GY + R + N CG
Sbjct: 270 YRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCG 329
Query: 172 IESY 175
I SY
Sbjct: 330 IASY 333
>gi|431920312|gb|ELK18347.1| Cathepsin H [Pteropus alecto]
Length = 232
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 99/183 (54%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G +L L++ QL++C N N GC+GG ++A +Y+++ G+ E YP++
Sbjct: 47 LESAIAIKTGKMLSLAEQQLVDCAQNFNNHGCKGGLPSQAFEYIRYNKGIMGEDTYPYQG 106
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
++G C + K V D + N + + Y P+ + +D+ K
Sbjct: 107 KDGT---CKFQPEKAIAFVKDVANITINDEEAMVEAVALYNPVSFAFE--VTEDFMLYRK 161
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 162 GIYSSTSCHKTPDKVNHAVLAVGYGEENGKPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 220
Query: 172 IES 174
+ +
Sbjct: 221 LAA 223
>gi|1809288|gb|AAC47721.1| secreted cathepsin L 2 [Fasciola hepatica]
Length = 326
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 89/183 (48%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ S+ QL++C ++ N GC GG A +YLKH GLE E+ YP++
Sbjct: 141 VEGQFRKNERASASFSEQQLVDCPRDLGNYGCGGGYMENAYEYLKHNGLETESYYPYQ-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G C YD R +V+ + + D + ++ GP ++ I
Sbjct: 199 -AVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSDFMMYQSGIY 257
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGY--FTVERGTNACGIESY 175
++ C + L HAV+ VGYG + WIV+NSWG W +DGY F RG N CGI S
Sbjct: 258 QSQTCLPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARNRG-NMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASV 319
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 95/192 (49%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +YL + G+ E
Sbjct: 165 LEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + G G C +D KV VS+F V D L GPL +N A +Q
Sbjct: 225 KDYAY---TGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQT 281
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y + VC L+H V++VG+G P+ WI++NSWG+ WG + GY+
Sbjct: 282 YMSG-VSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWG-EQGYYK 339
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 340 ICRGRNVCGVDS 351
>gi|385298943|gb|AFI60244.1| cysteine protease/senescence-enhanced 1, partial [Panicum virgatum]
Length = 282
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 100/186 (53%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+KH GL+ E YP++
Sbjct: 98 LEAAYTQATGKPVSLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKHNGGLDTEESYPYK- 156
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + A V V+V D + + + + P+ + NG +
Sbjct: 157 --GVNGLCQFKASNVGVKVLDSVNITLGAENELKDAVGLVRPVSVA-----FEVINGFRL 209
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N
Sbjct: 210 YKSGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DEGYFKMEMGKN 268
Query: 169 ACGIES 174
CG+ +
Sbjct: 269 MCGVAT 274
>gi|1272388|gb|AAB17051.1| cysteine protease, partial [Spirometra mansonoides]
Length = 216
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 93/181 (51%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E IK G L LS+ QL++C+ NQGC GG + A QY + G+EAE DY + +
Sbjct: 34 IEGAIQIKMGILPTLSEQQLVDCSWEYGNQGCNGGFMSLAFQYAQRYGVEAEVDYRYTAK 93
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG---ALLQDYNGK 114
+G C Y V V+ + D + +R + GP+ G++ + +G
Sbjct: 94 DGF---CRYQQDMVVANVTGYAELPQGDEASLQRAVAVIGPISVGIDANDPGFMSYSHGV 150
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
+ K C +++NH V+++GYG + P W+V+NSWGR + GY + R N CGI
Sbjct: 151 FVSK--TCSPDDINHGVLVIGYGTENDEPYWLVKNSWGRSWGEQGYVKMARNKNNMCGIA 208
Query: 174 S 174
S
Sbjct: 209 S 209
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 94/180 (52%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ES++ + +L+ LS+ +L+ C+ ++GC GG +A +L ++ + A YP+ +
Sbjct: 159 IESKWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGASYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 219 GNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG + GY V +GTN C I+ Y
Sbjct: 279 TS---CDGKQLNHGVLLVGYNMTGEVPYWLIKNSWGENWG-EKGYVRVRKGTNECLIQEY 334
>gi|2253415|gb|AAB62937.1| stress-induced cysteine proteinase [Lavatera thuringiaca]
Length = 175
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 85/160 (53%), Gaps = 21/160 (13%)
Query: 28 NQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-G 85
N GC GG A +Y LK GLE E +YP+ + G C +D K+ VS+F V +
Sbjct: 11 NAGCSGGLMTSAFEYTLKAGGLEREEEYPYTGID--RGGCKFDKTKIAASVSNFSVISVD 68
Query: 86 SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPS---ENLNHAVVIVGYGMRHQV 142
D + +GPL G+N A +Q Y G + CP +L+H V++VGYG
Sbjct: 69 EDQIAANMVKHGPLAVGINAAFMQTYIGGV-----SCPYICFRSLDHGVLLVGYGAAGYA 123
Query: 143 PV-------WIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
PV WI++NSWG WG +DGY+ + RG N CG++S
Sbjct: 124 PVRFKEKPFWIIKNSWGANWG-EDGYYKICRGRNVCGVDS 162
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 97/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + I LL LS+ QL++C+ + GC+GG A +YL AG LE E
Sbjct: 125 VEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGGLEEE 184
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+ YP+ G G C + +V VRV +F V + L +GPL G+N +Q
Sbjct: 185 SSYPY---TGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQT 241
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWG-RWGPDDGYFT 162
Y G + +CP +NH V++VGYG + P WI++NSWG RWG + GY+
Sbjct: 242 YIGG-VSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWG-EHGYYR 299
Query: 163 VERGTNACGIES 174
+ RG CG+ +
Sbjct: 300 LCRGHGMCGMNT 311
>gi|308476152|ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
gi|308265817|gb|EFP09770.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
Length = 391
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 96/176 (54%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+Q+AI+ L+ LS+ ++++C+ N GC GG A++++K GLE+E +YP+
Sbjct: 209 VEAQHAIRKNQLVSLSEQEMVDCDDKNNGCSGGYRPYAMRFVKENGLESEKEYPYSALK- 267
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C +V + DF ++ + + GP+ GM+ A+ +G
Sbjct: 268 -HDQCMLKQNDTRVFIDDFRMLSQNEEEIANWVGTKGPVTFGMSVTKAMYSYRSGIFNPS 326
Query: 119 NDVCPSENL-NHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
D C +++ +HA+ IVGYG + WIV+NSWG WG GYF + RG N+CG+
Sbjct: 327 ADDCAEKSMGSHALTIVGYGGEGEAAFWIVKNSWGTSWGA-SGYFRLARGVNSCGL 381
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A +Y LK GL E
Sbjct: 166 LEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
D+P+ + C +D K+ +V++F V + D L GPL +N +Q
Sbjct: 226 EDHPYTGND--LQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 284 YIGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG-ENG 337
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 338 YYKICRGRNVCGVDS 352
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES YA G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 166 LESAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTG 225
Query: 59 QNGVTGRCAYDARKVKVRV--SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
QNG C + + V V+V S + D + + P+ ++ D+ K
Sbjct: 226 QNG---PCKFTSEDVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFE--VVDDFRLYKK 280
Query: 115 LIRKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACG 171
+ + C + ++NHAV+ VGYG+ VP W+++NSW G WG D GYF +E G N CG
Sbjct: 281 GVYTSTTCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWG-DHGYFKMEMGKNMCG 339
Query: 172 IES 174
+ +
Sbjct: 340 VAT 342
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 95/191 (49%), Gaps = 22/191 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + + G LL LS+ QL++C+ + GC GG A +Y++ AG LE E
Sbjct: 172 VEGAHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLMTNAYKYVEEAGGLELE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
+DYP++ ++G +C ++ KV +VS+F + D L GPL G+N +Q
Sbjct: 232 SDYPYKGRDG---KCQFNPNKVAAKVSNFTNIPIDEDQVAAYLIKSGPLAIGINAEFMQT 288
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGYFTV 163
Y + C NL+H V++VGY P WI++NSWG D GY+ +
Sbjct: 289 YVAG-VSCPIFCNKRNLDHGVLLVGYAEHGFAPARLAYKPYWIIKNSWGPMWGDKGYYKI 347
Query: 164 ERGTNACGIES 174
RG CG+ +
Sbjct: 348 CRGHGECGLNT 358
>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 94/180 (52%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAGHRLTALSEQQLVSCDDMDSGCGGGLMTQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C ++ V R+ +++ ++T L GP+ G++ + Y+G ++
Sbjct: 219 TFGYVPECTNSSQLVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYHGGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGKQLNHGVLLVGYNMTGEVPYWVIKNSWGENWG-EKGYVRVTMGVNACLLTEY 334
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 99/193 (51%), Gaps = 26/193 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
+E + I LL LS+ QL++C+ + GC+GG A +YL AG LE E
Sbjct: 179 VEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEE 238
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQ 109
+ YP+ G G C + +V VRV +F + N + ++ H GPL G+N +Q
Sbjct: 239 SSYPY---TGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCH-GPLAVGLNAIFMQ 294
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWG-RWGPDDGYF 161
Y G + +CP +NH V++VGYG + P WI++NSWG RWG + GY+
Sbjct: 295 TYIGG-VSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWG-EHGYY 352
Query: 162 TVERGTNACGIES 174
+ RG CG+ +
Sbjct: 353 RLCRGHGMCGMNT 365
>gi|148575301|gb|ABQ95351.1| secreted cathepsin L2 [Fasciola hepatica]
Length = 326
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 88/183 (48%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ S+ QL++C + N GC GG A +YLKH GLE E+ YP++
Sbjct: 141 VEGQFRKNERASASFSEQQLVDCTRDFGNYGCGGGYMENAYEYLKHNGLETESYYPYQ-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G C YD R +V+ + + D + ++ GP ++ I
Sbjct: 199 -AVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSDFMMYQSGIY 257
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGY--FTVERGTNACGIESY 175
++ C + L HAV+ VGYG + WIV+NSWG W +DGY F RG N CGI S
Sbjct: 258 QSQTCLPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARNRG-NMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASV 319
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 95/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ + GC+GG ++A++++ + + E YP+ +
Sbjct: 159 IEGQWKVAGHELTSLSEQMLVSCDNMDYGCRGGFLDRALKWIVSSNKGNVFTEESYPYDS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G C + V ++S + + + L GP+ ++ + DY G ++
Sbjct: 219 TDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
C S+ LNH V++VGY + P WI++NSWG +WG ++GY VE+GTN C ++ Y
Sbjct: 279 S---CSSDALNHGVLLVGYDDSSKPPYWIIKNSWGKKWG-EEGYIRVEKGTNQCLMKEYA 334
>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
Length = 411
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 95/176 (53%), Gaps = 6/176 (3%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
++ES AI L+ LS+ QL++C++ + GC GG A+QY++H G+ E YP+ +
Sbjct: 229 VVESMNAIAKNPLVSLSEQQLVDCDMNDNGCDGGYRPYALQYIRHNGIVPEELYPYAGKE 288
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+ + ++V V+ ++ N S +++ GPL G+N L +G
Sbjct: 289 LDSCKLNTTVQRVYVKTVKYIRRNES-AMADFVFYKGPLSVGINVTKDLFHYQSGVFTPS 347
Query: 119 NDVCPSENL-NHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C HA+ +VGYG ++ WI++NSWG RWG DG+F +RG N+CGI
Sbjct: 348 KEDCEQNPQGTHALAVVGYGSQNGEDYWIIKNSWGKRWGM-DGFFLYKRGANSCGI 402
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 159 LEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 218
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD + S + L F + + + + GP+ G++ + + K
Sbjct: 219 ---AMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKT 275
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG W+V+NSWG D GY + R + N CGI S
Sbjct: 276 GVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIAS 335
Query: 175 Y 175
Y
Sbjct: 336 Y 336
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 93/175 (53%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ ++ G LL LS+ +L++C+ +Q C GG + A + K GLE E DY +
Sbjct: 271 VEGQWFLRRGALLALSEQELVDCDTLDQACGGGLPSNAYTAIEKLGGLETEKDYSY---E 327
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G RC++ K +V + S + + L GP+ +N +Q Y +
Sbjct: 328 GRKERCSFSPDKARVYINSSVDLSRDEEELATWLAENGPVSIALNAFAMQFYRRGVSHPF 387
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R +P W ++NSWG WG ++GY+ + RG ACG+
Sbjct: 388 RPLCSPWFIDHAVLLVGYGHRSGIPFWAIKNSWGPDWG-EEGYYYLYRGARACGV 441
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 96/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + +K GL E +YP+ +N
Sbjct: 238 VESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLEDNYPYDAKN 297
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V V ++ + +T LYH + GMN LLQ Y +
Sbjct: 298 E---KCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 354
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG ++GYF + RG CGI +
Sbjct: 355 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-ENGYFRMYRGDGTCGINT 411
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 93/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+ + +L+ LS+ +L+ C+ ++GC GG +A +L ++ + YP+ +
Sbjct: 159 IESQWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGVSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 219 GNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG + GY V +GTN C I+ Y
Sbjct: 279 TS---CDGKQLNHGVLLVGYNMTGEVPYWLIKNSWGENWG-EKGYVRVRKGTNECLIQEY 334
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +YL G++ E
Sbjct: 167 LEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGSGGVQRE 226
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C +D K+ VS++ V + + L GPL +N +Q
Sbjct: 227 KDYPY---TGRDGTCKFDKSKIAASVSNYSVISLDEEQIAANLVKNGPLAVAINAVYMQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 284 YVGGV-----SCPYICGKHLDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWG-ENG 337
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 338 YYKICRGRNVCGVDS 352
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 96/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + +K GL E +YP+ +N
Sbjct: 276 VESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLEDNYPYDAKN 335
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V V ++ + +T LYH + GMN LLQ Y +
Sbjct: 336 E---KCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 392
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG ++GYF + RG CGI +
Sbjct: 393 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-ENGYFRMYRGDGTCGINT 449
>gi|42564149|gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 322
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 93/178 (52%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGF-NKAIQYLKHAGLEAEADYPFRN 58
LE Q AI + PLS+ QL++C+ N C GG +A Y+ G+EAE+ YP+
Sbjct: 143 LEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPYVE 202
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
Q C YDA+K V++ + + D ++ + GP+ GM+ L Y G ++
Sbjct: 203 Q---MTECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGVL- 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
D C ++HAV++VGYG + W V+NSWG +DGYF +ER N C I S
Sbjct: 259 -GDQCYF-GMDHAVLVVGYGEANGKKFWKVKNSWGATWGEDGYFRIERDADNLCDIAS 314
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 99/204 (48%), Gaps = 28/204 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A ++ LK GL E
Sbjct: 168 LEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEHTLKTGGLMKE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G T C D K+ VS+F V + + L GPL +N +Q
Sbjct: 228 EDYPYTGKDGKT--CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQT 285
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGRWGPDDGY 160
Y G + CP + LNH V++VGYG + P WI++NSWG ++G+
Sbjct: 286 YIGGV-----SCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGF 340
Query: 161 FTVERGTNACGIESYGGICTRTLN 184
+ + +G N CG++S T++
Sbjct: 341 YKICKGRNICGVDSMVSTVAATVS 364
>gi|345798093|ref|XP_536212.3| PREDICTED: pro-cathepsin H [Canis lupus familiaris]
Length = 350
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/182 (36%), Positives = 95/182 (52%), Gaps = 13/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQG-GGFNKAIQYLKH-AGLEAEADYPFR 57
LES AIK G LL L++ QL++C N N GCQG G +A +Y+++ G+ E YP++
Sbjct: 164 LESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYK 223
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
Q+G C Y K V D + N + Y P+ K
Sbjct: 224 GQDG---DCKYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKG 280
Query: 116 IRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I + C + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N CG+
Sbjct: 281 IYSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWG-MNGYFLMERGKNMCGL 339
Query: 173 ES 174
+
Sbjct: 340 AA 341
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 96/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + +K GL E +YP+ +N
Sbjct: 275 VESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLEDNYPYDAKN 334
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V V ++ + +T LYH + GMN LLQ Y +
Sbjct: 335 E---KCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 391
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG ++GYF + RG CGI +
Sbjct: 392 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-ENGYFRMYRGDGTCGINT 448
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 103/192 (53%), Gaps = 13/192 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+ES AI G L+ LS+ +L++C+ Y+ GC GG + A +++ K+ GL++E DYP+ + N
Sbjct: 176 IESANAIATGDLIRLSEQELVDCDTYDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSN 235
Query: 61 GVTGRC-AYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGKLIR 117
G G+C + K V + ++ ++ P+ G+ G+ Q Y G +
Sbjct: 236 GRDGKCDKTKSAKSVVSLDSYVEVESNEDAVLCAVATTPVTIGIVGSAYDFQLYTGGVY- 294
Query: 118 KNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
N C S+ +++HAV+IVGYG + WIV+NSWG + +GY +ER T+
Sbjct: 295 -NGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERNTDI-----K 348
Query: 176 GGICTRTLNGVF 187
G+C L V+
Sbjct: 349 NGVCGMYLEPVY 360
>gi|149698347|ref|XP_001499302.1| PREDICTED: cathepsin O-like [Equus caballus]
Length = 367
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 93/178 (52%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 187 VESVCAIKGEPLEDLSVQQVIDCSYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQ 246
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKL 115
+G+ C Y + F ++ SD +M L +GPLV ++ QDY G +
Sbjct: 247 SGL---CHYFSDSHSGFSIKGFSAYDFSDQEDQMAKALLTFGPLVVVVDAVSWQDYLGGV 303
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S NHAV+I G+ P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 304 IQHH--CSSGEANHAVLITGFDRTGSTPYWIVRNSWGSSWGV-DGYAHVKMGGNICGI 358
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 99/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L+ LS+ QL++C+ + GC GG N A++Y LK GL E
Sbjct: 166 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSALEYTLKAGGLMRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ + G C +D K+ V++F V + + L GPL +N +Q
Sbjct: 226 EDYPYSGTD--RGTCKFDETKIAASVANFSVVSLDENQIAANLVKNGPLAVAINAVFMQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP S+ L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 284 YVGGV-----SCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG-ENG 337
Query: 160 YFTVERGTNACGIES 174
++ + +G N CG++S
Sbjct: 338 FYKICQGRNVCGVDS 352
>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
Length = 236
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 93/179 (51%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ K L+ LS+ QL++C+ ++ C GG A + + K GL +E DYP+
Sbjct: 54 IEGQWYKKTKKLVSLSEQQLLDCDKKDEACNGGFPEWAYESIVKMGGLMSEKDYPYEAHK 113
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
C + ++D + + + L GP+ GMN LQ Y G +
Sbjct: 114 ET---CNLKPNNISAYINDSVTLSKDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPP 170
Query: 119 NDVCPSENLNHAVVIVGYGMRH--QVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+ +C + L+HAV++VGYG+ Q P WIV+NSWGR WG + GYF + RG CGI +
Sbjct: 171 HMLCSEQGLDHAVLLVGYGVTSFWQRPYWIVKNSWGRSWG-EKGYFRIYRGDGTCGINA 228
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 93/190 (48%), Gaps = 29/190 (15%)
Query: 7 AIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPF 56
A++ L LS+ QL++C+ + GC GG N A +Y LK GL E DYP+
Sbjct: 176 ALEGANFLXLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPY 235
Query: 57 RNQNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ T C +D K+ ++ F V N D L GPL +N +Q Y G
Sbjct: 236 AGIDRNT--CNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGG 293
Query: 115 LIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGYFTVE 164
+ CP S+ L+H V++VGYG P+ WI++NSWG ++GY+ +
Sbjct: 294 V-----SCPFICSKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKIC 348
Query: 165 RGTNACGIES 174
RG N CG++S
Sbjct: 349 RGRNICGVDS 358
>gi|4139678|pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1
Angstrom Resolution: Location Of The Mini-Chain
C-Terminal Carboxyl Group Defines Cathepsin H
Aminopeptidase Function
gi|28948781|pdb|1NB3|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948784|pdb|1NB3|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948787|pdb|1NB3|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948790|pdb|1NB3|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948793|pdb|1NB5|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948796|pdb|1NB5|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948799|pdb|1NB5|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948802|pdb|1NB5|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H
Length = 220
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 35 LESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 94
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY--NGK 114
Q+ C + K V D + N + + Y P+ + D+ K
Sbjct: 95 QDD---HCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFE--VTNDFLMYRK 149
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 150 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 208
Query: 172 IES 174
+ +
Sbjct: 209 LAA 211
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 104/185 (56%), Gaps = 15/185 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
+E Q + G L+ LS+ QL++C+ N C GG + A +Y+K + G++ EA YP+
Sbjct: 185 IEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASYPY-- 242
Query: 59 QNGVTG----RCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGALLQDYN 112
+G TG C ++ ++ VRV+ ++ ++ + HYGP+ +N L +
Sbjct: 243 VSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMS 302
Query: 113 GKL-IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNA 169
K + +D C S++L+H V++VGYG + +P W+++NSWG WG ++GY + R N
Sbjct: 303 YKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWG-ENGYVKILRDHNNL 361
Query: 170 CGIES 174
CG+ S
Sbjct: 362 CGVAS 366
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 104/185 (56%), Gaps = 15/185 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
+E Q + G L+ LS+ QL++C+ N C GG + A +Y+K + G++ EA YP+
Sbjct: 197 IEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASYPY-- 254
Query: 59 QNGVTG----RCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGALLQDYN 112
+G TG C ++ ++ VRV+ ++ ++ + HYGP+ +N L +
Sbjct: 255 VSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMS 314
Query: 113 GKL-IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNA 169
K + +D C S++L+H V++VGYG + +P W+++NSWG WG ++GY + R N
Sbjct: 315 YKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWG-ENGYVKILRDHNNL 373
Query: 170 CGIES 174
CG+ S
Sbjct: 374 CGVAS 378
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 98/198 (49%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + G + LS+ QL++C+ + GC GG A YL K GLE E
Sbjct: 175 LEGANYLASGKMEVLSEQQLVDCDHECDPSEPDSCDAGCNGGLMTSAFSYLLKSGGLERE 234
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D K+ V ++ +V + L YGPL G+N A +Q
Sbjct: 235 KDYPYTGKDGT---CKFDKSKIAASVQNYSVVAVDEEQIAANLVKYGPLAIGINAAYMQT 291
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG + P WI++NSWG WG D G
Sbjct: 292 YIGGV-----SCPYICGRHLDHGVLLVGYGASGFAPSRFKEKPYWIIKNSWGENWG-DKG 345
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 346 YYKICRGSNVRNKCGVDS 363
>gi|414589597|tpg|DAA40168.1| TPA: hypothetical protein ZEAMMB73_868349 [Zea mays]
Length = 252
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 99/186 (53%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC+GG ++A +Y+K+ GL+ E YP++
Sbjct: 68 LEAAYTQATGKAISLSEQQLVDCGFAFNNFGCKGGLPSQAFEYIKYNGGLDTEESYPYQ- 126
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + A V V+V D + D + + P+ + +G +
Sbjct: 127 --GVNGICQFKAENVGVKVLDSVNITLGAEDELKDAVGLVRPV-----SVAFEVISGFRL 179
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N
Sbjct: 180 YKTGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DEGYFKMEMGKN 238
Query: 169 ACGIES 174
CG+ +
Sbjct: 239 MCGVAT 244
>gi|222641669|gb|EEE69801.1| hypothetical protein OsJ_29533 [Oryza sativa Japonica Group]
Length = 314
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 130 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY-- 187
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C Y V V+V D + D + + P+ Q NG +
Sbjct: 188 -TGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPV-----SVAFQVINGFRM 241
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N
Sbjct: 242 YKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKN 300
Query: 169 ACGIES 174
CGI +
Sbjct: 301 MCGIAT 306
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP++
Sbjct: 179 LEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYK- 237
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + A V V+V D + D + + P+ Q NG
Sbjct: 238 --GVNGICDFKAENVGVKVLDSVNITLGAEDELKDAVALVRPV-----SVAFQVVNGFRQ 290
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D GYF +E G N
Sbjct: 291 YKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DKGYFKMEMGKN 349
Query: 169 ACGIES 174
CG+ +
Sbjct: 350 MCGVAT 355
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGEYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 RCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
Length = 396
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 93/179 (51%), Gaps = 8/179 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ESQYAI+ GTL LS+ +L++C+ + GC GG A+ ++ GLE E DYP+
Sbjct: 214 VESQYAIRKGTLWSLSEQELVDCDGASYGCGGGFLTSALGFILGNGLETEDDYPYSATK- 272
Query: 62 VTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C + K +V + + + + D + + GP+ M+ + ++G
Sbjct: 273 -HDQCWINGDKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPAYHDGIYSPS 331
Query: 119 NDVCPSENLN-HAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C E+L HA+ I+GYG WIV+NSW G WG D GY + RG NACG+ Y
Sbjct: 332 EHECKDESLGYHAMAIIGYGQEGGQNYWIVKNSWGGSWG-DQGYMRLARGVNACGMNDY 389
>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
Length = 332
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 97/175 (55%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG--LEAEADYPFRNQ 59
+ES Y IK+ ++ LS+ LI C++ N GC GG + A++ + G + +E + P+
Sbjct: 155 IESLYNIKYDKVIDLSEQHLINCDLVNNGCNGGLMHWALENILQEGGGVVSEENDPYY-- 212
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G+ C ++ + + + + +L GP+ ++ + + +Y +
Sbjct: 213 -GLDSVCKKTPWELNISGCKRYILQNENKLKELLVVNGPISVAIDVSDVINYKSGIA--- 268
Query: 120 DVCPSEN-LNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
D+C + N LNHAV++VGYG +VP WI++NSWG WG +DG+F ++R N+CG+
Sbjct: 269 DICENNNGLNHAVLLVGYGEYDEVPYWILKNSWGIEWG-EDGFFRIQRNKNSCGL 322
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGEYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 RCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGEYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 RCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|148283737|gb|ABN50361.2| cathepsin L [Fasciola hepatica]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 90/189 (47%), Gaps = 10/189 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ S+ QL+ C + N GC GG A +YLKH GLE E+ YP++
Sbjct: 141 VEGQFRKNERASASFSEQQLVNCTRDFGNYGCGGGYVENAYEYLKHNGLETESYYPYQ-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G C YD R +V+ + + D + ++ GP ++ I
Sbjct: 199 -AVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSDFMMYQSGIY 257
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGY--FTVERGTNACGIESY 175
++ C + L HAV+ VGYG + WIV+NSWG W +DGY F RG N CGI S
Sbjct: 258 QSQTCLPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARNRG-NMCGIASL 316
Query: 176 GGICTRTLN 184
+ T++
Sbjct: 317 ASVPIGTIS 325
>gi|172050735|gb|ACB70169.1| cathepsin H transcript variant 3 [Sus scrofa]
Length = 251
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 66 LESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 125
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY--NGK 114
Q+ C + K V D + N + + Y P+ + D+ K
Sbjct: 126 QDD---HCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFE--VTNDFLMYRK 180
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 181 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 239
Query: 172 IES 174
+ +
Sbjct: 240 LAA 242
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGEYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 RCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|7271893|gb|AAF44677.1|AF239266_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 92/187 (49%), Gaps = 18/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ S+ QL++C N N GC GG A +YLKH+GLE ++ YP++
Sbjct: 141 MEGQFRKNERASASFSEQQLVDCTRNFGNHGCGGGYMENAYEYLKHSGLETDSYYPYQ-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKL-- 115
V G C YD R +V+D+ + D + ++ GP AL DY+ +
Sbjct: 199 -AVEGPCQYDGRLAYAKVTDYYTVHSGDEVELKNLVGTEGPAAV----ALDVDYDFMMYE 253
Query: 116 --IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGY--FTVERGTNACG 171
I ++ C + L HAV+ VGYG + WIV+NSWG + GY F RG N CG
Sbjct: 254 SGIYHSETCLPDRLTHAVLAVGYGAQDGTDYWIVKNSWGSSWGEKGYIRFARNRG-NMCG 312
Query: 172 IESYGGI 178
I S +
Sbjct: 313 IASLASV 319
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 98/198 (49%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + G + LS+ QL++C+ + GC GG A YL K GLE E
Sbjct: 175 LEGANYLATGKMEVLSEQQLVDCDHECDPAEPDSCDAGCNGGLMTSAFSYLLKSGGLERE 234
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C ++ K+ V +F +V + L YGPL G+N A +Q
Sbjct: 235 KDYPYTGKDGT---CKFEKSKIAASVQNFSVVAVDEEQIAANLVEYGPLAIGINAAYMQT 291
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG + P WI++NSWG WG D G
Sbjct: 292 YIGGV-----SCPYICGRHLDHGVLLVGYGASGFAPSRFKEKPYWIIKNSWGENWG-DKG 345
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 346 YYKICRGSNVRNKCGVDS 363
>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
Length = 367
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 183 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY-- 240
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C Y V V+V D + D + + P+ Q NG +
Sbjct: 241 -TGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPV-----SVAFQVINGFRM 294
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N
Sbjct: 295 YKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKN 353
Query: 169 ACGIES 174
CGI +
Sbjct: 354 MCGIAT 359
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 93/191 (48%), Gaps = 23/191 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--------IYNQGCQGGGFNKAIQYLKHAG-LEAEA 52
+E + G L+ LS+ QL++C+ + GC GG A YL AG LE E
Sbjct: 173 IEGANFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDYLMEAGGLEEET 232
Query: 53 DYPFRNQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDY 111
YP+ G G C +D KV VRVS+F + + L ++GPL +N +Q Y
Sbjct: 233 SYPY---TGAQGECKFDPNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTY 289
Query: 112 NGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWG-RWGPDDGYFTV 163
G + +C LNH V++VGY + P W ++NSWG +WG + GY+ +
Sbjct: 290 VGG-VSCPLICSKRRLNHGVLLVGYNAEGFSILRLRKKPYWTIKNSWGEQWG-EKGYYKL 347
Query: 164 ERGTNACGIES 174
RG CG+ +
Sbjct: 348 CRGHGMCGMNT 358
>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
Length = 357
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 100/186 (53%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + S+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 173 LEAAYVQAFGKQISPSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLDTEQAYPY-- 230
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
V G C + + V VRV D + N + + + P+ ++QD+ +
Sbjct: 231 -TAVDGACKFSSENVGVRVLDSVNITLNDEEELKHAVAFVRPVSVAFQ--VVQDFR---L 284
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V SE ++NHAV+ VGYG+ + VP W+++NSWG+ WG D+GYF +E G N
Sbjct: 285 YKSGVYTSETCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGQSWG-DNGYFKMEYGKN 343
Query: 169 ACGIES 174
CG+ +
Sbjct: 344 MCGVAT 349
>gi|37655265|gb|AAQ96835.1| cysteine proteinase [Glycine max]
Length = 215
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 92/174 (52%), Gaps = 10/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 13 LEAAYAQAFGKSISLSEQQLVDCAGPFNNFGCHGGLPSQAFEYIKYNGGLETEEAYPYTG 72
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++GV C + A V V+V D + D + + P+ + +
Sbjct: 73 KDGV---CKFSAENVAVQVLDSVNITLGAEDELKHAVAFVRPVSVAFQVVNGFHFYENGV 129
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTN 168
+D C S+++NHAV+ VGYG+ + VP W+++NSWG ++GYF +E G N
Sbjct: 130 FTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGESWGENGYFKMELGKN 183
>gi|426248750|ref|XP_004018122.1| PREDICTED: pro-cathepsin H [Ovis aries]
Length = 355
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 90/181 (49%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G L L++ QL++C N N GCQGG ++A +Y+++ G+ E YP+R
Sbjct: 170 LESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYR- 228
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C Y K V D + N + + Y P+ K I
Sbjct: 229 --GEDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTADFMMYRKGI 286
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ C + +NHAV+ VGYG +P WIV+NSWG WG GYF +ERG N CG+
Sbjct: 287 YSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPHWG-MKGYFLIERGKNMCGLA 345
Query: 174 S 174
+
Sbjct: 346 A 346
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 91/181 (50%), Gaps = 15/181 (8%)
Query: 2 LESQYAIK-HGTLLPLSKSQLIECNIY-NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY ++ L S+ QL++C+ +QGC GG + A YL+ A LE E+ YP+
Sbjct: 145 IEGQYVLQLKQNLTSFSEQQLVDCDTKEDQGCNGGLMDNAFTYLESAKLETESAYPY--- 201
Query: 60 NGVTGRCAYDARKVKVRVSDFL-------VFNGSDTFRRMLYHYGPLVAGMNGALLQDYN 112
V G C Y+ V V+ F+ V + +T L + GPL +N LQ Y
Sbjct: 202 TAVDGSCKYNQSLGVVGVASFVDIEQGKTVADTENTMGVALDNIGPLSVAINANNLQFYA 261
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
G I +C LNH V+IVG G + W V+NSWG WG + GYF + RG CG
Sbjct: 262 GG-ISNPLICNPNGLNHGVLIVGLGSENGKDFWKVKNSWGASWG-EKGYFRIVRGKGKCG 319
Query: 172 I 172
I
Sbjct: 320 I 320
>gi|47522632|ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]
gi|5915886|sp|O46427.1|CATH_PIG RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|2735659|gb|AAB93957.1| preprocathepsin H [Sus scrofa]
gi|172050733|gb|ACB70168.1| cathepsin H [Sus scrofa]
Length = 335
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GCQGG ++A +Y+++ G+ E YP++
Sbjct: 150 LESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 209
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY--NGK 114
Q+ C + K V D + N + + Y P+ + D+ K
Sbjct: 210 QDD---HCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFE--VTNDFLMYRK 264
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 265 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 323
Query: 172 IES 174
+ +
Sbjct: 324 LAA 326
>gi|197258084|gb|ACH56226.1| cathepsin L-like cysteine proteinase [Radopholus similis]
Length = 417
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 93/181 (51%), Gaps = 13/181 (7%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF--RNQN 60
ES YA+ HG L LS+ +L++CN+ N C GG +KA +Y+ GL E +YP+ QN
Sbjct: 234 ESAYAVAHGHLRSLSEQELLDCNLENNACNGGSEDKAFRYIHERGLVTEDEYPYVAHRQN 293
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM--LYHYGPLVAGMN-GALLQDYNGKLIR 117
C+ D + D VF D M L ++GP+ G+ ++ Y +
Sbjct: 294 V----CSVDFGSKNLTKIDVAVFINPDEQSMMDWLINFGPVNVGIAVPPDMKPYKSGIYH 349
Query: 118 KNDV-CPSENLN-HAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+D C L HA+++VGYG + V WIV+NSW WG + GY RG NACGIE
Sbjct: 350 PSDYDCKFRVLGLHALLVVGYGESQEGVKYWIVKNSWNNTWGQEHGYVNFVRGINACGIE 409
Query: 174 S 174
Sbjct: 410 D 410
>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
Length = 363
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 179 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY-- 236
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C Y V V+V D + D + + P+ Q NG +
Sbjct: 237 -TGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPV-----SVAFQVINGFRM 290
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N
Sbjct: 291 YKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKN 349
Query: 169 ACGIES 174
CGI +
Sbjct: 350 MCGIAT 355
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGDYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 TCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ G++ E YP++
Sbjct: 177 LEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKG 236
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
NGV C Y A V+V D + N D + + P+ Q +G
Sbjct: 237 VNGV---CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGV 293
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ +++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N C I
Sbjct: 294 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKNMCAIA 352
Query: 174 S 174
+
Sbjct: 353 T 353
>gi|119640017|gb|ABL85450.1| cathepsin L [Kudoa thyrsites]
Length = 203
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 93/177 (52%), Gaps = 9/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YAIK G L+ S+ QL++C+ N GC GG A Y+ + G+ DYP+ + G
Sbjct: 25 IESAYAIKTGELVNFSEQQLVDCSTENHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQG 84
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRK 118
C Y V VR+S F V N ++ + + GP G+N A Q Y G I
Sbjct: 85 T---CQYSPEDV-VRISSFKCVENNEESVMESVANNGPNSIGINAASRSFQFYGGG-IYF 139
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
+ S L+HAV++VGYG ++ W V+NSWG W + GY ++R G N G+ S
Sbjct: 140 DPWASSYPLDHAVLLVGYGFKNTENYWHVKNSWGPWWGEQGYINIKRDGKNFLGVTS 196
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGDYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 KCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|426216526|ref|XP_004002513.1| PREDICTED: cathepsin S isoform 2 [Ovis aries]
Length = 281
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 98 LEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 157
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ GRC YD + S + L F + + + + GP+ G++ + K
Sbjct: 158 ---AMDGRCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDAKQTSFFLYKT 214
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI +
Sbjct: 215 GVYYDPSCTQNVNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAN 274
Query: 175 Y 175
+
Sbjct: 275 F 275
>gi|162815|gb|AAA30435.1| cathepsin S, partial [Bos taurus]
gi|312895|emb|CAA43971.1| cathepsin S [Bos taurus]
Length = 196
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 13 LEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 72
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD + S + L F + + + + GP+ G++ + + K
Sbjct: 73 ---AMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKT 129
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG W+V+NSWG D GY + R + N CGI +
Sbjct: 130 GVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIAN 189
Query: 175 Y 175
Y
Sbjct: 190 Y 190
>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
Group]
gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 362
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 178 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY-- 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C Y V V+V D + D + + P+ Q NG +
Sbjct: 236 -TGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPV-----SVAFQVINGFRM 289
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N
Sbjct: 290 YKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKN 348
Query: 169 ACGIES 174
CGI +
Sbjct: 349 MCGIAT 354
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGDYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LNH V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 TCRCSNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
Length = 396
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 93/179 (51%), Gaps = 8/179 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ESQYAI+ GTL LS+ +L++C+ + GC GG A+ ++ GLE E DYP+
Sbjct: 214 VESQYAIRKGTLWSLSEQELVDCDGASYGCGGGFLTSALGFILGNGLETEDDYPYSATR- 272
Query: 62 VTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRK 118
+C + K +V + + + + D + + GP+ M+ + ++G
Sbjct: 273 -HDQCWINGDKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPYYHDGIYSPS 331
Query: 119 NDVCPSENLN-HAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C E+L HA+ I+GYG WIV+NSW G WG D GY + RG NACG+ Y
Sbjct: 332 EHECKDESLGYHAMAIIGYGQEGGQNYWIVKNSWGGSWG-DQGYMRLARGVNACGMNDY 389
>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
Length = 362
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ G++ E YP++
Sbjct: 177 LEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKG 236
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
NGV C Y A V+V D + N D + + P+ Q +G
Sbjct: 237 VNGV---CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGV 293
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ +++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N C I
Sbjct: 294 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKNMCAIA 352
Query: 174 S 174
+
Sbjct: 353 T 353
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 148 LEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ GRC YD + S + L F + + + + GP+ G++ + K
Sbjct: 208 ---AMDGRCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDAKQTSFFLYKT 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI +
Sbjct: 265 GVYYDPSCTQNVNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAN 324
Query: 175 Y 175
+
Sbjct: 325 F 325
>gi|20301805|gb|AAM15726.1| cysteine protease [Pagumogonimus skrjabini]
Length = 165
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 83/153 (54%), Gaps = 6/153 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
+E Q+ IK G L+ LSK QL++C+ +GC GG + Q + GLE++ DYP+
Sbjct: 16 VEGQWFIKTGQLVTLSKQQLVDCDRAAEGCNGGWPVSSYQEIMVMGGLESQDDYPYV--- 72
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +CA + K+ ++ D +V + L +GPL +N LQ Y +++ +
Sbjct: 73 GKEQQCALNKEKLVAKIDDLVVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLKPS 132
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW 151
+ CP + LNHAV+ VGY P WIV+NSW
Sbjct: 133 YEDCPDDVLNHAVLTVGYDTEGDDPYWIVKNSW 165
>gi|255211|gb|AAB23202.1| cathepsin S [cattle, spleen, Peptide Partial, 217 aa]
gi|227966|prf||1714236A cathepsin S
Length = 217
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 34 LEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 93
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD + S + L F + + + + GP+ G++ + + K
Sbjct: 94 ---AMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKT 150
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG W+V+NSWG D GY + R + N CGI +
Sbjct: 151 GVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIAN 210
Query: 175 Y 175
Y
Sbjct: 211 Y 211
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 99/195 (50%), Gaps = 31/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +YL G++ E
Sbjct: 167 LEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGSGGVQRE 226
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D K+ VS++ V + + L GPL +N +Q
Sbjct: 227 KDYPYTGRDGT---CKFDKSKIAASVSNYSVISLDEEQIAANLVKNGPLAVAINAVYMQT 283
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG +G
Sbjct: 284 YVGGV-----SCPYICGKHLDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWG-GNG 337
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 338 YYKICRGRNVCGVDS 352
>gi|77735725|ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]
gi|115312126|sp|Q3T0I2.1|CATH_BOVIN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74267711|gb|AAI02387.1| Cathepsin H [Bos taurus]
gi|296475480|tpg|DAA17595.1| TPA: cathepsin H precursor [Bos taurus]
Length = 335
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 91/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G L L++ QL++C N N GCQGG ++A +Y+++ G+ E YP+R
Sbjct: 150 LESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRG 209
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
Q+G C Y K V D + N + + + P+ K I
Sbjct: 210 QDG---DCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRKGI 266
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ C + +NHAV+ VGYG +P WIV+NSWG WG GYF +ERG N CG+
Sbjct: 267 YSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWG-MKGYFLIERGKNMCGLA 325
Query: 174 S 174
+
Sbjct: 326 A 326
>gi|358416284|ref|XP_874012.4| PREDICTED: cathepsin O [Bos taurus]
gi|359074588|ref|XP_002694471.2| PREDICTED: cathepsin O [Bos taurus]
Length = 313
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 90/175 (51%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 133 VESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQ 192
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ F+G D L GPL+ ++ QDY G +I+
Sbjct: 193 NGLCRYFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQH 252
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ +P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 253 H--CSSGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWG-IDGYVRVKMGGNVCGI 304
>gi|426247636|ref|XP_004017585.1| PREDICTED: cathepsin O [Ovis aries]
Length = 288
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 91/175 (52%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 108 VESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLNALYWLNKLQVKLVRDSEYPFQAQ 167
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ F+G D + L GPL+ ++ QDY G +I+
Sbjct: 168 NGLCRYFSDSHSGSSIKGYSAYDFSGQEDKMAKALLALGPLIVVVDAMSWQDYLGGIIQH 227
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ +P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 228 H--CSSGESNHAVLVTGFDKTGSIPYWIVRNSWGTSWGI-DGYVRVKMGGNICGI 279
>gi|440910969|gb|ELR60703.1| Cathepsin H, partial [Bos grunniens mutus]
Length = 329
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 91/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G L L++ QL++C N N GCQGG ++A +Y+++ G+ E YP+R
Sbjct: 144 LESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRG 203
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
Q+G C Y K V D + N + + + P+ K I
Sbjct: 204 QDG---DCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRKGI 260
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ C + +NHAV+ VGYG +P WIV+NSWG WG GYF +ERG N CG+
Sbjct: 261 YSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWG-MKGYFLIERGKNMCGLA 319
Query: 174 S 174
+
Sbjct: 320 A 320
>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
Length = 375
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 100/198 (50%), Gaps = 26/198 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ +AIK + +L++C+ GC+GG ++ + LK+ GL +E DYPF + +
Sbjct: 161 IEALWAIKFNRSVEERGGELLDCDRCGNGCKGGFVWDAFLTVLKNRGLASETDYPF-DGS 219
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G T RC + K + DF++ + + R L GP+ +N LLQ Y +I+
Sbjct: 220 GKTHRCLAEKHKKVAWIQDFIMLQACEQSIARHLATQGPITVTINVKLLQQYQKGVIKAT 279
Query: 120 -DVCPSENLNHAVVIVGYGM--------------------RHQVPVWIVRNSWG-RWGPD 157
C +++H+V++VG+G R + W ++NSWG WG +
Sbjct: 280 PTTCDPRHVDHSVLLVGFGKTKSVEGRQGKAASFRSYTRPRRSMAYWTLKNSWGPHWG-E 338
Query: 158 DGYFTVERGTNACGIESY 175
+GYF + RG+N CGI Y
Sbjct: 339 EGYFRLHRGSNTCGITKY 356
>gi|119640015|gb|ABL85449.1| cathepsin L [Kudoa thyrsites]
Length = 203
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 93/177 (52%), Gaps = 9/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YAIK G L+ S+ QL++C+ N GC GG A Y+ + G+ DYP+ + G
Sbjct: 25 IESAYAIKTGELVNFSEQQLVDCSTENHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQG 84
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRK 118
C Y V VR+S F V N ++ + + GP G+N A Q Y G I
Sbjct: 85 T---CQYSPEDV-VRISSFKCVKNNEESVMESVANNGPNSIGINAASRSFQFYGGG-IYF 139
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
+ S L+HAV++VGYG ++ W V+NSWG W + GY ++R G N G+ S
Sbjct: 140 DPWASSYPLDHAVLLVGYGYKNTENYWHVKNSWGPWWGEQGYINIKRDGKNFLGVTS 196
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 95/181 (52%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K+ G++ E YP++
Sbjct: 174 LEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKG 233
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
NGV C Y V+V+D + N D + + P+ Q +G
Sbjct: 234 VNGV---CKYRPENAAVQVADSVNITLNAEDELKNAVGLVRPVSVAFEVIDGFKQYKSGV 290
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ +++NHAV+ VGYG+ + VP W+++NSWG WG +DGYF +E G N C +
Sbjct: 291 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-EDGYFKMEMGKNMCAVA 349
Query: 174 S 174
+
Sbjct: 350 T 350
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 96/198 (48%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L LS+ Q+++C+ + GC GG A YL AG LE E
Sbjct: 174 LEGAHYLATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAKAGGLETE 233
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G G C +D K+ +V +F V D L +GPL G+N +Q
Sbjct: 234 KDYPY---TGRGGACKFDKSKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQT 290
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG + P WI++NSWG WG + G
Sbjct: 291 YIGGV-----SCPFICGRHLDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGENWG-ESG 344
Query: 160 YFTVERGT---NACGIES 174
Y+ + RG N CG++S
Sbjct: 345 YYKICRGAHVKNKCGVDS 362
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/183 (31%), Positives = 96/183 (52%), Gaps = 11/183 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G++++A YP+
Sbjct: 157 LEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASYPY 216
Query: 57 RNQNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ V +C YD++ S ++ D + + + GP+ G++ + + K
Sbjct: 217 K---AVAEKCHYDSKSRAATCSRYMELPSGDEEALKEAVANKGPVSVGIDASHPSFFLYK 273
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
++ +EN+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 274 SGVYDEPSCTENVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIA 333
Query: 174 SYG 176
SYG
Sbjct: 334 SYG 336
>gi|342305190|dbj|BAK55649.1| cathepsin O [Oplegnathus fasciatus]
Length = 338
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 96/177 (54%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA--GLEAEADYPFRNQ 59
++S +AI L LS Q+++C+ N GC GG +A+ +LK L +++Y ++ +
Sbjct: 158 MQSVHAIGGSPLAQLSVQQVLDCSFQNHGCNGGSPFRALTWLKQTRVKLVPQSEYSYKAE 217
Query: 60 NGVTGRCAYDARK-VKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKL 115
G+ C + ++ V V +F + S M L +GPL A ++ QDY G +
Sbjct: 218 TGI---CHFFSQSHAGVAVKNFTAHDFSGQEEAMMGQLVEHGPLAAIVDAVSWQDYLGGI 274
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I+ + C S+ NHAV++VGY +P WIV+NSWG ++GY ++ G N CGI
Sbjct: 275 IQHH--CSSQWSNHAVLVVGYNTTGDIPYWIVQNSWGTTWGNEGYVYIKIGGNVCGI 329
>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 92/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAGHRLXXLSEQQLVSCDDKDSGCXGGLMTQAFEWLLRXMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C + V R+ +++ ++T L GP+ G++ + Y ++
Sbjct: 219 STGDVPECTNSSELVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C ++LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGKHLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLXEY 334
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES YA G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 166 LESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTG 225
Query: 59 QNGVTGRCAYDARKVKVRV--SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
NG+ C + + V V+V S + D + + P+ ++ D+
Sbjct: 226 SNGL---CKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVAFE--VVHDFRLYKS 280
Query: 115 LIRKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACG 171
+ + C S ++NHAV+ VGYG+ +P W+++NSW G WG D GYF +E G N CG
Sbjct: 281 GVYTSTACGSTPMDVNHAVLAVGYGIEDGIPYWLIKNSWGGDWG-DHGYFKMEMGKNMCG 339
Query: 172 IES 174
+ +
Sbjct: 340 VAT 342
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 96/186 (51%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 174 LEAAYKQAFGKGISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYPYTG 233
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+NG C + + V V+V D + D + + P+ Q NG +
Sbjct: 234 KNG---ECKFSSENVGVQVLDSVNITLGAEDELKHAVAFVRPVSVA-----FQVVNGFRL 285
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D GYF +E G N
Sbjct: 286 YKEGVYTSDTCGRTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DSGYFKMEMGKN 344
Query: 169 ACGIES 174
CG+ +
Sbjct: 345 MCGVAT 350
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 100/198 (50%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L LS+ Q+++C+ + GC GG A YL+ AG LE+E
Sbjct: 170 LEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPDSCDSGCNGGLMTNAFSYLQKAGGLESE 229
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G +C +D K+ V +F V + + L +GPL G+N A +Q
Sbjct: 230 KDYPY---TGSDDKCKFDKSKIVASVQNFSVVSVDEGQIAANLIKHGPLAIGINAAYMQT 286
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 287 YIGGV-----SCPYICGRTLDHGVLLVGYGAAGFAPIRLKDKPYWIIKNSWGENWG-ENG 340
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 341 YYKICRGSNVRNKCGVDS 358
>gi|38045864|gb|AAR08900.1| cathepsin L [Fasciola gigantica]
Length = 326
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 89/184 (48%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY + S+ QL++C + N+GC GG A +YL GLE E+ YP++ +
Sbjct: 141 MEGQYMKNQKANISFSEQQLVDCSGDYGNRGCSGGFMEHAYEYLYEVGLETESSYPYKAE 200
Query: 60 NGVTGRCAYDARKVKVRVSDFLV--FNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKL 115
G C YD+R +V+ F F ++ GP ++ L G
Sbjct: 201 EGP---CKYDSRLGVAKVNGFYFDHFGVESKLAHLVGDKGPAAVAVDVESDFLMYRGGIY 257
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
+N C SE LNHA+++VGYG + WIV+NSWG D GY + R N CGI S
Sbjct: 258 ASRN--CSSEKLNHAMLVVGYGTQDGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIAS 315
Query: 175 YGGI 178
+ +
Sbjct: 316 FASL 319
>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 93/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHRLTALSEQQLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+G C+ ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 SSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG ++GY V G NAC + Y
Sbjct: 279 TS---CAGDTLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-ENGYVRVTMGVNACLLTEY 334
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES YA G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 166 LESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTG 225
Query: 59 QNGVTGRCAYDARKVKVRV--SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
NG+ C + + V V+V S + D + + P+ ++ D+
Sbjct: 226 SNGL---CKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVAFE--VVHDFRLYKS 280
Query: 115 LIRKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACG 171
+ + C S ++NHAV+ VGYG+ +P W+++NSW G WG D GYF +E G N CG
Sbjct: 281 GVYTSTACGSTPMDVNHAVLAVGYGIEDGIPYWLIKNSWGGDWG-DHGYFKMEMGKNMCG 339
Query: 172 IES 174
+ +
Sbjct: 340 VAT 342
>gi|148298724|ref|NP_001091806.1| cathepsin L-like proteinase precursor [Bombyx mori]
gi|116272515|gb|ABJ97193.1| cathepsin L-like proteinase [Bombyx mori]
Length = 402
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 91/178 (51%), Gaps = 17/178 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
L++Q +HG LS Q+++C+I N GC GG A++Y GL E+ YP+
Sbjct: 226 LQAQLYKRHGEWNELSPQQIVDCSIKDGNMGCDGGSLRGALRYAAREGLVMESHYPY--- 282
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDYNGKL 115
G G C YD+ V+ R + D + L GPL +N A Q Y+G
Sbjct: 283 VGKKGYCRYDSNLVRARPRRWATLPSGDEEAMEKALATVGPLAVAVNAAPFTFQLYSG-- 340
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ + C S +LNHA+++VGY + WI+ N WGR WG +DGY + RG N CG+
Sbjct: 341 VYDDPFCVSWHLNHAMLLVGYTQDY----WILLNWWGRNWG-EDGYMRIRRGLNRCGV 393
>gi|296478683|tpg|DAA20798.1| TPA: cathepsin O preproprotein-like [Bos taurus]
Length = 375
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 90/175 (51%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 195 VESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQ 254
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ F+G D L GPL+ ++ QDY G +I+
Sbjct: 255 NGLCRYFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQH 314
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ +P WIVRNSWG WG DGY V+ G N CGI
Sbjct: 315 H--CSSGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWG-IDGYVRVKMGGNVCGI 366
>gi|391341656|ref|XP_003745143.1| PREDICTED: uncharacterized protein LOC100900885 [Metaseiulus
occidentalis]
Length = 1356
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 93/179 (51%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADY-PFRN 58
+E QY +KHG L+ ++ QL++C+ N C GG A Y+K GL ++A Y P+R
Sbjct: 1174 IEGQYFLKHGELVRFAEQQLVDCSWTSGNDACDGGLDYVAYDYIKKYGLSSDAQYGPYR- 1232
Query: 59 QNGVTGRC--AYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKL 115
G+ G+C K + + +G + R+ + GP+ ++ + +
Sbjct: 1233 --GIDGKCKDVEIENKPITTIQRYYNISGVENLRKAIAFVGPISVAIDASRPSLSFYAHG 1290
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+ ++ C S L+HAV+ VGYG+ H P W+++NSW + +DGY + + N CG+ S
Sbjct: 1291 VYEDPDCSSTELDHAVLAVGYGVLHGKPYWLIKNSWSTYWGNDGYILISQKDNMCGVAS 1349
Score = 89.0 bits (219), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 2 LESQYAIKHG--TLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADY-PF 56
LESQY + +G L S+ QL++C + N GC GG A Y+K GL + Y P+
Sbjct: 393 LESQYFLNNGKENLTRFSEQQLVDCSWDFSNTGCSGGSIESAFSYVKEYGLFTDEQYGPY 452
Query: 57 RNQNGVTGRCAYDARKVKVRVSDFLVFN---GSDTFRRMLYHYGPLVAGMNGALLQ-DYN 112
R + G +C + +S FN G + R + GP+ ++ + Y
Sbjct: 453 REEEG---KCRDTVTGTEPTISTLEGFNAIGGKECLRNYIALKGPIAVAIDASSPSFVYY 509
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
+ KN C +LNHAV+ +GYG + P W+++NSWG WG +G+ + + N CG
Sbjct: 510 SHGVYKNPAC-GRDLNHAVLAIGYGELNGEPYWLIKNSWGDIWG-SEGFMLISQENNTCG 567
Query: 172 IE 173
IE
Sbjct: 568 IE 569
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 96/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + G L LS+ QL++C+ + GC GG N A +Y L+ G+ +E
Sbjct: 168 LEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQSGGVVSE 227
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + G G C +D KV VS+F V + D L GPL +N A +Q
Sbjct: 228 KDYAY---TGRDGSCKFDKSKVVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAWMQT 284
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGM-------RHQVPVWIVRNSWGR-WGPDDGYFT 162
Y + +C L+H V+++G+G + P WI++NSWG+ WG ++GY+
Sbjct: 285 YMSG-VSCPYICAKARLDHGVLLLGFGQGGYAPIRLKEKPYWIIKNSWGQNWG-EEGYYK 342
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 343 ICRGRNVCGVDS 354
>gi|348565006|ref|XP_003468295.1| PREDICTED: cathepsin W-like [Cavia porcellus]
Length = 375
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 95/197 (48%), Gaps = 24/197 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKA-IQYLKHAGLEAEADYPFRNQN 60
+E+ + I++ + LS +L++C GC GG A I L ++GL +E DYPFR
Sbjct: 161 IEAMWNIRYKVSVTLSVQELLDCARCEDGCAGGYIWDAFITVLNYSGLASEKDYPFRGHA 220
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR-KN 119
+ A + RKV ++ R + GP+ +N +LQ Y +I+ +
Sbjct: 221 NIHKCLASNYRKVAWIYDYIMLPRDEQGIARYVATQGPITVIINSKILQHYKKGIIKGTS 280
Query: 120 DVCPSENLNHAVVIVGYGM--------------------RHQVPVWIVRNSWG-RWGPDD 158
C ++H V++VGYG RH +P WI++NSWG WG ++
Sbjct: 281 SKCDPWFVDHYVLLVGYGRSKAEEEKWTETDLSHSNRPPRHSIPYWILKNSWGANWG-EE 339
Query: 159 GYFTVERGTNACGIESY 175
GYF + RG+N CGI Y
Sbjct: 340 GYFRLHRGSNTCGITKY 356
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 96/192 (50%), Gaps = 24/192 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L+ LS+ QL++C+ + GC GG N A +YL + G+ E
Sbjct: 160 LEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGLMNNAFEYLLQSGGVVQE 219
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + G G C +D KV VS+F V + + L GPL +N A +Q
Sbjct: 220 KDYAY---TGRDGSCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVAINAAWMQA 276
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDGYFT 162
Y + VC L+H V++VG+G P+ WI++NSWG+ WG + GY+
Sbjct: 277 YMSG-VSCPYVCAKARLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWG-EQGYYK 334
Query: 163 VERGTNACGIES 174
+ RG N CG++S
Sbjct: 335 ICRGRNVCGVDS 346
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 100/184 (54%), Gaps = 18/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ G++ E YP++
Sbjct: 180 LEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYIKYNGGIDTEESYPYKG 239
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAG---MNGALLQDYNG 113
NGV C Y A V+V D + N D + + P+ +NG + Y
Sbjct: 240 VNGV---CHYKAENAVVQVLDSVNITLNAEDELKNAVGLVRPVSVAFEVING--FRQYKS 294
Query: 114 KLIRKNDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
+ +D C + +++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N C
Sbjct: 295 G-VYSSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKNMC 352
Query: 171 GIES 174
+ +
Sbjct: 353 AVAT 356
>gi|321476439|gb|EFX87400.1| hypothetical protein DAPPUDRAFT_312328 [Daphnia pulex]
Length = 330
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 84/177 (47%), Gaps = 6/177 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE K LS+ L++C+ N GC GG + A YLK AG A N
Sbjct: 150 LEFSKCKKAKVTTVLSEQHLVDCDTTNGGCNGGWYVTAWTYLKKAG--GSAKQTLYNYTA 207
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
C + + +VS F + N + + L YGPL + + +
Sbjct: 208 KKNTCRFTTAMIAAKVSSFGYVQSNNATAMQLALQQYGPLAVAITVVPSFYSYASGVYDD 267
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+ C + +NHAVV+VG+G + V WIVRNSWG WG GYF ++RG N CGIE+Y
Sbjct: 268 NACDGQAVNHAVVLVGWGNLNGVDYWIVRNSWGTNWGL-SGYFFMKRGVNKCGIETY 323
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 148 LEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD + S + L F + + + + GP+ G++ + + K
Sbjct: 208 ---AMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKT 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG W+V+NSWG D GY + R + N CGI +
Sbjct: 265 GVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIAN 324
Query: 175 Y 175
Y
Sbjct: 325 Y 325
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 95/181 (52%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 174 LEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPY-- 231
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + A+ + V+V D + D + + P+ + K +
Sbjct: 232 -TGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGV 290
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIE 173
++ C + ++NHAV+ VGYG+ VP W+++NSW G WG D+GYF +E G N CG+
Sbjct: 291 FTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWG-DNGYFKMEMGKNMCGVA 349
Query: 174 S 174
+
Sbjct: 350 T 350
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 95/181 (52%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K + GL+ E YP+
Sbjct: 175 LEAAYVQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYIKANGGLDTEEAYPY-- 232
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + + + V+V D + D + + P+ +
Sbjct: 233 -TGVDGVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVAFEVVSGFRLYKSGV 291
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+D C + ++NHAVV VGYG+ + VP W+++NSWG WG D+GYF +E G N CG+
Sbjct: 292 YTSDTCGNTPMDVNHAVVAVGYGVENDVPYWLIKNSWGADWG-DNGYFKMEMGKNMCGVA 350
Query: 174 S 174
+
Sbjct: 351 T 351
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 95/185 (51%), Gaps = 20/185 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA HG + LS+ QL++C N GC GG ++A +Y+K+ G+ E +YP+
Sbjct: 168 LEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGIALEKEYPYTA 227
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++ + + A V VRV D + D + + P+ Q +G +
Sbjct: 228 KDEAS---KFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVA-----FQVVDGFRL 279
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNA 169
K V S+ ++NHAV+ VGYG+ + VP WI++NSWG D GYF +E G N
Sbjct: 280 YKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGYFKMELGKNM 339
Query: 170 CGIES 174
CG+ +
Sbjct: 340 CGVAT 344
>gi|268578473|ref|XP_002644219.1| Hypothetical protein CBG17217 [Caenorhabditis briggsae]
Length = 413
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 97/179 (54%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF--RNQ 59
+E+ YAI HG LS+ L++C++ + C GG +KA +Y+ GL D P+ Q
Sbjct: 230 VEAAYAIAHGEKRNLSEQTLLDCDLDDNACDGGDEDKAFRYIHRQGLAYAVDLPYVAHRQ 289
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL-LQDYNGKLIRK 118
N + Y+ K+K + + + + D+ L ++GP+ GM+ ++ Y G +
Sbjct: 290 NTCSVDGHYNTTKIK---AAYFLHHDEDSMINWLVNFGPVNIGMSVIQPMRAYKGGVFTP 346
Query: 119 ND-VCPSENLN-HAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++ C +E + HA++I GYG + WIV+NSWG WG ++GY RG NACGIE
Sbjct: 347 SEYACKNEVIGLHALLITGYGTSEKGEKYWIVKNSWGNTWGVENGYIYFARGINACGIE 405
>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
Length = 334
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 92/181 (50%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G LL L++ QL++C + N GC GG ++A +Y+ + G+ E YP+
Sbjct: 149 LESAVAIATGKLLSLAEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKGIMGEDTYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNG--K 114
G G C + K V D D + H+ P+ + D+ K
Sbjct: 208 --GKDGTCKFQPNKAIAFVKDVANITAYDEEAMTEAVAHHNPVSFAFE--VTDDFLSYHK 263
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I N C + +NHAV+ VGYG + +P WIV+NSWG WG ++GYF +ERG N CG
Sbjct: 264 GIYSNPKCSKSPDKVNHAVLAVGYGKENGIPYWIVKNSWGTSWG-NNGYFLIERGKNMCG 322
Query: 172 I 172
+
Sbjct: 323 L 323
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQ-GCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C YN GC GG ++A +Y+K+ GL+ E YP++
Sbjct: 178 LEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYK- 236
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAG---MNGALLQDYNG 113
GV G C Y V+V D + N D + + P+ +NG + Y
Sbjct: 237 --GVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVRPVSVAFEVING--FRQYKS 292
Query: 114 KLIRKNDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
+ +D C + +++NHAV+ VGYG+ + P W+++NSWG D GYF +ERG N C
Sbjct: 293 G-VYTSDHCGTTPDDVNHAVLAVGYGVENGTPYWLIKNSWGESWGDKGYFKMERGKNMCA 351
Query: 172 IES 174
+ +
Sbjct: 352 VAT 354
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 97/181 (53%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E + +K L S+ +L++C+ + C GG + A + + K GLE E++YP+ +
Sbjct: 1267 IEGLHQVKTKKLEEYSEQELLDCDTVDSACNGGFMDDAYKAIEKIGGLELESEYPYLAKK 1326
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
T C ++ VRV + ++T + L GP+ G+N +Q Y G +
Sbjct: 1327 QKT--CHFNKTMAHVRVKGAVDLPKNETAIAQFLVANGPVSIGLNANAMQFYRGGISHPW 1384
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+C +NL+H V+IVGYG++ +P WIV+NSWG +WG + GY+ V RG N CG
Sbjct: 1385 KPLCSKKNLDHGVLIVGYGVKEYPMFNKTLPYWIVKNSWGPKWG-EQGYYRVFRGDNTCG 1443
Query: 172 I 172
+
Sbjct: 1444 V 1444
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 95/181 (52%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y K G + LS+ QL++C N GC GG ++A +Y+K + GLE E YP+
Sbjct: 174 LEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYTG 233
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
+NG+ C + ++ V V+V+D + D + + P+ Q +G
Sbjct: 234 KNGL---CKFSSQNVGVKVTDSVNITLGAEDELKYAVALVRPVSVAFEVVKGFKQYKSGV 290
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++NHAV+ VGYG+ + VP W+++NSWG WG D+ YF +E G + CGI
Sbjct: 291 YTSTECGTTPMDVNHAVLAVGYGVEYGVPFWLIKNSWGADWG-DNAYFKMEMGNDMCGIA 349
Query: 174 S 174
+
Sbjct: 350 T 350
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 99/198 (50%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + G + LS+ Q+++C+ + GC GG A YL K GLE+E
Sbjct: 172 LEGANYLATGKMDVLSEQQMVDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLKSGGLESE 231
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D K+ V +F V + D L +GPL G+N A +Q
Sbjct: 232 KDYPYTGRDGT---CKFDKSKIVTSVQNFSVVSVDEDQIAANLVKHGPLAIGINAAYMQT 288
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG + G
Sbjct: 289 YIGGV-----SCPYICGRHLDHGVLLVGYGASGFAPIRLKDKAYWIIKNSWGENWG-EHG 342
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 343 YYKICRGSNVRNKCGVDS 360
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 89/163 (54%), Gaps = 7/163 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +KH L+ LS+ +L++C+ + GC GG + A + + K GLE E DYP+
Sbjct: 288 VEGQWFLKHKKLISLSEQELVDCDTLDSGCGGGLPSNAYKSIEKLGGLEPEKDYPYV--- 344
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +CA KV V++ + + L GP+ G+N L+Q Y G +
Sbjct: 345 GEGEKCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPISIGINANLMQFYWGGISHPW 404
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGY 160
+ C ++L+H V+IVGYG + P WI++NSWG WG ++ Y
Sbjct: 405 KIFCNPKSLDHGVLIVGYGTENGTPFWIIKNSWGPDWGEEEEY 447
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 25/36 (69%), Gaps = 2/36 (5%)
Query: 138 MRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ + P WI++NSWG WG ++GY+ + RG +CG+
Sbjct: 552 LENGTPFWIIKNSWGPDWG-EEGYYRIYRGDGSCGL 586
Score = 36.2 bits (82), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG 34
+E Q+ +KH L+ LS+ +L++C+ + GC GG
Sbjct: 508 VEGQWFLKHKKLISLSEQELVDCDTLDSGCGGG 540
>gi|535600|gb|AAA29137.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 92/184 (50%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+R
Sbjct: 141 MEGQYMKNEKTSISFSEQQLVDCSGPFGNYGCNGGLMENAYEYLKRFGLETESSYPYR-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMN--GALLQDYNGKL 115
V G+C Y+ + +V+ + + D + ++ P ++ + +G
Sbjct: 199 -AVEGQCRYNEQLGVAKVTGYYTVHSGDEVELQNLVGCRRPAAVALDVESDFMMYRSG-- 255
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYF-TVERGTNACGIES 174
I ++ C + LNH V+ VGYG++ WIV+NSWG W +DGY V + N CGI S
Sbjct: 256 IYQSQTCSPDRLNHGVLAVGYGIQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIAS 315
Query: 175 YGGI 178
+
Sbjct: 316 LASV 319
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 98/176 (55%), Gaps = 6/176 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K LL LS+ QL++C+ ++GC GG +A Q L GL+ ++DYP+
Sbjct: 147 IEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGTPQQAFKQILGMGGLQLDSDYPYE--- 203
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYN-GKLIRK 118
G G+C KVKV ++ + + + +ML GPL + +N LQ Y G L
Sbjct: 204 GREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPL 263
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +++LNHAV+ VGYG ++P W V+NSW ++GYF + RG CGI +
Sbjct: 264 PALCDAQSLNHAVLTVGYGKEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGINT 319
>gi|260821804|ref|XP_002606293.1| hypothetical protein BRAFLDRAFT_57270 [Branchiostoma floridae]
gi|229291634|gb|EEN62303.1| hypothetical protein BRAFLDRAFT_57270 [Branchiostoma floridae]
Length = 246
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 97/185 (52%), Gaps = 23/185 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
LES AI G + LS+ QL+ C N GC+GG ++A +Y+K A G+E+E DYP+
Sbjct: 58 LESVTAITFGAPMNLSEQQLVSCAQGFNNHGCEGGLPSQAWEYVKWAQGIESEKDYPYTA 117
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPL---------VAGMNGALLQ 109
++G +C ++ K V D + D +L G L VA
Sbjct: 118 KDG---KCMFNTNKTIAYVRDVVNITQGDE-DEILQAVGTLNPVSIAYQVVADFKLYKKG 173
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-PVWIVRNSWG-RWGPDDGYFTVERGT 167
Y+ KL ++ E++NHAV++VGYG V P WIV+NSWG WG DGYF +ER
Sbjct: 174 VYSSKLCHRD----QEHVNHAVLVVGYGEDESVIPYWIVKNSWGPSWGM-DGYFLIERNQ 228
Query: 168 NACGI 172
N CG+
Sbjct: 229 NMCGL 233
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 93/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 174 LEAAYHQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLETEEAYPYTG 233
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++G C + + V ++V D + D + + P+ + +
Sbjct: 234 EDGA---CKFSSENVGIQVLDSVNITLGAEDELKEAVGLVRPVSVAFEVVSGFRFYKSGV 290
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+D C S ++NHAV+ VGYG+ VP W+V+NSWG WG D GYF +E G N CG+
Sbjct: 291 YTSDTCGSTPMDVNHAVLAVGYGVEDGVPYWLVKNSWGENWG-DHGYFKMEMGKNMCGVA 349
Query: 174 S 174
+
Sbjct: 350 T 350
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 98/176 (55%), Gaps = 6/176 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K LL LS+ QL++C+ ++GC GG +A Q L GL+ ++DYP+
Sbjct: 136 IEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGTPQQAFKQILGMGGLQLDSDYPYE--- 192
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYN-GKLIRK 118
G G+C KVKV ++ + + + +ML GPL + +N LQ Y G L
Sbjct: 193 GREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPL 252
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +++LNHAV+ VGYG ++P W V+NSW ++GYF + RG CGI +
Sbjct: 253 PALCDAQSLNHAVLTVGYGKEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGINT 308
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 90/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAGHRLTALSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMXTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 STGDVPACTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYXSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGKXLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTEY 334
>gi|321452486|gb|EFX63859.1| hypothetical protein DAPPUDRAFT_306050 [Daphnia pulex]
Length = 222
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE K+G+LL LS+ QL++C Y+ GC GG + A YL++ A
Sbjct: 39 LEFARCKKYGSLLALSEQQLVDCEPYDYGCGGGWYTNAWYYLQNVA-GGSAKQSLYTYTA 97
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY--NGKLIR 117
T C + + + V++S + L + + + YGP+ + A++ + +
Sbjct: 98 TTNTCKFTSSMIGVKISSYTNLATLNAANMQLAVQTYGPISVAI--AVVNSFFSYASGVF 155
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
+ C + +NHAVVIVG+G+ +P WIVRNSWG WG GY ++RG N C IE Y
Sbjct: 156 TDTTCDNVGVNHAVVIVGWGVTTTGIPYWIVRNSWGTGWG-QAGYILIQRGVNKCSIEQY 214
>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
Length = 328
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 101/178 (56%), Gaps = 14/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+ESQYAI++ + LS+ QLI+C+ + GC GG + A Q ++ G++ E +YP+
Sbjct: 147 IESQYAIRNDRHINLSEQQLIDCDYVDMGCYGGLLHTAFEQMIQMGGVKQEHEYPYA--- 203
Query: 61 GVTGRCAY-----DARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
GV +C D+ V+++ V + + +L GP+ ++ + + +Y +
Sbjct: 204 GVNKQCELNDITDDSFVVRIKGCYRYVVVREEKLKDLLRAVGPIPIAIDASGIVNYYKGV 263
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I + C + LNHAV++VGYG+ + VP W +N+WG WG ++GYF + + NACG+
Sbjct: 264 I---NYCENYGLNHAVLLVGYGVDNGVPYWTFKNTWGVDWG-ENGYFRLRQNINACGM 317
>gi|307141900|gb|ADN34745.1| putative cysteine peptidase [Echinococcus granulosus]
Length = 218
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 98/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
LE Q K G L+ LS+ QL++C ++ N+GC GG N A +Y G E+E+DYP+
Sbjct: 40 LEGQLKRKKGKLISLSEQQLVDCSTDMGNEGCNGGYMNDAFRYWMQNGAESESDYPY--- 96
Query: 60 NGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGKL 115
+ G+C +++ KV +VS F+ D + + GP+ ++ A Y K
Sbjct: 97 TAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAIDAASSGFMLYK-KG 155
Query: 116 IRKNDVCPSENLNHAVVIVGY--GMRHQVPVWIVRNSWGR-WGPDDGYFTVERGT-NACG 171
I +++ C + L+HAV++VGY M Q WIV+NSWG WG GY + R N CG
Sbjct: 156 IYQDNTCSQQYLDHAVLVVGYDADMAGQ-KYWIVKNSWGEDWG-QRGYIWMARDKGNMCG 213
Query: 172 IES 174
I +
Sbjct: 214 IAT 216
>gi|26245863|gb|AAN77407.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 196
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 93/178 (52%), Gaps = 11/178 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGF-NKAIQYLKHAGLEAEADYPFRN 58
LE Q AI + PLS+ QL++C+ N C GG +A Y+ G+EAE+ YP+
Sbjct: 17 LEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPYVE 76
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
Q C YDA+K V++ + + D ++ + GP+ GM+ L Y G ++
Sbjct: 77 Q---MTECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGVL- 132
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
+D C ++HAV++VG G + W V+NSWG +DGYF +ER N C I S
Sbjct: 133 -DDQCYF-GMDHAVLVVGCGEANGKKFWKVKNSWGTTWGEDGYFRIERDADNLCDIAS 188
>gi|218185|dbj|BAA14404.1| oryzain gamma precursor [Oryza sativa Japonica Group]
Length = 362
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE++Y G + LS+ QL +C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 178 LEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPY-- 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C Y V+V D + D + + P+ Q NG +
Sbjct: 236 -TGVNGICHYKPENAGVKVLDSVNITLVAEDELKNAVGLVRPV-----SVAFQVINGFRM 289
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ + VP W+++NSWG WG D+GYFT+E G N
Sbjct: 290 YKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFTMEMGKN 348
Query: 169 ACGIES 174
CGI +
Sbjct: 349 MCGIAT 354
>gi|326926970|ref|XP_003209669.1| PREDICTED: cathepsin H-like [Meleagris gallopavo]
Length = 323
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 89/180 (49%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G LL L++ QL++C N GC GG ++A +Y L + GL E YP+R
Sbjct: 138 LESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRA 197
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
QNG C + K V D + D + + + P+ + K +
Sbjct: 198 QNGT---CKFQPDKAVAFVRDVINITQYDEASMVEAVGKHNPVSFAFEVTNDFMHYRKGV 254
Query: 117 RKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
N C + +NHAV+ VGYG +P WIV+NSWG DGYF +ERG N CG+ +
Sbjct: 255 YSNPRCEHTPDKVNHAVLAVGYGEEDGLPYWIVKNSWGSLWGMDGYFLIERGKNMCGLAA 314
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 95/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + ++ GL E +YP+ +N
Sbjct: 273 IESQWFRKTGKLLSLSEQQLVDCDSLDDGCNGGLPSNAYESIIRMGGLMLEDNYPYDAKN 332
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V ++ + ++ LYH+ + GMN LLQ Y +
Sbjct: 333 E---KCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPW 389
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG + GYF + RG CGI +
Sbjct: 390 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-EKGYFRMYRGDGTCGINT 446
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 98/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + + G L+ LS+ QL++C+ + GC GG A Y K AG L E
Sbjct: 165 LEGAHFLATGELVSLSEQQLVDCDHECDPDLNDACDSGCNGGLMTTAFGYTKKAGGLVRE 224
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DY + ++ G C +D K+ VS+F V + D L GPL G+N +Q
Sbjct: 225 EDYLYTGRD--RGPCKFDKSKIAASVSNFSVVSLDEDQIAANLVKNGPLSVGINAVYMQT 282
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP ++L+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 283 YIGGV-----SCPFICGKHLDHGVLLVGYGAGGYAPIRFKEKPYWIIKNSWGENWG-ENG 336
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 337 YYKICRGPNMCGVDS 351
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 97/195 (49%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG---------FNKAIQY-LKHAGLEAE 51
LE + + G L+ LS+ QL++C+ + G N A +Y L + G+ E
Sbjct: 161 LEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCGSGCNGGLMNSAFEYILNNGGVMRE 220
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ NG G C +D K+ V++F +V D L GPL +N +Q
Sbjct: 221 EDYPYSGTNG--GTCKFDKAKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQT 278
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP S+ LNH V++VGYG Q P WI++NSWG WG ++G
Sbjct: 279 YVGGV-----SCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWG-ENG 332
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 333 YYKICRGRNICGVDS 347
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 98/176 (55%), Gaps = 6/176 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K LL LS+ QL++C+ ++GC GG +A Q L GL+ ++DYP+
Sbjct: 147 IEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGTPQQAFKQILGMGGLQLDSDYPYE--- 203
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYN-GKLIRK 118
G G+C KVKV ++ + + + +ML GPL + +N LQ Y G L
Sbjct: 204 GREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPL 263
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +++LNHAV+ VGYG ++P W V+NSW ++GYF + RG CGI +
Sbjct: 264 PALCDAQSLNHAVLTVGYGKEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGINT 319
>gi|340370388|ref|XP_003383728.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 398
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 94/180 (52%), Gaps = 12/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE Q+ I G L+ LS+ QL++C++ N GC GG + A +Y++ AG E+E DYP+ +N
Sbjct: 216 LEGQHFINTGNLVSLSEQQLVDCSLKNDGCNGGMLSTAFKYIESVAGEESETDYPYTAKN 275
Query: 61 GVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLI 116
G C YD K +V+ + L D+ + GP+ ++ + Q Y+ +
Sbjct: 276 GT---CQYDPSKAVAKVTGYTALPSGDEDSLNDAVTSKGPISVCIDASHKSFQLYSEGVY 332
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGIES 174
+ C L+H V++VGYG W+V+NSWG WG GY + R N CGI +
Sbjct: 333 YEKS-CSYFLLDHCVLVVGYGTEDTADYWLVKNSWGTSWGM-KGYIRMSRNRKNNCGIAT 390
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 10/63 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
+ESQY IK GTL+PLS Q+++C NI N G +N G++A +R +
Sbjct: 143 VESQYFIKTGTLVPLSVQQILDCANITNLDEAGAIYN---------GVKAPDTVDWREKG 193
Query: 61 GVT 63
VT
Sbjct: 194 AVT 196
>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
Length = 371
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 94/195 (48%), Gaps = 23/195 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+++ + IKH + +S +L++C GC GG ++ + L ++GL +E DYPF+
Sbjct: 160 IQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKDYPFQGDR 219
Query: 61 GVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
RC K + DF ++ N L +GP+ +N LLQ Y +I+
Sbjct: 220 K-PHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKAT 278
Query: 120 -DVCPSENLNHAVVIVGYGM-----------------RHQVPVWIVRNSWG-RWGPDDGY 160
C ++H+V++VG+G RH P WI++NSWG WG + GY
Sbjct: 279 PSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWG-EKGY 337
Query: 161 FTVERGTNACGIESY 175
F + RG N CG+ Y
Sbjct: 338 FRLYRGNNTCGVTKY 352
>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
Length = 371
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 94/195 (48%), Gaps = 23/195 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+++ + IKH + +S +L++C GC GG ++ + L ++GL +E DYPF+
Sbjct: 160 IQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKDYPFQGDR 219
Query: 61 GVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
RC K + DF ++ N L +GP+ +N LLQ Y +I+
Sbjct: 220 K-PHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKAT 278
Query: 120 -DVCPSENLNHAVVIVGYGM-----------------RHQVPVWIVRNSWG-RWGPDDGY 160
C ++H+V++VG+G RH P WI++NSWG WG + GY
Sbjct: 279 PSSCDPRQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWG-EKGY 337
Query: 161 FTVERGTNACGIESY 175
F + RG N CG+ Y
Sbjct: 338 FRLYRGNNTCGVTKY 352
>gi|6649595|gb|AAF21471.1|U85984_1 cysteine proteinase [Clonorchis sinensis]
Length = 217
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 98/176 (55%), Gaps = 6/176 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K LL LS+ QL++C+ ++GC GG +A Q L GL+ ++DYP+
Sbjct: 37 IEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGTPQQAFKQILGMGGLQLDSDYPYE--- 93
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYN-GKLIRK 118
G G+C KVKV ++ + + + +ML GPL + +N LQ Y G L
Sbjct: 94 GREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPL 153
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +++LNHAV+ VGYG ++P W V+NSW ++GYF + RG CGI +
Sbjct: 154 PALCDAQSLNHAVLTVGYGKEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGINT 209
>gi|108735858|gb|ABG00260.1| cathepsin L1 [Fasciola hepatica]
Length = 219
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 88/182 (48%), Gaps = 8/182 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
++ QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 34 MKGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYEYLKQFGLETESSYPY--- 90
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+ V G C YD + +V+ + + D + ++ GP ++ L I
Sbjct: 91 SAVEGPCRYDRKLGVAKVTGYYTVHSGDEVELQNLVGGEGPPAVALDAELDFMMYRSGIY 150
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESYG 176
+ C + L+H V+ VGYG + WIV+NSWG W +DGY + R N CGI S
Sbjct: 151 XSQTCSPDRLSHGVLAVGYGTQDGTDYWIVKNSWGTWWGEDGYIRMVRNRGNMCGIASLA 210
Query: 177 GI 178
+
Sbjct: 211 SV 212
>gi|319891283|gb|ADV74826.1| cathepsin [Agraulis vanillae MNPV]
Length = 168
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 90/161 (55%), Gaps = 10/161 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
LESQ+AIK+ L+ LS+ QLI+C+ + GC+GG + A + ++ G++ E DYP+ +N
Sbjct: 14 LESQFAIKYNRLINLSEQQLIDCDSVDAGCEGGLLHTAYEAIMEMGGVQVEHDYPYERRN 73
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C D K V V + + + +L GPL ++ + + +Y +IR
Sbjct: 74 ---GDCRVDTAKFVVNVKKCYRYITVLEEKLKDLLRIVGPLPVAIDASDIVNYKRGIIR- 129
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDD 158
C + LNHAV++VGY + VP I++N+WG WG D+
Sbjct: 130 --YCSNHGLNHAVLLVGYAVEDGVPYRILKNTWGTDWGEDN 168
>gi|119640003|gb|ABL85443.1| cathepsin L [Kudoa thyrsites]
Length = 300
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 88/167 (52%), Gaps = 8/167 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YAIK G L+ S+ QL++C+ N GC GG A Y+ + G+ DYP+ + G
Sbjct: 135 IESAYAIKTGELVNFSEQQLVDCSTENHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQG 194
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRK 118
C Y V VR+S F V N ++ + + GP G+N A Q Y G I
Sbjct: 195 T---CQYSPEDV-VRISSFKCVKNNEESVMESVANNGPNSIGINAASRSFQFYGGG-IYF 249
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER 165
+ S L+HAV++VGYG ++ W V+NSWG W D GY ++R
Sbjct: 250 DPWASSYPLDHAVLLVGYGYKNTENYWHVKNSWGPWWGDQGYINIKR 296
>gi|41055337|ref|NP_956720.1| cathepsin S, a [Danio rerio]
gi|32451845|gb|AAH54668.1| Cathepsin S, a [Danio rerio]
Length = 239
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 97/181 (53%), Gaps = 9/181 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LE+Q + L+PLS L++C++ N+GC+GG ++A Y +++ G+++ YP+ +
Sbjct: 57 LEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEH 116
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+ GV C Y + F + + + + + GP+ G+N LL + +
Sbjct: 117 KEGV---CRYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSG 173
Query: 117 RKNDV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
ND C S +NHAV++VGYG + W+V+NSWG ++GY + R N CGI S+
Sbjct: 174 IYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSF 233
Query: 176 G 176
G
Sbjct: 234 G 234
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 95/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + ++ GL E +YP+ +N
Sbjct: 136 IESQWFRKTGKLLSLSEQQLVDCDSLDDGCNGGLPSNAYESIIRMGGLMLEDNYPYDAKN 195
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V ++ + ++ LYH+ + GMN LLQ Y +
Sbjct: 196 E---KCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPW 252
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG + GYF + RG CGI +
Sbjct: 253 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-EKGYFRMYRGDGTCGINT 309
>gi|321477694|gb|EFX88652.1| hypothetical protein DAPPUDRAFT_304724 [Daphnia pulex]
Length = 336
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 13/179 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q +K GTL+ LS+ LI+C+ N GC GG ++ Y+K GL E YP++ +
Sbjct: 152 IEYQRCMKTGTLVTLSEENLIDCSQKYGNAGCNGGLALRSWNYVKDVGLNTEEAYPYQGE 211
Query: 60 NGVTGRCAYDARKVKVRVSDFLV---FNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+ C Y A V+ + N + + ++ YGP+ ++ + D+ I
Sbjct: 212 ETM---CEYSASNYGGNVTTWAYATRTNDEEAIKVVVAKYGPVAVSVDASNW-DFYSSGI 267
Query: 117 RKNDVCPSENLNHAVVIVGYG--MRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C + NHAVVIVGYG + + WIVRNSWG WG + GY +ERG N C I
Sbjct: 268 FSSPTCSNTTTNHAVVIVGYGKDTKTRKDFWIVRNSWGPEWG-EGGYINLERGVNMCAI 325
>gi|42744610|gb|AAH66625.1| Ctssa protein [Danio rerio]
Length = 321
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 97/181 (53%), Gaps = 9/181 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LE+Q + L+PLS L++C++ N+GC+GG ++A Y +++ G+++ YP+ +
Sbjct: 139 LEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEH 198
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+ GV C Y + F + + + + + GP+ G+N LL + +
Sbjct: 199 KEGV---CRYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSG 255
Query: 117 RKNDV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
ND C S +NHAV++VGYG + W+V+NSWG ++GY + R N CGI S+
Sbjct: 256 IYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSF 315
Query: 176 G 176
G
Sbjct: 316 G 316
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 96/175 (54%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ + L+ LS+ +L++C+ + GC+GG ++A + ++ GLE E+ YP+ +
Sbjct: 279 IEGQWFLAKKKLVSLSEQELVDCDKVDDGCEGGLPSQAYKEIMRMGGLETESAYPY---D 335
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + + V ++D + + ++ + L GP+ G+N LQ Y +
Sbjct: 336 GRGEECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPISIGINANPLQFYRHGISHPW 395
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
C LNH V++VGYG P WI++NSWG +WG ++GY+ + RG N CG+
Sbjct: 396 KFFCEPYMLNHGVLLVGYGSEKNKPYWIIKNSWGPKWG-ENGYYRLYRGKNVCGV 449
>gi|321449362|gb|EFX61852.1| hypothetical protein DAPPUDRAFT_68588 [Daphnia pulex]
Length = 198
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 90/181 (49%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH--AGLEAEADYPFRNQ 59
LE KHG L +S+ QL++C Y+ GC GG + A YL++ G + YP+
Sbjct: 14 LEFARCKKHGALRAISEQQLVDCEPYDYGCGGGWYTNAWYYLQYEAGGAAKRSLYPYTAT 73
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGM--NGALLQDYNGKL 115
+ CA+ + + ++S + D + +L YGP+ + + +G
Sbjct: 74 DNT---CAFSSSMIGAKISSYGDLPSFDAAYMQSVLQDYGPISVAIAVTDSFFSYASGVY 130
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
P+ +NHAVV+VG+G + + WIVRNSWG +WG GY +ERG N C IE
Sbjct: 131 TDVECDDPNAYVNHAVVVVGWGTDNGIDYWIVRNSWGTKWG-SAGYILMERGVNKCKIEK 189
Query: 175 Y 175
Y
Sbjct: 190 Y 190
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 100/181 (55%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K G+++ LS+ L++C + N GC+GG + A +Y++ + G++ E YP+
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPY-- 209
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
NG G C + V S F+ + GS+T ++ + GP+ ++ + Q Y+
Sbjct: 210 -NGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
+ + + C SE+L+H V++VGYG + W+V+NSWG D+GY + R N CGI
Sbjct: 269 VYDEPE-CDSESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIA 327
Query: 174 S 174
S
Sbjct: 328 S 328
>gi|1185457|gb|AAA87848.1| cathepsin L, partial [Schistosoma japonicum]
Length = 224
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 95/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + ++ GL E +YP+ +N
Sbjct: 43 IESQWFRKTGKLLSLSEQQLVDCDSLDDGCNGGLPSNAYESIIRMGGLMLEDNYPYDAKN 102
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V ++ + ++ LYH+ + GMN LLQ Y +
Sbjct: 103 E---KCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPW 159
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG + GYF + RG CGI +
Sbjct: 160 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-EKGYFRMYRGDGTCGINT 216
>gi|108755401|emb|CAI77919.1| cathepsin H [Guillardia theta]
gi|122890320|emb|CAJ73711.1| Cathepsin H [Guillardia theta]
Length = 353
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 102/187 (54%), Gaps = 17/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES +AIK G ++ LS+ QL++C + N GC GG ++A +Y+ + GL +YP+
Sbjct: 156 LESLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQAFEYIMYNGGLSKMEEYPYVC 215
Query: 59 QNG----VTGRCAYDAR----KVKVRVSDFLVFNGSD--TFRRMLYHYGPL-VAGMNGAL 107
+G G CA+D V +VS F D + + ++ + P+ VA A
Sbjct: 216 GDGHCNVTGGPCAFDPVGKPWSVGAKVSKVANFTPGDEISMKTVVGSHNPISVAFEVVAD 275
Query: 108 LQDYN-GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVER 165
L+ Y+ G V + +NHAV+ VGYG +P W ++NSWG WG D+GYF ++R
Sbjct: 276 LRHYSSGVYSSPTCVGTPDKVNHAVLAVGYGTEGGIPYWTIKNSWGFAWG-DNGYFKIQR 334
Query: 166 GTNACGI 172
G+N CGI
Sbjct: 335 GSNKCGI 341
>gi|7271891|gb|AAF44676.1|AF239265_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 92/184 (50%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C + N GC GG A +YLK GLE E+ YP+R
Sbjct: 141 MEGQYMKNQRTSISFSEQQLVDCSDDFGNFGCNGGLMENACEYLKRFGLETESSYPYR-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMN--GALLQDYNGKL 115
V G C Y+ + +V+ + + + D + ++ GP ++ + +G
Sbjct: 199 -AVEGPCRYNKQLGVAKVTGYYMVHSGDEVELQNLVGIEGPAAVALDVDSDFMMYRSG-- 255
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
I ++ C E LNH V+ VGYG + WIV+NSWG W ++GY + R N CGI S
Sbjct: 256 IYQSQTCSPEFLNHGVLAVGYGTQSGTDYWIVKNSWGPWWGENGYIRMVRNRGNMCGIAS 315
Query: 175 YGGI 178
+
Sbjct: 316 LASV 319
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 95/178 (53%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+ESQ+ K G LL LS+ QL++C+ + GC GG + A + ++ GL E +YP+ +N
Sbjct: 273 IESQWFRKTGKLLSLSEQQLVDCDNLDDGCNGGLPSNAYESIIRMGGLMLEDNYPYDAKN 332
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+C V ++ + ++ LYH+ + GMN LLQ Y +
Sbjct: 333 E---KCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPW 389
Query: 120 DV-CPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C L+HAV++VGYG+ + P WIV+NSWG WG + GYF + RG CGI +
Sbjct: 390 WIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWG-EKGYFRMYRGDGTCGINT 446
>gi|344275472|ref|XP_003409536.1| PREDICTED: cathepsin S-like isoform 2 [Loxodonta africana]
Length = 281
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 98 LEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYK 157
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
G+C YD + S + L + D + + + GP+ G++ + + K
Sbjct: 158 ---ATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKGPVSVGIDASRPSFFLYKS 214
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG + GY + R + N CGI S
Sbjct: 215 GVYYDPSCTDNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIAS 274
Query: 175 Y 175
+
Sbjct: 275 F 275
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 100/183 (54%), Gaps = 15/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFR 57
LE Q+ +K+G L+ L++ QL++C YNQGC GG N+A +Y+K + G++ E+ YP+
Sbjct: 140 LEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSYPYE 199
Query: 58 NQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFR-RMLYHYGPLVAGMNGA--LLQDYNG 113
++ C +++ V S F+ + GS++ R + GP+ ++ A Q Y+
Sbjct: 200 ARDNT---CRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYSS 256
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACG 171
+ + C S L+HAV+ VGYG W+V+NSWG WG GY + R N CG
Sbjct: 257 GVYYEPS-CSSSQLDHAVLAVGYGSEGGQDFWLVKNSWGTSWG-SAGYINMARNRNNNCG 314
Query: 172 IES 174
I +
Sbjct: 315 IAT 317
>gi|348505824|ref|XP_003440460.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 324
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 89/179 (49%), Gaps = 12/179 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G L+PLS+ QL++C + N GC GG ++A +Y+ + GL E DYP+
Sbjct: 141 LESVTAINKGKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKGLMTEQDYPY-- 198
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM----LYHYGPLVAGMNGALLQDYNGK 114
G+C Y K V+ + + + ++ + + + G
Sbjct: 199 -TAFEGKCVYKPGKAAAFVNSVVNITAYNELEMVDAVGTHNPVSFAFEVTSDFMSYHQGV 257
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
++ +NHAV+ VGYG + P WIV+NSWG WG +GYF +ERG N CG+
Sbjct: 258 YTSTECHNTTDKVNHAVLAVGYGQENGTPYWIVKNSWGSSWGM-NGYFLIERGKNMCGL 315
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 98/181 (54%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K G L+ LS+ L++C+ N GC+GG + A QY+K + G++ E YP+
Sbjct: 150 LEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEA 209
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
++G C + + V + F+ + GS D ++ + GP+ ++ + Q Y+
Sbjct: 210 EDG---ECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEG 266
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+ + + C SE L+H V++VGYG+ W+V+NSW D+GY + R N CGI
Sbjct: 267 VYDETE-CSSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIA 325
Query: 174 S 174
S
Sbjct: 326 S 326
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 96/179 (53%), Gaps = 12/179 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+AI L+ LS+ +L++C+ + GC+GG N + ++ GLE+E YP+ ++
Sbjct: 99 IEGQWAIHRNKLVSLSEQELVDCDKLDDGCEGGLPVNAYEEIIRLGGLESEKKYPYDAED 158
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
+C + V V ++ + + ++ LY GP+ G+N +Q Y G +
Sbjct: 159 E---KCKFTVGDVAVYINSSVNISSNEADMAAWLYKNGPISIGINAFAMQFYMGGVSHPF 215
Query: 119 NDVCPSENLNHAVVIVGYGMRH----QVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ +C + L+H V+IVGYG + P WIV+NSWG WG GY+ V RG CG+
Sbjct: 216 SFLCSPDELDHGVLIVGYGTKKGWFSDSPYWIVKNSWGASWGV-QGYYLVYRGDGVCGL 273
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 99/181 (54%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E + IK L S+ +LI+C+ + GC GG + A + ++ GLE E DYP+ +
Sbjct: 766 VEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAK- 824
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTF-RRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
C ++ V+V + ++T+ + L GP+ G+N +Q Y G +
Sbjct: 825 -AQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPW 883
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C ++++H V+IVGYG++ +P WI++NSWG RWG + GY+ + RG N+CG
Sbjct: 884 HPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWG-EQGYYRIYRGDNSCG 942
Query: 172 I 172
+
Sbjct: 943 V 943
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 93/190 (48%), Gaps = 21/190 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN-----IYNQ-----GCQGGGFNKAIQYL-KHAGLEA 50
+E YAIKH L+ S+ QL++C+ NQ GC GG A QYL K G+
Sbjct: 160 IEGSYAIKHKQLVSFSEQQLVDCDNNCVTFENQQSCDDGCNGGLQWSAYQYLMKAGGVVT 219
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQ 109
E DYP+ + +C ++S++ + + ++T L GP+ +N LQ
Sbjct: 220 EKDYPYYAERY---KCEVKPANFVAKLSNWTMLSTNETEMANWLAENGPIAVALNADFLQ 276
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMR-----HQVPVWIVRNSWGRWGPDDGYFTVE 164
+YN I C L+H V+IVGYG+ P WIV+NSWG +DGYF +
Sbjct: 277 NYNNG-IADPAWCDPTQLDHGVLIVGYGLETFWFGKPQPYWIVKNSWGYDFGEDGYFRIV 335
Query: 165 RGTNACGIES 174
+G CGI +
Sbjct: 336 KGVGRCGINT 345
>gi|209732040|gb|ACI66889.1| Cathepsin H precursor [Salmo salar]
Length = 330
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 91/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
LES AI G L LS+ QL++C + N GC GG ++A +Y+K+ GL E DYP+
Sbjct: 145 LESVTAIATGKLPLLSEQQLVDCAQDFNNHGCMGGLPSQAFEYVKYNNGLMTEDDYPY-- 202
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAG--MNGALLQDYNGK 114
G G C + V D + D + P+ G + L +G
Sbjct: 203 -TGHDGSCNFKPELAAAFVKDVVNITSYDEKGMVDAVARLNPVSFGYEVTDDFLHYKDGV 261
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
++N+NHAV+ VGYG ++ P WIV+NSWG WG DGYF +ERG N CG+
Sbjct: 262 YSSTTCKNTTDNVNHAVLAVGYGEKNSTPYWIVKNSWGTNWGM-DGYFLIERGRNMCGLA 320
Query: 174 S 174
+
Sbjct: 321 A 321
>gi|114796866|gb|ABI79445.1| cysteine proteinase 5 [Entamoeba histolytica]
Length = 289
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 85/165 (51%), Gaps = 10/165 (6%)
Query: 14 LPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDAR 71
L LS+ QL++C++ N+GC GG + +Y+K G+ E DYP+ C YD +
Sbjct: 129 LDLSEQQLVDCSVSVGNKGCNGGSLLLSFRYVKLNGIMQEKDYPYV---AAEETCTYDKK 185
Query: 72 KVKVRVS-DFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDVCPSENLN 128
KV V+++ LV GS+ GP+ A ++ G Q Y + + C S LN
Sbjct: 186 KVAVKITGQKLVRPGSEKALMRAAAEGPVGAAIDASGVKFQLYKSGIYNSKE-CSSTQLN 244
Query: 129 HAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGI 172
H V +VGYG ++ WIVRNSWG D GY + R N CGI
Sbjct: 245 HGVAVVGYGTQNGTEYWIVRNSWGTIWGDQGYVLMSRNKNNQCGI 289
>gi|73983670|ref|XP_540846.2| PREDICTED: cathepsin W [Canis lupus familiaris]
Length = 374
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 98/196 (50%), Gaps = 24/196 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ + I++ + +S +L++C GC+GG ++ I L ++GL + DYPF N
Sbjct: 162 IEALWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPFLG-N 220
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIR-K 118
RC K + DF++ G++ L GP+ +N LLQ Y +I+
Sbjct: 221 TKPHRCLAKKYKKVAWIQDFIMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKGVIQAT 280
Query: 119 NDVCPSENLNHAVVIVGYGM------------------RHQVPVWIVRNSWG-RWGPDDG 159
+ C + ++H+V++VG+G H +P WI++NSWG WG ++G
Sbjct: 281 HTTCDPQRVDHSVLLVGFGKSKSVAGKQAEGGSSRPRPHHPIPYWILKNSWGAEWG-EEG 339
Query: 160 YFTVERGTNACGIESY 175
YF + RG N CGI Y
Sbjct: 340 YFRLHRGNNTCGITKY 355
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
G+C YD + S + L + D + + + GP+ G++ + + K
Sbjct: 208 ---ATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKGPVSVGIDASRPSFFLYKS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG + GY + R + N CGI S
Sbjct: 265 GVYYDPSCTDNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIAS 324
Query: 175 Y 175
+
Sbjct: 325 F 325
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 92/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A+ TL+ LS+ L+ C+ + GC GG ++A ++ H+G + E YP+ +
Sbjct: 162 IEGQWALSGNTLVSLSEQMLVSCDTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSYPYTS 221
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G T C KV R+S + D L GP+ ++ Q Y G ++
Sbjct: 222 GDGSTASC-LSTGKVGARISGQVSLPQDEDAIEAWLEKNGPISIAVDATTWQLYFGGVVS 280
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
C + NLNH V++VGY P WIV+NSWG WG + GY + +G+N C ++ Y
Sbjct: 281 N---CFAYNLNHGVLLVGYNNSANPPYWIVKNSWGTSWG-EHGYIRLAKGSNQCMMKDYA 336
>gi|341903430|gb|EGT59365.1| hypothetical protein CAEBREN_22193 [Caenorhabditis brenneri]
Length = 410
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 96/180 (53%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF--RNQ 59
+E+ YAI HG LS+ L++C++ + C GG +KA +Y+ GL D P+ Q
Sbjct: 227 VEAAYAIAHGERRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRNGLAYAVDLPYVAHRQ 286
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNG-ALLQDYNGKLIRK 118
NG ++ ++K + + + + D+ L ++GP+ GM+ ++ Y G +
Sbjct: 287 NGCAVTDNWNTTRIK---AAYFLHHDEDSIINWLVNFGPVNIGMSVIQPMRAYKGGVFTP 343
Query: 119 ND-VCPSENLN-HAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
++ C +E + HA++I GYG + WIV+NSWG WG + GY RG NACGIE
Sbjct: 344 SEYACKNEVIGLHALLITGYGTSEKGEKYWIVKNSWGNTWGVEHGYIYFARGINACGIED 403
>gi|297819034|ref|XP_002877400.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
lyrata]
gi|297323238|gb|EFH53659.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
lyrata]
Length = 317
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 95/181 (52%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 133 LEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPY-- 190
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + A+ + V+V D + D + + P+ + K +
Sbjct: 191 -TGKDGGCKFSAKNIGVQVLDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGV 249
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIE 173
++ C + ++NHAV+ VGYG+ VP W+++NSW G WG D+GYF +E G N CG+
Sbjct: 250 FTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGDWG-DNGYFKMEMGKNMCGVA 308
Query: 174 S 174
+
Sbjct: 309 T 309
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 99/181 (54%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E + IK L S+ +LI+C+ + GC GG + A + ++ GLE E DYP+ +
Sbjct: 1647 VEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAK- 1705
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTF-RRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
C ++ V+V + ++T+ + L GP+ G+N +Q Y G +
Sbjct: 1706 -AQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPW 1764
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C ++++H V+IVGYG++ +P WI++NSWG RWG + GY+ + RG N+CG
Sbjct: 1765 HPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWG-EQGYYRIYRGDNSCG 1823
Query: 172 I 172
+
Sbjct: 1824 V 1824
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 99/181 (54%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E + IK L S+ +LI+C+ + GC GG + A + ++ GLE E DYP+ +
Sbjct: 1623 VEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAK- 1681
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTF-RRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
C ++ V+V + ++T+ + L GP+ G+N +Q Y G +
Sbjct: 1682 -AQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPW 1740
Query: 119 NDVCPSENLNHAVVIVGYGMRH------QVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
+ +C ++++H V+IVGYG++ +P WI++NSWG RWG + GY+ + RG N+CG
Sbjct: 1741 HPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWG-EQGYYRIYRGDNSCG 1799
Query: 172 I 172
+
Sbjct: 1800 V 1800
>gi|431910254|gb|ELK13327.1| Cathepsin W [Pteropus alecto]
Length = 210
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 22/182 (12%)
Query: 14 LPLSKSQLIECNIYNQGCQGGGFNKA-IQYLKHAGLEAEADYPFRNQNGVTGRCAYDARK 72
L L +L++C GC+GG A I L ++GL +E DYP++ + T +C K
Sbjct: 10 LSLFGPELVDCTRCGNGCEGGFIWDAFITVLNNSGLASEKDYPYQGKVR-THKCQAKKHK 68
Query: 73 VKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIR-KNDVCPSENLNHA 130
+ DF++ + R L GP+ +N LLQ Y +I+ ++ C ++H+
Sbjct: 69 NVAWIQDFIMLPDCEMKIARYLATEGPITVTINMKLLQQYQTGVIKATSNTCDPHLVDHS 128
Query: 131 VVIVGYGM----------------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
V++VG+G RH +P WI++NSWG WG + GYF + RG+N CGI
Sbjct: 129 VLLVGFGKSKSVEGRRAEAVSSKSRHSIPYWILKNSWGASWG-EKGYFRLHRGSNTCGIT 187
Query: 174 SY 175
Y
Sbjct: 188 KY 189
>gi|119640001|gb|ABL85442.1| cathepsin L [Kudoa thyrsites]
gi|119640005|gb|ABL85444.1| cathepsin L [Kudoa thyrsites]
Length = 300
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 88/167 (52%), Gaps = 8/167 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YAIK G L+ S+ QL++C+ N GC GG A Y+ + G+ DYP+ + G
Sbjct: 135 IESAYAIKTGELVNFSEQQLVDCSTENHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQG 194
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRK 118
C Y V VR+S F V N ++ + + GP G+N A Q Y G I
Sbjct: 195 T---CQYSPEDV-VRISSFKCVENNEESVMESVANNGPNSIGINAASRSFQFYGGG-IYS 249
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER 165
+ S L+HAV++VGYG ++ W V+NSWG W + GY ++R
Sbjct: 250 DPWASSYPLDHAVLLVGYGYKNTENYWHVKNSWGPWWGEQGYINIKR 296
>gi|328909405|gb|AEB61370.1| cathepsin S-like protein, partial [Equus caballus]
Length = 281
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG A QY+ + G++++A YP++
Sbjct: 98 LEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNNGIDSDASYPYK 157
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD++ S + L F D + + + GP+ ++ + + K
Sbjct: 158 ---AMDGKCRYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAIDASHPSFFLYKS 214
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI +
Sbjct: 215 GVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGINFGDKGYIRMARNSGNHCGIAN 274
Query: 175 Y 175
Y
Sbjct: 275 Y 275
>gi|449270628|gb|EMC81287.1| Cathepsin H, partial [Columba livia]
Length = 261
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 87/180 (48%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G LL L++ QL++C N GC GG ++A +Y L + GL E YP+R
Sbjct: 76 LESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQAFEYILYNRGLMGEDTYPYRA 135
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+NG C + K V D + D + + P+ + K +
Sbjct: 136 ENGT---CKFQPEKAIAFVRDVINITQYDEDGMVEAVGKHNPVSFAFEVTSNFMHYRKGV 192
Query: 117 RKNDVCP--SENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
N C + +NHAV+ VGYG P WIV+NSWG DGYF +ERG N CG+ +
Sbjct: 193 YSNPRCEHTPDKVNHAVLAVGYGEEDGTPFWIVKNSWGPLWGMDGYFLIERGKNMCGLAA 252
>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 90/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAGHRLTALSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C ++ V R+ ++ S+T L GP+ G++ + Y ++
Sbjct: 219 SXGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIGVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGBXLNHGVLLVGYNXTGEVPYWVIKNSWGEDWG-EKGYVRVAMGVNACLLTEY 334
>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 92/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHRLTALSEQQLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+G C+ ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 SSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C LNH V++VGY M +VP W+++NSWG WG ++GY V G NAC + Y
Sbjct: 279 TS---CAGITLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-ENGYVRVTMGVNACLLTEY 334
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 96/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y+ G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGAL-LQDYNGKL 115
+NG+ C + + V V+V D + D + + P+ + Y +
Sbjct: 236 KNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGV 292
Query: 116 IRKNDVCPSE-NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ + ++NHAV+ VGYG+ + VP W+++NSWG WG DDGYF +E G N CGI
Sbjct: 293 YSSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DDGYFKMEMGKNMCGIA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 99/174 (56%), Gaps = 12/174 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQ-GCQGGGFNKAIQY-LKHAGLEAEADYPFRNQ 59
+E+ AI G L+ LS+ +L++C+ N GC+GG + A Q+ + + G++ EADYP+
Sbjct: 170 IEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTEADYPY--- 226
Query: 60 NGVTGRC--AYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALL--QDYNGKL 115
GV G C A + +KV V + ++ + SD+ P+ GM+G+ L Q Y G +
Sbjct: 227 TGVDGTCNTAKEEKKV-VSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDFQLYTGGI 285
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ +++HA++IVGYG + WIV+NSWG WG +GYF + R T+
Sbjct: 286 YDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGM-EGYFYIRRNTS 338
>gi|410914437|ref|XP_003970694.1| PREDICTED: cathepsin O-like [Takifugu rubripes]
Length = 328
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 95/177 (53%), Gaps = 11/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA--GLEAEADYPFRNQ 59
++S +AI L+ LS Q+++C+ N+GC GG A+++L L +++YP++ Q
Sbjct: 148 VQSVHAIGGSQLVELSVQQVLDCSFQNKGCNGGTPVAALKWLTQTRVKLVPQSEYPYKAQ 207
Query: 60 NGVTGRCAY-DARKVKVRVSDFLVFNGSDTFRRMLYH---YGPLVAGMNGALLQDYNGKL 115
T C + V V +F + S M+ H +GPL ++ QDY G +
Sbjct: 208 ---TRMCHFFSGSHGGVGVKNFTALDFSGQEEAMMGHLVKHGPLSVVVDALSWQDYLGGI 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I+ + C S+ NHAV++VGY +P WIV+NSWG D GY ++ G+N CGI
Sbjct: 265 IQYH--CSSKRSNHAVLVVGYDTTGDIPYWIVQNSWGTTWGDKGYVYMKVGSNICGI 319
>gi|4902840|emb|CAB43538.1| cysteine proteinase A [Leishmania major]
Length = 229
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 99/179 (55%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+ESQ+A+K+ +L+ LS+ L+ C+ + GC GG ++A++++ H G + E YP+ +
Sbjct: 37 IESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPTEKSYPYAS 96
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G + C +D + R+S ++ + + GP+ ++ Q Y G ++
Sbjct: 97 AGGTSPPC-HDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVVT 155
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C +LNH V++VG+ R + P WIV+NSWG WG + GY + G+N C +++Y
Sbjct: 156 ---LCFGLSLNHGVLVVGFNKRAKPPYWIVKNSWGTSWG-EKGYTRLAMGSNQCLLKNY 210
>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
Length = 373
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 102/198 (51%), Gaps = 27/198 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ +AI + + +S QL++C+ GC+GG ++ + L ++GL +E DYPFR +
Sbjct: 162 IEALWAITYHQSVEVSIQQLLDCDRCGNGCKGGFVWDAFLTVLNNSGLASEKDYPFRG-D 220
Query: 61 GVTGRCAYDARKVKVR-VSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
RC A+K KV + DF+ L +GP+ +N LLQ Y +I+
Sbjct: 221 AKPHRC--QAKKPKVAWIQDFIRLPEDEQKIAEYLATHGPITVTINMKLLQQYQKGVIKA 278
Query: 119 N-DVCPSENLNHAVVIVGYGMRHQVP------------------VWIVRNSWG-RWGPDD 158
C ++L+H+V++VG+G V WI++NSWG +WG ++
Sbjct: 279 TPTTCDPQHLDHSVLLVGFGGGKSVEGRRPGAVSSQSRPRRSSSYWILKNSWGAKWG-EE 337
Query: 159 GYFTVERGTNACGIESYG 176
GYF + RG+N CGI Y
Sbjct: 338 GYFRLHRGSNTCGITKYA 355
>gi|56758920|gb|AAW27600.1| SJCHGC00098 protein [Schistosoma japonicum]
gi|226476138|emb|CAX72159.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 94/181 (51%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAG---MNGALLQDYNGK 114
G C Y K V+V F+ D T ++ +Y YGP+ G +N ++ Y
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALNSLIM--YKSG 263
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+ ND C ++NHAV++VGYG H W+++NSWG GYF + R N CG+
Sbjct: 264 VFESND-CKYADINHAVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVA 322
Query: 174 S 174
S
Sbjct: 323 S 323
>gi|194462412|gb|ACF72674.1| cysteine proteinase type I [Leishmania tarentolae]
Length = 218
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+ + L+ LS+ QL+ C+ + GC GG ++A ++L + + E YP+ +
Sbjct: 35 IESQWFLAGHPLVNLSEQQLVSCDDVDSGCSGGLMSQAFEWLLNNTNGNVYTEDSYPYLS 94
Query: 59 QNGVTGRCA-YDARKVKVRVSDFLVFNGS-DTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ D V ++ +V + D L GP+ ++ Y G ++
Sbjct: 95 ANGYAPECSNSDELAVGAQIDGHVVIESNEDEMAAWLAKNGPIAIAVDATAFMSYEGGVL 154
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C E LNH V++V Y ++P W+++NSWG WG ++ Y V +GTN C + Y
Sbjct: 155 T---ACNGEQLNHGVLLVAYNTTGELPYWVIKNSWGASWG-EEAYVRVAKGTNECLLNEY 210
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 95/184 (51%), Gaps = 22/184 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E Q+ + L+ LS QL++C++ ++GC GG + + ++ GLE E YP+ +
Sbjct: 186 IEGQWFLAKKKLVSLSAQQLLDCDVVDEGCNGGFPLDAYKEIVRMGGLEPEDKYPYEAK- 244
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGS-------DTFRRMLYHYGPLVAGMNGALLQDYN 112
A + ++ SD V+ NGS + R L GP+ G+ +Q Y
Sbjct: 245 ---------AEQCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPISIGITVDDIQFYK 295
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
G + R C ++ H ++VGYG+ +P WI++NSWG WG +DGY+ + RG NAC
Sbjct: 296 GGVSRPT-TCRLSSMIHGALLVGYGVEKNIPYWIIKNSWGPNWG-EDGYYRMVRGENACR 353
Query: 172 IESY 175
I +
Sbjct: 354 INRF 357
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ YA G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 178 LEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPY-- 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G+ G C + + + V+V D + D + + P+ + K +
Sbjct: 236 -TGLDGTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFEVVHDFRFYKKGV 294
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ C S ++NHAV+ VGYG+ V W+++NSWG WG D+GYF +E G N CG+
Sbjct: 295 YTSGTCGSTPMDVNHAVLAVGYGVEDGVAYWLIKNSWGENWG-DNGYFKMELGKNMCGVA 353
Query: 174 S 174
+
Sbjct: 354 T 354
>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
Length = 354
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 99/179 (55%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+ESQ+A+K+ +L+ LS+ L+ C+ + GC GG ++A++++ H G + E YP+ +
Sbjct: 162 IESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPTEKSYPYAS 221
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G + C +D + R+S ++ + + GP+ ++ Q Y G ++
Sbjct: 222 AGGTSPPC-HDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVVT 280
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C +LNH V++VG+ R + P WIV+NSWG WG + GY + G+N C +++Y
Sbjct: 281 ---LCFGLSLNHGVLVVGFNKRAKPPYWIVKNSWGTSWG-EKGYIRLAMGSNQCLLKNY 335
>gi|321467301|gb|EFX78292.1| hypothetical protein DAPPUDRAFT_305243 [Daphnia pulex]
Length = 328
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 90/181 (49%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH--AGLEAEADYPFRNQ 59
LE KHG L +S+ QL++C Y+ GC GG + A YL++ G + YP+
Sbjct: 144 LEFARCKKHGALRAISEQQLVDCEPYDYGCGGGWYTNAWYYLQYEAGGAAKRSLYPYTAT 203
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGM--NGALLQDYNGKL 115
+ CA+ + + ++S + D + +L YGP+ + + +G
Sbjct: 204 DNT---CAFSSSMIGAKISSYGDLPSFDAAYMQSVLQDYGPISVAIAVTDSFFSYASGVY 260
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
P+ +NHAVV+VG+G + + WIVRNSWG +WG GY +ERG N C IE
Sbjct: 261 TDVECDDPNAYVNHAVVVVGWGTDNGIDYWIVRNSWGTKWG-SAGYILMERGVNKCKIEK 319
Query: 175 Y 175
Y
Sbjct: 320 Y 320
>gi|226476540|emb|CAX72162.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 95/181 (52%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAG---MNGALLQDYNGK 114
G C Y K V+V F+ D T ++ +Y YGP+ G +N ++ Y
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALNSLIM--YKSG 263
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+ ND C ++NHAV++VGYG H W+++NSWG + GYF + R N CG+
Sbjct: 264 VFESND-CKYGDINHAVLVVGYGNEHGKDYWLIKNSWGDFWGSKGYFKLRRNKHNMCGVA 322
Query: 174 S 174
S
Sbjct: 323 S 323
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 90/176 (51%), Gaps = 10/176 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q AIK G+ PLS QL++C+ N GC GG N A Y+K GLE++A YP+
Sbjct: 143 VEGQAAIKSGSKTPLSVQQLVDCSTEGGNSGCNGGLMNGAFDYIKANGLESDAKYPY--- 199
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C D V+++ + S+ + + + GP+ + L + Y G +
Sbjct: 200 TGTDDSCKADKSSSLVKLTGYKKVASSEASLKEAVGTVGPISVAVYADLWRSYGGGIFN- 258
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGT-NACGI 172
N +C L+H V VGYG + W V+NSWG WG ++GY + R T + CGI
Sbjct: 259 NILCLGFGLDHGVTAVGYGTDNGKKYWPVKNSWGESWG-EEGYIRMARDTLHNCGI 313
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|432961003|ref|XP_004086527.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin O-like [Oryzias latipes]
Length = 333
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 98/178 (55%), Gaps = 13/178 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRNQ 59
++S A+ L LS QL++C+ N+GC GG A+ +L L A+YP++ +
Sbjct: 153 VQSARAVGGSRLQRLSVQQLLDCSFTNKGCGGGSPTAALSWLLQTREKLVTAAEYPYQAE 212
Query: 60 NGVTGRCAYDARKVK-VRVSDFLV--FNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ C + ++ + V V +F V F G + L +GPLVA ++ QDY G +
Sbjct: 213 AQI---CRFFSQTHQGVAVKNFTVHNFRGQEPAMMAQLVEHGPLVAVVDAVSWQDYLGGI 269
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
I+ + C S+ NHAV++VGY VP WIV+NSWG WG ++GY ++ G + CGI
Sbjct: 270 IQHH--CSSQWPNHAVLVVGYDTSGDVPYWIVQNSWGTSWG-NEGYVYIKMGGDVCGI 324
>gi|321476443|gb|EFX87404.1| hypothetical protein DAPPUDRAFT_307061 [Daphnia pulex]
Length = 332
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/178 (35%), Positives = 92/178 (51%), Gaps = 7/178 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE K T + LS+ QL++C+ + GC GG + A Y+K+AG A+ + N
Sbjct: 151 LEFAQCKKDSTRVVLSERQLVDCDRLDSGCNGGMYTDAWTYIKNAGGCAKQTL-YSPYNA 209
Query: 62 VTGRCAYDARKVKVRVS--DFLVFNGSDTFRRMLYHYGPL-VAGMNGALLQDYNGKLIRK 118
C + + V +VS DFL N + + +GP+ VA ++G +
Sbjct: 210 RKNFCKFRSSMVGAQVSTFDFLPANNPLAMQVAMEQHGPIAVAIAVVPSFLSFHGDVYDD 269
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
N C +NHAVV+VG+G + V W+VRNSWG WG GY ++RG N CGIESY
Sbjct: 270 N-ACDGAEINHAVVVVGWGTLNGVDYWMVRNSWGTNWGLS-GYIRIKRGVNKCGIESY 325
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 95/176 (53%), Gaps = 11/176 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA-GLEAEADYPFRNQN 60
+ESQ+A+ L LS Q+++C+ ++ GC GG + A Y+ A GL+A A+YP+
Sbjct: 153 IESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDALANYPY---T 209
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKLIR 117
V G CA+ +V ++S + +M L +GP+ ++ Y G + R
Sbjct: 210 AVGGSCAFKESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESWPSYTGGVYR 269
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C + +++H V+ VGY + P WI+RNSWG WG +GY +E GT+AC +
Sbjct: 270 AS-ACGT-SIDHCVLAVGYNLTANPPYWIIRNSWGTSWGL-EGYMHLEFGTDACAV 322
>gi|71084306|gb|AAZ23598.1| cysteine protease [Leishmania major]
Length = 327
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 99/179 (55%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+ESQ+A+K+ +L+ LS+ L+ C+ + GC GG ++A++++ H G + E YP+ +
Sbjct: 135 IESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPTEESYPYAS 194
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G + C +D + R+S ++ + + GP+ ++ Q Y G ++
Sbjct: 195 AGGTSPPC-HDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVVT 253
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C +LNH V++VG+ R + P WIV+NSWG WG + GY + G+N C +++Y
Sbjct: 254 ---LCFGWSLNHGVLVVGFNKRAKPPYWIVKNSWGTSWG-EKGYIRLAMGSNQCLLKNY 308
>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 89/180 (49%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVADHRLXXLSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 STGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGDALNHGVLLVGYNRTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTEY 334
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 89/180 (49%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAGHRLTALSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 STGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGDALNHGVLLVGYNXTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTEY 334
>gi|440911897|gb|ELR61520.1| Cathepsin O, partial [Bos grunniens mutus]
Length = 276
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 90/175 (51%), Gaps = 7/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS Q+I+C+ N GC GG A+ +L L +++YPF+ Q
Sbjct: 96 VESVCAIKGQPLGVLSVQQVIDCSYSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQ 155
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
NG+ + ++ F+G D L GPL+ ++ QDY G +I+
Sbjct: 156 NGLCRYFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQH 215
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ C S NHAV++ G+ +P WIV+NSWG WG DGY V+ G N CGI
Sbjct: 216 H--CSSGEANHAVLVTGFDKTGSIPYWIVQNSWGTSWGI-DGYVRVKMGGNICGI 267
>gi|8393221|ref|NP_059016.1| cathepsin S preproprotein [Rattus norvegicus]
gi|399190|sp|Q02765.1|CATS_RAT RecName: Full=Cathepsin S; Flags: Precursor
gi|203650|gb|AAA40994.1| cathepsin S precursor [Rattus norvegicus]
Length = 330
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 92/184 (50%), Gaps = 15/184 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYLKHAGLEAEADYPFR 57
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ +++EA YP++
Sbjct: 146 LEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTSIDSEASYPYK 205
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ +C YD + S + L F + + + GP+ G++ A + L
Sbjct: 206 ---AMDEKCLYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDDASHSSF--FL 260
Query: 116 IRK---NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACG 171
+ +D +EN+NH V++VGYG W+V+NSWG D GY + R N CG
Sbjct: 261 YQSGVYDDPSCTENMNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCG 320
Query: 172 IESY 175
I SY
Sbjct: 321 IASY 324
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 94/181 (51%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG KA QY+ + G+++E YP++
Sbjct: 156 LEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSYPYK 215
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G C YD++ S + L F D + + + GP+ ++ + K
Sbjct: 216 ---AMDGNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKS 272
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG + GY + R + N CGI S
Sbjct: 273 GVYYDPSCTQNVNHGVLVVGYGNLNGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIAS 332
Query: 175 Y 175
Y
Sbjct: 333 Y 333
>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 89/180 (49%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAGHRLTALSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 STGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGDALNHGVLLVGYNRTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTEY 334
>gi|226467484|emb|CAX69618.1| cathepsin L, a [Schistosoma japonicum]
Length = 353
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE Q +K L+PLS QLI+C + C YLKH G+E+E DY F G
Sbjct: 175 LEGQLKLKTNKLIPLSAQQLIDCT-GDHECVENPLPVGFDYLKHKGVESEDDYKFV---G 230
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVA--GMNGALLQDYNGKLIR 117
C Y+A KV + S + ++ D ++ LY YGP+ M L +G LI
Sbjct: 231 NVENCTYNASKVVITASSYSQVLPISEDELQKALYTYGPIAVTIAMTQEFLAYESGVLIP 290
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIESYG 176
+ C + +V++VGYG+ ++P W+++ S G D GY + R +N C I SY
Sbjct: 291 TD--CQDKEAFESVLLVGYGIEDEIPYWLIKFSLGTEFGDQGYIKLARNHSNMCHIASYA 348
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 93/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QLI+C N GC GG ++A +Y+K+ GL+ E YP++
Sbjct: 176 LEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQ- 234
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + V V+V D + D + + P+ +
Sbjct: 235 --GVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGV 292
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+D C + ++NHAV+ VGYG+ VP W+++NSWG WG D+GYF +E G N CG+
Sbjct: 293 YTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWG-DEGYFKMEMGKNMCGVA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 98/182 (53%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K G L+ LS+ L++C+ N GC+GG + A +Y+K + G++AE YP+
Sbjct: 149 LEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNG--ALLQDYNGK 114
+ +C + V + F+ G D ++ + GP+ ++ + Q Y+
Sbjct: 208 --AMDDKCRFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERG-TNACGI 172
+ + + C SE L+H V+ VGYG++ W+V+NSW G WG D+GY + R N CGI
Sbjct: 266 VYDEPE-CSSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWG-DNGYILMSRDKNNQCGI 323
Query: 173 ES 174
S
Sbjct: 324 AS 325
>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
Length = 329
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 93/177 (52%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSDNDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ G++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVGIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+N+NHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNVNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 92/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHGLTALSEQQLVSCDDKDNGCSGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLI 116
+G C+ ++ V R+ ++ S+T + L GP+ ++ + Y ++
Sbjct: 219 SSGYVPECSNSSQLVPGARIEGYMTIESSETVKGAWLAKNGPISIAVDASSFMSYQSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CAGDALNHGVLLVGYNRTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTEY 334
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 98/182 (53%), Gaps = 15/182 (8%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
++E I G L+ LS+ +L++C+ YNQGC GG + A Q+ +K+ GL E DYP+R
Sbjct: 36 VVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRG 95
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLI 116
+G +++ V + + + N +R + Y P+ ++ G + Q Y +
Sbjct: 96 SDGKCNSLLKNSKVVTIDGYEDVPTNDETALKRAV-SYQPVSVAIDAGGRVFQHYQSGIF 154
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-----TNAC 170
C ++ ++HAVV VGYG + V WIVRNSWG +WG +DGY +ER + C
Sbjct: 155 TGE--CGTK-MDHAVVAVGYGSENGVDYWIVRNSWGQKWG-EDGYIRIERNLASSKSGKC 210
Query: 171 GI 172
GI
Sbjct: 211 GI 212
>gi|407399825|gb|EKF28451.1| cysteine peptidase, putative, partial [Trypanosoma cruzi
marinkellei]
Length = 257
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 87/163 (53%), Gaps = 9/163 (5%)
Query: 16 LSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRNQNGVTGRCAYDARK 72
LS+ L+ C+ N C GG KA +++ + + E YP+R+ NG T C RK
Sbjct: 3 LSEQMLVSCDKTNDSCDGGSQLKAFKWIVEQNNGTVYTEKSYPYRSCNGRTPPCIKFRRK 62
Query: 73 VKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAV 131
V ++D+ ++T + L YGPL A ++ + Y G ++ C S L HAV
Sbjct: 63 VGATITDYFSVKKNETKVAIALAAYGPLSAVIDASSWMIYTGGVLTN---CVSAALGHAV 119
Query: 132 VIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++VGY VP W ++NSWG+ WG ++GY + +G+N C ++
Sbjct: 120 LLVGYNDSAPVPYWTIKNSWGKQWG-EEGYIRIAKGSNQCLVK 161
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHRLTALSEQQLVSCDDKDNGCAGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C+ ++ V R+ +L S+T L GP+ ++ + Y ++
Sbjct: 219 STGYVPECSNSSQLVPGARIDGYLTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG ++GY V G NAC + Y
Sbjct: 279 TS---CAGDALNHGVLLVGYNRTGEVPYWVIKNSWGENWG-ENGYVRVTMGVNACLLTEY 334
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 90/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWAVAXHGLVRLSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C + V R+ +++ +T L GP+ ++ + Y ++
Sbjct: 219 STGDVPECTNSSELVPGARIDGYVMIESXETVMAAWLAKSGPISIAVDASPFMSYESGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CVGKXLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTEY 334
>gi|119640007|gb|ABL85445.1| cathepsin L [Kudoa thyrsites]
Length = 300
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 88/167 (52%), Gaps = 8/167 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES YAIK G L+ S+ QL++C+ N GC GG A Y+ + G+ DYP+ + G
Sbjct: 135 IESAYAIKTGELVNFSEQQLVDCSTENHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQG 194
Query: 62 VTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRK 118
C Y V VR+S F V N ++ + + GP G+N A Q Y G I
Sbjct: 195 T---CQYSPEDV-VRISSFKCVENNGESVMESVANNGPNSIGINAASRSFQFYGGG-IYF 249
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER 165
+ S L+HAV++VGYG ++ W V+NSWG W + GY ++R
Sbjct: 250 DPWASSYPLDHAVLLVGYGFKNTENYWHVKNSWGPWWGEQGYINIKR 296
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 99/185 (53%), Gaps = 12/185 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K G+++ LS+ L+ C + N GC+GG + A +Y++ + G++ E YP+
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPY-- 209
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
NG G C + V S F+ + GS+T ++ + GP+ ++ + Q Y+
Sbjct: 210 -NGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
+ + + C SE+L+H V++VGYG + W V+NSWG D+GY + R N CGI
Sbjct: 269 VYDEPE-CDSESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIA 327
Query: 174 SYGGI 178
S I
Sbjct: 328 SSASI 332
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 96/181 (53%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG A QY+ + G++++A YP++
Sbjct: 149 LEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNNGIDSDASYPYK 208
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD++ S + L F D + + + GP+ ++ + + K
Sbjct: 209 ---AMDGKCRYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAIDASHPSFFLYKS 265
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI +
Sbjct: 266 GVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGINFGDKGYIRMARNSGNHCGIAN 325
Query: 175 Y 175
Y
Sbjct: 326 Y 326
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 91/180 (50%), Gaps = 11/180 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Q+ K+GTL+ LS +L++C N GC+GG +A +++ G++ E YP+
Sbjct: 145 IEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESYPYEG 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
+ + KVK V R + GP+ + + L Y+ ++ +
Sbjct: 205 RRSSCKKSGEYVTKVKTYVFPL----DEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260
Query: 119 NDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C + E+LN V++VGYG + V WIV+NSWG WG + GYF +++ ACGI Y
Sbjct: 261 RCRCSNKREDLNPGVLVVGYGSENGVDYWIVKNSWGADWG-EKGYFRLKKDVKACGIGYY 319
>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
Length = 332
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+
Sbjct: 151 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYI--- 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
G C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 GEDESCMYNPTGKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASLSSFQFYSKGVY 267
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG +WG + GY + R NACGI
Sbjct: 268 YDENCNSDNLNHAVLAVGYGIQRGTKHWIIKNSWGEQWG-NKGYILMARNKNNACGI 323
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 100/184 (54%), Gaps = 18/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C NQGC GG ++A +Y+K+ GL+ E YP+
Sbjct: 177 LEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTG 236
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
++GV C + + V V+V D + N + L H LV ++ A + +G K
Sbjct: 237 KDGV---CKFSSENVGVQVLDSV--NITLGAEDELQHAVGLVRPVSVAF-EVVDGFRFYK 290
Query: 119 NDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
+ V S ++NHAVV VGYG+ VP W+++NSWG WG D GYF ++ G N C
Sbjct: 291 SGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWG-DHGYFKIKMGKNMC 349
Query: 171 GIES 174
GI +
Sbjct: 350 GIAT 353
>gi|229596403|ref|XP_001009843.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565321|gb|EAR89598.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 324
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 88/179 (49%), Gaps = 19/179 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE YAI G L S+ Q+++C+ N GC GG A +Y+ G+E EADYP++ G
Sbjct: 148 LEGAYAIATGNLTSFSEQQIVDCSKANAGCNGGDLPPAYKYVVQNGIETEADYPYK---G 204
Query: 62 VTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYG-PLVAGMNGALLQDYNGKLIRK 118
V +CAYDA KV + F+ N D L P+ + Q Y +I
Sbjct: 205 VNQKCAYDASKVVFKPKSFVQVTPNSPDQLAIALNKEPVPICIEADQKAFQFYTSGIISS 264
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYF----TVERGTNACGI 172
C + NL+H V+ VGY WIV+NSWG WG ++GY T +G CGI
Sbjct: 265 G--CGT-NLDHCVLAVGYDADS----WIVKNSWGASWG-ENGYVRIARTTAKGPGVCGI 315
>gi|30141461|emb|CAD54747.1| cysteine proteinase a [Leishmania guyanensis]
Length = 222
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 93/184 (50%), Gaps = 10/184 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A+ TL+ LS+ L+ C+ + GC GG ++A ++ H+G + E +P+ +
Sbjct: 37 IEGQWALSGNTLVSLSEQMLVSCDTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSHPYTS 96
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G T C KV R+S + D L GP+ ++ Q Y G ++
Sbjct: 97 GDGSTASC-LSTGKVGARISGQVSLPQDEDAIEAWLEKNGPIAIAVDATTWQLYFGGVVL 155
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYG 176
C + LNH V++VGY + P WIV+NSWG WG + GY + +G+N C ++ Y
Sbjct: 156 N---CFAYQLNHGVLLVGYNNSAKPPYWIVKNSWGTSWG-EHGYIRLAKGSNQCMMKDYA 211
Query: 177 GICT 180
T
Sbjct: 212 MTAT 215
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 96/182 (52%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q K G L+PLS+ L++C+ N+GC GG + A QY+K + GL+ YP+
Sbjct: 147 LEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEA 206
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMN--GALLQDYNGKL 115
NG C Y+ + +V F+ S+ + + GP+ G++ Q Y G +
Sbjct: 207 LNGT---CRYNPKYSAAKVVGFMSIPPSENALMKAVATVGPISVGIDIKHKSFQFYKGGM 263
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
+ D C S NLNHAV++VGYG W+V+NSWGR WG DGY + + N CGI
Sbjct: 264 YYEPD-CSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGM-DGYIKMAKDWNNNCGI 321
Query: 173 ES 174
S
Sbjct: 322 AS 323
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 97/190 (51%), Gaps = 24/190 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---------YNQGCQGGGFNKAIQYL-KHAGLEAE 51
+E + I G L+ LS+ QL++C++ + GC GG + A++Y+ +H G++ E
Sbjct: 77 IEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPSNAMEYIVEHGGIDTE 136
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
YP+ G G C K+ + +F V + L YGPL G+N A +Q
Sbjct: 137 KSYPYV---GEKGECKAKKGKLGATLKNFSFVSDDEKQMAAALVKYGPLSIGINAAWMQS 193
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV-------WIVRNSWG-RWGPDDGYFT 162
Y G + +C +E+L+H V+IVGYG PV WIV+NSW WG + GY+
Sbjct: 194 YIGGVACPW-LCDAESLDHGVLIVGYGSSGFAPVRWAPEPYWIVKNSWSPAWG-EGGYYR 251
Query: 163 VERGTNACGI 172
+ + +CGI
Sbjct: 252 ICKDKGSCGI 261
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 99/195 (50%), Gaps = 30/195 (15%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQY-LKHAGLEAE 51
+E + + G L+ LS+ QL++C+ + GC GG A +Y LK GL+ E
Sbjct: 166 VEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLKAGGLQRE 225
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +NG +C +D K+ V+++ V D L +GPL G+N A +Q
Sbjct: 226 KDYPYTGRNG---QCHFDKSKIAASVTNYSVVGLDEDQIAANLVKHGPLAVGINSAWMQT 282
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGRWGPDDGY 160
Y G + CP ++ +H V++VGYG P+ WI++NSWG + GY
Sbjct: 283 YIGGV-----SCPLVCFKHQDHGVLLVGYGSAGFAPIRLKAKPYWIIKNSWGEHWGEHGY 337
Query: 161 FTVERGT-NACGIES 174
+ + RG N CG+++
Sbjct: 338 YKICRGQHNICGVDA 352
>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
Length = 333
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 90/184 (48%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ S +A+ + LS QL+ C+ + GC+GG F A +L LE E+ P+
Sbjct: 153 IASAWALAGNSFTELSVQQLLSCDNMDGGCRGGSFYLACNWLTKNRVPLETESANPYL-- 210
Query: 60 NGVTGRCAYDARKVKVRVSDF----LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
G +C A + + F ++ S + L GPL ++ +DY G +
Sbjct: 211 -GKRDKCVKHATNTGIILKKFTTSNFIYQESSSMIAALNQNGPLSIAVDATSWRDYVGGI 269
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI-ES 174
I+ + C + LNHAV +VGY + VP WIVRNSWG D GY ++ G N CGI ES
Sbjct: 270 IQHH--CDGKVLNHAVQVVGYKLDAPVPYWIVRNSWGEDFGDHGYIYIKMGKNVCGIAES 327
Query: 175 YGGI 178
G +
Sbjct: 328 VGWV 331
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 92/177 (51%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ ++ G LL LS+ +L++C+ +Q C GG + A ++ GLE E DY +
Sbjct: 387 VEGQWFLRRGALLTLSEQELVDCDTLDQACGGGLPSNAYTAIETLGGLETEKDYSY---E 443
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G RC++ K + + S + L GP+ +N +Q Y +
Sbjct: 444 GRKERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVSIALNAFAMQFYRRGVSHPF 503
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R +P W ++NSWG WG ++GY+ + RG ACG+ +
Sbjct: 504 RPLCSPWFIDHAVLLVGYGDRSGIPFWAIKNSWGPDWG-EEGYYYLYRGARACGMNT 559
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP++
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQ- 234
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + V V+V D + D + + P+ + G +
Sbjct: 235 --GVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPV-----SVAFEVITGFRL 287
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S+ ++NHAV+ VGYG+ VP W+++NSWG WG D+GYF +E G N
Sbjct: 288 YKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWG-DEGYFKMEMGKN 346
Query: 169 ACGIES 174
CG+ +
Sbjct: 347 MCGVAT 352
>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
Length = 358
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 95/182 (52%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y + LS+ QL++C N GC GG ++A +Y+K+ GL+ EA YP+
Sbjct: 174 LEAAYVQAFRKQISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEAAYPYV- 232
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G G C + A V V+V D + D + L H V ++ A + ++ +
Sbjct: 233 --GTDGACKFSAENVGVQVLDSVNITLGD--EQELKHAVAFVRPVSVAFQVVKSFRIYKS 288
Query: 119 ----NDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+D C S ++NHAV+ VGYG VP W+++NSWG D+GYF +E G N CG+
Sbjct: 289 GVYTSDTCGSSPMDVNHAVLAVGYGEEGGVPFWLIKNSWGESWGDNGYFKMEFGKNMCGV 348
Query: 173 ES 174
+
Sbjct: 349 AT 350
>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
Length = 327
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 87/180 (48%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G LL L++ QL++C N GC GG ++A +Y L + GL E YP+R
Sbjct: 142 LESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRA 201
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
QNG C + K V D + D + + P+ + K +
Sbjct: 202 QNGT---CKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGV 258
Query: 117 RKNDVCP--SENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
N C + +NHAV+ VGYG P WIV+NSWG DGYF +ERG N CG+ +
Sbjct: 259 YSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAA 318
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 95/182 (52%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE Q+ I GTL+ LS+ QL++C+ N GC GG + + +YLK AG E E +YP+
Sbjct: 142 LEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYPYTA 201
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF--NGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+NGV C YD+ V ++ D+ + + + GP+ ++ + Q YN
Sbjct: 202 ENGV---CRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSG 258
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGI 172
+ + C S L+H V+ +GYG W+V+NSWG WG +GY + R N CGI
Sbjct: 259 VYYAS-TCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGM-EGYIKMSRNRNNNCGI 316
Query: 173 ES 174
+
Sbjct: 317 AT 318
>gi|167427527|gb|ABZ80400.1| cathepsin L4, partial [Fasciola hepatica]
Length = 303
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 88/184 (47%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY + S+ QL++C + N GC GG A +YL+ GLE E+ YP++ +
Sbjct: 118 VEGQYTKNQKANISFSEQQLVDCSGDYGNHGCNGGFMENAYEYLERRGLETESSYPYKAE 177
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMN--GALLQDYNGKL 115
G C YD+R V V + + + ++ GP ++ L G
Sbjct: 178 EGP---CKYDSRLGVVEVFGYFIEHSGIESKLAHLVGDKGPAAVAVDVESDFLMYRGGIY 234
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
+N C SE+LNH +++VGYG + WIV+NSWG D GY + R N CGI S
Sbjct: 235 ASRN--CSSESLNHGILVVGYGTQDGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIAS 292
Query: 175 YGGI 178
+
Sbjct: 293 AASV 296
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 96/181 (53%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI--YNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
+E IK G L+ LS+ QLI+C++ YN+GC GG A +++K + GL E DYP+
Sbjct: 160 IEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPY-- 217
Query: 59 QNGVTGRCAYDARKVKV-RVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKL 115
G+ G C + K KV + + ++ ++ P+ G++ G + Q Y+ +
Sbjct: 218 -TGIEGTCDQEKSKNKVVTIQGYQKVAQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG----TNACG 171
+ C + NLNH V +VGYG+ WIV+NSWG ++GY +ERG T CG
Sbjct: 277 F--TNYCGT-NLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCG 333
Query: 172 I 172
I
Sbjct: 334 I 334
>gi|118363825|ref|XP_001015136.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89296903|gb|EAR94891.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 355
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 97/180 (53%), Gaps = 15/180 (8%)
Query: 2 LESQYAIKHGTL-LPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFR 57
LES YA+K G + S+ QL++C QGC GG +K +YL +AG ++ EADYP+
Sbjct: 156 LESHYALKTGKKPIQFSEQQLVDCARKFDTQGCDGGLPSKGFEYLAYAGGIQTEADYPYE 215
Query: 58 NQNGVTGRCAYDARKV--KVRVSDFLVFNGSDTFRRMLYHYGPLVAG--MNGALLQDYNG 113
G +C +++ K +V S + F + L +YGP+ +N +G
Sbjct: 216 ---GKDKKCRFNSSKAVAQVEKSFNITFQDENELIYHLANYGPVAIAYEVNDDFDNYKDG 272
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
N E++NHAV+ VGY M + +IV+NSWG+ WG + GYF +E G+N CG+
Sbjct: 273 VFTSSNCSTDPEDVNHAVLAVGYNMTGKY--FIVKNSWGKDWGMN-GYFYIELGSNMCGL 329
>gi|197359120|gb|ACH69776.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 261
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 92/180 (51%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+ YAI HG L LS+ +L++C++ N C GG +KA +++ GL E DYP+ Q
Sbjct: 77 VETSYAIAHGELRNLSEQELLDCDLANNACNGGDDDKAFRFIHEHGLMREEDYPYVAQR- 135
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK 118
C + D F SD L ++GP+ G+N ++ Y G +
Sbjct: 136 -QNSCLLNEYSGPTTKLDLAYFIASDENAMLEWLVNFGPINVGINVPPDMKLYKGGVYTP 194
Query: 119 NDVCPSENL--NHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ N+ HA+ I+GYG WIV+NSWG ++G +DGY + RG N+CGIE
Sbjct: 195 SPWDCKNNILGTHALNIMGYGTWEDGQKYWIVKNSWGPKYGIEDGYVYMARGENSCGIED 254
>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
Vinyl Sulfone Inhibitor
gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
Oxoethylcarbamate
gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
Inhibitor
gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
Inhibitor
gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
Myocrisin
gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor E-64
gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Symmetric Diacylaminomethyl
Ketone Inhibitor
gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Propanone Inhibitor
gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Symmetric Biscarbohydrazide
Inhibitor
gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Thiazolhydrazide Inhibitor
gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent
Benzyloxybenzoylcarbohydrazide Inhibitor
gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Peptidomimetic Inhibitor
gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin Sulfate
Complex.
gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
Triazine Ligand
gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
Pyrimidine Inhibitor
gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective 2-Cyano-Pyrimidine
Inhibitor
gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
Inhibitor With A Benzyl P3 Group.
gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
Inhibitor With Improved Selectivity Over Herg
gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
Length = 215
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 34 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQ- 92
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 93 --EESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 150
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 151 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 206
>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abe854
gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abi491
gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abj688
gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
Complex With Human Cathepsin K
Length = 217
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 36 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQ- 94
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 95 --EESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 152
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 153 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 208
>gi|341878255|gb|EGT34190.1| hypothetical protein CAEBREN_02333 [Caenorhabditis brenneri]
Length = 410
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 95/195 (48%), Gaps = 24/195 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E+QYA+K G LL LS+ +L++C++ + GC GG N A+ + GLE EADYP+
Sbjct: 212 IETQYAMKKGALLSLSEQELVDCDVLSYGCNGGYLNTALLFAIEKGLETEADYPYVAIQ- 270
Query: 62 VTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPL------------------VAG 102
+C+ +K++V++ D + + D + GP+ V
Sbjct: 271 -HKQCSIQTQKIRVKIDDGYHLKANEDQIADWVAREGPVSFCKLLLFLFFFKFFKCSVMP 329
Query: 103 MNGALLQDYNGKLIRKNDVCPSENL-NHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGY 160
+ +++ G C + + NH + IVGYG WIV+NSWG WG + GY
Sbjct: 330 VPKSIMFYRGGIFNPSMAECRGQAVGNHVMAIVGYGREGNQKYWIVKNSWGTSWG-EQGY 388
Query: 161 FTVERGTNACGIESY 175
+ RG N CG +Y
Sbjct: 389 LKMARGVNICGFTNY 403
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 97/198 (48%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L LS+ Q+++C+ + GC GG A YL K GL++E
Sbjct: 145 LEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSE 204
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G C +D K+ +V +F V + D L +GPL +N A +Q
Sbjct: 205 KDYPY---AGRENTCKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQT 261
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG + G
Sbjct: 262 YIGGV-----SCPFICGRHLDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWG-EKG 315
Query: 160 YFTVERG---TNACGIES 174
Y+ + RG N CG++S
Sbjct: 316 YYKICRGPHDKNKCGVDS 333
>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
Ketoamide Warhead
Length = 213
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 32 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQ- 90
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 91 --EESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 148
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 149 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 204
>gi|229595078|ref|XP_001020175.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225566400|gb|EAR99930.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 375
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 97/180 (53%), Gaps = 15/180 (8%)
Query: 2 LESQYAIKHGTL-LPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFR 57
LES YA+K G + S+ QL++C QGC GG +K +YL +AG ++ EADYP+
Sbjct: 156 LESHYALKTGKKPIQFSEQQLVDCARKFDTQGCDGGLPSKGFEYLAYAGGIQTEADYPYE 215
Query: 58 NQNGVTGRCAYDARKV--KVRVSDFLVFNGSDTFRRMLYHYGPLVAG--MNGALLQDYNG 113
G +C +++ K +V S + F + L +YGP+ +N +G
Sbjct: 216 ---GKDKKCRFNSSKAVAQVEKSFNITFQDENELIYHLANYGPVAIAYEVNDDFDNYEDG 272
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
N E++NHAV+ VGY M + +IV+NSWG+ WG + GYF +E G+N CG+
Sbjct: 273 VFTSSNCSTDPEDVNHAVLAVGYNMTGKY--FIVKNSWGKDWGMN-GYFYIELGSNMCGL 329
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 99/179 (55%), Gaps = 6/179 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
+E Q+ +KHG LL LS+ +L++C+ + C+GG + A + ++ GLEAE DY + +
Sbjct: 296 IEGQWFLKHGKLLSLSEQELVDCDGLDHACRGGLPSNAYEAIEGLGGLEAENDYTY---S 352
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G +C++ KV + S + + + L GP+ +N +Q Y +
Sbjct: 353 GHKQKCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVSVALNAFAMQFYKKGVSHPW 412
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESYGG 177
+ C ++HAV++VGYG R+ +P W ++NSWG ++GY+ + +G+NACGI G
Sbjct: 413 MILCNPWMIDHAVLLVGYGERNGIPFWAIKNSWGEDYGEEGYYYLYKGSNACGINKMGS 471
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 97/198 (48%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L LS+ Q+++C+ + GC GG A YL K GL++E
Sbjct: 181 LEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSE 240
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G C +D K+ +V +F V + D L +GPL +N A +Q
Sbjct: 241 KDYPY---AGRENTCKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQT 297
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG + G
Sbjct: 298 YIGGV-----SCPFICGRHLDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWG-EKG 351
Query: 160 YFTVERG---TNACGIES 174
Y+ + RG N CG++S
Sbjct: 352 YYKICRGPHDKNKCGVDS 369
>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
Norleucine Aldehyde
Length = 214
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 33 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQ- 91
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 92 --EESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 149
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 150 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 205
>gi|343472975|emb|CCD15017.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 293
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 88/180 (48%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ I L LS L+ C+ N GC+GG ++A Q++ + E YP+ +
Sbjct: 91 IEGQWKIAGHELTSLSGQMLVSCDKKNYGCEGGLMDRAFQWIVSSNKGNVFTEQSYPYDS 150
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + V ++S ++ + L GP+ ++ + Y G ++
Sbjct: 151 SWGDVPACNMSGKVVGAKISSYVDLPQDENAIAEWLAKNGPVAIAVDATSFRSYTGGVLT 210
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
C S L+H V++VGY + P WI++NSWG+ WG + GY +E+GTN C ++ Y
Sbjct: 211 S---CISRRLDHGVLLVGYDDTSKPPYWIIKNSWGKGWG-EWGYIRIEKGTNQCLVQEYA 266
>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
Length = 328
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 89/180 (49%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI G LL LS+ QL++C N GC GG ++A +Y+K+ GL E DYP+
Sbjct: 145 LESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKYNKGLMTEDDYPYTA 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM--LYHYGPLVAG--MNGALLQDYNGK 114
Q+G C + + V D + D + + P+ + + ++G
Sbjct: 205 QDGT---CKFKPERAAAFVKDVVNITMYDEMGMVDAVARLNPVSMAYEVTSDFMHYHSGV 261
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
++ +NHAV+ VGY + P WIV+NSWG + GYF +ERG N CG+ +
Sbjct: 262 YSSSECHNTTDTVNHAVLAVGYDEENVTPYWIVKNSWGPFWGMKGYFFIERGKNMCGLSA 321
>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
Length = 291
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 93/182 (51%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP+
Sbjct: 107 LEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPY 166
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ + +C YD + S + L F + + + GP+ G++ + + +
Sbjct: 167 K---AMDEKCHYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQ 223
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D +EN+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 224 SGVYDDPSCTENVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIA 283
Query: 174 SY 175
SY
Sbjct: 284 SY 285
>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
Length = 329
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|452258|emb|CAA80446.1| cathepsin L-like protease [Fasciola hepatica]
Length = 326
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 87/184 (47%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ S+ QL++C + N GC GG A +YLKH GLE E+ YP++
Sbjct: 141 VEGQFRKNERASASFSEQQLVDCTRDFGNYGCGGGYMENAYEYLKHNGLETESYYPYQ-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRR---MLYHYGPLVAGMNGALLQDYNGKLI 116
V G C YD R +V+ + + D + P VA + Y I
Sbjct: 199 -AVEGPCQYDGRLAYAKVTGYYTVHSGDEIELKNLVGTEDLPAVALDADSDFMMYQSG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGY--FTVERGTNACGIES 174
++ C + L HAV+ VGYG + WIV+NSWG W +DGY F RG N CGI S
Sbjct: 257 YQSQTCLPDRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWGEDGYIRFARNRG-NMCGIAS 315
Query: 175 YGGI 178
+
Sbjct: 316 LASV 319
>gi|308495037|ref|XP_003109707.1| hypothetical protein CRE_07390 [Caenorhabditis remanei]
gi|308245897|gb|EFO89849.1| hypothetical protein CRE_07390 [Caenorhabditis remanei]
Length = 405
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 95/179 (53%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF--RNQ 59
+E+ YAI HG LS+ L++C++ + C GG +KA +Y+ GL D P+ Q
Sbjct: 222 VEAAYAIAHGERRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRQGLAYSVDLPYVAHRQ 281
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL-LQDYNGKLIRK 118
N ++ ++K + + + + D+ L ++GP+ GM+ ++ Y G +
Sbjct: 282 NNCVVNDHWNTTRIK---AAYFLHHDEDSIINWLVNFGPVNIGMSVIQPMRAYKGGVFTP 338
Query: 119 ND-VCPSENLN-HAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++ C +E + HA++I GYG + WIV+NSWG WG + GY RG NACGIE
Sbjct: 339 SEYACKNEVIGLHALLITGYGTSDKGEKYWIVKNSWGNTWGVEHGYIYFARGINACGIE 397
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 97/198 (48%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L LS+ Q+++C+ + GC GG A YL K GL++E
Sbjct: 178 LEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSE 237
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G C +D K+ +V +F V + D L +GPL +N A +Q
Sbjct: 238 KDYPY---AGRENTCKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQT 294
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG + G
Sbjct: 295 YIGGV-----SCPFICGRHLDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWG-EKG 348
Query: 160 YFTVERG---TNACGIES 174
Y+ + RG N CG++S
Sbjct: 349 YYKICRGPHDKNKCGVDS 366
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 209 E---NCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWG-NKGYILMARNKNNACGI 321
>gi|321476447|gb|EFX87408.1| hypothetical protein DAPPUDRAFT_207683 [Daphnia pulex]
Length = 339
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 88/168 (52%), Gaps = 11/168 (6%)
Query: 14 LPLSKSQLIECNI--YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDA- 70
+ LS+ QLI+C+ N+GC GG AI+ G+ YP++ G C Y A
Sbjct: 170 VTLSEQQLIDCDRGGLNKGCHGGFSMTAIESTMLTGIATGLQYPYKKTGG---PCKYVAN 226
Query: 71 -RKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKNDVCPSENLN 128
+ V++ +++ + L GPL A M DY + ND C ++ N
Sbjct: 227 MKAASVKLCNYIEGGSIVDMKYALTKLGPLSATMTVTDSFADYGSGVYDSND-CDGQDPN 285
Query: 129 HAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
HAVV+VG+G ++ + WI RNSWG WG + GYF ++RG N CGIESY
Sbjct: 286 HAVVLVGWGNQNGIDYWIGRNSWGTGWGKE-GYFLIQRGVNKCGIESY 332
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 9/172 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+E A+ G L+ LS+ +L+EC+ N GC+GG + A ++ + + G+++E+DYP+
Sbjct: 173 MEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNGGIDSESDYPY---T 229
Query: 61 GVTGRCAYDARKVKV-RVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALL--QDYNGKLIR 117
GV G C + KV + + SD+ P+ G++G+ + Q Y G +
Sbjct: 230 GVDGTCNTTKEETKVVSIDGYQDVEQSDSALLCAVAQQPVSVGIDGSAIDFQLYTGGIYD 289
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ ++++HAV+IVGYG WIV+NSWG WG DGYF ++R T+
Sbjct: 290 GSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWGI-DGYFYLKRDTD 340
>gi|301103045|ref|XP_002900609.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262101872|gb|EEY59924.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 376
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 94/181 (51%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES +KHG LS+ L++C N N GC GG + A +Y+K+ GL+ E YP+
Sbjct: 192 LESHVKLKHGEFTILSEQNLLDCAQNFDNHGCNGGLPSHAFEYIKYNGGLDTEETYPYEA 251
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY----N 112
+ G +C ++ V V+V + + R + GP+ ++ D+ +
Sbjct: 252 KEG---KCKFNTYHVGVQVDQVVNITTRNENELRAAVGSTGPVSIAFQ--VVSDFRFYES 306
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
G K +++NHAV+ VGYG+ WIV+NSWG +WG DG+F + RG+N CG
Sbjct: 307 GVYESKECRSDEKDVNHAVLAVGYGVEDGKDHWIVKNSWGSQWGM-DGFFQIARGSNMCG 365
Query: 172 I 172
+
Sbjct: 366 V 366
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 91/181 (50%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E IK G L LS+ QL++C + NQGC GG +A QY + G+EAE DY + +
Sbjct: 154 IEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDYRYTER 213
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGA---LLQDYNGK 114
+GV C Y V V+ + D +R + GP+ G++ A + +G
Sbjct: 214 DGV---CRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYSHGV 270
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
+ K C ++H V++VGYG + W+V+NSWG +DGY + R N CGI
Sbjct: 271 FVSK--TCSPYAIDHGVLVVGYGAENGDAYWLVKNSWGSSWGEDGYLKMARNRNNMCGIA 328
Query: 174 S 174
S
Sbjct: 329 S 329
>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 99/181 (54%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNK-AIQYLKHAGLEAEADYPFRN 58
LE Q AI + +PLS+ QL++C+ N C+ GG A Y+ G+EA++ YP++
Sbjct: 143 LEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDKGIEADSSYPYK- 201
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G+ C YDA+K +++ + V N + ++ + GP+ ++ +Q Y G ++
Sbjct: 202 --GIDTPCQYDAKKTVLKIKGYKNVSNSEEELKKAVGTVGPVSVAIDADPIQLYFGGIL- 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQV----PVWIVRNSWGR-WGPDDGYFTVER-GTNACG 171
+ + + NLNH V+ VGYG + W V+NSWG+ WG + GYF ++R N CG
Sbjct: 259 -DGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWG-EQGYFRIKRDANNLCG 316
Query: 172 I 172
I
Sbjct: 317 I 317
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 209 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 321
>gi|115533516|ref|NP_001041281.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
gi|85539716|emb|CAJ58500.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
Length = 348
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 96/181 (53%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF--RNQ 59
+E+ +AI HG LS+ L++C++ + C GG +KA +Y+ GL D P+ Q
Sbjct: 165 VEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRNGLANAVDLPYVAHRQ 224
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQD---YNGKLI 116
NG ++ ++K + + + + D+ L ++GP+ GM A++Q Y G +
Sbjct: 225 NGCAVNDHWNTTRIK---AAYFLHHDEDSIINWLVNFGPVNIGM--AVIQPMRAYKGGVF 279
Query: 117 RKND-VCPSENLN-HAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
++ C +E + HA++I GYG WIV+NSWG WG + GY RG NACGI
Sbjct: 280 TPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARGINACGI 339
Query: 173 E 173
E
Sbjct: 340 E 340
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 209 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 321
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 97/176 (55%), Gaps = 6/176 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K LL LS+ QL++C+ ++GC GG +A Q L GL+ ++DYP+
Sbjct: 147 IEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGTPQQAFKQILGMGGLQLDSDYPYE--- 203
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYN-GKLIRK 118
G G+C KVKV ++ + + + +ML GP + +N LQ Y G L
Sbjct: 204 GREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPFSSALNALSLQFYTEGILHPL 263
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+C +++LNHAV+ VGYG ++P W V+NSW ++GYF + RG CGI +
Sbjct: 264 PALCDAQSLNHAVLTVGYGKEGRLPYWTVKNSWSTMFGENGYFRIYRGDGPCGINT 319
>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
Length = 383
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 202 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 261
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 262 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 318
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 319 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 374
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 98/181 (54%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE Q+A GTL+ LS+ L++C+ N+GC+GG ++ QY+ ++ G++ E YP++
Sbjct: 147 LEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKA 206
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+N RC +D + +S F + D ++ + GP+ G++ + Q Y+
Sbjct: 207 KNH---RCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSG 263
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+ + + C S L+H V++VGYG W+V+NSWG ++GY + R N CG+
Sbjct: 264 VYNEFE-CSSTKLDHGVLVVGYGTYGSKDYWLVKNSWGTVWGNEGYIMMSRNKDNQCGVA 322
Query: 174 S 174
+
Sbjct: 323 T 323
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 96/182 (52%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K GTL+ LS+ QL++C + N GC GG + A QY++ + G++ E YP+
Sbjct: 151 LEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDTEESYPYEA 210
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGALL--QDYNGK 114
+NG +C Y+ + + + + D + + GP+ G++ + + Q Y
Sbjct: 211 ENG---KCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQFYESG 267
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGI 172
+ + D C S L+H V+ VGYG W+V+NSWG WG D GY + R +N CGI
Sbjct: 268 VYNEPD-CSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWG-DKGYIKMSRNKSNQCGI 325
Query: 173 ES 174
+
Sbjct: 326 AT 327
>gi|7271897|gb|AAF44679.1|AF239268_1 cathepsin L, partial [Fasciola gigantica]
Length = 219
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 88/178 (49%), Gaps = 8/178 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ G + S+ QL++C + N GC+GG A +YL+ GLE E+ YP+R
Sbjct: 34 MEGQFMKNIGFNVSFSEQQLVDCSSDFGNNGCRGGLMEIAYEYLRRFGLEIESTYPYR-- 91
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G C YD R +V+ + + + D + ++ GP ++ I
Sbjct: 92 -AVEGPCRYDRRLGVAKVTGYYIVHSGDEVELQNLVGIEGPAAVALDVESDFVMYRSGIY 150
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
++ C + LNH V+ VGYG + WIV+NSWG W + GY + R N CGI S
Sbjct: 151 QSQTCSPDRLNHGVLAVGYGTQSGTDYWIVKNSWGTWWGEGGYIRMVRNRGNMCGIAS 208
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y+ G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
+NG+ C + + V V+V D + D + + P+ Q +G
Sbjct: 236 KNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGV 292
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N CGI
Sbjct: 293 YTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKNMCGIA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
Length = 329
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 92/177 (51%), Gaps = 12/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 174 LEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPY-- 231
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + A+ + V+V D + D + + P+ + K +
Sbjct: 232 -TGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGV 290
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
++ C + ++NHAV+ VGYG+ VP W+++NSW G WG D+GYF +E G N C
Sbjct: 291 FTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWG-DNGYFKMEMGKNMC 346
>gi|392873946|gb|AFM85805.1| cathepsin H [Callorhinchus milii]
Length = 259
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 91/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G LL L++ QL++C N GC GG ++A +Y+K+ GLEAE DYP+
Sbjct: 74 LESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYIKYNGGLEAEKDYPYTA 133
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAG--MNGALLQDYNGK 114
Q+ C Y K V + + D + P+ + Q G
Sbjct: 134 QDQ---HCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIAFEVTDDFFQYEGGV 190
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
N + +NHAV+ VGYG+++ WIV+NSWG WG +GYF + RG N CG+
Sbjct: 191 YSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGL-NGYFYIIRGKNMCGLA 249
Query: 174 S 174
+
Sbjct: 250 A 250
>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
Length = 348
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 167 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 226
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 227 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 283
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 284 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 339
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 209 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 321
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 99/182 (54%), Gaps = 13/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQ 59
+ES AI G L+ LS+ +L++C+ YN+GC GG + A ++ +K+ G++ E DYP++ +
Sbjct: 171 MESINAIVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKER 230
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA-GMNGALLQDYNGKLIRK 118
NGV + +A+ VK+ + + N ++ + H +A G Q Y +
Sbjct: 231 NGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTG 290
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESYGG 177
C + ++H VVI GYG + + WIVRNSWG WG ++GY V+R + S G
Sbjct: 291 K--CGTA-VDHGVVIAGYGTENGMDYWIVRNSWGANWG-ENGYLRVQR-----NVASSSG 341
Query: 178 IC 179
+C
Sbjct: 342 LC 343
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y+ G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
+NG+ C + + V V+V D + D + + P+ Q +G
Sbjct: 236 KNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGV 292
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N CGI
Sbjct: 293 YTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKNMCGIA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 97/198 (48%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + + G L LS+ Q+++C+ + GC GG A YL K GL++E
Sbjct: 161 LEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSE 220
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ G C +D K+ +V +F V + D L +GPL +N A +Q
Sbjct: 221 KDYPY---AGRENTCKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQT 277
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P+ WI++NSWG WG + G
Sbjct: 278 YIGGV-----SCPFICGRHLDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWG-EKG 331
Query: 160 YFTVERG---TNACGIES 174
Y+ + RG N CG++S
Sbjct: 332 YYKICRGPHDKNKCGVDS 349
>gi|268570635|ref|XP_002640795.1| Hypothetical protein CBG15672 [Caenorhabditis briggsae]
Length = 396
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 66/184 (35%), Positives = 95/184 (51%), Gaps = 18/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ESQ+AIK GTL LS+ +L++C+ + GC GG +KA+ ++ GLE E DYP+ +
Sbjct: 214 VESQFAIKKGTLWSLSEQELVDCDRDSYGCNGGFMDKALSWILGNGLETEDDYPY---DA 270
Query: 62 VT-GRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
V +C + RK +V V + + + N D + GP+ M L + + K
Sbjct: 271 VRHDQCYLNGRKTRVWVDEGYRLANNEDFIADWVDSVGPVSFAM--KLPKSFYS--YSKG 326
Query: 120 DVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
PSE N HA+ ++GYG WIV+NSWG WG D GY + RG N CG
Sbjct: 327 IYHPSERECNDPNNGYHAMTLIGYGNEGGQLYWIVKNSWGSGWG-DQGYMRLARGQNVCG 385
Query: 172 IESY 175
Y
Sbjct: 386 AGEY 389
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 101/186 (54%), Gaps = 21/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
L + A+K G L+ LSK QL++C+ N+GC+GG ++A +Y+++ G+E+E DYP+++
Sbjct: 163 LSAHLALKTGQLISLSKQQLLDCSRSFNNRGCKGGLPSQAFEYIRYNGGIESERDYPYKD 222
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNG----ALLQD-- 110
+ +C + V V+ + F D L + GP+ G++ A +
Sbjct: 223 REE---KCHFKPSLVAATVTGVVNFTQGAEDDIAVALANIGPVSIGIHSTKSFATYKKGI 279
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTN 168
Y GKL KN +NHAV+IVGY WI +NSWG WG + GYF + RG N
Sbjct: 280 YQGKLCSKN----PRKINHAVLIVGYDQTASGEKYWIGKNSWGTNWGMN-GYFWIRRGHN 334
Query: 169 ACGIES 174
ACG+ +
Sbjct: 335 ACGLAT 340
>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 291
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 99/195 (50%), Gaps = 27/195 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---------NQGCQGGGFNKAIQYLKHAGLEAEA 52
+E +K G L+ LS+ QL++C+ + GC GG A++Y++ GL+ E+
Sbjct: 95 VEGANFLKTGELVSLSEQQLVDCDHTCDPSAPRNCDYGCNGGLPLNAMRYVQKHGLDTES 154
Query: 53 DYPFRNQNGVTGRCAYDAR--KVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQ 109
+YP++ GV G+CA AR VS F LV L +GPL G++ A +Q
Sbjct: 155 NYPYK---GVDGKCA-SARHGPAAASVSSFNLVSTNETQIAAALLKHGPLSIGIDAAWMQ 210
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPV---------WIVRNSWG-RWGPDDG 159
Y G + +C L+H V+IVGYG+ P WIV+NSWG WG + G
Sbjct: 211 TYVGGVACPW-ICNKAGLDHGVLIVGYGVNGTAPARPWHRRQDYWIVKNSWGPNWGVEGG 269
Query: 160 YFTVERGTNACGIES 174
Y+ + + ACG+ +
Sbjct: 270 YYHICKDRAACGLNT 284
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 97/182 (53%), Gaps = 10/182 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C++ N+GC GG +A QY+ + G+E+EA YP++
Sbjct: 159 LEAQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYK 218
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD++ S + L + D + + + GP+ ++ + + +
Sbjct: 219 ---AMDGKCQYDSKYRAATCSRYTELPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRS 275
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D + ++NH V++VGYG + W+V+NSWG D GY + R + N CGI S
Sbjct: 276 GVYYDPACTLHVNHGVLVVGYGNLNGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIAS 335
Query: 175 YG 176
Y
Sbjct: 336 YA 337
>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
Precursor
gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
Length = 329
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
Length = 330
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 91/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AIK G LL L++ QL++C N GC GG ++A +Y+K+ GLEAE DYP+
Sbjct: 145 LESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYIKYNGGLEAEKDYPYTA 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAG--MNGALLQDYNGK 114
Q+ C Y K V + + D + P+ + Q G
Sbjct: 205 QDQ---HCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIAFEVTDDFFQYEGGV 261
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
N + +NHAV+ VGYG+++ WIV+NSWG WG +GYF + RG N CG+
Sbjct: 262 YSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGL-NGYFYIIRGKNMCGLA 320
Query: 174 S 174
+
Sbjct: 321 A 321
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 94/177 (53%), Gaps = 19/177 (10%)
Query: 8 IKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C N YNQGC GG + A Q+ +K+ GL+ E DYP+R G G+
Sbjct: 184 IVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYR---GFGGK 240
Query: 66 CAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDV 121
C + KV D V +T + P+ + G + Q Y + N
Sbjct: 241 CNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGN-- 298
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-----TNACGI 172
C + NL+HAVV VGYG + V WIVRNSWG RWG ++GY +ER + CGI
Sbjct: 299 CGT-NLDHAVVAVGYGSENGVDYWIVRNSWGPRWG-EEGYIRMERNLASSKSGKCGI 353
>gi|157862757|gb|ABV90501.1| cathepsin L, partial [Fasciola gigantica]
Length = 244
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 89/182 (48%), Gaps = 8/182 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q+ G + S+ QL++C + N GC+GG A +YL+ GLE E+ YP+R
Sbjct: 59 MEGQFMKNIGFNVSFSEQQLVDCSSDFGNNGCRGGLMEIAYEYLRRFGLEIESTYPYR-- 116
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G C YD R +V+ + + + D + ++ GP ++ I
Sbjct: 117 -AVEGPCRYDRRLGVAKVTGYYIVHSGDEVELQNLVGIEGPAAVALDVESDFVMYRSGIY 175
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESYG 176
++ C + LNH V+ VGYG + WIV+NSWG W + GY + R N CGI S
Sbjct: 176 QSQTCSPDRLNHGVLAVGYGTQSGTDYWIVKNSWGTWWGEGGYIRMVRNRGNMCGIASMA 235
Query: 177 GI 178
+
Sbjct: 236 SL 237
>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
Length = 329
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
Length = 288
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 107 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 166
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 167 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 223
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 224 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 279
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/187 (32%), Positives = 97/187 (51%), Gaps = 19/187 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q +K GTL LS+ QL++C+ N GCQGG + A +Y++ + G+++EA YP+
Sbjct: 141 LEGQTFLKKGTLPSLSEQQLVDCSDKYGNHGCQGGLMDNAFKYIEANGGIDSEASYPYEA 200
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+NG +C + V + + + + D + + + GP+ M+ + Q Y
Sbjct: 201 KNG---KCRFQQSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLYAAG 257
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMR------HQVPVWIVRNSWG-RWGPDDGYFTVERGT 167
+ +C S L+H V+ VGYG + P W+V+NSWG WG GYF + R
Sbjct: 258 -VYDPLLCSSTRLDHGVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWG-QQGYFKIVRKD 315
Query: 168 NACGIES 174
N CGI +
Sbjct: 316 NKCGIAT 322
>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
Length = 215
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 90/176 (51%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ ++ G+++E YP+ Q+
Sbjct: 34 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQD 93
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 94 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 150
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG + GY + R NACGI
Sbjct: 151 YDENCSSDNLNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNACGI 206
>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
Length = 329
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 100/175 (57%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K GTLL LS+ +L++C+ ++GC GG + A +K GLE E DY +R
Sbjct: 310 VEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNAYSAIKTLGGLETEEDYSYR--- 366
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C+++A K KV ++D + + ++ L GP+ +N +Q Y +
Sbjct: 367 GHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFGMQFYRHGISHPL 426
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R P W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 427 RPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWG-EEGYYYLYRGSGACGV 480
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 95/181 (52%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI--YNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
+E IK G L+ LS+ QLI+C++ YN+GC GG A +++K + GL E DYP+
Sbjct: 160 IEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPY-- 217
Query: 59 QNGVTGRCAYDARKVKV-RVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKL 115
G+ G C + K KV + + ++ ++ P+ G++ G + Q Y+ +
Sbjct: 218 -TGIEGTCDQEKAKNKVVTIQGYQKVAQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG----TNACG 171
C + NLNH V +VGYG+ WIV+NSWG ++GY +ERG T CG
Sbjct: 277 F--TSYCGT-NLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCG 333
Query: 172 I 172
I
Sbjct: 334 I 334
>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
Length = 329
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|33242884|gb|AAQ01146.1| cathepsin [Petromyzon marinus]
Length = 333
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 92/184 (50%), Gaps = 14/184 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE Q+ G L LS+ QL++C + YN GC GG +A+QY+ + G+++E YP+ +
Sbjct: 150 LEGQHFAATGNLTSLSEQQLVDCTKSYYNNGCNGGRSERALQYIIDNNGIDSELSYPYEH 209
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGS---DTFRRMLYHYGPLVAGMNGAL--LQDYNG 113
+G +C + V + S + S + R+ + GP+ MN L + Y
Sbjct: 210 ADG---KCRFKPANVATKCSSYQFVEPSSNEEVLRQAVASVGPIAIAMNADLDTFKHYKS 266
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
L N+ ++ NHA+++VGYG WIV+NSWG WG + + N CGI
Sbjct: 267 GLF--NEPSCDKSPNHAMLVVGYGSLSGNDFWIVKNSWGEDWGEKGYIYMIRNKDNQCGI 324
Query: 173 ESYG 176
S G
Sbjct: 325 ASIG 328
>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
Length = 343
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 162 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 221
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 222 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 279 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 334
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 92/177 (51%), Gaps = 12/177 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 174 LEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPY-- 231
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + A+ + V+V D + D + + P+ + K +
Sbjct: 232 -TGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGV 290
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
++ C + ++NHAV+ VGYG+ VP W+++NSW G WG D+GYF +E G N C
Sbjct: 291 FTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWG-DNGYFKMEMGKNMC 346
>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
gi|226475|prf||1514114A cathepsin H
Length = 333
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 100/186 (53%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G ++ L++ QL++C N N GCQGG ++A +Y L + G+ E YP+
Sbjct: 148 LESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIG 207
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+NG +C ++ K V + + N + Y P+ + +D+ ++
Sbjct: 208 KNG---QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFE--VTEDF---MM 259
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S + +NHAV+ VGYG ++ + WIV+NSWG WG ++GYF +ERG N
Sbjct: 260 YKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWG-NNGYFLIERGKN 318
Query: 169 ACGIES 174
CG+ +
Sbjct: 319 MCGLAA 324
>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
Length = 314
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 133 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 192
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 193 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 249
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 250 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 305
>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
Length = 330
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
Length = 343
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 162 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 221
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 222 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 279 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 334
>gi|115533514|ref|NP_001041280.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
gi|3878958|emb|CAA89070.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
Length = 402
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 96/181 (53%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF--RNQ 59
+E+ +AI HG LS+ L++C++ + C GG +KA +Y+ GL D P+ Q
Sbjct: 219 VEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRNGLANAVDLPYVAHRQ 278
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQD---YNGKLI 116
NG ++ ++K + + + + D+ L ++GP+ GM A++Q Y G +
Sbjct: 279 NGCAVNDHWNTTRIK---AAYFLHHDEDSIINWLVNFGPVNIGM--AVIQPMRAYKGGVF 333
Query: 117 RKND-VCPSENLN-HAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
++ C +E + HA++I GYG WIV+NSWG WG + GY RG NACGI
Sbjct: 334 TPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARGINACGI 393
Query: 173 E 173
E
Sbjct: 394 E 394
>gi|203341|gb|AAA63484.1| cathepsin H [Rattus norvegicus]
Length = 298
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 100/186 (53%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G ++ L++ QL++C N N GCQGG ++A +Y L + G+ E YP+
Sbjct: 113 LESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIG 172
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+NG +C ++ K V + + N + Y P+ + +D+ ++
Sbjct: 173 KNG---QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFE--VTEDF---MM 224
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTN 168
K+ V S + +NHAV+ VGYG ++ + WIV+NSWG WG ++GYF +ERG N
Sbjct: 225 YKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWG-NNGYFLIERGKN 283
Query: 169 ACGIES 174
CG+ +
Sbjct: 284 MCGLAA 289
>gi|340505335|gb|EGR31675.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 229
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 100/181 (55%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLP-LSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFR 57
+ES +A+K+G P LS+ QLI+C + N GC+GG ++A +Y+ + GLE+E DYP+
Sbjct: 37 VESHWALKNGNPPPILSEQQLIDCAQDFNNFGCKGGLPSQAFEYIFYNGGLESEKDYPYM 96
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAG--MNGALLQDYNG 113
T C +DA KV ++ + F + L + GP+ +N Q +G
Sbjct: 97 ---AATRNCTFDASKVSAKLEGQYNITFQDENELLYKLANEGPISIAYQVNNDFFQYRSG 153
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVW-IVRNSWG-RWGPDDGYFTVERGTNACG 171
+ ++NHAV+ VGYG+ ++ IV+NSWG WG + GYF +ERGTN CG
Sbjct: 154 VYSSPSCSQQPSDVNHAVLAVGYGVSISGQLYYIVKNSWGPEWGIN-GYFLIERGTNMCG 212
Query: 172 I 172
+
Sbjct: 213 L 213
>gi|407036622|gb|EKE38272.1| cysteine protease, putative [Entamoeba nuttalli P19]
Length = 308
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 11/178 (6%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
+LE + G L S+ QL++C+ + GC+GG ++++++ + GL E+DYP++
Sbjct: 123 VLEGRVNKDLGKLYSFSEQQLVDCDTSDNGCEGGHPTNSLKFIQENNGLGLESDYPYK-- 180
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGKLI 116
V G C + V V +GS+T + ++ GP+ GM+ + Q Y I
Sbjct: 181 -AVAGTCK-KVKNVATVTGSKRVTDGSETGLQTIIAENGPVAVGMDASRPTFQLYKKGTI 238
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGI 172
+ C S +NH V VGYG WI+RNSWG WG D GYF + R + N CGI
Sbjct: 239 YSDARCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTSWG-DAGYFLLARDSNNMCGI 295
>gi|332373716|gb|AEE61999.1| unknown [Dendroctonus ponderosae]
Length = 346
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 94/176 (53%), Gaps = 11/176 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK--HAGLEAEADYPFRNQ 59
+ES AIK L LS QLI+C+ YN GC+GG ++++K + + E DYP +
Sbjct: 167 IESMQAIKTQQLTDLSIQQLIDCSSYNNGCKGGDTCALLRWIKVNNIAIMNETDYPLVLE 226
Query: 60 NGVTGRCAYDARKVKVRVSDFLV--FNG-SDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+ +C V+V + F G D ++L GP+ ++G Q+Y G +I
Sbjct: 227 DQ---KCQKTDMSEGVKVGTYQCNSFVGREDIILKLLAINGPVAVAISGETWQNYVGGVI 283
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ + C + L+HAV IVGY + +VP +IVRNSWG D+GY V G N CG+
Sbjct: 284 QFH--CEGD-LSHAVQIVGYNLTAKVPFYIVRNSWGEDFGDNGYLYVAIGGNVCGL 336
>gi|379991182|emb|CCA61803.1| cathepsin protein CatL1-MM3p, partial [Fasciola hepatica]
Length = 326
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 94/183 (51%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A QYLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+ + V +GS+ + ++ GP ++ + Y+G I
Sbjct: 198 TAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
++ C LNHAV+ VGYG + WIV+NSWG + + GY + R N CGI S
Sbjct: 257 YQSQTCSPLGLNHAVLAVGYGTQGGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASL 319
>gi|324518532|gb|ADY47133.1| Cysteine proteinase [Ascaris suum]
Length = 334
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 95/180 (52%), Gaps = 11/180 (6%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
++ES YAIK+G ++ +S+ ++++C+ N GC G +A+ K G Y F
Sbjct: 157 VVESLYAIKYGDIINISEYEMLDCDFSNDGCYSGSTRRAMGRGKFHGFTETRFYSF--MR 214
Query: 61 GVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGAL---LQDYNGKLI 116
G +C R+ + V + + ++T +YGP+ +N A+ + Y ++
Sbjct: 215 GHPYQCP---RRGTIFVKNLFALSPDANTIAWFTANYGPV--ALNVAIPPNYKFYKSGIM 269
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESYG 176
R + C NHA +VG+G+ + WI++NSWG W ++G+F +ERG NAC +E++
Sbjct: 270 RDSYECWQMQPNHAAEVVGFGVEDGIEYWIMKNSWGSWWGENGFFRIERGKNACQVETFA 329
>gi|56199438|gb|AAV84208.1| cathepsin L [Culicoides sonorensis]
Length = 331
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 91/176 (51%), Gaps = 9/176 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E +Y H L++ +L++C + GC GG + A+QY++ GL E DYP++ G
Sbjct: 149 VEMRYKRFHNKSYTLAEQELVDCETTSHGCSGGWSDLALQYMRDNGLSFEKDYPYK---G 205
Query: 62 VTGRC-AYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAG-MNGALLQDYNGKLIRK 118
+C A + K V+V + + +++ Y YGPLV + Y G +
Sbjct: 206 KDEKCHASNENKSPVKVVNVCSTPKDEVSYKDHFYQYGPLVVYYFVDNNFKQYKGGIF-S 264
Query: 119 NDVCPSEN--LNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ C EN +NHAVV++GYG V W+VRNSWG+ + G+F + R + C +
Sbjct: 265 SKTCNVENAGINHAVVLMGYGSEKDVKYWLVRNSWGKSFGESGHFRILRDAHMCNL 320
>gi|67475048|ref|XP_653254.1| cysteine protease [Entamoeba histolytica HM-1:IMSS]
gi|2507251|sp|P36184.2|ACP1_ENTHI RecName: Full=Cysteine proteinase ACP1; Flags: Precursor
gi|1460065|emb|CAA60673.1| cysteine proteinase [Entamoeba histolytica]
gi|56470190|gb|EAL47868.1| cysteine protease, putative [Entamoeba histolytica HM-1:IMSS]
gi|449707486|gb|EMD47138.1| cysteine protease, putative [Entamoeba histolytica KU27]
Length = 308
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 93/178 (52%), Gaps = 11/178 (6%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
+LE + G L S+ QL++C+ + GC+GG + ++++++ + GL E+DYP++
Sbjct: 123 VLEGRVNKDLGKLYSFSEQQLVDCDASDNGCEGGHPSNSLKFIQENNGLGLESDYPYK-- 180
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGKLI 116
V G C + V V +GS+T + ++ GP+ GM+ + Q Y I
Sbjct: 181 -AVAGTCK-KVKNVATVTGSRRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTI 238
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGI 172
+ C S +NH V VGYG WI+RNSWG WG D GYF + R + N CGI
Sbjct: 239 YSDTKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTSWG-DAGYFLLARDSNNMCGI 295
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|391341652|ref|XP_003745141.1| PREDICTED: counting factor associated protein D-like [Metaseiulus
occidentalis]
Length = 751
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 96/182 (52%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHG--TLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADY-PF 56
LESQY I++G S+ Q+++C + N GC+GG + A +Y++ GL E Y P+
Sbjct: 567 LESQYIIRNGKGNTTRFSEQQIVDCSWDSLNIGCKGGFPHGAFEYVQKYGLFTEDQYGPY 626
Query: 57 RNQNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGK 114
+ G + + F + G++ R + +GP+ G++G+ + Y+
Sbjct: 627 LDDEGKCRDAEMKGEPIIPTLKSFTMMEGAECLLRHVGLHGPIAVGIHGSSDSFRAYSRG 686
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ ND +L HAV++VGYG P W+V+NSWG +WG +GY V R N CGIE
Sbjct: 687 IY--NDPTCDHSLTHAVLVVGYGSLRGEPYWLVKNSWGPKWGA-EGYILVSRKENYCGIE 743
Query: 174 SY 175
+Y
Sbjct: 744 NY 745
>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
Length = 500
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 96/188 (51%), Gaps = 31/188 (16%)
Query: 8 IKHGTLLPLSKSQLIEC---------NIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
IK G L+ LS+ QL++C N + GC GG + A++Y+ +H GL+ E YP++
Sbjct: 311 IKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEHGGLDTEKSYPYK 370
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
T C K+ +S++ ++T L YGPL G+N A +Q Y G +
Sbjct: 371 AYKEDT--CRAKEGKLGATISNYTFVGKNETHMAHALVKYGPLSIGINAAWMQSYVGGV- 427
Query: 117 RKNDVCP----SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDGYFTVE 164
CP + L+H V+IVGYG H+ P W+++NSWG WG ++GY+ +
Sbjct: 428 ----ACPWLCNKDALDHGVLIVGYGEEGFAPARLHKEPYWVIKNSWGMGWG-EEGYYRIC 482
Query: 165 RGTNACGI 172
+ CG+
Sbjct: 483 KDKGNCGV 490
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 100/177 (56%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++ Q
Sbjct: 311 VEGQWFLKQGTLLSLSEQELLDCDKMDKACLGGLPSNAYSAIKNLGGLETEEDYSYQGQM 370
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 371 QA---CNFSAEKAKVYINDSVELSHNEQKLAAWLAKKGPISVAINAFGMQFYRHGISRPL 427
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV+IVGYG R +P W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 428 RPLCTPWLIDHAVLIVGYGNRSDIPFWAIKNSWGTDWG-EQGYYYLHRGSGACGVNT 483
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 96/172 (55%), Gaps = 9/172 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+E AI G L+ LS+ +L++C+ N GC+GG + A ++ + + G++ EA+YP+
Sbjct: 174 IEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDTEANYPY---T 230
Query: 61 GVTGRCAYDARKVKV-RVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALL--QDYNGKLIR 117
GV G C ++KV + + + +D+ P+ GM+G+ L Q Y G +
Sbjct: 231 GVDGTCNTTKEEIKVVSIDGYTDVDETDSALLCATVQQPISVGMDGSALDFQLYTGGIYD 290
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ +++HAV+IVGYG + WIV+NSWG WG +GYF ++R T+
Sbjct: 291 GDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGM-EGYFYIKRNTD 341
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 92/174 (52%), Gaps = 14/174 (8%)
Query: 8 IKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A Q+ +K+ GL E DYP+ NG
Sbjct: 139 IVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNS 198
Query: 66 CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDVCP 123
++R V + + + +R + Y P+ ++ G Q Y + C
Sbjct: 199 LLKNSRVVTIDGYEDVPSKDETALKRAV-SYQPVSVAIDAGGRAFQHYQSGIFTGK--CG 255
Query: 124 SENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG----TNACGI 172
+ N++HAVV VGYG + V WIVRNSWG RWG +DGY +ER + CGI
Sbjct: 256 T-NMDHAVVAVGYGSENGVDYWIVRNSWGTRWG-EDGYIRMERNVASKSGKCGI 307
>gi|295971911|gb|ADG63162.1| cysteine protease F [Leishmania infantum]
Length = 238
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A L+ LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 19 IESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTS 78
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C ++ V R+ +++ ++T L GP+ G++ + Y ++
Sbjct: 79 GNGDVAECLNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVL 138
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 139 TS---CAGDALNHGVLLVGYNTTGGVPYWVIKNSWGEDWG-EKGYVRVAMGLNACLLSEY 194
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 92/174 (52%), Gaps = 14/174 (8%)
Query: 8 IKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A Q+ +K+ GL E DYP+ NG
Sbjct: 139 IVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNS 198
Query: 66 CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDVCP 123
++R V + + + +R + Y P+ ++ G Q Y + C
Sbjct: 199 LLKNSRVVTIDGYEDVPSKDETALKRAV-SYQPVSVAIDAGGRAFQHYQSGIFTGK--CG 255
Query: 124 SENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG----TNACGI 172
+ N++HAVV VGYG + V WIVRNSWG RWG +DGY +ER + CGI
Sbjct: 256 T-NMDHAVVAVGYGSENGVDYWIVRNSWGTRWG-EDGYIRMERNVASKSGKCGI 307
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 209 E---SCMYNPTGKAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGRKHWIIKNSWGENWG-NKGYVLMARNKNNACGI 321
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A Y+ K+ G+++E YP+ Q+
Sbjct: 150 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFHYVQKNQGIDSEDAYPYVGQD 209
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 210 E---SCMYNPTGKAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 266
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
+ C S+NLNHAV+ VGYG++ + WI++NSWG WG + GY + R NACGI
Sbjct: 267 YDKNCNSDNLNHAVLAVGYGIQKRKKHWIIKNSWGESWG-NKGYILMARNKNNACGI 322
>gi|146147376|gb|ABQ01982.1| cathepsin [Fasciola gigantica]
Length = 326
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 95/183 (51%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNMGCMGGLMENAYEYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+D+ V +GS+ + ++ GP ++ + Y+G I
Sbjct: 198 TAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
++ C S +NHAV+ VGYG + WIV+NSWG + GY + R N CGI S
Sbjct: 257 YQSRTCSSLRVNHAVLAVGYGTQSGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASL 319
>gi|291224868|ref|XP_002732424.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
Length = 823
Score = 93.2 bits (230), Expect = 5e-17, Method: Composition-based stats.
Identities = 62/182 (34%), Positives = 90/182 (49%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
LE Q K G L LS+ QL++C+ N GC GG + A +Y+K A G+E E DYP+
Sbjct: 640 LEGQTFKKTGKLPDLSEQQLVDCSTQFGNHGCNGGLMDLAFEYIKAAPGIEGEMDYPYLA 699
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
++G RC +D KV + ++ D + + GP+ ++ Q Y
Sbjct: 700 KDG---RCMFDQSKVVATDTGYVDIPSMDENALKEAVATIGPISVAIDAGHPSFQMYKSG 756
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGI 172
+ + C SE L+H V+ VGYG W+V+NSWG WG GY + R N CGI
Sbjct: 757 VYNEPG-CSSERLDHGVLAVGYGTEDGQDYWLVKNSWGDSWG-QAGYIMMSRNMNNQCGI 814
Query: 173 ES 174
+
Sbjct: 815 AT 816
>gi|58617832|gb|AAW80535.1| cathepsin L-like cysteine protease [Leishmania donovani]
gi|58617834|gb|AAW80536.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 247
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A L+ LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 22 IESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTS 81
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C ++ V R+ +++ ++T L GP+ G++ + Y ++
Sbjct: 82 GNGDVAECLNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVL 141
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 142 TS---CAGDALNHGVLLVGYNTTGGVPYWVIKNSWGEDWG-EKGYVRVAMGLNACLLSEY 197
>gi|74152091|dbj|BAE32077.1| unnamed protein product [Mus musculus]
Length = 245
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 61 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 120
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 121 K---ATDEKCHYNSKNRAATCSGYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 177
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 178 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 237
Query: 174 SY 175
SY
Sbjct: 238 SY 239
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 93/191 (48%), Gaps = 24/191 (12%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEA 50
++E + G LL LS+ QLI+C+ + GC GG A YL AG +E
Sbjct: 207 VVEGANFLATGKLLNLSEQQLIDCDHKCDPLNTKACDNGCHGGLMTNAYNYLMEAGGIEE 266
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQ 109
+YP+ GV G C ++ V+ +F N + L +GPL G+N A +Q
Sbjct: 267 AKNYPY---TGVQGDCKFNPDLAAVKAINFTTVNLDEKQIAANLVKHGPLAVGLNAAFMQ 323
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-------PVWIVRNSWG-RWGPDDGYF 161
Y G + +C +NH V++VGYG + P WI++NSWG RWG + GY+
Sbjct: 324 TYIGG-VSCPLICSKRFINHGVLLVGYGHKGFALLRLGYRPYWIIKNSWGKRWG-EHGYY 381
Query: 162 TVERGTNACGI 172
+ RG CG+
Sbjct: 382 KLCRGHGECGM 392
>gi|295922223|gb|ADG62368.1| cysteine protease [Leishmania donovani]
gi|295971913|gb|ADG63163.1| cysteine protease F [Leishmania donovani]
Length = 239
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A L+ LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 18 IESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTS 77
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C ++ V R+ +++ ++T L GP+ G++ + Y ++
Sbjct: 78 GNGDVAECLNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVL 137
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 138 TS---CAGDALNHGVLLVGYNTTGGVPYWVIKNSWGEDWG-EKGYVRVAMGLNACLLSEY 193
>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 92/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHGLTALSEQQLVSCDDKDNGCGGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+G C+ ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 SSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG ++GY V G NAC + Y
Sbjct: 279 TS---CAGDALNHGVLLVGYNRTGEVPYWVIKNSWGEDWG-ENGYVRVTMGVNACLLTEY 334
>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
Length = 350
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES YA G + LS+ QL++C N GC GG ++A +Y+K+ GLE E YP+
Sbjct: 166 LESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLETEETYPYTG 225
Query: 59 QNGVTGRCAYDARKVKVRV--SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG+ C + + V ++V S + D + + P+ ++ D+ +
Sbjct: 226 SNGL---CKFTSENVALKVLGSVNITLGSEDELKHAVAFARPVSVAFE--VVHDFR---L 277
Query: 117 RKNDVCPSE-------NLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTN 168
K+ V S ++NHAV+ VGYG+ +P W ++NSW G WG D GYF +E G N
Sbjct: 278 YKSGVYTSTACGNTPMDVNHAVLAVGYGIEDGIPYWHIKNSWGGDWG-DHGYFKMEMGKN 336
Query: 169 ACGIES 174
CG+ +
Sbjct: 337 MCGVAT 342
>gi|167427529|gb|ABZ80401.1| cathepsin L4, partial [Fasciola hepatica]
Length = 303
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 88/184 (47%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY + S+ QL++C + N GC GG A +YL+ GLE E+ YP++ +
Sbjct: 118 VEGQYMKNPKANISFSEQQLVDCSGDYGNHGCNGGFMENAYEYLERRGLETESSYPYKAE 177
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMN--GALLQDYNGKL 115
G C YD+R V V + + + ++ GP ++ L G
Sbjct: 178 EGP---CKYDSRLGVVEVFGYFIEHSGIESKLAHLVGDKGPAAVAVDVESDFLMYRGGIY 234
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
+N C SE LNHA+++VGYG + WIV+NSWG D GY + R N CGI S
Sbjct: 235 ASRN--CSSEKLNHAMLVVGYGTQDGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIAS 292
Query: 175 YGGI 178
+
Sbjct: 293 AASV 296
>gi|56752859|gb|AAW24641.1| unknown [Schistosoma japonicum]
Length = 331
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA-LLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVAVDSLIMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKYADINHGVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|391346471|ref|XP_003747496.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 333
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 13/170 (7%)
Query: 16 LSKSQLIECNI------YNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVT-GRCAY 68
LS+ QL++C + N GC GG IQ+ G+ E +YP+R+ N T GRC+
Sbjct: 159 LSEQQLVDCTLNRYIHNMNFGCGGGDPATTIQHALRHGISQEHEYPYRSGNTQTHGRCSS 218
Query: 69 DARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG--ALLQDYNGKLIRKNDVCPS 124
+ V + + D + +GP+ +NG + Y+G I N CP+
Sbjct: 219 TSGSVSLNNLRLMQVKAGDENALANAVATHGPIAVTLNGENSDFYSYSGG-IYNNRSCPT 277
Query: 125 ENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+ +NHAV++VGYG + P WI++NSWG ++G+ + RG+N CGI S
Sbjct: 278 Q-INHAVLLVGYGSSNGQPYWIIKNSWGSTWGENGFMKLARGSNRCGIVS 326
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 89/176 (50%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES IK G L+ LS+ QL++C N GC GG + A++Y++ G+ +E DYP+ +N
Sbjct: 143 VESHNFIKTGKLISLSEQQLVDCVKNNSGCAGGWMDIALEYIEADGIMSEDDYPYEERNT 202
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
C ++ K V++ + +D ++ + GP+ + + + I +
Sbjct: 203 T---CRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVSVAIEVTIAFQLYARGILND 259
Query: 120 DVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGI 172
C + +L HAV++ GYG + WIV+NSWG DGY + R N CGI
Sbjct: 260 PQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNSWGAEYGMDGYLRMSRNADNQCGI 315
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 175 LEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 234
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
++G C Y A V V V D + N + L H LV ++ A ++ +L +
Sbjct: 235 EDGT---CKYSAENVGVEVLDSV--NITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKS 289
Query: 119 NDVCPSE------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
S ++NHAV+ VGYG+ VP W+++NSWG WG D GYF +E G N CG
Sbjct: 290 GVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWG-DKGYFKMEMGKNMCG 348
Query: 172 IES 174
I +
Sbjct: 349 IAT 351
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 96/198 (48%), Gaps = 34/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYL-KHAGLEAE 51
LE + G + LS+ Q ++C+ + GC GG A YL K GLE E
Sbjct: 175 LEGANYLATGKMEVLSEQQFVDCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKSGGLERE 234
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++G C +D K+ V +F V + + L +GPL G+N A +Q
Sbjct: 235 KDYPYTGRDGT---CKFDKSKIVASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYMQT 291
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP +L+H V++VGYG P W+++NSWG WG + G
Sbjct: 292 YIGGV-----SCPYICGRSLDHGVLLVGYGASGFAPSRLKNKPYWVIKNSWGENWG-EKG 345
Query: 160 YFTVERGTNA---CGIES 174
Y+ + RG+N CG++S
Sbjct: 346 YYKICRGSNVRNKCGVDS 363
>gi|148908373|gb|ABR17300.1| unknown [Picea sitchensis]
Length = 357
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 99/184 (53%), Gaps = 17/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 172 LEAAYTQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTA 231
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
++GV C YD V V+V+D + D + + P+ ++QD+ +
Sbjct: 232 KDGV---CNYDVNNVGVKVADSVNISLGAEDKLKSAVGLVRPVSVAFQ--VIQDFRFYKE 286
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
+ + C ++NHAV+ VGYG+ + P WI++NSWG+ WG +GYF +E G N C
Sbjct: 287 GVFTSTTCGQGPMDVNHAVLAVGYGVSEEGTPHWIIKNSWGKSWGV-EGYFKMEMGKNMC 345
Query: 171 GIES 174
G+ +
Sbjct: 346 GVAT 349
>gi|116779845|gb|ABK21448.1| unknown [Picea sitchensis]
gi|116791731|gb|ABK26088.1| unknown [Picea sitchensis]
gi|224286276|gb|ACN40847.1| unknown [Picea sitchensis]
Length = 357
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 99/184 (53%), Gaps = 17/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 172 LEAAYTQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTA 231
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
++GV C YD V V+V+D + D + + P+ ++QD+ +
Sbjct: 232 KDGV---CNYDVNNVGVKVADSVNISLGAEDELKSAVGLVRPVSVAFQ--VIQDFRFYKE 286
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
+ + C ++NHAV+ VGYG+ + P WI++NSWG+ WG +GYF +E G N C
Sbjct: 287 GVFTSTTCGQGPMDVNHAVLAVGYGVSEEGTPHWIIKNSWGKSWGV-EGYFKMEMGKNMC 345
Query: 171 GIES 174
G+ +
Sbjct: 346 GVAT 349
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 97/181 (53%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQ 59
+ES AI G L+ LS+ +L++C+ YN+GC GG + A ++ +K+ G++ E DYP++ +
Sbjct: 51 MESINAIVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKER 110
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA-GMNGALLQDYNGKLIRK 118
NGV + +A+ VK+ + + N ++ + H +A G Q Y +
Sbjct: 111 NGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTG 170
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESYGGI 178
C + ++H VVI GYG + + WIVRNSWG ++GY V+R + S G+
Sbjct: 171 K--CGTA-VDHGVVIAGYGTENGMDYWIVRNSWGANCRENGYLRVQR-----NVSSSSGL 222
Query: 179 C 179
C
Sbjct: 223 C 223
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 93/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 176 LEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGK 114
+NG+ C + + V V+V D + D + + P+ Q +G
Sbjct: 236 KNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGV 292
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
++NHAV+ VGYG+ + VP W+++NSWG WG D+GYF +E G N CGI
Sbjct: 293 YTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWG-DNGYFKMEMGKNMCGIA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
Length = 329
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---NCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 100/177 (56%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 199 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 255
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D +V + ++ L GP+ +N +Q Y + R
Sbjct: 256 GHMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 315
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 316 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 371
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
Length = 334
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 153 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 212
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 213 E---NCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVY 269
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 270 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 325
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 99/181 (54%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G++++A YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+C YD++ S + L + D + ++ + GP+ G++ + + +
Sbjct: 208 ---ATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++N+NH V++VGYG+ + W+V+NSWGR ++GY + R N CGI S
Sbjct: 265 GVYYEPSCTQNVNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIAS 324
Query: 175 Y 175
+
Sbjct: 325 F 325
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 99/181 (54%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G++++A YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+C YD++ S + L + D + ++ + GP+ G++ + + +
Sbjct: 208 ---ATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++N+NH V++VGYG+ + W+V+NSWGR ++GY + R N CGI S
Sbjct: 265 GVYYEPSCTQNVNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIAS 324
Query: 175 Y 175
+
Sbjct: 325 F 325
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 96/190 (50%), Gaps = 24/190 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---------YNQGCQGGGFNKAIQYL-KHAGLEAE 51
+E + I G L+ LS+ QL++C++ + GC GG + A++Y+ +H G++ E
Sbjct: 98 IEGAHFISTGKLVELSEQQLLDCDVGCDPDVPNACDSGCNGGLPSNAMEYIVEHGGIDTE 157
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
YP+ G G C D + + +F V + L +GPL G+N A +Q
Sbjct: 158 KSYPYV---GEKGECKADEGTLGATLKNFSYVSSDEKQMAAALVKHGPLSIGINAAWMQT 214
Query: 111 YNGKLIRKNDVCPSENLNHAVVIVGYGMR-------HQVPVWIVRNSWG-RWGPDDGYFT 162
Y G + +C SE L+H V+IVGYG Q P WIV+NSW WG + GY+
Sbjct: 215 YIGGVACPW-LCDSEALDHGVLIVGYGSSGFAPVRWQQEPYWIVKNSWSPAWG-EGGYYR 272
Query: 163 VERGTNACGI 172
+ + +CGI
Sbjct: 273 ICKDKGSCGI 282
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 92/177 (51%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E + I L+ LS+ +L++C+ +QGC GG N + ++ GLE E YP+ +
Sbjct: 297 VEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY---D 353
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + + V ++ + + ++ L GP+ G+N LQ Y ++
Sbjct: 354 GRGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPF 413
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C LNH V+IVGYG + P WIV+NSWG WG + GYF + RG N CG++
Sbjct: 414 KIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWG-EAGYFKLYRGKNVCGVQE 469
>gi|4574304|gb|AAD23996.1|AF112566_1 cathepsin [Fasciola gigantica]
Length = 326
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 95/183 (51%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNMGCSGGLMENAYEYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+D+ V +GS+ + ++ GP ++ + Y+G I
Sbjct: 198 TAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
++ C S +NHAV+ VGYG + WIV+NSWG + GY + R N CGI S
Sbjct: 257 YQSRTCSSLRVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASL 319
>gi|21263041|gb|AAM44832.1|AF510856_1 cathepsin L2 [Fasciola gigantica]
Length = 326
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 96/183 (52%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNMGCMGGLMENAYEYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+D+ V +GS+ + ++ GP ++ + Y+G I
Sbjct: 198 TAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
++ C S ++NHAV+ VGYG + WIV+NSWG + GY + R N CGI S
Sbjct: 257 YQSRTCSSLHVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASL 319
>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
Length = 338
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 157 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 216
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 217 E---NCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVY 273
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 274 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 329
>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
Length = 329
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYDA--RKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+A + K R + +R + GP+ ++ +L + + +
Sbjct: 208 E---SCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C +N+NHAV++VGYG + WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWG-NKGYALLARNKNNACGI 320
>gi|403352840|gb|EJY75943.1| Oryzain gamma chain [Oxytricha trifallax]
Length = 338
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 94/180 (52%), Gaps = 15/180 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+ES +AI+ G L LS+ QL++C+ N GC GG + Y+K GLE EADYP+
Sbjct: 155 VESHFAIQFGKLYSLSEQQLVDCSTAYDNAGCNGGLATQGYDYVKSYGLEQEADYPYLAA 214
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGA-LLQDYNGKLI 116
+G C D K+ V DF + L GP ++ + + ++Y ++
Sbjct: 215 DGT---CHRDKSKIVAYVEDFHTVQTLSPSQLKAALATQGPASVSVDASGVFKNYQSGIL 271
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGY--FTVERGTNACGIE 173
N C + +LNHA++ VGYG+ + +IVRNSWG WG ++GY + G CG++
Sbjct: 272 --NAGCGT-SLNHAILAVGYGVENGQEYYIVRNSWGPSWG-ENGYIRLAIVEGQGTCGVQ 327
>gi|26245875|gb|AAN77413.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 287
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 90/178 (50%), Gaps = 8/178 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES IK G L+ LS+ QL++C N GC GG + A++Y++ G+ +E DYP+ +N
Sbjct: 106 VESHNFIKTGKLISLSEQQLVDCVKNNSGCAGGWMDIALEYIEADGIMSEDDYPYEERNT 165
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
C ++ K V++ + +D ++ + GP+ + + + I +
Sbjct: 166 T---CRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVPVAIEVTIAFQLYARGILND 222
Query: 120 DVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
C + +L HAV++ GYG + WIV+NSWG DGY + R N CGI +
Sbjct: 223 PQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNSWGAEYGMDGYLRMSRNADNQCGIAT 280
>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
Length = 330
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 209 E---NCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 321
>gi|402856107|ref|XP_003892641.1| PREDICTED: cathepsin S isoform 2 [Papio anubis]
Length = 281
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 99/181 (54%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G++++A YP++
Sbjct: 98 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYK 157
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+C YD++ S + L + D + ++ + GP+ G++ + + +
Sbjct: 158 ---ATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRS 214
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++N+NH V++VGYG+ + W+V+NSWGR ++GY + R N CGI S
Sbjct: 215 GVYYEPSCTQNVNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIAS 274
Query: 175 Y 175
+
Sbjct: 275 F 275
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 94/181 (51%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 157 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 216
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
G+C YD++ S + L D + + + GP+ ++ + +
Sbjct: 217 ---ATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRS 273
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI S
Sbjct: 274 GVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 333
Query: 175 Y 175
Y
Sbjct: 334 Y 334
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 92/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QLI+C N GC GG ++A +Y+K+ GL+ E YP++
Sbjct: 176 LEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQ- 234
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
GV G C + V +V D + D + + P+ +
Sbjct: 235 --GVNGICKFKNENVGFKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGV 292
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+D C + ++NHAV+ VGYG+ VP W+++NSWG WG D+GYF +E G N CG+
Sbjct: 293 YTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWG-DEGYFKMEMGKNMCGVA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 93/182 (51%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 146 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 205
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ + +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 206 K---AMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 262
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 263 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 322
Query: 174 SY 175
SY
Sbjct: 323 SY 324
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 99/181 (54%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G++++A YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+C YD++ S + L + D + ++ + GP+ G++ + + +
Sbjct: 208 ---ATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++N+NH V++VGYG+ + W+V+NSWGR ++GY + R N CGI S
Sbjct: 265 GVYYEPSCTQNVNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIAS 324
Query: 175 Y 175
+
Sbjct: 325 F 325
>gi|327289219|ref|XP_003229322.1| PREDICTED: cathepsin K-like, partial [Anolis carolinensis]
Length = 289
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 93/177 (52%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
LE+Q +K G LL LS L++C N GC GG A +Y+ + G++++ YP+ Q+
Sbjct: 108 LEAQLKMKTGKLLNLSPQNLVDCVSNNDGCGGGYMTNAFEYVHVNRGIDSDDTYPYIGQD 167
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ G++ +L + + +
Sbjct: 168 E---NCMYNPTGKAAKCRGYKEIPEGDEKALKRAVARKGPVSVGIDASLASFQFYSRGVY 224
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C ++N+NHAV+ VGYG + WIV+NSWG WG D GY + R NACGI
Sbjct: 225 YDENCNADNINHAVLAVGYGSQKGTKHWIVKNSWGEDWG-DKGYILMARNMNNACGI 280
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 98/195 (50%), Gaps = 32/195 (16%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQY-LKHAGLEAE 51
LE + + G L+ LS QL++C+ + GC GG N A +Y L+ G++ E
Sbjct: 138 LEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSGCNGGLMNNAFEYILESGGVQRE 197
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFN-GSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ ++ G +A V S+F V + D L GPL G+N +Q
Sbjct: 198 EDYPYTGRD--RGPAIDEANAASV--SNFSVVSLDEDQISANLVKNGPLAIGINAVFMQT 253
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMRHQVPV-------WIVRNSWGR-WGPDDG 159
Y G + CP +NL+H V++VGYG P+ WI++NSWG WG ++G
Sbjct: 254 YIGGV-----SCPYICGKNLDHGVLLVGYGKAGYAPIRLKEKPYWIIKNSWGESWG-ENG 307
Query: 160 YFTVERGTNACGIES 174
Y+ + RG N CG++S
Sbjct: 308 YYKICRGRNVCGVDS 322
>gi|428175797|gb|EKX44685.1| hypothetical protein GUITHDRAFT_71985 [Guillardia theta CCMP2712]
Length = 354
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 106/190 (55%), Gaps = 22/190 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES +AIK G ++ LS+ QL++C + N GC GG ++A +Y+ + GL +YP+
Sbjct: 156 LESLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQAFEYIMYNGGLSKMEEYPYVC 215
Query: 59 QNG----VTGRCAYD---------ARKVKVRVSDFLVFNGSDTFRRMLYHYGPL-VAGMN 104
+G G CA+D A+KV +V++F + + + ++ + P+ VA
Sbjct: 216 GDGHCNVTGGPCAFDPVGKPWSVGAKKVS-KVANFTPGD-EISMKTVVGSHNPISVAFEV 273
Query: 105 GALLQDYN-GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFT 162
A L+ Y+ G V + +NHAV+ VGYG +P W ++NSWG WG D+GYF
Sbjct: 274 VADLRHYSSGVYSSPTCVGTPDKVNHAVLAVGYGTEGGIPYWTIKNSWGFAWG-DNGYFK 332
Query: 163 VERGTNACGI 172
++RG+N CGI
Sbjct: 333 IQRGSNMCGI 342
>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
Length = 219
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 38 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 97
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + + +
Sbjct: 98 E---SCMYNPTGKAAKCRGYREIPQGNEKALKRAVARVGPVSVAIDASLTSFQFYSRGVY 154
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 155 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 210
>gi|380026170|ref|XP_003696831.1| PREDICTED: cathepsin O-like [Apis florea]
Length = 368
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFR 57
++ES +AIK+GTL LS ++I+C N GC+GG + +L + ++ E+ YP
Sbjct: 181 VIESMFAIKNGTLHSLSVQEMIDCAKNSNFGCEGGDICSLLSWLLVSKVQILQESIYPLV 240
Query: 58 NQNGVTGRCAYDARKVK---VRVSDFL---VFNGSDTFRRMLYHYGPLVAGMNGALLQDY 111
G+TG C K +++ DF + D L +GP+ A +N Q+Y
Sbjct: 241 ---GMTGTCKLGKMTDKAFGIKIQDFTCDSFVDAEDELLIALATHGPVAAAVNALSWQNY 297
Query: 112 NGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
G +I+ + +NLNHAV I+GY VP +I++NSWG D GY + G N CG
Sbjct: 298 LGGVIQYHCDGSFDNLNHAVQIIGYDKSVAVPHYIIKNSWGSNFGDKGYMYIGIGNNLCG 357
Query: 172 I 172
I
Sbjct: 358 I 358
>gi|171948778|gb|ACB59246.1| cathepsin H [Sus scrofa]
Length = 297
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 98/186 (52%), Gaps = 19/186 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGG--GF-NKAIQYLKH-AGLEAEADYP 55
LES AI G +L L++ QL++C N N GCQGG G ++A +Y+++ G+ E YP
Sbjct: 109 LESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPGLPSQAFEYIRYNKGIMGEDTYP 168
Query: 56 FRNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDY-- 111
++ Q+ C + K V D + N + + Y P+ + D+
Sbjct: 169 YKGQDD---HCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFE--VTNDFLM 223
Query: 112 NGKLIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K I + C + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 224 YRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGM-NGYFLIERGKN 282
Query: 169 ACGIES 174
CG+ +
Sbjct: 283 MCGLAA 288
>gi|350415610|ref|XP_003490694.1| PREDICTED: cathepsin O-like [Bombus impatiens]
Length = 355
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 95/180 (52%), Gaps = 10/180 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFR 57
++ES YAIK+GTL LS ++I+C N GC+GG + +L + ++ E+ YP
Sbjct: 168 VVESMYAIKNGTLHMLSVQEMIDCAKNSNFGCEGGDICSLLSWLLASKVQIFQESTYPLV 227
Query: 58 NQNGVT--GRCAYDARKVKVRVSDFLVFNGSDTFRRMLYH---YGPLVAGMNGALLQDYN 112
+ + G+ A VK+R DF N D +L +GP+ A +N Q+Y
Sbjct: 228 GKTSMCKLGKMIDKASGVKIR--DFNCDNFVDAEDELLITVATHGPVAAAVNALSWQNYL 285
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
G +I+ + +NLNHAV IVGY +P +I++NSWG D GY + G N CGI
Sbjct: 286 GGVIQYHCDSSFDNLNHAVQIVGYDKSAAIPHYIIKNSWGTNFGDKGYMYIGIGNNLCGI 345
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 94/181 (51%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 145 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 204
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
G+C YD++ S + L D + + + GP+ ++ + +
Sbjct: 205 ---ATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRS 261
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI S
Sbjct: 262 GVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 321
Query: 175 Y 175
Y
Sbjct: 322 Y 322
>gi|403371627|gb|EJY85692.1| Cysteine protease [Oxytricha trifallax]
Length = 384
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 96/191 (50%), Gaps = 19/191 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRN 58
+E Y IK G L+ +SK QL+EC+ N GC+GG A +YLK L+++A YP+
Sbjct: 200 VEGAYQIKTGKLIEMSKQQLLECSGRPYGNSGCRGGYMTNAYKYLKDNKLQSDASYPY-- 257
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPL---VAGMNGALLQDYNGK 114
G G C +DA K V + +D T P+ + + ALL +G
Sbjct: 258 -TGTAGTCKHDASKGITNVVSYTALPANDPTALLNAVAKQPVSIAIYASSSALLAYKSG- 315
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVER----GTNA 169
I C + N+NHAV +VGYG + + WI++NSWG +WG + G+ ++R G
Sbjct: 316 -IVDTAKCGT-NVNHAVTLVGYGSENGIDYWIIKNSWGAKWG-EKGFIRIKRDMTKGPGI 372
Query: 170 CGIESYGGICT 180
CGI I T
Sbjct: 373 CGIYKLSSIPT 383
>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 91/180 (50%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ I L LS+ L+ C+ C+GG ++A +++ + + E YP+ +
Sbjct: 159 IEGQWKIAGHELTSLSEQMLVSCDTTEDNCRGGFADRAFKWIVSSNKGNVFTEESYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G C + V ++S + + + L GP+ ++ + DY G ++
Sbjct: 219 TDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
C SE L+H V++VGY + P WI++NSW + WG ++GY +E+GTN C ++ Y
Sbjct: 279 S---CSSEGLSHDVLLVGYNDTSKPPYWIIKNSWDKEWG-EEGYIRIEKGTNLCLMKEYA 334
>gi|109082090|ref|XP_001108862.1| PREDICTED: cathepsin H isoform 2 [Macaca mulatta]
Length = 335
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ +I
Sbjct: 209 --GKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MI 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K + S + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
Length = 329
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 90/177 (50%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG+ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNSDNLNHAVLAVGYGILKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|297297049|ref|XP_002804951.1| PREDICTED: cathepsin H [Macaca mulatta]
Length = 323
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 138 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 196
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ +I
Sbjct: 197 --GKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MI 249
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K + S + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 250 YKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKN 308
Query: 169 ACGIES 174
CG+ +
Sbjct: 309 MCGLAA 314
>gi|256052112|ref|XP_002569622.1| cathepsin S (C01 family) [Schistosoma mansoni]
Length = 345
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 92/179 (51%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE Q IK GTL PLS QL++C + C + A ++K G+E++ DYPF G
Sbjct: 168 LEGQVKIKTGTLTPLSSQQLVDC-AGDHECVENPVSVAFDFIKQNGVESQQDYPF---TG 223
Query: 62 VTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVA--GMNGALLQDYNGKLIRK 118
G C YD+ K +S ++ V + + ++ +Y+ GP+ M L +G L+
Sbjct: 224 KVGNCTYDSSKKVTTISSYIQVDDNEEELQKAVYNIGPIAVRIAMTQEFLTYGSGVLLI- 282
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIESYG 176
D C +E +V++VGYG+ + +P W+V+ + G D GY + R N C I ++
Sbjct: 283 -DDCQNEEPFESVLVVGYGIENDIPYWLVKFNLGEEFGDHGYIKLARNYKNMCHIANFA 340
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYDA--RKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+A + K R + +R + GP+ ++ +L + + +
Sbjct: 208 E---SCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C +N+NHAV++VGYG + WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWG-NKGYVLLARNKNNACGI 320
>gi|226476112|emb|CAX72146.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLIMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKHADINHGVLVVGYGNEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 91/180 (50%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ + GC GG + A +++ + + E YP+ +
Sbjct: 159 IEGQWKVTGHNLTSLSEQMLVSCDTEDLGCAGGLMDNAFKWIVSSNRHNVFTEESYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+ G C + V ++ D + + + L GP+ ++ Q Y G ++
Sbjct: 219 KGGNVPPCRMSGKVVGAKIRDHVDLPKDENAIAEWLAKNGPVAIAVDSTSFQSYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
C S+ L+H V++VGY + P WI++NSW + WG ++GY +E+GTN C +++Y
Sbjct: 279 S---CISKQLDHGVLLVGYDDTSKPPYWIIKNSWSKGWG-EEGYIRIEKGTNQCLVKNYA 334
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 97/182 (53%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
+E Q+A K G L+ LS+ L++C+ NQGC GG + A QY+ + G++ EA YP+
Sbjct: 151 VEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYTA 210
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
++G C ++A V +S F + GS++ + + GP+ ++ + Q Y
Sbjct: 211 KDGT---CKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSG 267
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVER-GTNACGI 172
+ + C S +L+H V+ GYG + P W+V+NSWG WG GY + R N CGI
Sbjct: 268 VYNEKK-CSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWG-QAGYIWMSRNANNQCGI 325
Query: 173 ES 174
+
Sbjct: 326 AT 327
>gi|328789602|ref|XP_623690.2| PREDICTED: cathepsin O-like [Apis mellifera]
Length = 368
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 93/181 (51%), Gaps = 12/181 (6%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFR 57
++ES +AIK+GTL LS ++I+C N GC+GG + +L + ++ E+ YP
Sbjct: 181 VIESMFAIKNGTLHSLSVQEMIDCAKNSNFGCEGGDICSLLSWLLISKVQILQESIYPLV 240
Query: 58 NQNGVTGRCA---YDARKVKVRVSDFL---VFNGSDTFRRMLYHYGPLVAGMNGALLQDY 111
G+TG C + +++ DF + D L +GP+ A +N Q+Y
Sbjct: 241 ---GMTGTCKLGKMTDKTFNIKIQDFTCDSFVDAEDELLIALATHGPVAAAVNALSWQNY 297
Query: 112 NGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
G +I+ + NLNHAV I+GY VP +I++NSWG D GY + G N CG
Sbjct: 298 LGGVIQYHCDGSFNNLNHAVQIIGYDKSVAVPHYIIKNSWGSNFGDKGYMYIGIGNNLCG 357
Query: 172 I 172
I
Sbjct: 358 I 358
>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ L+ C+ N GC GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHRLTALSEHHLVSCHDKNSGCTGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+G C+ ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 SSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C +LNH V++VGY +VP W+++NSWG WG ++GY V G NAC + Y
Sbjct: 279 TS---CAGISLNHGVLLVGYNRTGEVPYWVIKNSWGENWG-ENGYVRVTMGVNACLLTEY 334
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 149 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 208
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K + + +R + GP+ ++ +L + K +
Sbjct: 209 E---SCMYNPTGKAAKCKGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 266 YDENCNSDNLNHAVLAVGYGVQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 321
>gi|91092016|ref|XP_970773.1| PREDICTED: similar to cathepsin-L-like midgut cysteine proteinase
[Tribolium castaneum]
gi|270001248|gb|EEZ97695.1| cathepsin L precursor [Tribolium castaneum]
Length = 314
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 90/179 (50%), Gaps = 11/179 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E Q A+K L LS LI+C+ + GC GG A Y+ G+ E DYP+ + G
Sbjct: 134 VEGQLALKTNQLTSLSAQNLIDCSA-DFGCNGGHATNAYSYISQFGIMPEKDYPYEGKAG 192
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGAL-LQDYNGKLIRK 118
V C +DA K V+ F + +D + L GP+ A + LQ Y G ++
Sbjct: 193 V---CRFDASKSITTVTGFYDIDPNDETALQGALAMMGPIAATIEATEELQFYKGGILL- 248
Query: 119 NDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
++ C S+ +LNH V++VGYG + WIV+NSWG WG Y V N CGI S
Sbjct: 249 DEKCNSKVPDLNHGVLVVGYGSENGGDFWIVKNSWGSDWGEGGYYRPVRNHGNNCGIAS 307
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYTS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 142 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 201
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 202 K---ATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 258
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 259 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 318
Query: 174 SY 175
SY
Sbjct: 319 SY 320
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 175 LEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 234
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
++G C Y A V V+V D + N + L H L+ ++ A ++ +L +
Sbjct: 235 EDGT---CKYSAENVGVQVLDSV--NITLGAEDELKHAVGLLRPVSIAFEVIHSFRLYKS 289
Query: 119 NDVCPSE------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
S ++NHAV+ VGYG+ VP W+++NSWG WG D GYF +E G N CG
Sbjct: 290 GVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWG-DKGYFKMEMGKNMCG 348
Query: 172 IES 174
I +
Sbjct: 349 IAT 351
>gi|328789446|ref|XP_394277.3| PREDICTED: cathepsin J-like [Apis mellifera]
Length = 344
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 98/185 (52%), Gaps = 19/185 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
+E Q K G LLPLS+ QL++C+ N GC GG ++YL+ A GL A+ YP++
Sbjct: 166 IEGQIFKKTGMLLPLSEQQLVDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMAKKYYPYKA 225
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+ G +C + V ++ + V D + GP+ A +N + Q Y+ K
Sbjct: 226 KQG---QCRFKEDLSVVNITSWAVLPARDEKVLEAAVATIGPIAASINASPKTFQLYH-K 281
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPV-WIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ ++VC S+ +NHA++IVGY P WI++N WG WG ++GY + + N CGI
Sbjct: 282 GVYDDEVCSSDMVNHAMLIVGY-----TPTEWILKNWWGDGWG-ENGYMRLAKNKNRCGI 335
Query: 173 ESYGG 177
+Y
Sbjct: 336 ANYAA 340
>gi|91085677|ref|XP_971867.1| PREDICTED: similar to cathepsin L-like protein; cysteine proteinase
[Tribolium castaneum]
gi|270011032|gb|EFA07480.1| cathepsin L precursor [Tribolium castaneum]
Length = 329
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 90/178 (50%), Gaps = 9/178 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE+ Y I+ G+++ LS+ QL++C GC+GG A Y+ ++ G+ + +YP++
Sbjct: 149 LEAHYKIRRGSVVTLSEQQLVDCVRQAFGCRGGWMTDAYMYIARNGGINLDRNYPYK--- 205
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
G C + A K KV + + G + + M+ GP+ ++ + G +
Sbjct: 206 ASAGPCRFQASKPKVTIRGYAYLTGPNEEMLKHMVVTQGPVSVAIDASGRFASYGGGVYY 265
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGIES 174
N C HAVVIVGYG + W+V+NSWGR WG GY + R N CGI S
Sbjct: 266 NPSCARNKFTHAVVIVGYGRENGQDYWLVKNSWGRDWGL-GGYIKMARNRNNHCGIAS 322
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 158 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 217
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 218 K---ATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 274
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 275 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 334
Query: 174 SY 175
SY
Sbjct: 335 SY 336
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 94/177 (53%), Gaps = 19/177 (10%)
Query: 8 IKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A Q+ +K+ GL E DYP+R G G+
Sbjct: 184 IVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYR---GFGGK 240
Query: 66 CAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDV 121
C + +V D V +T + Y P+ + G + Q Y + +
Sbjct: 241 CNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGS-- 298
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA-----CGI 172
C + NL+HAVV VGYG + V WIVRNSWG RWG ++GY +ER A CGI
Sbjct: 299 CGT-NLDHAVVAVGYGSENGVDYWIVRNSWGPRWG-EEGYIRMERNLAASKSGKCGI 353
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 159 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 218
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 219 K---ATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 275
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 276 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 335
Query: 174 SY 175
SY
Sbjct: 336 SY 337
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 92/180 (51%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG+ WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGKDWG-EKGYVRVTMGVNACLLTGY 334
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 93/182 (51%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 156 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 215
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ + +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 216 K---AMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 272
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 273 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 174 SY 175
SY
Sbjct: 333 SY 334
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 90/174 (51%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A L LS+ L+ C+ + GC GG + A +++ +++G + E YP+ +
Sbjct: 144 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 203
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
++G C +V ++ + + + D + L GP+ ++ Y+G ++
Sbjct: 204 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 263
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C SE LNH V++VGY + P WI++NSW WG + GY +E+GTN C
Sbjct: 264 S---CTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWG-EKGYIRIEKGTNQC 313
>gi|37911662|gb|AAR05023.1| cathepsin L-like protein [Tenebrio molitor]
Length = 336
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 93/187 (49%), Gaps = 13/187 (6%)
Query: 2 LESQYAIKHGTLL--PLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
+ESQ I +G +S+ QL++C GC GG N A Y+ ++ G+++E YP+
Sbjct: 154 IESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYEM 213
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGA-LLQDYNGKL 115
+G C YD +V R+S ++ +G D M+ GP+ + Y+G
Sbjct: 214 ADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGG- 269
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVER-GTNACGIE 173
+ N C + HAV+IVGYG + W+V+NSWG WG DGYF + R N CGI
Sbjct: 270 VYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGL-DGYFKIARNANNHCGIA 328
Query: 174 SYGGICT 180
+ T
Sbjct: 329 GVASVPT 335
>gi|28932708|gb|AAO60048.1| midgut cysteine proteinase 5 [Rhipicephalus appendiculatus]
Length = 329
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 92/179 (51%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K G L+ LS+ L++C+ N GC+GG + A Y+K + G++ E YP+
Sbjct: 148 LEGQHLLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFNYIKANDGIDTEEGYPYE- 206
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
V G C + V + F+ G D ++ + + P + + Q Y+ +
Sbjct: 207 --AVDGECRFKKEDVGATDTGFVDIPGGIEDDLKKASFCWPPPWLWRSPSSFQLYSEGVY 264
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
++D C SE L+H V++VGYG++ W+V+NSW D GY + R N CGI S
Sbjct: 265 DESD-CSSEQLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDKNNQCGIAS 322
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 94/177 (53%), Gaps = 19/177 (10%)
Query: 8 IKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A Q+ +K+ GL E DYP+R G G+
Sbjct: 184 IVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYR---GFGGK 240
Query: 66 CAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDV 121
C + +V D V +T + Y P+ + G + Q Y + +
Sbjct: 241 CNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGS-- 298
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA-----CGI 172
C + NL+HAVV VGYG + V WIVRNSWG RWG ++GY +ER A CGI
Sbjct: 299 CGT-NLDHAVVAVGYGSENGVDYWIVRNSWGPRWG-EEGYIRMERNLAASKSGKCGI 353
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 93/177 (52%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K G+L+ LS+ +L++C+ + C GG + A + + K G+E E +Y +
Sbjct: 283 IEGQWFLKKGSLVSLSEQELVDCDGVDHACAGGLPSNAYEAIEKLGGIETEQEYSYE--- 339
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C++ KV + S + + L GP+ +N +Q Y +
Sbjct: 340 GHKNTCSFSTSKVSAYINSSVEIPKDENEIAAWLAQNGPISIALNAFAMQFYRKGISHPF 399
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+ C ++HAV++VGYG R+ P W ++NSWG WG + GY+ + RGT ACG+ +
Sbjct: 400 RILCNPWMIDHAVLLVGYGERNGTPFWAIKNSWGTDWG-EQGYYYLYRGTGACGMNT 455
>gi|426379977|ref|XP_004056662.1| PREDICTED: pro-cathepsin H [Gorilla gorilla gorilla]
Length = 335
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|118363827|ref|XP_001015137.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89296904|gb|EAR94892.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 429
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 102/181 (56%), Gaps = 17/181 (9%)
Query: 2 LESQYAIKHGTL-LPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFR 57
+ES A+K G LS+ QL++C NQGC GG ++A +Y+ +AG +E+ DYP++
Sbjct: 159 IESHLALKTGKAPFNLSQQQLVDCAGKFDNQGCDGGLPSRAFEYIAYAGGIESSRDYPYK 218
Query: 58 NQNGVTGRCAYDARKV--KVRVSDFLVFNGSDTFRRMLYHYGPL-VAGMNGALLQDYNGK 114
G G+C + +KV KV+ S + F + L GP+ +A ++Y G
Sbjct: 219 ---GKDGKCKFKPQKVVAKVQSSFNITFQDENELIYHLAKNGPVSIAYQVTDDFENYEGG 275
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
I N C + + +NHAV+ VGY + + +IV+NSWG+ WG D GYF +E G+N CG
Sbjct: 276 -IYSNPECSTDPQEVNHAVLAVGYNLTGRY--YIVKNSWGKDWGMD-GYFYIELGSNMCG 331
Query: 172 I 172
+
Sbjct: 332 L 332
>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
Length = 371
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 95/195 (48%), Gaps = 23/195 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+++ + IK + +S +L++C+ GC GG ++ I L ++GL +E DYPF+
Sbjct: 160 IQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLNNSGLASEEDYPFQGHQ 219
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
RC D + + DF + + ++ L +GP+ +N LLQ Y +I+
Sbjct: 220 K-PHRCLADKYRKVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKAT 278
Query: 120 -DVCPSENLNHAVVIVGYGM-----------------RHQVPVWIVRNSWG-RWGPDDGY 160
C +NH+V++VG+G R P WI++NSWG WG + GY
Sbjct: 279 PSTCDPHLVNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWG-EKGY 337
Query: 161 FTVERGTNACGIESY 175
F + RG N CGI Y
Sbjct: 338 FRLYRGNNTCGIAKY 352
>gi|403223173|dbj|BAM41304.1| cysteine protease precursor TacP [Theileria orientalis strain
Shintoku]
Length = 463
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 93/181 (51%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES Y I +L LS+ +L+ C + GC+GG + A++Y+K+ G+ + AD P+ +
Sbjct: 286 VESLYKIHTDKVLDLSEQELVNCETKSHGCEGGFGDTALEYVKNKGISSSADVPYHAMDQ 345
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDV 121
+D KV ++ F+V G D + L +V + L Y + N
Sbjct: 346 TCDIKTHD----KVFINSFMVTKGKDVMNKSLVLSPTVVYIAASSELMMYKAGVF--NGA 399
Query: 122 CPSENLNHAVVIVGYGMRHQV--PVWIVRNSWG-RWGPDDGYFTVER---GTNACGIESY 175
C E LNHAV++VG G V W+++NSWG WG +DGY +ER GT+ CG+
Sbjct: 400 CAKE-LNHAVLLVGEGYDDIVGKRYWVIKNSWGPHWG-EDGYVRLERTDKGTDKCGVLDT 457
Query: 176 G 176
G
Sbjct: 458 G 458
>gi|374414520|pdb|3QJ3|A Chain A, Structure Of Digestive Procathepsin L2 Proteinase From
Tenebrio Molitor Larval Midgut
gi|374414521|pdb|3QJ3|B Chain B, Structure Of Digestive Procathepsin L2 Proteinase From
Tenebrio Molitor Larval Midgut
Length = 331
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 93/187 (49%), Gaps = 13/187 (6%)
Query: 2 LESQYAIKHGTLL--PLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
+ESQ I +G +S+ QL++C GC GG N A Y+ ++ G+++E YP+
Sbjct: 149 IESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYEM 208
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGA-LLQDYNGKL 115
+G C YD +V R+S ++ +G D M+ GP+ + Y+G
Sbjct: 209 ADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGG- 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVER-GTNACGIE 173
+ N C + HAV+IVGYG + W+V+NSWG WG DGYF + R N CGI
Sbjct: 265 VYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGL-DGYFKIARNANNHCGIA 323
Query: 174 SYGGICT 180
+ T
Sbjct: 324 GVASVPT 330
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 156 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 215
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 216 K---ATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 272
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 273 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 174 SY 175
SY
Sbjct: 333 SY 334
>gi|355692920|gb|EHH27523.1| Cathepsin H, partial [Macaca mulatta]
Length = 305
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 120 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 178
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 179 --GKDGDCKFRPGKAIGFVKDVANITIYAEEAMVEAVALYNPVSFAFE--VTQDF---MM 231
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K + S + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 232 YKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKN 290
Query: 169 ACGIES 174
CG+ +
Sbjct: 291 MCGLAA 296
>gi|17556460|ref|NP_497627.1| Protein Y71H2AR.2 [Caenorhabditis elegans]
gi|351064196|emb|CCD72484.1| Protein Y71H2AR.2 [Caenorhabditis elegans]
Length = 345
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 101/193 (52%), Gaps = 9/193 (4%)
Query: 2 LESQYA-IKHGTLLPLSKSQLIECNIYN-QGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+ES YA +GTLL S+ QLI+CN +GC+ AI YL G+E EADYP+ ++
Sbjct: 115 IESMYAKATNGTLLSFSEQQLIDCNDQGYKGCEEQFAMNAIGYLATHGIETEADYPYVDK 174
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGA-LLQDYNGKLIR 117
+C +D+ K K+ + +V G++ ++ + +YGP M L DY +
Sbjct: 175 TN--EKCTFDSTKSKIHLKKGVVAEGNEVLGKVYVTNYGPAFFTMRAPPSLYDYKIGIYN 232
Query: 118 KN-DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+ + C S + ++VIVGYG+ + WIV+ S+G WG + GY + R NAC + +
Sbjct: 233 PSIEECTSTHEIRSMVIVGYGIEGEQKYWIVKGSFGTSWG-EQGYMKLARDVNACAMATT 291
Query: 176 GGICTRTLNGVFL 188
+ T V +
Sbjct: 292 IAVLTEIFLRVLV 304
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 91/182 (50%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
LE Q K G L+PLS+ QL++C + N GC GG ++A Y+K G E+E YP+
Sbjct: 143 LEGQTFKKTGKLVPLSEQQLVDCSGDYGNMGCGGGWMDQAFSYIKDKGEESEDGYPY--- 199
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG--ALLQDYNGKL 115
G C YDA KV + + D ++ + GP+ ++ + Q Y +
Sbjct: 200 TGTDDTCVYDASKVVATDTGYTDIPEMDENALQQAVATVGPISVAIDATHSSFQFYESGV 259
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWGR-WGPDDGYFTVERGT-NACGI 172
+ + C NL+HAV+ VGYG + + WIV+NSW WG GY + R N CGI
Sbjct: 260 YDEPE-CSQTNLDHAVLAVGYGTSEEGLDYWIVKNSWSTGWGM-QGYIEMSRNKDNQCGI 317
Query: 173 ES 174
S
Sbjct: 318 AS 319
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 94/180 (52%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K G ++ LS+ L++C+ N GC+GG + A +Y+K + G++ E YP+
Sbjct: 154 LEGQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPY-- 211
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSD-TFRRMLYHYGPLVAGMNGALLQ-DYNGKL 115
NG G C + V + F+ + G++ ++ + GP+ ++ + + +
Sbjct: 212 -NGTDGTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQG 270
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ C SENL+H V++VGYG + W+V+NSWG D GY + R N CGI S
Sbjct: 271 VYDEPECSSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGGYIYMTRNKDNQCGIAS 330
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + + +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPQGNEKALKRAVARVGPVSVAIDASLTSFQFYSRGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|407036599|gb|EKE38251.1| cysteine proteinase, putative [Entamoeba nuttalli P19]
Length = 318
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 82/167 (49%), Gaps = 10/167 (5%)
Query: 14 LPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDAR 71
L LS+ QL++C++ N+GC GG + +Y+K G+ E DYP+ C YD +
Sbjct: 145 LDLSEQQLVDCSVSVGNKGCNGGSLLLSFRYVKLNGIMQEKDYPYV---AAEETCTYDKK 201
Query: 72 KVKVRVS-DFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLN 128
KV V+++ LV GS+ R +G Q Y + + C S LN
Sbjct: 202 KVAVKITGQKLVRPGSEKALMRAAAEGPVAAAIDASGVKFQLYKSGIYNSKE-CSSTQLN 260
Query: 129 HAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
H V +VGYG ++ WIVRNSWG D GY + R N CGI S
Sbjct: 261 HGVAVVGYGTQNGTEYWIVRNSWGTIWGDQGYVLMSRNKNNQCGIAS 307
>gi|67469932|ref|XP_650937.1| cysteine proteinase [Entamoeba histolytica HM-1:IMSS]
gi|1929343|emb|CAA62835.1| cysteine proteinase [Entamoeba histolytica]
gi|56467606|gb|EAL45551.1| cysteine proteinase, putative [Entamoeba histolytica HM-1:IMSS]
gi|449710372|gb|EMD49461.1| cysteine proteinase, putative [Entamoeba histolytica KU27]
Length = 318
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 82/167 (49%), Gaps = 10/167 (5%)
Query: 14 LPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDAR 71
L LS+ QL++C++ N+GC GG + +Y+K G+ E DYP+ C YD +
Sbjct: 145 LDLSEQQLVDCSVSVGNKGCNGGSLLLSFRYVKLNGIMQEKDYPYV---AAEETCTYDKK 201
Query: 72 KVKVRVS-DFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLN 128
KV V+++ LV GS+ R +G Q Y + + C S LN
Sbjct: 202 KVAVKITGQKLVRPGSEKALMRAAAEGPVAAAIDASGVKFQLYKSGIYNSKE-CSSTQLN 260
Query: 129 HAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
H V +VGYG ++ WIVRNSWG D GY + R N CGI S
Sbjct: 261 HGVAVVGYGTQNGTEYWIVRNSWGTIWGDQGYVLMSRNKNNQCGIAS 307
>gi|167394751|ref|XP_001741082.1| cysteine proteinase ACP1 precursor [Entamoeba dispar SAW760]
gi|165894470|gb|EDR22453.1| cysteine proteinase ACP1 precursor, putative [Entamoeba dispar
SAW760]
Length = 308
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 90/177 (50%), Gaps = 9/177 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
+LE + G L S+ QL++C+ + GC+GG + ++++++ + GL E DYP++
Sbjct: 123 VLEGRVNKDLGKLYSFSEQQLVDCDSSDNGCEGGHPSNSLKFIQENNGLGLETDYPYK-- 180
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGKLI 116
V G C + V V +GS+T + ++ GP+ GM+ + Q Y I
Sbjct: 181 -AVAGTCK-KVKNVATVTGSKRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTI 238
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGI 172
+ C S +NH V VGYG WI+RNSWG D GYF + R + N CGI
Sbjct: 239 YSDAKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTAWGDAGYFLLARDSNNMCGI 295
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 156 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 215
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 216 K---ATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 272
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 273 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 174 SY 175
SY
Sbjct: 333 SY 334
>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 99/181 (54%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNK-AIQYLKHAGLEAEADYPFRN 58
LE Q AI + +PLS+ QL++C+ N C+ GG A Y+ G+EA++ YP++
Sbjct: 143 LEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDKGIEADSSYPYK- 201
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G+ C YDA+K +++ + V + ++ + GP+ ++ +Q Y+G ++
Sbjct: 202 --GIDTPCQYDAKKTVLKIKGYRNVSISEEELKKAVGTVGPVSVAIDADPIQLYSGGIL- 258
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQV----PVWIVRNSWGR-WGPDDGYFTVER-GTNACG 171
+ + + NLNH V+ VGYG + W V+NSWG+ WG + GYF ++R N CG
Sbjct: 259 -DGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWG-EQGYFRIKRDANNLCG 316
Query: 172 I 172
I
Sbjct: 317 I 317
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 95/182 (52%), Gaps = 12/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LESQ ++ G L LS+ QL++C+ N GC GG ++A QY++ + G+++E+ YP++
Sbjct: 143 LESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQYIQANGGIDSESYYPYQA 202
Query: 59 QNGVTGRCAYDARKVKVRVS---DFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G C Y++ S D + + + GPL ++ + Q Y +
Sbjct: 203 R---VGTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASGWQSYQSGV 259
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
ND S+ +HAV++VGYG + W+V+NSWG W + GY + R N CGI +
Sbjct: 260 F--NDPSCSQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMTRNANNQCGIAN 317
Query: 175 YG 176
+
Sbjct: 318 HA 319
>gi|350646652|emb|CCD58679.1| Peptidase C1 family [Schistosoma mansoni]
Length = 378
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 92/179 (51%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE Q IK GTL PLS QL++C + C + A ++K G+E++ DYPF G
Sbjct: 201 LEGQVKIKTGTLTPLSSQQLVDC-AGDHECVENPVSVAFDFIKQNGVESQQDYPF---TG 256
Query: 62 VTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVA--GMNGALLQDYNGKLIRK 118
G C YD+ K +S ++ V + + ++ +Y+ GP+ M L +G L+
Sbjct: 257 KVGNCTYDSSKKVTTISSYIQVDDNEEELQKAVYNIGPIAVRIAMTQEFLTYGSGVLLI- 315
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIESYG 176
D C +E +V++VGYG+ + +P W+V+ + G D GY + R N C I ++
Sbjct: 316 -DDCQNEEPFESVLVVGYGIENDIPYWLVKFNLGEEFGDHGYIKLARNYKNMCHIANFA 373
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 95/173 (54%), Gaps = 12/173 (6%)
Query: 8 IKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A +++ K+ G++ EADYP++ +G +
Sbjct: 177 IVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQ 236
Query: 66 CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA-GMNGALLQDYNGKLIRKNDVCPS 124
+A+ V + + + N + ++ L H VA G Q Y+ + + +C +
Sbjct: 237 NRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF--DGICGT 294
Query: 125 ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG----TNACGI 172
E L+H VV VGYG + WIVRNSWG RWG + GY + R T CGI
Sbjct: 295 E-LDHGVVAVGYGTENGKDYWIVRNSWGNRWG-ESGYIKMARNIAEPTGKCGI 345
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 90/191 (47%), Gaps = 23/191 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----------IYNQGCQGGGFNKAIQYL-KHAGLEA 50
+E YA K G L+ LS+ QL++C+ N GC GG + +++ K GL
Sbjct: 164 VEGMYAAKTGKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKTGGLVT 223
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQ 109
E YP+ V RC ++ V++S++ V + D L + GP+ +N LQ
Sbjct: 224 EESYPYE---AVDNRCRFNVSNAVVKISNWTFVSSNEDEMAAWLANNGPIAIAINADYLQ 280
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQV-----PVWIVRNSW-GRWGPDDGYFTV 163
Y K I C E LNH V+IVGYG WIV+NSW WG + GY V
Sbjct: 281 YYR-KGILNPSRCDPEELNHGVLIVGYGEEKAANGKVEKYWIVKNSWSASWG-EKGYVRV 338
Query: 164 ERGTNACGIES 174
RG CG+ +
Sbjct: 339 LRGKGVCGLNA 349
>gi|380798253|gb|AFE71002.1| pro-cathepsin H preproprotein, partial [Macaca mulatta]
Length = 242
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 57 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 115
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 116 --GKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 168
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K + S + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 169 YKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKN 227
Query: 169 ACGIES 174
CG+ +
Sbjct: 228 MCGLAA 233
>gi|1460063|emb|CAA60672.1| cysteine protein [Entamoeba dispar]
Length = 307
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 90/177 (50%), Gaps = 9/177 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
+LE + G L S+ QL++C+ + GC+GG + ++++++ + GL E DYP++
Sbjct: 122 VLEGRVNKDLGKLYSFSEQQLVDCDSSDNGCEGGHPSNSLKFIQENNGLGLETDYPYK-- 179
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGKLI 116
V G C + V V +GS+T + ++ GP+ GM+ + Q Y I
Sbjct: 180 -AVAGTCK-KVKNVATVTGSKRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTI 237
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGI 172
+ C S +NH V VGYG WI+RNSWG D GYF + R + N CGI
Sbjct: 238 YSDAKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTAWGDAGYFLLARDSNNMCGI 294
>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
Length = 320
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 95/175 (54%), Gaps = 11/175 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAI-QYLKHAGLEAEADYPFRNQN 60
+E Q+ K LL LS+ QL++C+ ++GC GG +A Q L GL+ ++DYP+
Sbjct: 147 IEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGTPQQAFRQILGMGGLQLDSDYPYE--- 203
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFR-RMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G G+C KVKV ++ + + + +ML GPL + +N LQ L
Sbjct: 204 GREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQHPLPAL---- 259
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
C +++LNHAV+ VGYG ++P W V+NSW ++GYF + RG CGI +
Sbjct: 260 --CDAQSLNHAVLTVGYGKEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGINT 312
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 90/174 (51%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A L LS+ L+ C+ + GC GG + A +++ +++G + E YP+ +
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
++G C +V ++ + + + D + L GP+ ++ Y+G ++
Sbjct: 212 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C SE LNH V++VGY + P WI++NSW WG + GY +E+GTN C
Sbjct: 272 S---CTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWG-EKGYIRIEKGTNQC 321
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 102/182 (56%), Gaps = 13/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K GTL+ LS+ L++C+ N+GC+GG ++A +Y+K + G++ E YP++
Sbjct: 143 LEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMDQAFKYIKTNGGIDTEECYPYKG 202
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNG-SDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
++ +C Y A +S F+ V G D ++ GP+ G++ + Q Y+
Sbjct: 203 RD--ERKCEYKASCSGATLSSFVDVKTGDEDALKQASATIGPISVGIDASHPSFQLYDHG 260
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGI 172
+ + C S+ L+H V++VGYG + W+V+NSWG WG +GY + R N CGI
Sbjct: 261 VYHEKR-CSSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADWGM-EGYIMMSRNKDNQCGI 318
Query: 173 ES 174
+
Sbjct: 319 AT 320
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 92/177 (51%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E + + L+ LS+ +L++C+ +QGC GG N + ++ GLE E YP+ +
Sbjct: 297 VEGAWYLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIMRMGGLEPEDAYPY---D 353
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + + V ++ + + ++ L GP+ G+N LQ Y ++
Sbjct: 354 GKGETCHIVRKDIAVYINGSVELPHDEVKIQKWLVTKGPISIGLNANTLQFYRHGVVHPF 413
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C LNH V+IVGYG + P WIV+NSWG WG + GYF + RG N CG++
Sbjct: 414 KIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWG-ESGYFRLYRGKNVCGVQE 469
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 92/177 (51%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E + + L+ LS+ +L++C+ +QGC GG N + ++ GLE E YP+ +
Sbjct: 298 IEGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY---D 354
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + + V ++ + + ++ L GP+ G+N LQ Y ++
Sbjct: 355 GRGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPF 414
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C LNH V+IVGYG + P WIV+NSWG WG + GYF + RG N CG++
Sbjct: 415 KIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWG-EAGYFKLYRGKNVCGVQE 470
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 94/181 (51%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG A QY+ + G+++EA YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSTEKYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
Q+G +C YD++ S + L F + + + + GP+ ++ + + +
Sbjct: 208 AQDG---KCQYDSKFRAATCSKYTELPFGSEEALKEAVANKGPVSVAIDASHPSFFLYRS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D + +NH V++VGYG W+V+NSWG D GY + R + N CGI S
Sbjct: 265 GVYYDQSCTLKVNHGVLVVGYGNLDGKDYWLVKNSWGLNFGDKGYIRMARNSGNHCGIAS 324
Query: 175 Y 175
Y
Sbjct: 325 Y 325
>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
Length = 360
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C + N GC GG ++A +Y+K+ GL+ E YP++
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGLAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG++ + V V+V D + D + + P+ +
Sbjct: 236 VNGIS---KFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGV 292
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+D C + ++NHAV+ VGYG+ VP W+++NSWG WG D+GYF +E G N CG+
Sbjct: 293 YTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWG-DEGYFKMEMGKNMCGVA 351
Query: 174 S 174
+
Sbjct: 352 T 352
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/184 (35%), Positives = 98/184 (53%), Gaps = 18/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K+ GL+ E YP+
Sbjct: 173 LEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYA- 231
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL-----LQDYNG 113
GV G C + V V+V + + N + L H LV ++ A + Y G
Sbjct: 232 --GVNGFCHFKPENVGVKVVESV--NITLGAEDELLHAVGLVRPVSIAFEVVSGFRFYKG 287
Query: 114 KLIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNAC 170
+ +D C ++NHAV+ VGYG+ + VP W+++NSWG WG DGYF +E G N C
Sbjct: 288 G-VYTSDTCGRTQMDVNHAVLAVGYGVENGVPYWLIKNSWGEEWGV-DGYFKMELGKNMC 345
Query: 171 GIES 174
GI +
Sbjct: 346 GIAT 349
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 84/174 (48%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+A L LS+ L+ C+ + GC GG + A +++ + E YP+ +
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C KV ++ + + + D + L GP+ ++ Y+G ++
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C SE LNH V++VGY + P WI++NSW WG + GY +E+GTN C
Sbjct: 272 S---CTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWG-EKGYIRIEKGTNQC 321
>gi|226476132|emb|CAX72156.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCGGGFMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLTMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKHADINHGVLVVGYGEEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|380026639|ref|XP_003697053.1| PREDICTED: cathepsin J-like [Apis florea]
Length = 346
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 98/185 (52%), Gaps = 19/185 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
+E Q K G LLPLS+ QL++C+ N GC GG ++YL+ A GL A+ YP++
Sbjct: 168 IEGQIFKKTGMLLPLSEQQLVDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMAKKYYPYKA 227
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+ G C ++ V ++ + V D + GP+ A +N + Q Y+ K
Sbjct: 228 KQGP---CRFNEDLSVVNITSWAVLPARDEKVLEAAVATIGPIAASINASPKTFQLYH-K 283
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPV-WIVRNSWGR-WGPDDGYFTVERGTNACGI 172
I ++VC S+ +NHA++IVGY P WI++N WG WG ++GY + + N CG+
Sbjct: 284 GIYDDEVCSSDMVNHAMLIVGY-----TPTEWILKNWWGDGWG-ENGYMRLAKNKNRCGV 337
Query: 173 ESYGG 177
+Y
Sbjct: 338 ANYAA 342
>gi|403258371|ref|XP_003921746.1| PREDICTED: pro-cathepsin H [Saimiri boliviensis boliviensis]
Length = 336
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 95/183 (51%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 151 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 209
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
G C + K V D + D + Y P+ + QD+ +
Sbjct: 210 --GKDSDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFE--VTQDFMMYKR 265
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 266 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 324
Query: 172 IES 174
+ +
Sbjct: 325 LAA 327
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 88/176 (50%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ G+++E YP+ Q+
Sbjct: 151 LEGQLKKKTGKLLNLSPQNLVDCVSKNDGCGGGYMTNAFQYVQENRGIDSEDAYPYIGQD 210
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 211 E---SCMYNPTGKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASLSSFQFYSKGVY 267
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGI 172
++ C +NLNHAV+ VGYG++ WI++NSWG + GY + R NACGI
Sbjct: 268 YDENCNGDNLNHAVLAVGYGIQRGTKHWIIKNSWGEEWGNKGYILMARNKKNACGI 323
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 88/174 (50%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A L LS+ L+ C+ + GC GG + A +++ +++G + E YP+ +
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C KV ++ + + + D + L GP+ ++ Y+G ++
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C SE LNH V++VGY + P WI++NSW WG + GY +E+GTN C
Sbjct: 272 S---CTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWG-EKGYIRIEKGTNQC 321
>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/183 (31%), Positives = 97/183 (53%), Gaps = 14/183 (7%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFR 57
+LE Q+ K G L+ LS+ QL++C+ N GC GG +A+QY++ + G++ E YP++
Sbjct: 150 VLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKRALQYIQANGGIDTETSYPYK 209
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFNGS--DTFRRMLYHYGPLVAGMNGAL--LQDYNG 113
+ RC Y + + + ++ S +T ++ + GP+ G++ + Q Y
Sbjct: 210 AKGQ---RCRYKPDGIGAKCTGYVHVKPSNEETLKKAVATLGPISVGIDASRHSFQFYQS 266
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACG 171
+ D C L+H + VGYG + W+++NSWG RWG D GY + R +N CG
Sbjct: 267 GVYDDPD-CSKTVLDHGALAVGYGTENGHDYWLIKNSWGLRWG-DKGYIKMSRNKSNQCG 324
Query: 172 IES 174
I S
Sbjct: 325 IAS 327
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 92/177 (51%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E + + L+ LS+ +L++C+ +QGC GG N + ++ GLE E YP+ +
Sbjct: 298 IEGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY---D 354
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + + V ++ + + ++ L GP+ G+N LQ Y ++
Sbjct: 355 GRGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPF 414
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C LNH V+IVGYG + P WIV+NSWG WG + GYF + RG N CG++
Sbjct: 415 KIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWG-EAGYFKLYRGKNVCGVQE 470
>gi|61200410|gb|AAX39778.1| cathepsin R [Mus musculus]
Length = 335
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 17/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
+E+Q + G L PLS L++C+ N GC GG A QY+ H GLE+EA YP+
Sbjct: 149 IEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGS-DTFRRMLYHYGPLVAGMNGA--LLQDYNGKL 115
G G C Y+ + K ++ F+ S D + GP+ AG++ + ++Y G +
Sbjct: 208 --GKDGPCRYNPKNSKAEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGI 265
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVP----VWIVRNSWG-RWGPDDGYFTVERG-TNA 169
+ + C S+ + H V++VGYG + W+++NSWG RWG GY + + N
Sbjct: 266 YHEPN-CSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGI-RGYMKLAKDKNNH 323
Query: 170 CGIESYG 176
CGI SY
Sbjct: 324 CGIASYA 330
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ D ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 92/177 (51%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E + + L+ LS+ +L++C+ +QGC GG N + ++ GLE E YP+ +
Sbjct: 295 VEGAWFLAKNKLVSLSEQELVDCDGVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY---D 351
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + + V ++ + + ++ L GP+ G+N LQ Y ++
Sbjct: 352 GKGETCHLVRKDIAVYINGSIELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPF 411
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ C LNH V+IVGYG + P WIV+NSWG WG + GYF + RG N CG++
Sbjct: 412 KIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWG-ESGYFKLYRGKNVCGVQE 467
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY +R
Sbjct: 304 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYR--- 360
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 361 GHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGISRPL 420
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R +P W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 421 RPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 476
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 174 LEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 233
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRK 118
++G C Y A V V+V D + N + L H LV ++ A + +L +
Sbjct: 234 KDGT---CKYSAENVGVQVLDSV--NITLGAEDELKHAVGLVRPVSIAFEVVKSFRLYKS 288
Query: 119 NDVCPSE------NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACG 171
S ++NHAV+ VGYG+ VP W+++NSWG WG D GYF +E G N CG
Sbjct: 289 GVYTDSHCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWG-DKGYFKMEMGKNMCG 347
Query: 172 IES 174
I +
Sbjct: 348 IAT 350
>gi|226476122|emb|CAX72151.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA-LLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVAVDSLIMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKYGDINHGVLVVGYGNEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 87/181 (48%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVS---DFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
NG C+ + ++ V L+ + L GP+ ++ + Y +
Sbjct: 219 GNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGV 278
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIES 174
+ C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC +
Sbjct: 279 LT---ACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSE 334
Query: 175 Y 175
Y
Sbjct: 335 Y 335
>gi|9931986|ref|NP_064680.1| cathepsin R precursor [Mus musculus]
gi|23813621|sp|Q9JIA9.1|CATR_MOUSE RecName: Full=Cathepsin R; Flags: Precursor
gi|9623188|gb|AAF90051.1|AF245399_1 cathepsin R [Mus musculus]
gi|12837970|dbj|BAB24023.1| unnamed protein product [Mus musculus]
gi|12852278|dbj|BAB29345.1| unnamed protein product [Mus musculus]
gi|16445015|gb|AAK00507.1| cathepsin R precursor [Mus musculus]
gi|71682221|gb|AAI00339.1| Cathepsin R [Mus musculus]
gi|148709367|gb|EDL41313.1| cathepsin R [Mus musculus]
Length = 334
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 17/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
+E+Q + G L PLS L++C+ N GC GG A QY+ H GLE+EA YP+
Sbjct: 148 IEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPYE- 206
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGS-DTFRRMLYHYGPLVAGMNGA--LLQDYNGKL 115
G G C Y+ + K ++ F+ S D + GP+ AG++ + ++Y G +
Sbjct: 207 --GKDGPCRYNPKNSKAEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGI 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVP----VWIVRNSWG-RWGPDDGYFTVERG-TNA 169
+ + C S+ + H V++VGYG + W+++NSWG RWG GY + + N
Sbjct: 265 YHEPN-CSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGI-RGYMKLAKDKNNH 322
Query: 170 CGIESYG 176
CGI SY
Sbjct: 323 CGIASYA 329
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|56754277|gb|AAW25326.1| unknown [Schistosoma japonicum]
Length = 342
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC GG + A YL+ +E+E DY +
Sbjct: 160 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCGGGFMDHAFNYLESHYIESENDYKYL-- 217
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 218 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLTMYKSGVF 276
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 277 ESND-CKYADINHGVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 334
>gi|114658412|ref|XP_001153217.1| PREDICTED: pro-cathepsin H isoform 6 [Pan troglodytes]
gi|397478882|ref|XP_003810764.1| PREDICTED: pro-cathepsin H [Pan paniscus]
gi|12803323|gb|AAH02479.1| Cathepsin H [Homo sapiens]
gi|60655259|gb|AAX32193.1| cathepsin H [synthetic construct]
gi|123979560|gb|ABM81609.1| cathepsin H [synthetic construct]
gi|123994193|gb|ABM84698.1| cathepsin H [synthetic construct]
gi|189054474|dbj|BAG37247.1| unnamed protein product [Homo sapiens]
gi|410254318|gb|JAA15126.1| cathepsin H [Pan troglodytes]
gi|410294916|gb|JAA26058.1| cathepsin H [Pan troglodytes]
gi|410331109|gb|JAA34501.1| cathepsin H [Pan troglodytes]
Length = 335
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
Length = 335
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 97/183 (53%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
G G C + K V D + + + Y P+ + QD+ +
Sbjct: 209 --GKDGYCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDFMMYRR 264
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 265 GIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 323
Query: 172 IES 174
+ +
Sbjct: 324 LAA 326
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 94/177 (53%), Gaps = 19/177 (10%)
Query: 8 IKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A Q+ +K+ GL E DYP+R G G+
Sbjct: 184 IVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYR---GFGGK 240
Query: 66 CAYDARKVKVRVSDFL--VFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDV 121
C + +V D V +T + Y P+ + G + Q Y + +
Sbjct: 241 CNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGS-- 298
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA-----CGI 172
C + NL+HAVV VGYG + V WIVRNSWG RWG ++GY +ER A CGI
Sbjct: 299 CGT-NLDHAVVAVGYGSENGVDYWIVRNSWGPRWG-EEGYIRMERNLAASKSGKCGI 353
>gi|1149525|emb|CAA64218.1| preprocathepsin K [Mus musculus]
Length = 329
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ G+++E +P+ Q+
Sbjct: 148 LEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVGQD 207
Query: 61 GVTGRCAYDA--RKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+A + K R + +R + GP+ ++ +L + + +
Sbjct: 208 E---SCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C +N+NHAV++VGYG + WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWG-NKGYALLARNKNNACGI 320
>gi|61372279|gb|AAX43816.1| cathepsin H [synthetic construct]
Length = 336
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ D ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 94/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ D ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|7271889|gb|AAF44675.1|AF239264_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 95/183 (51%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNYGCMGGLMENAYEYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+D+ V +GS+ + ++ GP ++ + Y+G I
Sbjct: 198 TAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFTMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
++ C S +NHAV+ VGYG + WIV+NSWG + GY + R N CGI S
Sbjct: 257 YQSRTCSSLRVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASL 319
>gi|402875039|ref|XP_003901328.1| PREDICTED: pro-cathepsin H [Papio anubis]
Length = 335
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K + S + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|391339556|ref|XP_003744114.1| PREDICTED: counting factor associated protein D-like [Metaseiulus
occidentalis]
Length = 563
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 94/183 (51%), Gaps = 14/183 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E YA KHG L+ S+ QLI+C+ N GC GG +A QY+ GL + +Y
Sbjct: 377 IEGMYARKHGKLVRFSEQQLIDCSWKFGNGGCDGGQDYQAYQYIMQHGLSTDKEY--GAY 434
Query: 60 NGVTGRCAYDARKVKVRVSDFLVF---NGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGK 114
G+ G+C +D +K + L + G + +R + GP+ G+ AL L Y+
Sbjct: 435 MGIDGKC-HDGPALKRELPTLLGYVNVTGENDLKRAVAFVGPISVGIFAALPSLSFYHTG 493
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQ-VPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
+ D C + +L+HAV+ VGYG+ H+ WIV+NSW DDGY + N CG
Sbjct: 494 IFNDKD-CKNGLADLDHAVLAVGYGVSHEGEAFWIVKNSWSTLWGDDGYVKIAMKNNICG 552
Query: 172 IES 174
+ +
Sbjct: 553 VTT 555
>gi|401430127|ref|XP_003886478.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491231|emb|CBZ41048.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 375
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 91 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 150
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 151 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 210
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 211 T---ACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 266
>gi|321476446|gb|EFX87407.1| hypothetical protein DAPPUDRAFT_312322 [Daphnia pulex]
Length = 334
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/168 (35%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 16 LSKSQLIECNIYNQ--GCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAY-DARK 72
LS+ Q+++C+ + GC+GG A +Y+ G+ + YP++ GV C Y D+ K
Sbjct: 168 LSEQQVLDCDRTDMSIGCRGGWPWDAWEYMSTNGIARTSVYPYK---GVDSVCKYVDSMK 224
Query: 73 V-KVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL-LQDYNGKLIRKNDVCPSENLNHA 130
V VR +++ + L ++GPLVA M DY + + +C + +NHA
Sbjct: 225 VTSVRAYNYVESRNVADMQYALTNFGPLVAAMTVVQSFMDY-ASGVYDDKICDGKLVNHA 283
Query: 131 VVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYGG 177
VV+VG+G ++ + WI RNSWG WG +GYF ++RG N C IE+Y G
Sbjct: 284 VVLVGWGNQNGIDYWIGRNSWGPGWG-KEGYFLIQRGVNKCQIETYVG 330
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 99/182 (54%), Gaps = 13/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K GTL+ LS+ L++C+ N+GCQGG ++A +Y+K + G++ E YP++
Sbjct: 143 LEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCQGGLMDQAFKYIKTNGGIDTEECYPYKG 202
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFN--GSDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+N +C Y + +S ++ D + GP+ G++ + Q Y+
Sbjct: 203 KN--ERKCEYKSSCSGATLSSYVDIKTGDEDALMQASATIGPISVGIDASHPSFQLYDHG 260
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGI 172
+ + C S+ L+H V++VGYG + W+V+NSWG WG +GY + R N CGI
Sbjct: 261 VYHEKR-CSSKKLDHGVLVVGYGTDGEKDYWLVKNSWGEEWGM-EGYIKMSRNKDNQCGI 318
Query: 173 ES 174
+
Sbjct: 319 AT 320
>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
Length = 897
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ ++ G+++E YP+ Q+
Sbjct: 716 LEGQLMKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQD 775
Query: 61 GVTGRCAYDA--RKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + ++ + GP+ ++ +L + K +
Sbjct: 776 E---SCMYNPTGKAAKCRGYKEIPEGNEKALKKAVARVGPISVAIDASLSSFQFYSKGVY 832
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 833 YDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWG-NKGYILMARNKNNACGI 888
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|60827884|gb|AAX36817.1| cathepsin H [synthetic construct]
Length = 336
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 96/180 (53%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYK- 206
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+ +C YD++ S + L + D + + + GP+ G++ + + +
Sbjct: 207 --AMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHSSFFLYRSG 264
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
D ++N+NH V+++GYG + W+V+NSWG + GY + R N CGI SY
Sbjct: 265 VYYDPACTQNVNHGVLVIGYGDLNGEEYWLVKNSWGSNFGERGYIRMARNKGNHCGIASY 324
>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
Length = 491
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 99/189 (52%), Gaps = 18/189 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+Q+ I++ + +S +L++C GC+GG ++ I L ++GL +E DYP+++ N
Sbjct: 287 IEAQWGIRYNQSVKVSVQELLDCGRCGDGCKGGWVWDAFITVLNNSGLASEKDYPYQS-N 345
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTF-RRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
RC KV + DF++ ++ + L +GP+ +N L+ Y +
Sbjct: 346 VDPQRCRVKRNKV-AWIQDFIMLQDNEQIIAQYLASHGPITVTINMKPLKQYRKGVFEAT 404
Query: 120 D-VCPSENLNHAVVIVGYGMRHQV-----------PVWIVRNSWG-RWGPDDGYFTVERG 166
C ++H+V++VG+G V P WI++NSWG +WG + GYF + RG
Sbjct: 405 PATCDPWLVDHSVLLVGFGSSKSVKGMRAGTASSKPYWILKNSWGAKWG-EKGYFRLHRG 463
Query: 167 TNACGIESY 175
+N CGI Y
Sbjct: 464 SNTCGIAKY 472
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY +R
Sbjct: 280 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYR--- 336
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 337 GHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 396
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R +P W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 397 RPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 452
>gi|167427523|gb|ABZ80398.1| cathepsin L3, partial [Fasciola hepatica]
Length = 306
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 85/182 (46%), Gaps = 8/182 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY K S+ QL++C N GC GG A +YLK++GLE +DYP++
Sbjct: 121 IEGQYLRKFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNSGLETASDYPYQ-- 178
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFR--RMLYHYGPLVAGMNGALLQDYNGKLIR 117
G +C Y +V+ + D + +M+ GP ++ I
Sbjct: 179 -GWEYQCQYRKELGVAKVTGAYTVHSGDEMKLMQMVGREGPAAVAVDAQSDFYMYESGIF 237
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIESYG 176
++ C S ++ HAV+ VGYG WI++NSWG+W +DGY R N C I S
Sbjct: 238 QSQTCTSRSVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRNNMCAIASVA 297
Query: 177 GI 178
+
Sbjct: 298 SV 299
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|115532742|ref|NP_001040887.1| Protein Y71H2AM.25 [Caenorhabditis elegans]
gi|373220609|emb|CCD73875.1| Protein Y71H2AM.25 [Caenorhabditis elegans]
Length = 299
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/187 (32%), Positives = 100/187 (53%), Gaps = 11/187 (5%)
Query: 2 LESQYA-IKHGTLLPLSKSQLIECNIYN-QGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+ES YA +G+LL S+ QLI+C+ + +GC+ A+ Y G+E EADYP+ +
Sbjct: 115 IESMYAKATNGSLLSFSEQQLIDCDDHGFKGCEEQPAINAVSYFIFHGIETEADYPYAGK 174
Query: 60 NGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGA-LLQDYNGKLI 116
G+C +D+ K K+++ D F+V N + + ++ +YGP M L DY +
Sbjct: 175 EN--GKCTFDSTKSKIQLKDAEFVVSNETQG-KELVTNYGPAFFTMRAPPSLYDYKIGIY 231
Query: 117 RKN-DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ + C S + ++VIVGYG+ WIV+ S+G WG + GY + R NAC +
Sbjct: 232 NPSIEECTSTHEIRSMVIVGYGIEGVQKYWIVKGSFGTSWG-EQGYMKLARDVNACAMAD 290
Query: 175 YGGICTR 181
+ + T
Sbjct: 291 FITVPTE 297
>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
Length = 353
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 98/182 (53%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES +A G ++ LS+ QL++C N GC GG ++A +Y+++ GL+ E YP+
Sbjct: 165 LESAHAQATGKMVVLSEQQLVDCAGGYNNFGCNGGLPSQAFEYIRYNGGLDTEDSYPY-- 222
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYN--GK 114
G G+C Y+ + +V D V N ++ L H ++ A +L+D+
Sbjct: 223 -TGHDGKCTYNQNSIGAKVYD--VVNITEGAEDELIHAVAFNRPVSIAYEVLKDFRFYKS 279
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ ++VC + + +NHAV+ VGY VP WI++NSWG DGYF +E G N CGI
Sbjct: 280 GVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPYWIIKNSWGESFGLDGYFYMEMGKNMCGI 339
Query: 173 ES 174
+
Sbjct: 340 AT 341
>gi|226476108|emb|CAX72144.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA-LLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLIMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C +NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKYAGINHGVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|355778231|gb|EHH63267.1| Cathepsin H, partial [Macaca fascicularis]
Length = 305
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 97/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 120 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 178
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 179 --GKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 231
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
K + S + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 232 YKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGM-NGYFLIERGKN 290
Query: 169 ACGIES 174
CG+ +
Sbjct: 291 MCGLAA 296
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 88/174 (50%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A L LS+ L+ C+ + GC GG + A +++ +++G + E YP+ +
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C KV ++ + + + D + L GP+ ++ Y+G ++
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C SE LNH V++VGY + P WI++NSW WG + GY +E+GTN C
Sbjct: 272 S---CTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWG-EKGYIRIEKGTNQC 321
>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
Length = 348
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|23110955|ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]
gi|288558851|sp|P09668.4|CATH_HUMAN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|119619549|gb|EAW99143.1| cathepsin H [Homo sapiens]
Length = 335
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 96/179 (53%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA---GLEAEADYPFRN 58
+E Q+A +L+ LS+ L+ C+ ++GC GG ++A+ ++ + + EA YP+ +
Sbjct: 162 IEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTS 221
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C +D +V +++ FL + + + GP+ ++ Q Y G ++
Sbjct: 222 GGGTRPPC-HDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVV- 279
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C + +LNH V+IVG+ + P WIV+NSWG WG + GY + G+N C +++Y
Sbjct: 280 --SLCLAWSLNHGVLIVGFNKNAKPPYWIVKNSWGSSWG-EKGYIRLAMGSNQCMLKNY 335
>gi|157862759|gb|ABV90502.1| cathepsin L, partial [Fasciola gigantica]
Length = 280
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 94/184 (51%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+R
Sbjct: 95 MEGQYMKNQRTSISFSEQQLVDCSGPWGNMGCSGGLMENAYEYLKQFGLETESSYPYR-- 152
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN--GALLQDYNGKL 115
V G+C Y+ + V+V+ + V +GS+ + ++ GP ++ + +G
Sbjct: 153 -AVEGQCRYNRQLGVVKVTGYYTVHSGSEVGLKNLVGAEGPAAVAVDVESDFMMYRSG-- 209
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
I ++ C LNHAV+ VGYG + WIV+NSWG + GY + R N CGI S
Sbjct: 210 IYQSQTCSPFGLNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIAS 269
Query: 175 YGGI 178
+
Sbjct: 270 MASL 273
>gi|48145879|emb|CAG33162.1| CTSH [Homo sapiens]
Length = 335
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 88/180 (48%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ ++ V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + +NHAV++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 T---ACIGKQVNHAVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|256535829|gb|ACU82389.1| cathepsin L 1 [Pheronema raphanus]
Length = 328
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 95/184 (51%), Gaps = 21/184 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
LE QY I + LL S+S+L++C+ N GC+GG + A +Y + E E+DYP+ +
Sbjct: 148 LEGQYFINNDKLLSFSESELVDCSRRYGNNGCKGGLMDNAFRYWEVYKEELESDYPYVAK 207
Query: 60 NGVTGRCAYDARKVKVRVSD------FLVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDY 111
+G C Y K +S F + D R + GP+ M+ + Q Y
Sbjct: 208 DG---PCRYSQDKGVTTISSYKNVPHFSQISLQDAVRTI----GPISVAMDASHKSFQLY 260
Query: 112 NGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
+ + +++ C L+H V++VGYG + P W+V+NSWG WG DGYF + N C
Sbjct: 261 HSGVYSESE-CSQTKLDHGVLVVGYGTSSE-PFWLVKNSWGAGWGM-DGYFEIAMRNNMC 317
Query: 171 GIES 174
G+E+
Sbjct: 318 GLET 321
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 96/179 (53%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA---GLEAEADYPFRN 58
+E Q+A +L+ LS+ L+ C+ ++GC GG ++A+ ++ + + EA YP+ +
Sbjct: 162 IEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTS 221
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C +D +V +++ FL + + + GP+ ++ Q Y G ++
Sbjct: 222 GGGTRPPC-HDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVV- 279
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C + +LNH V+IVG+ + P WIV+NSWG WG + GY + G+N C +++Y
Sbjct: 280 --SLCLAWSLNHGVLIVGFNKNAKPPYWIVKNSWGSSWG-EKGYIRLAMGSNQCMLKNY 335
>gi|114796864|gb|ABI79444.1| cysteine proteinase 5 [Entamoeba histolytica]
Length = 284
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 82/160 (51%), Gaps = 9/160 (5%)
Query: 14 LPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDAR 71
L LS+ QL++C++ N+GC GG + +Y+K G+ E DYP+ C YD +
Sbjct: 129 LDLSEQQLVDCSVSVGNKGCNGGSLLLSFRYVKLNGIMQEKDYPYV---AAEETCTYDKK 185
Query: 72 KVKVRVS-DFLVFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIRKNDVCPSENLN 128
KV V+++ LV GS+ GP+VA ++ G Q Y + + C S LN
Sbjct: 186 KVAVKITGQKLVRPGSEKALMRAAAEGPVVAAIDASGVKFQLYKSGIYNSKE-CSSTQLN 244
Query: 129 HAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTN 168
H V +VGYG ++ WIVRNS G D GY + R N
Sbjct: 245 HGVAVVGYGTQNGTEYWIVRNSCGTIWGDQGYVLMSRNKN 284
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|54020916|ref|NP_001005702.1| cathepsin K (pycnodysostosis) precursor [Xenopus (Silurana)
tropicalis]
gi|49671274|gb|AAH75275.1| cathepsin K (pycnodysostosis) [Xenopus (Silurana) tropicalis]
Length = 329
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 90/178 (50%), Gaps = 8/178 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
LE Q K G L+ LS L++C+ N GC+GG A Y++ + G++++A+YP+ Q+
Sbjct: 148 LEGQLMKKTGKLVSLSPQNLVDCDTDNYGCEGGYMTNAFGYVRDNGGIDSDAEYPYVGQD 207
Query: 61 GVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + + +R + + GP+ ++ +L + K +
Sbjct: 208 E---GCHYNPADKAATCKGYKEIPVGSEKALKRAVANVGPVSVSIDASLPSFQFYKKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIES 174
+ C + +NHAV++VGYG + WI++NSWG W GY + R NACGI S
Sbjct: 265 YDSSCNPDAVNHAVLVVGYGNEKGIKHWIIKNSWGDWWGKKGYVLLARDKKNACGIAS 322
>gi|20136379|gb|AAM11647.1|AF490984_1 cathepsin L, partial [Fasciola hepatica]
Length = 311
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 93/185 (50%), Gaps = 12/185 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A QYLK GLE E+ YP+
Sbjct: 126 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPY--- 182
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN--GALLQDYNGKL 115
V G+C Y+ + +V+ + V +GS+ + ++ GP ++ + +G
Sbjct: 183 TAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRSG-- 240
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
I ++ C +NHAV+ VGYG + WIV+NSWG + + GY + R N CGI S
Sbjct: 241 IYQSQTCSPLRVNHAVLAVGYGTQDGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIAS 300
Query: 175 YGGIC 179
+
Sbjct: 301 LASVA 305
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 95/173 (54%), Gaps = 12/173 (6%)
Query: 8 IKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A +++ K+ G++ EADYP++ +G +
Sbjct: 177 IVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQ 236
Query: 66 CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA-GMNGALLQDYNGKLIRKNDVCPS 124
+A+ V + + + N + ++ L H VA G Q Y+ + + +C +
Sbjct: 237 NRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF--DGLCGT 294
Query: 125 ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA----CGI 172
E L+H VV VGYG + WIVRNSWG RWG + GY + R A CGI
Sbjct: 295 E-LDHGVVAVGYGTENGKDYWIVRNSWGNRWG-ESGYIKMARNIEAPTGKCGI 345
>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
Length = 396
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 92/179 (51%), Gaps = 8/179 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ESQYAI+ GTL LS+ +L++C+ + GC GG A++++ GLE E DYP+
Sbjct: 214 VESQYAIRKGTLWSLSEQELVDCDGASYGCSGGFLTSALEFILGNGLETEDDYPYTATK- 272
Query: 62 VTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMNG--ALLQDYNGKLIRK 118
+C + K +V + + + + D + + GP+ M + + +NG
Sbjct: 273 -HDQCWINGDKTRVWIDEGYQLTMNEDDIAEWVANVGPVSFAMRAPYSFIAYHNGIYSPS 331
Query: 119 NDVCPSENLNHAVV-IVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C E + + ++ I+GYG WIV+NSWG WG + GY + RG N C + +Y
Sbjct: 332 EYQCKHEAMGYVMMAIIGYGQEGGQNYWIVKNSWGDSWG-NQGYMRLARGVNTCEMANY 389
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY +R
Sbjct: 201 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYR--- 257
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 258 GHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGISRPL 317
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R +P W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 318 RPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 373
>gi|189233776|ref|XP_001814509.1| PREDICTED: similar to CG5367 CG5367-PA [Tribolium castaneum]
gi|270015148|gb|EFA11596.1| cathepsin K precursor [Tribolium castaneum]
Length = 330
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 98/187 (52%), Gaps = 21/187 (11%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAG-LEAEADYPF- 56
+L++Q + L+PLS+ Q+++C++ N GC GG ++YL+ AG L +DYP+
Sbjct: 151 VLQAQIFKQTEKLVPLSEQQIVDCSVSMGNYGCGGGSLRNTLRYLEKAGGLMTYSDYPYL 210
Query: 57 -RNQNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDY 111
R Q RC +D + V ++ + V D + GP+ A +N + Q Y
Sbjct: 211 ARQQ-----RCRFDKHRAIVNLTTWAVLPARDERALELAVAKIGPVAASINASPHTFQLY 265
Query: 112 NGKLIRKNDV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNAC 170
+ + +DV C S ++NHA++IVGY WI++N WG+ + GY + RG N C
Sbjct: 266 HSGVY--DDVACSSNHVNHAMLIVGYTKN----AWILKNWWGKHWGEKGYMRLRRGKNRC 319
Query: 171 GIESYGG 177
GI +Y
Sbjct: 320 GIANYAA 326
>gi|310751866|gb|ADP09371.1| cathepsin L-like proteinase [Fasciola hepatica]
Length = 326
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 96/184 (52%), Gaps = 12/184 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+R
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYEYLKQFGLETESSYPYR-- 198
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+ + V +GS+ + ++ GP ++ + Y+G I
Sbjct: 199 -AVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
++ C LNHAV+ VGYG + WIV+NSWG WG + GY + R N CGI S
Sbjct: 257 YQSQTCSPLGLNHAVLAVGYGTQGGTDYWIVKNSWGLSWG-ERGYIRMARNRGNMCGIAS 315
Query: 175 YGGI 178
+
Sbjct: 316 LASL 319
>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
Length = 329
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G LL L++ L++C N GC GG ++A +Y L + GL E YP+R
Sbjct: 144 LESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRA 203
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
QNG C + K V D + D + + P+ + K +
Sbjct: 204 QNGT---CKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGV 260
Query: 117 RKNDVCP--SENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
N C + +NHAV+ VGYG P WIV+NSWG DGYF +ERG N CG+ +
Sbjct: 261 YSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAA 320
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 GNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWG-EKGYVRVTMGVNACLLTGY 334
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 T---ACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|34811401|pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F
gi|34811402|pdb|1M6D|B Chain B, Crystal Structure Of Human Cathepsin F
Length = 214
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 98/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 34 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 90
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV + D + + ++ L GP+ +N +Q Y + R
Sbjct: 91 GHMQSCQFSAEKAKVYIQDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 150
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 151 RPLCSPWLIDHAVLLVGYGQRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 206
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 21/184 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN-IYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQ 59
+E+ I G L+ LS+ +L++C+ +N+GC GG + A +++ ++ G++ E DYP++
Sbjct: 158 VEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYK-- 215
Query: 60 NGVTGRCAYDARKVKVRV----SDFLVFNGSDTFRRMLYHYGPLVA-GMNGALLQDYNGK 114
G GRC + KV D +N + ++ ++H VA G LQ Y
Sbjct: 216 -GFEGRCDPTRKNAKVVSIDGYEDVPAYN-ENALKKAVFHQPVSVAIEAGGRALQLYQSG 273
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVER-----GTN 168
+ C + NL+H VV+VGYG + V W+VRNSWG WG +DGYF +ER T
Sbjct: 274 VFTGR--CGT-NLDHGVVVVGYGFENGVDYWLVRNSWGTNWG-EDGYFKLERNVKKINTG 329
Query: 169 ACGI 172
CGI
Sbjct: 330 KCGI 333
>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 359
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 88/180 (48%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ ++ V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + +NHAV++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 T---ACIGKQVNHAVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|296213765|ref|XP_002753411.1| PREDICTED: pro-cathepsin H [Callithrix jacchus]
Length = 336
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 95/183 (51%), Gaps = 16/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 151 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQ- 209
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN--GK 114
G C + K V D + D + Y P+ + QD+ +
Sbjct: 210 --GKDSDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFE--VTQDFMMYKR 265
Query: 115 LIRKNDVC--PSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACG 171
I + C + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N CG
Sbjct: 266 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCG 324
Query: 172 IES 174
+ +
Sbjct: 325 LAA 327
>gi|29708|emb|CAA30428.1| cathepsin H [Homo sapiens]
Length = 248
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 63 LESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 121
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 122 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 174
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 175 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 233
Query: 169 ACGIES 174
CG+ +
Sbjct: 234 MCGLAA 239
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 89/174 (51%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ +L++C+ +Q C GG + A + +++ GLE E DY +
Sbjct: 293 IEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENLGGLETETDYSY---T 349
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + KV + S + L GP+ A +N +Q Y +
Sbjct: 350 GHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPL 409
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ C ++HAV++VG+G R+ VP W ++NSWG + GY+ + RG+ CGI
Sbjct: 410 KIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYYLYRGSGLCGI 463
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 95/181 (52%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 148 LEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
V G+C YD++ S + L F + + + GP+ ++ + +
Sbjct: 208 ---AVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D ++N+NH V++VGYG + W+V+NSWG D GY + R + N CGI +
Sbjct: 265 GVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIAN 324
Query: 175 Y 175
Y
Sbjct: 325 Y 325
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 89/174 (51%), Gaps = 6/174 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ K G LL LS+ +L++C+ +Q C GG + A + +++ GLE E DY +
Sbjct: 293 IEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENLGGLETETDYSY---T 349
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + KV + S + L GP+ A +N +Q Y +
Sbjct: 350 GHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPL 409
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ C ++HAV++VG+G R+ VP W ++NSWG + GY+ + RG+ CGI
Sbjct: 410 KIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYYLYRGSGLCGI 463
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 304 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 360
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 361 GHMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 420
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 421 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 476
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 94/182 (51%), Gaps = 17/182 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE Q+ + G L+ LS+ QL++C+ N+GC GG + A +Y+K GLE E DYP+
Sbjct: 176 LEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDDYPYTA 235
Query: 59 QNGVTGRCAYDARKVKVRVSDF----LVFNGSDTFRRMLYHYGPLVAGMNG--ALLQDYN 112
+ G +C +K + +D + D + L GP+ ++ A Q Y+
Sbjct: 236 KQG---KCHL--KKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHASFQSYD 290
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQV-PVWIVRNSWGRWGPDDGYFTVERGT-NAC 170
G + + + C S+NL+H V+ VGYG W+V+NSWG ++GY + R N C
Sbjct: 291 GGVYDEEE-CSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRNKDNQC 349
Query: 171 GI 172
GI
Sbjct: 350 GI 351
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 92/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE+ Y G + LS+ QL++C N GC GG ++A +Y+K + GL+ E YP+
Sbjct: 173 LEAAYHQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTG 232
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++ C + + V VRV + + D + + P+ + +
Sbjct: 233 KDDA---CKFSSENVGVRVVESVNITLGAEDELKHAVAFVRPVSVAFEVVGSFRLYKEGV 289
Query: 117 RKNDVCPSE--NLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
C S ++NHAV+ VGYG+ + +P W+++NSWG WG D+GYF +E G N CGI
Sbjct: 290 YTTSTCGSTPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWG-DNGYFKMEMGKNMCGIA 348
Query: 174 S 174
+
Sbjct: 349 T 349
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 92/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ ++ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+N+NHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWG-NKGYILMARNKNNACGI 320
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY +R
Sbjct: 199 VEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAIKNLGGLETEDDYSYR--- 255
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 256 GHMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 315
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R +P W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 316 RPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 371
>gi|383852029|ref|XP_003701533.1| PREDICTED: cathepsin J-like [Megachile rotundata]
Length = 341
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 98/185 (52%), Gaps = 19/185 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
++ Q + G L+PLS+ QLI+C+ N GC GG ++YL+ A GL ++A YP++
Sbjct: 163 IQGQIFKRTGALIPLSEQQLIDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMSQAYYPYKA 222
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDY-NG 113
+ G RC + V V+ + V D + GP+ A +N + Q Y NG
Sbjct: 223 KQG---RCRFQEDLSVVNVTSWAVLPARDEKALEAAVATIGPIAASVNASPRTFQLYHNG 279
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+ +++C S+ +NHAV+IVGY WI++N WG WG ++GY + + N CG+
Sbjct: 280 --VYDDELCSSDMVNHAVLIVGYTPTE----WILKNWWGDGWG-ENGYMRLAKMKNRCGV 332
Query: 173 ESYGG 177
+Y
Sbjct: 333 ANYAA 337
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 T---ACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 98/181 (54%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY---NQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG A QY+ + G++++A YP++
Sbjct: 148 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYK 207
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ +C YD++ S + L ++ D + + + GP+ G++ + + +
Sbjct: 208 ---AMDQKCQYDSKYRAATCSKYTELPYSREDVLKEAVANKGPVSVGVDASHPSFFLYRS 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
+ ++N+NH V++VGYG + W+V+NSWGR ++GY + R N CGI S
Sbjct: 265 GVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIAS 324
Query: 175 Y 175
+
Sbjct: 325 F 325
>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 359
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 87/180 (48%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYLPECSNSSELVVGAQIDSHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + +NHAV++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 T---ACIGKEVNHAVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L+ LS+ QL+ C+ + GC GG +A +++ + + E YP+ +
Sbjct: 159 IESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYTS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C+ + R+ ++ S+ L GP+ ++ + Y+ ++
Sbjct: 219 TFGYVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C E LNH V++VGY M +VP W+++NSWG+ WG + GY V G NAC + Y
Sbjct: 279 TS---CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGKDWG-EKGYVRVTMGVNACLLTGY 334
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 249 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 308
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 309 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 368
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 369 ---TACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 424
>gi|226476110|emb|CAX72145.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGFMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLTMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
N+ C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESNE-CKYGDINHGVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|29710|emb|CAA34734.1| unnamed protein product [Homo sapiens]
Length = 335
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|401758202|gb|AFQ01136.1| cathepsin O2-like protease [Chilo suppressalis]
Length = 368
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 92/177 (51%), Gaps = 9/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYL--KHAGLEAEADYPFRN 58
+ES AI G L LS ++I+C + NQGC GG + +L + +E E DYP +
Sbjct: 185 MESMAAINTGKLPALSVQEVIDCARLGNQGCSGGDICLLLDWLMITNTPVEVEKDYPLQL 244
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM---LYHYGPLVAGMNGALLQDYNGKL 115
NGV C VRV+ F + T +++ L +GP+ +N Q+Y G +
Sbjct: 245 TNGV---CKAKKNTTGVRVTSFTCDDFVGTEQKIIEALALHGPVAVAVNALTWQNYLGGV 301
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
I+ + + +LNHAV +VGY + VP +I +NSWG +GY + G+N CG+
Sbjct: 302 IQYHCSGDAMDLNHAVQLVGYDLTADVPYYIAKNSWGSDFGLNGYIHLAIGSNICGL 358
>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
Length = 215
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 88/177 (49%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY+ K+ G+++E YP+ Q
Sbjct: 34 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQ- 92
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 93 --EESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 150
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG WI++NSWG WG GY + R NACGI
Sbjct: 151 YDESCNSDNLNHAVLAVGYGESKGNKHWIIKNSWGENWGM-GGYIKMARNKNNACGI 206
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 95/191 (49%), Gaps = 23/191 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN----------IYNQGCQGGGFNKAIQY-LKHAGLEA 50
+E Q+AIK G L+ LS+ QL++C+ + GC GG A QY +K+ GL+
Sbjct: 155 VEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIKNGGLDT 214
Query: 51 EADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQ 109
E YP+ GV C ++ V +S + + + L GP+ +N LQ
Sbjct: 215 EDSYPYE---GVDDTCRFNKSNVAATISSWTSISSDENQMAAWLAANGPISIAINAEWLQ 271
Query: 110 DYNGKLIRKNDVCPSENLNHAVVIVGYG-----MRHQVPVWIVRNSWGR-WGPDDGYFTV 163
Y I C ++L+H V+IVGYG + + WIV+NSWG WG +DGYF +
Sbjct: 272 YYTSG-ISDPWFCNPQDLDHGVLIVGYGVGKSWLGSEENYWIVKNSWGSDWG-EDGYFRI 329
Query: 164 ERGTNACGIES 174
RG CG+ S
Sbjct: 330 IRGKGKCGLNS 340
>gi|16506815|gb|AAL23962.1|AF426248_1 truncated cathepsin H [Homo sapiens]
Length = 323
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 138 LESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 196
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 197 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 249
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 250 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 308
Query: 169 ACGIES 174
CG+ +
Sbjct: 309 MCGLAA 314
>gi|229595080|ref|XP_001020177.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225566401|gb|EAR99932.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 405
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 96/180 (53%), Gaps = 15/180 (8%)
Query: 2 LESQYAIKHGTL-LPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAG-LEAEADYPFR 57
LES YA+K G + S+ QL++C +GC GG +K +YL +AG ++ EADYP+
Sbjct: 207 LESHYALKTGKKPIQFSEQQLVDCARKFDTKGCSGGLPSKGFEYLAYAGGIQNEADYPYE 266
Query: 58 NQNGVTGRCAYDARK--VKVRVSDFLVFNGSDTFRRMLYHYGPLVAG--MNGALLQDYNG 113
G C +++ K V+V+ S + F + L +YGP+ +N NG
Sbjct: 267 ---GEDKNCRFNSSKTVVQVQKSYNITFQDENELIYHLANYGPVTIAYQVNSDFDNYKNG 323
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
N E++NHAV+ VGY M + +I +NSWG WG + GYF +E G+N CG+
Sbjct: 324 VFTSSNCSKDPEDVNHAVLAVGYNMTGKY--FIAKNSWGNDWGMN-GYFYIELGSNMCGL 380
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 99/181 (54%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K G ++ LS+ L++C+ N GC+GG + A +Y+K + G++ E YP+
Sbjct: 175 LEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPY-- 232
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTF-RRMLYHYGPLVAGMNGA--LLQDYNGK 114
NG G C ++ V + F+ + G++ ++ + GP+ ++ + Q Y+
Sbjct: 233 -NGTDGICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQG 291
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
+ + + C SE+L+H V++VGYG + W+V+NSWG DDGY + R N CGI
Sbjct: 292 VYDEPE-CSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIA 350
Query: 174 S 174
S
Sbjct: 351 S 351
>gi|56756677|gb|AAW26511.1| unknown [Schistosoma japonicum]
Length = 331
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCGGGFMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLTMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKYGDINHGVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
Length = 382
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 101/206 (49%), Gaps = 34/206 (16%)
Query: 2 LESQYAIKHGTLLPLSKS--------QLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEA 52
+E+ +AIK + +S +L++C+ GC+GG ++ + L ++GL +E
Sbjct: 160 IEALWAIKFRHFVEVSVQRMAGGRGWELLDCDRCGNGCRGGFVWDAFLTVLNNSGLASEK 219
Query: 53 DYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDY 111
DYPF + +G T RC K + DF++ + + R L GP+ +N LLQ Y
Sbjct: 220 DYPF-DGSGKTHRCLAKKYKKVAWIQDFIILQACEQSMARHLATEGPITVTINMTLLQQY 278
Query: 112 NGKLIRKN-DVCPSENLNHAVVIVGYGM--------------------RHQVPVWIVRNS 150
+I+ C ++H+V++VG+G R + W ++NS
Sbjct: 279 QKGVIKATPTTCDPTQVDHSVLLVGFGKTKSGEGRQGKAASFGSYARPRRSMAYWTLKNS 338
Query: 151 WG-RWGPDDGYFTVERGTNACGIESY 175
WG +WG ++GYF + RG+N CGI +
Sbjct: 339 WGPQWG-EEGYFRLHRGSNTCGITKF 363
>gi|58617840|gb|AAW80539.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 225
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 3 ESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRNQ 59
ESQ+A L+ LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 1 ESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGFVFTEKSYPYTSG 60
Query: 60 NGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
NG C ++ V R+ +++ ++T L GP+ ++ + Y ++
Sbjct: 61 NGDVAECLNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLT 120
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 121 S---CAGDALNHGVLLVGYNKIGEVPYWVIKNSWGEDWG-EKGYVRVAMGRNACLLSEY 175
>gi|226476130|emb|CAX72155.1| cathepsin L, a [Schistosoma japonicum]
Length = 241
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC+GG + A YL+ +E+E DY +
Sbjct: 59 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYMDHAFNYLESHYIESENDYKYL-- 116
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 117 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLIMYKSGVF 175
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 176 ESND-CKYGDINHGVLVVGYGNEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 233
>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
Length = 329
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 91/177 (51%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A +Y+ K+ G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFEYVQKNRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + + +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSRGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 265 FDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGI 320
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 87/174 (50%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ I L LS+ L+ C+ + GC GG + A +++ + E YP+ +
Sbjct: 159 IEGQWKIAGHELTSLSEQMLVSCDTNDFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + V ++ D + + + L GP+ ++ Q Y G ++
Sbjct: 219 GGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
C SE+L+H V++VGY + P WI++NSWG+ WG ++GY +E+GTN C
Sbjct: 279 S---CISEHLDHGVLLVGYDDTSKPPYWIIKNSWGKGWG-EEGYIRIEKGTNQC 328
>gi|16506813|gb|AAL23961.1|AF426247_1 cathepsin H [Homo sapiens]
Length = 335
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 98/186 (52%), Gaps = 22/186 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LES AI G +L L++ QL++C + N GCQGG ++A +Y L + G+ E YP++
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQ- 208
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G G C + K V D + + + Y P+ + QD+ ++
Sbjct: 209 --GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE--VTQDF---MM 261
Query: 117 RKNDVCPS-------ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ + S + +NHAV+ VGYG ++ +P WIV+NSWG +WG +GYF +ERG N
Sbjct: 262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG-MNGYFLIERGKN 320
Query: 169 ACGIES 174
CG+ +
Sbjct: 321 MCGLAA 326
>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
Length = 333
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 101/185 (54%), Gaps = 17/185 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
+E Q K G L PLS L++C+ + N+GCQ G ++A +Y LK+ GLEAEA YP+
Sbjct: 146 IEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAEATYPYE- 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGA--LLQDYNGKL 115
G G C Y + ++D++ ++ + + + GP+ A ++ + + YNG +
Sbjct: 205 --GKDGPCRYRSENASANITDYVNLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGGI 262
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVP----VWIVRNSWG-RWGPDDGYFTVERG-TNA 169
+ + C S +NHAV++VGYG V W+++NSWG WG +GY + + N
Sbjct: 263 YYEPN-CSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGM-NGYMQIAKDHNNH 320
Query: 170 CGIES 174
CGI S
Sbjct: 321 CGIAS 325
>gi|84028184|sp|Q9R014.2|CATJ_MOUSE RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|5306071|gb|AAD41898.1|AF158182_1 preprocathepsin P [Mus musculus]
gi|12838143|dbj|BAB24099.1| unnamed protein product [Mus musculus]
gi|74199838|dbj|BAE20748.1| unnamed protein product [Mus musculus]
gi|74355544|gb|AAI03770.1| Cathepsin J [Mus musculus]
gi|148709363|gb|EDL41309.1| cathepsin J, isoform CRA_a [Mus musculus]
Length = 334
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 101/185 (54%), Gaps = 17/185 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
+E Q K G L PLS L++C+ + N+GCQ G ++A +Y LK+ GLEAEA YP+
Sbjct: 147 IEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAEATYPYE- 205
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRM-LYHYGPLVAGMNGA--LLQDYNGKL 115
G G C Y + ++D++ ++ + + + GP+ A ++ + + YNG +
Sbjct: 206 --GKDGPCRYRSENASANITDYVNLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGGI 263
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVP----VWIVRNSWG-RWGPDDGYFTVERG-TNA 169
+ + C S +NHAV++VGYG V W+++NSWG WG +GY + + N
Sbjct: 264 YYEPN-CSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGM-NGYMQIAKDHNNH 321
Query: 170 CGIES 174
CGI S
Sbjct: 322 CGIAS 326
>gi|195473621|ref|XP_002089091.1| GE26053 [Drosophila yakuba]
gi|194175192|gb|EDW88803.1| GE26053 [Drosophila yakuba]
Length = 338
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 94/177 (53%), Gaps = 17/177 (9%)
Query: 9 KHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQNGVTGR 65
+ G +L LSK Q+++C++ NQGC GG ++YL+ G + E DYP+ + G+
Sbjct: 167 RTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLRYLQSTGGIMREEDYPYAAR---KGK 223
Query: 66 CAY--DARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRKNDV 121
C + D V V L + + H GP+ +N + Q Y+ I + +
Sbjct: 224 CQFVPDLSVVNVTSWAILPVRDEQAIQAAVAHIGPVAISINASPKTFQLYSDG-IYDDPL 282
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYGG 177
C S ++NHA+V++G+G + WI++N WG+ WG ++GY + +G N CG+ +Y
Sbjct: 283 CSSASVNHAMVVIGFGKDY----WILKNWWGQNWG-ENGYIRIRKGVNMCGMANYAA 334
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 94/182 (51%), Gaps = 11/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI----YNQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE+Q +K G L+ LS L++C+ N GC GG +A QY+ + G++++A YP+
Sbjct: 163 LEAQLKLKTGKLVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPY 222
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ ++G +C Y+ S + L + D + + + GP+ G++ +L + K
Sbjct: 223 KAKDG---KCQYNPANRAATCSRYTELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYK 279
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
D ++N+NH V++ GYG W+V+NSWG D GY + R N CGI
Sbjct: 280 SGVYYDPSCTQNVNHGVLVTGYGNLDGKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIA 339
Query: 174 SY 175
++
Sbjct: 340 NF 341
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 92/172 (53%), Gaps = 9/172 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+E AI G L+ LS+ +L++C+ N GC+GG + A ++ + + G++ EADYP+
Sbjct: 157 IEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGIDTEADYPYI--- 213
Query: 61 GVTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMNGALL--QDYNGKLIR 117
GV G C + KV D + SD+ P+ G++G+ L Q Y G +
Sbjct: 214 GVGGTCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQPISVGIDGSTLDFQLYTGGIYD 273
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ ++++HAV+IVGYG WIV+NSWG WG +G+ + R TN
Sbjct: 274 GDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGI-EGFIYIRRNTN 324
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 92/179 (51%), Gaps = 12/179 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E +A+K G L+ LS+ +L++C+ +QGC GG N + ++ GL E +Y + +
Sbjct: 312 VEGVWAVKKGELVSLSEQELVDCDTLDQGCSGGYPSNAYKEIIRLGGLTTETNYSY---D 368
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G G C + + KV ++D + +T + GP+ G+N + Y +
Sbjct: 369 GNQGTCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAMMFYRHGIAHPW 428
Query: 119 NDVCPSENLNHAVVIVGYGMRHQV----PVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+C + L+H V IVGY + Q P WI++NSWG WG + GY+ + RG CG+
Sbjct: 429 RFLCSPDALDHGVAIVGYDVEKQSKKPKPYWIIKNSWGTHWG-EGGYYMLYRGAGVCGV 486
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 96/179 (53%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA---GLEAEADYPFRN 58
+E Q+A +L+ LS+ L+ C+ ++GC GG ++A+ ++ + + EA YP+ +
Sbjct: 162 IEGQWAASGHSLVSLSEQMLVSCDNVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTS 221
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C +D +V +++ FL + + + GP+ ++ Q Y G ++
Sbjct: 222 GGGTRPPC-HDEGEVGAKITGFLSLPHDEERIADWVEKRGPVAVAVDATTWQLYFGGVVS 280
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C + +LNH V+IVG+ + P WIV+NSWG WG + GY + G+N C +++Y
Sbjct: 281 ---LCLAWSLNHGVLIVGFNKNAKPPYWIVKNSWGSSWG-EKGYIRLAMGSNQCMLKNY 335
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 92/172 (53%), Gaps = 9/172 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+E AI G L+ LS+ +L++C+ N GC+GG + A ++ + + G++ EADYP+
Sbjct: 217 IEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGIDTEADYPYI--- 273
Query: 61 GVTGRCAYDARKVKVRVSD-FLVFNGSDTFRRMLYHYGPLVAGMNGALL--QDYNGKLIR 117
GV G C + KV D + SD+ P+ G++G+ L Q Y G +
Sbjct: 274 GVGGTCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQPISVGIDGSTLDFQLYTGGIYD 333
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
+ ++++HAV+IVGYG WIV+NSWG WG +G+ + R TN
Sbjct: 334 GDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGI-EGFIYIRRNTN 384
>gi|28974202|gb|AAO61485.1| cathepsin H [Sterkiella histriomuscorum]
Length = 366
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 98/183 (53%), Gaps = 15/183 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
+ES Y +K+G LS+ QL++C + N GC GG + A +Y+K + GL E YP++
Sbjct: 168 VESHYLLKYGAFRNLSEQQLVDCAGDYDNHGCSGGLPSHAFEYIKDNGGLALETTYPYKA 227
Query: 59 QNGVTGRCAYDA--RKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAG---MNGALLQDYNG 113
NG +C+ + V +R + D ++ +Y +GP+ ++G +DY
Sbjct: 228 ANG---QCSIQKGQQSVGIRGGAVNISLNEDDLKQAIYLHGPVSVAFRVIDG--FRDYKS 282
Query: 114 KLIRKNDVCPSEN-LNHAVVIVGYGM-RHQVPVWIVRNSWGRWGPDDGYFTVERGTNACG 171
+ N +NHAV+ VG+G ++V WI++NSWG D G+F ++RG N CG
Sbjct: 283 GVYAVEGCANGPNDVNHAVLAVGFGTDENKVDYWIIKNSWGAAWGDQGFFKMKRGVNMCG 342
Query: 172 IES 174
I++
Sbjct: 343 IQN 345
>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 94/182 (51%), Gaps = 12/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LESQ ++ G L LS+ QL++C+ N GC GG + A QY++ + G+++E+ YP++
Sbjct: 143 LESQTCLRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQYVQANGGIDSESYYPYQA 202
Query: 59 QNGVTGRCAYDARKVKVRVS---DFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G C Y++ S D + + + GPL ++ + Q Y +
Sbjct: 203 R---VGTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASGWQSYQSGV 259
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIES 174
ND S+ +HAV++VGYG + W+V+NSWG W + GY + R N CGI +
Sbjct: 260 F--NDPSCSQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMARNANNQCGIAN 317
Query: 175 YG 176
+
Sbjct: 318 HA 319
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 86/174 (49%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ + GC+GG + A +++ + E YP+ +
Sbjct: 159 IEGQWKVAGHELTSLSEQMLVSCDTNDFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + V ++ D + + L GP+ ++ Q Y G ++
Sbjct: 219 GGGNVPACDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
C SE+L+H V++VGY + P WI++NSW + WG ++GY +E+GTN C
Sbjct: 279 S---CISEHLDHGVLLVGYDDTSKPPYWIIKNSWSKGWG-EEGYIRIEKGTNQC 328
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 98/182 (53%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K G L+ LS+ L++C + NQGC GG + QY+K + G++ E +P+
Sbjct: 150 LEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGIDTEESHPYTA 209
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
Q+G C + V + F+ + GS D ++ + GP+ ++ + Q Y+
Sbjct: 210 QDG---DCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQLYSQG 266
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGT-NACGI 172
+ + D C S L+H V+ VGYG+++ W+V+NSW G WG D+GY + R N CGI
Sbjct: 267 VYDEPD-CSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWG-DNGYILMSRDKDNQCGI 324
Query: 173 ES 174
S
Sbjct: 325 AS 326
>gi|27681979|ref|XP_225125.1| PREDICTED: cathepsin 7-like [Rattus norvegicus]
gi|109505372|ref|XP_001065135.1| PREDICTED: cathepsin 7-like [Rattus norvegicus]
Length = 331
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 97/187 (51%), Gaps = 17/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
+E Q K G L PLS L++C+ GC GG A QY+K+ GLEAEA YP+
Sbjct: 145 IEGQLFKKTGKLSPLSVQNLVDCSRSFGTMGCNGGRIYNAFQYVKNNGGLEAEATYPYEA 204
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNG--ALLQDYNGKL 115
+ G C Y K V+V+ FLV + L + GP+ G++ + Y G +
Sbjct: 205 KEG---NCRYRPEKSVVKVTRFLVVPRNEEALINALVNIGPIAVGIDAQHESFKKYAGGI 261
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVP----VWIVRNSWG-RWGPDDGYFTVERGTNA- 169
+ + C ++ NH++++VG+G Q W+V+NS+G +WG + GY + RG N
Sbjct: 262 YHEPN-CKRDSPNHSMLLVGFGYEGQESEGRKYWLVKNSYGEQWG-EKGYMKIPRGQNNY 319
Query: 170 CGIESYG 176
CGI SY
Sbjct: 320 CGIASYA 326
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 103/184 (55%), Gaps = 15/184 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE+ +AIK G L+ LS+ QL++C N GC GG ++A +Y+K+ G+E+E++Y +
Sbjct: 151 LEAHHAIKTGQLISLSEQQLVDCAGAFNNHGCNGGLPSQAFEYIKYNGGIESESNYNYTA 210
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPL-VAGMNGALLQDYNGKL 115
++GV C +++ V VSD + + + + GP+ +A Q Y +
Sbjct: 211 KDGV---CRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVSIAFEVTKSFQHYKKGV 267
Query: 116 IR-KNDVCPS--ENLNHAVVIVGYGM-RHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
+ + +VC + +NHAV++VGY + WIV+NSW WG D GYF + RG NAC
Sbjct: 268 YQGEIEVCSQSPDKVNHAVLVVGYNQTKLGEEYWIVKNSWSASWGMD-GYFWIRRGHNAC 326
Query: 171 GIES 174
G+ +
Sbjct: 327 GLAT 330
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 93/181 (51%), Gaps = 9/181 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE+Q +K G L+ LS L++C++ N+GC GG +A QY+ + G+++E YP+
Sbjct: 146 LEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMA 205
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
QNG C Y+ S ++ +D + + + GP+ ++ + +
Sbjct: 206 QNGT---CQYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSG 262
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIESY 175
+D ++ +NH V++VGYG ++ W+V+NSWG D GY + R N CGI SY
Sbjct: 263 VYDDPRCTQEVNHGVLVVGYGTLNEKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASY 322
Query: 176 G 176
Sbjct: 323 A 323
>gi|302763927|ref|XP_002965385.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
gi|300167618|gb|EFJ34223.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
Length = 353
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 98/182 (53%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES +A G ++ LS+ QL++C N GC GG ++A +Y+++ GL+ E YP+
Sbjct: 165 LESAHAQATGKMVVLSEQQLVDCAGGYNNFGCSGGLPSQAFEYIRYNGGLDTEDSYPYTA 224
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYN--GK 114
+G +C Y+ + +V D V N ++ L H ++ A +L+D+
Sbjct: 225 HDG---KCMYNQNSIGAKVYD--VVNITEGAEDELIHAVAFNRPVSIAYEVLKDFRFYKS 279
Query: 115 LIRKNDVCPS--ENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
+ ++VC + + +NHAV+ VGY VP WI++NSWG DGYF +E G N CGI
Sbjct: 280 GVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPYWIIKNSWGESFGLDGYFYMEMGKNMCGI 339
Query: 173 ES 174
+
Sbjct: 340 AT 341
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 304 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 360
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 361 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 420
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 421 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 476
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K GLE E DY + +
Sbjct: 293 VEGQWFLNRGTLLSLSEQELLDCDKVDKACMGGVPSNAYSAIKTLGGLETEEDYSY---H 349
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ A K KV ++D + + ++ L GP+ +N +Q Y +
Sbjct: 350 GHLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPISVAINAFGMQFYRHGIAHPL 409
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV+IVGYG R VP W ++NSWG WG ++GY+ + RG+ ACG+ +
Sbjct: 410 RPLCSPWLIDHAVLIVGYGNRSDVPFWAIKNSWGTDWG-EEGYYYLHRGSGACGVNT 465
>gi|26245871|gb|AAN77411.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 200
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 98/181 (54%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNK-AIQYLKHAGLEAEADYPFRN 58
LE Q AI + +PLS+ QL++C+ N C+ GG A Y+ G+EA++ YP++
Sbjct: 17 LEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDKGIEADSSYPYK- 75
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C YDA+K +++ + V + ++ + GP+ ++ +Q Y+G ++
Sbjct: 76 --GTDTPCQYDAKKTVLKIKGYKNVSISEEELKKAVGTVGPVSVAIDADPIQLYSGGIL- 132
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQV----PVWIVRNSWGR-WGPDDGYFTVER-GTNACG 171
+ + + NLNH V+ VGYG + W V+NSWG+ WG + GYF ++R N CG
Sbjct: 133 -DGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWG-EQGYFRIKRDANNLCG 190
Query: 172 I 172
I
Sbjct: 191 I 191
>gi|545734|gb|AAB30089.1| cysteine protease [Fasciola sp.]
gi|2662308|dbj|BAA23743.1| cathepsin L [Fasciola hepatica]
Length = 325
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 94/183 (51%), Gaps = 11/183 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNYGCMGGLMENAYEYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+D+ V +GS+ + ++ GP ++ + Y+G I
Sbjct: 198 TAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYSGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
++ C S +NHAV+ VGYG + WIV+NSWG WG RG N CGI S
Sbjct: 257 YQSRTCSSLRVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERYIRMVRNRG-NMCGIASL 315
Query: 176 GGI 178
+
Sbjct: 316 ASL 318
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 98/181 (54%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP++
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYK- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ + GS+ ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 94/181 (51%), Gaps = 10/181 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI---YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 157 LEAQLKLKTGKLVSLSVQNLVDCSTGKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYK 216
Query: 58 NQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKL 115
+ G+C YD + S + L F + + + + GP+ ++ + + +
Sbjct: 217 ---AMDGKCQYDVKNRAATCSKYVELPFGNEEALKEAVANKGPVSVAIDASHPSFFLYRS 273
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
D + N+NH V+ VGYG + W+V+NSWG + GY + R + N CGI S
Sbjct: 274 GVYYDKACTLNVNHGVLAVGYGNYNGKDYWLVKNSWGLHFGEQGYIRMARNSGNHCGIAS 333
Query: 175 Y 175
Y
Sbjct: 334 Y 334
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 35/198 (17%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN---------IYNQGCQGGGFNKAIQYLKHAG-LEAE 51
LE + G L LS+ QL++C+ + GC GG A YL AG LE E
Sbjct: 180 LEGANYLATGKLEVLSEQQLVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAKAGGLETE 239
Query: 52 ADYPFRNQNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGALLQD 110
DYP+ +N C +D K+ +V +F V D L +GPL G+N +Q
Sbjct: 240 KDYPYTGRNSA---CKFDKSKIAAQVKNFSTVAIDEDQIAANLVKHGPLAIGINAVFMQT 296
Query: 111 YNGKLIRKNDVCP---SENLNHAVVIVGYGMR-------HQVPVWIVRNSWGR-WGPDDG 159
Y G + CP +L+H V +VGYG + P WI++NSWG WG + G
Sbjct: 297 YIGGV-----SCPYICGRHLDH-VFLVGYGSAGYAPLRFKEKPYWIIKNSWGENWG-ESG 349
Query: 160 YFTVERG---TNACGIES 174
Y+ + RG N CG++S
Sbjct: 350 YYKICRGPHVKNKCGVDS 367
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 90/181 (49%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E IK G L LS+ QL++C + NQGC GG +A QY + G+EAE DY + +
Sbjct: 154 IEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDYRYTER 213
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGA---LLQDYNGK 114
+GV C Y V V+ + D +R + GP+ G++ A + +G
Sbjct: 214 DGV---CRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYSHGV 270
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
+ K C ++H V++VGYG + W+V+NSWG + GY + R N CGI
Sbjct: 271 FVSK--TCSPYAIDHGVLVVGYGAENGEAYWLVKNSWGSSWGEGGYVKMARNRNNMCGIA 328
Query: 174 S 174
S
Sbjct: 329 S 329
>gi|348531585|ref|XP_003453289.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 366
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 96/183 (52%), Gaps = 14/183 (7%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFR 57
+LE Q+ K G L+ LS+ QL++C+ N GC GG +A QY++ + G++ EA YP+
Sbjct: 182 VLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKRAFQYIQANGGIDTEASYPYE 241
Query: 58 NQNGVTGRCAYDARKVKVRVSDFLVFNGS--DTFRRMLYHYGPLVAGMNGA--LLQDYNG 113
+ +C Y + + + ++ S D + + GP+ G++ + + Y
Sbjct: 242 AKGQ---QCRYKPDGIGAKCTGYVEVKPSNEDALKEAVATIGPISVGIDASHNSFRFYQS 298
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACG 171
+ + D C LNH V+ VGYG + W+++NSWG RWG D GY + R +N CG
Sbjct: 299 GVYDEPD-CSKTVLNHDVLAVGYGTENGHDYWLIKNSWGIRWG-DKGYIKMSRNKSNQCG 356
Query: 172 IES 174
I S
Sbjct: 357 IAS 359
>gi|295971915|gb|ADG63164.1| cysteine protease F [Leishmania donovani]
Length = 240
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A L+ LS+ QL+ C+ + GC GG +A ++L + + E YP+ +
Sbjct: 19 IESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTS 78
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C ++ V ++ +++ ++T L GP+ ++ + Y ++
Sbjct: 79 GNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVL 138
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY +VP W+++NSWG WG + GY V G NAC + Y
Sbjct: 139 TS---CAGDALNHGVLLVGYNKTGEVPYWVIKNSWGEDWG-EKGYVRVAMGRNACLLSEY 194
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 212 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 268
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 269 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 328
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 329 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 384
>gi|10798511|emb|CAC12806.1| cathepsin L1 [Fasciola hepatica]
Length = 311
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 91/183 (49%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+R
Sbjct: 126 MEGQYMKNEKTSISFSEQQLVDCSGPWGNNGCSGGLMENAYEYLKRFGLETESSYPYR-- 183
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
V G+C Y+ + +V+ + V +GS+ + ++ GP + I
Sbjct: 184 -AVEGQCRYNEQLGVAKVTGYYTVHSGSEVELKNLVGSEGPAAIAVEAESDFMMYRSGIY 242
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIESY 175
++ C LNHAV+ VGYG + WIV+NSWG WG + GY + R N CGI S
Sbjct: 243 QSQTCLPFALNHAVLAVGYGTQDGTDYWIVKNSWGLSWG-ERGYIRMARNRGNMCGIASL 301
Query: 176 GGI 178
+
Sbjct: 302 ASL 304
>gi|226476102|emb|CAX72141.1| cathepsin L, a [Schistosoma japonicum]
Length = 331
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCGGGFMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLTMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKYGDINHGVLVVGYGNEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 158 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 214
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 215 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 274
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 275 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 330
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLYTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 T---ACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
Length = 300
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 95/173 (54%), Gaps = 12/173 (6%)
Query: 8 IKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQNGVTGR 65
I G L+ LS+ +L++C+ YNQGC GG + A +++ K+ G++ EADYP++ +G +
Sbjct: 14 IVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQ 73
Query: 66 CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA-GMNGALLQDYNGKLIRKNDVCPS 124
+A+ V + + + N + ++ L H VA G Q Y+ + + +C +
Sbjct: 74 NRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF--DGLCGT 131
Query: 125 ENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYF----TVERGTNACGI 172
E L+H VV VGYG + WIVRNSWG RWG + GY +E T CGI
Sbjct: 132 E-LDHGVVAVGYGTENGKGYWIVRNSWGNRWG-ESGYIKMARNIEAPTGKCGI 182
>gi|321472587|gb|EFX83556.1| hypothetical protein DAPPUDRAFT_47691 [Daphnia pulex]
Length = 211
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 86/177 (48%), Gaps = 5/177 (2%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
LE K+ + + + QL++C+ + GC GG + A Y K A A++
Sbjct: 30 LEFAACKKNKSSAAVPEQQLVDCDTVDGGCNGGFYTNAWDYHKKANGSAKS--SLYGYTA 87
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
V G C +++ V +V+ + + S + L +YGPL + Y + +
Sbjct: 88 VKGTCKFNSSMVGAKVASYTKVQVKNSTAMQTALVNYGPLAVAITVINSFQYYSSGVYND 147
Query: 120 DVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
C ++ +NHAV IVGYG+++ + W+VRNSWG GY + RGTN C IE Y
Sbjct: 148 VACNNQTINHAVTIVGYGVQNTTINYWVVRNSWGTGWGQKGYILMLRGTNLCHIEEY 204
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 310 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 366
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 367 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 426
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 427 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 482
>gi|440297066|gb|ELP89796.1| cysteine proteinase ACP1 precursor, putative [Entamoeba invadens
IP1]
Length = 306
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 84/168 (50%), Gaps = 10/168 (5%)
Query: 11 GTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQNGVTGRCAYD 69
G L S+ QLI+C+ + GC GG + + ++K+ G+ E YP++ +G C
Sbjct: 130 GKLYSYSEQQLIDCDTTDNGCSGGHPDNSFTFIKNNKGITLETSYPYKAADGT---CNTA 186
Query: 70 ARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNG--ALLQDYNGKLIRKNDVCPSEN 126
+ V V +GS+T + + YGP+ GM+ A Q Y I + C
Sbjct: 187 VKNVATVAGHKRVTDGSETGLQEITATYGPVAVGMDASRASFQLYKKGTIYNDANCKRIV 246
Query: 127 LNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERG-TNACGI 172
++H V +VGYG WI+RNSWG WG D+GYF + R N CGI
Sbjct: 247 MDHCVTLVGYGKNTDGEYWIIRNSWGTSWG-DEGYFLLARNQNNRCGI 293
>gi|410910990|ref|XP_003968973.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
Length = 329
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 94/183 (51%), Gaps = 12/183 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE Q K G L+PLS L++C+ N GC+GG +K+ Y+ ++ G+++E+ YP+ +
Sbjct: 146 LEGQMKRKTGFLVPLSPQNLLDCSTSDGNLGCRGGYISKSYSYIIRNGGVDSESFYPYEH 205
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGAL--LQDYNGK 114
Q G +C Y + S F + D T + + GP+ +N L Y G
Sbjct: 206 QKG---KCRYSVKGKAGYCSRFHILPQGDEETLKATVARVGPVAVAVNAMLASFHLYRGG 262
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERG-TNACGIE 173
L + C + +NHAV++VGYG W+V+NSWG ++GY + R N CGI
Sbjct: 263 LYNVPN-CNPKFINHAVLVVGYGSSEGQDFWLVKNSWGSAWGEEGYIRLARNKKNLCGIA 321
Query: 174 SYG 176
S+
Sbjct: 322 SFA 324
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/193 (32%), Positives = 100/193 (51%), Gaps = 28/193 (14%)
Query: 2 LESQYAIKHGTLLPLSK---SQLIECNIYNQGCQGGGFNKAIQYLKHA-----GLEAEAD 53
+E Q+AIK G L LS+ S++ C+I N ++ K + GLE+E
Sbjct: 97 IEGQWAIKKGNLPDLSEQHTSKIESCHI----------NPIVKRTKRSIDGKSGLESEKA 146
Query: 54 YPFRNQNGVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYN 112
YP+ ++ +C D KV+V + S + + L GP+ G+N +Q Y
Sbjct: 147 YPYEAKDE---QCHMDYSKVQVYINSSVNISKDENDMASWLAENGPISIGINAFPMQFYM 203
Query: 113 GKLIRKNDV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNAC 170
G + + C E L+H V+IVGYG + + P WI++NSWG+ WG ++GY+ V RG C
Sbjct: 204 GGISHPWRIFCNPEELDHGVLIVGYGTKDETPYWIIKNSWGKNWG-EEGYYLVYRGGGVC 262
Query: 171 GIESYGGICTRTL 183
G+ + +CT ++
Sbjct: 263 GLNT---MCTSSV 272
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 219 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVS 278
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 279 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 338
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 339 ---TACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 394
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 98/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY +R
Sbjct: 309 VEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAIKNLGGLETEDDYSYR--- 365
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 366 GHMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 425
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 426 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 481
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 337 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 393
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 394 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 453
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 454 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 509
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 88/179 (49%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ I L LS+ L+ C+ + GC+ G + A +++ + E YP+ +
Sbjct: 159 IEGQWKIAGHELTSLSEQMLVSCDTNDLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + V + D + + + + L GP+ ++ Q Y G ++
Sbjct: 219 GGGNVPACNKSGKVVGANIDDHVHILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C S+ +N A ++VGY + P WI++NSWG+ WG ++GY +E+GTN C ++ Y
Sbjct: 279 S---CISKEVNSAALLVGYDDTSKPPYWIIKNSWGKGWG-EEGYIRIEKGTNQCRMKDY 333
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 122 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 178
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D + + ++ L GP+ +N +Q Y + R
Sbjct: 179 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 238
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIES 174
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 239 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNT 294
>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 88/179 (49%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ I L LS+ L+ C+ + GC+ G + A +++ + + E YP+ +
Sbjct: 159 IEGQWKIAGHELTSLSEQMLVSCDTNDLGCRAGFMDTAFKWIVSSNNGNVFTEQSYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + V + D + + + + L GP+ ++ Q Y G ++
Sbjct: 219 GGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C S+ +N A ++VGY + P WI++NSW + WG ++GY +E+GTN C ++ Y
Sbjct: 279 S---CISKEVNSAALLVGYDDTSKPPYWIIKNSWSKGWG-EEGYIRIEKGTNQCRMKEY 333
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+A+ L LS+ QL+ C+ + GC+GG +A ++L + + E YP+ +
Sbjct: 159 IESQWALAGHRLTALSEQQLVSCDDKDNGCRGGLMLQAFEWLLRNMNGTMFTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVK-VRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
G C+ ++ V R+ ++ S+T L GP+ ++ + Y ++
Sbjct: 219 STGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C LNH V++V Y +VP W+++NSWG WG ++GY V G NAC + Y
Sbjct: 279 TS---CAGMPLNHGVLLVWYNRTGEVPYWVIKNSWGENWG-ENGYVRVTMGVNACLLTEY 334
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 100/181 (55%), Gaps = 8/181 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 368 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 424
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C + A K KV ++D +V + ++ L GP+ +N +Q Y + R
Sbjct: 425 GHMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 484
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYGG 177
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + G+ ACG+ +
Sbjct: 485 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWG-EKGYYYLHCGSEACGVNTMAS 543
Query: 178 I 178
+
Sbjct: 544 L 544
>gi|24583376|ref|NP_609387.1| CG5367 [Drosophila melanogaster]
gi|22946140|gb|AAF52922.2| CG5367 [Drosophila melanogaster]
Length = 338
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 93/177 (52%), Gaps = 17/177 (9%)
Query: 9 KHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQNGVTGR 65
+ G +L LSK Q+++C++ NQGC GG + YL+ G + + DYP+ + G +
Sbjct: 167 RTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDYPYVARKG---K 223
Query: 66 CAY--DARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRKNDV 121
C + D V V L + + H GP+ +N + Q Y+ I + +
Sbjct: 224 CQFVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDG-IYDDPL 282
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYGG 177
C S ++NHA+V++G+G + WI++N WG+ WG ++GY + +G N CGI +Y
Sbjct: 283 CSSASVNHAMVVIGFGKDY----WILKNWWGQNWG-ENGYIRIRKGVNMCGIANYAA 334
>gi|56756955|gb|AAW26649.1| unknown [Schistosoma japonicum]
Length = 331
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 10/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GC GG + A YL+ +E+E DY +
Sbjct: 149 IEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCGGGFMDHAFNYLESHYIESENDYKYL-- 206
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG-ALLQDYNGKLI 116
G C Y K V+V F+ D T ++ +Y YGP+ G+ L Y +
Sbjct: 207 -GYDANCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQYGPISVGIVALDSLIMYKSGVF 265
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIES 174
ND C ++NH V++VGYG H W+++NSWG GYF + R N CG+ S
Sbjct: 266 ESND-CKYGDINHGVLVVGYGKEHGKDYWLIKNSWGDLWGSKGYFKLRRNKHNMCGVAS 323
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 84/175 (48%), Gaps = 10/175 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ N GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC
Sbjct: 279 T---ACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNAC 329
>gi|348542778|ref|XP_003458861.1| PREDICTED: digestive cysteine proteinase 3-like [Oreochromis
niloticus]
Length = 218
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 96/181 (53%), Gaps = 13/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ K G L+ LS+ QL++C N +N GC GG A +Y+K + G++ E Y +
Sbjct: 36 LEGQHFKKTGNLVSLSEQQLVDCSRNFFNHGCDGGWMIPAFKYIKDNGGIQTEESYTYEA 95
Query: 59 QNGVTGRCAYDARKVKVRVSDF-LVFNGSDTFRRMLYHYGPLVAGMNGA--LLQDYNGKL 115
++G RC Y+A V + S + V + ++ + GP+ ++ + Q Y +
Sbjct: 96 RDG---RCHYNANFVGAQCSGYGTVKQDEEALKQAVAAIGPISIAVDASHESFQLYQSGV 152
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
+ C + NLNHAV+ VGYG + W+V+NSWG WG + GY + R N CGI
Sbjct: 153 YDE-PWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSEWG-NKGYIKMTRNKDNQCGIA 210
Query: 174 S 174
+
Sbjct: 211 T 211
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 88/174 (50%), Gaps = 9/174 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL--KHAG-LEAEADYPFRN 58
+E Q+A L LS+ L+ C+ + GC GG + A +++ +++G + E YP+ +
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 211
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C +V ++ + + + D + L GP+ ++ Y+G ++
Sbjct: 212 GGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNAC 170
C SE LNH V++VGY + P WI++NSW WG + GY +E+GTN C
Sbjct: 272 S---CTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWG-EKGYIRIEKGTNQC 321
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 90/177 (50%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ + G+++E YP+ Q
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQENRGIDSEDAYPYVGQE 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLSSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C E+LNHA++ VGYGM+ WI++NSWG WG + GY + R NACGI
Sbjct: 265 YDESCNGEDLNHALLAVGYGMQRGNKHWILKNSWGENWG-NKGYVLLARNKNNACGI 320
>gi|346469497|gb|AEO34593.1| hypothetical protein [Amblyomma maculatum]
Length = 557
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 92/183 (50%), Gaps = 8/183 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADY-PFRN 58
LE Y K G L+ LS+ QL++C N N GC GG +A +Y++ GL ++ DY +
Sbjct: 374 LEGAYFRKTGKLVRLSEQQLVDCSWNSGNNGCDGGEDFRAYEYIRKHGLASDEDYGAYIG 433
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ---DYNGKL 115
Q+GV +A ++ ++ D L + GP+ ++ AL NG
Sbjct: 434 QDGVCHDTKVNATISTIK--SYINITNRDDLLTALANVGPVSVSIDAALRSFSFYSNGVF 491
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIESY 175
+++L+HAV+ VGYG + P W+++NSW + +DGY + + N CG+ +
Sbjct: 492 YDPKCRNDTDSLDHAVLAVGYGTLQEQPYWLIKNSWSTYWGNDGYVLISQKDNNCGVATQ 551
Query: 176 GGI 178
G I
Sbjct: 552 GTI 554
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 88/176 (50%), Gaps = 6/176 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E Q+ +K G L+ LS+ +L++C+ +Q C GG + A + + K G+E E DY +
Sbjct: 295 IEGQWFVKTGKLVSLSEQELVDCDTADQACGGGLPSNAYEAIEKLGGVETETDYSY---T 351
Query: 61 GVTGRCAYDARKVKVRV-SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G C + KV + S + + L GP+ +N +Q Y +
Sbjct: 352 GKKQSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPL 411
Query: 120 DV-CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+ C ++HAV++VGYG R P W ++NSWG + GY+ + RG+ CGI +
Sbjct: 412 KIFCNPWMIDHAVLLVGYGERQGKPFWAIKNSWGEDYGEQGYYYLYRGSRLCGINT 467
>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 94/181 (51%), Gaps = 16/181 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGC-QGGGFNKAIQYLKHAGLEAEADYPF-R 57
LE Q AI + + LS+ QL++C+ N C +GG + A +Y++ G+++E YP+ R
Sbjct: 143 LEGQNAILNNVKISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVRDYGIQSEKSYPYIR 202
Query: 58 NQNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
Q C YDA K +++ + V + R+ + GP+ MN LQ Y +I
Sbjct: 203 KQT----ECQYDASKTILKIKGYKNVTTSEEGLRKAVGAIGPISIAMNSDPLQLYYSGII 258
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQ----VPVWIVRNSWGRWGPDDGYFTVER-GTNACG 171
S +L+H V++VGYG Q W V+NSWG+ ++GYF ++R N CG
Sbjct: 259 SGKGC--SHDLDHGVLVVGYGKASQWSGETKFWRVKNSWGKIWGENGYFRIKRDANNLCG 316
Query: 172 I 172
I
Sbjct: 317 I 317
>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
Length = 376
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 96/198 (48%), Gaps = 26/198 (13%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFRNQN 60
+E+ + I + +S +L++C GC GG ++ I L ++GL +E DYPF+ +
Sbjct: 162 IETLWRINFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKV 221
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
RC + + DF++ N + L YGP+ +N LLQ Y +I+
Sbjct: 222 RAH-RCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKLLQLYRKGVIKAT 280
Query: 120 -DVCPSENLNHAVVIVGYG--------------------MRHQVPVWIVRNSWG-RWGPD 157
C + ++H+V++VG+G H P WI++NSWG +WG +
Sbjct: 281 PTTCDPQLVDHSVLLVGFGNVKSEEGIWAETVLSQSQPQPPHPTPYWILKNSWGAQWG-E 339
Query: 158 DGYFTVERGTNACGIESY 175
GYF + RG+N CGI +
Sbjct: 340 KGYFRLHRGSNTCGITKF 357
>gi|195339771|ref|XP_002036490.1| GM11735 [Drosophila sechellia]
gi|194130370|gb|EDW52413.1| GM11735 [Drosophila sechellia]
Length = 338
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 95/177 (53%), Gaps = 17/177 (9%)
Query: 9 KHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAG-LEAEADYPFRNQNGVTGR 65
+ G +L LSK Q+++C++ NQGC GG + YL+ G + + DYP+ + G +
Sbjct: 167 RTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLTYLQSTGGIMRDQDYPYVARKG---K 223
Query: 66 CAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGA--LLQDYNGKLIRKNDV 121
C + A V VS + + D + + H GP+ +N + Q Y+ I + +
Sbjct: 224 CQFVADLSVVNVSSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDG-IYDDPL 282
Query: 122 CPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYGG 177
C S ++NHA+V++G+ + WI++N WG+ WG ++GY V +G N CG+ +Y
Sbjct: 283 CSSASVNHAMVVIGFAKDY----WILKNWWGQNWG-ENGYIRVRKGVNMCGLANYAA 334
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 92/181 (50%), Gaps = 11/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY----NQGCQGGGFNKAIQYL-KHAGLEAEADYPF 56
LE Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+EA+A YP+
Sbjct: 156 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 215
Query: 57 RNQNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGK 114
+ + +C Y+++ S + L F D + + GP+ G++ + + K
Sbjct: 216 K---AMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYK 272
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+D + N+NH V++VGYG W+V+NSWG D GY + R N CGI
Sbjct: 273 SGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 174 S 174
S
Sbjct: 333 S 333
>gi|7327279|gb|AAB26209.2| cysteine proteinase precursor [Entamoeba histolytica]
Length = 284
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 87/177 (49%), Gaps = 10/177 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
+LE + G L S+ QL++C+ + GC+ G N ++ GL E+DYP++
Sbjct: 100 VLEGRVNKDLGKLYSFSEQQLVDCDASDNGCERGPSNSLKFIQENNGLGLESDYPYK--- 156
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGKLIR 117
V G C + V V +GS+T + ++ GP+ GM+ + Q Y I
Sbjct: 157 AVAGTCK-KVKNVATVTGSRRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTIY 215
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGI 172
+ C S +NH V VGYG WI+RNSWG WG D GYF + R + N CGI
Sbjct: 216 SDTKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTSWG-DAGYFLLARDSNNMCGI 271
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 93/181 (51%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LE ++ +K+G L+ LS+ L++C+ N GC+GG A +Y+K G++ E YP+
Sbjct: 149 LEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNG--SDTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ D ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|113931178|ref|NP_001039033.1| cathepsin W [Xenopus (Silurana) tropicalis]
gi|89269052|emb|CAJ83515.1| cathepsin W [Xenopus (Silurana) tropicalis]
Length = 303
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 94/184 (51%), Gaps = 14/184 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGG-GFNKAIQYLKHAGLEAEADYPFRNQN 60
+E+Q+AI G + LS+ Q+I+CN GC GG ++ + L+ GL +E YP+
Sbjct: 112 IEAQWAIL-GQTISLSEQQVIDCNTCRNGCSGGYAWDAFMTVLQQGGLTSEKSYPY---T 167
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLI--- 116
G C V + DF + ++T + H G L +N A L+ Y ++
Sbjct: 168 GHVSNCRKGFEAVGW-IHDFEMLKKNETAMASHVAHKGTLTVTINKAPLKHYQKGIVDTL 226
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
R N C ++H V+IVGY ++P WI++NSWG WG + G+F + R NACGI Y
Sbjct: 227 RSN--CDPNYVDHVVLIVGYRGGGKLPQWILKNSWGEDWG-EKGFFRMFRDKNACGITKY 283
Query: 176 GGIC 179
C
Sbjct: 284 PVTC 287
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 91/180 (50%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG---LEAEADYPFRN 58
+E Q+ I L LS+ L+ C+ C GG ++A +++ + + E YP+ +
Sbjct: 238 IEGQWKIAGHELTSLSEQMLVSCDTTEDNCGGGFADRAFKWIVSSNKGNVFTERSYPYAS 297
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
+G C + V ++S + + + L GP+ ++ + DY G ++
Sbjct: 298 IDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGVLT 357
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESYG 176
C S+++NH V++VGY + P WI++NSW + WG ++GY +E+GTN C ++ Y
Sbjct: 358 S---CSSKHVNHEVLLVGYNDTSKPPYWIIKNSWDKEWG-EEGYIRIEKGTNLCLMKEYA 413
>gi|407401839|gb|EKF28997.1| cysteine peptidase, putative, partial [Trypanosoma cruzi
marinkellei]
Length = 281
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 85/157 (54%), Gaps = 9/157 (5%)
Query: 21 LIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRNQNGVTGRCAYDARKVKVRV 77
L+ C+ + GC GG KA +++ + + E YP+R+ G+T C RKV +
Sbjct: 2 LVSCDKTDNGCSGGWPLKAFRWIVQENNGAVYTEKSYPYRSCFGITPPCIKFRRKVGATI 61
Query: 78 SDFLVFNGSDT-FRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGY 136
+D++ ++T +L YGPL A ++ L Y G ++ C ++ HAV++VGY
Sbjct: 62 TDYVTLPENETKIATVLAAYGPLSAVIDLTSLIFYTGGVLTN---CVADKSIHAVLLVGY 118
Query: 137 GMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
VP W ++NSWG RWG ++GY + +G+N C +
Sbjct: 119 NDSAAVPYWTIKNSWGKRWG-EEGYIRIAKGSNQCRV 154
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ + GS+ ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 88/179 (49%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ I L LS+ L+ C+ + GC+ G + A +++ + E YP+ +
Sbjct: 159 IEGQWKIAGHELTSLSEQMLVSCDTNDLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYAS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G C + V + D + + + + L GP+ ++ Q Y G ++
Sbjct: 219 GGGNVPACNKSGKVVGANIRDHVHILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGVLT 278
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
C S+ +N A ++VGY + P WI++NSWG+ WG ++GY +E+GTN C ++ Y
Sbjct: 279 S---CISKEVNSAALLVGYDDTSKPPYWIIKNSWGKGWG-EEGYIRIEKGTNQCRMKDY 333
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ + GS+ ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|307195722|gb|EFN77562.1| Cathepsin K [Harpegnathos saltator]
Length = 345
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 96/184 (52%), Gaps = 17/184 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHA-GLEAEADYPFRN 58
++ Q + G L+PLS QL++C+ N+GC GG ++YL+ + GL A+ +YP++
Sbjct: 167 IQGQIFKQTGMLIPLSAQQLVDCSTATGNRGCAGGSLRNTLRYLERSKGLMAKTEYPYKA 226
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNG--ALLQDYNGK 114
Q+G +C + V ++ + + D + GP+ +N Q Y+ K
Sbjct: 227 QDG---QCKFHRDLSVVNITSWAILPARDETALEAAVASVGPIAVSINAMPKTFQLYH-K 282
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIE 173
+ + +C S+ +NHA++IVGY WI++N WG WG ++GY + + N CGI
Sbjct: 283 GVYDDHLCSSDTVNHAMLIVGYTPTE----WILKNWWGENWG-ENGYMRLAKNKNRCGIA 337
Query: 174 SYGG 177
+Y
Sbjct: 338 NYAA 341
>gi|377823949|gb|AFB77219.1| cathepsin L1 [Fasciola gigantica]
Length = 326
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 93/183 (50%), Gaps = 10/183 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C+ N GC GG A +YLK GLE E+ YP+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNYGCMGGLMENAYEYLKQFGLETESSYPY--- 197
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+D+ V +GS+ + ++ GP ++ + Y G I
Sbjct: 198 TAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRGG-I 256
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
++ C +NHAV+ VGYG + WIV+NSWG + GY + R N CGI S
Sbjct: 257 YQSQTCSPLGVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASL 316
Query: 176 GGI 178
+
Sbjct: 317 ASL 319
>gi|6959769|gb|AAF33213.1|AF217445_1 cysteine protease Cys1b, partial [Babesia equi]
Length = 273
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 88/181 (48%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+ES Y IK G L LS+ +L+ C + GC+GG NKA++Y+K G+ D P+ N
Sbjct: 89 VESLYLIKKGQALDLSEQELVNCEENSNGCEGGLPNKALEYIKAKGISHSKDLPYHAAN- 147
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDV 121
C + KV + F G D + L +VA Y G +
Sbjct: 148 --EECVVSSSD-KVFIHSFFANTGLDILNKSLVVSPTIVAIAASKEFTAYKGGIFTGE-- 202
Query: 122 CPSENLNHAVVIVGYGMRHQV--PVWIVRNSWGR-WGPDDGYFTVER---GTNACGIESY 175
C E LNHAV++VG G WIV+NSWG WG ++G+F +ER G++ C I +
Sbjct: 203 CAPE-LNHAVLLVGEGHDEATGKRFWIVKNSWGTDWG-ENGFFRLERTDEGSDKCDILEF 260
Query: 176 G 176
G
Sbjct: 261 G 261
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ + GS+ ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|291224892|ref|XP_002732436.1| PREDICTED: cathepsin H-like [Saccoglossus kowalevskii]
Length = 302
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 92/180 (51%), Gaps = 13/180 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
LES AI TL+ LS+ QLI+C N GC GG +A +Y+ + GL A+ DY ++
Sbjct: 116 LESATAIAKSTLISLSEQQLIDCAQAFNNHGCNGGLPAQAFEYIHYNDGLMADIDYQYKA 175
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSDT--FRRMLYHYGPLVAGMNGALLQDYNGKLI 116
++G +C YD K VS + D +Y +GP+ + A +
Sbjct: 176 KDG---KCKYDPSKAAAFVSKIVNITKGDEDGILNAVYKHGPVSIAYDVASDFHLYHSGV 232
Query: 117 RKNDVCP--SENLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+ VC E++NHAV+ G+ + + W+V+NSWG WG D GYF +ER N CG+
Sbjct: 233 YSSTVCKIDPEHVNHAVLATGFNETAEGLKYWMVKNSWGPDWGLD-GYFWIERNKNMCGL 291
>gi|323451555|gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
Length = 263
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 87/176 (49%), Gaps = 8/176 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
LE + I TL LS+ L++C+ + GC GG + A ++++ + G+ +EADY +
Sbjct: 88 LEGAFEIAGNTLTSLSEQNLVDCDTTDSGCNGGLMDNAFKWIQSNGGICSEADYAY---T 144
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNG--ALLQDYNGKLIRK 118
G C KV V +G + + GP+ + ++ Q Y+ ++
Sbjct: 145 AAKGTCKTTCDKVATLSGHTDVPSGDEDALKTAVAIGPVSIAIEADKSVFQSYSSGILDS 204
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGIES 174
+ C + NL+H V++VGYG W V+NSWG + GY + RG+N CGI S
Sbjct: 205 S-ACGT-NLDHGVLVVGYGTDDGSEYWKVKNSWGTTWGESGYVRIARGSNICGIAS 258
>gi|10798513|emb|CAC12807.1| procathepsin L3 [Fasciola hepatica]
Length = 306
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 84/182 (46%), Gaps = 8/182 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY K S+ QL++C N GC GG A +YLK++GLE +DYP++
Sbjct: 121 IEGQYVKKFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNSGLETASDYPYQ-- 178
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFR--RMLYHYGPLVAGMNGALLQDYNGKLIR 117
G +C Y +V+ + D + M+ GP A ++ I
Sbjct: 179 -GWEYQCQYRKELGVAKVTGAYTVHSGDEMKLMPMVRKKGPAAAAVDAQPDFYMYESGIF 237
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESYG 176
++ C S + HAV+ VG+G WI++NSWG+W +DGY R N C I S
Sbjct: 238 QSQYCSSRRVTHAVLAVGHGTESGTDYWILKNSWGKWWGEDGYMRFARNRGNMCAIASVA 297
Query: 177 GI 178
+
Sbjct: 298 SV 299
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K G L+ LS+ LI+C+ N+GC GG + A +Y+K + G++ E YP+
Sbjct: 129 LEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESYPYE- 187
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGA--LLQDYNGK 114
+ G C + V + F+ + GS D ++ + GP+ ++ + Q Y+
Sbjct: 188 --AMDGDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEG 245
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIE 173
+ + + C SE L+H V+ VGYG+++ W+V+NSW D+GY + R N CGI
Sbjct: 246 VYDEPN-CSSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCGIA 304
Query: 174 S 174
S
Sbjct: 305 S 305
>gi|226470468|emb|CAX70514.1| Cathepsin L-like proteinase precursor [Schistosoma japonicum]
Length = 249
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 66/182 (36%), Positives = 93/182 (51%), Gaps = 14/182 (7%)
Query: 3 ESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQN 60
E QYAI LL LS Q I+C + N GC GG ++YL+ GLE E YP+
Sbjct: 66 EIQYAIHKLKLLYLSVQQFIDCTHSYGNNGCHGGDTLSLLKYLQTIGLETEEMYPYT--- 122
Query: 61 GVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMN--GALLQDYNGKLI 116
GV C ++ V VR + + NGS++ R ++ GP V MN L +G I
Sbjct: 123 GVDQECMANSSNVTVRSIGYKSIQNGSESDLRDVICSEGPYVVTMNIDENFLHYKSG--I 180
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
++ C NLN ++ ++GY + WI++NSWG WG +DG+ V R N CGI S
Sbjct: 181 YQSIYCNESNLNQSMAVIGYDSNEGIDYWILKNSWGTNWG-EDGFVYVRRNYGNMCGIAS 239
Query: 175 YG 176
+
Sbjct: 240 FA 241
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECN--IYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q+ +K+G L+ LS+ L++C+ N GC+GG A +Y+K + G++ E YP+
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDT-FRRMLYHYGPLVAGMNGA--LLQDYNGK 114
V G C + V + ++ + GS+ ++ + GP+ ++ + Q Y+
Sbjct: 208 --AVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEG 265
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVER-GTNACGIE 173
+ + + C SE+L+H V++VGYG++ W+V+NSW D GY + R N CGI
Sbjct: 266 VYDEPE-CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIA 324
Query: 174 S 174
S
Sbjct: 325 S 325
>gi|403302732|ref|XP_003942007.1| PREDICTED: cathepsin S isoform 2 [Saimiri boliviensis boliviensis]
Length = 289
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 94/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 107 LEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYK- 165
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+C YD++ S + L + D + + + GP+ G++ + + +
Sbjct: 166 --ATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSG 223
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
D ++ +NH V+++GYG + W+V+NSWG + GY + R N CGI SY
Sbjct: 224 VYYDPACTQKVNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASY 283
>gi|270006364|gb|EFA02812.1| cathepsin O precursor [Tribolium castaneum]
Length = 326
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 87/174 (50%), Gaps = 14/174 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAG--LEAEADYPFRNQ 59
+ES AIK LS ++I+C N+GC GG + ++K ++ ADY
Sbjct: 154 VESMNAIKTNKSEELSVQEIIDCAGNNKGCNGGDICTLLSWIKATNFTIQRHADY----- 208
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
G+C + V VR DF++ D R+L GPL +N Q+Y G +I +
Sbjct: 209 ---GGKCGRGSAGVHVR--DFILVGSEDVMLRLLADNGPLAVAINAQTWQNYIGGVIEYH 263
Query: 120 -DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
D PS+ LNHAV IVGY + +P +IVRN+WG D G+ + N CGI
Sbjct: 264 CDGDPSK-LNHAVQIVGYDLTASIPHYIVRNTWGVDFGDGGFLYIAVDKNMCGI 316
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 89/177 (50%), Gaps = 10/177 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ G+++E YP+ Q+
Sbjct: 148 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQQNRGIDSEDAYPYVGQD 207
Query: 61 GVTGRCAYD--ARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQ-DYNGKLIR 117
C Y+ + K R + +R + GP+ ++ +L + K +
Sbjct: 208 E---SCMYNPTGKAAKCRGYREVPVGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 264
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERG-TNACGI 172
++ C +NLNHAV+ VGYG++ WI++NSWG WG + GY + R N CGI
Sbjct: 265 YDESCDGDNLNHAVLAVGYGIQRGHKHWILKNSWGENWG-NKGYVLLARNKNNTCGI 320
>gi|148709374|gb|EDL41320.1| cathepsin 7, isoform CRA_c [Mus musculus]
Length = 277
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 98/187 (52%), Gaps = 17/187 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKH-AGLEAEADYPFRN 58
+E Q K G L+PLS L++C++ +GC GG A QY+K+ GLEAEA YP+
Sbjct: 91 IEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEA 150
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNG--ALLQDYNGKL 115
+ C Y + V+V+ F V + + L +GP+ ++G A Y G +
Sbjct: 151 K---AKHCRYRPERSVVKVNRFFVVPRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGI 207
Query: 116 IRKNDVCPSENLNHAVVIVGYGMR-HQVP---VWIVRNSWG-RWGPDDGYFTVERG-TNA 169
+ C + L+H +++VGYG H+ W+++NS G RWG ++GY + RG N
Sbjct: 208 YHEPK-CRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWG-ENGYMKLPRGQNNY 265
Query: 170 CGIESYG 176
CGI SY
Sbjct: 266 CGIASYA 272
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 94/180 (52%), Gaps = 9/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYL-KHAGLEAEADYPFRN 58
LE+Q +K G L+ LS L++C+ N+GC GG +A QY+ + G+++EA YP++
Sbjct: 157 LEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYK- 215
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
+C YD++ S + L + D + + + GP+ G++ + + +
Sbjct: 216 --ATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSG 273
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGT-NACGIESY 175
D ++ +NH V+++GYG + W+V+NSWG + GY + R N CGI SY
Sbjct: 274 VYYDPACTQKVNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASY 333
>gi|340710428|ref|XP_003393792.1| PREDICTED: cathepsin O-like [Bombus terrestris]
Length = 355
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 95/180 (52%), Gaps = 10/180 (5%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLE--AEADYPFR 57
++ES YAIK+GTL LS ++I+C N GC+GG + +L + ++ E+ YP
Sbjct: 168 VVESMYAIKNGTLYMLSVQEMIDCAKNKNFGCEGGDIYSLLSWLLASKVQIFQESTYPLV 227
Query: 58 NQNGV--TGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYH---YGPLVAGMNGALLQDYN 112
+ + G+ +A VK+R DF N D +L +GP+ A +N Q+Y
Sbjct: 228 GKTSMCKLGKMIDNAFGVKIR--DFNCDNFVDAEDELLIKVATHGPVAAVVNALSWQNYL 285
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRWGPDDGYFTVERGTNACGI 172
G +I+ + +N NHAV I+GY +P +I++NSWG D GY + G N CGI
Sbjct: 286 GGVIQYHCDSTYDNRNHAVQIIGYDKSAAIPHYIIKNSWGTNFGDKGYMYIAIGNNLCGI 345
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 98/175 (56%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K GTLL LS+ +L++C+ ++ C GG + A ++ GLE E DY +R
Sbjct: 297 VEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIRTLGGLETEDDYSYR--- 353
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ A K KV ++D + + ++ L GP+ +N +Q Y +
Sbjct: 354 GHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPL 413
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R P W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 414 RPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTNWG-EEGYYYLHRGSGACGV 467
>gi|218478062|dbj|BAH03397.1| cathepsin L-like cysteine peptidase [Taenia asiatica]
Length = 338
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 93/179 (51%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYN--QGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
LE +A K G L+ LS+ QL++C++ N GC GG + A +YL+ +E E+ YP+R
Sbjct: 156 LEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHSIEPESAYPYRAT 215
Query: 60 NGVTGRCAYDARKVKVRVSDF-LVFNGSDT-FRRMLYHYGPLVAGMNGALLQ-DYNGKLI 116
+G C Y+ V+D + G++T + GP+ ++ + L + I
Sbjct: 216 DGP---CRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGI 272
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
K+ C S+ LNH V+ +GYG + P W+V+NSWG RWG + N CG+ S
Sbjct: 273 YKSHWCSSKFLNHGVLAIGYGKQEGKPYWLVKNSWGTRWGMKGYIMMAKDYHNMCGVAS 331
>gi|30141463|emb|CAD54748.1| cysteine proteinase b [Leishmania guyanensis]
Length = 174
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 81/160 (50%), Gaps = 9/160 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+ESQ+ + +L+ LS+ +L+ C+ ++GC GG +A +L K+ + A YP+ +
Sbjct: 17 IESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLBNKNGAVYTGASYPYVS 76
Query: 59 QNGVTGRCAYDARKVKVRVSD--FLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D + + DT L GP+ ++ + Y G ++
Sbjct: 77 GNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMSYTGGIL 136
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WG 155
C LNH V++VGY M +VP W+++NSWG WG
Sbjct: 137 TS---CDGRQLNHGVLLVGYNMTGEVPYWLIKNSWGENWG 173
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 99/175 (56%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K GTLL LS+ +L++C+ ++ C GG + A ++ GLE E DY +R
Sbjct: 280 VEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIRTLGGLETEDDYSYR--- 336
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ A K KV ++D + + ++ L GP+ +N +Q Y +
Sbjct: 337 GRLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFGMQFYRHGISHPL 396
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R +P W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 397 RPLCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWG-EEGYYYLHRGSGACGV 450
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 86/180 (47%), Gaps = 10/180 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL---KHAGLEAEADYPFRN 58
+E Q+ + L+ LS+ QL+ C+ + GC GG +A +L + L E YP+ +
Sbjct: 159 IEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLHTEDSYPYVS 218
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGALLQDYNGKLI 116
NG C+ + V D V GS L GP+ ++ + Y ++
Sbjct: 219 GNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL 278
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSW-GRWGPDDGYFTVERGTNACGIESY 175
C + LNH V++VGY M +VP W+++NSW G WG + GY V G NAC + Y
Sbjct: 279 ---TACIGKQLNHGVLLVGYDMTGEVPYWVIKNSWGGDWG-EQGYVRVVMGVNACLLSEY 334
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 98/175 (56%), Gaps = 8/175 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKH-AGLEAEADYPFRNQN 60
+E Q+ +K GTLL LS+ +L++C+ ++ C GG + A ++ GLE E DY +R
Sbjct: 280 VEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIRTLGGLETEDDYSYR--- 336
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD-TFRRMLYHYGPLVAGMNGALLQDYNGKLIRK- 118
G C++ A K KV ++D + + ++ L GP+ +N +Q Y +
Sbjct: 337 GHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPL 396
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGI 172
+C ++HAV++VGYG R P W ++NSWG WG ++GY+ + RG+ ACG+
Sbjct: 397 RPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTNWG-EEGYYYLHRGSGACGV 450
>gi|390470786|ref|XP_003734355.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin W [Callithrix jacchus]
Length = 373
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 97/195 (49%), Gaps = 23/195 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQ-YLKHAGLEAEADYPFRNQN 60
+E+ ++I + +S +L++C GC GG A LK++G+ +E+DYPF+
Sbjct: 162 IEALWSINFLKFVNVSVQELLDCGRCGDGCHGGYVWDAFSTVLKNSGVVSESDYPFQANF 221
Query: 61 GVTGRCAYDARKVKVRVSDFLVF-NGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR-K 118
G RC + DF+ + + L YGP+ +N LQ Y +I+ +
Sbjct: 222 G-PHRCHAKTYNKVAWIMDFIFLPDDXQRIAQYLTTYGPITVTINAKHLQLYQKGVIKAR 280
Query: 119 NDVCPSENLNHAVVIVGYGM---------------RH--QVPVWIVRNSWG-RWGPDDGY 160
C + ++H+V++VG+G RH P WI++NSWG +WG ++GY
Sbjct: 281 PTTCDPQFVDHSVLLVGFGSEKSEGMGAKTVSSQSRHPRSTPYWILKNSWGAQWG-EEGY 339
Query: 161 FTVERGTNACGIESY 175
F + RG+N CGI Y
Sbjct: 340 FRLHRGSNTCGITKY 354
>gi|473159|emb|CAA83538.1| cathepsin L [Schistosoma mansoni]
Length = 317
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 88/177 (49%), Gaps = 6/177 (3%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q KH L+ LS+ QL++C+ N GCQGG +++ YL+ +E+E DY +
Sbjct: 136 VEGQLVKKHKKLISLSEQQLVDCSYKYGNDGCQGGTMDQSFAYLEKYPIESEKDYKYIGH 195
Query: 60 NGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKN 119
+ + D L + ++ LYHYGP+ ++ I ++
Sbjct: 196 DSSCHFRKSKGVVKVKKFVD-LPARDEEKLQKALYHYGPISVAIDALDDLILYKSGIYES 254
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
C S LNH V+ VGYG ++ W+++NSWG WG +GYF + R N CGI +
Sbjct: 255 KQCSSFLLNHGVLAVGYGRENRKDYWLIKNSWGTTWGM-NGYFKLRRNKHNMCGIAT 310
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.141 0.450
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,331,927,563
Number of Sequences: 23463169
Number of extensions: 142617423
Number of successful extensions: 244333
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4146
Number of HSP's successfully gapped in prelim test: 2487
Number of HSP's that attempted gapping in prelim test: 230406
Number of HSP's gapped (non-prelim): 7635
length of query: 192
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 58
effective length of database: 9,215,130,721
effective search space: 534477581818
effective search space used: 534477581818
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 72 (32.3 bits)