BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1727
(240 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 196 bits (499), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 91/151 (60%), Positives = 115/151 (76%), Gaps = 6/151 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG--SNRACHLNKEEIRVKI 119
+L+DCD +D GCGGGLM+ AFE + + GGLE E DYPY+G + C L K +++V I
Sbjct: 416 ELIDCDNLDNGCGGGLMTQAFEAVENL--GGLETESDYPYEGHADRKGCQLKKSDVKVSI 473
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VNVS+DE ++AK+LVK+GP++V +NANAMQFY GGVSHP+ LC +LDHGV I
Sbjct: 474 SKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPIHALCSP--KSLDHGVAI 531
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYGVH+TK+THK PYW+IKNSWGP WGEK
Sbjct: 532 VGYGVHRTKYTHKNLPYWLIKNSWGPGWGEK 562
Score = 45.1 bits (105), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 5/71 (7%)
Query: 3 ATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI-----RGEGT 57
T K + D+L+ F +F+ HNK Y + EE +R RIF AN+KK+++ +G
Sbjct: 264 TTKKNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAI 323
Query: 58 HLALKLVDCDK 68
+ A + D K
Sbjct: 324 YGATQFADLTK 334
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/167 (58%), Positives = 118/167 (70%), Gaps = 22/167 (13%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D+GC GGL A+ I + GGLE E DYPY + CH NK +++V I S
Sbjct: 868 ELVDCDKLDSGCNGGLPDTAYRAI--EELGGLELESDYPYDAEDEKCHFNKNKVKVNIVS 925
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+N++S+ET+MA++LVKNGPM++ INANAMQFY GGVSHP KFLC D+LDHGVLIVG
Sbjct: 926 GLNITSNETQMAQWLVKNGPMSIGINANAMQFYMGGVSHPFKFLC--SPDSLDHGVLIVG 983
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQ 228
YGV KF Y I K KTMP+WIIKNSWGPRWGEQ
Sbjct: 984 YGV---KF------YPIFK---------KTMPYWIIKNSWGPRWGEQ 1012
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 196 bits (497), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 91/149 (61%), Positives = 108/149 (72%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK D GC GGL A+ I + GGLE E DYPY G + CH N E+RV I S
Sbjct: 636 ELVDCDKYDDGCEGGLFETAYHAI--EELGGLELESDYPYSGRDNTCHFNSSEVRVSITS 693
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN+S+DET+MAK+LV NGP+++ INANAMQFY GGVSHPLKFLC LDHGVLIVG
Sbjct: 694 SVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSHPLKFLCDPK--TLDHGVLIVG 751
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG+H+T H+ PYW+IKNSW +WG K
Sbjct: 752 YGIHRTWLLHRHLPYWLIKNSWSSYWGAK 780
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 90/149 (60%), Positives = 109/149 (73%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GG M NA++ I + GGLE E DYPY + CH + + +V++ S
Sbjct: 716 ELVDCDSLDEGCNGGDMENAYKAI--ERLGGLELESDYPYDAKDEKCHFLQNKAKVQVVS 773
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN++SDE MA++LVKNGP++V INANAMQFYFGGVSHPL FLC NLDHGVLIVG
Sbjct: 774 AVNITSDEKRMAQWLVKNGPISVGINANAMQFYFGGVSHPLNFLCNPK--NLDHGVLIVG 831
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG+ K HK PYWIIKNSWGP WGE+
Sbjct: 832 YGISKYPLFHKELPYWIIKNSWGPRWGER 860
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 190 bits (483), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 90/149 (60%), Positives = 109/149 (73%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GCGGG M NA++T+ KLGG LE E DYPY N CH K + +V++ S
Sbjct: 716 ELVDCDNLDDGCGGGYMINAYKTV-EKLGG-LELETDYPYDARNEKCHFLKNKAKVQVAS 773
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+N+++DE +MA++LVKNGP++V INANAMQFYFGGVSHP KFLC NLDHGVLIVG
Sbjct: 774 ALNITNDEKKMAQWLVKNGPISVGINANAMQFYFGGVSHPFKFLCDPA--NLDHGVLIVG 831
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y K PYWIIKNSWGP WGE+
Sbjct: 832 YATSTYPLFKKKLPYWIIKNSWGPKWGEQ 860
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats.
Identities = 87/149 (58%), Positives = 106/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL NA+ I KLGG LE E DYPY+ N CH K +V++ S
Sbjct: 864 ELVDCDDLDEGCNGGLPDNAYRAI-EKLGG-LELESDYPYEAENERCHFKKNMAKVQVGS 921
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN++S+ET++A++LV NGP+++ INANAMQFY GGVSHP KFLC NLDHGVLIVG
Sbjct: 922 AVNITSNETQIAQWLVANGPISIGINANAMQFYMGGVSHPFKFLCNP--KNLDHGVLIVG 979
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG HK PYWI+KNSWG WGE+
Sbjct: 980 YGTSNYPLFHKKLPYWIVKNSWGDRWGEQ 1008
Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats.
Identities = 16/35 (45%), Positives = 26/35 (74%)
Query: 18 MFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI 52
+F +F+ +N++YAT+EE + RL IFR NL I++
Sbjct: 726 LFENFVNTYNRTYATEEERNLRLSIFRENLGIIRL 760
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 88/149 (59%), Positives = 107/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD D GC GGLM A+ +I K+GG LE E+DYPY + CH N+ RV++
Sbjct: 1558 ELVDCDTDDQGCNGGLMDTAYRSI-EKIGG-LETEQDYPYDAEDEKCHFNRTLARVQVTG 1615
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+N+S +ET+MAK+LV NGP+++AINANAMQFY GGVSHP KFLC NLDHGVLIVG
Sbjct: 1616 ALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSHPFKFLC--SPKNLDHGVLIVG 1673
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGVH K PYWI+KNSWG WGE+
Sbjct: 1674 YGVHNYPLFKKSLPYWIVKNSWGTGWGEQ 1702
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 187 bits (476), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 88/149 (59%), Positives = 107/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD D GC GGLM A+ +I K+GG LE E+DYPY + CH N+ RV++
Sbjct: 1593 ELVDCDTDDQGCNGGLMDTAYRSI-EKIGG-LETEQDYPYDAEDEKCHFNRTLARVQVTG 1650
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+N+S +ET+MAK+LV NGP+++AINANAMQFY GGVSHP KFLC NLDHGVLIVG
Sbjct: 1651 ALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSHPFKFLC--SPKNLDHGVLIVG 1708
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGVH K PYWI+KNSWG WGE+
Sbjct: 1709 YGVHNYPLFKKSLPYWIVKNSWGTGWGEQ 1737
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 187 bits (474), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 88/149 (59%), Positives = 107/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D GC GGL NA+ I + GGLE E DYPY+GS+ C NK RV+I
Sbjct: 2509 ELVDCDKLDQGCNGGLPDNAYRAI--EQLGGLESEDDYPYEGSDDKCSFNKTLARVQISG 2566
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN++S+ET+MAK+LVK+GP+++ INANAMQFY GG+SHP + LC NLDHGVLIVG
Sbjct: 2567 AVNITSNETDMAKWLVKHGPISIGINANAMQFYMGGISHPWRMLCNPS--NLDHGVLIVG 2624
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG HK PYWIIKNSWG WGE+
Sbjct: 2625 YGAKDYPLFHKHLPYWIIKNSWGTSWGEQ 2653
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 186 bits (472), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 88/149 (59%), Positives = 106/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL NA+ I KLGG LE E DYPY+ N CH K +V++ S
Sbjct: 719 ELVDCDDLDEGCNGGLPDNAYRAI-EKLGG-LELESDYPYEAENEKCHFKKNLAKVQLAS 776
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN++S+ET+MA++LV+NGP+++ INANAMQFY GGVSHP KFLC NLDHGVLIVG
Sbjct: 777 AVNITSNETQMAQWLVQNGPISIGINANAMQFYVGGVSHPFKFLCNPK--NLDHGVLIVG 834
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG HK PYW IKNSWG WGE+
Sbjct: 835 YGTSDYPLFHKKLPYWTIKNSWGKRWGEQ 863
Score = 43.5 bits (101), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 27/35 (77%)
Query: 18 MFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI 52
+FN+F+ +N++Y+T EE + RLRIFR NL IQ+
Sbjct: 581 LFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQL 615
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 85/149 (57%), Positives = 109/149 (73%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL NA+ I + GGLE E DYPY+ N CH + ++V++ S
Sbjct: 606 ELVDCDHLDEGCNGGLPDNAYRAI--EQLGGLELESDYPYEAENEKCHFKQNLVKVELAS 663
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN++S+ET++A++LV+NGP+A+ INANAMQFY GGVSHPLK LC +NL+HGVLIVG
Sbjct: 664 AVNITSNETQIAQWLVQNGPIAIGINANAMQFYMGGVSHPLKILCNP--NNLNHGVLIVG 721
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG + HK PYWIIKNSWG WGE+
Sbjct: 722 YGTSRYPLFHKNLPYWIIKNSWGKSWGEQ 750
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 86/149 (57%), Positives = 108/149 (72%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCDK+D GC GG M +E I+ KLGG LE E DYPY+ N C+LNK EI+VKI
Sbjct: 266 ELIDCDKIDNGCNGGYMPETYEAIM-KLGG-LETETDYPYEAENEKCNLNKTEIKVKING 323
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN++ E ++AK+L KNGP++ +NANAMQFY GG+SHP K LC + DHG+LIVG
Sbjct: 324 AVNLTKSELDIAKWLYKNGPVSAGLNANAMQFYLGGISHPPKILCNP--EEQDHGILIVG 381
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG+HK+ + PYWIIKNSWG HWGEK
Sbjct: 382 YGIHKSSILKRTIPYWIIKNSWGKHWGEK 410
Score = 44.3 bits (103), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 24/32 (75%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
F F+ K NK Y +KEE+ KR RIFRAN+KKI
Sbjct: 134 FKDFVLKFNKVYFSKEEFKKRFRIFRANMKKI 165
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 86/145 (59%), Positives = 105/145 (72%), Gaps = 4/145 (2%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD +D GC GG M NA++ I KLGG LE E DYPY G N CH K+ +V++ VN
Sbjct: 716 DCDTLDEGCNGGYMENAYKAI-EKLGG-LELESDYPYDGRNEKCHFFKKNAKVQVVGAVN 773
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
++S+ET+MA++L+KNGP+++ INANAMQFY GGVSHP FLC +LDHGVLIVGYG+
Sbjct: 774 ITSNETKMAQWLIKNGPISIGINANAMQFYIGGVSHPFHFLCNPK--DLDHGVLIVGYGI 831
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
K HK PYWIIKNSWG WGE
Sbjct: 832 SKYPLFHKELPYWIIKNSWGSRWGE 856
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 86/145 (59%), Positives = 105/145 (72%), Gaps = 4/145 (2%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD +D GC GG M NA++ I KLGG LE E DYPY G N CH K+ +V++ VN
Sbjct: 716 DCDTLDEGCNGGYMENAYKAI-EKLGG-LELESDYPYDGRNEKCHFFKKNAKVQVVGAVN 773
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
++S+ET+MA++L+KNGP+++ INANAMQFY GGVSHP FLC +LDHGVLIVGYG+
Sbjct: 774 ITSNETKMAQWLIKNGPISIGINANAMQFYIGGVSHPFHFLCNPK--DLDHGVLIVGYGI 831
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
K HK PYWIIKNSWG WGE
Sbjct: 832 SKYPLFHKKLPYWIIKNSWGSRWGE 856
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 91/149 (61%), Positives = 107/149 (71%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDKVD GC GGL S A++ II GGLE E DY Y+G N C ++K +IRVKI
Sbjct: 553 ELVDCDKVDEGCNGGLPSQAYKEIIRL--GGLETETDYKYRGHNEKCSMDKSKIRVKING 610
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V++SS+ETEMA +LVKNGP+++ INA AMQFY GG+SHP K C LDHGVLIVG
Sbjct: 611 SVSISSNETEMAAWLVKNGPISIGINAFAMQFYMGGISHPWKIFCN--PKELDHGVLIVG 668
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGV +K PYWIIKNSWGP WGEK
Sbjct: 669 YGVKGSK------PYWIIKNSWGPDWGEK 691
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 179 bits (454), Expect = 8e-43, Method: Composition-based stats.
Identities = 83/165 (50%), Positives = 116/165 (70%), Gaps = 5/165 (3%)
Query: 47 LKKIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SN 105
L +I+ + ++ +L+DCDKVD GCGGG M +AF+ I + GGLE E DYPY+ +
Sbjct: 1650 LHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI--EQLGGLELENDYPYEAKAQ 1707
Query: 106 RACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL 165
++CH N+ V+++ V++ +ET +AKYL+KNGP+A+ +NANAMQFY GG+SHP L
Sbjct: 1708 KSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPWHPL 1767
Query: 166 CKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C ++DHGVLIVGYG+ + +K PYWIIKNSWGP WGE+
Sbjct: 1768 CN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQ 1810
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 179 bits (454), Expect = 8e-43, Method: Composition-based stats.
Identities = 83/165 (50%), Positives = 116/165 (70%), Gaps = 5/165 (3%)
Query: 47 LKKIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SN 105
L +I+ + ++ +L+DCDKVD GCGGG M +AF+ I + GGLE E DYPY+ +
Sbjct: 1626 LHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI--EQLGGLELENDYPYEAKAQ 1683
Query: 106 RACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL 165
++CH N+ V+++ V++ +ET +AKYL+KNGP+A+ +NANAMQFY GG+SHP L
Sbjct: 1684 KSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPWHPL 1743
Query: 166 CKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C ++DHGVLIVGYG+ + +K PYWIIKNSWGP WGE+
Sbjct: 1744 CN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQ 1786
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 83/149 (55%), Positives = 107/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD D GC GG M A + +I GGLE E +YPYKG + C NK E + ++QS
Sbjct: 304 ELVDCDHGDHGCKGGYMGQAMKAVIEM--GGLETESEYPYKGVDGTCEFNKTESKARVQS 361
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+V + +ETE+A +L+K+GP+++ INANAMQFYFGG+SHP KFLC +LDHGVL+VG
Sbjct: 362 FVGLPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPWKFLCSP--TDLDHGVLLVG 419
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+GV K F K PYWI+KNSWG +WGEK
Sbjct: 420 FGVDKRSFRRKPVPYWIVKNSWGKYWGEK 448
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 83/149 (55%), Positives = 107/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD D GC GG M A + +I GGLE E +YPYKG + C NK E + ++QS
Sbjct: 190 ELVDCDHGDHGCKGGYMGQAMKAVIEM--GGLETESEYPYKGVDGTCEFNKTESKARVQS 247
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+V + +ETE+A +L+K+GP+++ INANAMQFYFGG+SHP KFLC +LDHGVL+VG
Sbjct: 248 FVGLPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPWKFLCS--PTDLDHGVLLVG 305
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+GV K F K PYWI+KNSWG +WGEK
Sbjct: 306 FGVDKRSFRRKPVPYWIVKNSWGKYWGEK 334
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 83/149 (55%), Positives = 107/149 (71%), Gaps = 6/149 (4%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG--SNRACHLNKEEIRVKI 119
+L+DCD +D GCGGGLM+ AFE + + GGLE E DYPY+G + C L K +++V I
Sbjct: 416 ELIDCDNLDNGCGGGLMTQAFEAVENL--GGLETESDYPYEGHADRKGCQLKKSDVKVSI 473
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VNVS+DE ++AK+LVK+GP++V +NANAMQFY GGVSHP+ LC +LDHGV I
Sbjct: 474 SKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPIHALCSP--KSLDHGVAI 531
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGYGVHK + + P+W IKNSWG WG
Sbjct: 532 VGYGVHKYPYLNATLPFWTIKNSWGDKWG 560
Score = 45.4 bits (106), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 5/71 (7%)
Query: 3 ATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI-----RGEGT 57
T K + D+L+ F +F+ HNK Y + EE +R RIF AN+KK+++ +G
Sbjct: 264 TTKKNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAI 323
Query: 58 HLALKLVDCDK 68
+ A + D K
Sbjct: 324 YGATQFADLTK 334
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 84/149 (56%), Positives = 107/149 (71%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D GC GG M NA+ I + GGLE E++YPY+ + C NK +V+I
Sbjct: 370 ELVDCDKMDDGCDGGYMDNAYRAI--EQLGGLETEEEYPYEAEDDKCSFNKSLSKVQISG 427
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN+SS+ET MAK+LV NGP+++ INANAMQFY GGVSHP K LC N+DHGVLIVG
Sbjct: 428 AVNISSNETNMAKWLVHNGPISIGINANAMQFYVGGVSHPWKALCNP--KNIDHGVLIVG 485
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG+ + +K PYW++KNSWGP WGE+
Sbjct: 486 YGIKEYPLFNKQLPYWVVKNSWGPGWGEQ 514
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 87/149 (58%), Positives = 105/149 (70%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL SNA++ I KLGG LE E DYPYKG++ C NK E++V I S
Sbjct: 114 ELVDCDTIDKGCEGGLPSNAYKQI-EKLGG-LESESDYPYKGADSKCKFNKAEVKVTINS 171
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +S DE E+A +L KNGP+++ INANAMQFY GG++HP K C +L+HGVLIVG
Sbjct: 172 SVVISKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKIFCN--PSSLNHGVLIVG 229
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGV PYWIIKNSWGP WGEK
Sbjct: 230 YGVKNG------TPYWIIKNSWGPSWGEK 252
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/159 (55%), Positives = 110/159 (69%), Gaps = 11/159 (6%)
Query: 53 RGEGTHLA-LKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+GE L+ +LVDCDKVD GC GG MS+A+E II KLGG + EK YPY+G N C N
Sbjct: 281 KGELISLSEQELVDCDKVDGGCEGGEMSDAYEAII-KLGGAMSEEK-YPYRGENEKCKFN 338
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++RVKI YVN+S +ETEMA +L +GP+++ INA MQFYFGG++HP K C D
Sbjct: 339 MTDVRVKINGYVNISKNETEMAGWLAAHGPISIGINALMMQFYFGGIAHPWKIFCSP--D 396
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHGVLIVGY V +PYWI+KNSWG WGE+
Sbjct: 397 SLDHGVLIVGYSVKDG------EPYWIVKNSWGKDWGEE 429
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 83/165 (50%), Positives = 116/165 (70%), Gaps = 5/165 (3%)
Query: 47 LKKIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SN 105
L +I+ + ++ +L+DCDKVD GCGGG M +AF+ I + GGLE E DYPY+ +
Sbjct: 769 LHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI--EQLGGLELENDYPYEAKAQ 826
Query: 106 RACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL 165
++CH N+ V+++ V++ +ET +AKYL+KNGP+A+ +NANAMQFY GG+SHP L
Sbjct: 827 KSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAMQFYRGGISHPWHPL 886
Query: 166 CKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C ++DHGVLIVGYG+ + +K PYWIIKNSWGP WGE+
Sbjct: 887 C--NHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQ 929
>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
Length = 266
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 87/169 (51%), Positives = 114/169 (67%), Gaps = 7/169 (4%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ I G L+L +LVDCDK+D+GC GGL NA++ I GGLE E DYPY
Sbjct: 78 GNVEGIYAVRNGDLLSLSEQELVDCDKLDSGCNGGLPENAYKAIHDI--GGLETESDYPY 135
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
G C N RV++ V +S++ETEMA++L++NGP+++ INANAMQ+Y GGVSHP
Sbjct: 136 NGHENKCKFNSNITRVQVTGGVEISTNETEMAQWLIQNGPISIGINANAMQYYRGGVSHP 195
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
K LC+ G +DHGVLIVGYGV + +K PYWI+KNSWG WGE+
Sbjct: 196 WKVLCRPG--GIDHGVLIVGYGVSQYPKFNKTLPYWIVKNSWGTRWGEQ 242
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 81/150 (54%), Positives = 104/150 (69%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY+ + CH NK V+++
Sbjct: 451 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYEAKKKQCHFNKTMSHVQVKD 508
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++LV NGP+++ INANAMQFY GGVSHP K LC NLDHGVL+V
Sbjct: 509 FVDLPKGNETAMQEWLVSNGPISIGINANAMQFYRGGVSHPWKALC--SKKNLDHGVLVV 566
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 567 GYGVSDYPNYHKTLPYWIVKNSWGPRWGEQ 596
Score = 37.4 bits (85), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
DK+EH +F+ F + + Y + E RLRIFR NLK I+
Sbjct: 308 DKVEH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIE 346
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 106/150 (70%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLNKEEIRVKIQ 120
+L+DCD VD+ C GG M +A++ I K+GG LE E +YPY + CH N E+ V+++
Sbjct: 996 ELLDCDAVDSACQGGYMDDAYKAI-EKIGG-LELESEYPYLAKKQKTCHFNSTEVHVRVK 1053
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
V++ +ET MA+YLV NGP+++ +NANAMQFY GG+SHP K LC NLDHGVLIV
Sbjct: 1054 GAVDLPKNETAMAQYLVANGPISIGLNANAMQFYRGGISHPWKPLC--SKKNLDHGVLIV 1111
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV + +K PYWI+KNSWGP WGE+
Sbjct: 1112 GYGVKEYPMFNKTMPYWIVKNSWGPKWGEQ 1141
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 169 bits (429), Expect = 7e-40, Method: Composition-based stats.
Identities = 81/165 (49%), Positives = 111/165 (67%), Gaps = 5/165 (3%)
Query: 47 LKKIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SN 105
L +I+ + + +L+DCD VD GC GG M +AF+ I KLGG LE E +YPY+ +
Sbjct: 1601 LHQIKTKKLEAYSEQELIDCDTVDNGCNGGYMDDAFKAI-EKLGG-LELEDEYPYQAKAQ 1658
Query: 106 RACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL 165
+ CH NK V+++ V++ +ET +A+YL++NGP+A+ +NANAMQFY GG+SHP L
Sbjct: 1659 KTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGLNANAMQFYRGGISHPWHLL 1718
Query: 166 CKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C +DHGVLIVGYGV + +K PYW IKNSWGP WGE+
Sbjct: 1719 CS--HKQIDHGVLIVGYGVKEYPLFNKTLPYWTIKNSWGPKWGEQ 1761
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 103/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPYK CH N+ V++
Sbjct: 446 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKAKKNQCHFNRTLSHVQVAG 503
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ INANAMQFY GGVSHP K LC NLDHGVL+V
Sbjct: 504 FVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPWKALC--SKKNLDHGVLVV 561
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV + HK PYWI+KNSWGP WGE+
Sbjct: 562 GYGVSEYPNFHKTLPYWIVKNSWGPRWGEQ 591
Score = 36.6 bits (83), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 5/51 (9%)
Query: 4 TAKPHHH---DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
T K H H DK +H +F+ F + + Y + E RLRIFR NLK I+
Sbjct: 293 THKKHSHRALDKADH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIE 341
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/180 (49%), Positives = 110/180 (61%), Gaps = 14/180 (7%)
Query: 29 SYATKEEYHKRLRIFRANLKKIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISK 88
+++T E + I R L + + +LVDCDK+D GC GGL NA+E II
Sbjct: 92 AFSTTENIEGQWAIHRNKLVSLSEQ--------ELVDCDKLDDGCEGGLPVNAYEEIIRL 143
Query: 89 LGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA 148
GGLE EK YPY + C ++ V I S VN+SS+E +MA +L KNGP+++ INA
Sbjct: 144 --GGLESEKKYPYDAEDEKCKFTVGDVAVYINSSVNISSNEADMAAWLYKNGPISIGINA 201
Query: 149 NAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
AMQFY GGVSHP FLC D LDHGVLIVGYG K F+ PYWI+KNSWG WG
Sbjct: 202 FAMQFYMGGVSHPFSFLCSP--DELDHGVLIVGYGTKKGWFSD--SPYWIVKNSWGASWG 257
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 102/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPYK CH N+ V++
Sbjct: 446 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKAKKNQCHFNRTLSHVQVAG 503
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ INANAMQFY GGVSHP K LC NLDHGVL+V
Sbjct: 504 FVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPWKALC--SKKNLDHGVLVV 561
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 562 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 591
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 102/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPYK CH N+ V++
Sbjct: 445 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKAKKNQCHFNRTLSHVQVAG 502
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ INANAMQFY GGVSHP K LC NLDHGVL+V
Sbjct: 503 FVDLPKGNETAMQEWLLANGPISIGINANAMQFYRGGVSHPWKALC--SKKNLDHGVLVV 560
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 561 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 590
Score = 37.7 bits (86), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
H DK++H +F F + + Y + E RLRIFR NLK I+
Sbjct: 299 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIE 340
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 102/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPYK CH N+ V++
Sbjct: 306 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKAKKNQCHFNRTLSHVQVAG 363
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ INANAMQFY GGVSHP K LC NLDHGVL+V
Sbjct: 364 FVDLPKGNETAMQEWLLANGPISIGINANAMQFYRGGVSHPWKALC--SKKNLDHGVLVV 421
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 422 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 451
Score = 37.4 bits (85), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
H DK++H +F F + + Y + E RLRIFR NLK I+
Sbjct: 160 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIE 201
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 103/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY+G + CH N+ V++
Sbjct: 430 ELLDCDSKDSACNGGLMDNAYKAI--KDIGGLEYESEYPYEGKKKQCHFNRTLSHVQVSG 487
Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ INANAMQFY GGVSHP LC NLDHGVLIV
Sbjct: 488 FVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPWSPLC--SKKNLDHGVLIV 545
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 546 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 575
Score = 37.7 bits (86), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
+K++H +F+ F K+ + YA E+ RLRIFR +LK IQ
Sbjct: 287 NKVDH--LFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQ 325
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 103/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGL NA++ I + GGLE E +YPYK CH NK V++
Sbjct: 441 ELLDCDTKDSACNGGLPDNAYKAI--QEIGGLEYESEYPYKARKEQCHFNKTLAHVQVTG 498
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ ++ET M ++L+ NGP+++ INANAMQFY GGVSHP K LC+ NLDHGVLIV
Sbjct: 499 FVDLPKNNETAMQEWLIANGPISIGINANAMQFYRGGVSHPWKILCE--KSNLDHGVLIV 556
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 557 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 586
Score = 36.6 bits (83), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
H DK+EH +F+ F K + Y E RLRIFR NL+ I+
Sbjct: 294 HSLDKVEH--LFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIE 335
>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
Length = 265
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 106/150 (70%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLNKEEIRVKIQ 120
+L+DCD VD+ C GG M +A++ I K+GG LE E +YPY + CH N E+ V+++
Sbjct: 96 ELLDCDAVDSACQGGYMDDAYKAI-EKIGG-LELESEYPYLAKKQKTCHFNSTEVHVRVK 153
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
V++ +ET MA+YLV NGP+++ +NANAMQFY GG+SHP K LC NLDHGVLIV
Sbjct: 154 GAVDLPKNETAMAQYLVANGPISIGLNANAMQFYRGGISHPWKPLCS--KKNLDHGVLIV 211
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV + +K PYWI+KNSWGP WGE+
Sbjct: 212 GYGVKEYPMFNKTMPYWIVKNSWGPKWGEQ 241
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 80/149 (53%), Positives = 101/149 (67%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGL NA+E I K+GG LE E DYPY CH N +I VK++
Sbjct: 301 ELLDCDTSDSACNGGLPDNAYEAI-EKIGG-LELESDYPYHARKDQCHFNSTKIHVKVKG 358
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+V++ +ET +A++L+ NGP+++ INANAMQFY GGVSHP LC NLDHGVLIVG
Sbjct: 359 HVDLPKNETAIAQWLIANGPISIGINANAMQFYRGGVSHPPHILC--SRKNLDHGVLIVG 416
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGV K PYWI+KNSWG WGE+
Sbjct: 417 YGVSDYPMFKKTLPYWIVKNSWGKKWGEQ 445
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/157 (54%), Positives = 105/157 (66%), Gaps = 13/157 (8%)
Query: 55 EGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+GT ++L +LVDCDK+D GC GGL SNA++ I+ GG+ E DYPY G ++ C LN
Sbjct: 180 KGTLVSLSEQELVDCDKLDQGCNGGLPSNAYQEIMRF--GGIMSEDDYPYTGRDQDCKLN 237
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+V I +N+S DE +MA +L NGP+++ INANAMQFYFGGVSHP K C +
Sbjct: 238 ATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANAMQFYFGGVSHPWKIFCN--PE 295
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
NLDHGVLIVGYG T PYWIIKNSWG WG
Sbjct: 296 NLDHGVLIVGYG------TKDGTPYWIIKNSWGRSWG 326
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 79/150 (52%), Positives = 101/150 (67%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPYK CH N+ V++
Sbjct: 446 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKAKKNQCHFNRTLSHVQVAG 503
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ GP+++ INANAMQFY GGVSHP K LC NLDHGVL+V
Sbjct: 504 FVDLPKGNETAMQEWLLTKGPISIGINANAMQFYRGGVSHPWKALC--SKKNLDHGVLVV 561
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 562 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 591
Score = 37.7 bits (86), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 5/51 (9%)
Query: 4 TAKPHHH---DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
T K H H DK++H +F+ F + + Y + E RLRIFR NLK I+
Sbjct: 293 THKKHSHRGLDKVDH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIE 341
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 78/150 (52%), Positives = 105/150 (70%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLNKEEIRVKIQ 120
+L+DCD VD+ C GG M +A++ I K+GG LE E +YPY + CH NK V+++
Sbjct: 1285 ELLDCDTVDSACNGGFMDDAYKAI-EKIGG-LELESEYPYLAKKQKTCHFNKTMAHVRVK 1342
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
V++ +ET +A++LV NGP+++ +NANAMQFY GG+SHP K LC NLDHGVLIV
Sbjct: 1343 GAVDLPKNETAIAQFLVANGPVSIGLNANAMQFYRGGISHPWKPLC--SKKNLDHGVLIV 1400
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV + +K PYWI+KNSWGP WGE+
Sbjct: 1401 GYGVKEYPMFNKTLPYWIVKNSWGPKWGEQ 1430
Score = 37.7 bits (86), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 8 HHHDKLEHVA--MFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
HH+ K E + +F+ F +HN++Y + E+ R RIF+ NL KI+
Sbjct: 1133 HHYSKSEDHSRHLFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIE 1178
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/150 (52%), Positives = 103/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY+ + CH N+ V++
Sbjct: 460 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYEAKKQQCHFNRTLSHVQVSG 517
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ +GP+++ +NANAMQFY GGVSHP K LC NLDHGVLIV
Sbjct: 518 FVDLPKGNETAMQEWLLTHGPISIGLNANAMQFYRGGVSHPWKALC--SKKNLDHGVLIV 575
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 576 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 605
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/150 (52%), Positives = 103/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY+ + CH N+ V++
Sbjct: 458 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYEAKKQQCHFNRTLSHVQVSG 515
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ +GP+++ +NANAMQFY GGVSHP K LC NLDHGVLIV
Sbjct: 516 FVDLPKGNETAMQEWLLTHGPISIGLNANAMQFYRGGVSHPWKALC--SKKNLDHGVLIV 573
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 574 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 603
>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
Length = 274
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/149 (54%), Positives = 100/149 (67%), Gaps = 6/149 (4%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDKVD GC GGL A++ I+ GGLE EKDYPY+G C K E+ V I
Sbjct: 108 ELVDCDKVDLGCNGGLPLQAYKEIMRI--GGLETEKDYPYEGKGDKCVFEKAEVEVNITG 165
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VN+SS+E +M +L KNGP+++ +NANAMQFY GGVSHP FLC +LDHGVLI G
Sbjct: 166 AVNISSNEDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPFSFLCS--PSSLDHGVLITG 223
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG+ + + P+W IKNSWG WGEK
Sbjct: 224 YGIKQGWMSD--SPFWAIKNSWGESWGEK 250
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 78/150 (52%), Positives = 103/150 (68%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY+ + CH N+ V++
Sbjct: 308 ELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYEAKKQQCHFNRTLSHVQVSG 365
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ +GP+++ +NANAMQFY GGVSHP K LC NLDHGVLIV
Sbjct: 366 FVDLPKGNETAMQEWLLTHGPISIGLNANAMQFYRGGVSHPWKALCS--KKNLDHGVLIV 423
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 424 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 453
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 79/150 (52%), Positives = 101/150 (67%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY + CH NK V++
Sbjct: 436 ELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEYESEYPYLAKKKQCHFNKTLSHVQVAD 493
Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ +NANAMQFY GGVSHP LC NLDHGVLIV
Sbjct: 494 FVDLPKGNETAMQEWLLANGPISIGLNANAMQFYRGGVSHPWGPLC--SKKNLDHGVLIV 551
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 552 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 581
Score = 37.4 bits (85), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
+K++H +F+ F K+ + YA E+ RLRIFR NL+ IQ
Sbjct: 293 NKVDH--LFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQ 331
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 79/149 (53%), Positives = 100/149 (67%), Gaps = 4/149 (2%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGL NA+E I K+GG LE E DYPY CH N +I VK++
Sbjct: 301 ELLDCDTSDSACNGGLPDNAYEAI-EKIGG-LELESDYPYHARKDQCHFNSTKIHVKVKG 358
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+V++ +ET +A++L+ NGP+++ INANAMQFY GGVSHP LC NLDHGVLIVG
Sbjct: 359 HVDLPKNETAIAQWLIANGPISIGINANAMQFYRGGVSHPPHILC--SRKNLDHGVLIVG 416
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y V K PYWI+KNSWG WGE+
Sbjct: 417 YRVSDYPMFKKTLPYWIVKNSWGKKWGEQ 445
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 163 bits (413), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 78/150 (52%), Positives = 100/150 (66%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY CH N+ V++
Sbjct: 448 ELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEYESEYPYAAKKMQCHFNRTMSHVQLSG 505
Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ +NANAMQFY GGVSHP LC NLDHGVLIV
Sbjct: 506 FVDLPKGNETAMQEWLLSNGPISIGLNANAMQFYRGGVSHPWAPLC--SKKNLDHGVLIV 563
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWGP WGE+
Sbjct: 564 GYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 593
Score = 39.7 bits (91), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 6 KPHHH--DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
K +HH +K+EH +F+ F K+ + YA E+ RLRIFR NL+ I+
Sbjct: 298 KRNHHTLNKIEH--LFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIE 343
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/149 (53%), Positives = 104/149 (69%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D+GCGGGL SNA+++I KLGG LE EKDYPY G C + + + +V + +
Sbjct: 306 ELVDCDTLDSGCGGGLPSNAYKSI-EKLGG-LEPEKDYPYVGEGEKCAIKQSDFKVFVNN 363
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE ++A +L +NGP+++ INAN MQFY+GG+SHP K C +LDHGVLIVG
Sbjct: 364 SVALPKDEVKLAAWLAQNGPISIGINANLMQFYWGGISHPWKIFCNP--KSLDHGVLIVG 421
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG T P+WIIKNSWGP WGE+
Sbjct: 422 YG------TENGTPFWIIKNSWGPDWGEE 444
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 16/24 (66%), Positives = 22/24 (91%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETI 85
+LVDCD +D+GCGGGL SNA+++I
Sbjct: 526 ELVDCDTLDSGCGGGLPSNAYKSI 549
Score = 41.2 bits (95), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 17/26 (65%), Positives = 18/26 (69%)
Query: 209 EKTMPFWIIKNSWGPRWGEQVTKSIY 234
E PFWIIKNSWGP WGE+ IY
Sbjct: 553 ENGTPFWIIKNSWGPDWGEEGYYRIY 578
Score = 37.7 bits (86), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 13/16 (81%), Positives = 15/16 (93%)
Query: 195 PYWIIKNSWGPHWGEK 210
P+WIIKNSWGP WGE+
Sbjct: 557 PFWIIKNSWGPDWGEE 572
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 80/148 (54%), Positives = 96/148 (64%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDKVD GC GGL S A++ I+ GGLE E YPY G CH+N+ E V I
Sbjct: 297 ELVDCDKVDDGCEGGLPSQAYKEIMRM--GGLETESAYPYDGRGEECHINRTEFAVYIND 354
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE M +LVK GP+++ INAN +QFY G+SHP KF C+ M L+HGVL+VG
Sbjct: 355 SVELPHDEESMKAWLVKKGPISIGINANPLQFYRHGISHPWKFFCEPYM--LNHGVLLVG 412
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K K PYWIIKNSWGP WGE
Sbjct: 413 YGSEKNK------PYWIIKNSWGPKWGE 434
>gi|237651947|gb|ACR08662.1| cathepsin F, partial [Drosophila silvestris]
Length = 186
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/150 (52%), Positives = 99/150 (66%), Gaps = 5/150 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD D+ C GGLM NA++ I K GGLE E +YPY CH N+ V+I
Sbjct: 17 ELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEYESEYPYAAKKMQCHFNRTLSHVQISG 74
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ +ET M ++L+ NGP+++ +NANAMQFY GGVSHP LC NLDHGVLIV
Sbjct: 75 FVDLPKGNETAMQEWLLSNGPISIGLNANAMQFYRGGVSHPWAPLCS--KKNLDHGVLIV 132
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV HK PYWI+KNSWG WGE+
Sbjct: 133 GYGVSDYPNFHKTLPYWIVKNSWGQRWGEQ 162
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/158 (52%), Positives = 106/158 (67%), Gaps = 13/158 (8%)
Query: 56 GTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
G LAL +LVDCD +D CGGGL SNA+ T I KLGG LE EKDY Y+G C +
Sbjct: 280 GALLALSEQELVDCDTLDQACGGGLPSNAY-TAIEKLGG-LETEKDYSYEGRKERCSFSP 337
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ RV I S V++S DE E+A +L +NGP+++A+NA AMQFY GVSHP + LC
Sbjct: 338 DKARVYINSSVDLSRDEEELATWLAENGPVSIALNAFAMQFYRRGVSHPFRPLCSPWF-- 395
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG H++ P+W IKNSWGP WGE+
Sbjct: 396 IDHAVLLVGYG-HRSGI-----PFWAIKNSWGPDWGEE 427
Score = 43.5 bits (101), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 5/69 (7%)
Query: 2 EATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANL---KKIQ--IRGEG 56
+A A D ++ +++F FL +NKSYA E +RL IF NL +K+Q RG
Sbjct: 137 QAPAPAAQEDSVQLISLFKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSA 196
Query: 57 THLALKLVD 65
+ K D
Sbjct: 197 EYGVTKFSD 205
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/148 (53%), Positives = 95/148 (64%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL SNA++ II GGLE E YPY G CHL +++I V I
Sbjct: 316 ELVDCDSVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGRGETCHLVRKDIAVYING 373
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE EM K+LV GP+++ +NAN +QFY GV HP K C+ M L+HGVLIVG
Sbjct: 374 SVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVG 431
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWI+KNSWGP WGE
Sbjct: 432 YGKDGRK------PYWIVKNSWGPTWGE 453
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 156 bits (395), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 79/148 (53%), Positives = 95/148 (64%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL SNA++ II GGLE E YPY G CHL +++I V I
Sbjct: 316 ELVDCDSVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGRGETCHLVRKDIAVYING 373
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE EM K+LV GP+++ +NAN +QFY GV HP K C+ M L+HGVLIVG
Sbjct: 374 SVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVG 431
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWI+KNSWGP WGE
Sbjct: 432 YGKDGRK------PYWIVKNSWGPTWGE 453
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 81/150 (54%), Positives = 105/150 (70%), Gaps = 6/150 (4%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+ D+GC GGLM AFE +I GGLE E+ YPY G C+ K +V+I
Sbjct: 193 ELVDCDQKDSGCNGGLMDQAFEEVIRI--GGLETEQQYPYDGVQETCNFEKSLSKVQIDD 250
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
++++ DE E+A+ L ++GP+++AINA MQFY GG+SHPL FLC D LDHGVL+VG
Sbjct: 251 FMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGGISHPLSFLC--SQDGLDHGVLMVG 308
Query: 182 YGV-HKTKFTHK-IQPYWIIKNSWGPHWGE 209
YGV H T + H+ +PYW IKNSWGP WGE
Sbjct: 309 YGVEHHTTWRHRHPRPYWKIKNSWGPRWGE 338
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 85/170 (50%), Positives = 113/170 (66%), Gaps = 9/170 (5%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ + G ++L +LVDCD+ D+GC GGLM AFE +I GGLE E+ YPY
Sbjct: 173 GNIEGAWFKATGDLISLSEQELVDCDQKDSGCNGGLMDQAFEEVIRI--GGLETEQQYPY 230
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
G C+ K +V+I ++++ DE E+A+ L ++GP+++AINA MQFY GGVSHP
Sbjct: 231 DGVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGGVSHP 290
Query: 162 LKFLCKGGMDNLDHGVLIVGYGV-HKTKFTHK-IQPYWIIKNSWGPHWGE 209
L FLC D LDHGVL+VGYGV H T + H+ +PYW IKNSWGP WGE
Sbjct: 291 LSFLCS--PDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGE 338
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 78/148 (52%), Positives = 96/148 (64%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL SNA++ II GGLE E YPY G CHL +++I V I
Sbjct: 315 ELVDCDSMDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGRGETCHLVRKDIAVYING 372
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE EM K+LV GP+++ +NAN +QFY GV HP K C+ M L+HGVLIVG
Sbjct: 373 SVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVG 430
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWI+KNSWGP+WGE
Sbjct: 431 YGKDGRK------PYWIVKNSWGPNWGE 452
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 78/148 (52%), Positives = 95/148 (64%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL SNA++ II GGLE E YPY G CHL +++I V I
Sbjct: 313 ELVDCDGVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGKGETCHLVRKDIAVYING 370
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + DE EM K+LV GP+++ +NAN +QFY GV HP K C+ M L+HGVLIVG
Sbjct: 371 SIELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVG 428
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWI+KNSWGP WGE
Sbjct: 429 YGKDGRK------PYWIVKNSWGPTWGE 450
>gi|357605801|gb|EHJ64782.1| cysteine proteinase inhibitor precursor [Danaus plexippus]
Length = 148
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/127 (59%), Positives = 91/127 (71%), Gaps = 3/127 (2%)
Query: 84 TIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMA 143
T I +LGG LE E DYPY+G N C NK +V+I VN+SS+ET+MAK+L +NGP++
Sbjct: 2 TAIEQLGG-LELESDYPYEGENDKCVFNKTMSKVQISGAVNISSNETDMAKWLTQNGPIS 60
Query: 144 VAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSW 203
+ INANAMQFY GG+SHP K LC NLDHGVLIVGYGV HK PYWI+KNSW
Sbjct: 61 IGINANAMQFYMGGISHPWKVLCN--PTNLDHGVLIVGYGVKNYPLFHKRLPYWIVKNSW 118
Query: 204 GPHWGEK 210
G WGE+
Sbjct: 119 GKSWGEQ 125
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/149 (54%), Positives = 101/149 (67%), Gaps = 9/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D GC GGL NA+ +I+++LGG LE EKDYPY N C LNK E V I S
Sbjct: 190 ELVDCDKIDEGCKGGLPLNAYHSIMNRLGG-LETEKDYPYVAKNGKCKLNKSEEVVYINS 248
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V VS++ET++A +LV +GP+A+ IN+ M Y GG++HP C + LDHGVLIVG
Sbjct: 249 SVKVSTNETDLAAWLVAHGPVAIGINSVNMLHYKGGIAHPTNKDCNPKL--LDHGVLIVG 306
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG K+ PYWIIKNSWG WGEK
Sbjct: 307 YGEEKS------TPYWIIKNSWGTDWGEK 329
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 80/158 (50%), Positives = 101/158 (63%), Gaps = 7/158 (4%)
Query: 53 RGEGTHLA-LKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+GE L+ +LVDCD +D GC GG SNA++ II GGL E +Y Y G+ C
Sbjct: 320 KGELVSLSEQELVDCDTLDQGCSGGYPSNAYKEIIRL--GGLTTETNYSYDGNQGTCRFK 377
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +V I V++ DETE+A Y+ +NGP+AV INA AM FY G++HP +FLC D
Sbjct: 378 TQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAMMFYRHGIAHPWRFLCSP--D 435
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGV IVGY V K + K +PYWIIKNSWG HWGE
Sbjct: 436 ALDHGVAIVGYDVEKQ--SKKPKPYWIIKNSWGTHWGE 471
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 152 bits (385), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 95/149 (63%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL SNA+ II GGLE E DYPY G CHL K++I V I
Sbjct: 146 ELVDCDIIDQGCNGGLPSNAYREIIRM--GGLEAESDYPYDGRGEKCHLMKKDIAVYIND 203
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + DE +MA +LV GP+++ +NAN +QFY G++HP + C +LDHGVLIVG
Sbjct: 204 SLQLPHDEEKMAAWLVAKGPISIGLNANPLQFYRHGIAHPWRVFCSP--KHLDHGVLIVG 261
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG K PYWIIKNSWG WGE+
Sbjct: 262 YGSETDK------PYWIIKNSWGTKWGEE 284
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 78/149 (52%), Positives = 99/149 (66%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD CGGGL SNA+E I KLGG LE E DY Y G ++C +++ I S
Sbjct: 312 ELVDCDTVDQACGGGLPSNAYEAI-EKLGG-LETETDYSYTGKKQSCDFTTDKVIAYINS 369
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +S+DE E+A +L +NGP++VA+NA AMQFY GVSHPLK C M +DH VL+VG
Sbjct: 370 SVELSTDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVG 427
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG + K P+W IKNSWG +GE+
Sbjct: 428 YGERQGK------PFWAIKNSWGEDYGEQ 450
Score = 44.3 bits (103), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 33/50 (66%), Gaps = 3/50 (6%)
Query: 11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
D +E + F F+ ++N++Y+++EE +RLR+F NLK K+Q +GT
Sbjct: 168 DSVELLGQFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGT 217
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 75/148 (50%), Positives = 95/148 (64%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL SNA++ I+ GGLE E YPY G CH+ +++I V I
Sbjct: 315 ELVDCDSVDQGCNGGLPSNAYKEIMRM--GGLEPEDAYPYDGKGETCHIVRKDIAVYING 372
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE ++ K+LV GP+++ +NAN +QFY GV HP K C+ M L+HGVLIVG
Sbjct: 373 SVELPHDEVKIQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVG 430
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWI+KNSWGP WGE
Sbjct: 431 YGKDGRK------PYWIVKNSWGPTWGE 452
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 97/149 (65%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D CGGGL SNA+ T I LGG LE EKDY Y+G C + ++ R I S
Sbjct: 405 ELVDCDTLDQACGGGLPSNAY-TAIETLGG-LETEKDYSYEGRKERCSFSPDKARAYINS 462
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V++S DE E+A +L +NGP+++A+NA AMQFY GVSHP + LC +DH VL+VG
Sbjct: 463 SVDLSRDEQEIAAWLAENGPVSIALNAFAMQFYRRGVSHPFRPLCSPWF--IDHAVLLVG 520
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG P+W IKNSWGP WGE+
Sbjct: 521 YGDRSGI------PFWAIKNSWGPDWGEE 543
Score = 43.5 bits (101), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 3 ATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
+++ P D +E +++F FL +NKSYA E +RL IF NL+ K+Q +G+
Sbjct: 254 SSSLPRMGDSVELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGS 311
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 101/161 (62%), Gaps = 20/161 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGCGGGLM+NA++ + + GGLE E DYPYKG + C N
Sbjct: 190 QLVDCDHQCDPEEAQACDAGCGGGLMTNAYKYV--EEAGGLELESDYPYKGRDGKCQFNP 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ K+ ++ N+ DE ++A YL+K+GP+A+ INA MQ Y GVS P+ C N
Sbjct: 248 NKVAAKVSNFTNIPIDEDQVAAYLIKSGPLAIGINAEFMQTYVAGVSCPI--FCN--KRN 303
Query: 173 LDHGVLIVGYGVH---KTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGY H + +K PYWIIKNSWGP WG+K
Sbjct: 304 LDHGVLLVGYAEHGFAPARLAYK--PYWIIKNSWGPMWGDK 342
>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
Length = 236
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 94/149 (63%), Gaps = 8/149 (5%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCDK D C GG A+E+I+ GGL EKDYPY+ C+L I I
Sbjct: 72 QLLDCDKKDEACNGGFPEWAYESIVKM--GGLMSEKDYPYEAHKETCNLKPNNISAYIND 129
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +S DE E+A +L +NGP++V +NAN +QFYFGGVSHP LC LDH VL+VG
Sbjct: 130 SVTLSKDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPPHMLCS--EQGLDHAVLLVG 187
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGV T F + PYWI+KNSWG WGEK
Sbjct: 188 YGV--TSFWQR--PYWIVKNSWGRSWGEK 212
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 97/149 (65%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD D CGGGL SNA+E I KLGG +E E DY Y G ++C +++ I S
Sbjct: 313 ELVDCDTADQACGGGLPSNAYEAI-EKLGG-VETETDYSYTGKKQSCDFTTDKVTAYINS 370
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +S DE E+A +L +NGP++VA+NA AMQFY GVSHPLK C M +DH VL+VG
Sbjct: 371 SVELSKDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVG 428
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG + K P+W IKNSWG +GE+
Sbjct: 429 YGERQGK------PFWAIKNSWGEDYGEQ 451
Score = 43.1 bits (100), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 37/60 (61%), Gaps = 5/60 (8%)
Query: 3 ATAKP--HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
+T+KP D +E + F F+ ++N++Y+++E+ +RLRIF NLK K+Q GT
Sbjct: 159 STSKPVEETEDFVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGT 218
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 80/158 (50%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + DAGCGGGLM+ AFE + GGL+ EKDYPY G N CH +K
Sbjct: 184 QLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLK--AGGLQREKDYPYTGRNGQCHFDK 241
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + +Y V DE ++A LVK+GP+AV IN+ MQ Y GGVS PL +C +
Sbjct: 242 SKIAASVTNYSVVGLDEDQIAANLVKHGPLAVGINSAWMQTYIGGVSCPL--VC---FKH 296
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +PYWIIKNSWG HWGE
Sbjct: 297 QDHGVLLVGYGSAGFAPIRLKAKPYWIIKNSWGEHWGE 334
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/168 (46%), Positives = 103/168 (61%), Gaps = 12/168 (7%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 136 GNVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPY 193
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
N CHL + + V I S VN++ DETE+A +L N ++V +NA +QFY G+SHP
Sbjct: 194 DAKNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHP 253
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
C + LDH VL+VGYGV + K +P+WI+KNSWG WGE
Sbjct: 254 WWIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGE 294
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/167 (46%), Positives = 103/167 (61%), Gaps = 12/167 (7%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK 102
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 237 NVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYD 294
Query: 103 GSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPL 162
N CHL + + V I S VN++ DETE+A +L N ++V +NA +QFY G+SHP
Sbjct: 295 AKNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 354
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
C + LDH VL+VGYGV + K +P+WI+KNSWG WGE
Sbjct: 355 WIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGE 394
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 78/167 (46%), Positives = 103/167 (61%), Gaps = 12/167 (7%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK 102
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 274 NVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYD 331
Query: 103 GSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPL 162
N CHL + + V I S VN++ DETE+A +L N ++V +NA +QFY G+SHP
Sbjct: 332 AKNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 391
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
C + LDH VL+VGYGV + K +P+WI+KNSWG WGE
Sbjct: 392 WIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGE 431
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 78/167 (46%), Positives = 103/167 (61%), Gaps = 12/167 (7%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK 102
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 275 NVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYD 332
Query: 103 GSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPL 162
N CHL + + V I S VN++ DETE+A +L N ++V +NA +QFY G+SHP
Sbjct: 333 AKNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPW 392
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
C + LDH VL+VGYGV + K +P+WI+KNSWG WGE
Sbjct: 393 WIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGE 432
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 80/158 (50%), Positives = 101/158 (63%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE +I GG++ EKDYPY G + C +K
Sbjct: 185 QLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGS--GGVQREKDYPYTGRDGTCKFDK 242
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + +Y +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C +
Sbjct: 243 SKIAASVSNYSVISLDEEQIAANLVKNGPLAVAINAVYMQTYVGGVSCP--YICG---KH 297
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 298 LDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWGE 335
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/150 (48%), Positives = 94/150 (62%), Gaps = 9/150 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLNKEEIRVKIQ 120
+L+DCD D C GGL A++ I+ GGL EKDYPY+ ++CHL + I I
Sbjct: 262 QLLDCDTKDEACNGGLPEWAYDEIVKM--GGLMSEKDYPYEAMKEQSCHLRRPNISAYIN 319
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+ SDE ++A +LV+NGP++V +NAN +QFY GG+SHP LC LDH VL+V
Sbjct: 320 GSATLPSDEAKLAAWLVQNGPISVGVNANFLQFYLGGISHPPHMLCSEA--GLDHAVLLV 377
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV T +PYWI+KNSWG WGEK
Sbjct: 378 GYGVS----TFLRRPYWIVKNSWGGGWGEK 403
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/158 (50%), Positives = 100/158 (63%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG++ EKDYPY G + C +K
Sbjct: 190 QLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS--GGVQKEKDYPYTGRDGTCKFDK 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + +Y VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C +
Sbjct: 248 TKVAATVSNYSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICG---KH 302
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 303 LDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGE 340
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/158 (50%), Positives = 99/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG++ EKDYPY G + C +K
Sbjct: 187 QLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS--GGVQKEKDYPYTGRDGTCKFDK 244
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + +Y VS DE ++A LVKNGP+AV INA MQ Y GGVS P ++C +
Sbjct: 245 TKVAATVSNYSVVSLDEDQIAANLVKNGPLAVGINAVFMQTYIGGVSCP--YICG---KH 299
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVLIVGYG K +PYWIIKNSWG WGE
Sbjct: 300 LDHGVLIVGYGEGAYAPIRFKNKPYWIIKNSWGESWGE 337
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/160 (50%), Positives = 101/160 (63%), Gaps = 18/160 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++R AC +
Sbjct: 192 QLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRGACQFD 249
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 250 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 304
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 305 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGES 344
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/159 (50%), Positives = 100/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCDK---------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD+ D GC GGLM+NAFE I+ GG+E EKDYPY G +R+ C N
Sbjct: 186 QLVDCDRECDPEEYDACDDGCNGGLMNNAFEYILKT--GGVEREKDYPYTGRDRSPCKFN 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS DE ++A LVKNGP+AV INA MQ Y GVS P FLC G
Sbjct: 244 ESKIVASVSNFSVVSIDEDQIAANLVKNGPLAVGINAVFMQTYTAGVSCP--FLCSG--- 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + K +PYWI+KNSW +WGE
Sbjct: 299 ELDHGVLLVGYGSAGYSPIRFKEKPYWILKNSWSKYWGE 337
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 100/158 (63%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD V D+GC GGLM+NAF + + GGLE EKDYPY GS+ C +K
Sbjct: 188 QMVDCDHVCDTSEPDSCDSGCNGGLMTNAFSYL--QKAGGLESEKDYPYTGSDDKCKFDK 245
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q++ VS DE ++A L+K+GP+A+ INA MQ Y GGVS P ++C
Sbjct: 246 SKIVASVQNFSVVSVDEGQIAANLIKHGPLAIGINAAYMQTYIGGVSCP--YICG---RT 300
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 301 LDHGVLLVGYGAAGFAPIRLKDKPYWIIKNSWGENWGE 338
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/160 (45%), Positives = 100/160 (62%), Gaps = 20/160 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GG M+NA++ + + GGLE E DYPY+G + C +
Sbjct: 190 QLVDCDHQCDREEADACDAGCNGGFMTNAYQYV--EAAGGLELESDYPYEGRDGKCKFDS 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ VK+ ++ N+ DE ++A YL+K+GP+A+ INA MQ Y GVS P+ C N
Sbjct: 248 NKVAVKVSNFTNIPVDEDQVAAYLIKSGPLAIGINAEFMQTYIAGVSCPI--FCN--KRN 303
Query: 173 LDHGVLIVGY---GVHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGY G + +K PYWIIKNSWGP+WG+
Sbjct: 304 LDHGVLLVGYAERGFAPARLAYK--PYWIIKNSWGPNWGD 341
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 96/149 (64%), Gaps = 6/149 (4%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD++D GC GGL NA+ II GGLE E+DY Y + C N + V I
Sbjct: 200 QLVDCDRLDDGCEGGLPVNAYLEIIRL--GGLEKEEDYKYTARSGKCKFNHTKSAVYIND 257
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE +A+Y+ +NGP+AV +NA+AM FY G++HP + +C D ++HGV IVG
Sbjct: 258 TVVLPEDEDAIARYVSENGPVAVGLNADAMMFYRSGIAHPSRLMCSP--DGINHGVTIVG 315
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y V ++ F PYWIIKNSWGP+WGEK
Sbjct: 316 YDVKESLFWS--TPYWIIKNSWGPNWGEK 342
>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
Length = 245
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 96/158 (60%), Gaps = 17/158 (10%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+LVDCD DAGC GGLM+NAFE + GGL+ EKDYPY G + C +
Sbjct: 65 QLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALK--AGGLQKEKDYPYTGKDGTCKFD 122
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVK GP+AV INA MQ Y GGVS P ++C
Sbjct: 123 KTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQTYIGGVSCP--YICG---K 177
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVLIVGYG K +PYWIIKNSWG WGE
Sbjct: 178 SLDHGVLIVGYGTGYAPVRLKNKPYWIIKNSWGESWGE 215
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 99/158 (62%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM++A++ + GGLE E+DYPY G + C NK
Sbjct: 190 QLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEEDYPYTGKDGTCSFNK 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVKNGP++V INA MQ Y GGVS P ++C N
Sbjct: 248 NKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCP--YVCS--KRN 303
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYW+IKNSWGP+WGE
Sbjct: 304 LDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGE 341
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 99/158 (62%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM++A++ + GGLE E+DYPY G + C NK
Sbjct: 190 QLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEEDYPYTGKDGTCSFNK 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVKNGP++V INA MQ Y GGVS P ++C N
Sbjct: 248 NKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCP--YVCS--KRN 303
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYW+IKNSWGP+WGE
Sbjct: 304 LDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGE 341
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/157 (50%), Positives = 100/157 (63%), Gaps = 17/157 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE +I GG++ EKDYPY G + C +K
Sbjct: 185 QLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGS--GGVQREKDYPYTGRDGTCKFDK 242
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + +Y +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C +
Sbjct: 243 SKIAASVSNYSVISLDEEQIAANLVKNGPLAVAINAVYMQTYVGGVSCP--YICG---KH 297
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWG 208
LDHGVL+VGYG K +PYWIIKNSWG +WG
Sbjct: 298 LDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWG 334
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 99/158 (62%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM++A++ + GGLE E+DYPY G + C NK
Sbjct: 190 QLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEEDYPYTGKDGTCSFNK 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVKNGP++V INA MQ Y GGVS P ++C N
Sbjct: 248 NKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCP--YVCS--KRN 303
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYW+IKNSWGP+WGE
Sbjct: 304 LDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGE 341
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 100/158 (63%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G+++A C +
Sbjct: 189 QLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLK--AGGLMREEDYPYTGTDKATCKFD 246
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++ K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 247 NTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICS---K 301
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + K +PYWIIKNSWG WGE
Sbjct: 302 QLDHGVLLVGYGTGFSPIRMKEKPYWIIKNSWGEKWGE 339
Score = 37.0 bits (84), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK 49
HH EH F F ++ K+YA+ EE+H R +F+ANL++
Sbjct: 45 HHMLNAEH--HFTLFKKRFGKTYASDEEHHYRFSVFKANLRR 84
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 99/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V DAGC GGLM+NAF+ I+ GG++ EKDYPY G + C +K
Sbjct: 178 QLVDCDHVCDPEEYGACDAGCNGGLMNNAFDYILQ--AGGVQTEKDYPYSGRDETCKFDK 235
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ VS DE ++A LVK+GP+AV INA MQ Y GGVS P ++C N
Sbjct: 236 SKVAATVANFSVVSLDEDQIAANLVKHGPLAVGINAIFMQTYIGGVSCP--YICG---KN 290
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +P+WIIKNSWG WGE
Sbjct: 291 LDHGVLLVGYGAAGYAPIRFKDKPFWIIKNSWGESWGE 328
Score = 40.4 bits (93), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 2 EATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI 52
+ T HH EH F F K KSYAT+EE+ R +FRANL++ ++
Sbjct: 29 QVTDGDHHMLNAEH--HFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKL 77
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/161 (48%), Positives = 101/161 (62%), Gaps = 21/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GGLM++AF ++ GGLE EKDYPY G + C +K
Sbjct: 193 QLVDCDHECDPSEPDSCDAGCNGGLMTSAFSYLLKS--GGLEREKDYPYTGKDGTCKFDK 250
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q+Y V+ DE ++A LVK GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 251 SKIAASVQNYSVVAVDEEQIAANLVKYGPLAIGINAAYMQTYIGGVSCP--YICG---RH 305
Query: 173 LDHGVLIVGYGVH---KTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG ++F K PYWIIKNSWG +WG+K
Sbjct: 306 LDHGVLLVGYGASGFAPSRFKEK--PYWIIKNSWGENWGDK 344
>gi|312095086|ref|XP_003148243.1| hypothetical protein LOAG_12683 [Loa loa]
Length = 195
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 94/148 (63%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD +D GC GGL NA++ II GGLE EKDYPY G CHL ++EI V I
Sbjct: 33 ELIDCDVIDQGCKGGLPLNAYKEIIRM--GGLESEKDYPYDGHGEKCHLVRKEIAVYIND 90
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + DE ++A ++ K GP+++ +NA +QFY G+SHP K C +++HGVLIVG
Sbjct: 91 SIQLPDDEIKIAAWVAKKGPVSIGVNAGPLQFYRHGISHPWKAFCL--PSHINHGVLIVG 148
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWIIKNSWG WGE
Sbjct: 149 YGQEANK------PYWIIKNSWGTKWGE 170
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 99/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG++ EKDYPY G + C +K
Sbjct: 139 QLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS--GGVQKEKDYPYTGRDGTCKFDK 196
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + +Y V DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C +
Sbjct: 197 TKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICG---KH 251
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 252 LDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGE 289
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 102/168 (60%), Gaps = 12/168 (7%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK 102
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 272 NIESQWFRKTGKLLSLSEQQLVDCDSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYD 329
Query: 103 GSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPL 162
N CHL + I S VN++ DE+E+A +L + ++V +NA +QFY G+SHP
Sbjct: 330 AKNEKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPW 389
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C + LDH VL+VGYGV + K +P+WI+KNSWG WGEK
Sbjct: 390 WIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGEK 430
>gi|393904668|gb|EFO15826.2| hypothetical protein LOAG_12683 [Loa loa]
Length = 202
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 94/148 (63%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD +D GC GGL NA++ II GGLE EKDYPY G CHL ++EI V I
Sbjct: 40 ELIDCDVIDQGCKGGLPLNAYKEIIRM--GGLESEKDYPYDGHGEKCHLVRKEIAVYIND 97
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + DE ++A ++ K GP+++ +NA +QFY G+SHP K C +++HGVLIVG
Sbjct: 98 SIQLPDDEIKIAAWVAKKGPVSIGVNAGPLQFYRHGISHPWKAFCL--PSHINHGVLIVG 155
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG K PYWIIKNSWG WGE
Sbjct: 156 YGQEANK------PYWIIKNSWGTKWGE 177
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 102/168 (60%), Gaps = 12/168 (7%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK 102
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 272 NIESQWFRKTGKLLSLSEQQLVDCDNLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYD 329
Query: 103 GSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPL 162
N CHL + I S VN++ DE+E+A +L + ++V +NA +QFY G+SHP
Sbjct: 330 AKNEKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPW 389
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C + LDH VL+VGYGV + K +P+WI+KNSWG WGEK
Sbjct: 390 WIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGEK 430
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 101/158 (63%), Gaps = 13/158 (8%)
Query: 56 GTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
GT ++L +LVDCD +D C GGL SNA+E I KLGG LE E DY Y G ++C
Sbjct: 304 GTLVSLSEQELVDCDGLDQACNGGLPSNAYEAI-EKLGG-LETETDYSYIGKKQSCDFAT 361
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+++ I S V +S DE E+A +L +NGP++VA+NA AMQFY GVSHPLK C M
Sbjct: 362 KKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM-- 419
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG K P+W IKNSWG +GE+
Sbjct: 420 IDHAVLMVGYGERKGI------PFWAIKNSWGEDYGEQ 451
Score = 40.0 bits (92), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 2 EATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
+ + P + +E + F F+ K+NK Y++++E +RL IF NLK K+Q +G+
Sbjct: 160 DLSINPPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGS 218
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 101/158 (63%), Gaps = 13/158 (8%)
Query: 56 GTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
GT ++L +LVDCD +D C GGL SNA+E I KLGG LE E DY Y G ++C
Sbjct: 304 GTLVSLSEQELVDCDGLDQACNGGLPSNAYEAI-EKLGG-LETETDYSYIGKKQSCDFAT 361
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+++ I S V +S DE E+A +L +NGP++VA+NA AMQFY GVSHPLK C M
Sbjct: 362 KKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM-- 419
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG K P+W IKNSWG +GE+
Sbjct: 420 IDHAVLMVGYGERKGI------PFWAIKNSWGEDYGEQ 451
Score = 40.0 bits (92), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 2 EATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
+ + P + +E + F F+ K+NK Y++++E +RL IF NLK K+Q +G+
Sbjct: 160 DLSINPPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGS 218
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 99/159 (62%), Gaps = 16/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GGLM+ A+E ++ GGLE EKDYPY G + C +K
Sbjct: 190 ELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVLQS--GGLEKEKDYPYTGRDGTCKFDK 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVK+GP++V IN+ MQ Y GGVS P ++C N
Sbjct: 248 SKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINSIFMQTYIGGVSCP--YICS--KKN 303
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVLIVGYG K +PYWIIKNSWG +WGE+
Sbjct: 304 LDHGVLIVGYGAAGYAPIRFKDKPYWIIKNSWGENWGEE 342
Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 24/33 (72%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F F K+ KSYAT+EE+ RL +F+ANL++ +
Sbjct: 47 FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAK 79
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG++ E+DYPY G + +C +K
Sbjct: 184 QLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILG--AGGVQREEDYPYAGRDSSCKFDK 241
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + +Y +S DE ++A LVKNGP+AV INA MQ Y GGVS P ++C
Sbjct: 242 SKIAASVANYSVISLDEDQIAANLVKNGPLAVGINAVYMQTYIGGVSCP--YIC---AKR 296
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGV IVGYG K +PYWIIKNSWG WGE
Sbjct: 297 LDHGVQIVGYGESGYAPIRFKEKPYWIIKNSWGESWGE 334
Score = 37.0 bits (84), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 26/37 (70%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE 55
F++F K K+YATKEE+ R +F++NL++ ++ +
Sbjct: 50 FSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQ 86
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 71/149 (47%), Positives = 94/149 (63%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD C GGL SNA+E I KLGG +E E++Y Y+G C + ++ I S
Sbjct: 301 ELVDCDGVDHACAGGLPSNAYEAI-EKLGG-IETEQEYSYEGHKNTCSFSTSKVSAYINS 358
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE E+A +L +NGP+++A+NA AMQFY G+SHP + LC M +DH VL+VG
Sbjct: 359 SVEIPKDENEIAAWLAQNGPISIALNAFAMQFYRKGISHPFRILCNPWM--IDHAVLLVG 416
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG P+W IKNSWG WGE+
Sbjct: 417 YGERNGT------PFWAIKNSWGTDWGEQ 439
Score = 41.6 bits (96), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 29/47 (61%)
Query: 9 HHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE 55
+ L+ + +F F+ +NK Y+ +EE +RL+IF NLKK Q+ E
Sbjct: 156 EDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQE 202
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/146 (49%), Positives = 93/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD GC GGL SNA+ I K GGLE E+DY Y+G + C N E+ +V I V
Sbjct: 331 DCDKVDKGCMGGLPSNAYSAI--KTLGGLETEEDYSYRGHLQTCSFNAEKAKVYINDSVE 388
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 389 LSQNEQKLAAWLAEKGPISVAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 445
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 446 -----NRSATPFWAIKNSWGTDWGEE 466
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 100/159 (62%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+ VDCD DAGC GGLM++AF ++ GGLE EKDYPY G + C +K
Sbjct: 193 QFVDCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKS--GGLEREKDYPYTGRDGTCKFDK 250
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q++ VS DE ++A LVK+GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 251 SKIVASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYMQTYIGGVSCP--YICG---RS 305
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG + K +PYW+IKNSWG +WGEK
Sbjct: 306 LDHGVLLVGYGASGFAPSRLKNKPYWVIKNSWGENWGEK 344
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/169 (44%), Positives = 102/169 (60%), Gaps = 12/169 (7%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 134 GNIESQWFRKTGKLLSLSEQQLVDCDSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPY 191
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
N CHL + I S VN++ DE+E+A +L + ++V +NA +QFY G+SHP
Sbjct: 192 DAKNEKCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHP 251
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C + LDH VL+VGYGV + K +P+WI+KNSWG WGEK
Sbjct: 252 WWIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGEK 293
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 99/159 (62%), Gaps = 16/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM+ A+E ++ GGLE EKDYPY G + C +K
Sbjct: 188 QLVDCDHLCDPEEAGACDSGCNGGLMTTAYEYVLQS--GGLEKEKDYPYTGKDGTCKFDK 245
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVK+GP++V INA MQ Y GGVS P ++C N
Sbjct: 246 SKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINAVFMQTYIGGVSCP--YICS--KRN 301
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWI+KNSWG +WGE+
Sbjct: 302 LDHGVLLVGYGAAGYAPIRFKDKPYWIVKNSWGENWGEE 340
Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 24/33 (72%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F F K+ KSYAT+EE+ RL +F+ANL++ +
Sbjct: 47 FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAK 79
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD DAGC GGLM+ AF + GGLE EKDYPY G AC +K
Sbjct: 192 QMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAK--AGGLETEKDYPYTGRGGACKFDK 249
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +++++ V+ DE ++A LVK+GP+A+ INA MQ Y GGVS P F+C +
Sbjct: 250 SKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCP--FICG---RH 304
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 305 LDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGENWGE 342
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD DAGC GGLM+NAF ++ GGLE EKDYPY G + C +K
Sbjct: 190 QMVDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLKS--GGLESEKDYPYTGRDGTCKFDK 247
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q++ VS DE ++A LVK+GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 248 SKIVTSVQNFSVVSVDEDQIAANLVKHGPLAIGINAAYMQTYIGGVSCP--YICG---RH 302
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K + YWIIKNSWG +WGE
Sbjct: 303 LDHGVLLVGYGASGFAPIRLKDKAYWIIKNSWGENWGE 340
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 95/149 (63%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D C GGL SNA+ I K GGLE E DY Y G + C+ + E+ +V I
Sbjct: 296 ELVDCDKLDKACLGGLPSNAYSAI--KTLGGLETEDDYGYNGHLQTCNFSAEKAKVYIND 353
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +S +E ++A +L KNGP+++AINA MQFY G+SHPL+ LC + +DH VL+VG
Sbjct: 354 SVELSQNEQKLAAWLAKNGPISIAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVG 411
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG P+W IKNSWG WGE+
Sbjct: 412 YG------NRSDIPFWAIKNSWGTDWGEE 434
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 94/149 (63%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D CGGGL SNA+E I + GGLE E DY Y G ++C + ++ I S
Sbjct: 311 ELVDCDKLDQACGGGLPSNAYEAIENL--GGLETETDYSYTGHKQSCDFSTGKVAAYINS 368
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE E+A +L +NGP++ A+NA AMQFY GVSHPLK C M +DH VL+VG
Sbjct: 369 SVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVG 426
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+G P+W IKNSWG +GE+
Sbjct: 427 FGQRNGV------PFWAIKNSWGEDYGEQ 449
Score = 45.1 bits (105), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 2 EATAKPHHHDK-----LEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
+ A P H K +E + MF +F+ +N++Y+++EE KRLRIF+ N+K Q
Sbjct: 153 KVAAVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQ 207
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 94/149 (63%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDK+D CGGGL SNA+E I + GGLE E DY Y G ++C + ++ I S
Sbjct: 311 ELVDCDKLDQACGGGLPSNAYEAIENL--GGLETETDYSYTGHKQSCDFSTGKVAAYINS 368
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE E+A +L +NGP++ A+NA AMQFY GVSHPLK C M +DH VL+VG
Sbjct: 369 SVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVG 426
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+G P+W IKNSWG +GE+
Sbjct: 427 FGQRNGV------PFWAIKNSWGEDYGEQ 449
Score = 45.1 bits (105), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 2 EATAKPHHHDK-----LEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
+ A P H K +E + MF +F+ +N++Y+++EE KRLRIF+ N+K Q
Sbjct: 153 KVAAVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQ 207
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 97/159 (61%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + DAGCGGGLM+ AFE + GGL+ EKDYPY G + CH +K
Sbjct: 75 QLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLK--AGGLQREKDYPYTGRDGKCHFDK 132
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ V DE ++A LVK+GP+AV INA MQ Y GGVS PL +C
Sbjct: 133 SKIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL--IC---FKR 187
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
DHGVL+VGYG K +PYWIIKNSWG WGE+
Sbjct: 188 QDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQ 226
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 103/161 (63%), Gaps = 20/161 (12%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD+V D GCGGGLM+NA+ +I GGL+ E YPY G + C +
Sbjct: 149 QLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIE--AGGLQEESSYPYTGKSGECKFDP 206
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E+I VK+ ++ +++ DE ++A LV +GP+A+ +NA MQ Y GGVS PL +C G
Sbjct: 207 EKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIFMQTYIGGVSCPL--IC--GKKW 262
Query: 173 LDHGVLIVGYGVHK---TKFTHKIQPYWIIKNSWGPHWGEK 210
L+HGVL+VGYG +F +K PYWIIKNSWG HWGEK
Sbjct: 263 LNHGVLLVGYGARGYSILRFGYK--PYWIIKNSWGNHWGEK 301
Score = 37.4 bits (85), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANL 47
F F+++HNK YAT+EEY R IF NL
Sbjct: 14 FKMFIKEHNKEYATREEYVHRFGIFGKNL 42
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 98/158 (62%), Gaps = 13/158 (8%)
Query: 56 GTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
GT L+L +LVDCD +D C GGL SNA+E I KLGG LE E DY Y G + C
Sbjct: 304 GTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKLGG-LETESDYSYTGHKQRCDFTT 361
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ I S V + DE E+A +L +NGP++VA+NA AMQFY G+SHPLK C M
Sbjct: 362 GKVAAYINSSVELPKDEKEIAAWLAENGPVSVALNAFAMQFYRKGISHPLKIFCNPWM-- 419
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG K P+W IKNSWG +GE+
Sbjct: 420 IDHAVLLVGYGERKGI------PFWAIKNSWGEDYGEQ 451
Score = 44.3 bits (103), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 37/58 (63%), Gaps = 4/58 (6%)
Query: 3 ATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
+T++P + +E + F F+ K+NK Y+++EE +RLRIF NLK K+Q +G+
Sbjct: 162 STSQPLE-ESVELLGQFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGS 218
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 101/159 (63%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++R AC +
Sbjct: 187 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRDACKFD 244
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ ++ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 245 KNKVAARVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---R 299
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + K +P+WIIKNSWG WGE
Sbjct: 300 RLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNSWGEKWGE 338
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 78/160 (48%), Positives = 100/160 (62%), Gaps = 17/160 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD V D+GC GGLM++AFE + GGLE E+DYPY G++ + C +
Sbjct: 192 QLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLK--AGGLEREEDYPYTGTDHSKCKFD 249
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I V ++ VS DE ++A LV NGP+A+ INA MQ Y GGVS P ++C +
Sbjct: 250 KTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQTYIGGVSCP--YICSKRL- 306
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWIIKNSWG WGEK
Sbjct: 307 -LDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWGEK 345
>gi|1185457|gb|AAA87848.1| cathepsin L, partial [Schistosoma japonicum]
Length = 224
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 76/169 (44%), Positives = 102/169 (60%), Gaps = 12/169 (7%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G L+L +LVDCD +D GC GGL SNA+E+II GGL E +YPY
Sbjct: 41 GNIESQWFRKTGKLLSLSEQQLVDCDSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPY 98
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
N CHL + I S VN++ DE+E+A +L + ++V +NA +QFY G+SHP
Sbjct: 99 DAKNEKCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHP 158
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C + LDH VL+VGYGV + K +P+WI+KNSWG WGEK
Sbjct: 159 WWIFCSKYL--LDHAVLLVGYGV-----SEKNEPFWIVKNSWGVEWGEK 200
>gi|52546916|gb|AAU81591.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 190
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++RA C +
Sbjct: 11 QLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRAKCKFD 68
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++ K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 69 NTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICS---K 123
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 124 RQDHGVLLVGYGSGFAPIRMKEKPYWIIKNSWGEKWGE 161
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 95/158 (60%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+L+DCD D GC GGLM+NA+ ++ GGLE E YPY G C +
Sbjct: 191 QLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDP 248
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E+I VKI ++ N+ +DE ++A YLVKNGP+A+ +NA MQ Y GGVS PL +C
Sbjct: 249 EKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKR 304
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 305 LNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGE 342
>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
Length = 363
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 95/158 (60%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+L+DCD D GC GGLM+NA+ ++ GGLE E YPY G C +
Sbjct: 174 QLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDP 231
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E+I VKI ++ N+ +DE ++A YLVKNGP+A+ +NA MQ Y GGVS PL +C
Sbjct: 232 EKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKR 287
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 288 LNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGE 325
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 100/162 (61%), Gaps = 21/162 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
+LVDCD DAGC GGLM+NA++ ++ GGLE E DYPY G SN C N
Sbjct: 192 QLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKS--GGLETETDYPYTGNSNGKCQFN 249
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+I + ++ VS DE ++A LVK+GP+A+ INA MQ Y GGVS P+ +C
Sbjct: 250 ANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPI--ICS--KH 305
Query: 172 NLDHGVLIVGYGVH---KTKFTHKIQPYWIIKNSWGPHWGEK 210
++DHGVL+VGYG +FT K PYWIIKNSWG WGE+
Sbjct: 306 HIDHGVLLVGYGAKGYAPIRFTEK--PYWIIKNSWGATWGEQ 345
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 100/159 (62%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD D+GC GGLM+ AF ++ GGL+ EKDYPY G C +K
Sbjct: 196 QMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDK 253
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +++++ +S +E ++A LVK+GP+A+AINA MQ Y GGVS P F+C +
Sbjct: 254 SKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCP--FICG---RH 308
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWIIKNSWG +WGEK
Sbjct: 309 LDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGEK 347
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 100/162 (61%), Gaps = 21/162 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
+LVDCD DAGC GGLM+NA++ ++ GGLE E DYPY G SN C N
Sbjct: 155 QLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKS--GGLETETDYPYTGNSNGKCQFN 212
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+I + ++ VS DE ++A LVK+GP+A+ INA MQ Y GGVS P+ +C
Sbjct: 213 ANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPI--ICS--KH 268
Query: 172 NLDHGVLIVGYGVH---KTKFTHKIQPYWIIKNSWGPHWGEK 210
++DHGVL+VGYG +FT K PYWIIKNSWG WGE+
Sbjct: 269 HIDHGVLLVGYGAKGYAPIRFTEK--PYWIIKNSWGATWGEQ 308
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 96/158 (60%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGCGGGLM+ AFE + GGL+ EKDYPY G + CH +K
Sbjct: 183 QLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDK 240
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ + DE ++A LVK+GP+AV INA MQ Y GGVS PL +C
Sbjct: 241 SKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL--IC---FKR 295
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG H K + YWIIKNSWG +WGE
Sbjct: 296 QDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGE 333
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G +R AC +
Sbjct: 187 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGMDRGACKFD 244
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 245 KNKVAAGVANFSAVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---R 299
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 300 RLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGE 338
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 100/161 (62%), Gaps = 21/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GGLM++AF ++ GGLE EKDYPY G + C K
Sbjct: 193 QLVDCDHECDPAEPDSCDAGCNGGLMTSAFSYLLKS--GGLEREKDYPYTGKDGTCKFEK 250
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q++ V+ DE ++A LV+ GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 251 SKIAASVQNFSVVAVDEEQIAANLVEYGPLAIGINAAYMQTYIGGVSCP--YICG---RH 305
Query: 173 LDHGVLIVGYGVH---KTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG ++F K PYWIIKNSWG +WG+K
Sbjct: 306 LDHGVLLVGYGASGFAPSRFKEK--PYWIIKNSWGENWGDK 344
>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 102/159 (64%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD DAGC GGLM++AFE I+ GG+ E+DYPY G++R +C +
Sbjct: 182 QLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKS--GGVMREEDYPYSGTDRGSCKFD 239
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K++I + ++ VS DE ++A LVKNGP+A+A+NA MQ Y GGVS P ++C
Sbjct: 240 KKKIAASVANFSVVSLDEDQIAANLVKNGPLAIALNAVYMQTYVGGVSCP--YICS---K 294
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + K +PYWIIKNSWG WGE
Sbjct: 295 RLDHGVLLVGYGSGAYSPIRLKEKPYWIIKNSWGETWGE 333
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+ VDCD D+GC GGLM+ AF + + GGLE EKDYPY GS+ C +K
Sbjct: 188 QFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYL--QKAGGLESEKDYPYTGSDGKCKFDK 245
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q++ VS DE +++ L+K+GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 246 SKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCP--YICG---RH 300
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 301 LDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGE 338
>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
Length = 597
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 72/146 (49%), Positives = 93/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G +AC+ + E+ +V I V
Sbjct: 438 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQACNFSAEKAKVYINDSVE 495
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G++HPL+ LC + +DH VLIVGYG
Sbjct: 496 LSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGIAHPLRPLCSPWL--IDHAVLIVGYG- 552
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 553 -----NRSEVPFWAIKNSWGTDWGEK 573
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 96/158 (60%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGCGGGLM+ AFE + GGL+ EKDYPY G + CH +K
Sbjct: 181 QLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDK 238
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ + DE ++A LVK+GP+AV INA MQ Y GGVS PL +C
Sbjct: 239 SKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL--IC---FKR 293
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG H K + YWIIKNSWG +WGE
Sbjct: 294 QDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGE 331
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++ + C +
Sbjct: 186 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 244 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 299 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 337
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 96/158 (60%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGCGGGLM+ AFE + GGL+ EKDYPY G + CH +K
Sbjct: 181 QLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDK 238
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ + DE ++A LVK+GP+AV INA MQ Y GGVS PL +C
Sbjct: 239 SKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL--IC---FKR 293
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG H K + YWIIKNSWG +WGE
Sbjct: 294 QDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGE 331
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+ VDCD D+GC GGLM+ AF + + GGLE EKDYPY GS+ C +K
Sbjct: 188 QFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYL--QKAGGLESEKDYPYTGSDGKCKFDK 245
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +Q++ VS DE +++ L+K+GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 246 SKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCP--YICG---RH 300
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 301 LDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGE 338
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++ + C +
Sbjct: 186 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 244 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 299 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 337
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 100/159 (62%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD D+GC GGLM+ AF ++ GGL+ EKDYPY G C +K
Sbjct: 199 QMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDK 256
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +++++ +S +E ++A LVK+GP+A+AINA MQ Y GGVS P F+C +
Sbjct: 257 SKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCP--FICG---RH 311
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWIIKNSWG +WGEK
Sbjct: 312 LDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGEK 350
>gi|37903252|gb|AAO64474.1| cathepsin F [Fundulus heteroclitus]
Length = 166
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 75/148 (50%), Positives = 92/148 (62%), Gaps = 10/148 (6%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDCD +D C GGL SNA+E I KLGG LE E DY YKG + C ++ I S
Sbjct: 5 LVDCDGLDQACRGGLPSNAYEAI-EKLGG-LETETDYSYKGHKQTCDFTDRKVAAYINSS 62
Query: 123 VNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGY 182
V +S DE E+A +L + GP++VA+NA AMQFY GVSHPLK C M +DH VL+VGY
Sbjct: 63 VEISKDEKEIAAWLAEKGPISVALNAFAMQFYKKGVSHPLKIFCNPWM--IDHAVLLVGY 120
Query: 183 GVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G P+W IKNSWG +GE+
Sbjct: 121 GERNG------TPFWAIKNSWGEDYGEQ 142
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 139 bits (351), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 100/159 (62%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD D+GC GGLM+ AF ++ GGL+ EKDYPY G C +K
Sbjct: 163 QMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDK 220
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +++++ +S +E ++A LVK+GP+A+AINA MQ Y GGVS P F+C +
Sbjct: 221 SKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCP--FICG---RH 275
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWIIKNSWG +WGEK
Sbjct: 276 LDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGEK 314
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++ + C +
Sbjct: 184 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFD 241
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 242 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 296
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 297 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 335
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 100/159 (62%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
++VDCD D+GC GGLM+ AF ++ GGL+ EKDYPY G C +K
Sbjct: 179 QMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDK 236
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +++++ +S +E ++A LVK+GP+A+AINA MQ Y GGVS P F+C +
Sbjct: 237 SKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCP--FICG---RH 291
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG K +PYWIIKNSWG +WGEK
Sbjct: 292 LDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGEK 330
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G +R AC +
Sbjct: 187 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGMDRGACKFD 244
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 245 KNKVAAGVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---R 299
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 300 RLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGE 338
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 95/159 (59%), Gaps = 13/159 (8%)
Query: 55 EGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+G L+L +LVDCD +D C GGL SNA+ I K GGLE E DY Y G + C
Sbjct: 287 QGDLLSLSEQELVDCDTLDKACMGGLPSNAYSAI--KTLGGLETEDDYSYHGHLQTCSFT 344
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
E+++V I V +S DE ++A +L K GP+++AINA MQFY G+S PL+ LC
Sbjct: 345 AEKVKVYINDSVELSKDEQKLAAWLAKKGPISIAINAFGMQFYRRGISRPLRLLCSPWF- 403
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG P+W IKNSWG WGE+
Sbjct: 404 -IDHAVLLVGYG------NRSDVPFWAIKNSWGTDWGEE 435
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 95/158 (60%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM+NAFE I+ GGL+ E DYPY G + C +K
Sbjct: 185 QLVDCDHECDPAEYNSCDSGCNGGLMNNAFEYILK--AGGLQKEADYPYTGRDGTCKFDK 242
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS+DE ++A LV NGP+A+ INA MQ Y G VS P ++C
Sbjct: 243 SKIAASVANFSVVSTDEDQIAANLVTNGPLAIGINAAWMQTYIGQVSCP--YICS--KTK 298
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
+DHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 299 MDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGEDWGE 336
>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
Length = 462
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 95/146 (65%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GG+ SNA+ I S GGLE E DY YKG +AC+ + ++ +V I V
Sbjct: 303 DCDKMDKACLGGMPSNAYTAIKSL--GGLETEDDYSYKGYVQACNFSAQKAKVYINDSVE 360
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E++MA +L + GP++VAINA MQFY G++HPL+ LC + +DH VL+VGYG
Sbjct: 361 LSKNESKMAAWLAQKGPISVAINAFGMQFYRHGIAHPLRPLCSPWL--IDHAVLLVGYGN 418
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG +WGE+
Sbjct: 419 RSNT------PYWAIKNSWGSNWGEE 438
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 100/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM++AFE I+ GG+ E+DYPY G++R C +
Sbjct: 182 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKS--GGVMREEDYPYSGTDRGNCKFD 239
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 240 KAKIAASVANFSVISLDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YICS---R 294
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 295 RLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGE 333
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 98/160 (61%), Gaps = 18/160 (11%)
Query: 62 KLVDCDK---------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+L+DCD D GC GGLM+NAFE I+ GG+ E+DYPY G++R C N
Sbjct: 190 QLLDCDTECDPEEYDACDDGCNGGLMNNAFEYILK--AGGVAQEEDYPYTGTDRGLCRFN 247
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVKNGP+AV INA MQ Y GVS P ++C
Sbjct: 248 KTKIAASVANFSVVSLDEDQIAANLVKNGPLAVGINAVFMQTYKSGVSCP--YICSS--- 302
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG + K +PYWIIKNSWG WGE+
Sbjct: 303 TLDHGVLLVGYGSAGYSPIRFKEKPYWIIKNSWGESWGEQ 342
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 94/158 (59%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+L+DCD D GC GGLM+NA+ ++ GGLE E YPY G C +
Sbjct: 186 QLLDCDNKCEITEKTSCDNGCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDP 243
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E+I V+I ++ N+ DE ++A YLVKNGP+A+ +NA MQ Y GGVS PL +C
Sbjct: 244 EKITVRITNFTNIPVDENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKR 299
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 300 LNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGKKWGE 337
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 100/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM+NAFE + GGLE EKDYPY G++R AC
Sbjct: 186 QLVDCDHECDPEEYGACDSGCSGGLMNNAFEYALK--AGGLEREKDYPYTGNDRGACKFE 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVK+GP++VAINA MQ Y GGVS P ++C
Sbjct: 244 KSKVAASVSNFSVVSLDEDQIAANLVKHGPLSVAINAVFMQTYIGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+ DHGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 299 HQDHGVLLVGYGAAGYAPIRFKEKPFWIIKNSWGENWGE 337
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK D C GGL SNA+ I + GGLE E DY Y+G + C + E+ +V I V
Sbjct: 301 DCDKTDKACLGGLPSNAYSAI--RTLGGLETEDDYSYRGRLQTCSFSAEKAKVYINDSVE 358
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L KNGP+++AINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 359 LSKNEQKLAAWLAKNGPVSIAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 415
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 416 -----NRSAIPFWAIKNSWGTDWGEE 436
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D GC GGLM++AFE + GGL E+DYPY G++ + C +
Sbjct: 186 QLVDCDHECDPEEAGSCDFGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 244 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 299 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 337
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 96/158 (60%), Gaps = 13/158 (8%)
Query: 56 GTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
GT L+L +LVDCD +D C GGL SNA+E I KLGG LE E DY Y G + C
Sbjct: 302 GTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKLGG-LESETDYSYTGHKQKCDFTN 359
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ I S V + DE E+A +L +NGP++VA+NA AMQFY GVSHP K C M
Sbjct: 360 RKVAAYINSSVELPKDEREIAAWLAENGPISVALNAFAMQFYKKGVSHPWKIFCNPWM-- 417
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG P+W IKNSWG +GE+
Sbjct: 418 IDHAVLLVGYGERNGI------PFWAIKNSWGEDYGEQ 449
Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 2 EATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
E T + ++ + F F+ K+ K Y+++EE +RL+IF+ NLK K+Q +G+
Sbjct: 158 EPTNSQPVEESVQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGS 216
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 94/146 (64%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E+DY Y+G +AC+ + ++ +V I V
Sbjct: 301 DCDKLDKACLGGLPSNAYSAI--KNLGGLETEEDYTYQGHMQACNFSAQKAKVYINDSVE 358
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G++HPL+ LC + +DH VL+VGYG
Sbjct: 359 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRRGIAHPLRPLCSPWL--IDHAVLLVGYG- 415
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 416 -----NRSATPFWAIKNSWGADWGEE 436
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 97/159 (61%), Gaps = 11/159 (6%)
Query: 53 RGEGTHLA-LKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
RG+ L+ +LVDCDKVD C GGL SNA+ I K GGLE E DY Y G + C +
Sbjct: 238 RGDLLSLSEQELVDCDKVDKACMGGLPSNAYSAI--KTLGGLETEDDYSYSGHLQTCSFS 295
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++ +V I V +S +E E+A +L KNGP+++AINA MQFY G+S PL+ LC
Sbjct: 296 AQKAKVYINDSVELSHNEQELAAWLAKNGPISIAINAFGMQFYRHGISRPLRPLCSRWF- 354
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG P+W IKNSWG WGE+
Sbjct: 355 -IDHAVLLVGYG------NRSDVPFWAIKNSWGTDWGEE 386
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D GC GGLM++AFE I+ GG+E E+ YPY GS+R +C N
Sbjct: 186 QLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILK--AGGVEREETYPYIGSDRGSCKFN 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A +VKNGP+AV INA MQ Y GVS P ++C
Sbjct: 244 KSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVSCP--YICS---R 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
NLDHGV++VGYG K +PYWIIKNSWG WGE
Sbjct: 299 NLDHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWGE 337
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM+NAFE I+ GGLE E+DYPY GS+R C
Sbjct: 185 QLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILK--AGGLEREEDYPYTGSDRGPCKFE 242
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS DE ++A LV+NGP+AV INA MQ Y GGVS P ++C
Sbjct: 243 RAKIAASVNNFSVVSVDEDQIAANLVQNGPLAVGINAVFMQTYIGGVSCP--YICS---K 297
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGV++VGYG K +P+WIIKNSWG +WGE
Sbjct: 298 RQDHGVVLVGYGSAGYAPVRLKDKPFWIIKNSWGENWGE 336
Score = 37.4 bits (85), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F F K K+YAT+EE+ R ++F+ANL++ Q
Sbjct: 51 FTAFKAKFGKNYATQEEHDYRFKVFKANLRRAQ 83
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/146 (49%), Positives = 93/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GG+ SNA+ I K GGLE E+DY Y G +AC + E+ +V I V
Sbjct: 314 DCDKVDKACMGGVPSNAYSAI--KTLGGLETEEDYSYHGHLQACSFSAEKAKVYINDSVE 371
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L KNGP++VAINA MQFY G++HPL+ LC + +DH VLIVGYG
Sbjct: 372 LSQNEYKLAAWLAKNGPISVAINAFGMQFYRHGIAHPLRPLCSPWL--IDHAVLIVGYG- 428
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 429 -----NRSDVPFWAIKNSWGTDWGEE 449
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/159 (49%), Positives = 97/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D GC GGLM+ AFE I+ GGLE E DYPY G++R C N
Sbjct: 180 QLVDCDHECDSSEPDSCDQGCNGGLMTTAFEYILKS--GGLEREADYPYTGTDRGTCKFN 237
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I ++ VS DE ++A LVK+GP+AV INA MQ Y GGVS P ++C
Sbjct: 238 KAKISAVASNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQTYVGGVSCP--YICG---K 292
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 293 HLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGENWGE 331
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 97/158 (61%), Gaps = 16/158 (10%)
Query: 62 KLVDCDK---------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM+ A++ + GGL+ E+DYPY G + +C +
Sbjct: 211 QLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALK--AGGLQREEDYPYTGIDGSCKFDN 268
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ VS DE ++A LVKNGP+AV INA MQ Y GGVS P ++C N
Sbjct: 269 TKVAAMVANFSTVSIDEDQIAANLVKNGPLAVGINAAFMQTYVGGVSCP--YVCN--KQN 324
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +P+WIIKNSWGP WGE
Sbjct: 325 LDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDWGE 362
Score = 37.0 bits (84), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 13 LEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK 49
L A F HF++K NK Y+ EE+ +R IF+ NL K
Sbjct: 69 LNAEAHFAHFVKKFNKEYSGAEEHARRFSIFKKNLHK 105
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G +R AC +
Sbjct: 193 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGMDRGACKFD 250
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K+++ + ++ VS DE ++A LVKNGP+AVA NA MQ Y GGVS P ++C
Sbjct: 251 KDKVAAGVANFSVVSLDEDQIAANLVKNGPLAVATNAVFMQTYIGGVSCP--YICS---R 305
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 306 RLDHGVLLVGYGSAGYAPVRMKEKPYWIIKNSWGESWGE 344
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++ + C +
Sbjct: 184 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFD 241
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA +Q Y GGVS P ++C
Sbjct: 242 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFVQTYIGGVSCP--YICS---K 296
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 297 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 335
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/161 (49%), Positives = 100/161 (62%), Gaps = 22/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G + A C L+
Sbjct: 186 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMREEDYPYTGKDGATCKLD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C M
Sbjct: 244 KSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YIC---MR 298
Query: 172 NLDHGVLIVGYGV---HKTKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +F K PYWIIKNSWG WGE
Sbjct: 299 RLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGETWGE 337
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LV+CD D+GC GGLM+ AFE + GGL E+DYPY G++R +C +
Sbjct: 196 QLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLK--AGGLMKEEDYPYTGTDRGSCKFD 253
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 254 KTKIAASVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICS---K 308
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 309 RLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGE 347
>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
Length = 459
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y G +AC + E+ +V I V
Sbjct: 300 DCDKVDKACMGGLPSNAYSAI--KTLGGLETEDDYSYHGHLQACSFSAEKAKVYINDSVE 357
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
++ +E ++A +L K GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 358 LTKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 414
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 415 -----NRSAVPFWAIKNSWGTDWGEE 435
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 4 TAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANL---KKIQIRGEGT 57
T P ++ ++F HF+ +N++Y TKEE R+ IF +N+ +KIQ GT
Sbjct: 147 TPLPFQDFSVKMASIFKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGT 203
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK D C GGL SNA+ I + GGLE E DY Y+G + C + E+ +V I V
Sbjct: 301 DCDKTDKACLGGLPSNAYSAI--RTLGGLETEDDYSYRGHLQTCSFSAEKAKVYINDSVE 358
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 359 LSKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 415
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG +WGE+
Sbjct: 416 -----NRSATPFWAIKNSWGTNWGEE 436
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 99/159 (62%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG+ EKDY Y G + +C +K
Sbjct: 186 QLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQS--GGVVSEKDYAYTGRDGSCKFDK 243
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ VS DE ++A LVKNGP+AVAINA MQ Y GVS P ++C
Sbjct: 244 SKVVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCP--YICAKA--R 299
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL++G+G K +PYWIIKNSWG +WGE+
Sbjct: 300 LDHGVLLLGFGQGGYAPIRLKEKPYWIIKNSWGQNWGEE 338
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++R C +
Sbjct: 196 QLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKS--GGLMKEQDYPYTGTDRGTCKFD 253
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GVS P ++C
Sbjct: 254 KSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIKGVSCP--YICS---K 308
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 309 HLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGE 347
Score = 42.4 bits (98), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 25/33 (75%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F+ F +K KSYA+KEE+ R R+F+ANLK+ Q
Sbjct: 60 FSVFKQKFGKSYASKEEHDHRFRVFKANLKRAQ 92
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK D C GGL SNA+ I + GGLE E DY Y+G + C + E+ +V I V
Sbjct: 318 DCDKTDKACLGGLPSNAYSAI--RTLGGLETEDDYSYRGHLQTCSFSAEKAKVYINDSVE 375
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 376 LSKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 432
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG +WGE+
Sbjct: 433 -----NRSATPFWAIKNSWGTNWGEE 453
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 98/159 (61%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD DAGC GGLM++AFE I+ GGLE E+DYPY G++R +C
Sbjct: 192 QLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVK--AGGLEREEDYPYTGTDRGSCKFQ 249
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+I ++ +S+D ++A LVKNGP+A+ INA MQ Y G+S P ++C
Sbjct: 250 NGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAVFMQTYMKGISCP--YICS--KR 305
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
NLDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 306 NLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWGE 344
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+D+PY G++ + C +
Sbjct: 184 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDHPYTGNDLQVCRFD 241
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I K+ ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 242 KTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 296
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 297 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 335
>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
Length = 781
Score = 137 bits (345), Expect = 4e-30, Method: Composition-based stats.
Identities = 65/104 (62%), Positives = 79/104 (75%), Gaps = 4/104 (3%)
Query: 80 NAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKN 139
NA+ I KLGG LE E DYPY+ N CH K ++V++ S VNV+SDET+MA++LV+N
Sbjct: 679 NAYRAI-EKLGG-LELESDYPYEAENEKCHFKKNLVKVELTSAVNVTSDETQMAQWLVQN 736
Query: 140 GPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183
GP+++ INANAMQFY GGVSHP KFLC NLDHGVLIVGYG
Sbjct: 737 GPISIGINANAMQFYMGGVSHPFKFLCNP--KNLDHGVLIVGYG 778
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 100/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM++AFE I++ GG+ E+DYPY G+N C +
Sbjct: 179 QLVDCDHQCDPEEAGSCDSGCNGGLMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFD 236
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 237 KAKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCP--YVCS---K 291
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 292 KLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGE 330
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++R C +
Sbjct: 192 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRGTCKFD 249
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++ K+ ++ VS DE ++A L KNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 250 NTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQTYIGGVSCP--YICS---K 304
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 305 RLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWGE 343
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/154 (46%), Positives = 97/154 (62%), Gaps = 12/154 (7%)
Query: 62 KLVDCDKVDA-----GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
+LVDCD+ D GCGGGLM+NA+E ++ GGLE E+ YPY G C + E++
Sbjct: 188 QLVDCDQADKKACDNGCGGGLMTNAYEYLME--AGGLEEERSYPYTGKRGHCKFDPEKVA 245
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
V++ ++ + DE ++A LV++GP+AV +NA MQ Y GGVS PL +C N++HG
Sbjct: 246 VRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPL--ICS--KRNVNHG 301
Query: 177 VLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
VL+VGYG +PYWIIKNSWG WGE
Sbjct: 302 VLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGE 335
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 93/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I++ GGLE E DY Y+G +AC + ++ RV I +
Sbjct: 331 DCDKVDKACLGGLPSNAYSAIMTL--GGLETEDDYSYQGHLQACSFSAKKARVYINDSME 388
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 389 LSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 445
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 446 -----NRSGIPFWAIKNSWGTDWGEE 466
>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
Length = 394
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C + ++ RV I V
Sbjct: 235 DCDKVDKACLGGLPSNAYSAI--KTLGGLETEDDYSYRGHVQTCSFSSKKARVYINDSVE 292
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++ +L +NGP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 293 LSQNEQKLVAWLAQNGPISVAINAFGMQFYRRGISHPLRPLCSPWL--IDHAVLLVGYG- 349
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 350 -----NRSGIPFWAIKNSWGTDWGEE 370
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 100/160 (62%), Gaps = 19/160 (11%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHL 110
+LVDCD D+GC GGLM++AFE I++ GG+ E+DYPY G+N C
Sbjct: 179 QLVDCDHQQCDPEEAGSCDSGCNGGLMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKF 236
Query: 111 NKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM 170
+K +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 237 DKAKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCP--YVCS--- 291
Query: 171 DNLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 292 KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGE 331
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 99/159 (62%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE ++ GG+ EKDY Y G + +C +K
Sbjct: 178 QLVDCDHVCDPEEAGSCDSGCNGGLMNNAFEYLLQS--GGVVQEKDYAYTGRDGSCKFDK 235
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ VS DE ++A LVKNGP+AVAINA MQ Y GVS P ++C
Sbjct: 236 SKVVASVSNFSVVSLDEEQIAANLVKNGPLAVAINAAWMQAYMSGVSCP--YVCAKA--R 291
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VG+G K +PYWIIKNSWG +WGE+
Sbjct: 292 LDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQ 330
Score = 37.0 bits (84), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI 52
F F K +KSYATKEE+ R +F+ANL K ++
Sbjct: 43 FTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKL 76
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 94/158 (59%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GGLM+ AFE + GGL+ EKDYPY G CH +K
Sbjct: 183 QLVDCDHECDPEQQDSCDAGCSGGLMTTAFEYTLK--AGGLQREKDYPYTGKXGKCHFDK 240
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ + DE ++A LVK+GP+AV INA MQ Y GGVS PL +C
Sbjct: 241 SKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL--IC---FKR 295
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG H K + YWIIKNSWG +WGE
Sbjct: 296 QDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGE 333
>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
Length = 255
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 96/158 (60%), Gaps = 13/158 (8%)
Query: 56 GTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
GT L+L +LVDCD +D C GGL SNA+E I KLGG LE E DY Y G + C
Sbjct: 84 GTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKLGG-LETETDYSYTGKKQRCDFTN 141
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ I S V + DE E+A +L +NGP++VA+NA AMQFY GVSHP K C M
Sbjct: 142 RKVAAYINSSVELPKDEKEIAAWLAENGPISVALNAFAMQFYKKGVSHPWKIFCNPWM-- 199
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+DH VL+VGYG P+W IKNSWG +GE+
Sbjct: 200 IDHAVLLVGYGERNGI------PFWAIKNSWGEDYGEQ 231
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G +AC+ + E+ +V I V
Sbjct: 325 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVE 382
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 383 LSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 439
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 440 -----NRSDIPFWAIKNSWGTDWGEK 460
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G +AC+ + E+ +V I V
Sbjct: 301 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVE 358
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 359 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 415
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 416 -----NRSDIPFWAIKNSWGTDWGEK 436
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 99/159 (62%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE ++ GG+ EKDY Y G + +C +K
Sbjct: 183 QLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLES--GGVVQEKDYAYTGRDGSCKFDK 240
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ V+ DE ++A LVKNGP+AVAINA MQ Y GVS P ++C
Sbjct: 241 SKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCP--YVC--AKSR 296
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VG+G K +PYWIIKNSWG +WGE+
Sbjct: 297 LDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQ 335
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 97/160 (60%), Gaps = 18/160 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM+ AFE + GGL E+DYPY G +R C +
Sbjct: 185 QLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLK--AGGLMREEDYPYTGRDRGPCKFD 242
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVKNGP+AV INA MQ Y GGVS P ++C
Sbjct: 243 KSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGINAVFMQTYIGGVSCP--YICG---K 297
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHGVL+VGYG K +PYWIIKNSWG WGE+
Sbjct: 298 HLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWGEE 337
Score = 36.6 bits (83), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 5 AKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
A+ HH EH F+ F K K+YAT+EE+ R RIF+ NL + +
Sbjct: 39 AEDHHLLNAEH--HFSAFKTKFAKTYATQEEHDHRFRIFKNNLLRAK 83
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G +AC+ + E+ +V I V
Sbjct: 222 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVE 279
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 280 LSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 336
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 337 -----NRSDIPFWAIKNSWGTDWGEK 357
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 99/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D GC GGLM+ AFE I+ K GG + GE DYPY G++ C +K
Sbjct: 186 QLVDCDHECDPEEYGACDRGCNGGLMNTAFEYIL-KAGGVVRGE-DYPYTGTDGHCKFDK 243
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVKNGP+AV INA MQ Y GGVS P F+C +
Sbjct: 244 TKIAASVSNFSTVSIDEDQIAANLVKNGPLAVGINAIFMQSYAGGVSCP--FICS---TS 298
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG + K +PYW++KNSWG +WGE
Sbjct: 299 LNHGVLLVGYGSAGYSPIRFKEKPYWLLKNSWGQNWGE 336
>gi|5777611|emb|CAB53397.1| cysteine protease [Medicago sativa]
Length = 209
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 98/159 (61%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG+ EKDY Y G + +C +K
Sbjct: 31 QLVDCDHVCDPEERNSCDSGCNGGLMNNAFEYILQS--GGVVSEKDYAYTGRDGSCKFDK 88
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GVS P +C
Sbjct: 89 SKIVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCP--HICAKA--R 144
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VG+G K +PYWIIKNSWG +WGE+
Sbjct: 145 LDHGVLLVGFGSGGYAPIRLKEKPYWIIKNSWGQNWGEE 183
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 76/160 (47%), Positives = 97/160 (60%), Gaps = 18/160 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM+ AFE + GGL EKDYPY G +R C +
Sbjct: 185 QLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLQ--AGGLMREKDYPYTGRDRGPCKFD 242
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LV+NGP+AV INA MQ Y GGVS P ++C
Sbjct: 243 KSKVAASVANFSVVSLDEEQIAANLVQNGPLAVGINAVFMQTYIGGVSCP--YICG---K 297
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHGVL+VGYG K +PYWIIKNSWG WGE+
Sbjct: 298 HLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWGEE 337
Score = 37.0 bits (84), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 5 AKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
A+ HH EH F+ F K K+YAT+EE+ R RIF+ NL + +
Sbjct: 39 AEDHHLLNAEH--HFSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAK 83
>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
Length = 408
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 90/146 (61%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C + ++ RV I V
Sbjct: 250 DCDKVDKACLGGLPSNAYSAI--KTLGGLETEDDYSYRGRMQTCGFSPKKARVYINDSVE 307
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E +A +L + GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 308 LSQNEETLAAWLAEKGPISVAINAFGMQFYRHGISHPLRPLCSPWL--IDHAVLLVGYG- 364
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 365 -----NRSGTPFWAIKNSWGSDWGEE 385
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 99/161 (61%), Gaps = 22/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G + + C L+
Sbjct: 186 QLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKT--GGLMKEEDYPYTGKDGKTCKLD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 244 KSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TR 298
Query: 172 NLDHGVLIVGYGV---HKTKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +F K PYWIIKNSWG WGE
Sbjct: 299 RLNHGVLLVGYGAAGYAPARFKEK--PYWIIKNSWGETWGE 337
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL S+A+ I K GGLE E DY Y+G +AC+ + E+ +V I V
Sbjct: 330 DCDKIDKACMGGLPSSAYSAI--KNLGGLETEDDYSYRGHMQACNFSPEKAKVYINDSVE 387
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 388 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 444
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 445 -----NRSDVPFWAIKNSWGTDWGEK 465
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 97/158 (61%), Gaps = 18/158 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GGLM+ AF + GGLE EKDYPY G N AC +K
Sbjct: 198 QLVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAK--AGGLETEKDYPYTGRNSACKFDK 255
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I +++++ V+ DE ++A LVK+GP+A+ INA MQ Y GGVS P ++C +
Sbjct: 256 SKIAAQVKNFSTVAIDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCP--YICG---RH 310
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDH V +VGYG K +PYWIIKNSWG +WGE
Sbjct: 311 LDH-VFLVGYGSAGYAPLRFKEKPYWIIKNSWGENWGE 347
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 97/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++A E + GGL E+DYPY G++R C +
Sbjct: 184 QLVDCDHECDPEEPGSCDSGCNGGLMNSALEYTLK--AGGLMREEDYPYSGTDRGTCKFD 241
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 242 ETKIAASVANFSVVSLDENQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICS---K 296
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 297 RLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGE 335
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 93/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E+DY Y+G +AC+ + E+ +V I V
Sbjct: 332 DCDKMDKACLGGLPSNAYSAI--KNLGGLETEEDYSYQGQMQACNFSAEKAKVYINDSVE 389
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VLIVGYG
Sbjct: 390 LSHNEQKLAAWLAKKGPISVAINAFGMQFYRHGISRPLRPLCTPWL--IDHAVLIVGYG- 446
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 447 -----NRSDIPFWAIKNSWGTDWGEQ 467
>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
Length = 350
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 94/158 (59%), Gaps = 17/158 (10%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+LVDCD DAGC GGLM +AF+ +I GGL E YPY+G + C N
Sbjct: 176 QLVDCDHNCVTYQGQQACDAGCNGGLMWSAFQYVIKT--GGLVTEDSYPYEGVDDTCRFN 233
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K + V I S+ ++ SDE +MA +L NGP+++AINA +Q Y G+S+P + C
Sbjct: 234 KSNVAVTINSWTSIPSDEGKMAAWLAANGPISIAINAEWLQTYTSGISNP--WFCN--PQ 289
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVLIVG+G K + YWIIKNSWG WGE
Sbjct: 290 DLDHGVLIVGFGTGSNWLGEK-EDYWIIKNSWGADWGE 326
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 95/158 (60%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + DAGC GGLM+ AFE + GGL+ EKDYPY G + CH +K
Sbjct: 181 QLVDCDHECDAEEKSECDAGCNGGLMTTAFEYTLK--AGGLQREKDYPYTGRDGKCHFDK 238
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ + DE ++A LVK+GP+AV INA MQ Y GVS PL +C
Sbjct: 239 SKIAASVANFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYMRGVSCPL--IC---FKR 293
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 294 QDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGENWGE 331
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 179 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 236
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 237 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 293
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 294 -----NRSDVPFWAIKNSWGTDWGEK 314
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 99/161 (61%), Gaps = 22/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G + + C L+
Sbjct: 186 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMKEEDYPYTGKDGKTCKLD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 244 KSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TR 298
Query: 172 NLDHGVLIVGYGV---HKTKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +F K PYWIIKNSWG WGE
Sbjct: 299 RLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGETWGE 337
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 95/158 (60%), Gaps = 15/158 (9%)
Query: 62 KLVDCD--------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE 113
+LVDCD D GC GGLM+ A++ ++ GGLE E YPY G+ C +
Sbjct: 191 QLVDCDNKCDITKTSCDNGCNGGLMTTAYDYLME--AGGLEEETSYPYTGAQGECKFDPN 248
Query: 114 EIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNL 173
++ V++ ++ N+ +DE ++A YLV +GP+A+A+NA MQ Y GGVS PL +C L
Sbjct: 249 KVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVGGVSCPL--ICS--KRRL 304
Query: 174 DHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGEK 210
+HGVL+VGY + +PYW IKNSWG WGEK
Sbjct: 305 NHGVLLVGYNAEGFSILRLRKKPYWTIKNSWGEQWGEK 342
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 99/161 (61%), Gaps = 22/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G + + C L+
Sbjct: 186 QLVDCDHECDPEEADSCDSGCNGGLMNSAFEHTLKT--GGLMKEEDYPYTGKDGKTCKLD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 244 KSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TR 298
Query: 172 NLDHGVLIVGYGV---HKTKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +F K PYWIIKNSWG WGE
Sbjct: 299 RLNHGVLLVGYGAAGYAPARFKEK--PYWIIKNSWGETWGE 337
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 358 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 415
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 416 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 472
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 473 -----NRSDVPFWAIKNSWGTDWGEK 493
>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
Length = 484
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 325 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 382
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 383 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 439
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 440 -----NRSDVPFWAIKNSWGTDWGEK 460
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 331 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 388
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 389 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 445
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 446 -----NRSDVPFWAIKNSWGTDWGEK 466
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 325 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 382
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 383 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 439
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 440 -----NRSDVPFWAIKNSWGTDWGEK 460
>gi|34811401|pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F
gi|34811402|pdb|1M6D|B Chain B, Crystal Structure Of Human Cathepsin F
Length = 214
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C + E+ +V IQ V
Sbjct: 55 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVE 112
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 113 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYGQ 170
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 171 RSDV------PFWAIKNSWGTDWGEK 190
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 98/159 (61%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE ++ GG+ EKDY Y G + +C +K
Sbjct: 183 QLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQS--GGVVQEKDYAYTGRDGSCKFDK 240
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ VS DE ++A LVKNGP+AV INA MQ Y GVS P ++C
Sbjct: 241 SKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCP--YVC--AKSR 296
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VG+G K +PYWI+KNSWG +WGE+
Sbjct: 297 LDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQ 335
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 98/159 (61%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE ++ GG+ EKDY Y G + +C +K
Sbjct: 183 QLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQS--GGVVQEKDYAYTGRDGSCKFDK 240
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ + ++ VS DE ++A LVKNGP+AV INA MQ Y GVS P ++C
Sbjct: 241 SKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCP--YVC--AKSR 296
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VG+G K +PYWI+KNSWG +WGE+
Sbjct: 297 LDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQ 335
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 233 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 290
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 291 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 347
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 348 -----NRSDVPFWAIKNSWGTDWGEK 368
>gi|144228217|gb|ABO93617.1| papain-like cysteine proteinase [Vitis vinifera]
Length = 161
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D GC GGLM++AFE I+ GG+E E+ YPY GS+R +C N
Sbjct: 1 QLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILK--AGGVEREETYPYIGSDRGSCKFN 58
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A +VKNGP+AV INA MQ Y GVS P ++C
Sbjct: 59 KSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVSCP--YICS---R 113
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
NLDHGV++VGYG K +PYWIIKNSWG WGE
Sbjct: 114 NLDHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWGE 152
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 92/158 (58%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGC GGLM+ AFE + GGL+ EKDYPY G N CH +K
Sbjct: 179 QLVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLK--AGGLQLEKDYPYTGRNGKCHFDK 236
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
I + ++ V DE ++A L+K+GP+AV INA MQ Y GVS PL +C
Sbjct: 237 SRIAASVSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQTYVRGVSCPL--IC---FKR 291
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 292 QDHGVLLVGYGSEGFAPIRLKNKPYWIIKNSWGKTWGE 329
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 87/143 (60%), Gaps = 10/143 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D C GGL SNA+E I GGLE E DY Y G + C E++ I S
Sbjct: 314 ELVDCDGLDHACRGGLPSNAYEAIEGL--GGLEAENDYTYSGHKQKCSFATEKVAAYINS 371
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + SDE EMA +L +NGP++VA+NA AMQFY GVSHP LC M +DH VL+VG
Sbjct: 372 SVELPSDENEMAAWLAENGPVSVALNAFAMQFYKKGVSHPWMILCNPWM--IDHAVLLVG 429
Query: 182 YGVHKTKFTHKIQPYWIIKNSWG 204
YG P+W IKNSWG
Sbjct: 430 YGERNGI------PFWAIKNSWG 446
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 2 EATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLK---KIQIRGEGT 57
EA+ + + +E + +F F+ K+NK Y+++EE +RL+IF+ NLK KIQ EG+
Sbjct: 161 EASTRQPLKESVELLGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGS 219
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 97/158 (61%), Gaps = 16/158 (10%)
Query: 62 KLVDCDK---------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD+ D GCGGGLM+NA+E ++ GGLE E+ YPY G C +
Sbjct: 188 QLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLME--AGGLEEERSYPYTGKRGHCKFDP 245
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E++ V++ ++ + DE ++A LV++GP+AV +NA MQ Y GGVS PL +C N
Sbjct: 246 EKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPL--ICS--KRN 301
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 302 VNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGE 339
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL S+A+ I K GGLE E DY Y+G +AC + E+ +V I V
Sbjct: 220 DCDKIDKACMGGLPSSAYSAI--KNLGGLETEDDYSYRGHMQACSFSPEKAKVYINDSVE 277
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 278 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 334
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 335 -----NRSDIPFWAIKNSWGTDWGEK 355
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 143 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 200
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 201 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 257
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 258 -----NRSDVPFWAIKNSWGTDWGEK 278
>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
Length = 227
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 98/156 (62%), Gaps = 20/156 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--------RACHLNKE 113
+LVDCD++D GC GG M NA+E I +K GLE E+DYPY+ N CH
Sbjct: 60 QLVDCDRMDGGCKGGDMLNAYEYIKAK---GLEAEEDYPYQEENYKEYMFPHHRCHFRPS 116
Query: 114 EIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNL 173
++ I +Y VS DE ++A LVKNGP+++A+NAN + Y GGV+ P +C GG DN+
Sbjct: 117 KVAATIANYSTVSEDEDQIAANLVKNGPLSIALNANYIMDYMGGVACPR--ICPGG-DNM 173
Query: 174 DHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+H VL+VGYG+ K PYWI+KNSW ++GE
Sbjct: 174 NHAVLLVGYGMDGDK------PYWILKNSWSENYGE 203
>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
Length = 318
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 97/158 (61%), Gaps = 19/158 (12%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE + S GG++ EKD PY G + C +K
Sbjct: 139 QLVDCDHVCDPEEYGACDSGCNGGLMNNAFEILQS---GGVQKEKDIPYTGRDGTCKFDK 195
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C +
Sbjct: 196 TKV-AATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--YICG---KH 249
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + K +PYWIIKNSWG WGE
Sbjct: 250 LDHGVLLVGYGEGRYAPIRFKNKPYWIIKNSWGESWGE 287
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 389 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVV 446
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 447 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 503
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 504 -----NRSDVPFWAIKNSWGTDWGEK 524
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 101/159 (63%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL EKDYPY G++ +C L+
Sbjct: 135 QLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLD 192
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS +E ++A L+KNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 193 RSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YICS---R 247
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG ++ K +PYWIIKNSWG WGE
Sbjct: 248 RLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGE 286
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE I+ GG+ E+DYPY G++ C +
Sbjct: 184 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKS--GGVMREEDYPYSGADSGTCKFD 241
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 242 KTKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YVCS---R 296
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 297 RLNHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGE 335
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 101/159 (63%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL EKDYPY G++ +C L+
Sbjct: 183 QLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLD 240
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS +E ++A L+KNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YICS---R 295
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG ++ K +PYWIIKNSWG WGE
Sbjct: 296 RLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGE 334
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 98/161 (60%), Gaps = 22/161 (13%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G + C L+
Sbjct: 191 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMREEDYPYTGKDGPTCKLD 248
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 249 KSKIVASVSNFSVISIDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YIC---AR 303
Query: 172 NLDHGVLIVGYGV---HKTKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG +F K PYWIIKNSWG WGE
Sbjct: 304 RLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGESWGE 342
>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
Length = 292
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 101/159 (63%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
++VDCD D GC GGLM+ AF+ + K+GG LE EKDYPY G++R C +
Sbjct: 109 QMVDCDHECDAEEPDDCDQGCNGGLMNTAFQ-YLQKVGG-LESEKDYPYTGTDRGTCKFD 166
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I+ + ++ VS DE ++A LVK+GP+A+AINA MQ Y GGVS P ++C
Sbjct: 167 ESKIKASVHNFSVVSIDEEQIAANLVKHGPLAIAINAVFMQTYIGGVSCP--YICG---K 221
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 222 HLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGETWGE 260
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 220 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVV 277
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 278 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 334
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 335 -----NRSDVPFWAIKNSWGTDWGEK 355
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 99/158 (62%), Gaps = 15/158 (9%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE ++ GG+ E+DY Y G + +C +K
Sbjct: 182 QLVDCDHVCDPDEYNSCDSGCNGGLMNNAFEYLLQS--GGVVREQDYSYTGRDGSCKFDK 239
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GVS P ++C
Sbjct: 240 SKIAASVSNFSVVSVDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCP--YIC--AKSR 295
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VG+G K +PYWIIKNSWG +WGE+
Sbjct: 296 LDHGVLLVGFGNGFAPIRLKEKPYWIIKNSWGQNWGEE 333
Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 5/48 (10%)
Query: 8 HHHDKL---EHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI 52
H D+L EH F F K +KSYATKEE+ R +F++NLKK ++
Sbjct: 35 HEDDQLLNAEH--HFTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAKL 80
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 92/146 (63%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I +
Sbjct: 325 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSME 382
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 383 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 439
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 440 -----NRSDVPFWAIKNSWGTDWGEK 460
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 86/148 (58%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD++D GC GGL NAF I + GGLE E YPYK N CHL + I V I
Sbjct: 310 ELIDCDRIDKGCNGGLPINAFREI--QRMGGLEPEDQYPYKARNGTCHLIRSAIAVTIDD 367
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + +ET M ++V+ GP++V I+A + +Y G+ HP + C +DHGVLI G
Sbjct: 368 AVEIPRNETVMKAWIVQRGPLSVGIDAKLLAYYKSGILHPSRSRCPPS--GIDHGVLITG 425
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YGV PYW IKNSWG WGE
Sbjct: 426 YGVENG------LPYWTIKNSWGDQWGE 447
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 86/148 (58%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD++D GC GGL NAF I + GGLE E YPYK N CHL + I V I
Sbjct: 275 ELIDCDRIDKGCNGGLPINAFREI--QRMGGLEPEDQYPYKARNGTCHLIRSAIAVTIDD 332
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + +ET M ++V+ GP++V I+A + +Y G+ HP + C +DHGVLI G
Sbjct: 333 AVEIPRNETVMKAWIVQRGPLSVGIDAKLLAYYKSGILHPSRSRCPPS--GIDHGVLITG 390
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YGV PYW IKNSWG WGE
Sbjct: 391 YGVENG------LPYWTIKNSWGDQWGE 412
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LV+CD D+GC GGLM+ AFE + GGL E+DYPY G++R +C +
Sbjct: 196 QLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLK--AGGLMKEEDYPYTGTDRGSCKFD 253
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ +S DE ++A LVK GP+AVAINA MQ Y GGVS P ++C
Sbjct: 254 KTKIAASVSNFSVISLDEDQIAANLVKIGPLAVAINAVFMQTYVGGVSCP--YICS---K 308
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 309 RLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGE 347
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 78/160 (48%), Positives = 98/160 (61%), Gaps = 19/160 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G +R C+ +
Sbjct: 189 QLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLK--AGGLMKEQDYPYAGIDRNTCNFD 246
Query: 112 KEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM 170
K +I I S+ V+S DE ++A LVKNGP+A+AINA MQ Y GGVS P F+C
Sbjct: 247 KSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCP--FICS--- 301
Query: 171 DNLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + + YWIIKNSWG WGE
Sbjct: 302 KRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGE 341
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 29/44 (65%), Gaps = 2/44 (4%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
HH EH F+ F + KSYAT+EE+ +R +IF+AN+++ +
Sbjct: 50 HHALGAEH--HFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAE 91
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 80/186 (43%), Positives = 101/186 (54%), Gaps = 25/186 (13%)
Query: 63 LVDCDK----------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
LVDCD D GC GGL NA+ II GG++ E YPY+G + C
Sbjct: 166 LVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIK--NGGIDTEASYPYQGVDGTCSFKA 223
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
I KI ++ VSS+ET+MA YLV NGP+A+A +A QFY GGV F G +
Sbjct: 224 ANIGAKISNWTYVSSNETQMAAYLVANGPLAIAADAVEWQFYLGGV-----FDVPCG-NT 277
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKS 232
LDHG+LIVGY T F HK + YWI+KNSWG WGE+ + N GE V+K+
Sbjct: 278 LDHGILIVGYSAENTIF-HKDKAYWIVKNSWGATWGEQGYIYISRGN------GECVSKT 330
Query: 233 IYSSAP 238
+ P
Sbjct: 331 TSTPTP 336
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 69/149 (46%), Positives = 92/149 (61%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD D+GC GG + II GGLE ++DYPY G + C L++ ++ KI S
Sbjct: 163 QLVDCDVQDSGCDGGYPPTTYGEIIRM--GGLEAQRDYPYVGREQPCKLDESKLLAKINS 220
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + ++E + A Y+ ++GPM+ INA +QFY G+SHP K C+ D L+HGVL VG
Sbjct: 221 SIVLEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPSKSQCQ--PDWLNHGVLSVG 278
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG T PYWIIKNSWG WGEK
Sbjct: 279 YG------TEDGVPYWIIKNSWGTGWGEK 301
>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
Length = 381
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 99/159 (62%), Gaps = 16/159 (10%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD+V D GCGGGLM+NA+ +I GGLE E YPY G C ++
Sbjct: 203 QLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIE--AGGLEDEISYPYTGKPGKCKFDE 260
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++I V++ ++ ++ DE ++A +LV +GP+A+ +NA MQ Y GGVS PL +C G
Sbjct: 261 KKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQTYIGGVSCPL--IC--GKKW 316
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGEK 210
++HGVL+VGYG +PYWIIKNSWG WGE+
Sbjct: 317 INHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEE 355
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 98/158 (62%), Gaps = 18/158 (11%)
Query: 62 KLVDCDKV---------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD V D+GC GGLM+NAFE I+ GG++ E+DYPY G +R ++
Sbjct: 156 QLVDCDHVCDPEQYGACDSGCNGGLMNNAFEYILES--GGVQREEDYPYTGRDRGPAID- 212
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E + ++ VS DE +++ LVKNGP+A+ INA MQ Y GGVS P ++C N
Sbjct: 213 EANAASVSNFSVVSLDEDQISANLVKNGPLAIGINAVFMQTYIGGVSCP--YICG---KN 267
Query: 173 LDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG K +PYWIIKNSWG WGE
Sbjct: 268 LDHGVLLVGYGKAGYAPIRLKEKPYWIIKNSWGESWGE 305
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM+NAFE + GGLE E+DYPY G++ C +
Sbjct: 186 QLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALK--AGGLEREEDYPYTGTDGGTCKFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVK+GP++VAINA MQ Y GGVS P ++C
Sbjct: 244 KSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 299 RQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGE 337
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 62/119 (52%), Positives = 81/119 (68%), Gaps = 8/119 (6%)
Query: 92 GLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAM 151
GLE EK YPY+ + CH++ +++V I S VN+S DE +MA +L +NGP+++ INA M
Sbjct: 140 GLESEKAYPYEAKDEQCHMDYSKVQVYINSSVNISKDENDMASWLAENGPISIGINAFPM 199
Query: 152 QFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
QFY GG+SHP + C + LDHGVLIVGYG T PYWIIKNSWG +WGE+
Sbjct: 200 QFYMGGISHPWRIFCN--PEELDHGVLIVGYG------TKDETPYWIIKNSWGKNWGEE 250
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 94/158 (59%), Gaps = 17/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD DAGCGGG + AFE + GGL+ EKDYPY G + CH +K
Sbjct: 181 QLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDK 238
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+I + ++ + DE ++A LVK+GP+AV INA MQ Y GGVS PL +C
Sbjct: 239 SKICAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL--IC---FKR 293
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG H K + YWIIKNSWG +WGE
Sbjct: 294 QDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGE 331
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM+NAFE + GGLE E+DYPY G++ C +
Sbjct: 186 QLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALK--AGGLEREEDYPYTGTDGGTCKFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVK+GP++VAINA MQ Y GGVS P ++C
Sbjct: 244 KSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 299 RQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGE 337
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM+NAFE + GGLE E+DYPY G++ C +
Sbjct: 186 QLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALK--AGGLEREEDYPYTGTDGGTCKFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVK+GP++VAINA MQ Y GGVS P ++C
Sbjct: 244 KSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 299 RQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGE 337
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 99/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLN 111
+LVDCD D+GC GGLM++AFE I++ GG+ E+DYPY G+ C +
Sbjct: 179 QLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNN--GGVMREEDYPYSGTAGGTCKFD 236
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 237 QTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCP--YVCS---K 291
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 292 KLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGE 330
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 75/160 (46%), Positives = 99/160 (61%), Gaps = 19/160 (11%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHL 110
+LVDCD D+GC GGLM++AFE I++ GG+ E+DYPY G+ C
Sbjct: 179 QLVDCDHEQCDPEEAGSCDSGCKGGLMNSAFEYILNN--GGVMREEDYPYSGTAGGTCKF 236
Query: 111 NKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM 170
++ +I + ++ VS DE ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 237 DQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCP--YVCS--- 291
Query: 171 DNLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 292 KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGE 331
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 98/160 (61%), Gaps = 19/160 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G +R C+ +
Sbjct: 195 QLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLK--AGGLMKEQDYPYAGIDRNTCNFD 252
Query: 112 KEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM 170
K +I I ++ V+S DE ++A LVKNGP+A+AINA MQ Y GGVS P F+C
Sbjct: 253 KSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCP--FICS--- 307
Query: 171 DNLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVL+VGYG + + YWIIKNSWG WGE
Sbjct: 308 KRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGE 347
Score = 38.5 bits (88), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 29/44 (65%), Gaps = 2/44 (4%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
HH EH F+ F + KSYAT+EE+ +R +IF+AN+++ +
Sbjct: 50 HHALGAEH--HFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAE 91
>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
Length = 338
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 91/146 (62%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE DY Y+G ++C+ + E+ +V I V
Sbjct: 179 DCDKMDKACMGGLPSNAYSAI--KNLGGLETVDDYSYQGHMQSCNFSAEKAKVYINDSVE 236
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 237 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYG- 293
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGEK
Sbjct: 294 -----NRSDVPFWAIKNSWGTDWGEK 314
>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
Length = 459
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/146 (47%), Positives = 90/146 (61%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y G + C + ++ +V I V
Sbjct: 300 DCDKVDKACLGGLPSNAYLAI--KNLGGLETEDDYSYSGHLQTCSFSAKKAKVYINDSVE 357
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+SHPL+ LC + +DH VL+VGYG
Sbjct: 358 LSQNEQKLAAWLAKKGPISVAINAFGMQFYRRGISHPLRPLCSPWL--IDHAVLLVGYG- 414
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 415 -----NRSGIPFWAIKNSWGTDWGEE 435
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 97/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM+NAFE + GGLE E DYPY G++ C +
Sbjct: 186 QLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALK--AGGLEREADYPYTGTDGGTCKFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K ++ + ++ VS DE ++A LVK+GP++VAINA MQ Y GGVS P ++C
Sbjct: 244 KSKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +P+WIIKNSWG +WGE
Sbjct: 299 RQDHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGE 337
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 96/159 (60%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM+ AFE + GGLE E+DYPY G++R C +
Sbjct: 186 QLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLK--AGGLEREEDYPYTGNDRGPCKFD 243
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS DE ++A LVK+GP+AV INA MQ Y GGVS P ++C
Sbjct: 244 RNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQTYMGGVSCP--YICS---K 298
Query: 172 NLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
DHGVL+VGYG K +P+WIIKNSWG WGE
Sbjct: 299 RQDHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGESWGE 337
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 101/159 (63%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC GGLM++AFE + GGL E+DYPY G++ +C L+
Sbjct: 182 QLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMREEDYPYTGTDGGSCKLD 239
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS +E ++A LVKNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 240 RSKIVASVSNFSVVSINEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YICS---R 294
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL++GYG ++ K +PYWIIKNSWG WGE
Sbjct: 295 RLNHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGESWGE 333
>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
Length = 462
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 90/146 (61%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G +AC+ + + +V I V
Sbjct: 303 DCDKMDKACMGGLPSNAYTAI--KNLGGLETEDDYGYQGHVQACNFSTQMAKVYINDSVE 360
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S DE ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 361 LSRDENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 418
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG WGE+
Sbjct: 419 RSNI------PYWAIKNSWGRDWGEE 438
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/159 (44%), Positives = 95/159 (59%), Gaps = 17/159 (10%)
Query: 62 KLVDCDK----------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+LVDCD+ D GCGGGLM+NA+E ++ GGLE E+ YPY G C +
Sbjct: 188 QLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLME--AGGLEEERSYPYTGKRGHCKFD 245
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
E++ V++ ++ + DE ++A LV+ GP+AV +NA MQ Y GGVS PL +C
Sbjct: 246 PEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPL--ICS--KR 301
Query: 172 NLDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 302 KVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGE 340
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 97/158 (61%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + GC GGLM+NA++ +I GGLE E YPY G + C+
Sbjct: 229 QLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS--GGLEEESSYPYTGRSGQCNFQS 286
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++I VK+ ++ + DE ++A +LV++GP+AV +NA MQ Y GGVS PL +C G
Sbjct: 287 DKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGGVSCPL--IC--GKRF 342
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG + + PYW+IKNSWG WGE
Sbjct: 343 VNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGE 380
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 85/148 (57%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD +D GC GGL NAF I K GGLE E YPY+ N CHL + +I V I
Sbjct: 299 ELIDCDVIDKGCNGGLPINAFREI--KRMGGLEPEDQYPYEAKNGTCHLVRAQIAVSIDD 356
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + +ET M ++ + GP++V I+A + +Y G+ HP K C ++HGVLI G
Sbjct: 357 AVEIPRNETVMKAWIAQRGPLSVGIDAELLSYYKSGILHPSKSRCPPS--KINHGVLITG 414
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG+ PYW IKNSWG WGE
Sbjct: 415 YGIENN------LPYWTIKNSWGEQWGE 436
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 97/158 (61%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + GC GGLM+NA++ +I GGLE E YPY G + C+
Sbjct: 229 QLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS--GGLEEESSYPYTGRSGQCNFQS 286
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++I VK+ ++ + DE ++A +LV++GP+AV +NA MQ Y GGVS PL +C G
Sbjct: 287 DKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGGVSCPL--IC--GKRF 342
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG + + PYW+IKNSWG WGE
Sbjct: 343 VNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGE 380
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 100/159 (62%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLN 111
+LVDCD D+GC G LM++AFE + GGL EKDYPY G++ +C L+
Sbjct: 183 QLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLD 240
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ +I + ++ VS +E ++A L+KNGP+AVAINA MQ Y GGVS P ++C
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YICS---R 295
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
L+HGVL+VGYG ++ K +PYWIIKNSWG WGE
Sbjct: 296 RLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGE 334
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 95/159 (59%), Gaps = 17/159 (10%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+LVDCD +AGC GGLM ++FE II GGL E+ YPY+ + C N
Sbjct: 182 QLVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKT--GGLVTEESYPYEAVDNRCRFN 239
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
VKI ++ VSS+E EMA +L NGP+A+AINA+ +Q+Y G+ +P + +
Sbjct: 240 VSNAVVKISNWTFVSSNEDEMAAWLANNGPIAIAINADYLQYYRKGILNP----SRCDPE 295
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+HGVLIVGYG K K++ YWI+KNSW WGEK
Sbjct: 296 ELNHGVLIVGYGEEKAA-NGKVEKYWIVKNSWSASWGEK 333
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 90/167 (53%), Gaps = 22/167 (13%)
Query: 56 GTHLALK---LVDCDKV----------DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK 102
GT + L LVDCD +AGC GGL NA+ II GG++ E YPY
Sbjct: 167 GTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKN--GGIQTEATYPYT 224
Query: 103 GSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPL 162
+ C N ++ KI S+ V +ET++A YL NGP+A+A +A QFY GGV
Sbjct: 225 AVDGECKFNSAQVGAKISSFTMVPQNETQIASYLFNNGPLAIAADAEEWQFYMGGV---F 281
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
F C LDHG+LIVGYG T K PYWIIKNSWG WGE
Sbjct: 282 DFPCG---QTLDHGILIVGYGAQDT-IVGKNTPYWIIKNSWGADWGE 324
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 92/158 (58%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D GC GGLM+ AF +I GG+E E YPY G C N
Sbjct: 219 QLVDCDHMCDLKEKDDCDDGCSGGLMTTAFNYLIE--AGGIEEEVTYPYTGKRGECKFNP 276
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
E++ VK++++ + DE+++A +V NGP+A+ +NA MQ Y GGVS PL +C
Sbjct: 277 EKVAVKVRNFAKIPEDESQIAANVVHNGPLAIGLNAVFMQTYIGGVSCPL--ICD--KKR 332
Query: 173 LDHGVLIVGYGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 333 INHGVLLVGYGSRGFSILRLGYKPYWIIKNSWGKRWGE 370
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 90/150 (60%), Gaps = 12/150 (8%)
Query: 60 ALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
A +L+DCD VD GC GG +A++ I+ GGLE E YPY+ C L +I V I
Sbjct: 202 AQQLLDCDVVDEGCNGGFPLDAYKEIVRM--GGLEPEDKYPYEAKAEQCRLVPSDIAVYI 259
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
V + DE +M +LVK GP+++ I + +QFY GGVS P C+ + ++ HG L+
Sbjct: 260 NGSVELPHDEEKMRAWLVKKGPISIGITVDDIQFYKGGVSRPTT--CR--LSSMIHGALL 315
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYGV K PYWIIKNSWGP+WGE
Sbjct: 316 VGYGVEKN------IPYWIIKNSWGPNWGE 339
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 91/148 (61%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GCGGG +NA+ I+ GGLE + DYPY G + C+LNKE++ KI
Sbjct: 163 QLVDCDVMDYGCGGGWPTNAYMEIMRM--GGLELQSDYPYVGVQQQCYLNKEKLLAKIDD 220
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + + E E A YL ++GP++ A+NA +QFY G+SHP C +L+H VL VG
Sbjct: 221 LIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPA--SLNHAVLTVG 278
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
Y T PYWIIKNSWG WGE
Sbjct: 279 YD------TENGVPYWIIKNSWGTGWGE 300
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 96/159 (60%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLN 111
+LVDCD D+GC GGLM+ AF +K GGL E+DY Y G +R C +
Sbjct: 183 QLVDCDHECDPDLNDACDSGCNGGLMTTAFG--YTKKAGGLVREEDYLYTGRDRGPCKFD 240
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VS DE ++A LVKNGP++V INA MQ Y GGVS P F+C
Sbjct: 241 KSKIAASVSNFSVVSLDEDQIAANLVKNGPLSVGINAVYMQTYIGGVSCP--FICG---K 295
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 296 HLDHGVLLVGYGAGGYAPIRFKEKPYWIIKNSWGENWGE 334
>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
Length = 462
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 89/146 (60%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C+ + + +V I V
Sbjct: 303 DCDKVDKACLGGLPSNAYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVE 360
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 361 LSRNENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 418
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG WGE+
Sbjct: 419 RSNI------PYWAIKNSWGSDWGEE 438
>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
Length = 462
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 89/146 (60%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C+ + + +V I V
Sbjct: 303 DCDKVDKACLGGLPSNAYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVE 360
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 361 LSRNENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 418
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG WGE+
Sbjct: 419 RSNI------PYWAIKNSWGSDWGEE 438
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/159 (45%), Positives = 96/159 (60%), Gaps = 20/159 (12%)
Query: 62 KLVDCDK---------VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D GC GGLM+NA++ ++ GGLE E YPY G+ C +
Sbjct: 189 QLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQS--GGLEEESSYPYTGAKGECKFDP 246
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ V+I ++ N+ DE ++A YLVK+GP+AV +NA MQ Y GGVS PL +C
Sbjct: 247 GKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQTYIGGVSCPL--ICSKKW-- 302
Query: 173 LDHGVLIVGY---GVHKTKFTHKIQPYWIIKNSWGPHWG 208
L+HGVL+VGY G + +K PYWIIKNSWG WG
Sbjct: 303 LNHGVLLVGYRAKGFSILRLGNK--PYWIIKNSWGKRWG 339
Score = 38.5 bits (88), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKK 49
FN F+E + K Y+T+EEY +RL IF N+ +
Sbjct: 53 FNVFMENYGKKYSTREEYLQRLEIFAGNMLR 83
>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
Length = 332
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 89/146 (60%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C+ + + +V I V
Sbjct: 173 DCDKVDKACLGGLPSNAYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVE 230
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 231 LSRNENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 288
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG WGE+
Sbjct: 289 RSNI------PYWAIKNSWGSDWGEE 308
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 98/158 (62%), Gaps = 17/158 (10%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+LVDCD D+GC GGLM +AF+ +I GGL+ E YPY+G + C N
Sbjct: 173 QLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIKN--GGLDTEDSYPYEGVDDTCRFN 230
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K + I S+ ++SSDE +MA +L NGP+++AINA +Q+Y G+S P + C
Sbjct: 231 KSNVAATISSWTSISSDENQMAAWLAANGPISIAINAEWLQYYTSGISDP--WFCN--PQ 286
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+LDHGVLIVGYGV K+ + + YWI+KNSWG WGE
Sbjct: 287 DLDHGVLIVGYGVGKSWLGSE-ENYWIVKNSWGSDWGE 323
>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
Length = 462
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 89/146 (60%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C+ + + +V I V
Sbjct: 303 DCDKVDKACLGGLPSNAYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVE 360
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 361 LSRNENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 418
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG WGE+
Sbjct: 419 RSNI------PYWAIKNSWGSDWGEE 438
>gi|4972585|gb|AAD34707.1|AF071801_1 cysteine proteinase [Paragonimus westermani]
Length = 229
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/149 (46%), Positives = 91/149 (61%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GCGGG +NA+ I+ GGLE + DYPY G + C+LNKE++ KI
Sbjct: 67 QLVDCDVMDYGCGGGWPTNAYMEIMRM--GGLELQSDYPYVGVQQQCYLNKEKLLAKIDD 124
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + + E E A YL ++GP++ A+NA +QFY G+SHP C +L+H VL VG
Sbjct: 125 LIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPA--SLNHAVLTVG 182
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y T PYWIIKNSWG WGE
Sbjct: 183 YD------TENGVPYWIIKNSWGTGWGEN 205
>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 129 bits (325), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 94/158 (59%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + ++GC GGLM+NA+ ++S GGL + YPY G+ C ++
Sbjct: 191 QLVDCDHTCDAVAKTECNSGCSGGLMTNAYRYLMSS--GGLMEQAAYPYTGAQGPCRFDR 248
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ V++ ++ V DE +M LV+ GP+AV +NA MQ Y GGVS PL +C M N
Sbjct: 249 GKVAVRVANFTAVPLDEDQMRAALVRGGPLAVGLNAAFMQTYVGGVSCPL--ICPRAMVN 306
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
HGVL+VGYG + +PYW+IKNSWG WGE
Sbjct: 307 --HGVLLVGYGARGFSALRLGYRPYWLIKNSWGAQWGE 342
Score = 36.6 bits (83), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 20/34 (58%)
Query: 10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIF 43
H L A F F+ +H K Y+ EEY +RLR+F
Sbjct: 41 HPGLLPEAQFAAFVRRHGKEYSGPEEYARRLRVF 74
>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
Length = 417
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 89/146 (60%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C+ + + +V I V
Sbjct: 258 DCDKVDKACLGGLPSNAYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVE 315
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 316 LSRNENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 373
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
PYW IKNSWG WGE+
Sbjct: 374 RSNI------PYWAIKNSWGSDWGEE 393
>gi|371781479|emb|CCA95098.1| putative responsive to dehydration 19, partial [Liriodendron
tulipifera]
Length = 150
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/147 (50%), Positives = 90/147 (61%), Gaps = 8/147 (5%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLNKEEIRVKIQSYV 123
D DAGC GGLM++AF+ + GGLE E+DYPY G + A C K +I +Y
Sbjct: 1 DPSSCDAGCNGGLMTSAFKYTLKS--GGLEKEEDYPYTGKDGATCKFEKSKIAASALNYT 58
Query: 124 NVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183
VS DE ++A LVK GP+AV INA MQ Y GGVS P ++C + LDHGVL+VGYG
Sbjct: 59 VVSIDEDQIAANLVKFGPLAVGINAVFMQTYIGGVSCP--YICSKRL--LDHGVLLVGYG 114
Query: 184 VHK-TKFTHKIQPYWIIKNSWGPHWGE 209
K +PYWIIKNSWG WGE
Sbjct: 115 AAGYAPIRFKDKPYWIIKNSWGESWGE 141
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 95/158 (60%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+L+DCD D+GC GGL SNA E I+ GG++ EK YPY G C ++
Sbjct: 116 QLLDCDVGCDPDVPNACDSGCNGGLPSNAMEYIVEH--GGIDTEKSYPYVGEKGECKADE 173
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ ++++ VSSDE +MA LVK+GP+++ INA MQ Y GGV+ P +LC +
Sbjct: 174 GTLGATLKNFSYVSSDEKQMAAALVKHGPLSIGINAAWMQTYIGGVACP--WLCD--SEA 229
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVLIVGYG + +PYWI+KNSW P WGE
Sbjct: 230 LDHGVLIVGYGSSGFAPVRWQQEPYWIVKNSWSPAWGE 267
>gi|2253415|gb|AAB62937.1| stress-induced cysteine proteinase [Lavatera thuringiaca]
Length = 175
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/152 (46%), Positives = 96/152 (63%), Gaps = 14/152 (9%)
Query: 65 DCD-----KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLNKEEIRVK 118
+CD +AGC GGLM++AFE + GGLE E++YPY G +R C +K +I
Sbjct: 1 ECDPQQYGACNAGCSGGLMTSAFEYTLKA--GGLEREEEYPYTGIDRGGCKFDKTKIAAS 58
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ ++ +S DE ++A +VK+GP+AV INA MQ Y GGVS P ++C +LDHGVL
Sbjct: 59 VSNFSVISVDEDQIAANMVKHGPLAVGINAAFMQTYIGGVSCP--YIC---FRSLDHGVL 113
Query: 179 IVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+VGYG K +P+WIIKNSWG +WGE
Sbjct: 114 LVGYGAAGYAPVRFKEKPFWIIKNSWGANWGE 145
>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
Length = 242
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 85/148 (57%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD +D GC GGL NAF I K GGLE E YPYK N CHL + +I V I
Sbjct: 80 ELIDCDVIDNGCNGGLPINAFREI--KRMGGLEPEDQYPYKAKNGTCHLVRAQIAVTIDD 137
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +ET M ++ + GP++V I+A + +Y G+ HP K C ++HGVLI G
Sbjct: 138 AIEIPRNETVMKAWIAQRGPLSVGIDAELLAYYKSGILHPSKSRCP--PSKINHGVLITG 195
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG+ PYW IKNSWG WGE
Sbjct: 196 YGIENG------LPYWTIKNSWGEEWGE 217
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 93/158 (58%), Gaps = 16/158 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGL SNA E I+ GG++ EK YPY G C K
Sbjct: 95 QLVDCDVGCDPDVPNACDSGCNGGLPSNAMEYIVEH--GGIDTEKSYPYVGEKGECKAKK 152
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ ++++ VS DE +MA LVK GP+++ INA MQ Y GGV+ P +LC ++
Sbjct: 153 GKLGATLKNFSFVSDDEKQMAAALVKYGPLSIGINAAWMQSYIGGVACP--WLCD--AES 208
Query: 173 LDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVLIVGYG +PYWI+KNSW P WGE
Sbjct: 209 LDHGVLIVGYGSSGFAPVRWAPEPYWIVKNSWSPAWGE 246
>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
Length = 482
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 90/146 (61%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GG SNA+ I S GGLE E DY Y+G +AC+ + ++ +V I V
Sbjct: 323 DCDKMDKACMGGFPSNAYLAIKSL--GGLETEDDYSYQGHMKACNFSAKKAKVYINDSVE 380
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L GP++VAINA MQFY G++HPL+ LC +DH +L+VGYG
Sbjct: 381 LSKNEQKLAAWLAVKGPISVAINAFGMQFYRHGIAHPLRPLCSPWF--IDHAMLVVGYGN 438
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 439 RSN------VPFWAIKNSWGTDWGEE 458
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
+LVDCD D+GC GGLM+NAFE + GGL E+DYPY G N AC +
Sbjct: 191 QLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALK--AGGLMKEEDYPYTGRDNTACKFD 248
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VSSDE ++A LVK+GP+A+AINA MQ Y GGVS P ++C
Sbjct: 249 KSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQTYIGGVSCP--YVCS---K 303
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+ DHGVL+VG+G K +PYWIIKNSWG WGE
Sbjct: 304 SQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGE 342
Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F+ F K+ K+YAT+EE+ R R+F+ANL++ +
Sbjct: 55 FSLFKSKYEKTYATQEEHDHRFRVFKANLRRAR 87
>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
Length = 462
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 88/146 (60%), Gaps = 10/146 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL NA+ I S GGLE E DY Y+G AC+ + ++ +V I V
Sbjct: 303 DCDKVDKACMGGLPINAYSAIKSL--GGLETEDDYSYQGHMEACNFSAKKAKVYINDSVE 360
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E +A +L GP+++AINA MQFY G++HPL+ LC +DH +LIVGYG
Sbjct: 361 LSKNEQYLAAWLAVKGPISIAINAFGMQFYRHGIAHPLQPLCSPWF--IDHAMLIVGYGK 418
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+W IKNSWG WGE+
Sbjct: 419 RSG------VPFWAIKNSWGTDWGEE 438
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 126 bits (316), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 68/159 (42%), Positives = 92/159 (57%), Gaps = 17/159 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + D+GC GGLM+NA+ +I GGL + YPY G+ C +
Sbjct: 192 QLVDCDHTCDAVAKNECDSGCSGGLMTNAYTYLIRA--GGLMEQAAYPYTGAQGTCRFDA 249
Query: 113 EEIRVKIQSYVNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++ V++ S+ V DE ++ LV+ GP+AV +NA MQ Y GGVS PL LC +
Sbjct: 250 NKVAVRVTSFTAVPPDDEDQIRASLVRAGPLAVGLNAAFMQTYLGGVSCPL--LCPRKL- 306
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +PYWIIKNSWG WGE
Sbjct: 307 -INHGVLLVGYGARGLAPLRLGYRPYWIIKNSWGKEWGE 344
>gi|13774176|gb|AAK38775.1| cysteine proteinase [Dermatophagoides farinae]
Length = 127
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 61/115 (53%), Positives = 77/115 (66%), Gaps = 3/115 (2%)
Query: 53 RGEGTHLA-LKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
R E L+ +LVDCD +D GCGGGL +NAF+T+I GG+E E DYPY N C+
Sbjct: 15 RNESVSLSEQELVDCDTLDNGCGGGLPTNAFKTVIQL--GGIESETDYPYDAENEKCNFK 72
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLC 166
K VKI SYV + +ET + YL NGP+++ INANAMQFYFGG+SHP +LC
Sbjct: 73 KSLSHVKIDSYVELPKNETYIKNYLYHNGPISIGINANAMQFYFGGISHPPNWLC 127
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 95/160 (59%), Gaps = 20/160 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM+NA++ +I GGLE E YPY G + C
Sbjct: 143 QLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIE--AGGLEEESSYPYTGKHGECKFKP 200
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ + V++ ++ V BE ++A LV +GP+AV +NA MQ Y GGVS PL +C
Sbjct: 201 DRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQTYIGGVSCPL--ICP--KRW 256
Query: 173 LDHGVLIVGYGVHK---TKFTHKIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +F +K PYWIIKNSWG WGE
Sbjct: 257 INHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGXRWGE 294
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 98/159 (61%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLN 111
+LVDCD D+GC GGLM+NAFE + GGL E+DYPY G + AC +
Sbjct: 191 QLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALK--AGGLMKEEDYPYTGRDHTACKFD 248
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +I + ++ VSSDE ++A LV++GP+A+AINA MQ Y GGVS P ++C
Sbjct: 249 KSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCP--YVCS---K 303
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
+ DHGVL+VG+G K +PYWIIKNSWG WGE
Sbjct: 304 SQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGE 342
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 95/159 (59%), Gaps = 18/159 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+L+DCD D GC GGLM+NA+ ++ GG+E K+YPY G C N
Sbjct: 226 QLIDCDHKCDPLNTKACDNGCHGGLMTNAYNYLME--AGGIEEAKNYPYTGVQGDCKFNP 283
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ VK ++ V+ DE ++A LVK+GP+AV +NA MQ Y GGVS PL +C
Sbjct: 284 DLAAVKAINFTTVNLDEKQIAANLVKHGPLAVGLNAAFMQTYIGGVSCPL--ICSKRF-- 339
Query: 173 LDHGVLIVGYGVHKTKFTHKI--QPYWIIKNSWGPHWGE 209
++HGVL+VGYG HK ++ +PYWIIKNSWG WGE
Sbjct: 340 INHGVLLVGYG-HKGFALLRLGYRPYWIIKNSWGKRWGE 377
Score = 41.2 bits (95), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 8 HHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK 49
H L +F+ F+ +H K Y+T EEY +RLRIF NL K
Sbjct: 79 EHLLNLRSKTLFDKFIVEHGKVYSTIEEYVRRLRIFEKNLLK 120
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 83/150 (55%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW WGE
Sbjct: 292 VGYNDSSNP------PYWIIKNSWSNMWGE 315
>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 404
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 132 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 191
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 192 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 245
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 246 VGYNDNSNP------PYWIIKNSWSNMWGE 269
>gi|323713082|gb|ADY04295.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 91/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVSMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|323713208|gb|ADY04358.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGTSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|323713210|gb|ADY04359.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDRWGEE 131
>gi|12330248|gb|AAG52661.1| cysteine proteinase [Metagonimus yokogawai]
Length = 148
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/123 (50%), Positives = 77/123 (62%), Gaps = 4/123 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCDK D C GGL A+E+I+ GGL EKDYPY+ CH I S
Sbjct: 24 QLLDCDKRDEACNGGLPEWAYESIVKM--GGLMSEKDYPYEAEKEVCHWKPSNASAYINS 81
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +S +ETE+A +L NGP++V +NAN +QFYFGGVSHP LC LDH VL+VG
Sbjct: 82 SVVLSKNETELAAWLTDNGPISVGMNANFLQFYFGGVSHPPHMLCS--ESGLDHAVLLVG 139
Query: 182 YGV 184
YGV
Sbjct: 140 YGV 142
>gi|323713472|gb|ADY04490.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713476|gb|ADY04492.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713480|gb|ADY04494.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713482|gb|ADY04495.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713484|gb|ADY04496.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713486|gb|ADY04497.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713488|gb|ADY04498.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713490|gb|ADY04499.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713492|gb|ADY04500.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 138
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
Length = 500
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 75/161 (46%), Positives = 93/161 (57%), Gaps = 19/161 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
+L+DCD D+GC GGL SNA E I+ GGL+ EK YPYK C
Sbjct: 323 QLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEH--GGLDTEKSYPYKAYKEDTCRAK 380
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ ++ I +Y V +ET MA LVK GP+++ INA MQ Y GGV+ P +LC D
Sbjct: 381 EGKLGATISNYTFVGKNETHMAHALVKYGPLSIGINAAWMQSYVGGVACP--WLCN--KD 436
Query: 172 NLDHGVLIVGYGVH--KTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVLIVGYG HK +PYW+IKNSWG WGE+
Sbjct: 437 ALDHGVLIVGYGEEGFAPARLHK-EPYWVIKNSWGMGWGEE 476
>gi|323713016|gb|ADY04262.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713018|gb|ADY04263.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713020|gb|ADY04264.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713022|gb|ADY04265.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713024|gb|ADY04266.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713026|gb|ADY04267.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713030|gb|ADY04269.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713032|gb|ADY04270.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713034|gb|ADY04271.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713036|gb|ADY04272.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713038|gb|ADY04273.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713040|gb|ADY04274.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713042|gb|ADY04275.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713044|gb|ADY04276.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713046|gb|ADY04277.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713048|gb|ADY04278.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713050|gb|ADY04279.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713052|gb|ADY04280.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713054|gb|ADY04281.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713056|gb|ADY04282.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713058|gb|ADY04283.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713060|gb|ADY04284.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713062|gb|ADY04285.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713064|gb|ADY04286.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713066|gb|ADY04287.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713068|gb|ADY04288.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713070|gb|ADY04289.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713072|gb|ADY04290.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713074|gb|ADY04291.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713076|gb|ADY04292.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713080|gb|ADY04294.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713084|gb|ADY04296.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713088|gb|ADY04298.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713090|gb|ADY04299.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713092|gb|ADY04300.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713094|gb|ADY04301.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713096|gb|ADY04302.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713098|gb|ADY04303.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713100|gb|ADY04304.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713102|gb|ADY04305.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713104|gb|ADY04306.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713106|gb|ADY04307.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713108|gb|ADY04308.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713110|gb|ADY04309.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713112|gb|ADY04310.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713114|gb|ADY04311.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713116|gb|ADY04312.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713118|gb|ADY04313.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713120|gb|ADY04314.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713122|gb|ADY04315.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713124|gb|ADY04316.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713126|gb|ADY04317.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713128|gb|ADY04318.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713130|gb|ADY04319.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713132|gb|ADY04320.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713134|gb|ADY04321.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713136|gb|ADY04322.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713138|gb|ADY04323.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713140|gb|ADY04324.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713142|gb|ADY04325.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713144|gb|ADY04326.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713146|gb|ADY04327.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713148|gb|ADY04328.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713150|gb|ADY04329.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713152|gb|ADY04330.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713154|gb|ADY04331.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713156|gb|ADY04332.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713158|gb|ADY04333.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713160|gb|ADY04334.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713162|gb|ADY04335.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713166|gb|ADY04337.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713168|gb|ADY04338.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713170|gb|ADY04339.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713172|gb|ADY04340.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713174|gb|ADY04341.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713180|gb|ADY04344.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713182|gb|ADY04345.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713184|gb|ADY04346.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713186|gb|ADY04347.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713188|gb|ADY04348.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713190|gb|ADY04349.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713192|gb|ADY04350.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713194|gb|ADY04351.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713196|gb|ADY04352.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713198|gb|ADY04353.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713200|gb|ADY04354.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713202|gb|ADY04355.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713204|gb|ADY04356.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713206|gb|ADY04357.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713212|gb|ADY04360.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713216|gb|ADY04362.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713218|gb|ADY04363.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713220|gb|ADY04364.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713222|gb|ADY04365.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713224|gb|ADY04366.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713226|gb|ADY04367.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713230|gb|ADY04369.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713232|gb|ADY04370.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713234|gb|ADY04371.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713236|gb|ADY04372.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713238|gb|ADY04373.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713240|gb|ADY04374.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713246|gb|ADY04377.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713248|gb|ADY04378.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713250|gb|ADY04379.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713252|gb|ADY04380.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713254|gb|ADY04381.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713256|gb|ADY04382.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713258|gb|ADY04383.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713260|gb|ADY04384.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713262|gb|ADY04385.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713264|gb|ADY04386.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713266|gb|ADY04387.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713268|gb|ADY04388.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713270|gb|ADY04389.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713274|gb|ADY04391.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713276|gb|ADY04392.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713278|gb|ADY04393.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713280|gb|ADY04394.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713282|gb|ADY04395.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713284|gb|ADY04396.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713286|gb|ADY04397.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713288|gb|ADY04398.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713290|gb|ADY04399.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713292|gb|ADY04400.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713294|gb|ADY04401.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713296|gb|ADY04402.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713298|gb|ADY04403.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713300|gb|ADY04404.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713302|gb|ADY04405.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713304|gb|ADY04406.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713306|gb|ADY04407.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713308|gb|ADY04408.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713310|gb|ADY04409.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713312|gb|ADY04410.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713314|gb|ADY04411.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713316|gb|ADY04412.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713318|gb|ADY04413.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713322|gb|ADY04415.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713324|gb|ADY04416.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713326|gb|ADY04417.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713328|gb|ADY04418.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713330|gb|ADY04419.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713332|gb|ADY04420.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713334|gb|ADY04421.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713336|gb|ADY04422.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713338|gb|ADY04423.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713340|gb|ADY04424.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713342|gb|ADY04425.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713344|gb|ADY04426.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713346|gb|ADY04427.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713348|gb|ADY04428.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713350|gb|ADY04429.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713352|gb|ADY04430.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713354|gb|ADY04431.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713356|gb|ADY04432.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713358|gb|ADY04433.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713360|gb|ADY04434.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713362|gb|ADY04435.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713364|gb|ADY04436.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713366|gb|ADY04437.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713368|gb|ADY04438.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713370|gb|ADY04439.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713372|gb|ADY04440.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713374|gb|ADY04441.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713376|gb|ADY04442.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713378|gb|ADY04443.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713380|gb|ADY04444.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713382|gb|ADY04445.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713384|gb|ADY04446.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713386|gb|ADY04447.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713388|gb|ADY04448.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713390|gb|ADY04449.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713392|gb|ADY04450.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713394|gb|ADY04451.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713396|gb|ADY04452.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713398|gb|ADY04453.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713400|gb|ADY04454.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713402|gb|ADY04455.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713404|gb|ADY04456.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713408|gb|ADY04458.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713410|gb|ADY04459.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713412|gb|ADY04460.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713414|gb|ADY04461.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713416|gb|ADY04462.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713418|gb|ADY04463.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713420|gb|ADY04464.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713422|gb|ADY04465.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713424|gb|ADY04466.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713426|gb|ADY04467.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713428|gb|ADY04468.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713430|gb|ADY04469.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713432|gb|ADY04470.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713434|gb|ADY04471.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713436|gb|ADY04472.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713438|gb|ADY04473.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713440|gb|ADY04474.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713442|gb|ADY04475.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713444|gb|ADY04476.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713448|gb|ADY04478.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713454|gb|ADY04481.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713458|gb|ADY04483.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713460|gb|ADY04484.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713462|gb|ADY04485.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713464|gb|ADY04486.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713466|gb|ADY04487.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713468|gb|ADY04488.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713470|gb|ADY04489.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713474|gb|ADY04491.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713478|gb|ADY04493.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713494|gb|ADY04501.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713496|gb|ADY04502.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713498|gb|ADY04503.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713500|gb|ADY04504.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713502|gb|ADY04505.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713504|gb|ADY04506.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713506|gb|ADY04507.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713508|gb|ADY04508.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713510|gb|ADY04509.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713512|gb|ADY04510.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713514|gb|ADY04511.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713516|gb|ADY04512.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713518|gb|ADY04513.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713520|gb|ADY04514.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713522|gb|ADY04515.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713524|gb|ADY04516.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713526|gb|ADY04517.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713528|gb|ADY04518.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|323713164|gb|ADY04336.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713178|gb|ADY04343.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLKT--GGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|323713452|gb|ADY04480.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVKMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
Length = 346
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 66/159 (41%), Positives = 88/159 (55%), Gaps = 20/159 (12%)
Query: 63 LVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
LVDCD DAGC GGL NA+ +I GGL+ E YPY + +C
Sbjct: 170 LVDCDHQCMEYDGQKSCDAGCDGGLQPNAYRYVIEN--GGLDSENSYPYLAVTGDSCKFK 227
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ KI ++ + +ET+MA YL +GP+A+A +A QFY GGV C
Sbjct: 228 SGNVAAKISNFTMIPQNETQMAGYLATHGPLAIAADAAEWQFYIGGV---FDLPCG---Q 281
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHG+LIVG+ K F H ++PYWI+KNSWG WGE+
Sbjct: 282 SLDHGILIVGFSAEKNIFGH-LKPYWIVKNSWGASWGEQ 319
>gi|323713176|gb|ADY04342.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLKT--GGLMKEEDYPYTGTDKGSCKFEKSKIAAAVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|323713320|gb|ADY04414.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSHDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 66/149 (44%), Positives = 85/149 (57%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD GC GG S+++ I+ GGLE E DYPY G + C LNKE++ KI
Sbjct: 159 QLVDCDMAAEGCNGGWPSSSYLEIMDM--GGLESENDYPYVGVEQTCALNKEKLVAKIDD 216
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + + E E YL ++GP++ +NA A+Q Y G+ HP C D+L+H VL VG
Sbjct: 217 AVVLGASENEHVDYLAEHGPLSTLLNAVALQHYQSGILHPSHKDCPD--DDLNHAVLTVG 274
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y PYWIIKNSWG WGEK
Sbjct: 275 YDREGD------MPYWIIKNSWGTDWGEK 297
>gi|323713078|gb|ADY04293.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713086|gb|ADY04297.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRLK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|323713228|gb|ADY04368.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713242|gb|ADY04375.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713244|gb|ADY04376.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713272|gb|ADY04390.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713446|gb|ADY04477.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713450|gb|ADY04479.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGNKWGEE 131
>gi|323713406|gb|ADY04457.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIVASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 87/148 (58%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GG ++ I K GGLE + DYPY G C L++ ++ KI
Sbjct: 163 QLVDCDTVDNGCYGGYPPYTYKEI--KRMGGLELQSDYPYTGWGHGCRLDRSKLFAKIDD 220
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +DE + A +L ++GPM+ +NA +QFY G+ HP K +C + L+H VL VG
Sbjct: 221 SIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHPSKAMCS--PEGLNHAVLTVG 278
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
Y H I PYWIIKNSWG WGE
Sbjct: 279 YDTK-----HGI-PYWIIKNSWGTSWGE 300
>gi|323713456|gb|ADY04482.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRVK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 83/150 (55%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 178 LVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L LDHGVL+
Sbjct: 238 TDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGI------LTSCTSKQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY + PYWIIKNSW WGE
Sbjct: 292 VGYNDNSNP------PYWIIKNSWSNMWGE 315
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 95/160 (59%), Gaps = 20/160 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM+NA++ +I GGLE E YPY G + C
Sbjct: 197 QLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE--AGGLEEESSYPYTGKHGECKFKP 254
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ + V++ ++ V +E ++A LV +GP+AV +NA MQ Y GGVS PL +C
Sbjct: 255 DRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPL--ICP--KRW 310
Query: 173 LDHGVLIVGYGVHK---TKFTHKIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +F +K PYWIIKNSWG WGE
Sbjct: 311 INHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGKRWGE 348
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 95/160 (59%), Gaps = 20/160 (12%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D+GC GGLM+NA++ +I GGLE E YPY G + C
Sbjct: 68 QLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE--AGGLEEESSYPYTGKHGECKFKP 125
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ + V++ ++ V +E ++A LV +GP+AV +NA MQ Y GGVS PL +C
Sbjct: 126 DRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPL--ICP--KRW 181
Query: 173 LDHGVLIVGYGVHK---TKFTHKIQPYWIIKNSWGPHWGE 209
++HGVL+VGYG +F +K PYWIIKNSWG WGE
Sbjct: 182 INHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGKRWGE 219
>gi|20301809|gb|AAM15728.1| cysteine protease [Pagumogonimus skrjabini]
Length = 165
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 86/142 (60%), Gaps = 10/142 (7%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCDKVD GC GG + I K GGLE ++DYPY G + C ++K ++ KI
Sbjct: 34 QLVDCDKVDHGCNGGWPPYTYGEI--KRLGGLETQQDYPYIGRQQTCRMDKSKLLTKIDG 91
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + DE + A +L ++GPMA +NAN +Q+Y G+SHP ++ C L+HGVL VG
Sbjct: 92 SIVLERDEYKQAAWLAEHGPMASTLNANYLQYYRSGISHPSRYECNPA--RLNHGVLTVG 149
Query: 182 YGVHKTKFTHKIQPYWIIKNSW 203
YG T PYWI+KNSW
Sbjct: 150 YG------TENGIPYWIVKNSW 165
>gi|170784978|pdb|2P7U|A Chain A, The Crystal Structure Of Rhodesain, The Major Cysteine
Protease Of T. Brucei Rhodesiense, Bound To Inhibitor
K777
gi|171848756|pdb|2P86|A Chain A, The High Resolution Crystal Structure Of Rohedsain, The
Major Cathepsin L Protease From T. Brucei Rhodesiense,
Bound To Inhibitor K11002
Length = 215
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 53 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 113 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE
Sbjct: 167 VGYNDASNP------PYWIIKNSWSNMWGED 191
>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
Length = 354
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 88/151 (58%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A + II+ G + E YPY G+ CH N + KI
Sbjct: 181 LVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDNGT-VGAKI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ Y+++ DE E+A Y+ KNGP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 240 KGYMSLPHDEEEIAAYVGKNGPVAVAVDATTRQLYFGGVVT----LCFG--LSLNHGVLV 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 294 VGFNRQAKP------PYWIVKNSWGSSWGEK 318
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/159 (42%), Positives = 86/159 (54%), Gaps = 19/159 (11%)
Query: 63 LVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
LVDCD D GC GGL NA+ II GG++ E YPY + C+ N
Sbjct: 170 LVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFN 227
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
I KI ++ + +ET MA Y+V GP+A+A +A QFY GGV F +
Sbjct: 228 SANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPN 282
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHG+LIVGY T F K PYWI+KNSWG WGE+
Sbjct: 283 SLDHGILIVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQ 320
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/159 (42%), Positives = 86/159 (54%), Gaps = 19/159 (11%)
Query: 63 LVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
LVDCD D GC GGL NA+ II GG++ E YPY + C+ N
Sbjct: 170 LVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFN 227
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
I KI ++ + +ET MA Y+V GP+A+A +A QFY GGV F +
Sbjct: 228 SANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPN 282
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHG+LIVGY T F K PYWI+KNSWG WGE+
Sbjct: 283 SLDHGILIVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQ 320
>gi|339244637|ref|XP_003378244.1| cathepsin F [Trichinella spiralis]
gi|316972865|gb|EFV56511.1| cathepsin F [Trichinella spiralis]
Length = 317
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/149 (43%), Positives = 90/149 (60%), Gaps = 7/149 (4%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCDK++ GC GG A+ II G++ E DYPY G + +C LNKE+I+V I
Sbjct: 113 QIIDCDKINRGCRGGQPLKAYHEIIRM--SGVQAESDYPYTGLHGSCKLNKEKIKVYIND 170
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + +ET +A YL ++GP+AV +NA+ + Y G+ P K C L+HG I+G
Sbjct: 171 TVLLHKNETTIANYLYEHGPVAVRMNADILMLYRKGIIKPTKSSCNPNF--LNHGATIIG 228
Query: 182 YGVHKTKFTH-KIQPYWIIKNSWGPHWGE 209
YG K + H PYWIIKNSWG WGE
Sbjct: 229 YG--KESWLHWWSNPYWIIKNSWGVDWGE 255
>gi|323713214|gb|ADY04361.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/138 (48%), Positives = 89/138 (64%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + G L E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGALMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+PYWIIKNSWG WGE+
Sbjct: 114 EKPYWIIKNSWGDKWGEE 131
>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
Length = 354
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 88/151 (58%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A + II+ G + E YPY G+ CH N + KI
Sbjct: 181 LVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDNGT-VGAKI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ Y+++ DE E+A Y+ KNGP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 240 KGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVT----LCFG--LSLNHGVLV 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 294 VGFNRQAKP------PYWIVKNSWGSSWGEK 318
>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 86/150 (57%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVEIPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW HWGE
Sbjct: 289 VGYN------DSAAVPYWVIKNSWTTHWGE 312
>gi|118488886|gb|ABK96252.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 156
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/134 (49%), Positives = 88/134 (65%), Gaps = 9/134 (6%)
Query: 78 MSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMAKYL 136
M++AFE + GGL E+DYPY G++R AC +K ++ ++ ++ VS DE ++A L
Sbjct: 1 MNSAFEYTLK--AGGLMREEDYPYTGTDRGACKFDKNKVAARVANFSVVSLDEDQIAANL 58
Query: 137 VKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG-VHKTKFTHKIQP 195
VKNGP+AVAINA MQ Y GGVS P ++C LDHGVL+VGYG + K +P
Sbjct: 59 VKNGPLAVAINAVFMQTYIGGVSCP--YICS---RRLDHGVLLVGYGSAGYSPVRMKEKP 113
Query: 196 YWIIKNSWGPHWGE 209
+WIIKNSWG WGE
Sbjct: 114 FWIIKNSWGEKWGE 127
>gi|323713028|gb|ADY04268.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/138 (48%), Positives = 90/138 (65%), Gaps = 9/138 (6%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMA 133
GGLM++AFE + GGL E+DYPY G+++ +C K +I + ++ VS DE ++A
Sbjct: 1 GGLMNSAFEYTLK--AGGLMKEEDYPYTGTDKGSCKFEKSKIAASVANFSVVSLDEDQIA 58
Query: 134 KYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHK 192
LVKNGP+A+AINA MQ Y GGVS P ++C LDHGVL+VGYG + K
Sbjct: 59 ANLVKNGPLAIAINAVFMQTYMGGVSCP--YICS---KRLDHGVLLVGYGSSGYSPVRMK 113
Query: 193 IQPYWIIKNSWGPHWGEK 210
+P+WIIKNSWG WGE+
Sbjct: 114 EKPHWIIKNSWGDKWGEE 131
>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
Length = 353
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 87/151 (57%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A + II+ G + E YPY G+ CH N + KI
Sbjct: 180 LVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDNGT-VGAKI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE E+A Y+ KNGP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 239 AGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVT----LCFG--LSLNHGVLV 292
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 293 VGFNRQAKP------PYWIVKNSWGSSWGEK 317
>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
Length = 353
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 87/151 (57%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A + II+ G + E YPY G+ CH N + KI
Sbjct: 180 LVSCDNIDDGCNGGLMEQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDNGT-VGAKI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE E+A Y+ KNGP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 239 AGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVT----LCFG--LSLNHGVLV 292
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 293 VGFNRQAKP------PYWIVKNSWGSSWGEK 317
>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
Length = 354
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 87/151 (57%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A + II+ G + E YPY G+ CH N + KI
Sbjct: 181 LVSCDNIDDGCNGGLMEQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDNGT-VGAKI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE E+A Y+ KNGP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 240 AGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVT----LCFG--LSLNHGVLV 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 294 VGFNRQAKP------PYWIVKNSWGSSWGEK 318
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 96/169 (56%), Gaps = 28/169 (16%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + D+GCGGGLM+NA+ ++S GGL + YPY G+ AC +
Sbjct: 192 QLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYLMSS--GGLMEQSAYPYTGAQGACRFDA 249
Query: 113 EEIRVKIQSYVNVS--------SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF 164
+ V++ ++ V+ + +M LV++GP+AV +NA MQ Y GGVS PL
Sbjct: 250 NRVAVRVANFTVVAPAAGPGGNDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPL-- 307
Query: 165 LCKGGMDNLDHGVLIVGY---GVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+C N HGVL+VGY G + H+ PYWIIKNSWG WGE+
Sbjct: 308 VCPRAWVN--HGVLLVGYGERGFAALRLGHR--PYWIIKNSWGKAWGEQ 352
>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
Length = 354
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 87/151 (57%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A E II G + EK YPY G++ CH +K E +I
Sbjct: 181 LVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPTEKSYPYASAGGTSPPCH-DKGEFGARI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE +A Y+ K GP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 240 SGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVVT----LCFG--LSLNHGVLV 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 294 VGFNKRAKP------PYWIVKNSWGTSWGEK 318
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 91/160 (56%), Gaps = 17/160 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + + GC GGLM+NA+ ++ GGL + YPY G+ C +
Sbjct: 201 QLVDCDHTCSAVAQNECNNGCAGGLMTNAYSYLMES--GGLMEQSAYPYTGAAGPCRFDP 258
Query: 113 EEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
++ V++ ++ V + DE ++ LV+ GP+AV +NA MQ Y GGVS PL +C
Sbjct: 259 TQVAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGGVSCPL--ICPRAWV 316
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
N HGVL+VGYG +PYWIIKNSWG WGE+
Sbjct: 317 N--HGVLLVGYGARGFAALRLGYRPYWIIKNSWGKQWGEQ 354
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 67/135 (49%), Positives = 88/135 (65%), Gaps = 9/135 (6%)
Query: 77 LMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMAKY 135
LM++AFE I++ GG+ E+DYPY G+N C +K +I + ++ VS DE ++A
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAAN 260
Query: 136 LVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHKIQ 194
LVKNGP+AVAINA MQ Y GGVS P ++C L+HGVL+VGYG K +
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVCS---KKLNHGVLLVGYGSESYAPIRMKQK 315
Query: 195 PYWIIKNSWGPHWGE 209
PYWIIKNSWG +WGE
Sbjct: 316 PYWIIKNSWGENWGE 330
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 94/158 (59%), Gaps = 17/158 (10%)
Query: 62 KLVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
+LVDCD D GC GGL +A++ ++ GG+ EKDYPY C +
Sbjct: 178 QLVDCDNNCVTFENQQSCDDGCNGGLQWSAYQYLMK--AGGVVTEKDYPYYAERYKCEVK 235
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K+ ++ +S++ETEMA +L +NGP+AVA+NA+ +Q Y G++ P C
Sbjct: 236 PANFVAKLSNWTMLSTNETEMANWLAENGPIAVALNADFLQNYNNGIADPA--WCDP--T 291
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
LDHGVLIVGYG+ +T + K QPYWI+KNSWG +GE
Sbjct: 292 QLDHGVLIVGYGL-ETFWFGKPQPYWIVKNSWGYDFGE 328
>gi|4902840|emb|CAB43538.1| cysteine proteinase A [Leishmania major]
Length = 229
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 87/151 (57%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A E II G + EK YPY G++ CH +K E +I
Sbjct: 56 LVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPTEKSYPYASAGGTSPPCH-DKGEFGARI 114
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE +A Y+ K GP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 115 SGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVVT----LCFG--LSLNHGVLV 168
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 169 VGFNKRAKP------PYWIVKNSWGTSWGEK 193
>gi|343472975|emb|CCD15017.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 293
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 90/150 (60%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CDK + GC GGLM AF+ I+S G + E+ YPY G AC+++ + + KI
Sbjct: 110 LVSCDKKNYGCEGGLMDRAFQWIVSSNKGNVFTEQSYPYDSSWGDVPACNMSGKVVGAKI 169
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
SYV++ DE +A++L KNGP+A+A++A + + Y GGV L LDHGVL+
Sbjct: 170 SSYVDLPQDENAIAEWLAKNGPVAIAVDATSFRSYTGGV------LTSCISRRLDHGVLL 223
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY T K PYWIIKNSWG WGE
Sbjct: 224 VGY-----DDTSK-PPYWIIKNSWGKGWGE 247
>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
Length = 373
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 91/160 (56%), Gaps = 17/160 (10%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + + GC GGLM+NA+ ++ GGL ++ YPY G+ C +
Sbjct: 193 QLVDCDHTCSAVAQNECNNGCAGGLMTNAYAYLMKS--GGLMEQRAYPYTGAPGPCRFDP 250
Query: 113 EEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ V++ ++ V + DE ++ LV+ GP+AV +NA MQ Y GGVS PL LC
Sbjct: 251 AKAAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGGVSCPL--LCPRAWV 308
Query: 172 NLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGEK 210
N HGVL+VGYG +PYWIIKNSWG WGE+
Sbjct: 309 N--HGVLLVGYGARGFAALRLGYRPYWIIKNSWGERWGEQ 346
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW WGE
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTAQWGE 312
>gi|194462412|gb|ACF72674.1| cysteine proteinase type I [Leishmania tarentolae]
Length = 218
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 91/155 (58%), Gaps = 18/155 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD VD+GC GGLMS AFE +++ G + E YPY +N C N +E+ V
Sbjct: 53 QLVSCDDVDSGCSGGLMSQAFEWLLNNTNGNVYTEDSYPYLSANGYAPECS-NSDELAVG 111
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S+E EMA +L KNGP+A+A++A A Y GGV C G + L+HG
Sbjct: 112 AQIDGHVVIESNEDEMAAWLAKNGPIAIAVDATAFMSYEGGV----LTACNG--EQLNHG 165
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKT 211
VL+V Y T PYW+IKNSWG WGE+
Sbjct: 166 VLLVAYN------TTGELPYWVIKNSWGASWGEEA 194
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 66/149 (44%), Positives = 86/149 (57%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+V GC GG +++ I K GGLE E DYPY G+ + C LNKE++ KI
Sbjct: 159 QLVDCDRVAEGCNGGWPVSSYLEI--KHMGGLESESDYPYVGAEQTCALNKEKLLAKIDD 216
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + + E E A YL ++GP++ +NA A+Q Y GV +P C L+H VL VG
Sbjct: 217 LIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPTYEECPD--TELNHAVLTVG 274
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y PYWIIKNSWG WGEK
Sbjct: 275 YDKEGD------MPYWIIKNSWGTDWGEK 297
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 90/151 (59%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRVKI 119
L+ CD + GCGGGLM AF+ I+S G + E+ YPY ++ C+ + + + KI
Sbjct: 178 LLSCDTREDGCGGGLMDRAFQWIVSSNKGNVFTEQSYPYASTDGDVPRCNKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +A++L KNGP+A+A+ A ++Q Y GGV L + LDHGVL+
Sbjct: 238 SDYVDLPQDENAIAEWLAKNGPVAIAVEATSLQRYTGGV------LTSCISEQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSWG WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWGKGWGEE 316
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 95/168 (56%), Gaps = 27/168 (16%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD + D+GCGGGLM+NA+ ++S GGL + YPY G+ C +
Sbjct: 189 QLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYLMSS--GGLMEQSAYPYTGAQGTCRFDA 246
Query: 113 EEIRVKIQSYVNVS-------SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL 165
+ V++ ++ V+ + +M LV++GP+AV +NA MQ Y GGVS PL +
Sbjct: 247 NRVAVRVANFTVVAPPGGNDGDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPL--V 304
Query: 166 CKGGMDNLDHGVLIVGY---GVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C N HGVL+VGY G + H+ PYWIIKNSWG WGE+
Sbjct: 305 CPRAWVN--HGVLLVGYGERGFAALRLGHR--PYWIIKNSWGKAWGEQ 348
>gi|71084306|gb|AAZ23598.1| cysteine protease [Leishmania major]
Length = 327
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 87/151 (57%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A E II G + E+ YPY G++ CH +K E +I
Sbjct: 154 LVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPTEESYPYASAGGTSPPCH-DKGEFGARI 212
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE +A Y+ K GP+AVA++A Q YFGGV LC G +L+HGVL+
Sbjct: 213 SGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVVT----LCFGW--SLNHGVLV 266
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ PYWI+KNSWG WGEK
Sbjct: 267 VGFNKRAKP------PYWIVKNSWGTSWGEK 291
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 93/177 (52%), Gaps = 10/177 (5%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLNKEEIRVKIQS 121
LVDC + GC GGLM A++ I+ G++ E YPY + C N I KI
Sbjct: 171 LVDCSTKNDGCNGGLMPLAYDYIVEN--NGIDTEASYPYLAIQQKNCQFNPANIGAKIDG 228
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
Y NVSS+ET+M LV NGP+++A +A Q+Y G+ + +C NLDHG+LIVG
Sbjct: 229 YYNVSSNETQMQINLVNNGPLSIAADAAEWQYYKKGIFSGIFGICG---KNLDHGILIVG 285
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYSSAP 238
YG T+F ++ +WIIKNSW WG F +IK G S Y P
Sbjct: 286 YGQQTTEFGTEL--FWIIKNSWSTDWG--LSGFMLIKRGTGECGINLAVTSAYVDTP 338
>gi|118483347|gb|ABK93575.1| unknown [Populus trichocarpa]
Length = 157
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/134 (49%), Positives = 87/134 (64%), Gaps = 9/134 (6%)
Query: 78 MSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMAKYL 136
M+NAFE + GGLE EKDYPY G++R AC K ++ + ++ VS DE ++A L
Sbjct: 1 MNNAFEYALK--AGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQIAANL 58
Query: 137 VKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHKIQP 195
VK+GP++VAINA MQ Y GGVS P ++C + DHGVL+VGYG K +P
Sbjct: 59 VKHGPLSVAINAVFMQTYIGGVSCP--YICS---KHQDHGVLLVGYGAAGYAPIRFKEKP 113
Query: 196 YWIIKNSWGPHWGE 209
+WIIKNSWG +WGE
Sbjct: 114 FWIIKNSWGENWGE 127
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 91/153 (59%), Gaps = 8/153 (5%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC K + GC GG M NAFE ++ G G + EKDYPYKG + C + + +R I
Sbjct: 173 QLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSADGVRATI 232
Query: 120 QSYVNV-SSDETEMAKYLVKNGPMAVAINAN-AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V +ET++ + GP++VAI+A A+QFY GV + + C G L+HGV
Sbjct: 233 SGYNDVKQGNETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCFG---PLNHGV 289
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +F K+ YWIIKNSWG WGEK
Sbjct: 290 TAVGYGTASLRFGRKMD-YWIIKNSWGMGWGEK 321
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 289 VGYN------DSAAVPYWVIKNSWTTQWGE 312
>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
Length = 383
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 289 VGYN------DSAAVPYWVIKNSWTTQWGE 312
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 87/148 (58%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD++D GC GG ++ I K GGLE + YPY +AC +++ ++ KI
Sbjct: 163 QLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTSWKQACRIDRSKLVAKIDD 220
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +DE + A +L ++GPM+ +NA +QFY G+ HP K +C + L+H VL VG
Sbjct: 221 SIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPSKAMCSP--EGLNHAVLTVG 278
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
Y H + PYW ++NSWG WGE
Sbjct: 279 YDTE-----HGV-PYWTVRNSWGTRWGE 300
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 85/149 (57%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+ GC GG ++++ I+ GGLE E DYPY G + C LNKE++ KI
Sbjct: 159 QLVDCDRAAQGCNGGWPASSYLEIMYM--GGLESESDYPYVGVEQTCALNKEKLVAKIDD 216
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +E + A YL ++GP++ +NA A+Q+Y GV P C L+H VL VG
Sbjct: 217 SIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPTFEECPD--TELNHAVLTVG 274
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y PYWIIKNSWG WGEK
Sbjct: 275 YDKEGD------MPYWIIKNSWGTDWGEK 297
>gi|237637246|gb|ACR07923.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE
Sbjct: 167 VGYN------DSAAVPYWIIKNSWTAQWGED 191
>gi|237637240|gb|ACR07920.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE
Sbjct: 167 VGYN------DSAAVPYWIIKNSWTAQWGED 191
>gi|204307630|gb|ACI00341.1| cysteine proteinase [Trypanosoma cruzi]
Length = 339
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 47 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 106
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 107 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 160
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW WGE
Sbjct: 161 VGYN------DSAAVPYWIIKNSWTAQWGE 184
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 87/148 (58%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD++D GC GG ++ I K GGLE + YPY G +AC L++ ++ KI
Sbjct: 163 QLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTGWEQACRLDRSKLFAKIDD 220
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +E + A +L ++GPM+ +NA +QFY G+ HP ++ C + L+H VL VG
Sbjct: 221 SIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYACS--PEGLNHAVLTVG 278
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
Y T + PYW ++NSWG WGE
Sbjct: 279 YD------TERGVPYWTVRNSWGTRWGE 300
>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
Length = 308
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/131 (46%), Positives = 84/131 (64%), Gaps = 4/131 (3%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDK+D C GGL SNA+ I K GGLE E DY Y+G ++C+ + E+ +V I V
Sbjct: 165 DCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVE 222
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L K GP++VAINA MQFY G+S PL+ LC + +DH VL+VGYG
Sbjct: 223 LSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWL--IDHAVLLVGYGN 280
Query: 185 HKTKFTHKIQP 195
+ + IQP
Sbjct: 281 REFRCLSCIQP 291
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 289 VGYN------DSAAVPYWVIKNSWTTQWGE 312
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 93/159 (58%), Gaps = 15/159 (9%)
Query: 54 GEGTHLALK-LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY-PYKGSNRACHLN 111
GE H + + L+DCD ++ GC GGLM++A++ + + GG++ Y YK C+ +
Sbjct: 173 GELLHFSEQMLLDCDNINQGCRGGLMTDAYQFL--QQSGGIQTADTYGDYKNKKDICNFD 230
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
K +++ K+ + + +E + + LVKNGP+AV INA +QFY GG+ P K D
Sbjct: 231 KAKVKAKVVDWYQIPENEETIRRELVKNGPVAVGINARTLQFYEGGIVDP-----KNCDD 285
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
++H VLIVGYGV + PYW+IKN WG WG K
Sbjct: 286 KINHAVLIVGYGVEEGI------PYWLIKNQWGAEWGIK 318
>gi|371781445|emb|CCA95082.1| putative responsive to dehydration 19, partial [Ginkgo biloba]
Length = 130
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/121 (49%), Positives = 80/121 (66%), Gaps = 5/121 (4%)
Query: 91 GGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANA 150
GGLE E+DYPY G++ C + +++ + ++ VS DE ++A LVKNGP++V INA
Sbjct: 9 GGLEKEEDYPYTGTDGTCKFDDKKVVAAVSNFSVVSIDEDQIAANLVKNGPLSVGINAVF 68
Query: 151 MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG-VHKTKFTHKIQPYWIIKNSWGPHWGE 209
MQ Y GGVS P ++C NLDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 69 MQTYIGGVSCP--YICS--KRNLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGANWGE 124
Query: 210 K 210
+
Sbjct: 125 Q 125
>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
Length = 500
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 208 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 267
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 268 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 321
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 322 VGYN------DSAAVPYWIIKNSWTTQWGEE 346
>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTQWGEE 313
>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
Full=Major cysteine proteinase; Flags: Precursor
gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
Length = 467
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTQWGEE 313
>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTQWGEE 313
>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
Length = 1157
Score = 118 bits (295), Expect = 3e-24, Method: Composition-based stats.
Identities = 59/149 (39%), Positives = 87/149 (58%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+ GCGGG +++I + GGLE E DY Y G + CH N + + S
Sbjct: 763 QLVDCDRSSRGCGGGYPPATYDSI--RRIGGLEIELDYRYTGRDGVCHQNPRKFVAYVNS 820
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V ++ DE +A++L +GP+++A+NA +QFY G+ HP C + ++ H VL VG
Sbjct: 821 SVALTKDENTIAEWLSYHGPISMALNARLLQFYVSGIMHPPAAYCP--VKDISHAVLSVG 878
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+G T P+WI+KNSWG WGE+
Sbjct: 879 FG------TKGNVPFWIVKNSWGTLWGEE 901
Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats.
Identities = 63/149 (42%), Positives = 82/149 (55%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD++D GC GG AFE I + GGLE E DYPY G C N V I
Sbjct: 514 QLVDCDRIDQGCAGGTPYGAFEGI--QQLGGLELEADYPYLGHQDNCQSNPLRFVVSING 571
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE ++A+YL +GP++V IN +Q+Y G+ PL C ++H L VG
Sbjct: 572 SVQLPKDEDQIAQYLFDHGPLSVGINGALLQYYSSGIMQPLWDNCNPA--EMNHAGLAVG 629
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+G F + PYW IKNSWG WGE+
Sbjct: 630 FG-----FEQDV-PYWTIKNSWGMLWGEE 652
Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats.
Identities = 57/154 (37%), Positives = 85/154 (55%), Gaps = 10/154 (6%)
Query: 59 LALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK 118
++ ++VDCD D GC GG +A+E + +LGG LE YPY G + C +
Sbjct: 245 ISAEVVDCDHADHGCSGGFPIHAYECV-QRLGG-LELAVRYPYVGYQQYCQADPRYFVAY 302
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I V + D ++AK+L GP++V ++A +Q+Y G+ +P C + L+H VL
Sbjct: 303 INGSVALPKDSEQIAKFLATFGPLSVVLDARLLQYYRSGILNPSVAYCNP--EELNHAVL 360
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTM 212
VG+G T + PYWIIKNSWG WGE+ +
Sbjct: 361 SVGFG------TEQGIPYWIIKNSWGEQWGEQHL 388
Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats.
Identities = 56/153 (36%), Positives = 87/153 (56%), Gaps = 10/153 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GG +AF + +LGG L+ DYPY S +AC N ++ +
Sbjct: 24 QLVDCDHVDRGCEGGFPLDAF-MAVQRLGG-LQLSIDYPYIASRQACQFNPKQAVAFVTG 81
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +E +A+YL +NGP++V +N+ ++FY G+ + C + L+H L VG
Sbjct: 82 FAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCDP--EALNHAALAVG 139
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPF 214
+G T + P+WIIKN++G WGE+ F
Sbjct: 140 FG------TDESTPFWIIKNTFGKDWGEQLDEF 166
Score = 75.5 bits (184), Expect = 2e-11, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 59/105 (56%), Gaps = 6/105 (5%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD VD GCGGG + + I+ GGLE DYPY ++ C + + + R +
Sbjct: 1050 QLIDCDSVDDGCGGGYPPDTYGDIVKM--GGLELNADYPYIAADGVCKMERSKFRAYVNK 1107
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQ----FYFGGVSHPL 162
+ + + E + A +L KNGP++ INA+ +Q FY V+ P+
Sbjct: 1108 SLVLPTKEDQQAVWLSKNGPLSAGINADYLQVVILFYERSVNGPI 1152
>gi|13625989|gb|AAK35220.1|AF362769_1 pre-procathepsin L [Paragonimus westermani]
Length = 235
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 87/148 (58%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD++D GC GG ++ I K GGLE + YPY G +AC L++ ++ KI
Sbjct: 73 QLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTGWEQACRLDRSKLFAKIDD 130
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +E + A +L ++GPM+ +NA +QFY G+ HP ++ C + L+H VL VG
Sbjct: 131 SIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYACS--PEGLNHAVLTVG 188
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
Y T + PYW ++NSWG WGE
Sbjct: 189 YD------TERGVPYWTVRNSWGTRWGE 210
>gi|204307632|gb|ACI00342.1| cysteine proteinase [Trypanosoma cruzi]
gi|204307636|gb|ACI00344.1| cysteine proteinase [Trypanosoma cruzi]
gi|204307638|gb|ACI00345.1| cysteine proteinase [Trypanosoma cruzi]
Length = 339
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 47 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 106
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 107 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 160
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 161 VGYN------DSAAVPYWVIKNSWTTQWGE 184
>gi|71402717|ref|XP_804236.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70867097|gb|EAN82385.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 247
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 25 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGRTVGATI 84
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + LDHGVL+
Sbjct: 85 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTSCV------SEQLDHGVLL 138
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE
Sbjct: 139 VGYN------DSAAVPYWIIKNSWTAQWGED 163
>gi|237637238|gb|ACR07919.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 167 VGYN------DSAAVPYWVIKNSWTTQWGE 190
>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 90/170 (52%), Gaps = 29/170 (17%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-------------------- 101
+LVDCD ++ GC GG A + I + GGL E YPY
Sbjct: 160 QLVDCDNLNCGCFGGFPFIAMQYIQKR--GGLATESSYPYCIPPLGNCFPCNTNKTYCPS 217
Query: 102 -KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSH 160
+ NR C + ++ K+ Y NVS +E ++A YLVKNGP+++ +NA +QFY G+S
Sbjct: 218 GEYCNRTCSVQNYQLVAKVAGYENVSQNEDDIAAYLVKNGPLSICLNAMWLQFYHSGISD 277
Query: 161 PLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
P+ C ++DH VL+VG+G H K YWI+KNSWG WGEK
Sbjct: 278 PM--YCP---PDIDHAVLLVGFGTHTNWLGEKTN-YWIVKNSWGESWGEK 321
Score = 37.0 bits (84), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 27/41 (65%)
Query: 10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
+D+ E + F HF + + K YAT+EE+H+R +IF N+ +
Sbjct: 6 YDEKEALNYFQHFKKLYLKRYATEEEHHRRWKIFYDNINLV 46
>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 250
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 90/148 (60%), Gaps = 14/148 (9%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY-PYKGSNRACHLNKEEIRVKIQ 120
+L+DCD ++ GC GGLM++A++ I + GGLE +DY Y S C ++ ++ K+
Sbjct: 90 QLIDCDSINDGCRGGLMTDAYKAI--QEMGGLETSEDYGEYLNSKGQCKIDSNKVSAKVI 147
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
++ +S DE + + LV+NGP+AV +NA +QFY GG+ P LC D+++H VLIV
Sbjct: 148 NWYQISEDEEAIRRELVQNGPIAVGVNARFLQFYQGGILDPK--LCD---DSINHAVLIV 202
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
GYG K YWIIKN WG WG
Sbjct: 203 GYGEENGK------KYWIIKNQWGKSWG 224
>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
Length = 456
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/152 (44%), Positives = 94/152 (61%), Gaps = 19/152 (12%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNRA--CHLNKEEIRVKI 119
LV CD D GC GGLM NAFE I+++ G + E+ YPY GS A C + ++ I
Sbjct: 172 LVSCDARDYGCSGGLMDNAFEWIVNQNDGFVFTEESYPYASGSGDAPLCDVGGRKVGATI 231
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +V + +DE +MA +L NGP+++A++A++ + Y GGV C+ G LDHGVL+
Sbjct: 232 KGHVGLPNDEEKMAAWLAANGPISIAVDADSFKAYKGGVLTG----CEEGQ--LDHGVLL 285
Query: 180 VGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGE 209
VGY +K+ PYWIIKNSWGP+WGE
Sbjct: 286 VGY--------NKVANPPYWIIKNSWGPNWGE 309
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 81/151 (53%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C ++ I
Sbjct: 171 LVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHKVGATI 230
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 231 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 284
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 285 VGYN------DSSKPPYWIIKNSWSSSWGEK 309
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 80/151 (52%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY G C E+ I
Sbjct: 171 LVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHEVGATI 230
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 231 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 284
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 285 VGYN------DSSKPPYWIIKNSWSSSWGEK 309
>gi|237637242|gb|ACR07921.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 167 VGYN------DSAAVPYWVIKNSWTTQWGE 190
>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 426
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTQWGEE 313
>gi|237637236|gb|ACR07918.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 167 VGYN------DSAAVPYWVIKNSWTTQWGE 190
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 89/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GCGGGLM+ AFE ++ + G + E YPY G AC + + +
Sbjct: 177 QLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMXTEDSYPYVSSTGDVPACTNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S ET MA +L K+GP+++A++A++ Y GV L L+HGV
Sbjct: 237 RIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYXSGV------LTSCAGKXLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
Length = 347
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 66/159 (41%), Positives = 85/159 (53%), Gaps = 20/159 (12%)
Query: 63 LVDCD----------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLN 111
LVDCD D GC GGL NAF+ II GG++ E YPY + C
Sbjct: 168 LVDCDHHCMTYDGQQSCDDGCNGGLQPNAFQYIIGN--GGIDTETSYPYLAVAQDKCQFK 225
Query: 112 KEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
I KI ++ +S++ET++A YL NGP+++A +A QFY GGV C
Sbjct: 226 ASNIGAKISNWQMLSTNETQIAAYLALNGPVSIAADAAEWQFYIGGV---FDLPCGKA-- 280
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHG+LIVGY F H +PYW +KNSWG WGE+
Sbjct: 281 -LDHGILIVGYDTETNIFGHA-KPYWWVKNSWGASWGEQ 317
>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAF I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCGGGLMNNAFGWIVQENNGAVYTENSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW WGE
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTAQWGE 312
>gi|204307634|gb|ACI00343.1| cysteine proteinase [Trypanosoma cruzi]
Length = 339
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 47 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 106
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + LDHGVL+
Sbjct: 107 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTSCV------SEQLDHGVLL 160
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 161 VGYN------DSAAVPYWVIKNSWTTQWGE 184
>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 291
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 98/171 (57%), Gaps = 20/171 (11%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
+LVDCD D GC GGL NA + GL+ E +YPYKG + C +
Sbjct: 113 QLVDCDHTCDPSAPRNCDYGCNGGLPLNAMRYVQKH---GLDTESNYPYKGVDGKCASAR 169
Query: 113 E-EIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
+ S+ VS++ET++A L+K+GP+++ I+A MQ Y GGV+ P ++C
Sbjct: 170 HGPAAASVSSFNLVSTNETQIAAALLKHGPLSIGIDAAWMQTYVGGVACP--WICNKA-- 225
Query: 172 NLDHGVLIVGYGVHKTKFT---HKIQPYWIIKNSWGPHWGEKTMPFWIIKN 219
LDHGVLIVGYGV+ T H+ Q YWI+KNSWGP+WG + + I K+
Sbjct: 226 GLDHGVLIVGYGVNGTAPARPWHRRQDYWIVKNSWGPNWGVEGGYYHICKD 276
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYTSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 81/151 (53%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C ++ I
Sbjct: 171 LVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHKVGATI 230
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 231 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 284
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 285 VGYN------DSSKPPYWIIKNSWSSSWGEK 309
>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 467
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTQWGEE 313
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 84/149 (56%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+ GC GG ++++ I+ GGLE E DYPY G + C LNKE++ KI
Sbjct: 164 QLVDCDRAAQGCNGGWPASSYLEIMYM--GGLESESDYPYVGVEQTCALNKEKLVAKIDD 221
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +E + A YL ++GP++ +NA A+Q Y GV P C L+H VL VG
Sbjct: 222 SIVLGPEEEDHAAYLAEHGPLSTLLNAVALQHYQSGVLKPTFDECPD--TELNHAVLTVG 279
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y PYWIIKNSWG WGEK
Sbjct: 280 YDKEGD------MPYWIIKNSWGTDWGEK 302
>gi|633096|dbj|BAA04664.1| prepro NTP [Paragonimus westermani]
Length = 245
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 84/149 (56%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD GC GG ++++ I+ GGLE E DYPY G + C LNKE++ KI
Sbjct: 82 QLVDCDMAAEGCNGGWPASSYLEIMYM--GGLESESDYPYVGVEQTCALNKEKLVAKIDD 139
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + +E + A YL ++GP++ +NA A+Q+Y GV P C L+H VL VG
Sbjct: 140 SIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPTFEECPD--TELNHAVLTVG 197
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y PYWIIKNSWG WGEK
Sbjct: 198 YDKEGD------MPYWIIKNSWGTDWGEK 220
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 81/151 (53%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C ++ I
Sbjct: 171 LVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHKVGATI 230
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 231 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 284
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 285 VGYN------DSSKPPYWIIKNSWSSSWGEK 309
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 81/151 (53%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY GS C E+ I
Sbjct: 171 LVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSEDGSKPFCIPYGHEVGATI 230
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 231 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 284
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 285 VGYN------DSSKPPYWIIKNSWSSSWGEK 309
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|8569514|pdb|1EWP|A Chain A, Cruzain Bound To Mor-Leu-Hpq
gi|9955113|pdb|1F29|A Chain A, Crystal Structure Analysis Of Cruzain Bound To A Vinyl
Sulfone Derived Inhibitor (I)
gi|9955114|pdb|1F29|B Chain B, Crystal Structure Analysis Of Cruzain Bound To A Vinyl
Sulfone Derived Inhibitor (I)
gi|9955115|pdb|1F29|C Chain C, Crystal Structure Analysis Of Cruzain Bound To A Vinyl
Sulfone Derived Inhibitor (I)
gi|9955116|pdb|1F2A|A Chain A, Crystal Structure Analysis Of Cruzain Bound To A Vinyl
Sulfone Derived Inhibitor (Ii)
gi|9955117|pdb|1F2B|A Chain A, Crystal Structure Analysis Of Cruzain Bound To Vinyl
Sulfone Derived Inhibitor (Iii)
gi|9955118|pdb|1F2C|A Chain A, Crystal Structure Analysis Of Cryzain Bound To Vinyl
Sulfone Derived Inhibitor (Iv)
gi|27573887|pdb|1ME3|A Chain A, High Resolution Crystal Structure Analysis Of Cruzain Non-
Covalently Bound To A Hydroxymethyl Ketone Inhibitor
(Ii)
gi|27573888|pdb|1ME4|A Chain A, High Resolution Crystal Structure Analysis Of Cruzain
Non-Covalently Bound To A Hydroxymethyl Ketone Inhibitor
(I)
gi|33356864|pdb|1EWL|A Chain A, Crystal Structure Of Cruzain Bound To Wrr-99
gi|33356865|pdb|1EWM|A Chain A, The Cysteine Protease Cruzain Bound To Wrr-112
gi|33356866|pdb|1EWO|A Chain A, The Cysteine Protease Cruzain Bound To Wrr-204
gi|62738070|pdb|1U9Q|X Chain X, Crystal Structure Of Cruzain Bound To An Alpha-Ketoester
gi|157834543|pdb|2AIM|A Chain A, Cruzain Inhibited With
Benzoyl-Arginine-Alanine-Fluoromethylketone
gi|168988657|pdb|2OZ2|A Chain A, Crystal Structure Analysis Of Cruzain Bound To Vinyl
Sulfone Derived Inhibitor (K11777)
gi|168988658|pdb|2OZ2|C Chain C, Crystal Structure Analysis Of Cruzain Bound To Vinyl
Sulfone Derived Inhibitor (K11777)
gi|281307097|pdb|3I06|A Chain A, Crystal Structure Of Cruzain Covalently Bound To A Purine
Nitrile
gi|300193186|pdb|3KKU|A Chain A, Cruzain In Complex With A Non-Covalent Ligand
gi|309319934|pdb|3LXS|A Chain A, Crystal Structure Analysis Of Cruzain Bound To Vinyl
Sulfone Derived Inhibitor (Wrr483)
gi|309319935|pdb|3LXS|C Chain C, Crystal Structure Analysis Of Cruzain Bound To Vinyl
Sulfone Derived Inhibitor (Wrr483)
Length = 215
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 167 VGYN------DSAAVPYWIIKNSWTTQWGEE 191
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGKDWGEK 317
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 91/151 (60%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC GGLM +AF+ I+S G + E+ YPY G+ AC + + + KI
Sbjct: 178 LVSCDTNDFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPACDKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +V++ DE +A++L KNGP+A+A++A + Q Y GGV L ++LDHGVL+
Sbjct: 238 RDHVDLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWSKGWGEE 316
>gi|290560248|pdb|3IUT|A Chain A, The Crystal Structure Of Cruzain In Complex With A
Tetrafluorophenoxymethyl Ketone Inhibitor
Length = 221
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 90/158 (56%), Gaps = 16/158 (10%)
Query: 57 THLALK-LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNK 112
T+LA + LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C +
Sbjct: 46 TNLAEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSG 105
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ I +V + DE ++A +L NGP+AVA++A++ Y GGV + +
Sbjct: 106 HTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQ 159
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGY PYWIIKNSW WGE+
Sbjct: 160 LDHGVLLVGYN------DGAAVPYWIIKNSWTTQWGEE 191
>gi|260656302|pdb|3HD3|A Chain A, High Resolution Crystal Structure Of Cruzain Bound To The
Vinyl Sulfone Inhibitor Smdc-256047
gi|260656303|pdb|3HD3|B Chain B, High Resolution Crystal Structure Of Cruzain Bound To The
Vinyl Sulfone Inhibitor Smdc-256047
Length = 215
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 90/158 (56%), Gaps = 16/158 (10%)
Query: 57 THLALK-LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNK 112
T+LA + LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C +
Sbjct: 46 TNLAEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSG 105
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ I +V + DE ++A +L NGP+AVA++A++ Y GGV + +
Sbjct: 106 HTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQ 159
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGY PYWIIKNSW WGE+
Sbjct: 160 LDHGVLLVGYN------DGAAVPYWIIKNSWTTQWGEE 191
>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
Length = 348
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|343473085|emb|CCD14932.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 225
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 89/151 (58%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY--KGSN-RACHLNKEEIRVKI 119
LV CD D GC GGLM NAF+ I+S + E+ YPY KG N C ++ + + KI
Sbjct: 2 LVSCDTEDLGCAGGLMDNAFKWIVSSNKHNVFTEQSYPYASKGGNVPPCRMSGKVVGAKI 61
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +V++ DE +A++L KNGP+A+A++A + Q Y GGV L LDHGVL+
Sbjct: 62 RDHVDLPKDENAIAEWLAKNGPVAIAVDATSFQDYTGGV------LTSCISKQLDHGVLL 115
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 116 VGY-----DDTSK-PPYWIIKNSWSEKWGEE 140
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 81/151 (53%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY GS C E+ I
Sbjct: 163 LVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSEDGSKPFCIPYGHEVGATI 222
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 223 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 276
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 277 VGYN------DSSKPPYWIIKNSWSSSWGEK 301
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 83/150 (55%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AV ++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAVNGPVAVGVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW WGE
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTQWGE 312
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 89/154 (57%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E M +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|204307640|gb|ACI00346.1| cysteine proteinase [Trypanosoma cruzi]
Length = 339
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 47 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 106
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 107 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 160
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYW+IKNSW WGE
Sbjct: 161 VGYN------DSAAVPYWVIKNSWTAQWGE 184
>gi|157829894|pdb|1AIM|A Chain A, Cruzain Inhibited By
Benzoyl-Tyrosine-Alanine-Fluoromethylketone
Length = 215
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEALDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 167 VGYN------DSAAVPYWIIKNSWTTQWGEE 191
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/172 (41%), Positives = 100/172 (58%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R GT ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 155 LEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 212
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+G + +CH NK I + V++ DE +MA+ + GP++VAI+A+ + QFY G+
Sbjct: 213 EGIDDSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGI 272
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VGYG ++ Q YW++KNSWG WG+K
Sbjct: 273 YNEPQ--CD--PQNLDHGVLVVGYGTDESG-----QDYWLVKNSWGTTWGDK 315
>gi|237637248|gb|ACR07924.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 167 VGYN------DSAAVPYWIIKNSWTTQWGEE 191
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|237637244|gb|ACR07922.1| cysteine proteinase [Trypanosoma cruzi]
Length = 345
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 53 LVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 112
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 113 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 166
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 167 VGYN------DSAAVPYWIIKNSWTTQWGEE 191
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 90/154 (58%), Gaps = 11/154 (7%)
Query: 62 KLVDC-DKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC DK + GC GGLM NAF+ I + GG++ E YPY+ N C + +
Sbjct: 159 QLVDCSDKYGNHGCQGGLMDNAFKYI--EANGGIDSEASYPYEAKNGKCRFQQSAVAATC 216
Query: 120 QSYVNVSSDETEMAKYLVKN-GPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
Y ++ D+ + + V N GP++VA++A+ + Q Y GV PL LC LDHG
Sbjct: 217 TGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLYAAGVYDPL--LCSS--TRLDHG 272
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG + H+ +PYW++KNSWGP WG++
Sbjct: 273 VLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQ 306
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY G C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYTSTFGYVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGKDWGEK 317
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 90/154 (58%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 467
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 83/150 (55%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM NAF+ I+ K G + E Y Y G+++ C ++ + I
Sbjct: 177 LVSCDNADNGCDGGLMDNAFDWIVGKNNGTVYTEASYSYVSGGGNSQKCDMSGHVVGAVI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 237 SGHVDLPKDEDKMAAWLAANGPLAIAVDATSFMSYTGGV------LTNCISDQLDHGVVL 290
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSWG WGE
Sbjct: 291 VGYNDSSNP------PYWIIKNSWGADWGE 314
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM++AFE I+ G + E+ Y Y G + C + + I
Sbjct: 175 LVSCDTMDSGCDGGLMNSAFEWIVEHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE +MA +L NGP+AVA++A++ FY GGV L + LDHGVL+
Sbjct: 235 TGHVKLPPDEAKMATWLAANGPLAVAVDASSWMFYTGGV------LTSCVSNELDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWI+KNSWG WGE
Sbjct: 289 VGYN------DSAAPPYWIVKNSWGTLWGE 312
>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
Length = 467
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM++AFE I+ + G + E+ YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNDAFEWIVQENDGAVYTEESYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A + Y GGV + + LDHGVL+
Sbjct: 235 TGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW WGE
Sbjct: 289 VGYN------DSAPVPYWIIKNSWTTLWGE 312
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 88/156 (56%), Gaps = 20/156 (12%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDCD K D GC GGLM NAF + S LE E YPY + +C N+ V +
Sbjct: 164 QLVDCDTKEDQGCNGGLMDNAFTYLES---AKLETESAYPYTAVDGSCKYNQSLGVVGVA 220
Query: 121 SYVN------VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLD 174
S+V+ V+ E M L GP++VAINAN +QFY GG+S+PL +C + L+
Sbjct: 221 SFVDIEQGKTVADTENTMGVALDNIGPLSVAINANNLQFYAGGISNPL--ICN--PNGLN 276
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
HGVLIVG G K +W +KNSWG WGEK
Sbjct: 277 HGVLIVGLGSENGK------DFWKVKNSWGASWGEK 306
Score = 40.8 bits (94), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 26/37 (70%)
Query: 16 VAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQI 52
A F F + +NK Y+++E Y+ RL IF+ NL++I++
Sbjct: 27 AAAFKKFTQTYNKKYSSEEHYNARLSIFKENLRRIEL 63
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 89/151 (58%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY--KGSN-RACHLNKEEIRVKI 119
LV CD D GC GGLM NAF+ I+S + E+ YPY KG N C ++ + + KI
Sbjct: 178 LVSCDTEDLGCAGGLMDNAFKWIVSSNRHNVFTEESYPYASKGGNVPPCRMSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +V++ DE +A++L KNGP+A+A+++ + Q Y GGV L LDHGVL+
Sbjct: 238 RDHVDLPKDENAIAEWLAKNGPVAIAVDSTSFQSYTGGV------LTSCISKQLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWSKGWGEE 316
>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
Length = 337
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 88/150 (58%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ- 120
+L+DCD+VD GC GGLM AF+ II GG+E E DYPY+G AC L ++ V++
Sbjct: 177 QLLDCDRVDQGCDGGLMHLAFQEIIRI--GGVEHEIDYPYQGIEYACRLAPSKLAVRLSH 234
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y DE ++ + L KNGP+AVAI+ + Y G++ +C + L+H VL+V
Sbjct: 235 CYQYDLRDERKLLELLYKNGPIAVAIDCVDIIDYRSGIAT----VCND--NGLNHAVLLV 288
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYG+ PYWI KNSWG +WGE
Sbjct: 289 GYGIENDT------PYWIFKNSWGSNWGEN 312
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 89/154 (57%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 235
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E M +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
Length = 467
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM +AF+ I+ + G + E Y Y G ++ C ++ + I
Sbjct: 177 LVSCDNADNGCDGGLMDSAFDWIVEQNNGSVYTEASYSYVSGGGDSQTCDMSDHVVGAVI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 237 SGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV------LTNCVSDQLDHGVVL 290
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSWG WGE+
Sbjct: 291 VGYNDSSNP------PYWIIKNSWGADWGEE 315
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 96/173 (55%), Gaps = 19/173 (10%)
Query: 46 NLKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYP 100
+L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YP
Sbjct: 153 SLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGVDTEKSYP 210
Query: 101 YKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGG 157
Y+G + +CH NK + +V++ DE M K + GP+AVAI+A+ + Q Y G
Sbjct: 211 YEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNESFQLYSEG 270
Query: 158 VSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V + C DNLDHGVL+VGYG K Q YW++KNSWG WG++
Sbjct: 271 VYNDPN--CSS--DNLDHGVLVVGYGTDKDG-----QDYWLVKNSWGTTWGDQ 314
>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 91/153 (59%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIR-V 117
+LV CD +D+GCGGGLM+ AFE ++ + G + E YPY G C + + +
Sbjct: 177 QLVSCDDMDSGCGGGLMTQAFEWLLRNMNGTMFTEDSYPYVSTFGYVPECTNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L K+GP+++ ++A++ Y GGV L L+HGV
Sbjct: 237 RIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYHGGV------LTSCAGKQLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWVIKNSWGENWGEK 317
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 90/151 (59%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC GGLM +AF+ I+S G + E+ YPY G+ C + + + KI
Sbjct: 178 LVSCDTNDFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +V++ DE +A++L KNGP+A+A++A + Q Y GGV L ++LDHGVL+
Sbjct: 238 RDHVDLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWSKGWGEE 316
>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
Length = 467
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM +AF+ I+ + G + E Y Y G ++ C+++ + I
Sbjct: 177 LVSCDNADNGCDGGLMDSAFDWIVGQNNGSVYTEASYSYVSGGGDSQTCNMSSHVVGAVI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 237 SGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV------LTNCVSDQLDHGVVL 290
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSWG WGE+
Sbjct: 291 VGYNDSSNP------PYWIIKNSWGADWGEE 315
>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 88/152 (57%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-V 117
+LV CD D+GCGGGLM AFE ++ + G + E YPY S+ C + + +
Sbjct: 177 QLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSSGYVPECSNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I Y+ + S ET MA +L KNGP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYESGV------LTSCAGDTLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGY T ++ PYW+IKNSWG WGE
Sbjct: 291 LLVGY-----NMTGEV-PYWVIKNSWGEDWGE 316
>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
Length = 469
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM +AF+ I+ + G + E Y Y G ++ C ++ + I
Sbjct: 179 LVSCDNADNGCDGGLMDSAFDWIVEQNNGSVYTEASYSYVSGGGDSQTCDMSDHVVGAVI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 239 SGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV------LTNCVSDQLDHGVVL 292
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSWG WGE+
Sbjct: 293 VGYNDSSNP------PYWIIKNSWGADWGEE 317
>gi|358364413|gb|AEU08937.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 81/142 (57%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM NAF+ ++ GG + E YPY G AC N+ E+ K+
Sbjct: 30 LVSCDTLDYGCNGGLMDNAFKWLVESNGGNVYTEGSYPYVSGSGQTPACSTNQHEVGAKV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
SYVN+ DE +MA ++ NGP+AVA++AN+ Y GV L + L+HGVL+
Sbjct: 90 TSYVNLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSNQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGYDDSSSP------PYWIIKN 159
>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
Length = 1095
Score = 114 bits (284), Expect = 4e-23, Method: Composition-based stats.
Identities = 59/148 (39%), Positives = 86/148 (58%), Gaps = 13/148 (8%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY-PYKGSNRACHLNKEEIRVKIQ 120
+LVDCD ++ GC GGLM++A++ + + GGLE +DY YK C + +++ KI+
Sbjct: 934 QLVDCDDINDGCHGGLMTDAYKYL--QQSGGLEFAEDYGDYKNKKEKCKFDLNKVQAKIK 991
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+ + DE + K L +NGP+A +NA +QFY G+ P K +++H +LIV
Sbjct: 992 EWQQIDEDEEIIKKQLYQNGPIAAGVNARLLQFYKSGIFDP-----KECDSDINHAILIV 1046
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
GYGV K Q YWIIKN WG WG
Sbjct: 1047 GYGVEKDG-----QKYWIIKNQWGKDWG 1069
>gi|13625987|gb|AAK35219.1|AF362768_1 cysteine proteinase [Paragonimus westermani]
Length = 137
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 77/120 (64%), Gaps = 8/120 (6%)
Query: 91 GGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANA 150
GGLE ++DYPY G + C L++ ++ KI S + + ++E + A Y+ ++GPM+ INA
Sbjct: 2 GGLEAQRDYPYVGREQPCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVT 61
Query: 151 MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+QFY G+SHP K C+ D L+HGVL VGYG T PYWIIKNSWG WGEK
Sbjct: 62 LQFYQSGISHPSKSQCQ--PDWLNHGVLSVGYG------TEDGVPYWIIKNSWGTGWGEK 113
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/151 (42%), Positives = 85/151 (56%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM A I+ G + E YPY G+ CH ++ E+ KI
Sbjct: 181 LVSCDNVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCH-DEGEVGAKI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++++ DE +A ++ K GP+AVA++A Q YFGGV LC +L+HGVLI
Sbjct: 240 TGFLSLPHDEERIADWVEKRGPVAVAVDATTWQLYFGGVVS----LCLAW--SLNHGVLI 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ + PYWI+KNSWG WGEK
Sbjct: 294 VGFNKNAKP------PYWIVKNSWGSSWGEK 318
>gi|343414950|emb|CCD20840.1| cysteine peptidase, putative [Trypanosoma vivax Y486]
Length = 285
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/151 (42%), Positives = 79/151 (52%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY G C ++ I
Sbjct: 2 LVSCDTKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHKVGATI 61
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + L+HGVL+
Sbjct: 62 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVTSCT------SEALNHGVLL 115
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 116 VGYN------DSSKPPYWIIKNSWSSSWGEK 140
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 90/149 (60%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD+VD GC GGLM AF+ ++ L GG+E E DYPY+GS + C L+ +I VK+ S
Sbjct: 207 QLLDCDEVDLGCNGGLMHLAFQELL--LMGGVETEADYPYQGSEQMCTLDNRKIAVKLNS 264
Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
DE ++ + + GP+A+A++A + Y G+ L + + +L+H VL++
Sbjct: 265 CFKYDIRDENKLKELVYTTGPVAIAVDAMDIINYRRGI------LNQCHIYDLNHAVLLI 318
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G+G+ PYWIIKNSWG WGE
Sbjct: 319 GWGIENNV------PYWIIKNSWGEDWGE 341
Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
F HFL+++NKSY +EY R +F+ NL KI
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKI 88
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 90/149 (60%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD+VD GC GGLM AF+ ++ L GG+E E DYPY+GS + C L+ +I VK+ S
Sbjct: 207 QLLDCDEVDLGCNGGLMHLAFQELL--LMGGVETEADYPYQGSEQMCTLDNRKIAVKLNS 264
Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
DE ++ + + GP+A+A++A + Y G+ L + + +L+H VL++
Sbjct: 265 CFKYDIRDENKLKELVYTTGPVAIAVDAMDIINYRRGI------LNQCHIYDLNHAVLLI 318
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G+G+ PYWIIKNSWG WGE
Sbjct: 319 GWGIENNV------PYWIIKNSWGEDWGE 341
Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
F HFL+++NKSY +EY R +F+ NL KI
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKI 88
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 90/149 (60%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD+VD GC GGLM AF+ ++ L GG+E E DYPY+GS + C L+ +I VK+ S
Sbjct: 205 QLLDCDEVDLGCNGGLMHLAFQELL--LMGGVETEADYPYQGSEQMCTLDNRKIAVKLNS 262
Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
DE ++ + + GP+A+A++A + Y G+ L + + +L+H VL++
Sbjct: 263 CFKYDIRDENKLKELVYTTGPVAIAVDAMDIINYRRGI------LNQCHIYDLNHAVLLI 316
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G+G+ PYWIIKNSWG WGE
Sbjct: 317 GWGIENNV------PYWIIKNSWGEDWGE 339
Score = 38.9 bits (89), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
F HFL+++NKSY +EY R +F+ NL KI
Sbjct: 55 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKI 86
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 86/151 (56%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A I+ G + E YPY G+ CH ++ E+ KI
Sbjct: 181 LVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCH-DEGEVGAKI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++++ DE +A+++ K GP+AVA++A Q YFGGV LC +L+HGVLI
Sbjct: 240 TGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVS----LCLAW--SLNHGVLI 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ + PYWI+KNSWG WGEK
Sbjct: 294 VGFNKNAKP------PYWIVKNSWGSSWGEK 318
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 86/151 (56%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD +D GC GGLM A I+ G + E YPY G+ CH ++ E+ KI
Sbjct: 181 LVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCH-DEGEVGAKI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++++ DE +A+++ K GP+AVA++A Q YFGGV LC +L+HGVLI
Sbjct: 240 TGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVS----LCLAW--SLNHGVLI 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+ + PYWI+KNSWG WGEK
Sbjct: 294 VGFNKNAKP------PYWIVKNSWGSSWGEK 318
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 389
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 80/151 (52%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + K YPY GS C E+ I
Sbjct: 171 LVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTGKSYPYVSEDGSKPFCIPYGHEVGATI 230
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 231 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 284
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGEK
Sbjct: 285 VGYN------DSSKPPYWIIKNSWSSSWGEK 309
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 101/172 (58%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R GT ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 155 LEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 212
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+G + +CH NK+ + + + ++ +E +MA+ + GP++VAI+A+ + QFY G+
Sbjct: 213 EGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDASHESFQFYSEGI 272
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VGYG ++ + YW++KNSWG WG+K
Sbjct: 273 YNEPE--CNS--QNLDHGVLVVGYGTDESG-----KDYWLVKNSWGTTWGDK 315
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GC GGLM+ AFE ++ + G + E YPY G C + + +
Sbjct: 177 QLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSYPYVSSTGDVPECTNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S ET MA +L K+GP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYESGV------LTSCAGDALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NXTGEV-PYWVIKNSWGEDWGEK 317
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 87/163 (53%), Gaps = 11/163 (6%)
Query: 54 GEGTHLALK-LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
GE HL+++ ++DCD VD GC GG + + GGL+ + DY YK + CH ++
Sbjct: 81 GELLHLSVQQVLDCDHVDHGCNGGYPPQVYRQVNQM--GGLQLDADYSYKAAVGKCHTDR 138
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
+ R + S V +S +E A L GP+A +NA +QFY G+ HP C G
Sbjct: 139 SKFRAYVNSSVILSQNEQFQANKLKTIGPLASTLNARTLQFYRKGIMHPTPSACNPG--Q 196
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFW 215
L+H VL VGYG T + PYWI+KNSW +GE+ W
Sbjct: 197 LNHAVLTVGYG------TEQGMPYWIVKNSWSRGFGEQVRAIW 233
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 80/148 (54%), Gaps = 10/148 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD VD GC GG + +I GGLE DYPYK CH+++++++V I
Sbjct: 441 QLIDCDNVDEGCNGGYPPKTYGAVIKM--GGLELNSDYPYKALAEKCHMDRQKLKVYIND 498
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +E A+ L GP++ A+NAN ++FY G+ H C L+H VL VG
Sbjct: 499 SVVFPRNEHLQAEALKLMGPLSSALNANPLKFYKTGIMHLPVASCFP--RALNHAVLTVG 556
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
YG T PYW +KNSWG +GE
Sbjct: 557 YG------TENGLPYWTVKNSWGTAFGE 578
>gi|20301807|gb|AAM15727.1| cysteine protease [Pagumogonimus skrjabini]
Length = 166
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 84/143 (58%), Gaps = 11/143 (7%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLNKEEIRVKIQ 120
+L+DCDKVD GC GG +A++ + K GG+E + YPY G + C L+K +
Sbjct: 34 QLLDCDKVDEGCNGGYPMDAYKEL--KRMGGVESQSTYPYTGRQSSQCWLDKSLFVAYLN 91
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
V + DE + A +L NGP++VA+NA+ +QFY G+SHP + LC L+H VL V
Sbjct: 92 DSVMLPKDELKQAAWLADNGPLSVALNADQLQFYRRGISHPPESLCPA--SGLNHAVLSV 149
Query: 181 GYGVHKTKFTHKIQPYWIIKNSW 203
GYG + PYWI+KNSW
Sbjct: 150 GYG------SENGTPYWIVKNSW 166
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 65/161 (40%), Positives = 91/161 (56%), Gaps = 20/161 (12%)
Query: 56 GTHLALK---LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHL 110
GT ++L LVDC + GC GGLM +AFE +I G++ E YPY+ + C
Sbjct: 150 GTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKN--NGIDTEASYPYRAVDSTCKF 207
Query: 111 NKEEIRVKIQSYVNVSSD-ETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCK 167
N ++ I YV+V+ D E+++ + GP++VAI+A+ + QFY GV PL +C
Sbjct: 208 NTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGVYDPL--ICS 265
Query: 168 GGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
NLDHGVL VGYG +K YW++KNSWG WG
Sbjct: 266 S--TNLDHGVLAVGYGTDGSK------DYWLVKNSWGASWG 298
>gi|407398899|gb|EKF28211.1| cysteine peptidase, putative, partial [Trypanosoma cruzi
marinkellei]
Length = 261
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM++AFE I+ + G + E+ YPY +G + C + + I
Sbjct: 2 LVSCDKTDTGCSGGLMNDAFEWIVQENNGAVYTEESYPYASGEGISPPCTTSGHTVGAMI 61
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A + Y GGV + + LDHGVL+
Sbjct: 62 TGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYTGGV------MTSCVSEQLDHGVLL 115
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 116 VGYN------DSAPVPYWIIKNSWTTLWGEE 140
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 89/151 (58%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRVKI 119
LV CD +D GC GGLM A + I+S G + E+ YPY ++ C+++ + + KI
Sbjct: 178 LVSCDNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNMSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++N+ DE +A++L KNGP+A+A++A++ Y GGV L D L+H VL+
Sbjct: 238 SGHINLPKDENAIAEWLAKNGPVAIAVDASSFLDYKGGV------LTSCSSDALNHDVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSWG WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWGKKWGEE 316
>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GCGGGLM+ AFE ++ + G + E YPY G C + + +
Sbjct: 177 QLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSYPYVSSXGDVPECTNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S ET MA +L K+GP+++ ++A++ Y GV L B L+HGV
Sbjct: 237 RIDGYVTIESSETVMAAWLAKSGPISIGVDASSFMSYESGV------LTSCAGBXLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NXTGEV-PYWVIKNSWGEDWGEK 317
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 87/152 (57%), Gaps = 16/152 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC V +AGC GGLM NAF I K GGLE EK YPY G + CH + I K+
Sbjct: 165 LVDCSAVYGNAGCNGGLMDNAFRFI--KDAGGLETEKSYPYTGKDGTCHFDARGIGAKLT 222
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V+V S DE + + GP++VAI+A+ QFY GV + C +LDHGV
Sbjct: 223 GFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEIT--CSS--TSLDHGV 278
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYG T + YW++KNSWG WG+
Sbjct: 279 LVVGYGT-----TRDGKDYWLVKNSWGSSWGQ 305
>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GC GGLM+ AFE ++ + G + E YPY G C + + +
Sbjct: 177 QLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSYPYVSSTGDVPECTNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S ET MA +L K+GP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYESGV------LTSCAGDALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NRTGEV-PYWVIKNSWGEDWGEK 317
>gi|407394080|gb|EKF26779.1| cysteine peptidase, partial [Trypanosoma cruzi marinkellei]
Length = 179
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM++AFE I+ + G + E+ YPY +G + C + + I
Sbjct: 2 LVSCDKTDTGCSGGLMNDAFEWIVQENNGAVYTEESYPYASGEGISPPCTTSGHTVGAMI 61
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A + Y GGV + + LDHGVL+
Sbjct: 62 TGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYTGGV------MTSCVSEQLDHGVLL 115
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 116 VGYN------DSAPVPYWIIKNSWTTLWGEE 140
>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GC GGLM+ AFE ++ + G + E YPY G C + + +
Sbjct: 177 QLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSYPYVSSTGDVPECTNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S ET MA +L K+GP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYESGV------LTSCAGDALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NRTGEV-PYWVIKNSWGEDWGEK 317
>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 175 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 235 TGHVGLPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNS WGE+
Sbjct: 289 VGYN------DSAAVPYWIIKNSRTTQWGEE 313
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 92/179 (51%), Gaps = 34/179 (18%)
Query: 62 KLVDCD---------KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACH 109
+LVDCD D+GC GGL +NA ++ + GGL+ E YPY +G R
Sbjct: 150 QLVDCDHTCDPDSGTACDSGCDGGLPANAMAYVVKR--GGLDAEAAYPYLGARGDGRCKS 207
Query: 110 LNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGG 169
I +Y VS+DE+++A LVK+GP++V I+A MQ Y GV+ P + C
Sbjct: 208 KEDGPPAATITNYSFVSADESQIAAALVKHGPLSVGIDARWMQLYRRGVACP--WACD-- 263
Query: 170 MDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQ 228
LDHGVLIVG+G P G + PFW+IKNSWG RWGE+
Sbjct: 264 KTRLDHGVLIVGFGAEGR----------------APARGFRREPFWLIKNSWGARWGEE 306
>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
Length = 345
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 177 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 237 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 291 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 317
>gi|407398503|gb|EKF28112.1| cysteine peptidase, putative, partial [Trypanosoma cruzi
marinkellei]
Length = 222
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM++AFE I+ + G + E+ YPY +G + C + + I
Sbjct: 2 LVSCDKTDTGCSGGLMNDAFEWIVQENNGAVYTEESYPYASGEGISPPCTTSGHTVGAMI 61
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A + Y GGV + + LDHGVL+
Sbjct: 62 TGHVELPQDEAQIAVWLAANGPVAVAVDATSWMTYTGGV------MTSCVSEQLDHGVLL 115
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSW WGE+
Sbjct: 116 VGYN------DSAPVPYWIIKNSWTTLWGEE 140
>gi|209962695|gb|ACJ02142.1| cathepsin L-like protein [Trypanosoma brucei brucei]
Length = 159
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 30 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGSVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 90 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGYNDNSNP------PYWIIKN 159
>gi|54287314|emb|CAD54749.1| cysteine proteinase b [Leishmania major]
Length = 174
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 89/153 (58%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-- 116
+LV CD VD GCGGGLM AFE ++ + G + EK YPY N C N E+
Sbjct: 35 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECS-NSSELAPG 93
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 94 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 147
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VL+VGY T ++ PYW+IKNSWG WGE
Sbjct: 148 VLLVGY-----NMTGEV-PYWVIKNSWGEDWGE 174
>gi|311698057|gb|ADQ00323.1| cathepsin L-like protein [Trypanosoma cyclops]
Length = 159
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 81/143 (56%), Gaps = 15/143 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVK 118
+LV CD +D GCGGGLM+NAF +++ GG + E YPY G C ++ + K
Sbjct: 29 QLVSCDTLDYGCGGGLMNNAFTWLVNSSGGNVYTEDSYPYVSGSGDEPTCSTSEHCVGAK 88
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I YVN+ DE +MA ++ NGP+AVA++A++ +Y GGV L LDHGVL
Sbjct: 89 ITDYVNLPQDEDKMAAWVAANGPLAVAVDASSFSWYTGGV------LTNCASYQLDHGVL 142
Query: 179 IVGYGVHKTKFTHKIQPYWIIKN 201
+VGY PYWIIKN
Sbjct: 143 LVGYNDTNDP------PYWIIKN 159
>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
Length = 443
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 177 QLVSCDDKDNGCNGGLMLQAFEXLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 237 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 291 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 317
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 84/151 (55%), Gaps = 18/151 (11%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM A+ II G + E YPY GS +C L+ ++ +I
Sbjct: 181 LVSCDTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSYPYTSGDGSTASC-LSTGKVGARI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGG-VSHPLKFLCKGGMDNLDHGVL 178
V++ DE + +L KNGP+++A++A Q YFGG VS+ + NL+HGVL
Sbjct: 240 SGQVSLPQDEDAIEAWLEKNGPISIAVDATTWQLYFGGVVSNCFAY-------NLNHGVL 292
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGY PYWI+KNSWG WGE
Sbjct: 293 LVGYNNSANP------PYWIVKNSWGTSWGE 317
>gi|209962691|gb|ACJ02140.1| cathepsin L-like protein [Trypanosoma evansi]
Length = 159
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 30 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 90 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGYNDNSNP------PYWIIKN 159
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/172 (40%), Positives = 97/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 157 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 214
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK I + + ++ DE +MA+ + GP+AVAI+A+ + QFY GV
Sbjct: 215 EAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHESFQFYSEGV 274
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VGYG ++ YW++KNSWG WG+K
Sbjct: 275 YNEPQ--CDA--QNLDHGVLVVGYGTDESG-----DDYWLVKNSWGTTWGDK 317
>gi|559532|emb|CAA57675.1| cysteine proteinase [Zea mays]
Length = 145
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 76/117 (64%), Gaps = 6/117 (5%)
Query: 94 EGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF 153
E EKDYPY GS+ C +K +I +Q++ VS DE +++ +K+GP+A+ INA MQ
Sbjct: 1 ESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANRIKHGPLAIGINAAYMQT 60
Query: 154 YFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHK-TKFTHKIQPYWIIKNSWGPHWGE 209
Y GGVS P ++C +LDHGVL+VGYG K +PYWIIKNSWG +WGE
Sbjct: 61 YIGGVSCP--YICG---RHLDHGVLLVGYGASGFAPMRLKDKPYWIIKNSWGENWGE 112
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/172 (40%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R GT ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 156 LEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 213
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+G + +CH NK I + + ++ DE ++A+ + GP++VAI+A+ + QFY GV
Sbjct: 214 EGIDDSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGV 273
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ C NLDHGVL+VGYG + + YW++KNSWG WG+K
Sbjct: 274 YDEPQ--CD--PQNLDHGVLVVGYGTDENG-----KDYWLVKNSWGTTWGDK 316
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 81/149 (54%), Gaps = 10/149 (6%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+ GC GG ++++ I+ GGLE + DYPY G C + KE + KI
Sbjct: 159 QLVDCDRAADGCNGGWPASSYLEIMHM--GGLESQDDYPYAGVKEQCFMEKERLLAKIDD 216
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ + E + A YL ++GP++ +NA +Q+Y G+ HP C +L+H VL VG
Sbjct: 217 SIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEECSPV--DLNHAVLTVG 274
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
Y PYWIIKNSW WGEK
Sbjct: 275 YDKEGD------MPYWIIKNSWNVEWGEK 297
>gi|209962697|gb|ACJ02143.1| cathepsin L-like protein [Trypanosoma brucei gambiense]
Length = 159
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 30 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 90 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGYNDNSNP------PYWIIKN 159
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 86/152 (56%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC + GC GGLM AFE II+ GG+E E++YPY CH K E+
Sbjct: 182 QLVDCSGKFGNEGCNGGLMDQAFEYIITN--GGIETEEEYPYDARQERCHFKKSEVAATA 239
Query: 120 QSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
V+V S DET++ + + GP+++AI+A+ + Q Y GGV K C LDHG
Sbjct: 240 SGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDEPK--CSS--TELDHG 295
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VL+VGYG T Q YW++KNSWG WG
Sbjct: 296 VLVVGYG------TDDGQDYWLVKNSWGTTWG 321
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRVKI 119
LV CD +D GC GGLM A + I+S G + E+ YPY ++ C+ + + + KI
Sbjct: 178 LVSCDNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+N+ DE +A++L KNGP+A+A++A++ Y GGV L D L+H VL+
Sbjct: 238 SGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGV------LTSCSSDALNHDVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSWG WGE+
Sbjct: 292 VGYD------DSSKPPYWIIKNSWGKKWGEE 316
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LV CD VD GC GGLM AF+ +++ G + YPY N + E + I +
Sbjct: 169 ELVSCDDVDEGCNGGLMGQAFDWLLNNRNGAVYTGASYPYVSGNGSVPECSESSDLVIGA 228
Query: 122 YVN----VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y++ + S+E MA +L NGP+A+A++A+A Y GGV C G L+HGV
Sbjct: 229 YIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVLTS----CDG--KQLNHGV 282
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 283 LLVGY-----NMTGEV-PYWVIKNSWGENWGEK 309
>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 87/152 (57%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-V 117
+LV CD D GCGGGLM AFE ++ + G + E YPY S+ C + + +
Sbjct: 177 QLVSCDDKDNGCGGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSSGYVPECSNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I Y+ + S ET MA +L KNGP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGV------LTSCAGDALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGY T ++ PYW+IKNSWG WGE
Sbjct: 291 LLVGY-----NRTGEV-PYWVIKNSWGEDWGE 316
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GCGGGLM+ AFE ++ + G + E YPY G C + E +
Sbjct: 177 QLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSYPYVSSTGDVPECTNSSELVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S ET MA +L K+GP+++A++A+ Y GV C G L+HGV
Sbjct: 237 RIDGYVMIESXETVMAAWLAKSGPISIAVDASPFMSYESGVLTS----CVG--KXLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC---HLNKEEIRVK 118
+LVDCD + GC GGLM NAF+ I S GG+ E YPY+ SN C + + V
Sbjct: 190 ELVDCDTAENGCQGGLMENAFDFIKSY--GGITTESAYPYRASNGTCDGMRARRGRVHVS 247
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I + V + + V P++VAI+A A QFY GV F G D LDHG
Sbjct: 248 IDGHQMVPTGSEDALAKAVARQPVSVAIDAGGQAFQFYSEGV-----FTGDCGTD-LDHG 301
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V +VGYGV T PYWI+KNSWGP WGE
Sbjct: 302 VAVVGYGVSDVDGT----PYWIVKNSWGPSWGE 330
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 157 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 214
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK I + + ++ DE +MA+ + GP+AVAI+A+ + QFY GV
Sbjct: 215 EAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHESFQFYSEGV 274
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 275 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 317
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 73/173 (42%), Positives = 97/173 (56%), Gaps = 21/173 (12%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G LAL +LVDCD +D GC GG + I K+GG LE DYPY
Sbjct: 146 GNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEI-EKMGG-LELASDYPY 203
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGG 157
G + C++N+ K +YVN S+ E A+ L + GP++ A+NA +QFY GG
Sbjct: 204 TGVDGICYMNQS----KFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGG 259
Query: 158 VSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ P+ FLC L+H VL VGYG T+F PYWI+KNSWG +GEK
Sbjct: 260 IIFPIPFLCNP--HGLNHAVLTVGYG---TEFGI---PYWIVKNSWGVGFGEK 304
>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
Length = 352
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 83/147 (56%), Gaps = 9/147 (6%)
Query: 64 VDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYV 123
VDCD +D GCGGG +N + IIS GG+ EKDYPY + C + + YV
Sbjct: 192 VDCDTMDGGCGGGDPANVYNYIIS--AGGVSTEKDYPYTAQDGTCFNTTRAVSITGFQYV 249
Query: 124 NVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183
+SDE + + +GP+++ ++A+ Q Y GG+ + G N+DH V +VG
Sbjct: 250 TQNSDEDTLITTIANHGPVSICVDASTWQSYTGGI------ITTGCEQNIDHCVQVVGLD 303
Query: 184 VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ KT ++ I PY+II+NSWG WG+K
Sbjct: 304 IDKTDPSNPI-PYYIIRNSWGTSWGDK 329
Score = 37.0 bits (84), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 18 MFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
+F+H+ +++ K Y T EE+ KR F+ NLKKI+
Sbjct: 38 LFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIE 71
>gi|209962699|gb|ACJ02144.1| cathepsin L-like protein [Trypanosoma brucei rhodesiense]
Length = 159
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 30 LVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 90 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGYNDNSNP------PYWIIKN 159
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR---ACHLNKEEIRVKI 119
LV CD + C GG M NAF IIS G + E+ YPY R AC+++ + + I
Sbjct: 178 LVSCDPTEYACEGGFMDNAFRWIISSNKGKVFTEQSYPYSSGGRNVPACNMSGKVVGANI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +A++L KNGP++V ++A + Q Y GGV L L+H VL+
Sbjct: 238 SDYVDLPQDENAIAEWLAKNGPVSVIVDATSFQSYTGGV------LTSCLSKILNHAVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGEK
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWSEKWGEK 316
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 93/169 (55%), Gaps = 13/169 (7%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G LAL +LVDCD +D GC GG + I K+GG LE DYPY
Sbjct: 146 GNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEI-EKMGG-LELASDYPY 203
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
G + C++N+ + + + E A+ L + GP++ A+NA +QFY GG+ P
Sbjct: 204 TGVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFP 263
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ FLC L+H VL VGYG T+F PYWI+KNSWG +GEK
Sbjct: 264 IPFLCNP--HGLNHAVLTVGYG---TEFGI---PYWIVKNSWGVGFGEK 304
>gi|58617840|gb|AAW80539.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 225
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 18 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGFVFTEKSYPYTSGNGDVAECLNSSKLVPGA 77
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 78 RIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 131
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 132 LLVGY--------NKIGEVPYWVIKNSWGEDWGEK 158
>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 443
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 177 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 237 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 291 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 317
>gi|295971915|gb|ADG63164.1| cysteine protease F [Leishmania donovani]
Length = 240
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 37 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 96
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 97 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 150
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 151 LLVGY--NKTGEV----PYWVIKNSWGEDWGEK 177
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 91/153 (59%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF I K GG++ EK YPY+ + +CH NK I +
Sbjct: 176 LVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDR 233
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ +E +MA+ + GP+AVAI+A+ + QFY GV + C NLDHGV
Sbjct: 234 GFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPA--CDA--QNLDHGV 289
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VG+G ++ Q YW++KNSWG WG+K
Sbjct: 290 LVVGFGTDESG-----QDYWLVKNSWGTTWGDK 317
>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
Length = 443
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 177 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 237 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 291 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 317
>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
Length = 370
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 89/163 (54%), Gaps = 20/163 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLNKEEIRVKIQ 120
+L+DC + GC GG + +AF T+++ GL EKDYP++G+ RA C K + IQ
Sbjct: 180 ELLDCGRCGDGCSGGFVWDAFITVLNN--SGLASEKDYPFQGAVRAKCQAKKHKKVAWIQ 237
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
++ +S +E +A YL GP+ V IN +Q Y GV + C N+DH VL+V
Sbjct: 238 DFIMLSDNEQRIAWYLATEGPITVTINKKLLQQYQNGVIKATQTTCD--PQNVDHVVLLV 295
Query: 181 GYGVHKTKFTHKIQ-------------PYWIIKNSWGPHWGEK 210
G+G KTK Q PYWI+KNSWG +WGEK
Sbjct: 296 GFG--KTKSVEGRQAKGVPGHSRRRSTPYWILKNSWGANWGEK 336
>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
Length = 271
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 93/169 (55%), Gaps = 13/169 (7%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G LAL +LVDCD +D GC GG + I K+GG LE DYPY
Sbjct: 89 GNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEI-EKMGG-LELASDYPY 146
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
G + C++N+ + + + E A+ L + GP++ A+NA +QFY GG+ P
Sbjct: 147 TGVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFP 206
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ FLC L+H VL VGYG T+F PYWI+KNSWG +GEK
Sbjct: 207 IPFLCN--PHGLNHAVLTVGYG---TEFGI---PYWIVKNSWGVGFGEK 247
>gi|58617836|gb|AAW80537.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 247
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEI--RV 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGFVFTEKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 100 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 154 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 180
>gi|267632797|gb|ACY78683.1| cysteine proteinase B [Leishmania donovani]
Length = 179
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 25 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 84
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 85 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 138
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 139 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 165
>gi|319976406|gb|ADV90878.1| cysteine proteinase B [Leishmania donovani]
Length = 332
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 90/153 (58%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEI--RV 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 101 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 160
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 161 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 214
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY +KT PYW+IKNSWG WGEK
Sbjct: 215 LLVGY--NKTGGV----PYWVIKNSWGEDWGEK 241
>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/152 (40%), Positives = 87/152 (57%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-V 117
+LV CD D+GCGGGLM AFE ++ + G + E YPY S+ C + + +
Sbjct: 177 QLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSSGYVPECSNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I Y+ + S ET MA +L KNGP+++A++A++ Y GV L L+HGV
Sbjct: 237 RIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYESGV------LTSCAGITLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGY T ++ PYW+IKNSWG WGE
Sbjct: 291 LLVGY-----NMTGEV-PYWVIKNSWGEDWGE 316
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 59/151 (39%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRVKI 119
LV CD +D GC GG + A + I+S G + E+ YPY ++ C+ + + + KI
Sbjct: 178 LVSCDNMDYGCRGGFLDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+N+ DE +A++L KNGP+A+A++A++ Y GGV L D L+HGVL+
Sbjct: 238 SGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGV------LTSCSSDALNHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY PYWIIKNSWG WGE+
Sbjct: 292 VGYD------DSSKPPYWIIKNSWGKKWGEE 316
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 64/151 (42%), Positives = 87/151 (57%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GCGGG AF+ I+S G + E+ YPY G+ C + + + KI
Sbjct: 178 LVSCDTNDFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ V++ DE +A++L K GP+A+A++A + Q Y GGV L ++LDHGVL+
Sbjct: 238 RDRVDLPRDENAIAEWLAKKGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSWG WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWGKGWGEE 316
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 67/152 (44%), Positives = 88/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ EKDYPYKG++ C +N++ + V I
Sbjct: 184 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEKDYPYKGTDGRCDVNRKNAKVVTI 241
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V +++ + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 242 DSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSSGI-----FTGSCGT-RLDHGV 295
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI+KNSWG WGE
Sbjct: 296 TAVGYGTENGK------DYWIVKNSWGSSWGE 321
>gi|394333028|gb|AFN27088.1| cysteine protease, partial [Leishmania infantum]
Length = 242
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 100 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 154 LLVGY--------NKIGGVPYWVIKNSWGEDWGEK 180
>gi|449139100|gb|AGE89905.1| cathepsin-like cysteine proteinase [Spodoptera littoralis NPV]
Length = 336
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 86/149 (57%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ- 120
+L+DCD++D GC GGLM AF+ I+ GG+E E DYPY+G AC + V++
Sbjct: 176 QLLDCDRIDQGCDGGLMHLAFQEIMRI--GGVEHEIDYPYQGIEYACRSAPSKFAVRLSH 233
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y DE ++ + L KNGP+AVAI+ + Y G++ +C + L+H VL+V
Sbjct: 234 CYQYDLRDERKLLELLYKNGPIAVAIDCRDIIDYRSGIAT----VCND--NGLNHAVLLV 287
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYG+ PYWI KNSWG +WGE
Sbjct: 288 GYGIENDT------PYWIFKNSWGSNWGE 310
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 88/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ EKDYPYKG++ C +N++ + V I
Sbjct: 742 ELVDCDTSYNQGCNGGLMDYAFEFIIN--NGGIDTEKDYPYKGTDGRCDVNRKNAKVVTI 799
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V +++ + + V N P++VAI A Q Y G+ F G LDHGV
Sbjct: 800 DSYEDVPANDEKSLQKAVANQPVSVAIEAAGTTFQLYSSGI-----FTGSCGT-ALDHGV 853
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGYG K YWI+KNSWG WGE
Sbjct: 854 TVVGYGTENGK------DYWIMKNSWGSSWGE 879
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 187 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 244
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK + + + ++ DE +MA+ + GP++VAI+A+ + QFY GV
Sbjct: 245 EAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGV 304
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 305 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 347
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 191 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 248
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK + + + ++ DE +MA+ + GP++VAI+A+ + QFY GV
Sbjct: 249 EAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGV 308
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 309 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 351
>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 90/152 (59%), Gaps = 13/152 (8%)
Query: 62 KLVDCDKV---DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK 118
+L+DC K D GGLMS AF+ ++ K G+E + YPYKG + C + ++ +K
Sbjct: 161 QLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYKGIDTPCQYDAKKTVLK 217
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I+ Y NVS+ E E+ K + GP++VAI+A+ +Q YFGG+ L C NL+HGVL
Sbjct: 218 IKGYKNVSNSEEELKKAVGTVGPVSVAIDADPIQLYFGGILDGL--FC---THNLNHGVL 272
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG F K +W +KNSWG WGE+
Sbjct: 273 AVGYGEEDHLFGKK--KFWKVKNSWGKDWGEQ 302
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 61/152 (40%), Positives = 86/152 (56%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-V 117
+LV CD D GC GGLM AFE ++ + G + E YPY S C + + +
Sbjct: 177 QLVSCDDKDNGCAGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSTGYVPECSNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I Y+ + S ET MA +L KNGP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIDGYLTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGV------LTSCAGDALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGY T ++ PYW+IKNSWG +WGE
Sbjct: 291 LLVGY-----NRTGEV-PYWVIKNSWGENWGE 316
>gi|58617832|gb|AAW80535.1| cathepsin L-like cysteine protease [Leishmania donovani]
gi|58617834|gb|AAW80536.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 247
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+ ++A++ Y GV C G D L+HGV
Sbjct: 100 RIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T PYW+IKNSWG WGEK
Sbjct: 154 LLVGYN------TTGGVPYWVIKNSWGEDWGEK 180
>gi|394333022|gb|AFN27085.1| cysteine protease, partial [Leishmania infantum]
Length = 247
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 100 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 154 LLVGY--------NKIGGVPYWVIKNSWGEDWGEK 180
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 86/152 (56%), Gaps = 15/152 (9%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVKIQ 120
+LVDCD VD GC GGLM + FE II GG+ E +YPY + C +KE +I+
Sbjct: 178 ELVDCDSVDHGCDGGLMEDGFEFIIKN--GGISSEANYPYTAVDGTCDASKEASPAAQIK 235
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V ++ E + V N P++V+I+A + QFY GV F + G LDHGV
Sbjct: 236 GYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGV-----FTGQCGT-QLDHGVT 289
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG TH+ YWI+KNSWG WGE+
Sbjct: 290 VVGYGTTDDG-THE---YWIVKNSWGTQWGEE 317
>gi|295922223|gb|ADG62368.1| cysteine protease [Leishmania donovani]
gi|295971913|gb|ADG63163.1| cysteine protease F [Leishmania donovani]
Length = 239
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 36 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 95
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+ ++A++ Y GV C G D L+HGV
Sbjct: 96 RIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVLTS----CAG--DALNHGV 149
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T PYW+IKNSWG WGEK
Sbjct: 150 LLVGYN------TTGGVPYWVIKNSWGEDWGEK 176
>gi|394333026|gb|AFN27087.1| cysteine protease, partial [Leishmania infantum]
Length = 242
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 100 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 154 LLVGY--------NKIGGVPYWVIKNSWGEDWGEK 180
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 157 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 214
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK + + + ++ DE +MA+ + GP++VAI+A+ + QFY GV
Sbjct: 215 EAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGV 274
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 275 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 317
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 89/153 (58%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF I K GG++ EK YPY+G + +CH NK I
Sbjct: 174 LVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKATIGATDT 231
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ DE +M K + GP++VAI+A+ + Q Y GV + + C NLDHGV
Sbjct: 232 GFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPE--CD--EQNLDHGV 287
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG ++ YW++KNSWG WGE+
Sbjct: 288 LVVGYGTDESGM-----DYWLVKNSWGTTWGEQ 315
>gi|354494740|ref|XP_003509493.1| PREDICTED: cathepsin W-like [Cricetulus griseus]
gi|344243260|gb|EGV99363.1| Cathepsin W [Cricetulus griseus]
Length = 376
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 8/169 (4%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG--SNRACHLNKEEIRVKI 119
+L+DC++ GC GG + +A+ T+++ GL EKDYP+KG + C N+ + I
Sbjct: 178 ELLDCERCGNGCDGGFVWDAYMTVLN--NSGLASEKDYPFKGYPNPHGCLANRYKKVAWI 235
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q + + DE +A YL +GP+ V IN +Q Y GV C +DH VL+
Sbjct: 236 QDFTMLGRDEQVIAGYLATHGPITVTINMKLLQGYQKGVIKATPTTCDP--QQVDHSVLL 293
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQ 228
VG+G K K IQ I+ + P +++P+WI+KNSWG WGE+
Sbjct: 294 VGFGKGKEK--EDIQSGTILSQTRKPRKPRRSVPYWILKNSWGAEWGEK 340
Score = 38.5 bits (88), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 24/39 (61%)
Query: 13 LEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
LE + +F F K+N+SYA EY +RL IF NL + Q
Sbjct: 34 LELIEVFKLFQIKYNRSYANPAEYARRLNIFAHNLAQAQ 72
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LV CD VD GC GGLM AF+ +++ G + YPY N + E + I +
Sbjct: 177 ELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGASYPYVSGNGSVPECSESSDLVIGA 236
Query: 122 YVN----VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y++ + S+E MA +L NGP+A+A++A+A Y GGV C G L+HGV
Sbjct: 237 YIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVLTS----CDG--KQLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWLIKNSWGENWGEK 317
>gi|394333030|gb|AFN27089.1| cysteine protease, partial [Leishmania infantum]
Length = 236
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 34 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 93
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 94 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 147
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 148 LLVGY--------NKIGGVPYWVIKNSWGEDWGEK 174
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 88/153 (57%), Gaps = 18/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ EKDYPYKG++ C +N++ + V I
Sbjct: 186 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEKDYPYKGTDGRCDVNRKNAKVVTI 243
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V +++ + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 244 DSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSSGI-----FTGSCGT-ALDHGV 297
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG K YWI+KNSWG WGE
Sbjct: 298 TAVGYGTENGK------DYWIVKNSWGSSWGES 324
>gi|209962693|gb|ACJ02141.1| cathepsin L-like protein [Trypanosoma equiperdum]
Length = 159
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGL NAF I++ GG + E YPY G C +N EI I
Sbjct: 30 LVSCDTIDFGCGGGLTDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 90 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGYNDNSNP------PYWIIKN 159
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 92/154 (59%), Gaps = 18/154 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF I K GG++ EK YPY+ + +CH NK I +
Sbjct: 176 LVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDR 233
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV-SHPLKFLCKGGMDNLDHG 176
+V++ +E +MA+ + GP+AVAI+A+ + QFY GV + P C NLDHG
Sbjct: 234 GFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEP---ACDA--QNLDHG 288
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 289 VLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 317
>gi|295971911|gb|ADG63162.1| cysteine protease F [Leishmania infantum]
Length = 238
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 37 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 96
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+ ++A++ Y GV C G D L+HGV
Sbjct: 97 RIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVLTS----CAG--DALNHGV 150
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T PYW+IKNSWG WGEK
Sbjct: 151 LLVGYN------TTGGVPYWVIKNSWGEDWGEK 177
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 88/153 (57%), Gaps = 18/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ EKDYPYKG++ C +N++ + V I
Sbjct: 181 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEKDYPYKGTDGRCDVNRKNAKVVTI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V +++ + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 239 DSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSSGI-----FTGSCGT-ALDHGV 292
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG K YWI+KNSWG WGE
Sbjct: 293 TAVGYGTENGK------DYWIVKNSWGSSWGES 319
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 97/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 157 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 214
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK I + + ++ DE +MA+ + GP++VAI+A+ + QFY GV
Sbjct: 215 EAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGV 274
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ YW++KNSWG WG+K
Sbjct: 275 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----DDYWLVKNSWGTTWGDK 317
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 80/145 (55%), Gaps = 10/145 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD+VD GC GG AF+ I+ GGL+ + DYPY+G C + +++V I
Sbjct: 168 DCDEVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKI 225
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+ DE A+ L + GP + A+NA ++QFY G+ HPL LC +L+H VL VGYG
Sbjct: 226 LPEDEQIQAQMLKETGPFSSALNALSLQFYTEGILHPLPALCDA--QSLNHAVLTVGYGK 283
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
PYW +KNSW +GE
Sbjct: 284 EGR------LPYWTVKNSWSTMFGE 302
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 97/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 157 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 214
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK I + + ++ DE +MA+ + GP++VAI+A+ + QFY GV
Sbjct: 215 EAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGV 274
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ YW++KNSWG WG+K
Sbjct: 275 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----DDYWLVKNSWGTTWGDK 317
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 87/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCDK + GC GGLM AF+ II GG++ E+DYPYKG + AC N++ + V I
Sbjct: 200 ELVDCDKSFNMGCNGGLMDYAFQFIIGN--GGIDTEEDYPYKGRDAACDPNRKNAKVVTI 257
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ K V N P++VAI A A Q Y GV F + G D LDHGV
Sbjct: 258 DGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQLYQSGV-----FTGRCGTD-LDHGV 311
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG T YWI++NSWG WGE
Sbjct: 312 VAVGYG------TDNGTDYWIVRNSWGKDWGE 337
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/152 (44%), Positives = 88/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ EKDYPYKG++ C +N++ + V I
Sbjct: 184 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEKDYPYKGTDGRCDVNRKNAKVVTI 241
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V +++ + + V N P++VAI A QF Y G+ F G LDHGV
Sbjct: 242 DSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYSSGI-----FTGSCGT-ALDHGV 295
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI+KNSWG WGE
Sbjct: 296 TAVGYGTENGK------DYWIVKNSWGSSWGE 321
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 86/152 (56%), Gaps = 15/152 (9%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVKIQ 120
+LVDCD VD GC GGLM + FE II GG+ E +YPY + C +KE +I+
Sbjct: 172 ELVDCDSVDHGCDGGLMEDGFEFIIKN--GGISSEANYPYTAVDGTCDASKEASPAAQIK 229
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V ++ E + V N P++V+I+A + QFY GV F + G LDHGV
Sbjct: 230 GYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGV-----FTGQCGT-QLDHGVT 283
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG TH+ YWI+KNSWG WGE+
Sbjct: 284 VVGYGTTDDG-THE---YWIVKNSWGTQWGEE 311
>gi|311698047|gb|ADQ00318.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698049|gb|ADQ00319.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VDAGC GGLM NAF+ ++ G + E YPY G AC ++ E+ I
Sbjct: 30 LVSCDTVDAGCNGGLMDNAFQWLVDSNKGKVYTESSYPYVSGSGQTPACSTSEHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR-V 117
+LV CD D+GC GGLM+ AFE ++ + G + E YPY G C + E +
Sbjct: 177 QLVSCDDKDSGCXGGLMTQAFEWLLRXMNGTMFTEDSYPYVSSTGDVPECTNSSELVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L K+GP+++ ++A++ Y GV L +L+HGV
Sbjct: 237 RIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYESGV------LTSCAGKHLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWVIKNSWGEDWGEK 317
>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
Length = 367
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 86/156 (55%), Gaps = 11/156 (7%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC++ GC GG + +AF T+++ GL EKDYP+K S + C NK I
Sbjct: 180 ELLDCNRCGDGCQGGFVWDAFITVLN--NSGLASEKDYPFKASVKTHRCLANKYRKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + +E ++A+YL +GP+ V IN +Q Y GV C + N H VL+
Sbjct: 238 QDFIMLEDNEHKIAQYLATHGPITVTINMKLLQHYKKGVIKAKPTTCDPQLVN--HSVLL 295
Query: 180 VGYGVHKTKFT-----HKIQPYWIIKNSWGPHWGEK 210
VG+G H+ PYWI+KNSWG HWGE+
Sbjct: 296 VGFGAETVSSQSHLRPHRSTPYWILKNSWGAHWGEE 331
>gi|311697973|gb|ADQ00281.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697981|gb|ADQ00285.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697983|gb|ADQ00286.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697989|gb|ADQ00289.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697997|gb|ADQ00293.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697999|gb|ADQ00294.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VDAGC GGLM NAF+ ++ G + E YPY G AC ++ E+ I
Sbjct: 30 LVSCDTVDAGCNGGLMDNAFQWLVDSNKGKVYTENSYPYVSGSGQTPACSTSEHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 87/150 (58%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI-Q 120
+++DCD VD GC GGL+ AFE IIS GG++ E DYPY+ SN C ++ + V + Q
Sbjct: 163 QMIDCDSVDVGCEGGLLHTAFEAIISM--GGVQIENDYPYESSNNYCRMDPTKFVVGVKQ 220
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
++ E ++ L GP+ VAI+A+ + Y G+ +K+ G L+H VL+V
Sbjct: 221 CNRYITIYEEKLKDVLRLAGPIPVAIDASDILNYEQGI---IKYCANNG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV PYWI+KNSWG WGE+
Sbjct: 275 GYGVENNV------PYWILKNSWGTDWGEQ 298
>gi|358364417|gb|AEU08939.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364419|gb|AEU08940.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD +D GC GGLM NAF+ ++ GG + E YPY GS R AC + E+ K+
Sbjct: 30 LVSCDTLDQGCNGGLMDNAFKWLVDSNGGNVYTENSYPYVSGSGRTPACSTRQHEVGAKV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
SY+++ DE +MA +L NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TSYLDLPQDEDKMAAWLAANGPIAVAVDANSFLSYVSGV------LTNCESHQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 80/145 (55%), Gaps = 10/145 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD+VD GC GG AF+ I+ GGL+ + DYPY+G C + +++V I
Sbjct: 157 DCDEVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKI 214
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+ DE A+ L + GP++ A+NA +QFY G+ HPL LC +L+H VL VGYG
Sbjct: 215 LPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYGK 272
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
PYW +KNSW +GE
Sbjct: 273 EGRL------PYWTVKNSWSTMFGE 291
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 80/145 (55%), Gaps = 10/145 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD+VD GC GG AF+ I+ GGL+ + DYPY+G C + +++V I
Sbjct: 168 DCDEVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKI 225
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+ DE A+ L + GP++ A+NA +QFY G+ HPL LC +L+H VL VGYG
Sbjct: 226 LPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYGK 283
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
PYW +KNSW +GE
Sbjct: 284 EGRL------PYWTVKNSWSTMFGE 302
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LV CD VD GC GGLM AF+ +++ G + YPY N + E + I +
Sbjct: 177 ELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGVSYPYVSGNGSVPECSESSDLVIGA 236
Query: 122 YVN----VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y++ + S+E MA +L NGP+A+A++A+A Y GGV C G L+HGV
Sbjct: 237 YIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVLTS----CDG--KQLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWLIKNSWGENWGEK 317
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 85/149 (57%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE +I GG++ EKDYPY+ +N C +N + VK++
Sbjct: 164 QMIDCDSVDAGCNGGLLHTAFEAVIKM--GGVQLEKDYPYEAANNNCRMNSNKFLVKVKD 221
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 222 CYRYIIVYEEKLKDLLRSVGPIPMAIDAADIVNYKQGI---IKYCLNSG---LNHAVLLV 275
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 276 GYGVENNI------PYWTFKNTWGTDWGE 298
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 88/154 (57%), Gaps = 17/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEE--IR 116
+LV CD +D GC GGLM NAF +IS G + E +YPY N AC + E +
Sbjct: 150 ELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANYPYVSGNGIVPACSSSPESKPVG 209
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I ++ +++ E +MA ++ K+GP+++ ++A+ Q Y GG+ C D +DHG
Sbjct: 210 ATISAFQDIARTEEDMAAFVFKHGPLSIGVDASTWQSYAGGIMS----YCP--QDQIDHG 263
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VLIVG+ T T PYWIIKNSW +WGE+
Sbjct: 264 VLIVGF--DDTAST----PYWIIKNSWTANWGEE 291
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 88/154 (57%), Gaps = 17/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEE--IR 116
+LV CD +D GC GGLM NAF +IS G + E +YPY N AC + E +
Sbjct: 165 ELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANYPYVSGNGIVPACSSSPESKPVG 224
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I ++ +++ E +MA ++ K+GP+++ ++A+ Q Y GG+ C D +DHG
Sbjct: 225 ATISAFQDIARTEEDMAAFVFKHGPLSIGVDASTWQSYAGGIMS----YCP--QDQIDHG 278
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VLIVG+ T T PYWIIKNSW +WGE+
Sbjct: 279 VLIVGF--DDTAST----PYWIIKNSWTANWGEE 306
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LV CD VD GC GGLM AF+ +++ G + YPY N + E + I +
Sbjct: 177 ELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGVSYPYVSGNGSVPECSESSDLVIGA 236
Query: 122 YVN----VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y++ + S+E MA +L NGP+A+A++A+A Y GGV C G L+HGV
Sbjct: 237 YIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMSYTGGVLTS----CDG--KQLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWLIKNSWGKNWGEK 317
>gi|58617838|gb|AAW80538.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 247
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + +K YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTDKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 100 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 154 LLVGY--------NKIGGVPYWVIKNSWGEDWGEK 180
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 92/153 (60%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF I K GG++ EK YPY+ + +CH NK I +
Sbjct: 252 LVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEALDDSCHFNKGTIGATDR 309
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ +E ++A+ + GP++VAI+A+ + QFY GV ++ C NLDHGV
Sbjct: 310 GFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVY--VEPACDA--QNLDHGV 365
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VG+G ++ Q YW++KNSWG WG+K
Sbjct: 366 LVVGFGTDESG-----QDYWLVKNSWGTTWGDK 393
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 95/174 (54%), Gaps = 19/174 (10%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
A L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK Y
Sbjct: 151 AALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSY 208
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PY+G + +CH K + +V++ DE + K + GP++VAI+A+ + Q Y
Sbjct: 209 PYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHESFQLYSE 268
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + + C NLDHGVL+VGYG KT YW++KNSWG WG++
Sbjct: 269 GVYNEPE--CDA--QNLDHGVLVVGYGTDKTGL-----DYWLVKNSWGTTWGDQ 313
>gi|20301805|gb|AAM15726.1| cysteine protease [Pagumogonimus skrjabini]
Length = 165
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 82/142 (57%), Gaps = 10/142 (7%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+ GC GG ++++ I+ + GGLE + DYPY G + C LNKE++ KI
Sbjct: 34 QLVDCDRAAEGCNGGWPVSSYQEIM--VMGGLESQDDYPYVGKEQQCALNKEKLVAKIDD 91
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + + E E A YL ++GP++ +NA A+Q Y GV P C D L+H VL VG
Sbjct: 92 LVVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLKPSYEDCPD--DVLNHAVLTVG 149
Query: 182 YGVHKTKFTHKIQPYWIIKNSW 203
Y T PYWI+KNSW
Sbjct: 150 YD------TEGDDPYWIVKNSW 165
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 13/149 (8%)
Query: 63 LVDCDKVDA-GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
LVDC K D GC GG M A E I + GG+ E DYPY+G + C + ++ KI +
Sbjct: 162 LVDCAKEDCYGCSGGYMDKALEYI--ETAGGIMSENDYPYEGIDDKCRFDSSKVAAKISN 219
Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINAN-AMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ + +DE ++ ++ GP++VAI+A+ Q Y G+ C ++L+HGVL+
Sbjct: 220 FTYIKKNDEDDLKNAVIAKGPISVAIDASFNFQLYDSGILDDSS--CYSDFNSLNHGVLV 277
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGYG T K Q YWI+KNSWG WG
Sbjct: 278 VGYG------TEKEQDYWIVKNSWGADWG 300
>gi|320543907|ref|NP_001188921.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
gi|318068589|gb|ADV37168.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
Length = 249
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 65 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPY 122
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH NK + + + ++ DE +MA+ + GP++VAI+A+ + QFY GV
Sbjct: 123 EAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGV 182
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + C NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 183 YNEPQ--CDA--QNLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 225
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC D + GC GGLM AF+ + + G++ E+ YPY+G +C + E +
Sbjct: 161 AQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGEYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ + C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-RCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 87/153 (56%), Gaps = 17/153 (11%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM AF I K GG++ E YPY GS+ C + ++ +
Sbjct: 166 LVDCSTSEGNQGCNGGLMDQAFTYI--KKNGGIDTEAAYPYTGSDGTCRFLENKVGATVS 223
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINANAM--QFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V+V S DE + + + GP++VAI+A+++ QFY GGV +P + C LDHGV
Sbjct: 224 GFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNP--WFCSS--TELDHGV 279
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG K YW++KNSWG WG K
Sbjct: 280 LVVGYGTEGGK------DYWLVKNSWGSSWGLK 306
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC D + GC GGLM AF+ + + G++ E+ YPY+G +C + E +
Sbjct: 161 AQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGEYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ + C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-RCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|16076439|emb|CAC94444.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 76/116 (65%), Gaps = 8/116 (6%)
Query: 69 VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSS 127
D+GC GGLM+ AFE + GGLE EKDYPY G++R +C +K +I + ++ VS
Sbjct: 12 CDSGCSGGLMTTAFEYTLK--AGGLEREKDYPYTGTDRGSCKFDKSKIAASVSNFSVVSI 69
Query: 128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183
DE ++A LVKNGP+A+ INA MQ Y GVS P ++C LDHGVL+VGYG
Sbjct: 70 DEDQIAANLVKNGPLAIGINAAFMQTYMKGVSCP--YICG---RRLDHGVLLVGYG 120
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC D + GC GGLM AF+ + + G++ E+ YPY+G +C + E +
Sbjct: 161 AQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGEYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ + C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-RCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|58617842|gb|AAW80540.1| cathepsin L-like cysteine protease [Leishmania donovani]
Length = 213
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 90/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + +K YPY N A LN ++
Sbjct: 40 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTDKSYPYTSGNGDVAECLNSSKLVPGA 99
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 100 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 153
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 154 LLVGY--------NKIGEVPYWVIKNSWGEDWGEK 180
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC D + GC GGLM AF+ + + G++ E+ YPY+G +C + E +
Sbjct: 161 AQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGEYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ + C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-RCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/174 (40%), Positives = 91/174 (52%), Gaps = 19/174 (10%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G ++L +LVDC + GC GGLM NAFE I S GGLEGE DY
Sbjct: 174 GSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSI--GGLEGEDDY 231
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PY CHL K + +V S DE + L GP++VAI+A+ + Q Y G
Sbjct: 232 PYTAKQGKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHASFQSYDG 291
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + + NLDHGVL VGYG + YW++KNSWG WGE+
Sbjct: 292 GVYDEEECSSQ----NLDHGVLTVGYGTEENG-----GDYWLVKNSWGEMWGEE 336
>gi|6649569|gb|AAF21458.1|U56865_1 cysteine proteinase [Paragonimus westermani]
Length = 197
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 73/122 (59%), Gaps = 4/122 (3%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD+VD GC GG +A+ + K GG+E + YPY G C L++ +
Sbjct: 74 QLVDCDRVDEGCNGGYPMDAYNEL--KRMGGVEAQSTYPYTGRESQCRLDERRFVAYLND 131
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V + DE + A +L NGP++VA+NA+ +QFY G+SHP K+LC L+H VL VG
Sbjct: 132 SVMLPKDEVKQAAWLADNGPLSVALNADQLQFYRRGISHPPKYLCPAS--GLNHAVLSVG 189
Query: 182 YG 183
YG
Sbjct: 190 YG 191
>gi|332249835|ref|XP_003274061.1| PREDICTED: cathepsin W [Nomascus leucogenys]
Length = 403
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 89/165 (53%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 207 ELLDCSRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 264
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + + E +A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 265 QDFIMLQNSEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATSTTCDPQL--VDHSVLL 322
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 323 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 367
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 96/173 (55%), Gaps = 20/173 (11%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G ++L LVDC + GC GGLM NAF+ I K GG++ EK Y
Sbjct: 152 GSLEGQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYI--KANGGIDTEKSY 209
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PY G++ CH K ++ +V++ +E + K + GP++VAI+A+ + QFY
Sbjct: 210 PYNGTDGTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQ 269
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GV + C +NLDHGVL+VGYG T Q YW++KNSWG WG+
Sbjct: 270 GVYDEPE--CSS--ENLDHGVLVVGYG------TKDDQDYWLVKNSWGTTWGD 312
>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
Length = 1140
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 87/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ EKDYPYKG++ C +N++ + V I
Sbjct: 809 ELVDCDTSYNQGCNGGLMDYAFEFIIN--NGGIDTEKDYPYKGTDGRCDVNRKNAKVVTI 866
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V +++ + + V N P++VAI A Q Y G+ F G LDHGV
Sbjct: 867 DSYEDVPANDEKSLQKAVANQPVSVAIEAAGTTFQLYSSGI-----FTGSCGT-ALDHGV 920
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI+KNSWG WGE
Sbjct: 921 TAVGYGTENGK------DYWIMKNSWGSSWGE 946
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC D + GC GGLM AF+ + + G++ E+ YPY+G +C + E +
Sbjct: 161 AQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGEYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ + C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-RCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
Length = 321
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCDK + GC GGLM AF+ II GG++ E+DYPYKG + AC N++ + V I
Sbjct: 42 ELVDCDKSFNMGCNGGLMDYAFQFIIGN--GGIDTEEDYPYKGRDAACDPNRKNAKVVTI 99
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ K V N P++VAI A A Q Y GV F + G D LDHGV
Sbjct: 100 DGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQLYQSGV-----FTGRCGTD-LDHGV 153
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ VGYG T YWI++NSWG WGE
Sbjct: 154 VAVGYG------TDNGTDYWIVRNSWGKDWGES 180
>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
Length = 359
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD++D GC GGLM AF+ I+ GGLE E YPY+G + AC LN + VK+
Sbjct: 199 QLVDCDQIDQGCSGGLMHLAFQEILQM--GGLESELVYPYQGVDYACRLNPRKFDVKLSD 256
Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
DE ++ + + GP+AVAI+ + Y G+ +C + L+H VL+V
Sbjct: 257 CHRYDLRDERKLRELVYTVGPIAVAIDCIDIIDYKSGIVS----MCNN--NGLNHAVLLV 310
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G+G+ PYWI+KNSWG WGEK
Sbjct: 311 GFGIEFDT------PYWILKNSWGNDWGEK 334
>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
Length = 441
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKE-EIRV 117
+LV CD VD GC GGLM AF+ +++ G + YPY GS C + E +
Sbjct: 177 ELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGASYPYVSGNGSVPECSESSELVVGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
I +V + S+E MA +L NGP+A+A++A+A Y GG+ C G L+HGV
Sbjct: 237 YIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMSYTGGILTS----CDG--RQLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWLIKNSWGENWGEK 317
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/180 (40%), Positives = 94/180 (52%), Gaps = 37/180 (20%)
Query: 54 GEGTHLALK-LVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
G+ L+++ LVDCDK + GC GGLM AF+ +I GG++ EKDYPY+G + C +N
Sbjct: 175 GDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQN--GGIDTEKDYPYQGYDGRCDVN 232
Query: 112 KEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKG 168
K RV I SY +V ++ E K V P++VAI A Q Y GGV F +
Sbjct: 233 KMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGV-----FTGRC 287
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQ 228
G D LDHGVL VGYG EK + +WI+KNSWG WGE
Sbjct: 288 GTD-LDHGVLAVGYG------------------------SEKGLDYWIVKNSWGEYWGES 322
>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
Length = 441
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKE-EIRV 117
+LV CD VD GC GGLM AF+ +++ G + YPY GS C + E +
Sbjct: 177 ELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGASYPYVSGNGSVPECSESSELVVGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
I +V + S+E MA +L NGP+A+A++A+A Y GG+ C G L+HGV
Sbjct: 237 YIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMSYTGGILTS----CDG--RQLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG +WGEK
Sbjct: 291 LLVGY-----NMTGEV-PYWLIKNSWGENWGEK 317
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 79/145 (54%), Gaps = 10/145 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD VD GC GG AF+ I+ GGL+ + DYPY+G C + +++V I
Sbjct: 168 DCDGVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKI 225
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+ DE A+ L + GP++ A+NA +QFY G+ HPL LC +L+H VL VGYG
Sbjct: 226 LPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYGK 283
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
PYW +KNSW +GE
Sbjct: 284 EGR------LPYWTVKNSWSTMFGE 302
>gi|209962662|gb|ACJ02126.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C + E+ I
Sbjct: 30 LVSCDTKDGGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCQRDGHEVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A + Y GGV + L+HGVL+
Sbjct: 90 TGHVDIPQDEAAIAKYLADNGPVAVAVDATSFMSYSGGVVTSCT------SEQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY + PYWIIKN
Sbjct: 144 VGY------YDSSKPPYWIIKN 159
>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
Length = 376
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCSRCGDGCQGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN ++ Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVIKATPTTCDPQL--VDHSVLL 295
Query: 180 VGYGVHKT-------KFTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K+ + + + Q PYWI+KNSWG WGEK
Sbjct: 296 VGFGSVKSEEGIWAERVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 88/164 (53%), Gaps = 18/164 (10%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
LVDC +AGC GGLM AF+ IIS G++ E YPY + C N + +
Sbjct: 159 NLVDCSSAQGNAGCNGGLMDQAFQYIISN--NGIDTESSYPYTAQDGTCQFNSANVGATV 216
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
SY +++S E+++ + GP++VAI+A+ + QFY GV + + C LDHG
Sbjct: 217 ASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGVYN--EPACSSSQ--LDHG 272
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNS 220
VL VGYG T YW++KNSWG WG+ W+ +NS
Sbjct: 273 VLAVGYG------TSGSSDYWLVKNSWGTSWGQSGY-IWMTRNS 309
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 84/149 (56%), Gaps = 14/149 (9%)
Query: 63 LVDCDKVDA-GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
LVDC K GCGGG M A E I GG+ EKDYPY+G + C + ++ KI +
Sbjct: 162 LVDCAKDTCYGCGGGWMDKALEYIEK---GGIMSEKDYPYEGVDDNCRFDISKVAAKISN 218
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ + +DE ++ + GP++VAI+A+A Q Y G+ + C D+L+HGVL+
Sbjct: 219 FTYIKKNDEEDLKNAVAAKGPISVAIDASATFQLYVSGILDDTE--CSNEFDSLNHGVLV 276
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGYG K YWIIKNSWG +WG
Sbjct: 277 VGYGTENGK------DYWIIKNSWGVNWG 299
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 94/167 (56%), Gaps = 19/167 (11%)
Query: 52 IRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR 106
R GT ++L LVDC + GC GGLM NAF + K GG++ EK Y Y+G +
Sbjct: 162 FRKSGTLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYV--KDNGGIDTEKSYAYEGIDD 219
Query: 107 ACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLK 163
+CH +K I + + ++ +E ++A+ + GP++VAI+A+ + QFY GV
Sbjct: 220 SCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPN 279
Query: 164 FLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
C +NLDHGVL+VGYG K YW++KNSWG WG+K
Sbjct: 280 --CSA--ENLDHGVLVVGYGTEKDG-----SDYWLVKNSWGTTWGDK 317
>gi|133777889|gb|AAI15439.1| Ctsf protein [Mus musculus]
Length = 174
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/123 (45%), Positives = 78/123 (63%), Gaps = 4/123 (3%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCDKVD C GGL SNA+ I K GGLE E DY Y+G + C+ + + +V I V
Sbjct: 56 DCDKVDKACLGGLPSNAYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVE 113
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+S +E ++A +L + GP++VAINA MQFY G++HP + LC +DH VL+VGYG
Sbjct: 114 LSRNENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWF--IDHAVLLVGYGN 171
Query: 185 HKT 187
++
Sbjct: 172 RRS 174
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 91/155 (58%), Gaps = 20/155 (12%)
Query: 62 KLVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LVDCD +D GC GGLM NAFE II GGL E +YPY G++ +C+ NKE V
Sbjct: 248 ELVDCDVDGMDQGCEGGLMDNAFEFIIDN--GGLTTEGNYPYTGTDDSCNSNKESNDVAS 305
Query: 119 IQSYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDH 175
I+ Y +V S+DET + K + P+++A++ N +FY GGV L C LDH
Sbjct: 306 IKGYEDVPSNDETSLLKAVAAQ-PVSIAVDGGDNLFRFYKGGV---LSGACG---TELDH 358
Query: 176 GVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G+ VGYG+ T +W++KNSWG WGEK
Sbjct: 359 GIAAVGYGI-----TSDGTKFWLMKNSWGTSWGEK 388
>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
Length = 377
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 84/160 (52%), Gaps = 18/160 (11%)
Query: 63 LVDC---DKVDAG------CGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE 113
LVDC D++D G C GGLM NAF+ II GG++ E Y Y G + C +K
Sbjct: 200 LVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTEASYGYTGKDGTCAFDKA 259
Query: 114 EIRVKIQSYVNVS-SDETEMAKYLVKNGPMAVAINANAM-QFYFGGVSHPLKFL-CKGGM 170
+ I ++ +V+ DE +A L GP+++A++A+ Q Y GG+ P L C
Sbjct: 260 NVGATISNWTDVAVGDEVALADALANAGPVSIALDASKQWQLYSGGILKPRSILGCSSDP 319
Query: 171 DNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ DHGV IVGYG T YW I+NSWG WGE
Sbjct: 320 THADHGVAIVGYG------TDDGVDYWWIRNSWGTTWGES 353
>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
Length = 376
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQL--VDHSVLL 295
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 296 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340
>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
Length = 376
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQL--VDHSVLL 295
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 296 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340
>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
Length = 376
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQL--VDHSVLL 295
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 296 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 96/176 (54%), Gaps = 20/176 (11%)
Query: 53 RGEGTHLALK---LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA 107
R GT ++L LVDC + GC GGLM +AFE II GG++ E YPY +
Sbjct: 147 RKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKN--GGIDTEASYPYTATTGT 204
Query: 108 CHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINANAM--QFYFGGVSHPLKF 164
C N I + SY ++ + E+++ + GP++VAI+A+ + QFYF GV + K
Sbjct: 205 CKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVYNEKK- 263
Query: 165 LCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNS 220
C LDHGVL VGYG + + + YW++KNSWG WG K W+ +N+
Sbjct: 264 -CS--TTQLDHGVLAVGYGT-----STEGKDYWLVKNSWGATWG-KAGYIWMSRNA 310
>gi|311697991|gb|ADQ00290.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VDAGC GGLM +AF+ ++ G + E YPY G AC ++ E+ I
Sbjct: 30 LVSCDTVDAGCNGGLMDDAFQWLVDSNKGKVYTENSYPYVSGSGQTPACSTSEHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
Length = 376
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCSRCGDGCQGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN ++ Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVIKATPTTCDPQL--VDHSVLL 295
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 296 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340
>gi|375073954|gb|AFA34844.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073956|gb|AFA34845.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 81/142 (57%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY T PYWIIKN
Sbjct: 144 VGYNDSATV------PYWIIKN 159
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LVDCD + GC GG M AFE +I+ GG++ E DYPY G C++ KEE + V I
Sbjct: 175 ELVDCDTTNDGCEGGYMDYAFEWVINN--GGIDTEADYPYIGVGGTCNVTKEETKVVTID 232
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V+ ++ + VK P++V I+ + + F Y GG+ C D++DH VL
Sbjct: 233 GYTDVTQSDSALFCATVKQ-PISVGIDGSTLDFQLYTGGIYDG---DCSSNPDDIDHAVL 288
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
IVGYG Q YWI+KNSWG WG
Sbjct: 289 IVGYGSDGN------QDYWIVKNSWGTSWG 312
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LVDCD + GC GG M AFE +I+ GG++ E DYPY G C++ KEE + V I
Sbjct: 235 ELVDCDTTNDGCEGGYMDYAFEWVINN--GGIDTEADYPYIGVGGTCNVTKEETKVVTID 292
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V+ ++ + VK P++V I+ + + F Y GG+ C D++DH VL
Sbjct: 293 GYTDVTQSDSALFCATVKQ-PISVGIDGSTLDFQLYTGGIYDG---DCSSNPDDIDHAVL 348
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
IVGYG Q YWI+KNSWG WG
Sbjct: 349 IVGYGSDGN------QDYWIVKNSWGTSWG 372
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 87/152 (57%), Gaps = 17/152 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF+ I K G++ EK YPY+G + C K I
Sbjct: 170 LVDCSGSYGNNGCEGGLMDNAFQYI--KENHGIDTEKSYPYEGEDETCRFRKTSIGATDS 227
Query: 121 SYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V+++ DE + + + GP++VAI+A+ + QFY GV + + C +NLDHGV
Sbjct: 228 GFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPE--CSS--ENLDHGV 283
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYGV Q YW++KNSWG WG+
Sbjct: 284 LVVGYGVEDN------QKYWLVKNSWGTQWGD 309
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 92/166 (55%), Gaps = 23/166 (13%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF 164
H++K K +YVN S+ E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 211 HMDKS----KFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KW 265
Query: 165 LCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++HGVL VGYGV K PYWI+KNSWG +GE+
Sbjct: 266 CDPAG---VNHGVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 83/155 (53%), Gaps = 19/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEE--IR 116
+LV CD VD GC GGLM NAF ++S G + E YPY N AC N +
Sbjct: 165 ELVSCDTVDDGCSGGLMDNAFGWLLSAHNGQITTEASYPYVSGNGIVPACTFNSNSNPVG 224
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGV-SHPLKFLCKGGMDNLDH 175
I S+ ++ E +MA ++ K GP+++ ++A++ Q Y GG+ SH +DH
Sbjct: 225 ATITSFHDIPKTERDMAAFVFKYGPLSIGVDASSWQSYIGGILSHCSDV-------QIDH 277
Query: 176 GVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GVLIVG+ + PYWIIKNSW WGE+
Sbjct: 278 GVLIVGFDDTAST------PYWIIKNSWSSMWGEQ 306
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 86/152 (56%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AF+ II+ GG++ E DYPYKG + C +N++ + V I
Sbjct: 180 ELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTEDDYPYKGKDERCDVNRKNAKVVTI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V+ + + V+N P++VAI A A Q Y G+ F K G LDHGV
Sbjct: 238 DSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGI-----FTGKCGT-ALDHGV 291
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI++NSWG WGE
Sbjct: 292 AAVGYGTENGK------DYWIVRNSWGKSWGE 317
>gi|119594869|gb|EAW74463.1| cathepsin W (lymphopain), isoform CRA_a [Homo sapiens]
Length = 262
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 66 ELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 123
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 124 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQL--VDHSVLL 181
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 182 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 226
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 85/153 (55%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RAC--HLNKEEIR 116
+LV CD D GC GGLM NAF +IS GG + E YPY N AC +L+ + +
Sbjct: 165 ELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVG 224
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I ++ +++ E +MA ++ GP+++ ++A+ Q Y GG+ + +DHG
Sbjct: 225 ATISNFQDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGIITYCPDV------QIDHG 278
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VLIVGY T T PYWIIKNSW +WGE
Sbjct: 279 VLIVGY--DDTAPT----PYWIIKNSWTANWGE 305
>gi|431910254|gb|ELK13327.1| Cathepsin W [Pteropus alecto]
Length = 210
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 86/162 (53%), Gaps = 18/162 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+LVDC + GC GG + +AF T+++ GL EKDYPY+G R C K + I
Sbjct: 16 ELVDCTRCGNGCEGGFIWDAFITVLNN--SGLASEKDYPYQGKVRTHKCQAKKHKNVAWI 73
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + E ++A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 74 QDFIMLPDCEMKIARYLATEGPITVTINMKLLQQYQTGVIKATSNTCDPHL--VDHSVLL 131
Query: 180 VGYGVHK-----------TKFTHKIQPYWIIKNSWGPHWGEK 210
VG+G K +K H I PYWI+KNSWG WGEK
Sbjct: 132 VGFGKSKSVEGRRAEAVSSKSRHSI-PYWILKNSWGASWGEK 172
>gi|311697915|gb|ADQ00252.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697925|gb|ADQ00257.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCLSDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|154415085|ref|XP_001580568.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121914787|gb|EAY19582.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 81/149 (54%), Gaps = 9/149 (6%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GCGGGLMS A++ I GG EKDYPY + C NK + KI SY
Sbjct: 140 LVDCVTSCDGCGGGLMSAAYDYAIQYQGGKFMLEKDYPYTALDGTCKFNKAKATSKIVSY 199
Query: 123 VN-VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+N V DE ++A + GP +VAI+A+ + F F + C +LDHGV VG
Sbjct: 200 INVVEGDEKDLAAKVSAYGPSSVAIDASQISFQFYSQGIYDEPYCSS--YSLDHGVGCVG 257
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YG TK YWI++NSWG WG++
Sbjct: 258 YGTEGTK------NYWIVRNSWGLGWGDQ 280
>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
Length = 367
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 87/157 (55%), Gaps = 13/157 (8%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+++DCD+ GC GG + +AF T+++ GL E+DYPYKG+ + C + I
Sbjct: 180 QVLDCDRCGNGCNGGFVWDAFLTVLNT--SGLASEQDYPYKGTVKTHRCLAKQHRKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + E +A+YL GP+ V INA +Q Y GV C + N H VL+
Sbjct: 238 QDFLMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVN--HSVLL 295
Query: 180 VGYGVHKT------KFTHKIQPYWIIKNSWGPHWGEK 210
VG+G K+ + H I PYWI+KNSWGP WGE+
Sbjct: 296 VGFGKSKSVEGRRPRPGHSI-PYWILKNSWGPDWGEE 331
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 89/169 (52%), Gaps = 15/169 (8%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY
Sbjct: 146 GNVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTY-TAIQKMGG-LELASDYPY 203
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
G C+++K + I + E A+ L GP++ A+NA+ +Q Y GG+ P
Sbjct: 204 TGVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP 263
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LC ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 264 R--LCDPA--GVNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 84/158 (53%), Gaps = 19/158 (12%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG----SNRACHLNKEEI 115
+LVDC K + GC GGLM++AFE + + G++ E YPY N C N I
Sbjct: 199 QLVDCSKSYGNNGCSGGLMNSAFEYV--RDNEGIDSEISYPYVSGDGTENNRCLFNASNI 256
Query: 116 RVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDN 172
++ YVN+ DE + + GP++VAINA F Y G+ C+G +D
Sbjct: 257 LAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSGIYSDTD--CEGTLDA 314
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG + YW+IKNSWG WGEK
Sbjct: 315 LDHGVLVVGYGEENGR------SYWLIKNSWGEEWGEK 346
>gi|311698045|gb|ADQ00317.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDNAFQRLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 85/153 (55%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RAC--HLNKEEIR 116
+LV CD D GC GGLM NAF +IS GG + E YPY N AC +L+ + +
Sbjct: 165 ELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVG 224
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I ++ +++ E +MA ++ GP+++ ++A+ Q Y GG+ + +DHG
Sbjct: 225 ATISNFQDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGIITYCPDV------QIDHG 278
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VLIVGY T T PYWIIKNSW +WGE
Sbjct: 279 VLIVGY--DDTAPT----PYWIIKNSWTANWGE 305
>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 86/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-V 117
+LV CD D GC GGLM AFE ++ + G + E YPY S+ C + + +
Sbjct: 177 QLVSCDDKDNGCSGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSSGYVPECSNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I+ Y+ + S ET +L KNGP+++A++A++ Y GV L D L+HGV
Sbjct: 237 RIEGYMTIESSETVKGAWLAKNGPISIAVDASSFMSYQSGV------LTSCAGDALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGEK
Sbjct: 291 LLVGY-----NRTGEV-PYWVIKNSWGEDWGEK 317
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 67/152 (44%), Positives = 83/152 (54%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LVDCD VD GC GGLM + FE II GG+ E +YPY N C NKE +I+
Sbjct: 177 ELVDCDSVDHGCDGGLMEHGFEFIIKN--GGISSEANYPYTAVNGTCDTNKEASPGAQIK 234
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V + E + V N P++V+I+A +A QFY GV F + G LDHGV
Sbjct: 235 GYETVPVNCEEELQKAVANQPVSVSIDAGGSAFQFYSSGV-----FTGQCGT-QLDHGVT 288
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGE+
Sbjct: 289 AVGYGS-----TDDGIQYWIVKNSWGTQWGEE 315
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 86/149 (57%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL+ A+E I++ GGLE E+DYPY+ C L ++ V + +
Sbjct: 184 QLVDCDTIDMGCAGGLLHTAYEEIMAM--GGLEYEEDYPYRSVQGPCRLQSDKFEVSVDN 241
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V E ++ L + GP+AVA++A + Y+GG+ CK L+H VL+V
Sbjct: 242 CYRYVLYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITS----CKNY--GLNHAVLLV 295
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYG+ P+W++KNSWG +GE
Sbjct: 296 GYGIENGV------PFWVLKNSWGSDYGE 318
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 91/166 (54%), Gaps = 23/166 (13%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF 164
H++K K +YVN S+ E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 211 HMDKS----KFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KW 265
Query: 165 LCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++H VL VGYGV K PYWI+KNSWG +GEK
Sbjct: 266 CDPAG---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEK 302
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYKG++ C +N++ + V I
Sbjct: 186 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGTDGRCDVNRKNAKVVTI 243
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 244 DSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGI-----FTGTCGT-ALDHGV 297
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG K YWI+KNSWG WGE
Sbjct: 298 TAVGYGTENGK------DYWIVKNSWGSSWGES 324
>gi|283046734|ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum]
Length = 328
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 92/168 (54%), Gaps = 19/168 (11%)
Query: 49 KIQIRGEG-THLALK-LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS 104
++ I G G T L+ + LVDC +AGC GG M +AF+ I G+ E YPY S
Sbjct: 150 QLAISGRGLTSLSEQNLVDCSSAYGNAGCNGGWMDSAFDYIHDN---GIMSESAYPYTAS 206
Query: 105 NRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINA-NAMQFYFGGVSHPL 162
+C N E +Q Y ++ S DE + + NGP+AVA++A + +QFY GGV +
Sbjct: 207 EGSCRFNPSESVTSLQGYYDLPSGDENALKSAVANNGPIAVALDATDELQFYSGGVLYDT 266
Query: 163 KFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ L+HGVL+VGYG + Q YWI+KNSWG WGE+
Sbjct: 267 TCSAQA----LNHGVLVVGYG------SEGGQDYWIVKNSWGSGWGEQ 304
>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
Length = 376
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHSCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN ++ Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVIKATPITCDPQL--VDHSVLL 295
Query: 180 VGYGVHKTK-------FTHKIQ-------PYWIIKNSWGPHWGEK 210
VG+G K++ + + Q PYWI+KNSWG WGEK
Sbjct: 296 VGFGSIKSEEGILAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYKG++ C +N++ + V I
Sbjct: 186 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGTDGRCDVNRKNAKVVTI 243
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 244 DSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGI-----FTGTCGT-ALDHGV 297
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG K YWI+KNSWG WGE
Sbjct: 298 TAVGYGTENGK------DYWIVKNSWGSSWGES 324
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E E DYPYK + C L + +++
Sbjct: 180 QLVDCDFVDMGCDGGLIHTAYEQIMRM--GGVEQEFDYPYKAERQPCALKPHKFAAGVRN 237
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V +E + L GP+A+A++A + Y+GG+ CK + L+H VL+V
Sbjct: 238 CYRYVLMNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIVS----FCKN--NGLNHAVLLV 291
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYWIIKNSWG +GE
Sbjct: 292 GYGVENNV------PYWIIKNSWGSDYGE 314
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E E DYPYK + C L + +++
Sbjct: 179 QLVDCDFVDMGCDGGLIHTAYEQIMRM--GGVEQEFDYPYKAERQPCALKPHKFAAGVRN 236
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V +E + L GP+A+A++A + Y+GG+ CK + L+H VL+V
Sbjct: 237 CYRYVLMNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIVS----FCKN--NGLNHAVLLV 290
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYWIIKNSWG +GE
Sbjct: 291 GYGVENNV------PYWIIKNSWGSDYGE 313
>gi|311698033|gb|ADQ00311.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ G + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDEGCNGGLMDNAFQWLVDSNKGKVYTESSYPYVSGSGQTPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
Length = 394
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 86/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD D GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 177 QLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+ ++A++ Y GV C G D L+HGV
Sbjct: 237 RIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMSYQSGVLTS----CAG--DALNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T PY +IKNSWG WGEK
Sbjct: 291 LLVGYN------TTGGVPYCVIKNSWGEDWGEK 317
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 88/150 (58%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI-Q 120
+L+DCD VDAGC GGL+ AFE +++ GG++ E DYPY+ +N C N + VK+ +
Sbjct: 164 QLIDCDFVDAGCDGGLLHTAFEAVMNM--GGIQAESDYPYEANNGDCRANAAKFVVKVKK 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ VAI+A+ + Y G+ +K+ G L+H VL+V
Sbjct: 222 CYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKRGI---MKYCANHG---LNHAVLLV 275
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GY V P+WI+KN+WG WGE+
Sbjct: 276 GYAVENGV------PFWILKNTWGADWGEQ 299
Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%)
Query: 10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
+D L+ F FL K NKSY+++ E +R +IFR NL++I
Sbjct: 19 YDVLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI 59
>gi|30141461|emb|CAD54747.1| cysteine proteinase a [Leishmania guyanensis]
Length = 222
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 81/150 (54%), Gaps = 16/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM A+ II G + E +PY GS +C L+ ++ +I
Sbjct: 56 LVSCDTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSHPYTSGDGSTASC-LSTGKVGARI 114
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
V++ DE + +L KNGP+A+A++A Q YFGGV L L+HGVL+
Sbjct: 115 SGQVSLPQDEDAIEAWLEKNGPIAIAVDATTWQLYFGGV--VLNCFAY----QLNHGVLL 168
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWI+KNSWG WGE
Sbjct: 169 VGYN------NSAKPPYWIVKNSWGTSWGE 192
>gi|394333024|gb|AFN27086.1| cysteine protease, partial [Leishmania infantum]
Length = 237
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 89/155 (57%), Gaps = 20/155 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN--RACHLNKEEIR--V 117
+LV CD GC GGLM AFE ++ + G + EK YPY N A LN ++
Sbjct: 30 QLVSCDDKHNGCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGA 89
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I YV + S+ET MA +L +NGP+A+A++A++ Y GV C G D L+HGV
Sbjct: 90 QIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLTS----CAG--DALNHGV 143
Query: 178 LIVGYGVHKTKFTHKIQ--PYWIIKNSWGPHWGEK 210
L+VGY +KI PYW+IKNSWG WGEK
Sbjct: 144 LLVGY--------NKIGGVPYWVIKNSWGEDWGEK 170
>gi|1093503|prf||2104214A Cys protease
Length = 255
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 71 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFPYI--KDNGGIDTEKSYPY 128
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH N+ ++ + + ++ DE +M + + GP++VAI+A+ + QFY GV
Sbjct: 129 EAIDDSCHFNRAQVGATDRGFTDIPQGDEKKMPEAVATVGPVSVAIDASHESFQFYSEGV 188
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + + NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 189 YNEPQCDAQ----NLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 231
>gi|123438675|ref|XP_001310117.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121891873|gb|EAX97187.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 83/151 (54%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM A++ +I+ G E DYPY + +C N + +I+SY
Sbjct: 140 LVDCVTTCYGCNGGLMDAAYDYVINHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSY 199
Query: 123 VNVSS-DETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VNV+ DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 200 VNVAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 255
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 256 VGYGTEGSK------NYWIVRNSWGTSWGEK 280
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 82/153 (53%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+++DC + GC GGLM+N+FE II+ GGL+ E YPY+G C NK I I
Sbjct: 164 QILDCSGSEGNNGCDGGLMTNSFEYIIAV--GGLDTEASYPYEGVVGKCKFNKANIGATI 221
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y NV S + V P++VAI+A N+ Q Y GV + C LDHGV
Sbjct: 222 TGYKNVKSGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPA--CSS--TQLDHGV 277
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG + Q YWI+KNSWG WGEK
Sbjct: 278 LAVGYG------SQSGQDYWIVKNSWGADWGEK 304
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 69/153 (45%), Positives = 86/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+L+DCD ++GC GGLM AF+ IIS GGL E DYPY C KE++ RV I
Sbjct: 188 ELIDCDTTFNSGCNGGLMDYAFQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ E + + P++VAI A+ QFY GGV F K G D LDHGV
Sbjct: 246 SGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGV-----FNGKCGTD-LDHGV 299
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG + K Y I+KNSWGP WGEK
Sbjct: 300 AAVGYG------SSKGSDYVIVKNSWGPRWGEK 326
>gi|395514296|ref|XP_003761355.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 262
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 89/153 (58%), Gaps = 15/153 (9%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC ++GC GGLM NAFE + K GG++ E+ YPY G + CH N + +
Sbjct: 96 LVDCSTAQGNSGCQGGLMDNAFEYV--KKNGGIDTEESYPYVGKDGTCHYNSQCSGANVT 153
Query: 121 SYVNVSSD-ETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV++ + E +AK + GP++VAI+A ++ QFY GV + + + LDHGV
Sbjct: 154 GYVDIPAGVERALAKAVATVGPISVAIDAGHSSFQFYRSGVYYEPEC----SSEELDHGV 209
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VG+GV + YWI+KNSWG WG++
Sbjct: 210 LVVGFGVEGKNG----KKYWIVKNSWGEEWGDR 238
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 14/149 (9%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDC ++ GC GG + + F I + GLE E DYPY G + C ++ K+
Sbjct: 165 QLVDCTTDLNYGCDGGYLDDTFPYIQTN---GLELESDYPYTGYDGYCSYESSKVVTKVS 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
SYV+V ++E + + + GP+A+AINA+ +QFYF G+ K+ + LDHGVL V
Sbjct: 222 SYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGIIDD-KYC---DPEYLDHGVLAV 277
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GY + YW+IKNSWG WGE
Sbjct: 278 GYDSENGR------DYWLIKNSWGADWGE 300
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 73/154 (47%), Positives = 90/154 (58%), Gaps = 20/154 (12%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK D GC GGLM AF +I GGL+ E DYPYKG C +K +V I
Sbjct: 203 ELVDCDKGEDEGCNGGLMDYAFGFVIKN--GGLDTEADYPYKGYGTRCDRSKMNAKVVTI 260
Query: 120 QSYVNVS-SDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
Y +V +DET + K V + P++VAI+A ++MQFY G+ F + G D LDHG
Sbjct: 261 DGYEDVPVNDETALLK-AVAHQPVSVAIDAGGSSMQFYRSGI-----FTGRCGTD-LDHG 313
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYG K YWIIKNSWG +WGEK
Sbjct: 314 VTNVGYGKEDGK------AYWIIKNSWGSNWGEK 341
>gi|311697933|gb|ADQ00261.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697939|gb|ADQ00264.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697945|gb|ADQ00267.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697949|gb|ADQ00269.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697951|gb|ADQ00270.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697957|gb|ADQ00273.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697959|gb|ADQ00274.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697963|gb|ADQ00276.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697965|gb|ADQ00277.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697967|gb|ADQ00278.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697969|gb|ADQ00279.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698035|gb|ADQ00312.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698037|gb|ADQ00313.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364397|gb|AEU08929.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364399|gb|AEU08930.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364401|gb|AEU08931.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364403|gb|AEU08932.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364405|gb|AEU08933.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDNAFQWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|311698055|gb|ADQ00322.1| cathepsin L-like protein [Trypanosoma sp. D30]
Length = 159
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDNAFKWLVDSNGGKVYTEDSYPYVSGSGQTPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLV 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|311697955|gb|ADQ00272.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY G AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDNAFQWLVDSNGGKVYTEDSYPYVSGSGQTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
Length = 471
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 79/144 (54%), Gaps = 15/144 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 174 LVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 233
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A + NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 234 TGHVELPQDEAQIAACVAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 287
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSW 203
VGY PYWIIKNSW
Sbjct: 288 VGYN------DSAAVPYWIIKNSW 305
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 81/152 (53%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKIQ 120
+LVDCD VD GC GG M FE II GG+ E +YPY + C NKE +I+
Sbjct: 172 ELVDCDSVDHGCDGGYMEGGFEFIIKN--GGISSEANYPYTAVDGTCDANKEASPAAQIK 229
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V ++ + + V N P++V I+A +A QFY GV F + G LDHGV
Sbjct: 230 GYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGV-----FTGQCGT-QLDHGVT 283
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGE+
Sbjct: 284 AVGYGS-----TDDGTQYWIVKNSWGTQWGEE 310
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 81/152 (53%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVKIQ 120
+LVDCD VD GC GG M FE II GG+ E +YPY + C NKE +I+
Sbjct: 172 ELVDCDSVDHGCDGGYMEGGFEFIIKN--GGISSEANYPYTAVDGTCDANKEASPAAQIK 229
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V ++ + + V N P++V I+A +A QFY GV F + G LDHGV
Sbjct: 230 GYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGV-----FTGQCGT-QLDHGVT 283
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGE+
Sbjct: 284 AVGYGS-----TDDGTQYWIVKNSWGTQWGEE 310
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 86/149 (57%), Gaps = 14/149 (9%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDC ++ GC GG + + F I + GLE E DYPY G + +C + ++ K+
Sbjct: 165 QLVDCTTDLNYGCDGGYLDDTFPYIQTN---GLELESDYPYTGYDGSCSYDSSKVVTKVS 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
SYV+V ++E + + + GP+A+AINA+ +QFYF G+ K+ + LDHGVL V
Sbjct: 222 SYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGIIDD-KYC---DPEWLDHGVLAV 277
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GY + YW+IKNSWG WGE
Sbjct: 278 GYN------SENGLDYWLIKNSWGADWGE 300
>gi|6649595|gb|AAF21471.1|U85984_1 cysteine proteinase [Clonorchis sinensis]
Length = 217
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 79/145 (54%), Gaps = 10/145 (6%)
Query: 65 DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVN 124
DCD VD GC GG AF+ I+ GGL+ + DYPY+G C + +++V I
Sbjct: 58 DCDGVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKI 115
Query: 125 VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGV 184
+ DE A+ L + GP++ A+NA +QFY G+ HPL LC +L+H VL VGYG
Sbjct: 116 LPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYGK 173
Query: 185 HKTKFTHKIQPYWIIKNSWGPHWGE 209
PYW +KNSW +GE
Sbjct: 174 EGRL------PYWTVKNSWSTMFGE 192
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 73/154 (47%), Positives = 90/154 (58%), Gaps = 20/154 (12%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK D GC GGLM AF +I GGL+ E DYPYKG C +K +V I
Sbjct: 203 ELVDCDKGEDEGCNGGLMDYAFGFVIKN--GGLDTEADYPYKGYGTRCDRSKMNAKVVTI 260
Query: 120 QSYVNVS-SDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
Y +V +DET + K V + P++VAI+A ++MQFY G+ F + G D LDHG
Sbjct: 261 DGYEDVPVNDETALLK-AVAHQPVSVAIDAGGSSMQFYRSGI-----FTGRCGTD-LDHG 313
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYG K YWIIKNSWG +WGEK
Sbjct: 314 VTNVGYGKEDGK------AYWIIKNSWGSNWGEK 341
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM AFE I K G++ E YPYKG CH NK+ + + +
Sbjct: 223 LVDCSRKYGNNGCNGGLMDYAFEYI--KDNHGVDTEASYPYKGKEMKCHFNKKTVGAEDE 280
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV++ DE ++ + GP++VAI+A + Q Y GV + + C ++LDHGV
Sbjct: 281 GYVDLPEGDEEKLKIAVATQGPISVAIDAGHPSFQMYRKGVYYEPQ--CSS--ESLDHGV 336
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG + YWI+KNSWGP WGEK
Sbjct: 337 LVVGYGTDEID-----GDYWIVKNSWGPGWGEK 364
>gi|358364409|gb|AEU08935.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ G + E YPY G AC +K E+ K+
Sbjct: 30 LVSCDTVDHGCNGGLMDNAFKWLVDSNDGKVYTEDSYPYVSGSGQTPACSTSKHEVGAKV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 84/151 (55%), Gaps = 17/151 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
+L+DCD VDAGC GGLM AF+ ++ GG+ E YPY GS +C+ NK + +V +I
Sbjct: 174 QLMDCDTVDAGCDGGLMETAFKFVVKN--GGVTTEAAYPYTGSVGSCNANKAKNKVAEIT 231
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
+ V+ D + V P+ V+I + F Y G+ L C D+LDHGVL
Sbjct: 232 GFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGI---LSGKCD---DSLDHGVL 285
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
++GYG T PYWIIKNSWG WGE
Sbjct: 286 LIGYG------TEGGMPYWIIKNSWGTSWGE 310
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AF+ II+ GG++ E DYPYKG + C +N++ + V I
Sbjct: 180 ELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTEDDYPYKGKDERCDVNRKNAKVVTI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V+ + + V N P++VAI A A Q Y G+ F K G LDHGV
Sbjct: 238 DSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGI-----FTGKCGT-ALDHGV 291
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI++NSWG WGE
Sbjct: 292 AAVGYGTENGK------DYWIVRNSWGKSWGE 317
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC + + GC GGLM AF+ + + G++ E+ YPY+G +C + + +
Sbjct: 161 AQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGDYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ K C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-KCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 88/153 (57%), Gaps = 18/153 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD+ + GC GGLM +AF+ II+ GG++ + DYPY G + C ++ +V I
Sbjct: 183 ELVDCDRSYNEGCNGGLMDDAFQFIINN--GGIDSDADYPYTGRDGQCDQYRKNAKVVTI 240
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + + + N P++VAI A+ QFY G+ F K G D LDHGV
Sbjct: 241 DSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGI-----FTGKCGTD-LDHGV 294
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
++VGYG K YWI++NSWG WGEK
Sbjct: 295 VVVGYGTENGK------DYWIVRNSWGADWGEK 321
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/153 (45%), Positives = 85/153 (55%), Gaps = 20/153 (13%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKIQ 120
+LVDCD GC GG MS AFE +++ GL E YPYKG N AC K E V I
Sbjct: 262 ELVDCDAEAVGCAGGFMSWAFEFVMAN--HGLTTEASYPYKGINGACQTAKLNESSVSIT 319
Query: 121 SYVNVS-SDETEMAKYLVKNGPMAVAINANAM--QFYFGGV-SHPLKFLCKGGMDNLDHG 176
YVNV+ + E E+ K P++VA++A Q Y GGV S P C ++HG
Sbjct: 320 GYVNVTVNSEAELLKVAAVQ-PVSVAVDAGGFLFQLYAGGVFSGP----CTA---QINHG 371
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V +VGYG T K + YWI+KNSWGP WGE
Sbjct: 372 VTVVGYGE-----TDKAEKYWIVKNSWGPEWGE 399
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC + + GC GGLM AF+ + + G++ E+ YPY+G +C + + +
Sbjct: 161 AQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGDYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ K C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-KCRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 94/163 (57%), Gaps = 19/163 (11%)
Query: 52 IRGEGTHLA-LKLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD+ D GC GGLM AF+ II GG++ E+DYPY+G + C
Sbjct: 165 VTGEFVSLSEQELVDCDREYDEGCNGGLMDYAFQFIIQN--GGIDTEEDYPYQGIDGTCD 222
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLC 166
K++ +V +I Y +V S+ K V + P++VAI A+ A+Q Y GV F
Sbjct: 223 QTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGV-----FTG 277
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
K G LDHGV++VGYG T YW+++NSWG WGE
Sbjct: 278 KCGT-ALDHGVVVVGYG------TENGVDYWLVRNSWGTGWGE 313
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 84/151 (55%), Gaps = 17/151 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+++DCD D GC GG M NAF+ +I+ GG++ E DYPY G++ AC N+ R V I
Sbjct: 193 EIIDCDTQDGGCNGGEMQNAFQFVINN--GGIDTEADYPYLGTDAACDANRVNERVVTID 250
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
+V+V+++ + V N P++VAI+A+ +F Y G+ C LDHGV
Sbjct: 251 GFVSVATENETALQEAVANQPVSVAIDASGRKFQHYTSGI---FNGPCG---TQLDHGVT 304
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI+KNSW WGE
Sbjct: 305 AVGYGSENGK------DYWIVKNSWSSSWGE 329
>gi|311698003|gb|ADQ00296.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698009|gb|ADQ00299.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698013|gb|ADQ00301.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698015|gb|ADQ00302.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698019|gb|ADQ00304.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698021|gb|ADQ00305.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364395|gb|AEU08928.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ G + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDEGCNGGLMDNAFKWLVDSNKGKVYTESSYPYVSGSGQTPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 88/170 (51%), Gaps = 36/170 (21%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GGLM AFE II GG++ E DYPY+ S+ C N++ V I
Sbjct: 193 ELVDCDKAYNQGCNGGLMDYAFEFIIKN--GGIDSEADYPYRASDNMCDSNRKNAHVVTI 250
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ E K V N P++VAI A +F Y GV F + G NLDHGV
Sbjct: 251 DGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGV-----FTGRCGT-NLDHGV 304
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
+ VGYG E + +WI++NSWGP+WGE
Sbjct: 305 VAVGYGT------------------------ENGIDYWIVRNSWGPKWGE 330
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 87/152 (57%), Gaps = 14/152 (9%)
Query: 63 LVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
LVDCD+ D GC GG M +AF+ I++ GG++ E DYPY+ + C N+ V I
Sbjct: 188 LVDCDREYDTGCRGGFMDSAFDFIVNN--GGIDTEDDYPYRAEDGICQDNRTRRHVVTID 245
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V ++ V + P++VAI A+ A Q Y GGV F + G LDH VL
Sbjct: 246 GYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGV-----FDAECGT-ALDHAVL 299
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG + TH + PYW++KNSWG WGEK
Sbjct: 300 VVGYGT-ASNGTHNL-PYWLVKNSWGAEWGEK 329
>gi|209962668|gb|ACJ02129.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962670|gb|ACJ02130.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962680|gb|ACJ02135.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962682|gb|ACJ02136.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ K G + EK YPY G AC + E+ I
Sbjct: 30 LVSCDSKDNGCGGGLMDNAFEWIVKKNSGKVYTEKSYPYVSGGGEEPACKPHGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|311697935|gb|ADQ00262.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698039|gb|ADQ00314.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698041|gb|ADQ00315.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698043|gb|ADQ00316.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDHGCNGGLMDNAFQWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
Length = 376
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 91/169 (53%), Gaps = 10/169 (5%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC + GC GG + +AF T+++ GL EKDYP++G RA CH K + I
Sbjct: 180 ELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPFQGKVRAHRCHPKKYQKVAWI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E +A+YL GP+ V IN +Q Y GV C + +DH VL+
Sbjct: 238 QDFIMLQNNEHRIAQYLATYGPITVTINMKLLQLYRKGVIKATPTTCDPQL--VDHSVLL 295
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQ 228
VG+G K++ + PH P+WI+KNSWG +WGE+
Sbjct: 296 VGFGNVKSEEGIWAETVLSQSQPQPPH----PTPYWILKNSWGAQWGEK 340
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AF+ II+ GG++ E DYPYKG + C +N++ + V I
Sbjct: 180 ELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTEDDYPYKGKDERCDVNRKNAKVVTI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V+ + + V N P++VAI A A Q Y G+ F K G LDHGV
Sbjct: 238 DSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGI-----FTGKCGT-ALDHGV 291
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI++NSWG WGE
Sbjct: 292 AAVGYGTENGK------DYWIVRNSWGKSWGE 317
>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
Length = 323
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFDSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|6649534|gb|AAF21449.1|U38176_1 cysteine proteinase, partial [Acanthamoeba culbertsoni]
Length = 122
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/123 (44%), Positives = 75/123 (60%), Gaps = 9/123 (7%)
Query: 81 AFETIISKLGGGLEGEKDYPYKG-SNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKN 139
A++ I+ GGL EKDYPY+ ++CHL + I I + SDE ++ +LV+N
Sbjct: 7 AYDEIVKM--GGLMSEKDYPYEAMKEQSCHLRRPNISAYINGSATLPSDEAKLXAWLVQN 64
Query: 140 GPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWII 199
GP++V +NAN +QFY GG+SHP LC LDH VL+VGYGV T +PYWI+
Sbjct: 65 GPISVGVNANFLQFYLGGISHPPHMLCSEA--GLDHAVLLVGYGVS----TFLRRPYWIV 118
Query: 200 KNS 202
K S
Sbjct: 119 KFS 121
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AF+ II+ GG++ E DYPYKG + C +N++ + V I
Sbjct: 181 ELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTEDDYPYKGKDERCDVNRKNAKVVTI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V+ + + V N P++VAI A A Q Y G+ F K G LDHGV
Sbjct: 239 DSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGI-----FTGKCGT-ALDHGV 292
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI++NSWG WGE
Sbjct: 293 AAVGYGTENGK------DYWIVRNSWGKSWGE 318
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 86/162 (53%), Gaps = 15/162 (9%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
+++K + I + E A+ L GP++ A+NA+ +Q Y GG+ P LC
Sbjct: 211 YMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPR--LCDP 268
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 269 A--GVNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AF+ II+ GG++ E DYPYKG + C +N++ + V I
Sbjct: 180 ELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTEDDYPYKGKDERCDVNRKNAKVVTI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V+ + + V N P++VAI A A Q Y G+ F K G LDHGV
Sbjct: 238 DSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGI-----FTGKCGT-ALDHGV 291
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI++NSWG WGE
Sbjct: 292 AAVGYGTENGK------DYWIVRNSWGKSWGE 317
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 86/162 (53%), Gaps = 15/162 (9%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
+++K + I + E A+ L GP++ A+NA+ +Q Y GG+ P LC
Sbjct: 211 YMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPR--LCDP 268
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 269 A--GVNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
Length = 371
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC++ GC GG + +A+ T+++ GL EKDYP++G + C K + I
Sbjct: 178 ELLDCERCGNGCNGGFVWDAYLTVLN--NSGLASEKDYPFQGDRKPHRCLAKKYKKVAWI 235
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q + +S++E +A YL +GP+ V IN +Q Y GV C +DH VL+
Sbjct: 236 QDFTMLSNNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDP--RQVDHSVLL 293
Query: 180 VGYGVHK------TKFTHKIQ-----PYWIIKNSWGPHWGEK 210
VG+G K T +H + PYWI+KNSWG HWGEK
Sbjct: 294 VGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEK 335
>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
Length = 299
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK-IQ 120
+++DCD VD GC GGL+ AFE +I GG++ E +YPY+G N C LN + VK I
Sbjct: 138 QMIDCDYVDMGCDGGLLHTAFEQMIEM--GGVKHEHEYPYEGINMNCRLNDDNFAVKIIG 195
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A+ + Y+ GV + C+ L+H VL+V
Sbjct: 196 CYRYIVLQEEKLKDLLRAVGPIPIAIDASGIANYYQGVIN----YCEN--HGLNHAVLLV 249
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW IKN+WG WGE
Sbjct: 250 GYGVENNI------PYWTIKNTWGEDWGE 272
>gi|1705639|sp|Q10991.1|CATL1_SHEEP RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
Length = 217
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 94/174 (54%), Gaps = 21/174 (12%)
Query: 41 RIFRANLKKIQIRGEGTHLALKLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKD 98
++FR K + + + LVD + + GC GGLM NAF+ I K GGL+ E+
Sbjct: 37 QMFRKTGKLVSLSEQ------NLVDSSRPQGNQGCNGGLMDNAFQYI--KENGGLDSEES 88
Query: 99 YPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFG 156
YPY+ ++ +C+ E K +V++ E + K + GP++VAI+A ++ QFY
Sbjct: 89 YPYEATDTSCNYKPEYSAAKDTGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKS 148
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G+ + K +LDHGVL+VGYG T +WI+KNSWGP WG K
Sbjct: 149 GIYYDPDCSSK----DLDHGVLVVGYGFEGTN-----NKFWIVKNSWGPEWGNK 193
>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
Length = 323
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 85/150 (56%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNTNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFNSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV PYW KN+WG WGE+
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGEE 298
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 94/163 (57%), Gaps = 19/163 (11%)
Query: 52 IRGEGTHLA-LKLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD+ D GC GGLM AF+ II GG++ E+DYPY+G + C
Sbjct: 165 VTGEFVSLSEQELVDCDREYDEGCNGGLMDYAFQFIIQN--GGIDTEEDYPYQGIDGTCD 222
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLC 166
K++ +V +I Y +V S+ K V + P++VAI A+ A+Q Y GV F
Sbjct: 223 ETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGV-----FTG 277
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
K G LDHGV++VGYG T YW+++NSWG WGE
Sbjct: 278 KCGT-ALDHGVVVVGYG------TENGVDYWLVRNSWGTGWGE 313
>gi|407396649|gb|EKF27516.1| cysteine peptidase, partial [Trypanosoma cruzi marinkellei]
Length = 247
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 84/157 (53%), Gaps = 16/157 (10%)
Query: 57 THLALK-LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR---ACHLNK 112
T L+++ LV CDK + GCGGGL SNAFE I+ + G + E+ YPYK R C
Sbjct: 67 TRLSVQMLVSCDKTNDGCGGGLTSNAFEWIVQENNGNVYTEESYPYKSCMRITPPCIKVG 126
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDN 172
++ +I+ +V + DE + +L GP+AVA++A + FY+ G+ L
Sbjct: 127 RKVGARIKGHVELPKDEDRITGWLANKGPVAVAVDATSWMFYWSGI------LTNCVSKK 180
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+H VL+VGY PYW IKNSW WGE
Sbjct: 181 LNHAVLLVGYN------DSAAVPYWTIKNSWSRLWGE 211
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFNSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 87/154 (56%), Gaps = 17/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C + EE+ V
Sbjct: 177 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVG 236
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 237 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 290
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 291 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 318
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 89/149 (59%), Gaps = 15/149 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDCDK D GC GGLM+ AF+ I + G++ E+ YPYK N C K++I ++ +
Sbjct: 166 LVDCDKKDHGCQGGLMTTAFKYI--EENKGIDTEESYPYKAKNGRCEFKKDDIGATVERH 223
Query: 123 VNVSSDETE-MAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
V++ + + E + K + + GP++VA++A ++ Q Y G+ P +C LDHGVL+
Sbjct: 224 VSILTTDCEALKKAVAEIGPISVAMDASHSSFQLYKSGIYDPK--ICSS--RKLDHGVLV 279
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGYG + YW++KNSWG +WG
Sbjct: 280 VGYGKEDG------EEYWLVKNSWGKNWG 302
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEE-IRVK 118
+LVDCD + GCGGGLM NAFE I++ GGL+ E DYPY G++ C+ NKE I
Sbjct: 171 ELVDCDVGMQNKGCGGGLMDNAFEFIVNN--GGLDTEADYPYTGADGTCNSNKESNIAAS 228
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I+ Y +V +++ + V P+++A++ + +FY GGV L C LDHG
Sbjct: 229 IKGYEDVPANDEASLQKAVAAQPVSIAVDGGDDLFRFYKGGV---LTGACG---TELDHG 282
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V VGYGV YW++KNSWG WGE
Sbjct: 283 VAAVGYGVAGDG-----TKYWLVKNSWGTSWGE 310
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 88/154 (57%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VK 118
+LVDCD VD GC GGLM +AF+ II G L E +YPY+G + C+ NK I V
Sbjct: 176 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LNTEANYPYQGVDGTCNANKGSINAVT 233
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 234 ITGYEDVPTNNEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 287
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 288 VTAVGYGV-----SNDGTKYWLVKNSWGTEWGEE 316
>gi|311697943|gb|ADQ00266.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY G AC + E+ +
Sbjct: 30 LVSCDTVDHGCNGGLMDNAFQWLVDSNGGKVYTEDSYPYVSGSGQTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|30141463|emb|CAD54748.1| cysteine proteinase b [Leishmania guyanensis]
Length = 174
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 85/152 (55%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKE-EIRV 117
+LV CD VD GC GGLM AF+ ++ G + YPY GS C + E +
Sbjct: 35 ELVSCDDVDEGCNGGLMLQAFDWLLBNKNGAVYTGASYPYVSGNGSVPECSESSELVVGA 94
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
I +V + S+E MA +L NGP+A+A++A+A Y GG+ C G L+HGV
Sbjct: 95 YIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMSYTGGILTS----CDG--RQLNHGV 148
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGY T ++ PYW+IKNSWG +WGE
Sbjct: 149 LLVGY-----NMTGEV-PYWLIKNSWGENWGE 174
>gi|311697971|gb|ADQ00280.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697975|gb|ADQ00282.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697977|gb|ADQ00283.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697979|gb|ADQ00284.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697985|gb|ADQ00287.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697987|gb|ADQ00288.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697993|gb|ADQ00291.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698001|gb|ADQ00295.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364381|gb|AEU08921.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364383|gb|AEU08922.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364385|gb|AEU08923.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364387|gb|AEU08924.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364389|gb|AEU08925.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364391|gb|AEU08926.1| cathepsin L-like protein [Trypanosoma theileri]
gi|358364393|gb|AEU08927.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ G + E YPY G AC ++ E+ I
Sbjct: 30 LVSCDTVDEGCNGGLMDDAFQWLVDSNKGKVYTENSYPYVSGSGQTPACSTSEHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 86/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+L+DCD ++GC GGLM AF+ IIS GGL E DYPY C KE++ RV I
Sbjct: 188 ELIDCDTTFNSGCNGGLMDYAFQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ E + + P++VAI A+ QFY GGV F + G D LDHGV
Sbjct: 246 SGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGV-----FNGQCGTD-LDHGV 299
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG + K Y I+KNSWGP WGEK
Sbjct: 300 AAVGYG------SSKGSDYVIVKNSWGPRWGEK 326
>gi|375073904|gb|AFA34819.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073914|gb|AFA34824.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073920|gb|AFA34827.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073922|gb|AFA34828.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073924|gb|AFA34829.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073936|gb|AFA34835.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073938|gb|AFA34836.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073940|gb|AFA34837.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073942|gb|AFA34838.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073944|gb|AFA34839.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073946|gb|AFA34840.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073948|gb|AFA34841.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073950|gb|AFA34842.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073952|gb|AFA34843.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|389566208|gb|AFK83567.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|389566210|gb|AFK83568.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|389566212|gb|AFK83569.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|389566216|gb|AFK83571.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|400234865|gb|AFP74096.1| cathepsin L- like protein, partial [Trypanosoma cruzi]
gi|400234867|gb|AFP74097.1| cathepsin L- like protein, partial [Trypanosoma cruzi]
gi|400234871|gb|AFP74099.1| cathepsin L- like protein, partial [Trypanosoma cruzi]
gi|400234873|gb|AFP74100.1| cathepsin L- like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|311697911|gb|ADQ00250.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697913|gb|ADQ00251.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697917|gb|ADQ00253.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697929|gb|ADQ00259.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697931|gb|ADQ00260.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++A++ Y GV L D L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDASSFLSYVSGV------LTNCLSDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|262093296|gb|ACY25972.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 159
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 88/152 (57%), Gaps = 13/152 (8%)
Query: 62 KLVDCDKV---DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK 118
+L+DC K D GGLMS AF+ ++ K G+E + YPYKG + C + ++ +K
Sbjct: 161 QLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYKGIDTPCQYDAKKTVLK 217
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I+ Y NVS E E+ K + GP++VAI+A+ +Q Y GG+ L C NL+HGVL
Sbjct: 218 IKGYRNVSISEEELKKAVGTVGPVSVAIDADPIQLYSGGILDGL--FC---THNLNHGVL 272
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG F K +W +KNSWG WGE+
Sbjct: 273 AVGYGEEDHLFGKK--KFWKVKNSWGKDWGEQ 302
>gi|311697937|gb|ADQ00263.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697947|gb|ADQ00268.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697953|gb|ADQ00271.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697961|gb|ADQ00275.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDHGCNGGLMDNAFQWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 91/154 (59%), Gaps = 17/154 (11%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC D + GCGGGLM +AF+ I + GG++ E+ YPY+ + C + I K
Sbjct: 129 QLVDCSGDYGNMGCGGGLMDSAFKYI--QENGGIDTEESYPYEAEDGKCRFKPQNIGAKC 186
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
YV+V++ DE + + + GP++VAI+A ++ Q Y GV L+ C ++LDHG
Sbjct: 187 TGYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELE--CSS--EDLDHG 242
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG T Q YW++KNSWG WG+K
Sbjct: 243 VLAVGYG------TDNGQDYWLVKNSWGLGWGQK 270
>gi|358364415|gb|AEU08938.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD +D GC GGLM NAF+ ++ GG + E YPY GS R AC ++ E+ K+
Sbjct: 30 LVSCDTLDQGCNGGLMDNAFKWLVDSNGGNVYTENSYPYVSGSGRTPACSTSEHEVGAKV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y+++ DE ++A +L NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYLDLPQDEDKVAAWLAANGPIAVAVDANSFLSYVSGV------LTNCESHQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|311697919|gb|ADQ00254.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311697921|gb|ADQ00255.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++A++ Y GV L D L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDASSFLSYMSGV------LTNCLSDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 86/153 (56%), Gaps = 19/153 (12%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVK 118
+LVDCD D GC GGLM +AFE I K GGL E +YPY+G++ C+ NK K
Sbjct: 175 ELVDCDTSGEDQGCEGGLMDDAFEFI--KQNGGLTTEANYPYQGTDGTCNTNKAGNDAAK 232
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + V + P++VAI+A+ A QFY GGV F G + LDHG
Sbjct: 233 ITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGV-----FTGDCGTE-LDHG 286
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V VGYG T YW++KNSWG WGE
Sbjct: 287 VTAVGYG------TSDGTKYWLVKNSWGTSWGE 313
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 88/154 (57%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VK 118
+LVDCD VD GC GGLM +AF+ II G L E YPY+G + C+ NK ++ V
Sbjct: 176 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LSTEAQYPYEGVDGTCNANKASVQAVT 233
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 234 ITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 287
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 288 VTAVGYGV-----SNDGTKYWLVKNSWGTDWGEE 316
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 88/151 (58%), Gaps = 16/151 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLNKEEIR-VKI 119
+LVDCD + GC GG M AFE +I+ GG++ E +YPY G ++ C+ KEEI+ V I
Sbjct: 197 ELVDCDTTNEGCDGGYMDYAFEWVINN--GGIDSEANYPYTGQADSVCNTTKEEIKVVSI 254
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V++ E+ + V+ P++V I+ +++ F Y GG+ C G D++DH V
Sbjct: 255 DGYEDVATSESALLCAAVQQ-PVSVGIDGSSLDFQLYAGGI---YDGDCSGNPDDIDHAV 310
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
L+VGYG YWI+KNSWG WG
Sbjct: 311 LVVGYGQQGGT------DYWIVKNSWGTDWG 335
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 87/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GG M AFE I+ GG++ E DYPYKG + C N++ +V I
Sbjct: 144 ELVDCDKGFNQGCNGGFMDYAFEFIVKN--GGIDTEDDYPYKGVDGQCDQNRKNAKVVTI 201
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ +V ++ + K V + P++VAI A A Q Y G+ LC G D LDHGV
Sbjct: 202 NGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGI---FNGLC--GTD-LDHGV 255
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWGP+WGE
Sbjct: 256 VAVGYGTEDGK------DYWIVRNSWGPNWGE 281
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 85/152 (55%), Gaps = 14/152 (9%)
Query: 63 LVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
LVDCD+ D GC GGLM AFE I+ GG++ E DYPY C NK V I
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKN--GGIDTEDDYPYTAEEGMCQDNKMRRHVVTID 234
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V ++ V N P++VAI A+ A Q Y GGV F + G LDHGVL
Sbjct: 235 DYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGV-----FDAECGT-ALDHGVL 288
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG + TH + PYW++KNSWG WG+K
Sbjct: 289 VVGYGT-ASNGTHHL-PYWLVKNSWGAEWGDK 318
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
Length = 323
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFNSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|26245871|gb|AAN77411.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 200
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 89/152 (58%), Gaps = 13/152 (8%)
Query: 62 KLVDCDKV---DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK 118
+L+DC K D GGLMS AF+ ++ K G+E + YPYKG++ C + ++ +K
Sbjct: 35 QLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYKGTDTPCQYDAKKTVLK 91
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I+ Y NVS E E+ K + GP++VAI+A+ +Q Y GG+ L C NL+HGVL
Sbjct: 92 IKGYKNVSISEEELKKAVGTVGPVSVAIDADPIQLYSGGILDGL--FC---THNLNHGVL 146
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG F K +W +KNSWG WGE+
Sbjct: 147 AVGYGEEDHLFGKK--KFWKVKNSWGKDWGEQ 176
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E E DYPY+ + C L + ++S
Sbjct: 177 QLVDCDSVDMGCDGGLIHTAYEQIMHM--GGVEQEFDYPYRAERQPCALKPHKFAAGVRS 234
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V +E + L GP+A+A++A + Y+GG+ C+ + L+H VL+V
Sbjct: 235 CYRYVLLNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIVS----FCEN--NGLNHAVLLV 288
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV P+WIIKNSWG +GE
Sbjct: 289 GYGVENNV------PFWIIKNSWGSDYGE 311
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 90/169 (53%), Gaps = 15/169 (8%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY
Sbjct: 146 GNVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTY-TAIQKMGG-LELASDYPY 203
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
G C+++K + I + E A+ L GP++ A+NA+ +Q Y GG+ P
Sbjct: 204 TGVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP 263
Query: 162 LKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
K+ G ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 264 -KWCDPAG---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 19/163 (11%)
Query: 52 IRGEGTHLA-LKLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD+ DAGC GGLM AF+ I+ GG++ EKDYPY G N C
Sbjct: 175 VSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDN--GGIDTEKDYPYLGFNNQCD 232
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLC 166
K+ +V I Y +V ++E + K V + P+++AI A A Q Y GV F
Sbjct: 233 PTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIEAGGRAFQLYESGV-----FNG 286
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ G+ LDHGV+ VGYG Q YWI++NSWG +WGE
Sbjct: 287 ECGL-ALDHGVVAVGYGTDDNG-----QDYWIVRNSWGSNWGE 323
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LV+CD + GC GG M AFE +I+ GG++ E DYPY G + C+ KEE + V I
Sbjct: 191 ELVECDTSNYGCEGGYMDYAFEWVINN--GGIDSESDYPYTGVDGTCNTTKEETKVVSID 248
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V ++ + V P++V I+ +A+ F Y GG+ C D++DH VL
Sbjct: 249 GYQDVEQSDSALL-CAVAQQPVSVGIDGSAIDFQLYTGGIYDG---SCSDDPDDIDHAVL 304
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
IVGYG + + YWI+KNSWG WG
Sbjct: 305 IVGYGSEDS------EEYWIVKNSWGTSWG 328
>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
Length = 371
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DC++ GC GG + +A+ T+++ GL EKDYP++G + C K + I
Sbjct: 178 ELLDCERCGNGCNGGFVWDAYLTVLN--NSGLASEKDYPFQGDRKPHRCLAKKYKKVAWI 235
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q + +S++E +A YL +GP+ V IN +Q Y GV C +DH VL+
Sbjct: 236 QDFTMLSNNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDP--RQVDHSVLL 293
Query: 180 VGYGVHK------TKFTHKIQ-----PYWIIKNSWGPHWGEK 210
VG+G K T +H + PYWI+KNSWG HWGEK
Sbjct: 294 VGFGKKKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEK 335
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 89/170 (52%), Gaps = 36/170 (21%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GGLM AFE II+ GG++ E+DYPY+ ++ C N++ RV I
Sbjct: 193 ELVDCDKSYNQGCNGGLMDYAFEFIINN--GGIDSEEDYPYRAADTTCDPNRKNARVVSI 250
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ K V N P++VAI A A Q Y GV F + G LDHGV
Sbjct: 251 DGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV-----FTGQCGT-QLDHGV 304
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
+ VGYG E ++ +WI++NSWGP WGE
Sbjct: 305 VAVGYGT------------------------ENSVDYWIVRNSWGPNWGE 330
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 91/166 (54%), Gaps = 23/166 (13%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF 164
H++K K +YVN S+ E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 211 HMDKS----KFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KW 265
Query: 165 LCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 266 CDPAG---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 83/154 (53%), Gaps = 19/154 (12%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF+ I K GG++ EK YPY + CH NK I K
Sbjct: 167 LVDCSGKYGNNGCEGGLMDNAFQYI--KENGGIDTEKSYPYLAKDGVCHYNKSAIGAKDT 224
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYF---GGVSHPLKFLCKGGMDNLDHG 176
+V++ + DE + + L GP+++AI+A+ F+F G P LDHG
Sbjct: 225 GFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDP-----DCSSTRLDHG 279
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG K YW++KNSWGP WGE+
Sbjct: 280 VLAVGYGTDDGK------DYWLVKNSWGPSWGEE 307
>gi|16076437|emb|CAC94443.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/117 (47%), Positives = 78/117 (66%), Gaps = 8/117 (6%)
Query: 68 KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLNKEEIRVKIQSYVNVS 126
D+GC GGLM++AFE + GGL E+DYPY G++R+ C +K +I + ++ +S
Sbjct: 11 SCDSGCSGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRSTCKFDKSKIAASVSNFSVIS 68
Query: 127 SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183
DE ++A LVKNGP+AVAINA MQ + GGVS P ++C LDHGVL+VG+G
Sbjct: 69 LDEDQIAANLVKNGPLAVAINAVFMQTHVGGVSCP--YICS---RRLDHGVLLVGFG 120
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 91/166 (54%), Gaps = 23/166 (13%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF 164
H++K K +YVN S+ E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 211 HMDKS----KFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KW 265
Query: 165 LCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 266 CDPAG---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
Length = 897
Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats.
Identities = 60/151 (39%), Positives = 85/151 (56%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC + GCGGG M+NAF+ + + G++ E YPY G + +C N K + Y
Sbjct: 735 LVDCVSENDGCGGGYMTNAFQYV--QRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGY 792
Query: 123 VNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +E + K + + GP++VAI+A+ + QFY GV + C DNL+H VL
Sbjct: 793 KEIPEGNEKALKKAVARVGPISVAIDASLSSFQFYSKGVYYDEN--CNS--DNLNHAVLA 848
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG+ K K +WIIKNSWG +WG K
Sbjct: 849 VGYGIQKGK------KHWIIKNSWGENWGNK 873
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 85/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LVDCD D GC GGLM +AF+ II GGL E YPY+G + C+ N+E V
Sbjct: 175 ELVDCDTSGADQGCQGGLMDDAFKFIIQN--GGLNTEAQYPYQGVDGTCNTNEEATHVAT 232
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V S+ + + V N P+++AI+A+ F Y GV F G LDHG
Sbjct: 233 ITGYEDVPSNNEQALQQAVANQPISIAIDASGSDFQNYQSGV-----FTGSCGT-QLDHG 286
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V +VGYGV + YW++KNSWG WGE+
Sbjct: 287 VAVVGYGV-----SDDGTKYWLVKNSWGADWGEE 315
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 94/165 (56%), Gaps = 16/165 (9%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LVDCD + GC GG M AFE +I+ GG++ E +YPY G + C+ KEEI+ V I
Sbjct: 192 ELVDCDTTNYGCEGGYMDYAFEWVINN--GGIDTEANYPYTGVDGTCNTTKEEIKVVSID 249
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V ++ + V+ P++V ++ +A+ F Y GG+ C +++DH VL
Sbjct: 250 GYTDVDETDSALLCATVQQ-PISVGMDGSALDFQLYTGGI---YDGDCSDDPNDIDHAVL 305
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGP 223
IVGYG + + YWI+KNSWG WG + F+I +N+ P
Sbjct: 306 IVGYG------SENGEDYWIVKNSWGTEWGMEGY-FYIKRNTDLP 343
>gi|56567186|gb|AAV98582.1| cathepsin L-like cysteine proteinase precursor [Trichomonas
vaginalis]
Length = 305
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 82/151 (54%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM A++ ++ G E DYPY + +C N + +I+SY
Sbjct: 140 LVDCVTTCYGCNGGLMDAAYDYVVKHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSY 199
Query: 123 VNVSS-DETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VNV+ DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 200 VNVAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 255
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 256 VGYGTEGSK------NYWIVRNSWGTTWGEK 280
>gi|386364446|emb|CCH03784.1| Clan CA, family C1, cathepsin L-like cysteine peptidase, partial
[Trichomonas gallinae]
Length = 260
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 80/151 (52%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM AF+ +I+ G E DYPY + +C + + K+ Y
Sbjct: 113 LVDCVTSCYGCNGGLMDAAFDYVIASQNGQFNTEADYPYTAVDGSCKYSAAKATSKVTGY 172
Query: 123 VN-VSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VN V DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 173 VNVVEGDEADLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 228
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 229 VGYGTEGSK------NYWIVRNSWGTSWGEK 253
>gi|123492185|ref|XP_001326005.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121908913|gb|EAY13782.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 82/151 (54%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM A++ ++ G E DYPY + +C N + +I+SY
Sbjct: 140 LVDCVTTCYGCNGGLMDAAYDYVVKHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSY 199
Query: 123 VNVSS-DETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VNV+ DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 200 VNVAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 255
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 256 VGYGTEGSK------NYWIVRNSWGTAWGEK 280
>gi|454890|emb|CAA54438.1| cysteine proteinase, putative [Trichomonas vaginalis]
Length = 292
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 82/151 (54%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM A++ ++ G E DYPY + +C N + +I+SY
Sbjct: 127 LVDCVTTCYGCNGGLMDAAYDYVVKHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSY 186
Query: 123 VNVSS-DETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VNV+ DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 187 VNVAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 242
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 243 VGYGTEGSK------NYWIVRNSWGTAWGEK 267
>gi|1498185|dbj|BAA06738.1| cysteine proteinase-1 precursor [Drosophila melanogaster]
Length = 254
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 70 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFPYI--KDNGGIDTEKSYPY 127
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH N+ ++ + + ++ DE +M + + GP++VAI+A+ + QFY GV
Sbjct: 128 EAIDDSCHFNRAQVGATDRGFTDIPQGDEKKMPEPVPTVGPVSVAIDASHESFQFYSEGV 187
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + + NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 188 YNEPQCDAQ----NLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 230
>gi|311697927|gb|ADQ00258.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++A++ Y GV L D L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDASSFLSYVSGV------LTNCLSDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|308082013|ref|NP_001183396.1| uncharacterized protein LOC100501813 [Zea mays]
gi|238011208|gb|ACR36639.1| unknown [Zea mays]
Length = 291
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYKG++ C +N++ + V I
Sbjct: 8 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGTDGRCDVNRKNAKVVTI 65
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 66 DSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGI-----FTGTCGT-ALDHGV 119
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG K YWI+KNSWG WGE
Sbjct: 120 TAVGYGTENGK------DYWIVKNSWGSSWGES 146
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GGLM AF+ II GG++ E+DYPYK + C N++ RV I
Sbjct: 165 ELVDCDKTYNLGCNGGLMDYAFDFIIEN--GGIDTEEDYPYKAIDSMCDPNRKNARVVTI 222
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ + K V N P++VAI A Q Y GV F G LDHGV
Sbjct: 223 DGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSGV-----FTGSCGT-QLDHGV 276
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG H + YWI++NSWGP WGE
Sbjct: 277 VTVGYGTE-----HGVD-YWIVRNSWGPAWGE 302
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 91/166 (54%), Gaps = 23/166 (13%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 153 FRETGHLLALSGQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF 164
H++K K +YVN S+ E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 211 HMDKS----KFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KW 265
Query: 165 LCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 266 CDPAG---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|91085671|ref|XP_971698.1| PREDICTED: similar to cathepsin L-like protein; cysteine proteinase
[Tribolium castaneum]
gi|270011034|gb|EFA07482.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 15/161 (9%)
Query: 53 RGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+G T +++ +LVDCD GCGGG M++AF I GG++ E YPYKG + +CH
Sbjct: 162 KGANTDISVSEQQLVDCDTAADGCGGGWMTDAFTYIAQT--GGIDSESSYPYKGVDESCH 219
Query: 110 LNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
+++ K++ Y ++ DE +A + GP++VA +A FG S + +
Sbjct: 220 FMSDKVAAKLKGYAYLTGPDENMLADMVSSKGPVSVAFDAEGD---FGSYSGGVYYNPNC 276
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ H VLIVGYG Q YW++KNSWG WGE
Sbjct: 277 ATNKFTHAVLIVGYG------NENGQDYWLVKNSWGDGWGE 311
>gi|311698023|gb|ADQ00306.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698031|gb|ADQ00310.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ G + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDEGCNGGLMDDAFQWLVDSNKGKVYTESSYPYVSGSGQTPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 86/153 (56%), Gaps = 16/153 (10%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVK 118
+L+DCD D GC GGLM +AFE I+ GGL E YPY SN C+ N+ + V+
Sbjct: 187 ELIDCDTGGDDNGCQGGLMESAFE-FIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVR 245
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I + +V + E V + P++VAI+A A QFY GV F G + LDHG
Sbjct: 246 IDGHQSVPAGNEEALAKAVAHQPVSVAIDAGGQAFQFYSEGV-----FTGDCGSE-LDHG 299
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V +VGYGV + + YWI+KNSWGP WGE
Sbjct: 300 VAVVGYGVAE----EDGKEYWIVKNSWGPGWGE 328
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 96/173 (55%), Gaps = 21/173 (12%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
N++ R G LAL +LVDCD ++ GC GG + I K+GG LE DYPY
Sbjct: 146 GNVEGQWFRKTGDLLALSEQQLVDCDHLEKGCNGGYPPKTYGEI-EKMGG-LELASDYPY 203
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVSS----DETEMAKYLVKNGPMAVAINANAMQFYFGG 157
G + C++N+ K +YVN S+ E A+ L + GP++ A+NA +QFY GG
Sbjct: 204 TGVDGICYMNQS----KFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGG 259
Query: 158 VSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ P+ FLC L+H VL VGYG T+F PYWI+KNS G +GEK
Sbjct: 260 IIFPIPFLCNP--HGLNHAVLTVGYG---TEFGI---PYWIVKNSLGVGFGEK 304
>gi|375073962|gb|AFA34848.1| cathepsin L-like protein, partial [Trypanosoma dionisii]
Length = 159
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D+GC GGLMSNAFE I+ + G + E Y Y+ G+ C + + I
Sbjct: 30 LVSCDTADSGCDGGLMSNAFEWIVERHNGTVYTEDSYRYESGDGTAPPCRTSGRAVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+VN+ DE ++A++L NGP+AVA++A++ Y GGV L + LDHGVL+
Sbjct: 90 SGHVNLPPDEDKLAEWLAANGPLAVAVDASSWMSYTGGV------LTNCYSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY T PYWIIKN
Sbjct: 144 VGYNDSATP------PYWIIKN 159
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 82/150 (54%), Gaps = 13/150 (8%)
Query: 62 KLVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+L+DC + D GCGGG+M AF+ I L GG+E E DYPY+ N C + I +
Sbjct: 161 QLMDCSFKEGDEGCGGGIMDYAFDYIF--LAGGVESEADYPYEARNDHCRFDNSSIAATL 218
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
V+V+S ET++ K + GP++VAI+A+ + F G + +C LDHGVL
Sbjct: 219 TGCVDVTSGSETQLEKAVGSIGPVSVAIDASHISFQLYGSGVNYEPMCS--TTTLDHGVL 276
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGYG YWI+KNSWG WG
Sbjct: 277 AVGYGADNG------NEYWIVKNSWGEGWG 300
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 87/149 (58%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD +D GC GGL+ A+E I+S GG+E E+DYPY+ C + ++ +V + +
Sbjct: 184 QLVDCDTIDMGCAGGLLHTAYEEIMSM--GGVEYEEDYPYRSVQGPCRIENDKFQVSVDN 241
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L + GP+AVA++A + Y+GG+ CK L+H VL+V
Sbjct: 242 CYRYILYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITS----CKNY--GLNHAVLLV 295
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYG T P+W++KNSWG +GE
Sbjct: 296 GYG------TENGIPFWVLKNSWGTDYGE 318
>gi|2146900|pir||S67481 cathepsin L-like cysteine proteinase (EC 3.4.22.-) CP1 [similarity]
- fruit fly (Drosophila melanogaster) (fragment)
Length = 218
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 47 LKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ R G ++L LVDC + GC GGLM NAF I K GG++ EK YPY
Sbjct: 34 LEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFPYI--KDNGGIDTEKSYPY 91
Query: 102 KGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV 158
+ + +CH N+ ++ + + ++ DE +M + + GP++VAI+A+ + QFY GV
Sbjct: 92 EAIDDSCHFNRAQVGATDRGFTDIPQGDEKKMPEPVPTVGPVSVAIDASHESFQFYSEGV 151
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ + + NLDHGVL+VG+G ++ + YW++KNSWG WG+K
Sbjct: 152 YNEPQCDAQ----NLDHGVLVVGFGTDESG-----EDYWLVKNSWGTTWGDK 194
>gi|262093298|gb|ACY25973.1| cathepsin L-like protein [Trypanosoma cruzi]
gi|375073848|gb|AFA34791.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073850|gb|AFA34792.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073852|gb|AFA34793.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073854|gb|AFA34794.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073856|gb|AFA34795.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073858|gb|AFA34796.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073860|gb|AFA34797.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073862|gb|AFA34798.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 88/154 (57%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VK 118
+LVDCD VD GC GGLM +AF+ II G L E YPY+G + C+ NK ++ V
Sbjct: 177 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LSTEAQYPYEGVDGTCNANKASVQAVT 234
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 235 ITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGACGTE-LDHG 288
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 289 VTAVGYGV-----SNDGTKYWLVKNSWGTDWGEE 317
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 84/158 (53%), Gaps = 19/158 (12%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG----SNRACHLNKEEI 115
+LVDC K + GC GGLM++AFE + + G++ E YPY N C N I
Sbjct: 199 QLVDCSKSYGNNGCSGGLMNSAFEYV--RDNEGIDSEISYPYVSGDGTENNRCLFNASNI 256
Query: 116 RVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDN 172
++ YVN+ DE + + GP++VAINA F Y G+ C+G +D
Sbjct: 257 LAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSGIYSDTD--CEGTLDA 314
Query: 173 LDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGVL+VGYG + YW+IKNSWG WGEK
Sbjct: 315 LDHGVLVVGYGEENGR------SYWLIKNSWGEEWGEK 346
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 89/154 (57%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC D + GC GGLM AF+ + + G++ E+ YPY+G +C + E +
Sbjct: 161 AQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGEYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ + C ++L+ G
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE-RCRCSNKREDLNPG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 97/174 (55%), Gaps = 20/174 (11%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G+ ++L LVDC D + GC GGLM NAF+ I + G++ EK Y
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYI--RANKGIDTEKSY 207
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PY G++ CH K + +V++ ET++ K + GP++VAI+A+ + QFY
Sbjct: 208 PYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSD 267
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + C ++LDHGVL+VGYG T YW++KNSWG WG++
Sbjct: 268 GVYDEPE--CDS--ESLDHGVLVVGYG------TLNGTDYWLVKNSWGTTWGDE 311
>gi|311697995|gb|ADQ00292.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ G + E YPY G AC ++ E+ I
Sbjct: 30 LVSCDTVDEGCNGGLMDDAFKWLVDSNKGKVYTENSYPYVSGSGQTPACSTSEHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 92/167 (55%), Gaps = 20/167 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LV CD + GC GG M AF +I GG++ EKDY Y G + C+ NKE + V I
Sbjct: 193 ELVACDATNYGCEGGDMDYAFTWVIQN--GGIDTEKDYSYTGVDSTCNTNKEAKKIVSID 250
Query: 121 SYVNVSSDETEMAKYLVKNG--PMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
Y +VS D++ + L G P++V I+ +A+ F Y GG+ C G D++DH
Sbjct: 251 GYTDVSPDDSAL---LCAAGSQPVSVGIDGSAIDFQLYTGGIYDG---DCSGNPDDIDHA 304
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGP 223
VL+VGY K YWI+KNSWG WG + F+I++N+ P
Sbjct: 305 VLVVGYSAKNGK------DYWIVKNSWGTDWGLEGY-FYILRNTELP 344
Score = 39.7 bits (91), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 30/48 (62%)
Query: 16 VAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGEGTHLALKL 63
V +F+ +L +H K Y + EE +RL+IFR NL+ I + ++ + +L
Sbjct: 40 VRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRL 87
>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
Length = 371
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DCD+ GC GG + +A+ T+++ GL E+DYP++G + C +K I
Sbjct: 178 ELLDCDRCGNGCNGGFVWDAYITVLN--NSGLASEEDYPFQGHQKPHRCLADKYRKVAWI 235
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q + +SS+E +A YL +GP+ V IN +Q+Y GV C + N H VL+
Sbjct: 236 QDFTMLSSNEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKATPSTCDPHLVN--HSVLL 293
Query: 180 VGYGVHK------TKFTH-----KIQPYWIIKNSWGPHWGEK 210
VG+G K T +H + PYWI+KNSWG WGEK
Sbjct: 294 VGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWGEK 335
>gi|209962676|gb|ACJ02133.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962678|gb|ACJ02134.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G AC + E+ I
Sbjct: 30 LVSCDSKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPACKPHGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 88/149 (59%), Gaps = 16/149 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
L++CD ++ GCGGGLM A ETI+ + GG+ EKD PY G + C ++ V I
Sbjct: 182 LINCDSINNGCGGGLMHWALETILQQ--GGIVSEKDEPYYGLDAVCK--PKQFNVSISGC 237
Query: 123 VN-VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +E ++ + L+ NGP+++A++ + Y G++ +C+ M+ L+H VL+VG
Sbjct: 238 TRYVLKNENKLRELLIANGPISMAVDIIDVIDYKEGITD----ICEN-MNGLNHAVLLVG 292
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGVH PYWI+KNSWG WGEK
Sbjct: 293 YGVHNN------IPYWIMKNSWGEEWGEK 315
Score = 36.6 bits (83), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 27/44 (61%)
Query: 18 MFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGEGTHLAL 61
+F F++K+NKSYAT +E + F+ NLK I + G+ A+
Sbjct: 35 IFEDFIKKYNKSYATDQERAIKYENFKNNLKMINDKNNGSKYAV 78
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 86/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVK 118
+LVDCD D GC GGLM +AFE I K GGL E +YPY+G++ C+ NK K
Sbjct: 175 ELVDCDTSGEDQGCEGGLMDDAFEFI--KQNGGLTTEANYPYQGTDGTCNTNKAGNDAAK 232
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + V + P++VAI+A+ A QFY GGV F G + LDHG
Sbjct: 233 ITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGV-----FTGDCGTE-LDHG 286
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V VGYG + YW++KNSWG WGE
Sbjct: 287 VTAVGYGT-----SDDGTKYWLVKNSWGTSWGE 314
>gi|311698051|gb|ADQ00320.1| cathepsin L-like protein [Trypanosoma sp. D30]
Length = 159
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGQTPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLV 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 82/153 (53%), Gaps = 19/153 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI---RVK 118
+L+DCD VDAGC GGLM AF+ ++ GG+ E YPY GS +C+ NK I +
Sbjct: 178 QLMDCDTVDAGCDGGLMETAFKFVVKN--GGVTTEASYPYTGSVGSCNANKVAIINKVAE 235
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
I + V+ D + V P+ V+I + F Y G+ L C D+LDHG
Sbjct: 236 ITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGI---LSGQCG---DSLDHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VL++GYG T PYWIIKNSWG WGE
Sbjct: 290 VLLIGYG------TEGGMPYWIIKNSWGTSWGE 316
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 85/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LVDCD D GC GGLM +AF+ II GGL E YPY+G + C+ N+E V
Sbjct: 175 ELVDCDTSGADQGCQGGLMDDAFKFIIQN--GGLNTEAQYPYQGVDGTCNTNEEVTHVAT 232
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V S+ + + V N P++VAI+A+ F Y GV F G LDHG
Sbjct: 233 ITGYEDVPSNNEQALQQAVANQPISVAIDASGSDFQNYQSGV-----FTGSCGT-QLDHG 286
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V +VGYGV + YW++KNSWG WGE+
Sbjct: 287 VAVVGYGV-----SDDGTKYWLVKNSWGEDWGEE 315
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 95/173 (54%), Gaps = 21/173 (12%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYP 100
+L+ R G ++L +LVDC + + GC GGLM NAF+ I K GGLE E+DYP
Sbjct: 175 SLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYI--KSVGGLESEEDYP 232
Query: 101 YKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFGG 157
YK C + ++ V+V S E+ + K + + GP++VAI+A+ + Q Y GG
Sbjct: 233 YKPKQGTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGG 292
Query: 158 V-SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V P + + LDHGVL VGYG + Q YWI+KNSWG WGE
Sbjct: 293 VYDEP-----ECSSEQLDHGVLCVGYGTD-----DQGQDYWIVKNSWGAEWGE 335
>gi|358364411|gb|AEU08936.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ G + E YPY G AC +K E+ K
Sbjct: 30 LVSCDTVDHGCNGGLMDNAFKWLVDSNDGKVYTEDSYPYVSGSGQTPACSTSKHEVGAKA 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 88/149 (59%), Gaps = 16/149 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
L++CD ++ GCGGGLM A ETI+ + GG+ EKD PY G + C ++ V I
Sbjct: 182 LINCDSINNGCGGGLMHWALETILQQ--GGIVSEKDEPYYGLDAVCK--PKQFNVSISGC 237
Query: 123 VN-VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V +E ++ + L+ NGP+++A++ + Y G++ +C+ M+ L+H VL+VG
Sbjct: 238 TRYVLKNENKLRELLIANGPISMAVDIIDVIDYKEGITD----ICEN-MNGLNHAVLLVG 292
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGVH PYWI+KNSWG WGEK
Sbjct: 293 YGVHNN------IPYWIMKNSWGEEWGEK 315
>gi|311697941|gb|ADQ00265.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNRA--CHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ GG + E YPY GS R C + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDNAFQWLVDSNGGKVYTEDSYPYVSGSGRTPVCSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 93/163 (57%), Gaps = 19/163 (11%)
Query: 52 IRGEGTHLA-LKLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD+ DAGC GGLM AF+ II GG++ EKDYPY G N C
Sbjct: 176 VSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDN--GGIDTEKDYPYLGFNNQCD 233
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLC 166
K+ +V I Y +V ++E + K V + P+++AI A A Q Y GV F
Sbjct: 234 PTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIEAGGRAFQLYESGV-----FNG 287
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ G+ LDHGV+ VGYG Q YWI++NSWG +WGE
Sbjct: 288 ECGLA-LDHGVVAVGYGSDDNG-----QDYWIVRNSWGGNWGE 324
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 85/152 (55%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC + GCGGG M++AF+ I K GG++ E YPY+ +R+C + I
Sbjct: 149 ELVDCSTEYGNDGCGGGWMTSAFDYI--KDNGGIDTESSYPYEAQDRSCRFDANSIGATC 206
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V V E + + + GP++VAI+A+ + QFY GV + K C NLDHGV
Sbjct: 207 TGFVEVQHTEEALHEAVSDIGPISVAIDASHFSFQFYSSGVYYEKK--CS--PTNLDHGV 262
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L VGYG T + YW++KNSWG WG+
Sbjct: 263 LAVGYGTEST------EDYWLVKNSWGSGWGD 288
>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
Length = 341
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E E DYPYK C + + V +++
Sbjct: 181 QLVDCDFVDMGCDGGLIHTAYEQIMHI--GGVEQEYDYPYKAVRLPCAVKPHKFAVGVRN 238
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V E + L GP+A+A++A + Y+GGV + F G L+H VL+V
Sbjct: 239 CYRYVLLSEERLEDLLRHVGPIAIAVDAVDLTDYYGGV---ISFCENNG---LNHAVLLV 292
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW IKNSWGP +GE
Sbjct: 293 GYGVENNV------PYWTIKNSWGPDYGE 315
>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 382
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GCGGG AF+ I+S G + E+ YPY G+ C + + + KI
Sbjct: 178 LVSCDTNDFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ V++ DE +A++L KNGP+A+A++A + Q Y GGV L ++ VL+
Sbjct: 238 RDRVDLPRDENAIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISKEMNSAVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGEK
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWSKGWGEK 316
>gi|358364407|gb|AEU08934.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM NAF+ ++ G + E YPY G AC +K E+ +
Sbjct: 30 LVSCDTVDHGCNGGLMDNAFKWLVDSNDGKVYTEDSYPYVSGSGQTPACSTSKHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|389566206|gb|AFK83566.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 80/142 (56%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GCGGGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYW+IKN
Sbjct: 144 VGYN------DSAAVPYWVIKN 159
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
+LVDCD VD GC GGLM + FE II GG+ E +YPY N C NKE V +I
Sbjct: 177 ELVDCDSVDHGCDGGLMEHGFEFIIKN--GGISSEANYPYTAVNGTCDTNKEASPVAQIT 234
Query: 121 SYVNVSSD-ETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y V + E E+ K + M+V+I+A +A QFY GV F + G LDHGV
Sbjct: 235 GYETVPVNCEEELQKAVANQLTMSVSIDAGGSAFQFYPSGV-----FTGQCGT-QLDHGV 288
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGE+
Sbjct: 289 TAVGYGS-----TDYGTQYWIVKNSWGTQWGEE 316
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 89/161 (55%), Gaps = 19/161 (11%)
Query: 54 GEGTHLA-LKLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
GE L+ +LVDCD + + GC GGLM AF+ I+ GG++ E DYPYKG + C N
Sbjct: 172 GEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILEN--GGIDTENDYPYKGLDGRCDNN 229
Query: 112 KEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKG 168
K+ V I Y +V ++ E K V P++VAI A Q Y GGV F +
Sbjct: 230 KKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGV-----FTGEC 284
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G D LDHGVL VGYG + YWI+KNSWG +WGE
Sbjct: 285 GTD-LDHGVLAVGYGSEGS------LDYWIVKNSWGEYWGE 318
>gi|311698005|gb|ADQ00297.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698007|gb|ADQ00298.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698011|gb|ADQ00300.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698017|gb|ADQ00303.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698025|gb|ADQ00307.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ G + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDEGCNGGLMDDAFKWLVDSNKGKVYTESSYPYVSGSGQTPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L D L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESDQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 89/170 (52%), Gaps = 36/170 (21%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GGLM AFE II+ GG++ E+DYPY+ ++ C N++ RV I
Sbjct: 110 ELVDCDKSYNQGCNGGLMDYAFEFIINN--GGIDSEEDYPYRAADTTCDPNRKNARVVSI 167
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ K V N P++VAI A A Q Y GV F + G LDHGV
Sbjct: 168 DGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV-----FTGQCGT-QLDHGV 221
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
+ VGYG E ++ +WI++NSWGP WGE
Sbjct: 222 VAVGYGT------------------------ENSVDYWIVRNSWGPNWGE 247
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFDSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNV------PYWTFKNTWGTDWGE 297
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFNSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|311698053|gb|ADQ00321.1| cathepsin L-like protein [Trypanosoma sp. D30]
Length = 159
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY G AC ++ E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGQRPACSTSEHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLV 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|1666270|emb|CAA49713.1| envelope glycoprotein [Autographa californica nucleopolyhedrovirus]
Length = 208
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ- 120
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 48 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 105
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 106 CYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFNSG---LNHAVLLV 159
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 160 GYGVENNI------PYWTFKNTWGTDWGE 182
>gi|16612118|gb|AAL27459.1|AF430838_1 cysteine protease [Pagumogonimus skrjabini]
Length = 166
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 81/140 (57%), Gaps = 11/140 (7%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA-CHLNKEEIRVKIQ 120
+L+DCDKVD GC GG +A++ + + GG+E + YPY G + C L+K +
Sbjct: 34 QLLDCDKVDEGCNGGYPMDAYKEL--QRMGGVESQSTYPYTGRQSSQCWLDKNLFVAYLN 91
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
V + DE + A +L NGP++VA+NA+ +QFY G+SHP + LC L+H VL V
Sbjct: 92 DSVMLPKDELKQAAWLADNGPLSVALNADQLQFYRRGISHPPESLCPA--SGLNHAVLSV 149
Query: 181 GYGVHKTKFTHKIQPYWIIK 200
GYG + PYWI+K
Sbjct: 150 GYG------SENGTPYWIVK 163
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD +D GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 236 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+LVDCD VD GC GGLM AF+ I K GG+ E +YPY R+C+ KE V I
Sbjct: 189 ELVDCDDVDNQGCNGGLMDYAFQYI--KRNGGITTESNYPYLAEQRSCNKAKERSHDVTI 246
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ + + V N P+++AI A+ QFY GV F G + LDHGV
Sbjct: 247 DGYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSEGV-----FTGSCGTE-LDHGV 300
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG+ T YWI+KNSWG WGE+
Sbjct: 301 AAVGYGI-----TRDGTKYWIVKNSWGEDWGER 328
>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 323
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 90/154 (58%), Gaps = 15/154 (9%)
Query: 60 ALKLVDCDKV---DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC + GC GGLM AF+ + + G++ E+ YPYK C +N E +
Sbjct: 158 AQELVDCATEYYGNEGCNGGLMGQAFDFVEDE---GIQTEESYPYKAKRSICQMNGEYV- 213
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++Y ++ +E E+A+ + GP+AVAI+A+ + FY G+ K C ++L+HG
Sbjct: 214 TKVKTY-HLLLNEQEIARAVSAKGPVAVAIDASQLSFYDQGIVDE-KCKCSKKREDLNHG 271
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 272 VLVVGYG------SENGVDYWIVKNSWGADWGEK 299
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 89/154 (57%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC + + GC GGLM AF+ + + G++ E+ YPY+G +C + + +
Sbjct: 161 AQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGDYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDET-CRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFNSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFDSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 84/150 (56%), Gaps = 13/150 (8%)
Query: 63 LVDC-DKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC DK + GCGGGLM NAF I K G++ E+ YPY+ N C N + + +
Sbjct: 161 LVDCSDKYGNFGCGGGLMDNAFRYI--KDNNGIDTEESYPYEAKNGPCRFNSDNVGATLS 218
Query: 121 SYVNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
SYV++ E ++ K + + GP++VAI+A+ F+F S + + K LDHGVL
Sbjct: 219 SYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHF--YSRGIYYDEKCSSSFLDHGVLA 276
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG T YW++KNSW WG+
Sbjct: 277 VGYG------TDDSSDYWLVKNSWNETWGD 300
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 84/152 (55%), Gaps = 17/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDCD K +AGC GGLM AF+ I GG+ E YPY+ +C + + V I
Sbjct: 299 QLVDCDTKANAGCNGGLMDYAFQYIAKH--GGVAAEDAYPYRARQASCKKSPAPV-VTID 355
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V +++ K V + P++VAI A+ QFY GV F + G + LDHGV
Sbjct: 356 GYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGV-----FSGRCGTE-LDHGVA 409
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYGV T YW++KNSWGP WGEK
Sbjct: 410 AVGYGV-----TADGTKYWLVKNSWGPEWGEK 436
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 85/155 (54%), Gaps = 21/155 (13%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-------KGSNRACHLNKEE 114
+L+DCD D GC GG M +A+E + ++ GLE E+DYPY K C +
Sbjct: 179 QLIDCDYKDNGCEGGDMLSAYEYVKAR---GLEAEEDYPYEELGYRHKPVRGPCRYQPSK 235
Query: 115 IRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLD 174
+ I +Y VS DE ++A LVKNGP+++A+ N + Y GGV+ P +C G ++
Sbjct: 236 VVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGVACPR--ICPG---EIN 290
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
HGVL+VGYGV YW KN+W +GE
Sbjct: 291 HGVLLVGYGVENG------LRYWTFKNTWTDEFGE 319
Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
F HF++K K Y T EEY RL++F+ANL +
Sbjct: 46 FKHFMQKFGKVYGTTEEYVHRLKVFQANLAHV 77
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++DCD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQGI---IKYCFDSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 71/154 (46%), Positives = 86/154 (55%), Gaps = 19/154 (12%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK-I 119
+LVDCD K + GC GGLM AFE II GG++ EKDYPYK + C + +V I
Sbjct: 182 ELVDCDRKQNQGCNGGLMDYAFEFIIKN--GGIDTEKDYPYKARDGRCDEGRRNSKVVVI 239
Query: 120 QSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
Y +V + E+ + K L KN P++VAI A F Y GGV F G + LDHG
Sbjct: 240 DDYQDVPTQSESALMKALTKN-PVSVAIEAGGRDFQHYQGGV-----FTGPCGSE-LDHG 292
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG YWI+KNSWGP WGEK
Sbjct: 293 VLAVGYGTDDDGVN-----YWIVKNSWGPGWGEK 321
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 90/164 (54%), Gaps = 19/164 (11%)
Query: 52 IRGEGTHLA-LKLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ G T L+ +LVDCD + GC GGLM AF+ IIS GGL+ E DYPYK +N +C
Sbjct: 172 VTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISN--GGLDSEDDYPYKANNGSCD 229
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLC 166
++ V I Y +V ++ + K N P++VAI A+ A QFY GV F
Sbjct: 230 AYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV-----FTS 284
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G LDHGV +VGYG + YW++KNSWG WGEK
Sbjct: 285 NCGT-QLDHGVTLVGYG------SESGIDYWLVKNSWGNSWGEK 321
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 96/166 (57%), Gaps = 17/166 (10%)
Query: 62 KLVDCDKVDA-GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GG M +AF+ +I GG++ E DYPY G + C+ KEE +V I
Sbjct: 188 ELVDCDTTNNYGCEGGDMDSAFQWVIGN--GGIDTEADYPYTGVDGTCNTAKEEKKVVSI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
+ YV+V ++ + V+ P++V ++ +A+ F Y GG+ C G +++DH +
Sbjct: 246 EGYVDVDPSDSALLCATVQQ-PISVGMDGSALDFQLYTGGI---YDGDCSGDPNDIDHAI 301
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGP 223
LIVGYG + + YWI+KNSWG WG + F+I +N+ P
Sbjct: 302 LIVGYG------SENDEDYWIVKNSWGTEWGMEGY-FYIRRNTSKP 340
>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
Length = 259
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 87/162 (53%), Gaps = 15/162 (9%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 86 FRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 143
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
H++K + + + E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 144 HMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KWCDPA 202
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 203 G---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 235
>gi|21483194|gb|AAL49964.1| cathepsin L [Ascaris suum]
Length = 169
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 88/153 (57%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM AFE I K G++ E YPYKG CH NK+ + + +
Sbjct: 4 LVDCSRKFGNNGCNGGLMDYAFEYI--KDNHGVDTEASYPYKGKEMKCHFNKKTVGAEDE 61
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV++ DE ++ + GP++VAI+A + Q Y GV + + C ++LDHGV
Sbjct: 62 GYVDLPEGDEEKLKVAVATQGPISVAIDAGHPSFQMYRKGVYYEPQ--CS--SESLDHGV 117
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG + YWI+KNSWGP WGEK
Sbjct: 118 LVVGYGTDEID-----GDYWIVKNSWGPGWGEK 145
>gi|311697923|gb|ADQ00256.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-KGSNR--ACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ GG + E YPY GS R AC + E+ +
Sbjct: 30 LVSCDTVDQGCNGGLMDDAFKWLVDSNGGKVYTEDSYPYVSGSGRTPACSTSNHEVGATV 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV++ DE +MA ++ NGP+AVA++AN+ Y GV L L+HGVL+
Sbjct: 90 TGYVDLPQDEDKMAAWVAANGPLAVAVDANSFLSYVSGV------LTNCQSYQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 89/154 (57%), Gaps = 15/154 (9%)
Query: 60 ALKLVDC---DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR 116
A +LVDC + + GC GGLM AF+ + + G++ E+ YPY+G +C + + +
Sbjct: 161 AQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPYEGRRSSCKKSGDYV- 216
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
K+++YV DE EMA+ + GP+AVAI A+ + FY G+ C ++L+HG
Sbjct: 217 TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDET-CRCSNKREDLNHG 274
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG + YWI+KNSWG WGEK
Sbjct: 275 VLVVGYG------SENGVDYWIVKNSWGADWGEK 302
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 87/154 (56%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VK 118
+LVDCD VD GC GGLM +AF+ II G L E YPY+G + C+ NK I+
Sbjct: 177 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LNTEAQYPYQGVDGTCNANKASIQATT 234
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 235 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 288
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 289 VTAVGYGV-----SNDGTKYWLVKNSWGTDWGEE 317
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD +D GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLYTEDSYPYVSGNGYVPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 236 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 91/172 (52%), Gaps = 19/172 (11%)
Query: 41 RIFRANLKKIQIRGEGTHLALKLVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKD 98
++FR K I + + LVDC + + GC GGLM NAF+ I K GGL+ E+
Sbjct: 150 QMFRKTSKLISLSEQ------NLVDCSWPEGNEGCNGGLMDNAFQYI--KDNGGLDSEES 201
Query: 99 YPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
YPY G + +C + YV++ E + K + GP++V I+A+ + QFY
Sbjct: 202 YPYFGKDGSCKYKPQSSAANDTGYVDIPKQEKALMKAVATVGPISVGIDASHESFQFYST 261
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
G+ F + ++LDHGVL+VGYGV H YW++KNSWG WG
Sbjct: 262 GI----YFEPQCSSEDLDHGVLVVGYGVEG---AHSNNKYWLVKNSWGNTWG 306
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 86/152 (56%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD+ + GC GGLM AFE I+ GG++ E+DYPY + C N++ RV I
Sbjct: 190 ELVDCDRGYNMGCNGGLMDYAFEFIVQN--GGIDTEEDYPYHAKDNTCDPNRKNARVVTI 247
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V +++ + V N P++VAI A M+F Y GV F + G NLDHGV
Sbjct: 248 DGYEDVPTNDEKSLMKAVANQPVSVAIEAGGMEFQLYQSGV-----FTGRCGT-NLDHGV 301
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG T YW+++NSWG WGE
Sbjct: 302 VAVGYG------TENGTDYWLVRNSWGSAWGE 327
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E + DYPY+ + C L + ++S
Sbjct: 204 QLVDCDHVDMGCDGGLIHTAYEEIMRM--GGVEQDFDYPYRAERQPCALKPHKFAAGVRS 261
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V +E + L GP+A+A++A + Y+GG+ C+ + L+H VL+V
Sbjct: 262 CYRYVLLNEERLEDLLRHVGPIAIAVDAVDITDYYGGIVS----FCEN--NGLNHAVLLV 315
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYWI+KNSWG +GE
Sbjct: 316 GYGVENNV------PYWILKNSWGSDYGE 338
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/162 (40%), Positives = 87/162 (53%), Gaps = 17/162 (10%)
Query: 63 LVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM AF+ II GG++ E+ YPYK + CH K I +
Sbjct: 170 LVDCSGKEGNEGCDGGLMDQAFQYIIK--AGGIDTEESYPYKAVDGECHFKKANIGATVT 227
Query: 121 SYVNVSSD-ETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V+SD ET + K + GP++VAI+A+ M F Y GV + C + LDHGV
Sbjct: 228 GYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPD--CSSTL--LDHGV 283
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKN 219
L VGYG T YWI+KNSW WG W+ +N
Sbjct: 284 LAVGYGT-----TSDGTDYWIVKNSWAETWGMNGY-LWMSRN 319
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 87/154 (56%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LVDCD VD GC GGLM +AF+ II GL+ E YPY+G + C+ N+ I
Sbjct: 176 ELVDCDTKGVDQGCEGGLMDDAFKFIIQN--HGLDTEAKYPYQGVDGTCNANEASINAAT 233
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I SY +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 234 ITSYEDVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSGV-----FTGSCGTE-LDHG 287
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV + YW++KNSWG WGE+
Sbjct: 288 VTAVGYGV-----SDDGTKYWLVKNSWGTSWGEE 316
>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
Length = 232
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/162 (40%), Positives = 87/162 (53%), Gaps = 15/162 (9%)
Query: 52 IRGEGTHLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL +LVDCD +D GC GG + T I K+GG LE DYPY G C
Sbjct: 59 FRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY-TAIQKMGG-LELASDYPYTGVGGIC 116
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
H++K + I + E A+ L GP++ A+NA+ +Q Y GG+ P K+
Sbjct: 117 HMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-KWCDPA 175
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G ++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 176 G---VNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 208
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 236 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|298709635|emb|CBJ31444.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 475
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI--RVKI 119
+LVDCD D GC GGLM +F I + GG+ E+DYPY + C + ++ +
Sbjct: 308 ELVDCDTYDMGCNGGLMDYSFHWI--QQNGGICSEEDYPYTAAGDLCKKSTCDVVEGTMV 365
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
+V+V+SD+ + V P+++AI A+ M F Y GGV L C NLDHGV
Sbjct: 366 DKWVDVASDDEQALMEAVAQQPVSIAIEADQMSFQLYSGGV---LTAACG---TNLDHGV 419
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYGV + YW +KNSWGP WG +
Sbjct: 420 LLVGYGVSEDGVK-----YWKVKNSWGPEWGAE 447
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 84/162 (51%), Gaps = 15/162 (9%)
Query: 52 IRGEGTHLALK---LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
R G LAL LVDCD +D GC GG T I K+GG LE DYPY G C
Sbjct: 153 FRKTGHLLALSEQPLVDCDYLDGGCDGGYPPQT-NTAIQKMGG-LELASDYPYTGVGGIC 210
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
+++K + I + E A+ L GP++ A+NA+ +Q Y GG+ P LC
Sbjct: 211 YMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPR--LCDP 268
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
++H VL VGYGV K PYWI+KNSWG +GE+
Sbjct: 269 A--GVNHAVLTVGYGVQNGK------PYWIVKNSWGEDFGEE 302
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 87/150 (58%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI-Q 120
+L+DCD VD GC GGL+ A+E +++ GG++ E DYPY+ +N C LN + VK+ +
Sbjct: 164 QLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAENDYPYEANNGDCRLNAAKFVVKVKK 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V E ++ L GP+ VAI+A+ + Y GV +++ G L+H VL+V
Sbjct: 222 CYRYVLMFEEKLKDLLRIVGPLPVAIDASDIVNYKRGV---IRYCANHG---LNHAVLLV 275
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GY V P+WI+KN+WG WGE+
Sbjct: 276 GYAVENGV------PFWILKNTWGTDWGEQ 299
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 86/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE I+ GG++ E DYPYKG + C N++ +V I
Sbjct: 201 ELVDCDNGYNQGCNGGLMDYAFEFIVKN--GGIDTEDDYPYKGVDGLCDQNRKNAKVVTI 258
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ + K V + P++VAI A A Q Y GV F + G + LDHGV
Sbjct: 259 NGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGV-----FTGQCGTE-LDHGV 312
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ VGYG K YWI++NSWGP WGE
Sbjct: 313 VAVGYGSENGK------DYWIVRNSWGPDWGES 339
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 88/155 (56%), Gaps = 20/155 (12%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR--VK 118
+LVDCD+ + GC GGLM AF+ II GG++ E+DYPYK ++ C ++E V
Sbjct: 188 ELVDCDRGQNQGCNGGLMDYAFDFIIKN--GGIDTEEDYPYKATDGQCDEARKETSKVVV 245
Query: 119 IQSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDH 175
I Y +V + E+ + K + KN P++VAI A F Y GGV F G D LDH
Sbjct: 246 IDDYQDVPTKSESSLLKAVSKN-PVSVAIEAGGRDFQHYQGGV-----FTGPCGTD-LDH 298
Query: 176 GVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GVL VGYG YWI+KNSWGP WGEK
Sbjct: 299 GVLAVGYGTDDDGVN-----YWIVKNSWGPSWGEK 328
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLYTEDSYPYVSGNGYVPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 236 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 89/153 (58%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC + GCGGGLM NAF I K G++ E+ YPY+G C +KE+ +
Sbjct: 171 LIDCSTSYGNNGCGGGLMDNAFTYI--KENHGIDTEESYPYEGKQGKCRYHKEDSAGRDT 228
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ S +E +AK L GP++VAI+A+ + QFY GV +P C +LDHGV
Sbjct: 229 GFVDIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPD--CDS--HSLDHGV 284
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG T Q Y+IIKNSWG WG++
Sbjct: 285 LAVGYGT-----TDDGQDYYIIKNSWGERWGQE 312
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 84/153 (54%), Gaps = 18/153 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEE-IRVK 118
+LVDCD +D GC GGLM AFE II+ GGL E +YPYKG + C+ NK I V
Sbjct: 177 ELVDCDTKGIDHGCEGGLMDTAFEFIINN--GGLTTESNYPYKGEDGTCNFNKTNPIAVS 234
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V +++ + V + P++VAI A QFY GV F + G + LDH
Sbjct: 235 ITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGV-----FTGECGTE-LDHA 288
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V VGYG + YWI+KNSWG WGE
Sbjct: 289 VTAVGYGESEDG-----SKYWIVKNSWGTKWGE 316
>gi|170579559|ref|XP_001894882.1| cathepsin F-like cysteine proteinase [Brugia malayi]
gi|158598358|gb|EDP36268.1| cathepsin F-like cysteine proteinase, putative [Brugia malayi]
Length = 137
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 67/119 (56%), Gaps = 8/119 (6%)
Query: 91 GGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANA 150
GGLE E YPY+ N CHL + +I V I V + +ET M ++ + GP++V I+A
Sbjct: 2 GGLEPEDQYPYEAKNGTCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLSVGIDAEL 61
Query: 151 MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ +Y G+ HP K C ++HGVLI GYG+ PYW IKNSWG WGE
Sbjct: 62 LSYYKSGILHPSKSRCP--PSKINHGVLITGYGIEDN------LPYWTIKNSWGEQWGE 112
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 82/150 (54%), Gaps = 12/150 (8%)
Query: 63 LVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GG M+NAF+ + K GGL+ E YPY + +C E
Sbjct: 166 LVDCSHPQGNQGCNGGFMNNAFQYV--KENGGLDSEASYPYVAKDGSCKYKPENSVANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+V + + E E+ K + GP++VA++A ++ QFY G+ F NLDHGVL
Sbjct: 224 GFVVIPAHEKELMKAVATVGPISVAVDASHSSFQFYKSGI----YFEQDCSSKNLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG T + YW+IKNSWGP WG
Sbjct: 280 VVGYGFEGTNSNNNN--YWLIKNSWGPEWG 307
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 85/155 (54%), Gaps = 21/155 (13%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY-------KGSNRACHLNKEE 114
+L+DCD D GC GG M +A+E + ++ GLE ++DYPY K C +
Sbjct: 179 QLIDCDYKDNGCEGGDMLSAYEYVKAR---GLEADEDYPYEELGYRHKPVRGPCRYQPSK 235
Query: 115 IRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLD 174
+ I +Y VS DE ++A LVKNGP+++A+ N + Y GGV+ P +C G ++
Sbjct: 236 VVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGVACPR--ICPG---EIN 290
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
HGVL+VGYGV YW KNSW +GE
Sbjct: 291 HGVLLVGYGVENG------LRYWTFKNSWTDEFGE 319
Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 21/29 (72%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANL 47
F HF++K K Y T EEY RL++F+ANL
Sbjct: 46 FKHFMQKFGKVYGTTEEYVHRLKVFQANL 74
>gi|161016200|gb|ABX56032.1| cytotoxic cysteine proteinase [Trichomonas vaginalis]
Length = 305
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 80/151 (52%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM A++ ++ GG E DYPY + +C + + K+ Y
Sbjct: 140 LVDCVTTCYGCNGGLMDAAYDYVVKHQGGKFMTEADYPYTAQDGSCKFSAAKGTSKVTGY 199
Query: 123 VN-VSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VN V DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 200 VNVVEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 255
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 256 VGYGTEGSK------NYWIVRNSWGTSWGEK 280
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 81/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH--LNKEEIRVKI 119
+L+DCD + GC GGLM NAFE I S GG+ E YPY SN C + V I
Sbjct: 182 ELIDCDTDENGCQGGLMENAFEFIKSH--GGITTESAYPYHASNGTCDGARARRGRVVAI 239
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ V + + V + P++VAI+A A+QFY GV F G D LDHGV
Sbjct: 240 DGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQALQFYSEGV-----FTGDCGTD-LDHGV 293
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYGV + PYWI+KNSWGP WGE
Sbjct: 294 AAVGYGV-----SDDGTPYWIVKNSWGPSWGE 320
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 89/153 (58%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC + GCGGGLM NAF I K G++ E+ YPY+G C +KE+ +
Sbjct: 166 LIDCSTSYGNNGCGGGLMDNAFTYI--KENHGIDTEESYPYEGKQGKCRYHKEDSAGRDT 223
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ S +E +AK L GP++VAI+A+ + QFY GV +P C +LDHGV
Sbjct: 224 GFVDIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPD--CDS--HSLDHGV 279
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG T Q Y+IIKNSWG WG++
Sbjct: 280 LAVGYGT-----TDDGQDYYIIKNSWGERWGQE 307
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 84/150 (56%), Gaps = 12/150 (8%)
Query: 63 LVDCDKVD--AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + AGC GGLM NAF + K GGL+ E+ YPY + C E+
Sbjct: 166 LVDCSRAEGNAGCNGGLMDNAFRYV--KDNGGLDSEESYPYLAQDGRCKYKPEQSAANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ ++ DE + + GP++VAI+A + +FY+ G+ + C ++LDHGVL
Sbjct: 224 GFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPN--CSS--EDLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG + + +K YWI+KNSWG WG
Sbjct: 280 VVGYGSDEREAENK--NYWIVKNSWGTQWG 307
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 82/152 (53%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+++DC + GC GGLM+N+FE II+ GGL+ E YPY G C NK+ I I
Sbjct: 162 QILDCSGSEGNNGCDGGLMTNSFEYIIAV--GGLDTEASYPYTGEVGKCKFNKKNIGATI 219
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y NV S + V P++VAI+A ++ Q Y GV + + C LDHGV
Sbjct: 220 TGYKNVESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPE--CSS--TQLDHGV 275
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L VGYG + Q YWI+KNSWG WGE
Sbjct: 276 LAVGYG------SQSGQDYWIVKNSWGADWGE 301
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 85/156 (54%), Gaps = 20/156 (12%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK----EEIR 116
+L+DCD D GC GGLM NAFE I K GGL E YPY+ +N C K +
Sbjct: 189 ELIDCDTADNDGCEGGLMDNAFEYI--KKNGGLTTEAAYPYRAANGTCKAAKVAKSSPMV 246
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLD 174
V I + +V ++ E V N P++V I+A+ A FY GV F + G + LD
Sbjct: 247 VHIDGHQDVPANSEEALAKAVANQPVSVGIDASGKAFMFYSEGV-----FTGECGTE-LD 300
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
HGV +VGYGV + + YW +KNSWGP WGEK
Sbjct: 301 HGVAVVGYGVAEDG-----KAYWTVKNSWGPSWGEK 331
>gi|375073906|gb|AFA34820.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073910|gb|AFA34822.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073912|gb|AFA34823.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073916|gb|AFA34825.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073918|gb|AFA34826.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073926|gb|AFA34830.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073928|gb|AFA34831.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073930|gb|AFA34832.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073932|gb|AFA34833.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073934|gb|AFA34834.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|400234869|gb|AFP74098.1| cathepsin L- like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|375073880|gb|AFA34807.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073882|gb|AFA34808.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073884|gb|AFA34809.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073886|gb|AFA34810.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073888|gb|AFA34811.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073890|gb|AFA34812.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 87/150 (58%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI-Q 120
+L+DCD VD GC GGL+ A+E +++ GG++ E DYPY+ +N C N + VK+ +
Sbjct: 164 QLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAENDYPYEANNGDCRANAAKFVVKVKK 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ VAI+A+ + Y G+ +K+ G L+H VL+V
Sbjct: 222 CYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKRGI---MKYCANHG---LNHAVLLV 275
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GY V P+WI+KN+WG WGE+
Sbjct: 276 GYAVQNGV------PFWILKNTWGADWGEQ 299
>gi|357614049|gb|EHJ68876.1| hypothetical protein KGM_22410 [Danaus plexippus]
Length = 251
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 80/161 (49%), Gaps = 2/161 (1%)
Query: 49 KIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC 108
KI + E L+DC + GC + F TI++ +GG L PY+ + C
Sbjct: 68 KIHLSSEEILSEQFLIDCAPGNIGCNSTSVLKTFGTIVNDIGGVLRDLDYKPYEAKQKKC 127
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKG 168
+ + + + Y V DE MA Y+V GP++ AIN+ +M Y GG+ P LC
Sbjct: 128 SWDPLKRPIPVVGYRRVKPDEQIMALYVVNVGPLSAAINSASMAKYNGGIDEPTDKLCSP 187
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
N H VLIVG+ ++ + PYWIIKNSWG WG+
Sbjct: 188 RQTN--HAVLIVGFSFYEDPQSKTYVPYWIIKNSWGTSWGD 226
>gi|26245869|gb|AAN77410.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 200
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 88/152 (57%), Gaps = 13/152 (8%)
Query: 62 KLVDCDKV---DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK 118
+L+DC K D GGLMS AF+ ++ K G+E + Y YKG + C + ++ +K
Sbjct: 35 QLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYLYKGIDTPCQYDAKKTVLK 91
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I+ Y NVS E E+ K + GP++VAI+A+ +Q YFGG+ L C NL+HGVL
Sbjct: 92 IKGYKNVSISEEELKKAVGTVGPVSVAIDADPIQLYFGGILDGL--FC---THNLNHGVL 146
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG F K +W +KNSWG WGE+
Sbjct: 147 AVGYGEEDHLFGKK--KFWKVKNSWGKDWGEQ 176
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 89/153 (58%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GG+M AF+ I K GG++ EK YPY+ + CH N + + +
Sbjct: 174 LVDCSGKYGNNGCNGGMMDYAFQYI--KDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDK 231
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV++ DE + K L GP+++AI+A+ + QFY GV + + C +NLDHGV
Sbjct: 232 GYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQ--CDS--ENLDHGV 287
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG + + + YW++KNSWG WG++
Sbjct: 288 LAVGYGT-----SEEGEDYWLVKNSWGTTWGDQ 315
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LV+CD + GC GGLM AFE II GG++ E+DYPY G + C NK+ +V I
Sbjct: 191 ELVNCDTSYNQGCNGGLMDYAFEFIIKN--GGIDTEEDYPYTGKDGKCDKNKKNAKVVTI 248
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ K V N P+AVAI A QFY G+ F G LDHGV
Sbjct: 249 DSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGI-----FTGSCGT-ALDHGV 302
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L GYG K YW++KNSWG WGE
Sbjct: 303 LAAGYGTEDGK------DYWLVKNSWGAEWGE 328
>gi|123457373|ref|XP_001316414.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121899120|gb|EAY04191.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 80/151 (52%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM A++ ++ GG E DYPY + +C + + K+ Y
Sbjct: 140 LVDCVTTCYGCNGGLMDAAYDYVVKHQGGKFMTEADYPYTAQDGSCKFSAAKGTSKVTGY 199
Query: 123 VN-VSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
VN V DE ++A + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 200 VNVVEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--YNLDHGVGC 255
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 256 VGYGTEGSK------NYWIVRNSWGTSWGEK 280
>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
Length = 373
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 19/163 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DCD+ GC GG + +AF T+++ GL EKDYP++G + C K ++ I
Sbjct: 180 QLLDCDRCGNGCKGGFVWDAFLTVLNN--SGLASEKDYPFRGDAKPHRCQAKKPKV-AWI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + DE ++A+YL +GP+ V IN +Q Y GV C +LDH VL+
Sbjct: 237 QDFIRLPEDEQKIAEYLATHGPITVTINMKLLQQYQKGVIKATPTTCD--PQHLDHSVLL 294
Query: 180 VGYG------------VHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+G V + YWI+KNSWG WGE+
Sbjct: 295 VGFGGGKSVEGRRPGAVSSQSRPRRSSSYWILKNSWGAKWGEE 337
>gi|209962684|gb|ACJ02137.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCG GLM NAFE I+ K G + EK YPY G AC + E+ I
Sbjct: 30 LVSCDSKDNGCGXGLMDNAFEWIVKKNSGKVYTEKSYPYVSGGGEEPACKPHGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+L+DC+ + GC GGLM AF+ I + GG+ E YPY+G +C +KE V I
Sbjct: 183 ELMDCNIGENDGCNGGLMDVAFQFI--QQNGGITTEASYPYQGEQNSCDQSKENSHDVSI 240
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V +++ + V N P++VAI+A N QFY GV F GG D LDHGV
Sbjct: 241 DGYEDVPANDESALQKAVANQPVSVAIDASGNDFQFYSEGV-----FTTDGGTD-LDHGV 294
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGEK
Sbjct: 295 AAVGYGT-----TRDGTKYWIVKNSWGEDWGEK 322
>gi|209962666|gb|ACJ02128.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C + E+ I
Sbjct: 30 LVSCDTKDGGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCQRDGHEVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ D +AKYL NGP+AVA++A + Y GGV + L+HGVL+
Sbjct: 90 TGHVDIPQDGAAIAKYLADNGPVAVAVDATSFMSYSGGVVTSCT------SEQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYNDSSKP------PYWIIKN 159
>gi|209962636|gb|ACJ02113.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D+GC GGLM NAFE I+ + G + EK YPY G C E+ I
Sbjct: 30 LVSCDTTDSGCSGGLMDNAFEWIVEENSGKVYTEKSYPYVSGGGEEPPCKPRGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYNDSSKP------PYWIIKN 159
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 87/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD+ DAGC GGLM AFE II+ GG++ E+DYPY+G + C ++ + V I
Sbjct: 187 ELVDCDRTYDAGCNGGLMDYAFEFIINN--GGIDTEEDYPYRGVDGTCDPERKNTKVVSI 244
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V + K V + P++VAI A+ A Q Y GV F + G LDHGV
Sbjct: 245 NDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQLYLSGV-----FTGECGR-ALDHGV 298
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
++VGYG T +WI++NSWG WGE
Sbjct: 299 VVVGYG------TDNGADHWIVRNSWGTSWGE 324
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 89/169 (52%), Gaps = 18/169 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKIQ 120
+LVDCD D GC GGLM AF I+ GGL E +YPYK +N C+ NK ++I I+
Sbjct: 179 ELVDCDTNDGGCMGGLMDTAFNYTITI--GGLTSESNYPYKSTNGTCNFNKTKQIATSIK 236
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ +V +++ + V + P+++ I QFY GV C +LDHGV
Sbjct: 237 GFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV---FSGEC---TTHLDHGVT 290
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
VGYG K YWI+KNSWGP WGE+ + IK P+ G+
Sbjct: 291 AVGYGRSKNGL-----KYWILKNSWGPKWGER--GYMRIKKDIKPKHGQ 332
>gi|209962672|gb|ACJ02131.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY G AC + E+ I
Sbjct: 30 LVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPACKPHGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|209962652|gb|ACJ02121.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D+GC GGLM NAFE I+ + G + EK YPY G C E+ I
Sbjct: 30 LVSCDTTDSGCSGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 68/164 (41%), Positives = 90/164 (54%), Gaps = 19/164 (11%)
Query: 52 IRGEGTHLA-LKLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ G T L+ +LVDCD + GC GGLM AF+ II+ GGL+ E DYPYK ++ +C
Sbjct: 172 VTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINN--GGLDSEDDYPYKANDGSCD 229
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLC 166
++ V I Y +V ++ + K N P++VAI A+ A QFY GV F
Sbjct: 230 AYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV-----FTS 284
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G LDHGV +VGYG + YWI+KNSWG WGEK
Sbjct: 285 TCGT-QLDHGVTLVGYG------SESGTDYWIVKNSWGKSWGEK 321
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 83/154 (53%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH---LNKEEIRVK 118
+LVDC ++GC GGLM AF+ II+ GG+ E +YPY C +N + RV
Sbjct: 184 QLVDCSTENSGCNGGLMDTAFQYIINN--GGIVTEDNYPYTAEATECSSTKINSQTTRVV 241
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I + +V ++ + K V + P++VAI A+ QFY GV F K G LDHG
Sbjct: 242 IDGFEDVPANNEQALKEAVAHQPVSVAIEASGQDFQFYSTGV-----FTGKCGT-ALDHG 295
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V+ VGYG YWI++NSWGP WGE+
Sbjct: 296 VVAVGYGTSPEGIN-----YWIVRNSWGPKWGEE 324
>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
Length = 382
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 90/169 (53%), Gaps = 10/169 (5%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DCD+ GC GG + +AF T+++ GL EKDYP+ GS + C K + I
Sbjct: 186 ELLDCDRCGNGCRGGFVWDAFLTVLNN--SGLASEKDYPFDGSGKTHRCLAKKYKKVAWI 243
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + + E MA++L GP+ V IN +Q Y GV C +DH VL+
Sbjct: 244 QDFIILQACEQSMARHLATEGPITVTINMTLLQQYQKGVIKATPTTCD--PTQVDHSVLL 301
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQ 228
VG+G KTK Q S+ ++M +W +KNSWGP+WGE+
Sbjct: 302 VGFG--KTKSGEGRQGKAASFGSYARP--RRSMAYWTLKNSWGPQWGEE 346
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 89/153 (58%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GG+M AF+ I K GG++ EK YPY+ + CH N + + +
Sbjct: 174 LVDCSGKYGNNGCNGGMMDYAFQYI--KDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDK 231
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV++ DE + K L GP+++AI+A+ + QFY GV + + C +NLDHGV
Sbjct: 232 GYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQ--CDS--ENLDHGV 287
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG + + + YW++KNSWG WG++
Sbjct: 288 LAVGYGT-----SEEGEDYWLVKNSWGTTWGDQ 315
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 81/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH--LNKEEIRVKI 119
+L+DCD + GC GGLM NAFE I S GG+ E YPY+ SN C ++ V I
Sbjct: 183 ELIDCDTDENGCQGGLMENAFEFIKSY--GGVTTESAYPYRASNGTCDSVRSRRGQIVSI 240
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ V + + V N P++VAI+A A QFY GV F G D LDHGV
Sbjct: 241 DGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGV-----FTGDCGTD-LDHGV 294
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYGV + YWI+KNSWGP WGE
Sbjct: 295 AAVGYGV-----SDDGTAYWIVKNSWGPSWGE 321
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 89/169 (52%), Gaps = 18/169 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKIQ 120
+LVDCD D GC GGLM AF I+ GGL E +YPYK +N C+ NK ++I I+
Sbjct: 173 ELVDCDTNDGGCMGGLMDTAFNYTITI--GGLTSESNYPYKSTNGTCNFNKTKQIATSIK 230
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ +V +++ + V + P+++ I QFY GV C +LDHGV
Sbjct: 231 GFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV---FSGEC---TTHLDHGVT 284
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
VGYG K YWI+KNSWGP WGE+ + IK P+ G+
Sbjct: 285 AVGYGRSKNGL-----KYWILKNSWGPKWGER--GYMRIKKDIKPKHGQ 326
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 90/163 (55%), Gaps = 19/163 (11%)
Query: 52 IRGEGTHLA-LKLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD+ V+AGC GGLM AFE II+ GG++ ++DYPY+G + C
Sbjct: 185 VTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINN--GGIDSDEDYPYRGVDGKCD 242
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLC 166
K+ RV I Y V + + K V N P++VAI A +F Y G+ F
Sbjct: 243 QYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGI-----FTG 297
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
K G LDHGV VGYG T YWI++NSWG WGE
Sbjct: 298 KCGT-ALDHGVTAVGYG------TENGVDYWIVRNSWGKSWGE 333
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 84/150 (56%), Gaps = 12/150 (8%)
Query: 63 LVDCDKVD--AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + AGC GGLM NAF + K GGL+ E+ YPY + C E+
Sbjct: 166 LVDCSRAEGNAGCNGGLMDNAFRYV--KDNGGLDSEESYPYLAQDGRCKYKPEQSAANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ ++ DE + + GP++VAI+A + +FY+ G+ + C ++LDHGVL
Sbjct: 224 GFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPN--CSS--EDLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG + + +K YWI+KNSWG WG
Sbjct: 280 VVGYGSDEREAENK--NYWIVKNSWGTQWG 307
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 20/94 (21%)
Query: 123 VNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM--------DNLD 174
VNV E + + GP++ AI A+ F F CK G+ ++LD
Sbjct: 398 VNVPQQEEAVMLAVAAGGPVSAAIRASLGSFQF----------CKEGIYYDPNCSSEDLD 447
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
HGVL+VGYG + + +K YWI+KNSWG WG
Sbjct: 448 HGVLVVGYGSDEREAENK--NYWIVKNSWGTDWG 479
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 89/161 (55%), Gaps = 19/161 (11%)
Query: 54 GEGTHLA-LKLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
GE L+ +LVDCD + + GC GGLM AF+ II GG++ EKDYPYKG + C +
Sbjct: 179 GEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQN--GGIDTEKDYPYKGFDGRCDNS 236
Query: 112 KEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKG 168
K+ V I Y +V ++ E K V P++VAI A Q Y GV F +
Sbjct: 237 KKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYAQGV-----FSGEC 291
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G D LDHGVL VGYG T YWI+KNSWG +WGE
Sbjct: 292 GTD-LDHGVLAVGYG------TEDGVDYWIVKNSWGEYWGE 325
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 267 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECS-NSSELVVG 325
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 326 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGV----LTACIG--KQLNHG 379
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 380 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 407
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 87/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCDK + GC GGLM AFE II+ GG++ E+DYPYK S+ C N++ + V I
Sbjct: 189 ELVDCDKSYNQGCNGGLMDYAFEFIINN--GGIDTEEDYPYKASDNICDPNRKNAKVVTI 246
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ K V + P++VAI A A Q Y GV F + G + LDHGV
Sbjct: 247 DGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQLYKSGV-----FTGRCGTE-LDHGV 300
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG T YWI++NSWG WGE
Sbjct: 301 VAVGYG------TENGVNYWIVRNSWGSAWGE 326
>gi|209962674|gb|ACJ02132.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY G AC + E+ I
Sbjct: 30 LVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPACKPHGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 83/150 (55%), Gaps = 16/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK--I 119
+L+DCD VD GC GGL+ AFE I+ GG++ E DYP+ G NR C L++ V +
Sbjct: 195 QLIDCDSVDMGCNGGLLHTAFEEIMRM--GGVQTELDYPFVGRNRRCGLDRHRPYVVSLV 252
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y V +E ++ L GP+ +AI+A + Y+ GV C+ + L+H VL+
Sbjct: 253 GCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVISS----CEN--NGLNHAVLL 306
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYGV PYW+ KN+WG WGE
Sbjct: 307 VGYGVENGV------PYWVFKNTWGDDWGE 330
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 84/154 (54%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+LVDCD VD GC GGLM +AF+ II G L E YPY+G + C NK I V
Sbjct: 178 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LNTEAQYPYQGVDGTCSANKASIHAVT 235
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 236 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV YW++KNSWG WGE+
Sbjct: 290 VTAVGYGVGNDG-----TKYWLVKNSWGTDWGEE 318
>gi|209962632|gb|ACJ02111.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962634|gb|ACJ02112.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962654|gb|ACJ02122.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D+GC GGLM NAFE I+ + G + EK YPY G C E+ I
Sbjct: 30 LVSCDTTDSGCSGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 85/152 (55%), Gaps = 13/152 (8%)
Query: 63 LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC ++ + GC GGLM AF+ + ++ GG++ E+ YPY+G+N C E
Sbjct: 177 LIDCSTEEGNNGCNGGLMDQAFQYV--RINGGIDTERSYPYEGNNDVCRYEPENSGAIDT 234
Query: 121 SYVNVS-SDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V DE + + GP++VAI+A+ + Q Y GV CK ++LDHGV
Sbjct: 235 GYTDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPN--CKNEPESLDHGV 292
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYG + Q YW++KNSWG WGE
Sbjct: 293 LVVGYGTDE----ETQQDYWLVKNSWGDSWGE 320
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 83/154 (53%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEE---IRVK 118
+LVDC K +AGC GGLM NAF+ II GG+ E +YPY C K E I
Sbjct: 187 QLVDCSKENAGCNGGLMDNAFQYIIDN--GGIVTEDEYPYTAEAGECSTTKIESKSIATI 244
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I + +V ++ K V + P+++AI A+ QFY GV F K G + LDHG
Sbjct: 245 IDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQFYSTGV-----FTGKCGTE-LDHG 298
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V++VGYG YWI++NSWGP WGE+
Sbjct: 299 VVVVGYGKSPEGIN-----YWIVRNSWGPEWGEQ 327
>gi|311698027|gb|ADQ00308.1| cathepsin L-like protein [Trypanosoma theileri]
gi|311698029|gb|ADQ00309.1| cathepsin L-like protein [Trypanosoma theileri]
Length = 159
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD VD GC GGLM +AF+ ++ G + E YPY G AC ++ E+ I
Sbjct: 30 LVSCDTVDEGCNGGLMDDAFKWLVDSNKGKVYTESSYPYVSGSGQTPACSTSEHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++AN+ Y GV L + L+HGVL+
Sbjct: 90 TGFVDLPKDEDKMAAWLATNGPIAIAVDANSFLSYVSGV------LTNCESNQLNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYDDSSNP------PYWIIKN 159
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 86/149 (57%), Gaps = 16/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DC++V+AGC GG++S A + + S GL E +YPYK N C+ + + +
Sbjct: 164 QLLDCNRVNAGCDGGVLSYALQYVES---AGLTTEDEYPYKAWNGTCNSTHKPVAAYTKG 220
Query: 122 YVNV-SSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + + E+++ K V GP+AVA+NA+ +Q+Y G+ +P ++HG L+V
Sbjct: 221 YTLIYTRSESDLMK-AVAEGPVAVALNADLLQYYSKGIFNP-----SACSSTVNHGGLVV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GY + T PYWIIKNSWG WGE
Sbjct: 275 GYEENATL------PYWIIKNSWGATWGE 297
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 88/153 (57%), Gaps = 17/153 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM NAF+ I K G++ E YPY ++ CH N+ ++
Sbjct: 171 LVDCSRSFGNNGCEGGLMDNAFKYI--KSNKGIDTEWSYPYNATDGVCHFNRSDVGATDT 228
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ DE ++ K + GP++VAI+A+ + QFY GV + C + LDHGV
Sbjct: 229 GFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPE--CSS--EQLDHGV 284
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG T Q YW++KNSWG WG++
Sbjct: 285 LVVGYG------TKDGQDYWLVKNSWGTTWGDE 311
>gi|351694995|gb|EHA97913.1| Cathepsin L1 [Heterocephalus glaber]
Length = 278
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 83/148 (56%), Gaps = 8/148 (5%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM AFE + K GLE EK YPY+G + +C E
Sbjct: 111 LVDCSQPQGNQGCNGGLMDFAFEYV--KENKGLESEKSYPYEGKDGSCRYKPELSAANDT 168
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+V++ E + K + + GP++VA++A M F F + F + +L+HGVL+V
Sbjct: 169 GFVDIPQREKALMKAVAEKGPISVAVDAGLMSFQF--YKDGIYFDPECSSKDLNHGVLVV 226
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
GYG + T K + YW++KNSWGP WG
Sbjct: 227 GYGYEEVD-TEKNE-YWLVKNSWGPEWG 252
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 94/173 (54%), Gaps = 21/173 (12%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
A ++ I GT ++L +LVDCD +D GC GGLM +AFE II GL E +Y
Sbjct: 157 AAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIEN--NGLTTEANY 214
Query: 100 PYKGSNRACHLNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFG 156
PY+G + +C+ K KI Y NV + + E + V N P++VAI+A +A Q Y
Sbjct: 215 PYEGVDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSS 274
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G+ F G + LDHGV +VGYG + YW++KNSWG WGE
Sbjct: 275 GI-----FTGDCGTE-LDHGVTVVGYGT-----SDDGTKYWLVKNSWGTSWGE 316
>gi|209962656|gb|ACJ02123.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962658|gb|ACJ02124.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C E+ I
Sbjct: 30 LVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 81/151 (53%), Gaps = 18/151 (11%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC DAGC GG M AF+ II GG++ E YPYK + CH K + + Y
Sbjct: 170 LVDCSGRDAGCDGGFMDRAFQYIID--AGGIDTEASYPYKAVDGKCHFKKANVGATVTGY 227
Query: 123 VNVSS-DETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDN--LDHGV 177
+V+S E + K + GP++VAI+A+ M F Y GV + + G D+ LDHGV
Sbjct: 228 TDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSGVYN------EPGCDSTVLDHGV 281
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
L VGYG + YWI+KNSW WG
Sbjct: 282 LAVGYGT-----SSDGTDYWIVKNSWAETWG 307
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 81/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
+LVDCD VD GC GG M + FE II GG+ E +YPYKG + C+ V +I+
Sbjct: 178 ELVDCDSVDDGCEGGFMEDGFEFIIKN--GGITSETNYPYKGVDGTCNTTIAASPVAQIK 235
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V S E K V N P++V+I+A FY G+ + + G D LDHGV
Sbjct: 236 GYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGI-----YNGECGTD-LDHGVT 289
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGEK
Sbjct: 290 AVGYG------TENGTDYWIVKNSWGTQWGEK 315
>gi|5764411|gb|AAD51292.1|AF165115_1 evansain cysteine protease [Trypanosoma evansi]
Length = 151
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 71/123 (57%), Gaps = 9/123 (7%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 28 LVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 87
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +NGP+A+A++A + Y GG+ L + LDHGVL+
Sbjct: 88 TDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLL 141
Query: 180 VGY 182
VGY
Sbjct: 142 VGY 144
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKI 119
+LVDCD K +AGC GGLM +AFE I K GG+ E +YPY + C +K ++ V I
Sbjct: 179 ELVDCDTKKNAGCNGGLMESAFEFIKQK--GGITTESNYPYTAQDGTCDASKANDLAVSI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAM--QFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV +++ V N P++VAI+A QFYF GV L+HGV
Sbjct: 237 DGHENVPANDENALLKAVANQPVSVAIDAGGFDFQFYFEGV------FTGDCSTELNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG T YW ++NSWGP WGE+
Sbjct: 291 AIVGYGT-----TVDGTNYWTVRNSWGPEWGEQ 318
>gi|209962630|gb|ACJ02110.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY GS C E+ I
Sbjct: 30 LVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSEDGSKPFCIPYGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/152 (40%), Positives = 88/152 (57%), Gaps = 16/152 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC + GC GGLM NAF+ I K GG++ EK YPY+G + C N + +
Sbjct: 177 LIDCSSTYGNNGCNGGLMDNAFKYI--KDNGGIDTEKTYPYEGVDDKCRYNPKNSGAEDV 234
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ S DE ++ + + GP++VAI+A N+ QFY GGV + + C +LDHGV
Sbjct: 235 GFVDIPSGDEEKLMQAVATVGPVSVAIDASQNSFQFYSGGVYYDTE--CSS--TDLDHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYG + YW++KNSW WGE
Sbjct: 291 LVVGYGTDEAG-----GDYWLVKNSWSRTWGE 317
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 84/154 (54%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+LVDCD VD GC GGLM +AF+ II G L E YPY+G + C NK I V
Sbjct: 178 ELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LNTEAQYPYQGVDGTCSANKASIHAVT 235
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 236 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV YW++KNSWG WGE+
Sbjct: 290 VTAVGYGVGNDG-----TKYWLVKNSWGTDWGEE 318
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 89/170 (52%), Gaps = 28/170 (16%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLNKEEIRVKI 119
LVDC + GC GGLM AFE II+ G++ E YPY S+ C NK I
Sbjct: 318 LVDCSTSEGNMGCNGGLMDYAFEYIITN--NGIDTESSYPYTASSGTTCKYNKANSGATI 375
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
SY N+++ E+++A + GP++VAI+A N+ Q Y SH + + NLDHG
Sbjct: 376 SSYKNITAGSESDLADAVKNAGPVSVAIDASHNSFQLY----SHGIYYDASCSSVNLDHG 431
Query: 177 VLIVGYG---------VHK-------TKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGYG VHK T + YWI+KNSWG WG+K
Sbjct: 432 VLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDK 481
>gi|162044|gb|AAA30180.1| cysteine protease, partial [Trypanosoma cruzi]
Length = 165
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 81/144 (56%), Gaps = 16/144 (11%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + GG+ E YPY +G + C + + I
Sbjct: 35 LVSCDKTDSGCSGGLMNNAFEWIVQENNGGVYTEDSYPYASGEGISPPCTTSGHTVGATI 94
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA +A++ Y GGV + + LDHG+L+
Sbjct: 95 TGHVELPQDEAQIAAWLAVNGPVAVA-HASSWMTYTGGV------MTSCVSEQLDHGLLL 147
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSW 203
VGY PYWIIKNSW
Sbjct: 148 VGYN------DSAAVPYWIIKNSW 165
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 89/150 (59%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI-Q 120
+L+DCD VD GC GGL+ A+E +++ GG++ E DYPY+ +N C +N + V++ +
Sbjct: 164 QLIDCDFVDVGCDGGLLHTAYEAVMNM--GGIQAENDYPYEANNGPCRVNAAKFVVRVKK 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V+ E ++ L GP+ VAI+A+ + Y G+ +++ G L+H VL+V
Sbjct: 222 CYRYVTLFEEKLKDLLRIVGPIPVAIDASDIVGYKRGI---IRYCENHG---LNHAVLLV 275
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV P+WI+KN+WG WGE+
Sbjct: 276 GYGVENGI------PFWILKNTWGADWGEQ 299
Score = 36.6 bits (83), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
+D L+ + F FL K NK+Y+++ E +R +IF+ NL++I
Sbjct: 19 YDLLKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEI 59
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 76/150 (50%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD ++GCGGGL S AFE I+ + G + E YPY G C + + I
Sbjct: 175 LVSCDNTNSGCGGGLSSKAFEWIVQENNGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A GP++VA++A++ FY GGV L L H VL+
Sbjct: 235 TGHVELPQDEAQIAASGAVKGPLSVAVDASSWFFYTGGV------LTNCVSKRLSHAVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW HWGE
Sbjct: 289 VGYN------DSAAVPYWIIKNSWTTHWGE 312
>gi|375073960|gb|AFA34847.1| cathepsin L-like protein, partial [Trypanosoma cruzi marinkellei]
Length = 159
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM++AFE I+ + G + E+ YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCSGGLMNDAFEWIVQENNGAVYTEESYPYASGEGISPPCTTSGHTVGAMI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A + Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQVAAWLAANGPVAVAVDATSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 84/152 (55%), Gaps = 17/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDCD K +AGC GGLM AF+ I GG+ E YPY+ +C + + V I
Sbjct: 191 QLVDCDTKANAGCNGGLMDYAFQYIAKH--GGVAAEDAYPYRARQASCKKSPAPV-VTID 247
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V +++ K V + P++VAI A+ QFY GV F + G + LDHGV
Sbjct: 248 GYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGV-----FSGRCGTE-LDHGVT 301
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYGV T YW++KNSWGP WGEK
Sbjct: 302 AVGYGV-----TADGTKYWLVKNSWGPEWGEK 328
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 88/153 (57%), Gaps = 20/153 (13%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLNKEEIR-VK 118
+LVDCD + GCGGGLM AF+ IIS GG++ E+DYPY + + C+ +K+ R V
Sbjct: 174 ELVDCDTSYNNGCGGGLMDYAFQFIISN--GGIDTEEDYPYTATDDNICNTDKKNTRVVT 231
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V +E + K L N P++VAI A Q Y GV F G LDHG
Sbjct: 232 IDGYEDVPENENSLKKALA-NQPISVAIEAGGRGFQLYKSGV-----FTGTCGT-ALDHG 284
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V+ VGYG T + Q YWII+NSWG +WGE
Sbjct: 285 VVAVGYG------TSEGQDYWIIRNSWGSNWGE 311
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 82/149 (55%), Gaps = 14/149 (9%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ- 120
+LVDCD + GC GGL+ A E II+ GGG+ E+DYPYKG ++ C+L V++
Sbjct: 164 QLVDCDTSNMGCAGGLLHTALEQIINA-GGGVLQEEDYPYKGVDKQCNLPHNNFAVQVLG 222
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + +E ++ L GP+ VAI+A ++ Y G+ + L+H VL+V
Sbjct: 223 CYRYIVMNEEKLKDVLRAVGPIPVAIDAASIVDYSRGIIRTCTYY------GLNHAVLLV 276
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW +KN+WG WGE
Sbjct: 277 GYGVQDGV------PYWTLKNTWGDDWGE 299
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 84/152 (55%), Gaps = 17/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDCD K +AGC GGLM AF+ I GG+ E YPY+ +C + + V I
Sbjct: 192 QLVDCDTKANAGCNGGLMDYAFQYIAKH--GGVAAEDAYPYRARQASCKKSPAPV-VTID 248
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V +++ K V + P++VAI A+ QFY GV F + G + LDHGV
Sbjct: 249 GYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGV-----FSGRCGTE-LDHGVA 302
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYGV T YW++KNSWGP WGEK
Sbjct: 303 AVGYGV-----TADGTKYWLVKNSWGPEWGEK 329
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 84/152 (55%), Gaps = 19/152 (12%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDCD D GC GGLM +AF+ II G L E +YPY+G++ AC N + KI
Sbjct: 141 ELVDCDTSGEDQGCNGGLMDDAFDFIIQNKG--LTTEANYPYQGADGAC--NSGKAAAKI 196
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ V N P++VAI+A +A QFY GV F G D LDHGV
Sbjct: 197 TGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSSGV-----FTGDCGTD-LDHGV 250
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG+ + YW++KNSWG WGE
Sbjct: 251 TAVGYGM-----SDDGTKYWLVKNSWGTSWGE 277
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 85/152 (55%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-V 117
+LV CD D GC GGLM AFE ++ + G + E YPY S C + + +
Sbjct: 177 QLVSCDDKDNGCRGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSTGYVPECSNSSQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I Y+ + S ET MA +L KNGP+++A++A++ Y GV GM L+HGV
Sbjct: 237 RIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGV-----LTSCAGMP-LNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+V Y T ++ PYW+IKNSWG +WGE
Sbjct: 291 LLVWY-----NRTGEV-PYWVIKNSWGENWGE 316
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC + GC GG M NAF+ + K GG+E E DYPYK R C +K ++ +
Sbjct: 217 QLVDCSGSFGNEGCNGGFMENAFKYV--KSVGGIESESDYPYKARQRTCAFDKTKVIATV 274
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINA--NAMQFYFGGV-SHPLKFLCKGGMDNLDH 175
V+V S E+ + + + + GP++VAI+A ++ Q Y GGV P LC L+H
Sbjct: 275 SGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSSFQLYAGGVYDEP---LCS--TSRLNH 329
Query: 176 GVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
GVL VGYG + + + YWI+KNSWG WG
Sbjct: 330 GVLCVGYGT-----SLQGKDYWIVKNSWGVRWG 357
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 237 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVSGNGYVPECS-NSSELVVG 295
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 296 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGV----LTACIG--KQLNHG 349
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 350 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 377
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 88/163 (53%), Gaps = 18/163 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC K + GC GGLM +AF+ II+ G ++ E YPY + C N + +
Sbjct: 170 LVDCSKAQGNQGCNGGLMDDAFQYIITNKG--IDTEASYPYTAKDGTCKFNAANVGATLS 227
Query: 121 SYVNVS-SDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
S+ +++ E+++ + GP++VAI+A N+ Q Y GV + K C +LDHGV
Sbjct: 228 SFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGVYNEKK--CSS--TSLDHGV 283
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNS 220
L GYG T PYW++KNSWG WG+ W+ +N+
Sbjct: 284 LAAGYG------TSNGTPYWLVKNSWGSSWGQAGY-IWMSRNA 319
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 236 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKI 119
+LVDCDK + GC GGLM +AFE I K GG+ E +YPYK C +K ++ V I
Sbjct: 179 ELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEGTCDASKVNDLAVSI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV +++ + V N P++VAI+A + QFY GV F D L+HGV
Sbjct: 237 DGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCSTD-LNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
IVGYG T YWI++NSWGP WGE
Sbjct: 291 AIVGYGT-----TVDGTNYWIVRNSWGPEWGE 317
>gi|375073958|gb|AFA34846.1| cathepsin L-like protein, partial [Trypanosoma cruzi marinkellei]
Length = 159
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 79/142 (55%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM++AFE I+ + G + E+ YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCSGGLMNDAFEWIVQENDGAVYTEESYPYASGEGISPPCTTSGHTVGAMI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A + Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAPVPYWIIKN 159
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC GGLM + + I+S G + + YPY G C+ + + + KI
Sbjct: 178 LVSCDTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++N+ DE +A++L KNGP+A+A++A + Y GGV + KG LDH VL+
Sbjct: 238 SGHINLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLTSC--ISKG----LDHDVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 292 VGY-----NDTSK-PPYWIIKNSWSKGWGEE 316
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 90/172 (52%), Gaps = 25/172 (14%)
Query: 41 RIFRANLKKIQIRGEGTHLALKLVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKD 98
++FR K I + + LVDC + GC GGLM NAF+ I K GGL+ E+
Sbjct: 274 QMFRKTGKLISLSEQ------NLVDCSRRQGNLGCQGGLMDNAFQYI--KDNGGLDSEES 325
Query: 99 YPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
YPYKG + C E + N + E + K + GP++VAI+A + QFY
Sbjct: 326 YPYKGMDGTCQYKAE------WAVANDTGFEKALMKAVASVGPISVAIDAGHASFQFYKD 379
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
G+ + C +NLDHGVL+VGYGV K K YW+IKNSWG WG
Sbjct: 380 GIYYEPD--CSS--ENLDHGVLVVGYGVEKRNSNDK---YWLIKNSWGEQWG 424
>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
Length = 491
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 88/160 (55%), Gaps = 20/160 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK--- 118
+L+DC + GC GG + +AF T+++ GL EKDYPY+ +++ + RVK
Sbjct: 305 ELLDCGRCGDGCKGGWVWDAFITVLNN--SGLASEKDYPYQS-----NVDPQRCRVKRNK 357
Query: 119 ---IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDH 175
IQ ++ + +E +A+YL +GP+ V IN ++ Y GV C + +DH
Sbjct: 358 VAWIQDFIMLQDNEQIIAQYLASHGPITVTINMKPLKQYRKGVFEATPATCDPWL--VDH 415
Query: 176 GVLIVGYGVHKT-----KFTHKIQPYWIIKNSWGPHWGEK 210
VL+VG+G K+ T +PYWI+KNSWG WGEK
Sbjct: 416 SVLLVGFGSSKSVKGMRAGTASSKPYWILKNSWGAKWGEK 455
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 88/154 (57%), Gaps = 20/154 (12%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+L+DCD + GC GGLM AFE I+ GGL E+DYPY C + K+E V I
Sbjct: 189 ELIDCDTTYNNGCNGGLMDYAFEYIVKN--GGLRKEEDYPYSMEEGTCEMQKDESETVTI 246
Query: 120 QSYVNV-SSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
+ +V ++DE + K L P++VAI+A+ QFY GGV F + G+D LDHG
Sbjct: 247 NGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQFYSGGV-----FDGRCGVD-LDHG 299
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYG + K Y I+KNSWGP WGEK
Sbjct: 300 VAAVGYG------SSKGSDYIIVKNSWGPKWGEK 327
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 236 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE II GG++ E+DYPYKG + C ++ +V I
Sbjct: 177 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ E K + + P++VAI A Q Y G+ +C G D LDHGV
Sbjct: 235 DSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGI---FDGIC--GTD-LDHGV 288
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI+KNSWG WGE
Sbjct: 289 VAVGYGTENGK------DYWIVKNSWGTSWGE 314
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 85/152 (55%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKI 119
+LVDCDK + GC GGLM +AFE I K GG+ E +YPYK C +K ++ V I
Sbjct: 178 ELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEGTCDASKVNDLAVSI 235
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV +++ + V N P++VAI+A + QFY GV F D L+HGV
Sbjct: 236 DGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCSTD-LNHGV 289
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
IVGYG T YWI++NSWGP WGE
Sbjct: 290 AIVGYGT-----TVDGTNYWIVRNSWGPEWGE 316
>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
Length = 337
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E E DY YK + C L + +++
Sbjct: 177 QLVDCDFVDMGCDGGLIHTAYEQIMKM--GGVEQEFDYSYKAERQPCALKPHKFATGVRN 234
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V +E + L GP+A+A++A + Y+GG+ + F G L+H VL+V
Sbjct: 235 CYRYVILNEERLEDLLRYVGPIAIAVDAVDLTDYYGGI---VSFCENNG---LNHAVLLV 288
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYWIIKNSWG +GE
Sbjct: 289 GYGVENNV------PYWIIKNSWGSDYGE 311
>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
Length = 375
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 85/165 (51%), Gaps = 20/165 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA--CHLNKEEIRVKI 119
+L+DCD+ GC GG + +AF T++ GL E DYP+ GS + C K + I
Sbjct: 179 ELLDCDRCGNGCKGGFVWDAFLTVLKNR--GLASETDYPFDGSGKTHRCLAEKHKKVAWI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + + E +A++L GP+ V IN +Q Y GV C ++DH VL+
Sbjct: 237 QDFIMLQACEQSIARHLATQGPITVTINVKLLQQYQKGVIKATPTTCD--PRHVDHSVLL 294
Query: 180 VGYGVHKT---------KFTHKIQP-----YWIIKNSWGPHWGEK 210
VG+G K+ F +P YW +KNSWGPHWGE+
Sbjct: 295 VGFGKTKSVEGRQGKAASFRSYTRPRRSMAYWTLKNSWGPHWGEE 339
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC GGLM + + I+S G + + YPY G C+ + + + KI
Sbjct: 178 LVSCDTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++N+ DE +A++L KNGP+A+A++A + Y GGV + KG LDH VL+
Sbjct: 238 SGHINLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLTSC--ISKG----LDHDVLL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWSKGWGEE 316
>gi|262093294|gb|ACY25971.1| cathepsin L-like protein [Trypanosoma cruzi]
gi|375073892|gb|AFA34813.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073894|gb|AFA34814.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073896|gb|AFA34815.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073898|gb|AFA34816.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073900|gb|AFA34817.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073902|gb|AFA34818.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073908|gb|AFA34821.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|389566214|gb|AFK83570.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|400234875|gb|AFP74101.1| cathepsin L- like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AVA++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC GGLM + + I+S G + + YPY G C+ + + + KI
Sbjct: 173 LVSCDTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKI 232
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++N+ DE +A++L KNGP+A+A++A + Y GGV + KG LDH VL+
Sbjct: 233 SGHINLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLTSC--ISKG----LDHDVLL 286
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 287 VGY-----DDTSK-PPYWIIKNSWSKGWGEE 311
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 19/153 (12%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVK 118
+L+DC D + GC GGLM A + +I++ GGL+ E+ YPY S+ C N I K
Sbjct: 165 QLMDCSRDYGNEGCNGGLMDAAMKYVIAQ--GGLDTEESYPYTMSDSYTCKFNPANIGAK 222
Query: 119 IQSYVNVS-SDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDH 175
I SY++V ET++A L K GP++VAI+A+ + Q Y GV + + C NLDH
Sbjct: 223 ISSYIDVQRGSETDLAAKLNK-GPVSVAIDASHSSFQLYKSGVYY--EPACSS--YNLDH 277
Query: 176 GVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
GVL VGYG T YWI+KNSWGP+WG
Sbjct: 278 GVLAVGYG------TEGSSNYWIVKNSWGPNWG 304
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 84/152 (55%), Gaps = 17/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDCD K +AGC GGLM AF+ I GG+ E YPY+ +C + + V I
Sbjct: 94 QLVDCDTKANAGCNGGLMDYAFQYIAKH--GGVAAEDAYPYRARQASCKKSPAPV-VTID 150
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V +++ K V + P++VAI A+ QFY GV F + G + LDHGV
Sbjct: 151 GYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGV-----FSGRCGTE-LDHGVA 204
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYGV T YW++KNSWGP WGEK
Sbjct: 205 AVGYGV-----TADGTKYWLVKNSWGPEWGEK 231
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 83/152 (54%), Gaps = 16/152 (10%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC D + GCGGG M++AF+ I K GG++ E YPY+ +R+C + I
Sbjct: 156 QLVDCSTDYGNDGCGGGWMTSAFDYI--KDNGGIDTESSYPYEAEDRSCRFDANSIGAIC 213
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
V V E + + + GP++VAI+A+ + QFY GV + C LDHGV
Sbjct: 214 TGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQN--CSPTF--LDHGV 269
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L VGYG TK YW++KNSWG WG+
Sbjct: 270 LAVGYGTESTK------DYWLVKNSWGSSWGD 295
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 85/154 (55%), Gaps = 18/154 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF+ I K+ GG++ EK YPY+ + C N +
Sbjct: 177 LVDCSSKFGNNGCNGGLMDNAFQYI--KVNGGIDTEKSYPYEAEDEPCRYNPANAGADDR 234
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGV-SHPLKFLCKGGMDNLDHG 176
+V+V +E + K + GP++VAI+A ++ QFY GV S P +NLDHG
Sbjct: 235 GFVDVREGNENALKKAIATIGPVSVAIDASQDSFQFYQHGVYSDP-----DCSAENLDHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG T Q YW++KNSW WG++
Sbjct: 290 VLAVGYGT-----TEDGQDYWLVKNSWSKSWGDQ 318
>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
Length = 530
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 19/174 (10%)
Query: 46 NLKKIQIRGEGTHLAL---KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYP 100
+L+ + G ++L +LVDC + GC GG S+AF+ I++ GG+ E YP
Sbjct: 343 SLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYIMNF--GGIAYESTYP 400
Query: 101 YKGSNRACHLNKEEI-RVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINANA--MQFYFG 156
Y N C + ++ +K++SYVNV+S E + + GP+A+AI+A+A +FY
Sbjct: 401 YLMQNGYCKDSSSQLSNIKVKSYVNVTSFSEPALQNAVATVGPVAIAIDASAPDFRFYSS 460
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + +CK G+D+LDH VL VGYG T YWI+KNSW H+G +
Sbjct: 461 GVYY--SSVCKNGLDDLDHEVLAVGYG------TLNGADYWIVKNSWSTHYGAE 506
>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 86/151 (56%), Gaps = 16/151 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIR-VK 118
LV C ++GC GGLM AFE ++ + G + E YPY S+ C + + + +
Sbjct: 178 LVSCHDKNSGCTGGLMLQAFEWLLRNMNGTMFTEDSYPYVSSSGYVPECSNSSQLVPGAR 237
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I Y+ + S ET MA +L KNGP+++A++A++ Y GV L +L+HGVL
Sbjct: 238 IDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQSGV------LTSCAGISLNHGVL 291
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGY T ++ PYW+IKNSWG +WGE
Sbjct: 292 LVGY-----NRTGEV-PYWVIKNSWGENWGE 316
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 83/152 (54%), Gaps = 16/152 (10%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC D + GCGGG M++AF+ I K GG++ E YPY+ +R+C + I
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYI--KDNGGIDTESSYPYEAEDRSCRFDANSIGAIC 214
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
V V E + + + GP++VAI+A+ + QFY GV + C LDHGV
Sbjct: 215 TGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQN--CSPTF--LDHGV 270
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L VGYG TK YW++KNSWG WG+
Sbjct: 271 LAVGYGTESTK------DYWLVKNSWGSSWGD 296
>gi|402502150|ref|YP_006607808.1| cathepsin [Apocheima cinerarium nucleopolyhedrovirus]
gi|284431240|gb|ADB84400.1| cathepsin [Apocheima cinerarium nucleopolyhedrovirus]
Length = 160
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 83/150 (55%), Gaps = 17/150 (11%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ-S 121
++DCD VD GC GGL+ AFE +I GG++ E +YPY G N C L + +K++
Sbjct: 1 MIDCDYVDMGCDGGLLHTAFEQMIQM--GGVKSEIEYPYVGYNDNCRLTDDNFAIKVKGC 58
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAM-QFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + + E ++ L GP+ +AI+A+ + +Y G V+H + L+H VL+V
Sbjct: 59 YRYIVTREEKLKDLLRAVGPIPIAIDASGIVNYYRGIVNHCENY-------GLNHAVLLV 111
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYG+ PYW IKN+WG WGE
Sbjct: 112 GYGIENN------VPYWTIKNTWGKDWGEN 135
>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
Length = 334
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 89/158 (56%), Gaps = 19/158 (12%)
Query: 57 THLAL---KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE 113
HL L +++DCD VD GC GGL+ AFE +I GG+E E+ YPY+G N C L +
Sbjct: 165 VHLDLSEQQMIDCDYVDMGCYGGLLHTAFEQMIQM--GGVEEERQYPYEGVNNNCRLKSD 222
Query: 114 E-IRVKIQ-SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMD 171
E VK++ Y + E ++ L GP+ +AI+A+++ Y+ GV + C G +
Sbjct: 223 ERFVVKVKGCYRYLVMREEKLKDLLRAVGPLPMAIDASSIFNYYRGVIN----YC--GNN 276
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+H VL+VGYGV P+W KN+WG WGE
Sbjct: 277 GLNHAVLLVGYGVENGV------PFWTFKNTWGDDWGE 308
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 93/172 (54%), Gaps = 18/172 (10%)
Query: 41 RIFRANLKKIQIRGEGTHLALKLVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKD 98
++FR K + + + LVDC + + GC GGLM NAF+ ++ GGL+ E+
Sbjct: 150 QMFRKTGKLVSLSEQ------NLVDCSQPEGNRGCHGGLMDNAFQYVLDV--GGLDSEES 201
Query: 99 YPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
YPY G C+ N + +V++ E + K + GP++VA++A+ + QFY
Sbjct: 202 YPYTGLVGTCNYNPKNSAANETGFVDLPKQENALMKAVATLGPISVAVDASNPSFQFYKS 261
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
G+ + K CK +++DHGVL+VGYG YW++KNSWG HWG
Sbjct: 262 GIYYEPK--CKS--ESVDHGVLVVGYGFEGAD--SDDNKYWLVKNSWGKHWG 307
>gi|123391522|ref|XP_001300085.1| Clan CA, family C1, cathepsin L or K-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121881065|gb|EAX87155.1| Clan CA, family C1, cathepsin L or K-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 285
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 82/151 (54%), Gaps = 13/151 (8%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC GC GGLM+ A++ +I G E DYPY + +C + ++ + SY
Sbjct: 120 LVDCVTECYGCNGGLMTAAYDYVIRNQKGKFMLEDDYPYTARDGSCKFDSKKGTSNVASY 179
Query: 123 VNVSS-DETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
V V+ DE ++AK + GP A+AI+A+A Q Y G+ + C NLDHGV
Sbjct: 180 VTVNEGDEKDLAKKVSTLGPAAIAIDASAWSFQLYSSGIYD--ESACSS--VNLDHGVGC 235
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG +K YWI++NSWG WGEK
Sbjct: 236 VGYGTQGSK------NYWIVRNSWGESWGEK 260
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 92/172 (53%), Gaps = 18/172 (10%)
Query: 41 RIFRANLKKIQIRGEGTHLALKLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKD 98
++FR K I + + LVDC + + GC GGLM AF+ I K GGL+ E+
Sbjct: 150 QMFRKTGKLISLSEQ------NLVDCSRPQGNEGCDGGLMDYAFQYI--KENGGLDSEES 201
Query: 99 YPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFG 156
YPY + +C E +V++ +E + K + GP++VAI+A + QFY
Sbjct: 202 YPYDAMDESCKYRPEYSVANDTGFVDIPKEEKALMKAVATVGPISVAIDAGHESFQFYKE 261
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
GV F + DN+DHGVL+VGYG +T+ + +W++KNSWG WG
Sbjct: 262 GVY----FEPECSSDNVDHGVLVVGYGYEETESDN--NKFWLVKNSWGEEWG 307
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD + C GGLM AFE +++ GGL E DYPY+G+ C ++ ++ + + S
Sbjct: 177 QLIDCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGICKIDNKKFALSVSS 234
Query: 122 YVN-VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+ +E + K L+ GP+A+AI+A ++ Y G+ H + L L+H VL+V
Sbjct: 235 CKRYIFQNEENLKKELITTGPIAMAIDAASISTYSKGIIHFCENL------GLNHAVLLV 288
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYG T YW +KNSWG WGE
Sbjct: 289 GYG------TEGGVSYWTLKNSWGSDWGE 311
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK--EEIRVKI 119
+++DCD D+GC GG M NAF+ +I GG++ E DYP+ ++ C NK +E I
Sbjct: 214 EIIDCDTQDSGCNGGQMENAFQFVIDN--GGIDSEADYPFIATDGTCDANKANDEKVAAI 271
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V V+S+ + V P++VAI+A A Q Y G+ F G NLDHGV
Sbjct: 272 DGFVEVASNNETALQEAVAIQPVSVAIDAGGRAFQHYSSGI-----FNGPCGT-NLDHGV 325
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGYG K YWI+KNSW WGE
Sbjct: 326 TVVGYGSENGK------AYWIVKNSWSDSWGE 351
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 81/151 (53%), Gaps = 17/151 (11%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGL +AF+ +I GG++ E YPY + CH + I
Sbjct: 155 LVDCSSAEGNEGCNGGLPDDAFKYVIKN--GGIDTEASYPYVARDEKCHYSSANIGSTCS 212
Query: 121 SYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SYV++ S E ++ GP+ V I+A+ Q Y GGV H LC LDHGV
Sbjct: 213 SYVDIESKSEAQLQVASATVGPIPVGIDASHLGFQLYDGGVYH--SDLCS--QTRLDHGV 268
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
L+VGYGV+K K YW++KNSWG +WG
Sbjct: 269 LVVGYGVYKEK------DYWMVKNSWGTNWG 293
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 87/153 (56%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM NAF+ + K G++ EK YPY+ + CH N + I +
Sbjct: 179 LVDCSTKYGNNGCNGGLMDNAFQYV--KDNKGIDTEKAYPYEAIDDECHYNPKAIGATDK 236
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ DE + K L GP++VAI+A+ + QFY GV + + C + LDHGV
Sbjct: 237 GFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQ--CDS--EQLDHGV 292
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG T + YW++KNSWG WG++
Sbjct: 293 LAVGYGT-----TEDGEDYWLVKNSWGTTWGDQ 320
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 81/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
+LVDCD VD GC GG M + FE II GG+ E +YPYKG + C+ V +I+
Sbjct: 178 ELVDCDSVDDGCEGGFMEDGFEFIIKN--GGITSETNYPYKGVDGTCNTTIAASPVAQIK 235
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y V S E + V N P++V+I+A FY G+ + + G D LDHGV
Sbjct: 236 GYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGI-----YNGECGTD-LDHGVT 289
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YWI+KNSWG WGEK
Sbjct: 290 AVGYG------TENGTDYWIVKNSWGTQWGEK 315
>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 84/152 (55%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV---- 117
+LV CD D+GC LM AFE ++ + G + E YPY S I++
Sbjct: 177 QLVSCDDKDSGCRARLMLQAFEWLLRNMNGTMFTEDSYPYVSSTGYVPECSNSIQLVPGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I Y+ + S ET MA +L KNGP+++A++A++ Y GV GM L+HGV
Sbjct: 237 RIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQRGVVTSC-----AGMP-LNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGY T ++ PYW+IKNSWG +WGE
Sbjct: 291 LLVGY-----NRTGEV-PYWVIKNSWGENWGE 316
>gi|375073864|gb|AFA34799.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073866|gb|AFA34800.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073868|gb|AFA34801.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073870|gb|AFA34802.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073872|gb|AFA34803.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073874|gb|AFA34804.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073876|gb|AFA34805.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
gi|375073878|gb|AFA34806.1| cathepsin L-like protein, partial [Trypanosoma cruzi]
Length = 159
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CDK D+GC GGLM+NAFE I+ + G + E YPY +G + C + + I
Sbjct: 30 LVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE ++A +L NGP+AV ++A++ Y GGV + + LDHGVL+
Sbjct: 90 TGHVELPQDEAQIAAWLAVNGPVAVGVDASSWMTYTGGV------MTSCVSEQLDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAVPYWIIKN 159
>gi|401430127|ref|XP_003886478.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491231|emb|CBZ41048.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 375
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 109 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECS-NSSELVVG 167
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G L+HG
Sbjct: 168 AQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGV----LTACIG--KQLNHG 221
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 222 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 249
>gi|337255596|gb|AEI61876.1| cathepsin K [Gadus morhua]
Length = 331
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 83/149 (55%), Gaps = 10/149 (6%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC ++GCGGG M+NAF ++ GG++ ++ YPY G ++ C N + + + Y
Sbjct: 168 LVDCVTENSGCGGGYMTNAFSYVMQN--GGIDSDESYPYVGQDQQCGFNVSGVAAECKGY 225
Query: 123 VNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
+ DE +A L K GP++V I+A F F H + + ++++H VL VG
Sbjct: 226 KQIPVGDERALAVALFKAGPVSVGIDAGLGTFQF--YQHGVYYDRNCNAEDINHAVLAVG 283
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+GV T K + YWIIKNSWG WG K
Sbjct: 284 FGV-----TAKGKKYWIIKNSWGEDWGHK 307
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 86/152 (56%), Gaps = 16/152 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC ++GC GGLM NAFE + K GG++ E+ YPY ++ C + I
Sbjct: 200 LVDCSTAQGNSGCQGGLMDNAFEYV--KENGGIDTEESYPYIAADDTCQYKPQYSGANIT 257
Query: 121 SYVNVSSD-ETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV++ S E + K + GP++VAI+A ++ QFY GV + + C ++LDHGV
Sbjct: 258 GYVDIPSRMEKALEKAVATVGPISVAIDAGHSSFQFYRSGVYYEPE--CSS--EDLDHGV 313
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L VGYGV K YWI+KNSWG WG+
Sbjct: 314 LAVGYGVQG-----KNGKYWIVKNSWGEEWGD 340
>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 359
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN---RACHLNKEEIRV- 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N C N E+ V
Sbjct: 177 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVSGNGYLPECS-NSSELVVG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I S+V + S E MA +L KNGP+A+A++A++ Y GV C G ++H
Sbjct: 236 AQIDSHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KEVNHA 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 290 VLLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 82/154 (53%), Gaps = 20/154 (12%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC DAGC GGLM AFE II+ G + E YPYKG C + ++ V I
Sbjct: 177 QLVDCSTSYGDAGCNGGLMDYAFEYIIANKG--ICAESAYPYKGVGGLCQKSCTKV-VTI 233
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
Y +V+S DE + + GP++VAI A+ QFY GV C NLDHG
Sbjct: 234 SGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV---FSGTCG---HNLDHG 287
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG T Q YWI+KNSWG WGE
Sbjct: 288 VLAVGYG------TTGSQDYWIVKNSWGTSWGES 315
>gi|407844617|gb|EKG02042.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative [Trypanosoma cruzi]
Length = 294
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 75/150 (50%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD ++GCGGG AF+ I+ + G + E YPY G C + + I
Sbjct: 2 LVSCDNTNSGCGGGSPFRAFKWIVDRNNGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATI 61
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV + SDE +A L GP++VA++A++ Y GGV L LDH VL+
Sbjct: 62 SGYVTIPSDEKRIAAVLAVKGPLSVAVDASSWMPYTGGV------LTNCVSKKLDHAVLL 115
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY PYWIIKNSW HWGE
Sbjct: 116 VGYN------DSAAVPYWIIKNSWTTHWGE 139
>gi|538255|gb|AAA21470.1| cysteine proteinase, partial [Trypanosoma rangeli]
Length = 166
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 79/144 (54%), Gaps = 15/144 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM +AF+ I+ + G + E Y Y G ++ C+++ + I
Sbjct: 35 LVSCDNADNGCDGGLMDDAFDWIVGQNNGSVYTEASYSYVSGGGDSQTCNMSDHVVGGVI 94
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 95 SGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV------LTNCISDQLDHGVVL 148
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSW 203
VGY PYWI+KNSW
Sbjct: 149 VGYNDSSNP------PYWIVKNSW 166
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 95/173 (54%), Gaps = 20/173 (11%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G ++L LVDC + GC GGLM NAF+ I K GG++ E Y
Sbjct: 173 GSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYI--KANGGIDTELSY 230
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PY G++ CH K ++ +V++ +E + K + GP++VAI+A+ + QFY
Sbjct: 231 PYNGTDGICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQ 290
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GV + C ++LDHGVL+VGYG T Q YW++KNSWG WG+
Sbjct: 291 GVYDEPE--CSS--ESLDHGVLVVGYG------TKDGQDYWLVKNSWGTTWGD 333
>gi|209962640|gb|ACJ02115.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D+GC GGLM NAFE I+ + G + EK YPY G C ++ I
Sbjct: 30 LVSCDTTDSGCSGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHKVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 86/152 (56%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD+ + GC GGLM AFE II+ GG++ + DYPY G + C ++ +V I
Sbjct: 184 ELVDCDRSYNEGCNGGLMDYAFEFIINN--GGIDTDVDYPYTGRDGKCDQYRKNAKVVTI 241
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + + K N P++VAI A+ QFY G+ F K G+ LDHGV
Sbjct: 242 DSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGI-----FTGKCGI-ALDHGV 295
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
++VGYG K YWI++NSWG WGE
Sbjct: 296 VVVGYGTENGK------DYWIVRNSWGADWGE 321
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYK + C NK+ + V I
Sbjct: 183 ELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEEDYPYKERDNRCDANKKNAKVVTI 240
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V + + + V N P++VAI A A Q Y G+ F G LDHGV
Sbjct: 241 DGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGI-----FTGTCGT-ALDHGV 294
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YW+++NSWG WGE
Sbjct: 295 AAVGYGTENGK------DYWLVRNSWGSVWGE 320
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 84/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GGLM FE II+ GG++ +KDYPY G + C ++ +V I
Sbjct: 187 ELVDCDKSYNEGCDGGLMDYGFEFIINN--GGIDTDKDYPYLGRDARCDQYRKNAKVVTI 244
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + E K V + P++V I A QFY G+ F K G LDHGV
Sbjct: 245 DSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGI-----FTGKCGT-ALDHGV 298
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGYG K K YWI++NSWG WGE
Sbjct: 299 NVVGYGTEKGK------DYWIVRNSWGSSWGE 324
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 80/150 (53%), Gaps = 12/150 (8%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GG M+ AF + K GGL+ E YPY+ + C E
Sbjct: 166 LVDCSRPQGNQGCNGGFMNYAFRYV--KENGGLDSEASYPYEAKDGICKYKPENSVANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+V + + E E+ K + GP++VA++A ++ QFY G+ F K NLDHGVL
Sbjct: 224 GFVVIPTHEKELMKAVATVGPISVAVDASHSSFQFYKSGIY----FEKKCSSKNLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG K YW+IKNSWGP WG
Sbjct: 280 VVGYGFEGA--NSKDNKYWLIKNSWGPEWG 307
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 82/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD VD GC GGL+ A+E I+ GG+E E DYPYK C + + V +++
Sbjct: 177 QLVDCDFVDMGCDGGLIHTAYEQIMHI--GGVEQEYDYPYKAVRLPCAVKPHKFAVGVRN 234
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y V E + L GP+A+A++A + Y+GGV + F G L+H VL+V
Sbjct: 235 CYRYVLLSEERLEDLLRHVGPIAIAVDAVDLTDYYGGV---ISFCENNG---LNHAVLLV 288
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYG+ PYW IKNSWG +GE
Sbjct: 289 GYGIENNV------PYWTIKNSWGSDYGE 311
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KIQ 120
+LVDCD+ + GC GG M AFE ++ GG++ E +YPY G++ C++ KEE +V I
Sbjct: 188 ELVDCDRTNDGCDGGHMDYAFEWVMHN--GGIDTETNYPYSGADGTCNVAKEETKVIGID 245
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y NV + + VK P++ I+ ++ Q Y GG+ C D++DH +L
Sbjct: 246 GYYNVEQSDRSLLCATVKQ-PISAGIDGSSWDFQLYIGGIYDG---DCSSDPDDIDHAIL 301
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG + + YWI+KNSWG WG
Sbjct: 302 VVGYG------SEGDEDYWIVKNSWGTSWG 325
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 78/150 (52%), Gaps = 15/150 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD ++GCGGG AF+ I+ + G + E+ YPY G + C + + I
Sbjct: 175 LVSCDNTNSGCGGGWPLVAFKWIVDRNNGTVYTEESYPYHSCIGISPPCTTSGHTVGATI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YV + DE +A +L NGP+AV ++A++ FY GGV + L H VL+
Sbjct: 235 TGYVTIPRDENGIAAWLAVNGPVAVVVDASSWIFYTGGV------MTSCVSKQLSHAVLL 288
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGY T P+WIIKNSW HWGE
Sbjct: 289 VGYNDSATV------PHWIIKNSWTTHWGE 312
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 84/154 (54%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+LVDCD VD GC GGLM +AF+ II GL E YPY+G + C+ NK I
Sbjct: 178 ELVDCDTKGVDQGCEGGLMDDAFKFIIQN--HGLSTEAAYPYQGVDGTCNANKASIHAAT 235
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 236 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV-----FSGSCGTE-LDHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV YW++KNSWG WGE+
Sbjct: 290 VTAVGYGVGNDG-----TKYWLVKNSWGTDWGEE 318
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 83/153 (54%), Gaps = 20/153 (13%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH--LNKEEIRVK 118
+LVDCD K ++GC GG M AF+ I+S GG++ E DYPYKG C NK +I V
Sbjct: 179 ELVDCDNKYNSGCNGGSMDYAFQFIVSN--GGIDSESDYPYKGVGAVCDPVRNKAKI-VS 235
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V + V + P++V I A+ A Q Y GV L NLDHG
Sbjct: 236 IDGYEDVPPMNEKALMKAVAHQPVSVGIEASGRAFQLYTSGV------LTGSCGTNLDHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V++VGYG K YWI++NSWGP WGE
Sbjct: 290 VVVVGYGSENGK------DYWIVRNSWGPEWGE 316
>gi|403345257|gb|EJY71991.1| Cysteine protease [Oxytricha trifallax]
Length = 249
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 91/161 (56%), Gaps = 14/161 (8%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDC + GC GGLM+ A++ + + LE DYPY G + C NK R+++Q
Sbjct: 82 QLVDCSPQNTGCNGGLMTLAYQYL--EGNQLLEFWSDYPYIGYTQRCKYNKNFGRIRLQG 139
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
YVNV+ + + V+ P+++AI +++ +QFY G+ ++ C +DHGVLI
Sbjct: 140 YVNVAKYDANELQKAVQQQPISIAIESDSIYIQFYNSGIVQDVR--CG---TQVDHGVLI 194
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNS 220
VGYG + YW++KNSWG WGE F I+KN+
Sbjct: 195 VGYGYD----VFYGEEYWLVKNSWGADWGENGY-FRILKNA 230
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 86/152 (56%), Gaps = 17/152 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC + + GC GGLM AF+ I K G++ E+ YPY ++ CH NK +
Sbjct: 167 LIDCSRSFGNNGCEGGLMDYAFKYI--KANKGIDTEQSYPYNATDGVCHFNKSAVGATDT 224
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ DE ++ K + GP++VAI+A+ + QFY GV + C + LDHGV
Sbjct: 225 GFVDIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEGVYDEPE--CDS--EQLDHGV 280
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYG T Q YW++KNSWG WG+
Sbjct: 281 LVVGYG------TKDGQDYWLVKNSWGTTWGD 306
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 89/153 (58%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+L+DCD + GC GGLM AFE I+ GGL E+DYPY C + K+E V I
Sbjct: 189 ELIDCDTTYNNGCNGGLMDYAFEYIVKN--GGLRKEEDYPYSMEEGTCEMQKDESETVTI 246
Query: 120 QSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQF-YFGGVSHPLKFLCKGGMDNLDHGV 177
+ +V ++DE + K L P++VAI+A+ +F ++ GVS F + G+D LDHGV
Sbjct: 247 DGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQFYSGVS---VFDGRCGVD-LDHGV 301
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG + K Y I+KNSWGP WGEK
Sbjct: 302 AAVGYG------SSKGSDYIIVKNSWGPKWGEK 328
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 87/154 (56%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LVDCD VD GC GGLM +AF+ +I GL E +YPYKG + C+ N+ V
Sbjct: 723 ELVDCDTKGVDQGCEGGLMDDAFKFVIQ--NHGLNTEANYPYKGVDGKCNANEAANDVVT 780
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 781 ITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 834
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 835 VTAVGYGV-----SNDGTEYWLVKNSWGTEWGEE 863
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 83/155 (53%), Gaps = 25/155 (16%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+LVDC + GC GGLM NAF+ IIS GGL+ E+DYPY + C +KE V
Sbjct: 167 QLVDCSGSFGNQGCNGGLMDNAFKYIISN--GGLDTEQDYPYTARDGVCDKSKESKHAVS 224
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGV-SHPLKFLCKGGMDNLDH 175
I Y +V + + V+ GP++VAI A+ + Q Y GV S P NLDH
Sbjct: 225 ISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCG-------TNLDH 277
Query: 176 GVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GVL+VGY YWI+KNSWG WG++
Sbjct: 278 GVLVVGY----------TSDYWIVKNSWGASWGDQ 302
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 70/170 (41%), Positives = 96/170 (56%), Gaps = 24/170 (14%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIR- 116
+LVDCD + GC GGLM +AF+ ++ GG++ E+DY Y G C+ K+ R
Sbjct: 184 QLVDCDTASNMGCSGGLMDDAFKYVLDN--GGIDTEEDYSYWSGYGFGFWCNKRKQTDRP 241
Query: 117 -VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLD 174
V I Y +V + E + K V P+AVAI A+A MQFY GV + C+G L+
Sbjct: 242 AVSIDGYEDVPTSEPALLK-AVAGQPVAVAICASANMQFYSSGV---INSCCEG----LN 293
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPR 224
HGVL VGY + K QPYWI+KNSWG WGE+ ++ +K GP+
Sbjct: 294 HGVLAVGYDT-----SDKAQPYWIVKNSWGGSWGEQG--YFRLKMGEGPK 336
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 83/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+L+DCD + C GGLM AFE +++ GGL E DYPY+G+ C ++ ++ + + S
Sbjct: 177 QLIDCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGVCKIDNKKFALSVSS 234
Query: 122 YVN-VSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+ +E + K L+ GP+A+AI+A ++ Y G+ H + L L+H VL+V
Sbjct: 235 CKRYIFQNEENLKKELITMGPIAMAIDAASISTYSKGIIHFCENL------GLNHAVLLV 288
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYG T YW +KNSWG WGE
Sbjct: 289 GYG------TEGGVSYWTLKNSWGSDWGE 311
>gi|209962646|gb|ACJ02118.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGGLM NAFE I+ + G + EK YPY G C ++ I
Sbjct: 30 LVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVSGGGEEPPCKPRGHKVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/152 (40%), Positives = 82/152 (53%), Gaps = 17/152 (11%)
Query: 63 LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC D+ + GC GGLM NAF I +G ++ EK YPY+ + C K +
Sbjct: 160 LVDCSRDEGNMGCSGGLMDNAFTYIKKNMG--IDSEKSYPYEAVDGECRYKKSDSVTTDS 217
Query: 121 SYVNVS-SDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ DET + + GP++VAI+A+ + QFY GV C LDHGV
Sbjct: 218 GFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKTGVYTEAN--CSS--TQLDHGV 273
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYGV Q YW++KNSWG WGE
Sbjct: 274 LVVGYGVENG------QDYWLVKNSWGASWGE 299
>gi|432881828|ref|XP_004073923.1| PREDICTED: counting factor associated protein D-like [Oryzias
latipes]
Length = 563
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 81/149 (54%), Gaps = 10/149 (6%)
Query: 63 LVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GG A+E I+ K GG E Y G N CH+N E+ I+
Sbjct: 396 LVDCSWGFGNNGCDGGEEWRAYEWIM-KHGGIATTETYGSYMGMNGFCHMNSSELVAPIK 454
Query: 121 SYVNVSSDETEMAKY-LVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
SY NV+S + E K L KNGP+AV+I+A+ F F G + C D+LDH VL
Sbjct: 455 SYTNVTSGDAEALKLALFKNGPVAVSIDASHRSFVFYGYGVYYEPACGNTTDDLDHAVLA 514
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGYG T +PYW+IKNSW +WG
Sbjct: 515 VGYG------TLNGEPYWLIKNSWSTYWG 537
>gi|355681666|gb|AER96819.1| cathepsin W [Mustela putorius furo]
Length = 373
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 18/163 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNR--ACHLNKEEIRVKI 119
+L+DC++ GC GG + +AF T+++ GL EKDYP++GS + C + + I
Sbjct: 181 ELLDCNRCGDGCKGGFVWDAFVTVLN--NSGLASEKDYPFRGSLKRHKCLASNYKKVAWI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Q ++ + ++E MA YL +GP+ V IN +Q Y GV C + N H VL+
Sbjct: 239 QDFIMLQNNEQTMANYLATHGPITVTINMKLLQQYKKGVIKATPATCDPYLVN--HSVLL 296
Query: 180 VGYGV------------HKTKFTHKIQPYWIIKNSWGPHWGEK 210
VG+G H H+ PYWI+KNSWG WGE+
Sbjct: 297 VGFGKTNSSERRRAKGGHFWPHPHRPIPYWILKNSWGAEWGEE 339
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 79/149 (53%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
++VDC D GCGGG S A++ +I GL+ +YPY +C + ++ KI S
Sbjct: 171 QIVDCSWWDDGCGGGFPSYAYDYVID--APGLDALANYPYTAVGGSCAFKESQVVAKISS 228
Query: 122 --YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y S+E +MA YL ++GP++V ++A + Y GGV + ++DH VL
Sbjct: 229 WTYTTTDSNEHQMANYLAQHGPISVCVDAESWPSYTGGV-----YRASACGTSIDHCVLA 283
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
VGY + PYWII+NSWG WG
Sbjct: 284 VGYNLTANP------PYWIIRNSWGTSWG 306
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 90/160 (56%), Gaps = 16/160 (10%)
Query: 54 GEGTHLALK-LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHL 110
G T L+++ LVDC ++GC GGLM+ AF+ I + GLE + YPY G++ +C
Sbjct: 152 GSKTPLSVQQLVDCSTEGGNSGCNGGLMNGAFDYIKAN---GLESDAKYPYTGTDDSCKA 208
Query: 111 NKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM 170
+K VK+ Y V+S E + + + GP++VA+ A+ + Y GG+ + + LC G
Sbjct: 209 DKSSSLVKLTGYKKVASSEASLKEAVGTVGPISVAVYADLWRSYGGGIFNNI--LCLGF- 265
Query: 171 DNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
LDHGV VGYG K YW +KNSWG WGE+
Sbjct: 266 -GLDHGVTAVGYGTDNGK------KYWPVKNSWGESWGEE 298
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/163 (40%), Positives = 95/163 (58%), Gaps = 19/163 (11%)
Query: 52 IRGEGTHLA-LKLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD+ +AGC GGLM NAF+ II+ GG++ +KDYPY+ + C
Sbjct: 178 VTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQFIINN--GGIDTDKDYPYQAVDGKCD 235
Query: 110 LNKEEIR-VKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLC 166
K + + V I + +V + + + V + P++VAI A+ A+QFY GV F
Sbjct: 236 TTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGV-----FTG 290
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ G LDHGV+IVGYG T YW+++NSWG WGE
Sbjct: 291 ECG-SALDHGVVIVGYG------TEDGIDYWLVRNSWGRDWGE 326
>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
Length = 323
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 82/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+++ CD VDAGC GGL+ AFE II GG++ E DYPY+ N C +N + V+++
Sbjct: 163 QMIGCDFVDAGCNGGLLHTAFEAIIKM--GGVQLESDYPYEADNNNCRMNSNKFLVQVKD 220
Query: 122 -YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y + E ++ L GP+ +AI+A + Y G+ +K+ G L+H VL+V
Sbjct: 221 CYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGI---IKYCFDSG---LNHAVLLV 274
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV PYW KN+WG WGE
Sbjct: 275 GYGVENNI------PYWTFKNTWGTDWGE 297
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 91/173 (52%), Gaps = 21/173 (12%)
Query: 47 LKKIQIRGEGTHLAL---KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY 101
L+ + G +++L +LVDC + GC GGL S AFE I K GGL+ E+ YPY
Sbjct: 178 LEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI--KYNGGLDTEESYPY 235
Query: 102 KGSNRACHLNKEEIRVKIQSYVNVS---SDETEMAKYLVKNGPMAVAINA-NAMQFYFGG 157
KG N CH E V++ VN++ DE + A LV+ P++VA N + Y G
Sbjct: 236 KGVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVR--PVSVAFEVINGFRQYKSG 293
Query: 158 VSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V C D+++H VL VGYGV PYW+IKNSWG WG+K
Sbjct: 294 VYTSDH--CGTTPDDVNHAVLAVGYGVENGT------PYWLIKNSWGESWGDK 338
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 84/154 (54%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+L+DCD VD GC GGLM +AF+ II G L E YPY+G + C+ NK I V
Sbjct: 125 ELIDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LSTEVQYPYEGVDGTCNANKASIHAVT 182
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 183 ITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGV-----FTGSCGTE-LDHG 236
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV YW++KNSWG WGE+
Sbjct: 237 VTAVGYGVGNDG-----TKYWLVKNSWGADWGEE 265
>gi|162042|gb|AAA30179.1| cysteine protease, partial [Trypanosoma brucei]
Length = 165
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/144 (41%), Positives = 77/144 (53%), Gaps = 16/144 (11%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD + GCGGGLM NAF I++ GG + E YPY G C +N EI I
Sbjct: 35 LVYCDPL-IGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAI 93
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +A YL +N P+A+A+ A QFY H L + LDHGVL+
Sbjct: 94 TDHVDLPQDEDAIAAYLAENRPLAIAV--EAPQFY----GHNGGILTSCTSEQLDHGVLL 147
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSW 203
VGY + PYWI+KNSW
Sbjct: 148 VGYNDNSNP------PYWIVKNSW 165
>gi|392922426|ref|NP_001256718.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
gi|3879367|emb|CAB07275.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
Length = 337
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 86/153 (56%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM AFE I + G++ E+ YPYKG + CH NK+ + +
Sbjct: 172 LVDCSTKYGNHGCNGGLMDQAFEYI--RDNHGVDTEESYPYKGRDMKCHFNKKTVGADDK 229
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV+ DE ++ + GP+++AI+A + Q Y GV + + C + LDHGV
Sbjct: 230 GYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEE--CSS--EELDHGV 285
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG T H YWI+KNSWG WGEK
Sbjct: 286 LLVGYG---TDPEHG--DYWIVKNSWGAGWGEK 313
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 88/161 (54%), Gaps = 18/161 (11%)
Query: 56 GTHLALK---LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK 112
G LAL LVDC + GCGGG M+ AF+ + + GG++ E YPY G + +C N
Sbjct: 157 GKLLALSPQNLVDCVSENYGCGGGYMTTAFQYV--QQNGGIDSEDAYPYVGQDESCMYNA 214
Query: 113 EEIRVKIQSYVNVS-SDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGG 169
K + Y + +E + + + + GP++V+I+A+ + QFY GV + C
Sbjct: 215 TAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDEN--CD-- 270
Query: 170 MDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
DN++H VL+VGYG T K YWIIKNSWG WG K
Sbjct: 271 RDNVNHAVLVVGYG------TQKGNKYWIIKNSWGESWGNK 305
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+LVDCD VD GC GGLM AF+ I + GG+ E +YPY R+C+ KE V I
Sbjct: 190 ELVDCDDVDNQGCDGGLMDYAFQYI--QRNGGVTTESNYPYLAEQRSCNKAKERSHDVTI 247
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ + + V + P+AVAI A+ QFY GV F G D LDHGV
Sbjct: 248 DGYEDVPANNEDALQKAVASQPVAVAIEASGQDFQFYSEGV-----FTGSCGTD-LDHGV 301
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG T YW +KNSWG WGE+
Sbjct: 302 AAVGYGT-----TGDGTKYWTVKNSWGEDWGER 329
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 81/153 (52%), Gaps = 17/153 (11%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM +AF+ I K G++ E YPY+ N C N +
Sbjct: 166 LVDCSQKQGNHGCQGGLMDDAFQYI--KDNNGIDTESSYPYEAKNGKCRFNAANVGATDS 223
Query: 121 SYVNVSS-DETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
+ ++ S E+++ + GP+AVAI+A+ M F Y GV H +F C LDHGV
Sbjct: 224 GFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVYH--EFFCS--ETRLDHGV 279
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG K YW++KNSWG WG+K
Sbjct: 280 LAVGYGTESGK------DYWLVKNSWGESWGQK 306
>gi|375073966|gb|AFA34850.1| cathepsin L-like protein, partial [Trypanosoma dionisii]
Length = 159
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 76/142 (53%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM++AFE I+ G + EK Y Y G C + + I
Sbjct: 30 LVSCDTMDSGCDGGLMNSAFEWIVEHHNGTVYTEKSYRYASGDGIAPPCRTSGRTVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V V DE +MA +L NGP+AVA++A++ Y GGV L D LDHGVL+
Sbjct: 90 TGHVKVPPDEAKMATWLAANGPLAVAVDASSWMSYTGGV------LTSCVSDELDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAPPYWIIKN 159
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 88/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD+ + GC GGLM AFE I+ GG++ E+DYPYKG C ++ +V I
Sbjct: 176 ELVDCDRAFNEGCNGGLMDYAFEFIVEN--GGIDTEQDYPYKGFEGRCDPTRKNAKVVSI 233
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V + K V + P++VAI A A+Q Y GV F + G NLDHGV
Sbjct: 234 DGYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGV-----FTGRCGT-NLDHGV 287
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
++VGYG F + + YW+++NSWG +WGE
Sbjct: 288 VVVGYG-----FENGVD-YWLVRNSWGTNWGE 313
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 86/161 (53%), Gaps = 21/161 (13%)
Query: 56 GTHLAL---KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN 111
G +AL +LVDCD + GC GGLM AFE II+ GG++ E+DYPYK + C N
Sbjct: 173 GDMIALSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEEDYPYKERDNRCDAN 230
Query: 112 KEEIR-VKIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKG 168
K+ + V I Y +V + K V N P++VAI A A Q Y G+ F +
Sbjct: 231 KKNAKVVTIDGYEDVPVNSELSLKKAVANQPISVAIEAGGRAFQLYKSGI-----FTGRC 285
Query: 169 GMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G LDHGV VGYG K YWI+KNSWG WGE
Sbjct: 286 GT-ALDHGVTAVGYGSENGK------DYWIVKNSWGTVWGE 319
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 89/154 (57%), Gaps = 20/154 (12%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+L+DCD+ ++GCGGGLM A++ +I GG++ E+DYPY+ ++ C+ NK + RV I
Sbjct: 188 ELIDCDRSYNSGCGGGLMDYAYKFVIKN--GGIDTEEDYPYREADGTCNKNKLKKRVVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAI--NANAMQFYFGGV-SHPLKFLCKGGMDNLDHG 176
Y +V S++ ++ V P++V I +A A Q Y+ G+ P +LDH
Sbjct: 246 DGYTDVPSNKEDLLLQAVAQQPVSVGICGSARAFQLYYQGIFDGPCP-------TSLDHA 298
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VLIVGYG K YWI+KNSWG WG K
Sbjct: 299 VLIVGYGSEGGK------DYWIVKNSWGESWGMK 326
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 82/153 (53%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHL--NKEEIRVK 118
+L+DCD D +GC GGLM NAFE I K GG+ E YPY+ +N C + V
Sbjct: 186 ELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAANGTCDAVRARRAPLVV 243
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I + NV ++ V N P++VAI+A + QFY GV F G D LDHG
Sbjct: 244 IDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV-----FAGDCGTD-LDHG 297
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V +VGYG T+ YWI+KNSWG WGE
Sbjct: 298 VAVVGYGE-----TNDGTEYWIVKNSWGTAWGE 325
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 87/152 (57%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD+ + GC GGLM AFE II GG++ +KDYPY+G + C K+ +V I
Sbjct: 179 ELVDCDRAYNEGCNGGLMDYAFEFIIQN--GGIDTDKDYPYRGFDGICDPTKKNAKVVNI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V + K V + P++VAI A+ A+Q Y GV F K G +LDHGV
Sbjct: 237 DGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGV-----FTGKCGT-SLDHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
++VGYG + YW+++NSWG WGE
Sbjct: 291 VVVGYG------SENGVDYWLVRNSWGTGWGE 316
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 87/170 (51%), Gaps = 36/170 (21%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDKV + GC GGLM AFE I+ GG++ E+DYPYK + C N++ RV I
Sbjct: 189 ELVDCDKVYNQGCNGGLMDYAFEFIMKN--GGIDTEEDYPYKAVDSMCDPNRKNARVVTI 246
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ + + V N P++VAI A A Q Y GV F G LDHGV
Sbjct: 247 DGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGV-----FTGSCGT-QLDHGV 300
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
+ VGYG E + +W+++NSWGP WGE
Sbjct: 301 VAVGYGT------------------------ENGVDYWVVRNSWGPAWGE 326
>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 359
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 84/153 (54%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN----RACHLNKEEIRV 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N + +K +
Sbjct: 177 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVSGNGYLPECSNSSKLVVGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G ++H V
Sbjct: 237 QIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVLTA----CIG--KQVNHAV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 291 LLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|375073968|gb|AFA34851.1| cathepsin L-like protein, partial [Trypanosoma dionisii]
Length = 159
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD +D+GC GGLM++AFE I+ G + E+ Y Y G + C + + I
Sbjct: 30 LVSCDTMDSGCDGGLMNSAFEWIVEHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V + DE +MA +L NGP+AVA++A++ FY GGV L + LDHGVL+
Sbjct: 90 TGHVKLPPDEAKMATWLAANGPLAVAVDASSWMFYTGGV------LTSCVSNELDHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSAAPPYWIIKN 159
>gi|186688053|gb|ACC86112.1| cathepsin K [Paralichthys olivaceus]
Length = 330
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 87/150 (58%), Gaps = 14/150 (9%)
Query: 64 VDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYV 123
VDC + GCGGG M+NAF+ + + GG++ E+ YPY G +++C N + + + Y
Sbjct: 168 VDCVTENNGCGGGYMTNAFQYV--QENGGIDSEEAYPYVGEDQSCRYNSSGMAAQCKGYK 225
Query: 124 NVS-SDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
V DE +A L K GP++V I+A+ + QFY GV + D+++H VL V
Sbjct: 226 EVPVGDEHALAVALFKVGPVSVGIDASQSSFQFYQRGVYYDRNC----NKDDINHAVLAV 281
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYG+ + K + YWIIKNSW +WG+K
Sbjct: 282 GYGI-----SSKGKKYWIIKNSWSENWGKK 306
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 86/156 (55%), Gaps = 20/156 (12%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK----EEIR 116
+L+DCD D GC GGLM NAFE I K GGL E YPY+ + C++ + +
Sbjct: 185 ELIDCDTADNDGCQGGLMDNAFEYI--KNNGGLITEAAYPYRAARGTCNVARAAQNSPVV 242
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLD 174
V I + +V ++ E V N P++VA+ A+ A FY GV F + G + LD
Sbjct: 243 VHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGV-----FTGECGTE-LD 296
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
HGV +VGYGV + + YW +KNSWGP WGE+
Sbjct: 297 HGVAVVGYGVAEDG-----KAYWTVKNSWGPSWGEQ 327
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 84/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD D GC GGLM AF II GG++ EKDY YK + C++ KE+ V I
Sbjct: 187 ELVDCDVTQDHGCHGGLMDFAFSFIIRN--GGIDTEKDYKYKAQDGVCNIAKEKRHVVTI 244
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ K N P++VAI A+ +F Y GGV F G LDHGV
Sbjct: 245 DSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGV-----FDAPCGT-ALDHGV 298
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYG + YWI+KNSWG WG+
Sbjct: 299 LVVGYG------SDNGTDYWIVKNSWGDFWGD 324
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 82/150 (54%), Gaps = 16/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVK--I 119
+L+DCD VD GC GGL+ AFE II GG++ E DYP+ G +R C +++ V +
Sbjct: 174 QLIDCDSVDMGCNGGLLHTAFEEIIRM--GGVQAELDYPFVGRDRRCGVDRHRPYVVSLV 231
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y V +E ++ L GP+ +AI+A + Y+ GV C+ + L+H VL+
Sbjct: 232 GCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVISS----CEN--NGLNHAVLL 285
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYGV PYW KN+WG WGE
Sbjct: 286 VGYGVENGV------PYWAFKNTWGDDWGE 309
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE I K GGL E YPYK S+ C NKE V I
Sbjct: 177 ELVDCDTNQNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKASDETCDTNKENAPVVSI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ +V + + V N P++VAI+A + QFY GV F + G + L+HGV
Sbjct: 235 DGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGRCGTE-LNHGV 288
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG T YWI+KNSWG WGEK
Sbjct: 289 AVVGYGT-----TIDGTKYWIVKNSWGEEWGEK 316
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+LVDCD D GC GGLM AF+ I K GG+ E +YPY+ C+ K V I
Sbjct: 194 ELVDCDTGDNQGCDGGLMDYAFQFI--KRNGGITTESNYPYRAEQGRCNKAKASSHDVTI 251
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V +++ + V N P+AVA+ A+ QFY GV F + G D LDHGV
Sbjct: 252 DGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGV-----FTGECGTD-LDHGV 305
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG+ T YWI+KNSWG WGE+
Sbjct: 306 AAVGYGI-----TRDGTKYWIVKNSWGEDWGER 333
>gi|209962644|gb|ACJ02117.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY GS C E+ I
Sbjct: 30 LVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSEDGSKPFCIPYGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|209962638|gb|ACJ02114.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962642|gb|ACJ02116.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962648|gb|ACJ02119.1| cathepsin L-like protein [Trypanosoma vivax]
gi|209962650|gb|ACJ02120.1| cathepsin L-like protein [Trypanosoma vivax]
Length = 159
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GCGGG M NAFE I+ + G + EK YPY GS C E+ I
Sbjct: 30 LVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVSEDGSKPFCIPYGHEVGATI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +AKYL NGP+AVA++A Y GGV + + L+HGVL+
Sbjct: 90 TGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV------VTSCTSEALNHGVLL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYN------DSSKPPYWIIKN 159
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 83/150 (55%), Gaps = 12/150 (8%)
Query: 63 LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC D+ + GC GGLM AF+ I K GGL+ E+ YPY+ + +C E
Sbjct: 166 LVDCSHDQGNQGCNGGLMDFAFQYI--KENGGLDSEESYPYEAKDGSCKYRAEYAVANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+V++ E + K + GP++VA++A+ ++QFY G+ + K +LDHGVL
Sbjct: 224 GFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSK----DLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG T YW++KNSWG WG
Sbjct: 280 VVGYGYEGTDSNK--DKYWLVKNSWGKEWG 307
>gi|341878328|gb|EGT34263.1| CBN-CPL-1 protein [Caenorhabditis brenneri]
Length = 336
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 86/153 (56%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM AFE I + G++ E+ YPYKG + CH NK+ I +
Sbjct: 171 LVDCSTKYGNHGCNGGLMDQAFEYI--RDNHGVDTEESYPYKGRDMKCHFNKKTIGADDK 228
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV+ DE ++ + GP+++AI+A + Q Y GV + + C + LDHGV
Sbjct: 229 GYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEE--CSS--EELDHGV 284
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG T H YW++KNSWG WGEK
Sbjct: 285 LLVGYG---TDPEHG--DYWLVKNSWGTGWGEK 312
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+LVDCD D GC GGLM AF+ I K GG+ E +YPY+ C+ K V I
Sbjct: 194 ELVDCDTGDNQGCDGGLMDYAFQFI--KRNGGITTESNYPYRAEQGRCNKAKASSHDVTI 251
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V +++ + V N P+AVA+ A+ QFY GV F + G D LDHGV
Sbjct: 252 DGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGV-----FTGECGTD-LDHGV 305
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG+ T YWI+KNSWG WGE+
Sbjct: 306 AAVGYGI-----TRDGTKYWIVKNSWGEDWGER 333
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYKG + C ++ +V I
Sbjct: 183 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 240
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ E K + + P++VAI A Q Y G+ +C G D LDHGV
Sbjct: 241 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGI---FDGIC--GTD-LDHGV 294
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI+KNSWG WGE
Sbjct: 295 VAVGYGTENGK------DYWIVKNSWGTSWGE 320
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKI 119
+LVDCDK + GC GGLM +AFE I K GG+ E +YPYK C +K ++ V I
Sbjct: 179 ELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEGTCDESKVNDLAVSI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV ++ V N P++VAI+A + QFY GV F D L+HGV
Sbjct: 237 DGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCNTD-LNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG T YWI++NSWGP WGE+
Sbjct: 291 AIVGYGT-----TVDGTNYWIVRNSWGPEWGEQ 318
>gi|348542138|ref|XP_003458543.1| PREDICTED: counting factor associated protein D-like [Oreochromis
niloticus]
Length = 551
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 85/151 (56%), Gaps = 14/151 (9%)
Query: 63 LVDCD--KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC + GC GG A+E I+ K GG E Y G N CH++ E+ +IQ
Sbjct: 384 LIDCSWGFGNNGCDGGEEWRAYEWIM-KHGGIATTETYGAYMGMNGFCHVDSSELTARIQ 442
Query: 121 SYVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF--LCKGGMDNLDHGV 177
SY NV+S D+ + L KNGP+AV+I+A+ F F SH + + C +D+LDH V
Sbjct: 443 SYTNVTSGDQLALKMALFKNGPVAVSIDASHRSFVF--YSHGVYYEPACGNTVDDLDHAV 500
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
L VGYG T +PYW+IKNSW +WG
Sbjct: 501 LAVGYG------TLSGEPYWLIKNSWSTYWG 525
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC-HLNKEEIRVKI 119
+LVDCD + GC GGLM AFE II GG++ +KDYPYKG + C + K V I
Sbjct: 181 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + E K V + P++VAI A A Q Y G+ F G LDHGV
Sbjct: 239 DSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGI-----FDGTCGT-QLDHGV 292
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWG WGE
Sbjct: 293 VAVGYGTENGK------DYWIVRNSWGKSWGE 318
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 84/153 (54%), Gaps = 16/153 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN----RACHLNKEEIRV 117
+LV CD ++ GC GGLM AF+ ++ G L E YPY N + +K +
Sbjct: 177 QLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSYPYVSGNGYLPECSNSSKLVVGA 236
Query: 118 KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+I +V + S E MA +L KNGP+A+A++A++ Y GV C G ++H V
Sbjct: 237 QIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGV----LTACIG--KQVNHAV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGY T ++ PYW+IKNSWG WGE+
Sbjct: 291 LLVGY-----DMTGEV-PYWVIKNSWGGDWGEQ 317
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 82/149 (55%), Gaps = 15/149 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ- 120
+LVDCD VD GC GGL+ A+E I+ GG+E E DYPY+ + C L + ++
Sbjct: 183 QLVDCDFVDMGCDGGLIHTAYEQIMQM--GGVEQEFDYPYRAERQPCALKPHKFAAGVRK 240
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
+ V +E + L GP+A+A++A + Y+GG+ + F G L+H VL+V
Sbjct: 241 CFRYVLRNEERLEDLLRHVGPIAIAVDAVDLTDYYGGI---VSFCENNG---LNHAVLLV 294
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
GYGV P+W +KNSWG +GE
Sbjct: 295 GYGVENNV------PFWTLKNSWGSDYGE 317
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 81/153 (52%), Gaps = 17/153 (11%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM +AF+ I K G++ E YPY+ N C N +
Sbjct: 166 LVDCSQKQGNHGCQGGLMDDAFQYI--KDNSGIDTESSYPYEAKNGKCRFNAANVGATDS 223
Query: 121 SYVNVSS-DETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
+ ++ S E+++ + GP++VAI+A+ M F Y GV H +F C LDHGV
Sbjct: 224 GFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRSGVYH--EFFCS--ETRLDHGV 279
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L VGYG K YW++KNSWG WG+K
Sbjct: 280 LAVGYGTESGK------DYWLVKNSWGESWGQK 306
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 93/170 (54%), Gaps = 36/170 (21%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+LVDCD+ +AGC GGLM AF+ II+ GGL+ EKDYPY G++ C +K + + V I
Sbjct: 168 ELVDCDRFYNAGCNGGLMDYAFQFIINN--GGLDTEKDYPYLGNDDTCDRDKMKTKAVSI 225
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ +V + + + V + P++VAI A+ A+QFY GV F + G LDHGV
Sbjct: 226 DGFEDVLPFDEKALQKAVAHQPVSVAIEASGMALQFYQSGV-----FTGECGT-ALDHGV 279
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGE 227
++VGYG EK + +W+++NSWG WGE
Sbjct: 280 VVVGYGT------------------------EKGLDYWLVRNSWGTEWGE 305
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 86/150 (57%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH-LNKEEIRVKIQ 120
+L+DCD VDAGC GGL+ A+E ++ GG++ E DYPY+GS+ C + + +
Sbjct: 164 QLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAENDYPYEGSDGNCRVDVAKFVVKVKK 221
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIV 180
Y ++ E ++ L GP+ VAI+A+ + Y G+ +++ G L+H VL+V
Sbjct: 222 CYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRRGI---MRYCSNYG---LNHAVLLV 275
Query: 181 GYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GYGV PYWI+KN+WG WGE+
Sbjct: 276 GYGVENNV------PYWILKNTWGEDWGEQ 299
Score = 37.7 bits (86), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 32/58 (55%)
Query: 3 ATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGEGTHLA 60
A +D L+ + F FL K NK Y+++ E +R +IF+ NL++I I+ + A
Sbjct: 12 CVAHSAAYDLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTA 69
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 83/152 (54%), Gaps = 19/152 (12%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKI 119
+L+DCD +AGC GGLM +AFE IIS GG++ ++DYPYK N +C NK + V I
Sbjct: 185 ELMDCDTSYNAGCDGGLMDDAFEFIISN--GGIDTDEDYPYKARNDSCDANKRNRKAVTI 242
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y ++ +E + K V N P++VAI A Q Y G+ F G D LDH
Sbjct: 243 DDYEDLRMNEKSLQK-AVSNQPVSVAIEAGGRDFQLYKSGI-----FTGTCGTD-LDHAT 295
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
IVGYG + YWI+K S+G WGE
Sbjct: 296 TIVGYG------SENGTDYWIVKESYGTSWGE 321
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 87/154 (56%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LVDCD VD GC GGLM +AF+ +I GL E +YPYKG + C+ N+ V
Sbjct: 194 ELVDCDTKGVDQGCEGGLMDDAFKFVIQN--HGLNTEANYPYKGVDGKCNANEAANDVVT 251
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 252 ITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 305
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 306 VTAVGYGV-----SNDGTEYWLVKNSWGTEWGEE 334
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 84/151 (55%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC G M AF+ I+S G + E+ YPY G+ AC+ + + + I
Sbjct: 178 LVSCDTNDLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPACNKSGKVVGANI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ +V++ +E +A++L KNGP+A+A++A + Q Y GGV L ++ L+
Sbjct: 238 RDHVHILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGV------LTSCISKEVNSAALL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSWG WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWGKGWGEE 316
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC-HLNKEEIRVKI 119
+LVDCD + GC GGLM AFE II GG++ +KDYPYKG + C + K V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + E K V + P+++AI A A Q Y G+ F G LDHGV
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI-----FDGSCGT-QLDHGV 299
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWG WGE
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGE 325
>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
Length = 376
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 96/186 (51%), Gaps = 27/186 (14%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLNKEEIRVKI 119
LVDC + GC GGLM NAF II G ++ E YPYK S C I +
Sbjct: 174 LVDCSGAEGNLGCDGGLMDNAFIYIIQNKG--IDTESSYPYKAQSGTKCLFKPTSIGATL 231
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
YVN+++ E+++ + KNGP++VAI+A N+ Q Y GV + K C LDHG
Sbjct: 232 SGYVNITAGSESQLETAVAKNGPVSVAIDASHNSFQLYSSGVYYEPK--CS--PTELDHG 287
Query: 177 VLIVGYGVHK------TKFTHKIQPY--------WIIKNSWGPHWGEKTMPFWIIKNSWG 222
VL+VGYGV K + H+I+ I+ +S G KT +W++KNSWG
Sbjct: 288 VLVVGYGVAKKDENNASPNKHQIRIRHNDDFGIDEIVTDSSSDD-GRKTSQYWLVKNSWG 346
Query: 223 PRWGEQ 228
WG Q
Sbjct: 347 VSWGMQ 352
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 85/153 (55%), Gaps = 18/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYKG + C ++ +V I
Sbjct: 177 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ E K + + P++VAI A Q Y G+ +C G D LDHGV
Sbjct: 235 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGI---FDGIC--GTD-LDHGV 288
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+ VGYG K YWI+KNSWG WGE
Sbjct: 289 VAVGYGTENGK------DYWIVKNSWGTSWGES 315
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 83/152 (54%), Gaps = 16/152 (10%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+L+DCD + GC GGLM AF+ I K GG+ E +YPY + C K+ V I
Sbjct: 182 ELIDCDTDENNGCNGGLMDYAFDFI--KKNGGISSEAEYPYAAEDSYCATEKKSHVVSID 239
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ +V +++ + V N P+++AI A+ QFY GV F + G + LDHGV
Sbjct: 240 GHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGV-----FTGRSGTE-LDHGVA 293
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG T + YWI++NSWG WGEK
Sbjct: 294 IVGYGK-----TQQGTKYWIVRNSWGAEWGEK 320
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC-HLNKEEIRVKI 119
+LVDCD + GC GGLM AFE II GG++ +KDYPYKG + C + K V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + E K V + P+++AI A A Q Y G+ F G LDHGV
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI-----FDGSCGT-QLDHGV 299
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWG WGE
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGE 325
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 63 LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC D + GC GGLM NAF I K GG++ E YPY+G + C +K I
Sbjct: 161 LVDCSTDYGNNGCNGGLMDNAFSYI--KANGGIDTETGYPYEGQDGTCRYSKSSIGADDT 218
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINANAM--QFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ DE + + + GP++VAI+A+ M QFY GV + C LDHGV
Sbjct: 219 GFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQ--CSPSA--LDHGV 274
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG K YW++KNSWG WG +
Sbjct: 275 LVVGYGTDNGK------DYWLVKNSWGTGWGTE 301
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD + CGGG AF+ I+S G + E+ YPY G C+ + + + KI
Sbjct: 257 LVSCDTTEDNCGGGFADRAFKWIVSSNKGNVFTERSYPYASIDGYVPPCNKSGKVVGAKI 316
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
++N+ DE +A++L +NGP+A+A++A+ Y GGV L +++H VL+
Sbjct: 317 SGHINLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGV------LTSCSSKHVNHEVLL 370
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSW WGE+
Sbjct: 371 VGYND-----TSK-PPYWIIKNSWDKEWGEE 395
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 37/191 (19%)
Query: 41 RIFRANLKKIQIRGEGTHLALKLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKD 98
++FR K + + + LVDC + + GC GGLM NAF+ I K GGL+ E+
Sbjct: 150 QMFRKTGKLVSLSEQ------NLVDCSRAQGNQGCNGGLMDNAFQYI--KDNGGLDSEES 201
Query: 99 YPYKGSN-RACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYF 155
YPY ++ +C+ E +V++ E + K + GP++VAI+A + QFY
Sbjct: 202 YPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYK 261
Query: 156 GGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFW 215
G+ + CK +LDHGVL+VGYG T + FW
Sbjct: 262 SGIYYDPDCSCK----DLDHGVLVVGYGFEGTDSNNN--------------------KFW 297
Query: 216 IIKNSWGPRWG 226
I+KNSWGP WG
Sbjct: 298 IVKNSWGPEWG 308
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 84/152 (55%), Gaps = 17/152 (11%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM NAF+ I K GG++ EK YPY+ + C K+ +
Sbjct: 169 LVDCSETFGNHGCEGGLMDNAFQYI--KANGGIDTEKSYPYEAEDGECRFKKQNVGATDT 226
Query: 121 SYVNVSS-DETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ E ++ K + GP++VAI+A ++ Q Y GV + C + LDHGV
Sbjct: 227 GFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETE--CSS--EQLDHGV 282
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYGV K YW++KNSW WG+
Sbjct: 283 LVVGYGVEDGK------KYWLVKNSWAESWGD 308
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 77/150 (51%), Gaps = 15/150 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
+LVDCD ++GC GGLM AF+ I K GGL E YPY ++C V I
Sbjct: 180 QLVDCDTKNSGCNGGLMDYAFDFI--KNNGGLSSEDSYPYLAEQKSCGSEANSAVVTIDG 237
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y +V + V N P++VAI A+ A QFY GV F G + LDHGV
Sbjct: 238 YQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGV-----FSGHCGTE-LDHGVAA 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYGV + YWI+KNSWG WGE
Sbjct: 292 VGYGVDDDG-----KKYWIVKNSWGEGWGE 316
>gi|262093268|gb|ACY25958.1| cathepsin L-like protein [Trypanosoma rangeli]
gi|262093270|gb|ACY25959.1| cathepsin L-like protein [Trypanosoma rangeli]
gi|262093274|gb|ACY25961.1| cathepsin L-like protein [Trypanosoma rangeli]
gi|262093276|gb|ACY25962.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 159
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM NAF+ I+ K G + E Y Y G+++ C+++ + I
Sbjct: 30 LVSCDNADNGCDGGLMDNAFDWIVGKNNGTVYTEASYSYVSGGGNSQKCNMSGHVVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 90 SGHVDLPKDEDKMAAWLAANGPLAIAVDATSFMSYTGGV------LTNCISDQLDHGVVL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYNDSSNP------PYWIIKN 159
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC-HLNKEEIRVKI 119
+LVDCD + GC GGLM AFE II GG++ +KDYPYKG + C + K V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V + E K V + P+++AI A A Q Y G+ F G LDHGV
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI-----FDGSCGT-QLDHGV 299
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWG WGE
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGE 325
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 85/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN-KEEIRVKI 119
+LVDCD + D GC GGLM AF+ II GGL+ EKDYPY G + C+L+ K V I
Sbjct: 186 ELVDCDTEYDMGCNGGLMDYAFDFIIKN--GGLDTEKDYPYTGFDGECNLSGKSSKVVSI 243
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V + + + V + P++VA+ A A+Q Y G+ F + G LDHG+
Sbjct: 244 DGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYVSGI-----FTGECGT-ALDHGI 297
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG T YWI++NSWG WGE
Sbjct: 298 VAVGYG------TENGTDYWIVRNSWGSSWGE 323
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 84/153 (54%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC-HLNKEEIRVKI 119
+LVDCD + GC GGLM AFE IIS GG++ ++DYPY G + +C K V I
Sbjct: 191 ELVDCDTYYNQGCNGGLMDYAFEFIISN--GGIDTDEDYPYTGRDGSCDQYRKNAHVVTI 248
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ + + V N P++VAI A A Q Y G+ F G + LDHGV
Sbjct: 249 DSYEDVPINDEKSLQKAVANQPVSVAIEAGGRAFQLYESGI-----FTGYCGTE-LDHGV 302
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+GYG K+ YWI+KNSWG WGE
Sbjct: 303 TAIGYGSENGKY------YWIVKNSWGSDWGES 329
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 86/152 (56%), Gaps = 18/152 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AF+ IIS GG++ E+DYPYK + C N++ +V I
Sbjct: 181 ELVDCDTAYNEGCNGGLMDYAFQFIISN--GGIDTEEDYPYKERDGLCDPNRKNAKVVSI 238
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ K V + P++VAI + Q Y G+ F + G+D LDHGV
Sbjct: 239 DSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQLYKSGI-----FDGRCGID-LDHGV 292
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWG WGE
Sbjct: 293 VAVGYGTESGK------DYWIVRNSWGKSWGE 318
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 82/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH-LNKEEIRVKI 119
+L+DCD D +GC GGLM NAFE I K GG+ E YPY+ +N C + V I
Sbjct: 184 ELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAANGTCDAVRARGGLVVI 241
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV ++ V N P++VAI+A + QFY GV F G D LDHGV
Sbjct: 242 DGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV-----FAGDCGTD-LDHGV 295
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGYG T+ YWI+KNSWG WGE
Sbjct: 296 AVVGYGE-----TNDGTEYWIVKNSWGTAWGE 322
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 87/154 (56%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVK 118
+LVDCD VD GC GGLM +AF+ +I GL E +YPYKG + C++N+
Sbjct: 176 ELVDCDTKGVDQGCEGGLMDDAFKFVIQN--HGLNTEANYPYKGVDGKCNVNEAANDAAT 233
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 234 ITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHG 287
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV ++ YW++KNSWG WGE+
Sbjct: 288 VTAVGYGV-----SNDGTEYWLVKNSWGTEWGEE 316
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVKI 119
+LVDCD + GC GGLM AFE I K GG+ E +YPY+ + C ++KE V I
Sbjct: 177 ELVDCDTDQNQGCNGGLMDYAFEFI--KQRGGITTEANYPYEAYDGTCDVSKENAPAVSI 234
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV ++ V N P++VAI+A QFY GV F G + LDHGV
Sbjct: 235 DGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGSCGTE-LDHGV 288
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG T YW +KNSWGP WGEK
Sbjct: 289 AIVGYGT-----TIDGTKYWTVKNSWGPEWGEK 316
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 85/154 (55%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+LVDCD VD GC GGLM NAF I + GL E +YPYKG + C+ NK+ I +
Sbjct: 173 ELVDCDTSGVDQGCEGGLMDNAFTFI--QHNHGLASEANYPYKGVDGTCNTNKQAIHAAE 230
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I + +V ++ E V + P++VAI+A + QFY GV F+ G LDHG
Sbjct: 231 INGFEDVPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGV-----FIGACGT-QLDHG 284
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYG + YW++KNSWG WGE+
Sbjct: 285 VTAVGYGT-----SDDGTKYWLVKNSWGTQWGEE 313
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 14/150 (9%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPYK + C ++ +V I
Sbjct: 192 ELVDCDNGQNQGCNGGLMDYAFEFIINN--GGIDTEEDYPYKARDGKCDQYRKNAKVVSI 249
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y +V ++ + + V N P++VAI A +F + H F + G D LDHGV+
Sbjct: 250 DGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ---LYHSGIFTGRCGTD-LDHGVVA 305
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG K YWI++NSWG WGE
Sbjct: 306 VGYGTENGK------DYWIVRNSWGGDWGE 329
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 86/156 (55%), Gaps = 20/156 (12%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK----EEIR 116
+L+DCD D GC GGLM NAFE I K GGL E YPY+ + C++ + +
Sbjct: 55 ELIDCDTADNDGCQGGLMDNAFEYI--KNNGGLITEAAYPYRAARGTCNVARAAQNSPVV 112
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLD 174
V I + +V ++ E V N P++VA+ A+ A FY GV F + G + LD
Sbjct: 113 VHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGV-----FTGECGTE-LD 166
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
HGV +VGYGV + + YW +KNSWGP WGE+
Sbjct: 167 HGVAVVGYGVAEDG-----KAYWTVKNSWGPSWGEQ 197
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 85/156 (54%), Gaps = 20/156 (12%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK----EEIR 116
+L+DCD D GC GGLM NAFE I K GGL E YPY+ + C++ + +
Sbjct: 185 ELIDCDTADNDGCQGGLMDNAFEYI--KNNGGLITEAAYPYRAARGTCNVARAAQNSPVV 242
Query: 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLD 174
V I + +V ++ E V N P++VA+ A+ A FY GV F G + LD
Sbjct: 243 VHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGV-----FTGDCGTE-LD 296
Query: 175 HGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
HGV +VGYGV + + YW +KNSWGP WGE+
Sbjct: 297 HGVAVVGYGVAEDG-----KAYWTVKNSWGPSWGEQ 327
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 82/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH-LNKEEIRVKI 119
+L+DCD D +GC GGLM NAFE I K GG+ E YPY+ +N C + V I
Sbjct: 184 ELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAANGTCDAVRARGGLVVI 241
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV ++ V N P++VAI+A + QFY GV F G D LDHGV
Sbjct: 242 DGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV-----FAGDCGTD-LDHGV 295
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+VGYG T+ YWI+KNSWG WGE
Sbjct: 296 AVVGYGE-----TNDGTEYWIVKNSWGTAWGE 322
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 70/164 (42%), Positives = 90/164 (54%), Gaps = 19/164 (11%)
Query: 52 IRGEGTHLA-LKLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCD D GC GGLM A++ II GGL+ E DYPY + C
Sbjct: 174 VTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQWIIKN--GGLDTEDDYPYTAEDGVCV 231
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGV-SHPLKFL 165
K+ RV I YV++ ++ K + P+AVAI A+A Q Y GGV P
Sbjct: 232 AAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPT--- 288
Query: 166 CKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
C +L+HGVL+VGYG K H YWI+KNSWGP WG+
Sbjct: 289 CG---TSLNHGVLVVGYG----KDPH-FGNYWIVKNSWGPEWGD 324
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 88/153 (57%), Gaps = 19/153 (12%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLNKEEIRV-K 118
+LVDCD + GCGGGLM AF+ II GG++ E+DYPY ++ C+ +K+ RV
Sbjct: 180 ELVDCDTSYNDGCGGGLMDYAFKFIIEN--GGIDTEEDYPYIATDVNVCNSDKKNTRVVT 237
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + K + N P++VAI A A Q Y GV F G +LDHG
Sbjct: 238 IDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGV-----FTGTCGT-SLDHG 291
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V+ VGYG + Q YWI++NSWG +WGE
Sbjct: 292 VVAVGYG------SEGGQDYWIVRNSWGSNWGE 318
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 84/152 (55%), Gaps = 18/152 (11%)
Query: 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPY+G + C ++ +V I
Sbjct: 218 ELVDCDTGYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYRGVDGRCDTYRKNAKVVSI 275
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V + + K V N P++VAI +F Y GV F + G LDHGV
Sbjct: 276 DDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGV-----FTGRCGT-ALDHGV 329
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG T YWI++NSWGP WGE
Sbjct: 330 VAVGYG------TANGHDYWIVRNSWGPSWGE 355
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 83/150 (55%), Gaps = 12/150 (8%)
Query: 63 LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC D+ + GC GGLM AF+ I K GGL+ E+ YPY+ + +C E
Sbjct: 166 LVDCSHDQGNQGCNGGLMDFAFQYI--KENGGLDSEESYPYEAKDGSCKYRAEYAVANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+V++ E + K + GP++VA++A+ ++QFY G+ + K +LDHGVL
Sbjct: 224 GFVDIPQQEKALMKPVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSK----DLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG T YW++KNSWG WG
Sbjct: 280 VVGYGYEGTDSNK--DKYWLVKNSWGKEWG 307
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 83/152 (54%), Gaps = 12/152 (7%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + + GC GGLM AF+ + K GGL+ E+ YPY+ + +C E+
Sbjct: 166 LVDCSQAEGNEGCSGGLMDYAFQYV--KDNGGLDSEESYPYRAQDESCKYKPEQSAANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
++++ +E + + GP++ AI+A + QFY G+ + C +NLDHG+L
Sbjct: 224 GFMDIHPEEESLKLAVATVGPISAAIDASLSTFQFYHKGIYYDPD--CSS--ENLDHGIL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG + Q YWI+KNSWG WG +
Sbjct: 280 VVGYGSQGED--SEKQKYWIVKNSWGTDWGTQ 309
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 87/156 (55%), Gaps = 15/156 (9%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIR-VKIQ 120
+LVDCD + GC GG M AFE ++S GG++ E DYPY G + C+ KEE + V I
Sbjct: 198 ELVDCDSTNDGCEGGYMDYAFEWVMSN--GGIDTETDYPYTGEDGTCNTTKEETKAVSID 255
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V+ +E+ + ++K P++V I+ A+ F Y GG+ D++DH VL
Sbjct: 256 GYEDVAEEESALFCAVLKQ-PISVGIDGGAIDFQLYTGGIYDGDCSD---DPDDIDHAVL 311
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPF 214
+VGYG + YWIIKNSWG WG K +
Sbjct: 312 VVGYGAESG------EEYWIIKNSWGTDWGMKGYAY 341
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 62 KLVDCDKVD-AGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKI 119
+LVDCDK + GC GGLM +AFE I K GG+ E +YPY C +K ++ V I
Sbjct: 179 ELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYTAQEGTCDASKVNDLAVSI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV ++ V N P++VAI+A + QFY GV L +L+HGV
Sbjct: 237 DGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGV------LTGDCNTDLNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG T YWI++NSWGP WGE+
Sbjct: 291 AIVGYGT-----TVDGTNYWIVRNSWGPEWGEQ 318
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 83/152 (54%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRAC-HLNKEEIRVKI 119
+LVDCD + GC GGLM AFE II+ GG++ E+DYPY+ +++ C K V I
Sbjct: 189 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYRAADQKCDQYRKNANVVSI 246
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V ++ K V P++VAI A A Q Y GV F K G +LDHGV
Sbjct: 247 DGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGV-----FTGKCGT-SLDHGV 300
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
VGYG T Q YWI+ NSWG +WGE
Sbjct: 301 AAVGYG------TENGQDYWIVGNSWGKNWGE 326
>gi|268560858|ref|XP_002638172.1| C. briggsae CBR-CPL-1 protein [Caenorhabditis briggsae]
Length = 336
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 86/153 (56%), Gaps = 16/153 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM AFE I + G++ E+ YPYKG + CH NK+ + +
Sbjct: 171 LVDCSTKYGNHGCNGGLMDQAFEYI--RDNHGVDTEESYPYKGRDMKCHFNKKTVGADDK 228
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
YV+ DE ++ + GP+++AI+A + Q Y GV + + C + LDHGV
Sbjct: 229 GYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEE--CSS--EELDHGV 284
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
L+VGYG T H YW++KNSWG WGEK
Sbjct: 285 LLVGYG---TDPEHG--DYWLVKNSWGTGWGEK 312
>gi|255635439|gb|ACU18072.1| unknown [Glycine max]
Length = 142
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 69/106 (65%), Gaps = 6/106 (5%)
Query: 78 MSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLV 137
M+NA+ ++ GGLE E YPY G C + E+I VKI ++ N+ +DE ++A YLV
Sbjct: 1 MTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQIAAYLV 58
Query: 138 KNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183
KNGP+A+ +NA MQ Y GGVS PL +C L+HGVL+VGYG
Sbjct: 59 KNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYG 100
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 95/174 (54%), Gaps = 20/174 (11%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G+ ++L LV C D + GC GGLM +AF+ I + G++ EK Y
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYI--RANKGIDTEKSY 207
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PY G++ CH K + +V++ ET++ K + GP++VAI+A+ + QFY
Sbjct: 208 PYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSD 267
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + C ++LDHGVL+VGYG T YW +KNSWG WG++
Sbjct: 268 GVYDEPE--CDS--ESLDHGVLVVGYG------TLNGTDYWFVKNSWGTTWGDE 311
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 82/152 (53%), Gaps = 18/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD + GC GGLM AFE II GG++ E+DYPYK ++ C ++ +V I
Sbjct: 188 ELVDCDTSYNQGCNGGLMDYAFEFIIKN--GGIDTEEDYPYKAADGRCDQTRKNAKVVTI 245
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+Y +V + K + N P++VAI A A Q Y GV +C LDHGV
Sbjct: 246 DAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGV---FDGICG---TELDHGV 299
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
+ VGYG K YWI++NSWG WGE
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGGSWGE 325
>gi|123502829|ref|XP_001328382.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121911324|gb|EAY16159.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 95/186 (51%), Gaps = 23/186 (12%)
Query: 29 SYATKEEYHKRLRIFRANLKKIQIRGEGTHLALKLVDCDKVDAGCGGGLMSNAFETIISK 88
+++ + + I +NL+K+ + LVDC GC GGLM++A++ +I+
Sbjct: 114 AFSAIQAQESQYAITYSNLQKLSEQ--------NLVDCVSTCYGCNGGLMTSAYDYVINH 165
Query: 89 LGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAIN 147
G EKDY Y + +C + KI SY+ V+ DE ++A + GP AVAI+
Sbjct: 166 QNGKFMLEKDYSYTAAEGSCKFEATKAVSKITSYIPVAEGDEKDLAVKIATYGPAAVAID 225
Query: 148 ANA--MQFYFGGV-SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWG 204
A+A Q Y G+ P NLDHGV VG+G +K YWI++NSWG
Sbjct: 226 ASAWSFQVYSSGIYDEP-----SCSSYNLDHGVGCVGFGKEGSK------NYWIVRNSWG 274
Query: 205 PHWGEK 210
+WGEK
Sbjct: 275 EYWGEK 280
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 88/154 (57%), Gaps = 19/154 (12%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LV+C + ++GC GGLM +AF+ II GG++ E DYPYK + C +N+E +V
Sbjct: 191 ELVECSTNGQNSGCNGGLMDDAFDFIIKN--GGIDTEDDYPYKAVDGKCDINRENAKVVS 248
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
I + +V ++ + + V + P++VAI A +F Y GV F + G +LDHG
Sbjct: 249 IDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGV-----FSGRCGT-SLDHG 302
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V+ VGYG K YWI++NSWGP WGE
Sbjct: 303 VVAVGYGTDNGK------DYWIVRNSWGPKWGES 330
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 89/164 (54%), Gaps = 14/164 (8%)
Query: 52 IRGEGTHLA-LKLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCDK + GC GGLM +AFE II GGL+ E DYPYK + +C
Sbjct: 158 VTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQN--GGLDSEADYPYKAVSGSCD 215
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLC 166
++ V I + +V ++ V N P++VAI A+ Q Y GGV +
Sbjct: 216 ESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGV-----YTG 270
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
G + LDHGV+ VGYG KT YWI++NSWG WGE
Sbjct: 271 HCGYE-LDHGVVAVGYGTSKTP-DGVATDYWIVRNSWGDAWGES 312
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 84/159 (52%), Gaps = 21/159 (13%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPY----KGSNRACHLNKEEI 115
+L+DC K + GC GGLM AF+ + + G++ E YPY N C N I
Sbjct: 201 QLIDCSKSYGNNGCEGGLMDLAFQYV--RDNEGIDSEISYPYISGDGDENVRCLFNSTNI 258
Query: 116 RVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINANAMQF--YFGGV-SHPLKFLCKGGMD 171
++ Y+N+ DE + + GP++VAINA F Y G+ S P C +
Sbjct: 259 MAQVTGYINIHEGDERALMNAVATIGPVSVAINAGLSSFSMYKSGIYSDPE---CASASE 315
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+LDHGVL+VGYG+ K PYW+IKNSWG WG+K
Sbjct: 316 DLDHGVLLVGYGIEDGK------PYWLIKNSWGEDWGDK 348
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 88/152 (57%), Gaps = 15/152 (9%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LV+C + ++GC GGLM++AF+ II GG++ E DYPYK + C +N+E +V
Sbjct: 191 ELVECSTNGQNSGCNGGLMADAFDFIIKN--GGIDTEDDYPYKAVDGKCDINRENAKVVS 248
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
I + +V ++ + + V + P++VAI A +F + H F + G +LDHGV+
Sbjct: 249 IDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQ---LYHSGVFSGRCGT-SLDHGVV 304
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG K YWI++NSWGP WGE
Sbjct: 305 AVGYGTDNGK------DYWIVRNSWGPKWGES 330
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 81/152 (53%), Gaps = 17/152 (11%)
Query: 63 LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
L+DC + GC GGLM AF+ I K+ GG++ E YPY+ + C N +
Sbjct: 176 LIDCSTPEGNDGCNGGLMDQAFKYI--KIQGGIDTEAYYPYEAKDDTCRFNITDSGATDT 233
Query: 121 SYVNVSSDETEMAKYLVKN-GPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+V++ S + EM K GP++VAI+A+ + QFY GV C M LDHGV
Sbjct: 234 GFVDIKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSETA--CSSTM--LDHGV 289
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
L+VGYG K YW++KNSWG WGE
Sbjct: 290 LVVGYGTENGK------DYWLVKNSWGEGWGE 315
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 84/152 (55%), Gaps = 16/152 (10%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKIQ 120
+LVDCD D GC GG M++AF ++ GGL E +YPYK ++ C++NK ++I I+
Sbjct: 178 ELVDCDTNDDGCMGGYMNSAFNYTMTT--GGLTSESNYPYKSTDGTCNINKTKQIATSIK 235
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAI--NANAMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+ +V +++ + V + P+++ I QFY GV C +LDHGV
Sbjct: 236 GFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV---FSGECS---THLDHGVA 289
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG YWI+KNSWGP WGE+
Sbjct: 290 VVGYGKSSNG-----SKYWILKNSWGPKWGER 316
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 82/154 (53%), Gaps = 20/154 (12%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
+LVDC +AGC GGLM AFE II+ G + E YPYKG C + ++ V I
Sbjct: 177 QLVDCSTSYGNAGCNGGLMDYAFEYIIANKG--ICAESAYPYKGVGGLCQKSCTKV-VTI 233
Query: 120 QSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHG 176
Y +V+S DE + + GP++VAI A+ QFY GV C NLDHG
Sbjct: 234 SGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV---FSGTCG---HNLDHG 287
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL VGYG T Q YWI+KNSWG WGE
Sbjct: 288 VLAVGYG------TTGSQDYWIVKNSWGTSWGES 315
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 88/153 (57%), Gaps = 19/153 (12%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LV+C + ++GC GGLM +AF+ II GG++ E DYPYK + C +N+E +V
Sbjct: 187 ELVECSTNGQNSGCNGGLMDDAFDFIIKN--GGIDTEDDYPYKAVDGKCDINRENAKVVS 244
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
I + +V ++ + + V + P++VAI A +F Y GV F + G +LDHG
Sbjct: 245 IDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGV-----FSGRCGT-SLDHG 298
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V+ VGYG K YWI++NSWGP WGE
Sbjct: 299 VVAVGYGTDNGK------DYWIVRNSWGPKWGE 325
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK-EEIRVKI 119
+LVDCD K +AGC GGLM +AFE I K GG+ E +YPY + C +K ++ V I
Sbjct: 179 ELVDCDTKKNAGCNGGLMESAFEFIKQK--GGITTESNYPYTAQDGTCDASKANDLAVSI 236
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+ NV +++ V N P++VAI+A QFY GV L+HGV
Sbjct: 237 DGHENVPANDENALLKAVANQPVSVAIDAGGSDFQFYSEGV------FTGDCSTELNHGV 290
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG T YW ++NSWGP WGE+
Sbjct: 291 AIVGYGT-----TVDGTNYWTVRNSWGPEWGEQ 318
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 89/163 (54%), Gaps = 14/163 (8%)
Query: 52 IRGEGTHLA-LKLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACH 109
+ GE L+ +LVDCDK + GC GGLM +AFE II GGL+ E DYPYK + +C
Sbjct: 158 VTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQN--GGLDSEADYPYKAVSGSCD 215
Query: 110 LNKEEIRV-KIQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLC 166
++ V I + +V ++ V N P++VAI A+ Q Y GGV +
Sbjct: 216 ESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGV-----YTG 270
Query: 167 KGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
G + LDHGV+ VGYG KT YWI++NSWG WGE
Sbjct: 271 HCGYE-LDHGVVAVGYGTSKTP-DGVATDYWIVRNSWGDAWGE 311
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 83/157 (52%), Gaps = 17/157 (10%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC D GC GG M AF+ +I G ++ E YPYK + +C + I I
Sbjct: 173 LVDCSAAEGDMGCSGGWMDYAFKYVIQNRG--IDTEASYPYKAIDESCEFKRNSIGATIH 230
Query: 121 SYVNV-SSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
S+V+V + DE+ + + GP++VAI+A+ + QFY GV + C + LDHGV
Sbjct: 231 SFVDVKTGDESALQNAVASIGPISVAIDASQPSFQFYSSGVYNEPD--CSTEI--LDHGV 286
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPF 214
VGYG T PYW +KNSWG WG+K F
Sbjct: 287 TAVGYG------TLNGVPYWKVKNSWGTSWGQKGYIF 317
>gi|246148|gb|AAB21516.1| Cyclic Protein-2 [Rattus sp.]
Length = 247
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 83/150 (55%), Gaps = 12/150 (8%)
Query: 63 LVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC D+ + GC GGLM AF+ I K GGL+ E+ YPY+ + +C E
Sbjct: 79 LVDCSHDQGNQGCNGGLMDFAFQYI--KENGGLDSEESYPYEAKDGSCKYRAEYAVANDT 136
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+V++ E + K + GP++VA++A+ ++QFY G+ + K +LDHGVL
Sbjct: 137 GFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSK----DLDHGVL 192
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG T YW++KNSWG WG
Sbjct: 193 VVGYGYEGTDSNK--DKYWLVKNSWGKEWG 220
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 92/174 (52%), Gaps = 19/174 (10%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G ++L LVDC + + GC GGLM NAF I K GG++ E+ Y
Sbjct: 151 GSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI--KANGGIDTEQAY 208
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PYK + CH + + YV++ S +E ++ + GP++VAI+A+ + Q Y G
Sbjct: 209 PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSG 268
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + C LDHGVL+VGYG YW++KNSWG WG++
Sbjct: 269 GVYYEPD--CSASQ--LDHGVLVVGYGTEDDG-----TDYWLVKNSWGKSWGDQ 313
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYK---GSNRACHLNKEEIRVKI 119
LV CD D GC G M AF+ I+S G + E+ YPY G+ AC+ + + + I
Sbjct: 178 LVSCDTNDLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPACNKSGKVVGANI 237
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ +E +A++L KNGP+A+A++A + Q Y GGV L ++ L+
Sbjct: 238 DDHVHILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGV------LTSCISKEVNSAALL 291
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGY T K PYWIIKNSWG WGE+
Sbjct: 292 VGY-----DDTSK-PPYWIIKNSWGKGWGEE 316
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 86/152 (56%), Gaps = 19/152 (12%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCD+ + GC GGLM AFE II GG++ ++DYPY G R C K+ +V I
Sbjct: 143 ELVDCDRAFNEGCNGGLMDYAFEFIIRN--GGIDTDQDYPYNGFERKCDPTKKNAKVVSI 200
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
Y +V S + K V + P++VAI A+Q Y GV F K G D LDHGV
Sbjct: 201 DGYEDVPSYMNALKK-AVAHQPVSVAIAGLGRALQLYQSGV-----FTGKCGTD-LDHGV 253
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
++VGYG + YW+++NSWG +WGE
Sbjct: 254 VVVGYG------SENGVDYWLVRNSWGTNWGE 279
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 85/153 (55%), Gaps = 18/153 (11%)
Query: 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-KI 119
+LVDCDK + GC GGLM AF+ II GG++ EKDYPY + C ++ +V I
Sbjct: 92 ELVDCDKTYNDGCNGGLMDYAFQFIIDN--GGIDTEKDYPYTEQDGRCDSYRKNAKVVSI 149
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
SY +V ++ + K + P+AVAI+ + Q Y G+ F K G +LDHGV
Sbjct: 150 NSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGI-----FTGKCGT-SLDHGV 203
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG K YWI++NSWG WGEK
Sbjct: 204 TVVGYGSESGK------DYWIVRNSWGESWGEK 230
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/151 (39%), Positives = 83/151 (54%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC + GCGGG M+NAF+ + + G++ E YPY G + +C N K + Y
Sbjct: 170 LVDCVSKNDGCGGGYMTNAFQYV--QENRGIDSEDAYPYIGQDESCMYNPTGKAAKCRGY 227
Query: 123 VNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+ E + + + + GP+AVAI+A ++ QFY GV + C G DNL+H VL
Sbjct: 228 REIPEGSEKALKRAVARVGPVAVAIDASLSSFQFYSKGVYYDEN--CNG--DNLNHAVLA 283
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG+ + +WIIKNSWG WG K
Sbjct: 284 VGYGIQRG------TKHWIIKNSWGEEWGNK 308
>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
Length = 278
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 91/174 (52%), Gaps = 20/174 (11%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDCDKVDA--GCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ + GT ++L LVDC K + GC GGLM AFE I K G++ E+ Y
Sbjct: 116 GSLEGQHFKKTGTLVSLSEQNLVDCSKKEGNEGCEGGLMDQAFEYI--KRNKGIDTEQSY 173
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNV-SSDETEMAKYLVKNGPMAVAINA--NAMQFYFG 156
PY+ + C ++ ++ Y ++ E ++ + GP++VAI+A ++ Q Y
Sbjct: 174 PYRAVDEKCRFSRADVGATDTGYTDIHKGSEKDLQSAVATVGPISVAIDASRDSFQLYKS 233
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + K C M LDHGVL VGYG +K YWI+KNSWG WG K
Sbjct: 234 GVYYEPK--CSSTM--LDHGVLAVGYGTTDSK------DYWIVKNSWGTQWGMK 277
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 83/153 (54%), Gaps = 19/153 (12%)
Query: 62 KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRV-K 118
+LV+C D ++GC GGLM AF II GG++ E DYPYK + C +N+ +V
Sbjct: 196 ELVECSTDGGNSGCNGGLMDAAFNFIIKN--GGIDTEDDYPYKAVDGKCDINRRNAKVVS 253
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF--YFGGVSHPLKFLCKGGMDNLDHG 176
I ++ +V ++ + + V + P++VAI A QF Y GV NLDHG
Sbjct: 254 IDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGRQFQLYKSGV------FSGSCTTNLDHG 307
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
V+ VGYG K YWI++NSWGP WGE
Sbjct: 308 VVAVGYGTENGK------DYWIVRNSWGPKWGE 334
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 92/174 (52%), Gaps = 19/174 (10%)
Query: 45 ANLKKIQIRGEGTHLALK---LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R G ++L LVDC + + GC GGLM NAF I K GG++ E+ Y
Sbjct: 151 GSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI--KANGGIDTEQAY 208
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINAN--AMQFYFG 156
PYK + CH + + YV++ S +E ++ + GP++VAI+A+ + Q Y G
Sbjct: 209 PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSG 268
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + C LDHGVL+VGYG YW++KNSWG WG++
Sbjct: 269 GVYYEPD--CSASQ--LDHGVLVVGYGTEDDG-----TDYWLVKNSWGKSWGDQ 313
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 91/174 (52%), Gaps = 20/174 (11%)
Query: 45 ANLKKIQIRGEGTHLAL---KLVDC--DKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY 99
+L+ R GT ++L +LVDC D + GC GGLM AF+ I + GG++ E+ Y
Sbjct: 149 GSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYI--QANGGIDTEESY 206
Query: 100 PYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKNGPMAVAINANAM--QFYFG 156
PY+ N C N + I Y VS DE + + + GP++V I+A+ M QFY
Sbjct: 207 PYEAENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQFYES 266
Query: 157 GVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GV + C LDHGVL VGYG T YW++KNSWG WG+K
Sbjct: 267 GVYNEPD--CSSL--ELDHGVLAVGYG------TEDGNDYWLVKNSWGLEWGDK 310
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 83/151 (54%), Gaps = 13/151 (8%)
Query: 62 KLVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKI 119
++VDCD+ D GC GG A+E +I GGL+ E+ YPY + C + KI
Sbjct: 164 QIVDCDQGNGDYGCDGGDPPTAYEYVIK--AGGLDTEESYPYTAEDGQCAFKPSAVGAKI 221
Query: 120 Q--SYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGV 177
+Y+ + +ETEM L GP+++ ++A++ Q+Y GGV + LC+ D+LDH V
Sbjct: 222 SNWTYITTTKNETEMQYGLASRGPLSICVDASSWQYYIGGV---ITSLCE---DSLDHCV 275
Query: 178 LIVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+I GY V + K W I+NSWG WG
Sbjct: 276 MITGYSVQEGWDFMKYD-VWNIRNSWGEDWG 305
>gi|442736236|gb|AGC65593.1| cathepsin [Achaea janata granulovirus]
Length = 338
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 82/149 (55%), Gaps = 15/149 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
L+DCD+++ GC GGLM AFE II GG+ E DYPY G C N + I
Sbjct: 180 LLDCDQLNYGCDGGLMHWAFEEIIRM--GGVVLEYDYPYTGVESFCA-NNVNMYTTISGC 236
Query: 123 VNVS-SDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
V DE ++ + LV NGP+AVA++ + Y GV C G + L+H VL+VG
Sbjct: 237 VQYDLRDEEKLRELLVTNGPIAVALDIVDIVDYKSGVVS----FC-GTNNGLNHAVLLVG 291
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
YGV KT YW++KNSWG WGE+
Sbjct: 292 YGVDKTI------EYWLLKNSWGTDWGEE 314
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 87/151 (57%), Gaps = 15/151 (9%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LVDC + GCGGG M+NAF+ + + G++ E YPY G + +C N K + Y
Sbjct: 167 LVDCVSENDGCGGGYMTNAFQYV--QQNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGY 224
Query: 123 VNVS-SDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
V +E + + + + GP++VAI+A+ + QFY GV + C G DNL+H VL
Sbjct: 225 REVPVGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDES--CDG--DNLNHAVLA 280
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG+ + HK +WI+KNSWG +WG K
Sbjct: 281 VGYGIQR---GHK---HWILKNSWGENWGNK 305
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 82/152 (53%), Gaps = 17/152 (11%)
Query: 62 KLVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
+LVDCD K +AGC GGLM AF+ I GG+ E YPYK +C + V I
Sbjct: 187 QLVDCDTKGNAGCDGGLMDYAFQYIAKH--GGVAAEDAYPYKARQASCKKSPAPA-VTID 243
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V +++ K V + P++VAI A+ QFY GV F + G + LDHGV
Sbjct: 244 GYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGV-----FAGRCGTE-LDHGVT 297
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYGV YW++KNSWGP WGEK
Sbjct: 298 AVGYGVAADG-----TKYWVVKNSWGPEWGEK 324
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 84/154 (54%), Gaps = 18/154 (11%)
Query: 62 KLVDCDK--VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI-RVK 118
+L+DCD VD GC GGLM +AF+ II G L E YPY+G + C+ N+ I V
Sbjct: 105 ELIDCDTKGVDQGCEGGLMDDAFKFIIQNHG--LSTEVQYPYEGVDGTCNTNEASIHAVT 162
Query: 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHG 176
I Y +V ++ + V N P++VAI+A+ QFY GV F G + LDHG
Sbjct: 163 ITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGV-----FTGSCGTE-LDHG 216
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
V VGYGV YW++KNSWG WGE+
Sbjct: 217 VTAVGYGVGNDG-----TKYWLVKNSWGADWGEE 245
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 81/150 (54%), Gaps = 12/150 (8%)
Query: 63 LVDCDKV--DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQ 120
LVDC + GC GGLM AF+ I K GGL+ E+ YPY+ + +C E
Sbjct: 166 LVDCSHAQGNQGCNGGLMDYAFQYI--KENGGLDSEESYPYEAKDGSCKYRAEFAVANDT 223
Query: 121 SYVNVSSDETEMAKYLVKNGPMAVAINAN--AMQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
+V++ E + K + GP++VA++A+ ++QFY G+ + K NLDHGVL
Sbjct: 224 GFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSK----NLDHGVL 279
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWG 208
+VGYG T YW++KNSWG WG
Sbjct: 280 LVGYGYEGTDSNKN--KYWLVKNSWGSEWG 307
>gi|262093272|gb|ACY25960.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 159
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 77/142 (54%), Gaps = 15/142 (10%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRVKI 119
LV CD D GC GGLM NAF+ I+ K G + E Y Y G+++ C ++ + I
Sbjct: 30 LVSCDNADNGCDGGLMDNAFDWIVGKNNGTVYTEASYSYVSGGGNSQKCDMSGHVVGAVI 89
Query: 120 QSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
+V++ DE +MA +L NGP+A+A++A + Y GGV L D LDHGV++
Sbjct: 90 SGHVDLPKDEDKMAAWLAANGPLAIAVDATSFMSYTGGV------LTNCISDQLDHGVVL 143
Query: 180 VGYGVHKTKFTHKIQPYWIIKN 201
VGY PYWIIKN
Sbjct: 144 VGYNDSSNP------PYWIIKN 159
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.430
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,032,884,624
Number of Sequences: 23463169
Number of extensions: 172091351
Number of successful extensions: 390064
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2008
Number of HSP's successfully gapped in prelim test: 4314
Number of HSP's that attempted gapping in prelim test: 367599
Number of HSP's gapped (non-prelim): 13660
length of query: 240
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 102
effective length of database: 9,121,278,045
effective search space: 930370360590
effective search space used: 930370360590
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)