BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022276
(300 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
Length = 363
Score = 409 bits (1050), Expect = e-113, Method: Compositional matrix adjust.
Identities = 196/262 (74%), Positives = 228/262 (87%), Gaps = 7/262 (2%)
Query: 18 LASAVA--VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
+A+AV N+DD +IRQVV + EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF
Sbjct: 15 VATAVTDDTNNDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFG 70
Query: 76 VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
VFK+NL +AK Q DPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT
Sbjct: 71 VFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTT 130
Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
+LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH
Sbjct: 131 NLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHV 190
Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSN
Sbjct: 191 CDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSN 249
Query: 256 FSVISSDEDQMAANLVKHGPLA 277
FSV++ DEDQ+AANLVK+GPLA
Sbjct: 250 FSVVTLDEDQIAANLVKNGPLA 271
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 395 bits (1015), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/279 (70%), Positives = 227/279 (81%), Gaps = 6/279 (2%)
Query: 1 MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
M+RL L S+ +L V S+ VND DD +IRQVV +E +L +E HFSLFK
Sbjct: 1 MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56
Query: 59 KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116
Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
+LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236
Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275
>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
GN=At2g21430 PE=2 SV=2
Length = 361
Score = 391 bits (1004), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/272 (68%), Positives = 218/272 (80%), Gaps = 4/272 (1%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
L L + L V S D+D +IRQVV +++E +L++E HF+LFK KF K Y
Sbjct: 5 LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+ +LP D
Sbjct: 61 SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120
Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180
Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240
Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
+SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLA 272
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
Length = 371
Score = 349 bits (896), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 173/278 (62%), Positives = 207/278 (74%), Gaps = 12/278 (4%)
Query: 27 DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
+D +IRQVVP G D LNAE HF F +F K+Y +EH YR VFK NLRRA+R
Sbjct: 24 EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR 81
Query: 87 RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
QLLDP+A HGVTKFSDLTP+EFRR +LGL + R L A +AP+LPT+ LP DF
Sbjct: 82 HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141
Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
DWRDHGAV VK+QG+CGSCWSFSA+GALEGAH+L+TG+L LSEQQ VDCDHECD E
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201
Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D G CKFDKSKI A+V NFSV+S
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSV 260
Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
DE Q++ANL+KHGPLA + + + +++ VS P
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
Length = 343
Score = 215 bits (548), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 145/239 (60%), Gaps = 10/239 (4%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFS 102
L + F F+ KF+K Y + EE+ RF +FK+NL + + L+ GV KF+
Sbjct: 23 LEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81
Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACG 159
DL+ EF+ +L N+ D A L N +PT FDWR GAVT VK+QG CG
Sbjct: 82 DLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFE 218
SCWSFS TG +EG HF+S +LVSLSEQ LVDCDHEC + E +CD GCNGGL +A+
Sbjct: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYN 200
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YI+K GG++ E YPYT G C F+ + I A +SNF++I +E MA +V GPLA
Sbjct: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 179 bits (455), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 105/258 (40%), Positives = 141/258 (54%), Gaps = 24/258 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKF 101
L+ E H +K + K YA + E +R ++F N + AK QL V G+ K+
Sbjct: 23 LIKEEWH--TYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKY 80
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKDQ 155
+D+ EF+ G N LR + + T +P DWR+HGAVTGVKDQ
Sbjct: 81 ADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140
Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
G CGSCW+FS+TGALEG HF G LVSLSEQ LVDC + ++GCNGGLM++
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDN 193
Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHG 274
AF YI GG++ EK YPY G D SC F+K+ I A + F I DE++M + G
Sbjct: 194 AFRYIKDNGGIDTEKSYPYEGID-DSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMG 252
Query: 275 PLAGNVASIELPHISFSF 292
P++ +I+ H SF
Sbjct: 253 PVS---VAIDASHESFQL 267
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 179 bits (454), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 148/265 (55%), Gaps = 29/265 (10%)
Query: 30 MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
MI ++ Q E HL +A+H+F F ++K Y + +YRF++FK NL
Sbjct: 5 MIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLEDIN 64
Query: 86 RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
+ L+ +A++ + KFSDL+ +E ++ GL + P++ + AP
Sbjct: 65 EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKK--PSNMVRSTSNFCNVIHLDAPPDV 122
Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
++LP +FDWR + +T VKDQGACGSCW+ +A G LE + + L++LSEQQL+DCD
Sbjct: 123 HDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182
Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
S + C+GGLM++AFE ++ AGG+ E DYPY GT G CK D K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GVCKIDNKKFALSV 232
Query: 254 SNFS-VISSDEDQMAANLVKHGPLA 277
S+ I +E+ + L+ GP+A
Sbjct: 233 SSCKRYIFQNEENLKKELITMGPIA 257
>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
Length = 376
Score = 177 bits (448), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/247 (40%), Positives = 140/247 (56%), Gaps = 16/247 (6%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
F+ + KF++ Y++ E + R+ +FK+N+ D V G+ F+D+T E+R+
Sbjct: 36 FTEWTLKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94
Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+LG +L DL P DWR AVT +KDQG CGSCWSFS TG
Sbjct: 95 TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
+ EGAH L T +LVSLSEQ LVDC PEE + GC+GGLMN+AF+YI+K G++
Sbjct: 155 STEGAHALKTKKLVSLSEQNLVDCS---GPEE----NFGCDGGLMNNAFDYIIKNKGIDT 207
Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
E YPYT G +C F+KS I A + + I++ + N +HGP++ +I+ H
Sbjct: 208 ESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVS---VAIDASHN 264
Query: 289 SFSFLFT 295
SF L+T
Sbjct: 265 SFQ-LYT 270
>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
Length = 450
Score = 176 bits (447), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 109/288 (37%), Positives = 151/288 (52%), Gaps = 43/288 (14%)
Query: 1 MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
M R + ++LL +++ LAS V E+ L E F+ FK K+
Sbjct: 6 MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48
Query: 61 SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
K Y +E +RFR F+ N+ +AK + +P A GVT FSD+T EFR + F
Sbjct: 49 GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108
Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
+RLR K + T P DWR+ GAVT VK QG CGSCW+FS G +EG
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQ 162
Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
++ LVSLSEQ LV CD + DSGCNGGLM++AF +I+ + G V E
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213
Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
YPY +G C+ + +I AA+++ + DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 171 bits (434), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 92/222 (41%), Positives = 134/222 (60%), Gaps = 15/222 (6%)
Query: 62 KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
K Y E + RF++FK NL+ + + D T G+T+F+DLT EFR +L +++
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
D+ K + D LP + DWR +GAV VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
TGEL+SLSEQ+LVDCD G ++GC+GG+MN AFE+I+K GG+E ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
D G C DK+ V+ + + D+++ V H P++
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 169 bits (428), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)
Query: 10 LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
+L L ++SAV ++ + V + G +SE +++ + L K +++ + E
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68
Query: 70 HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
D RF +FK NLR + + G+T+F+DLT E+R ++LG RR L
Sbjct: 69 KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
+A+ ++LP DWR GAV VKDQG CGSCW+FS GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
LSEQ+LVDCD S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235
Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + ++ + + ++ V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 169 bits (428), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
+ FK + K Y + E +R ++F N + AK Q V V K++DL E
Sbjct: 59 WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 118
Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
FR+ G N ++LR ++ K I P + LP DWR GAVT VKDQG CGSC
Sbjct: 119 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 178
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS+TGALEG HF +G LVSLSEQ LVDC + ++GCNGGLM++AF YI
Sbjct: 179 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFRYIK 231
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
GG++ EK YPY D SC F+K + A F+ I DE +MA + GP++
Sbjct: 232 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 287
Query: 281 ASIELPHISFSF 292
+I+ H SF F
Sbjct: 288 VAIDASHESFQF 299
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 169 bits (427), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 132/253 (52%), Gaps = 17/253 (6%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK K+ + Y EE YR +F+ N + K+ + + T + KF
Sbjct: 13 LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
D+T EF G R P P T T+ DWR GAVT VKDQG CGSC
Sbjct: 73 GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+FS TG+LEG HFL TG L+SL+EQQLVDC P+ GCNGG MN AF+YI
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNV 280
G++ E YPY D GSC+FD + +AA S + I+S + V+ GP++
Sbjct: 185 ANNGIDTEAAYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS--- 240
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 241 VTIDAAHSSFQFY 253
>sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 PE=2 SV=1
Length = 319
Score = 166 bits (421), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 136/234 (58%), Gaps = 17/234 (7%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTP 106
N + + FK K+ K Y E+ + RF +FK+N+ +A+ Q+ + +A++GVT +SDLT
Sbjct: 15 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTT 73
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
EF R L +P+ P N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 131
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FS TG +E F TG+L+SLSEQQLVDCD D GCNGGL ++A+E I+K
Sbjct: 132 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 182
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
GG+ E +YPY + C +A +++ ++ DE ++AA L + ++
Sbjct: 183 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTIS 235
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 166 bits (420), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/257 (37%), Positives = 143/257 (55%), Gaps = 22/257 (8%)
Query: 44 DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
+HL N + LF+S + SK Y + EE +RF VF+ NL +R + G+ +
Sbjct: 39 EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98
Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
F+DLT EF+ ++LGL + R R P+ + + DLP DWR GAV VKDQG
Sbjct: 99 FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156
Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
CGSCW+FS A+EG + ++TG L SLSEQ+L+DCD + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYA 208
Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
F+YI+ GG+ +E DYPY + G C+ K + +S + + ++D+ + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267
Query: 276 LAGNVASIELPHISFSF 292
++ +IE F F
Sbjct: 268 VS---VAIEASGRDFQF 281
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 165 bits (417), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 128/234 (54%), Gaps = 16/234 (6%)
Query: 62 KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
K+Y T EE R+ +FKAN+ ++ V G+ F+D+T E+R +LG
Sbjct: 39 KSY-TSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNTYLGTKFDAS 97
Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
Q+ + T+ + DWR GAVT VK+QG CG CWSFS TG+ EGAHF S GEL
Sbjct: 98 SLIGTQEEKVFTTSSAASK-DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGEL 156
Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
VSLSEQ L+DC E +SGC+GGLM AFEYI+ G++ E YPY + G
Sbjct: 157 VSLSEQNLIDCSTE---------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGK 206
Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
C++ A +S++ +++ + + V P++ +I+ H SF L+T
Sbjct: 207 CEYKSENSGATLSSYKTVTAGSESSLESAVNVNPVS---VAIDASHQSFQ-LYT 256
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 164 bits (416), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 123/231 (53%), Gaps = 23/231 (9%)
Query: 56 FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
FK+KF K YA EE +R VF L+ +R + T + FSDLT E
Sbjct: 23 FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82
Query: 112 QFLGLNRRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
G+ RR LP A PT + D DWR+ GAVT VKDQG CGSCW+FSA
Sbjct: 83 TKTGMTRRRHPLSVLPKSA------PTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAV 136
Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
ALEGAHFL TG+LVSLSEQ LVDC S + GCNGG A++YI+ G++
Sbjct: 137 AALEGAHFLKTGDLVSLSEQNLVDC-------SSSYGNQGCNGGWPYQAYQYIIANRGID 189
Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E YPY D +C++D I A VS++ S DE + + GP++
Sbjct: 190 TESSYPYKAID-DNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVS 239
>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
Length = 467
Score = 163 bits (412), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)
Query: 52 HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ FK K + Y + E +R VF+ NL A+ +P A GVT FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
+ F R R+P + P DWR GAVT VKDQG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150
Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
SA G +E FL+ L +LSEQ LV CD DSGC+GGLMN+AFE+I++
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201
Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G V E YPY +G S C + A ++ + DE Q+AA L +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
GN=CG12163 PE=2 SV=2
Length = 614
Score = 162 bits (410), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
+H F F+ +F + Y + E R R+F+ NL+ + + +A +G+T+F+D+T SE
Sbjct: 305 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 364
Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
++ + GL +R A A ++P +LP +FDWR AVT VK+QG+CGSCW+FS
Sbjct: 365 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 423
Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
TG +EG + + TGEL SEQ+L+DCD + DS CNGGLM++A++ I GG+
Sbjct: 424 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 474
Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
E E +YPY C F+++ V+ F + +E M L+ +GP++
Sbjct: 475 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPIS 525
>sp|Q94504|CYSP7_DICDI Cysteine proteinase 7 OS=Dictyostelium discoideum GN=cprG PE=1 SV=1
Length = 460
Score = 161 bits (408), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 122/217 (56%), Gaps = 22/217 (10%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EE + R+ +FKAN+ V G+ F+D++ E+R +LG P D
Sbjct: 42 SSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRATYLGT------PFD 95
Query: 126 AQKAPILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE-- 180
A + ++ D DWR GAVT +K+QG CG CWSFS TGA EGA +L+ G+
Sbjct: 96 ASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKN 155
Query: 181 LVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
LVSLSEQ L+DC SGS ++GC GGLM AFEYI+ G++ E YPYT DG
Sbjct: 156 LVSLSEQNLIDC--------SGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDG 207
Query: 240 GSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
CKF+ +AA +S++ +V S E +AA V GP
Sbjct: 208 KKCKFNPKNVAAQLSSYVNVTSGSESDLAAK-VTQGP 243
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 160 bits (406), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 91/255 (35%), Positives = 138/255 (54%), Gaps = 22/255 (8%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
+PSDG+ D + + + + ++ KT + D RF +FK NLR
Sbjct: 33 LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNED 90
Query: 91 DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
+ A + G+TKF+DLT E+R+ +LG RR+ + + N ++P D
Sbjct: 91 NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150
Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
WR GAV +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202
Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
S + GCNGGLM+ AF++I+K GG+ EKDYPY G G F K+ ++ + + +
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262
Query: 263 EDQMAANLVKHGPLA 277
++ + + P++
Sbjct: 263 DETALKKAISYQPVS 277
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 160 bits (404), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 133/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D NA+ H +KS + Y T EE ++R V++ N+R + HG T
Sbjct: 22 DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H+ + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 159 bits (403), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)
Query: 26 DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
D+ IR V SDG E+S +L H F+ F ++ K Y EE RF +FK
Sbjct: 27 DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84
Query: 79 ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
NL + + GV +F+DLT EF+R LG + A + + + LP
Sbjct: 85 ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142
Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
DWR+ G V+ VKDQG CGSCW+FS TGALE A+ + G+ +SLSEQQLVDC +
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201
Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
+ GCNGGL + AFEYI GG++ EK YPYTG D +CKF + V N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254
Query: 256 FSVISSDEDQMAANLVK 272
++ + DE + A LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 159 bits (402), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 120/209 (57%), Gaps = 16/209 (7%)
Query: 35 VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
+ S GE+SE+ A ++ +K++ K+Y E + R+ F+ NLR
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 95 VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
VH G+ +F+DLT E+R +LGL + R + N+ LP DWR GAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
+KDQG CGSCW+FSA A+EG + + TG+L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
GGLM+ AF++I+ GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 159 bits (402), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 19/255 (7%)
Query: 28 DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
D I P D E S D L+ F + S F K Y T EE RF VFK NL+
Sbjct: 30 DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85
Query: 88 QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
+ G+ +F+DL+ EF++ +LGL + + + D+ P DWR
Sbjct: 86 NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145
Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
GAV VK+QG+CGSCW+FS A+EG + + TG L +LSEQ+L+DCD +
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197
Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
++GCNGGLM+ AFEYI+K GG+ +E+DYPY+ + G+C+ D+S+ + V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256
Query: 263 EDQMAANLVKHGPLA 277
E + L H PL+
Sbjct: 257 EKSLLKALA-HQPLS 270
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 159 bits (401), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)
Query: 3 RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
+L LSS +LL+L + AS D+ I+ V + + E + +L H FS F
Sbjct: 4 KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63
Query: 58 SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
++ K Y + EE RF VFK NL + + + +F+DLT EF+R LG
Sbjct: 64 HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123
Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
+ A + + + +P DWR+ G V+ VK+QG CGSCW+FS TGALE A+ +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
G+ +SLSEQQLVDC +G+ ++ GC+GGL + AFEYI GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233
Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
DGG CKF I V N ++ + DE + A LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 158 bits (399), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 23/253 (9%)
Query: 50 EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
+HH++L+K +SK Y + E R +++ NL+ L +H G+ D+T
Sbjct: 25 DHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 84
Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
E + L LR+P+ Q+ +N LP DWR+ G VT VK QG+CG+CW
Sbjct: 85 GEEV----ISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 140
Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
+FSA GALE L TG+LVSLS Q LVD C E+ G + GCNGG M +AF+YI+
Sbjct: 141 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTTAFQYIID 194
Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
G++ E YPY + G C++D K AA S ++ + ED + + GP++
Sbjct: 195 NNGIDSEASYPYKAMN-GKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVS---V 250
Query: 282 SIELPHISFSFLF 294
+I+ H SF FL+
Sbjct: 251 AIDASHYSF-FLY 262
>sp|P54639|CYSP4_DICDI Cysteine proteinase 4 OS=Dictyostelium discoideum GN=cprD PE=2 SV=2
Length = 442
Score = 158 bits (399), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 128/234 (54%), Gaps = 13/234 (5%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L + F+ + +TY++ EE + R+++FK+N+ + V G+ F+D+T
Sbjct: 24 LQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITN 82
Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
E+R +LG ++ I T PT DWR GAVT +K+QG CG CWSFS
Sbjct: 83 QEYRTTYLGTPFDGSALIGTEEEKIFST-PAPT-VDWRAQGAVTPIKNQGQCGGCWSFST 140
Query: 167 TGALEGAHFLSTG---ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
TG+ EGAHF+++G +LVSLSEQ L+DC ++GC GGLM AFEYI+
Sbjct: 141 TGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG-------NNGCEGGLMTLAFEYIINN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
G++ E YPYT DG CKF S I A + ++ ++S + + + P++
Sbjct: 194 KGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVS 247
>sp|Q8QLK1|CATV_NPVMC Viral cathepsin OS=Mamestra configurata nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 337
Score = 157 bits (398), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 158/290 (54%), Gaps = 35/290 (12%)
Query: 6 LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
++ +L+LLL L SAV + D QVV + + ++ +A +F F S+++K Y+
Sbjct: 1 MNKILILLL---LVSAVLTSHD-----QVVAVTIKPNLYNINSAPLYFEKFISQYNKQYS 52
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+++E YR+ +F+ N+ + + +AV+ + +F+D+T +E +NR L +
Sbjct: 53 SEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEV------VNRHTGLASG 106
Query: 126 AQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
A T P +FDWR++ VT VKDQG CG+CW+F+ GALE + +
Sbjct: 107 DIGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIK 166
Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
L+ L+EQQLVDCD D GC+GGL+++A+E I+ GGVE+E DYPY
Sbjct: 167 YDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYKAV 217
Query: 238 DGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
C K A V N + + E+++ +L++H GP+A V +++L
Sbjct: 218 R-LPCAVKPHKFAVGVRNCYRYVLLSEERL-EDLLRHVGPIAIAVDAVDL 265
>sp|Q8V5U0|CATV_NPVHZ Viral cathepsin OS=Heliothis zea nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 367
Score = 157 bits (397), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)
Query: 49 AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
+E +F F +++K+Y +E+ YR+ VFK NL + ++ R+ L +A
Sbjct: 53 SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 112
Query: 97 GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
GV KFSD TP E FL L++ L + + P LP +DWRD VT +
Sbjct: 113 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPDIRLPDYYDWRDTNKVTPI 171
Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
KDQG CGSCW+F A G +E + + +L+ LSEQQL+DCD D GCNGGL
Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 222
Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
M+ AF+ +L GGVE E DYPY G++ C D KIA + S F DE+++ +
Sbjct: 223 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 281
Query: 272 KHGPLAGNVASIEL 285
GP+A V ++++
Sbjct: 282 TTGPVAIAVDAMDI 295
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 157 bits (397), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 122/217 (56%), Gaps = 16/217 (7%)
Query: 69 EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
+ D RF +FK NLR + A + G+T F++LT E+R +LG RR+
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 123 PADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
+ + + + +++P DWR GAV +KDQG CGSCW+FS A+EG + + TGE
Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
LVSLSEQ+LVDCD S + GCNGGLM+ AF++I+K GG+ EKDYPY GT+G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195
Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
K+ + + + S ++ V + P++
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 156 bits (395), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
D +AE H +KS + Y T EE ++R +++ N+R + HG +
Sbjct: 22 DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P++ +P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSA+G LEG FL TG+L+SLSEQ LVDC H + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
I + GG++ E+ YPY D GSCK+ A + F I E + + GP++
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246
Query: 280 VASIELPHISFSF 292
+++ H S F
Sbjct: 247 -VAMDASHPSLQF 258
>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
Length = 434
Score = 156 bits (394), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 128/235 (54%), Gaps = 26/235 (11%)
Query: 66 TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
+ EE + RF +FKAN+ V G+ F+D+T E+R +LG P D
Sbjct: 42 SSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRATYLGT------PFD 95
Query: 126 AQKAPILPTNDL-----PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG- 179
A + P+ + DWR GAVT +K+QG CG CWSFSATGA EGA +++ G
Sbjct: 96 ASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGD 155
Query: 180 -ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
+L S+SEQQL+DC SGS ++GC GGLM AFEYI+ GG++ E YP+T
Sbjct: 156 SDLTSVSEQQLIDC--------SGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFT-A 206
Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
+ CK++ S I A +S++ ++S + A V GP + +I+ SF F
Sbjct: 207 NTEKCKYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTS---VAIDASQPSFQF 258
>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484
Score = 156 bits (394), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 137/238 (57%), Gaps = 14/238 (5%)
Query: 42 SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
S+D + F F +++TY ++EE +R VF N+ RA++ Q LD TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235
Query: 101 FSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
FSDLT EFR +L N LR P + K + P ++DWR GAVT VKDQG CG
Sbjct: 236 FSDLTEEEFRTIYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCG 293
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FS TG +EG FL+ G L+SLSEQ+L+DCD D C GGL ++A+
Sbjct: 294 SCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSA 344
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
I GG+E E DY Y G SC F K +++ +S +E ++AA L K GP++
Sbjct: 345 IKNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 401
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 155 bits (392), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 132/258 (51%), Gaps = 24/258 (9%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK KF + Y EE YR VF NL+ K+ + + T + +F
Sbjct: 13 LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQF 72
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
SD+T +F G + R PA A T+ P T+ DWR GAVT VKDQG CG
Sbjct: 73 SDMTNEKFNAVMKGYKKGPR-PA----AVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCG 127
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFE 218
SCW+FS TG +EG HFL TG LVSLSEQQLVDC GS + GCNGG + A
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-------AGGSYYNQGCNGGWVERAIM 180
Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLA 277
Y+ GGV+ E YPY D +C+F+ + I A + + I+ + + GP++
Sbjct: 181 YVRDNGGVDTESSYPYEARD-NTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPIS 239
Query: 278 GNVASIELPHISFSFLFT 295
+I+ H SF +T
Sbjct: 240 ---VAIDASHRSFQSYYT 254
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 154 bits (390), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 29/285 (10%)
Query: 1 MERLILSSLLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNAEHH----- 52
M RL SL+L+L++ + A+A+A D IRQVV D + E+ +L
Sbjct: 1 MSRL---SLVLILVAGLFATALAGPATFADKNPIRQVVFPD--ELENGILQVVGQTRSAL 55
Query: 53 -FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
F+ F + K Y + EE RF +F NL+ + + G+ +F+DLT EFR+
Sbjct: 56 SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRK 115
Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
LG ++ + K + TN LP DWR G V+ VK QG CGSCW+FS TGAL
Sbjct: 116 HKLGASQNC---SATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGAL 172
Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
E A+ + G+ +SLSEQQLVDC + + GCNGGL + AFEYI GG++ E+
Sbjct: 173 EAAYAQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEE 225
Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
YPYTG + G CKF ++ I V N ++ + E + A LV+
Sbjct: 226 AYPYTGKN-GICKFSQANIGVKVISSVNITLGAEYELKYAVALVR 269
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 154 bits (389), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 92/249 (36%), Positives = 126/249 (50%), Gaps = 17/249 (6%)
Query: 48 NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
N + H+ +K+ + Y EE ++R V++ N + HG + F D
Sbjct: 24 NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGD 82
Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
+T EFR+ G + P+L D+P DW G VT VK+QG CGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF+YI
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193
Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
GG++ E+ YPY TD SC + AA + F I E + + GP++ +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250
Query: 284 ELPHISFSF 292
+ H SF F
Sbjct: 251 DAGHTSFQF 259
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 154 bits (388), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 130/253 (51%), Gaps = 19/253 (7%)
Query: 44 DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
D LNA+ + +K+ + Y EE +R V++ N++ + HG T
Sbjct: 22 DQSLNAQWY--QWKATHRRLYGMNEE-GWRRAVWEKNMKMIELHNREYSQGKHGFTMAMN 78
Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
F D+T EFR+ G + + P+ ++P DWR+ G VT VK+QG CG
Sbjct: 79 AFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFA--EIPKSVDWREKGYVTPVKNQGQCG 136
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
SCW+FSATGALEG F TG+LVSLSEQ LVDC + GCNGGLM++AF Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNEGCNGGLMDNAFRY 189
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
+ GG++ E+ YPY G D +C + AA + F + E + + GP++
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQREKALMKAVATLGPIS-- 247
Query: 280 VASIELPHISFSF 292
+I+ H SF F
Sbjct: 248 -VAIDAGHQSFQF 259
>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
polyhedrosis virus GN=VCATH PE=3 SV=1
Length = 356
Score = 153 bits (387), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 45 HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLD-PTAVHGVTKF 101
+L A +F F ++K Y + E + R+ +FK NL AK D PTA + + KF
Sbjct: 48 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107
Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
SDL+ SE +F GL+ R+ ++ K IL P + P FDWR+ VT +K+QGACG
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-SNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACG 166
Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
+CW+F+ ++E + L+ LSEQQL+DCD S D GCNGGL+++AFE
Sbjct: 167 ACWAFATLASVESQFAMRHNRLIDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 217
Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPL 276
I++ GGV+ E DYP+ G + C D+ + + + V + + +E+++ L GP+
Sbjct: 218 IMRMGGVQTELDYPFVGRN-RRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPI 275
>sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear polyhedrosis virus GN=VCATH
PE=1 SV=1
Length = 323
Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 133/236 (56%), Gaps = 21/236 (8%)
Query: 47 LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
L A ++F F +F+K Y+++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80
Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+C
Sbjct: 81 DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136
Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
W+F+ G+LE + EL++LSEQQ++DCD D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187
Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
K GGV+ E DYPY D +C+ + +K V + + I E+++ L GP+
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPI 242
>sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 324
Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 130/236 (55%), Gaps = 18/236 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F KF+K Y+++ E RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
++ GGV+ E DYPY G+DG + + I+ E+++ L GP+
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPI 243
>sp|P35591|CYSP1_LEIPI Cysteine proteinase 1 OS=Leishmania pifanoi GN=CYS1 PE=2 SV=2
Length = 354
Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 101/275 (36%), Positives = 143/275 (52%), Gaps = 27/275 (9%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P D+ + A H+ FK + K + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + R D K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKD-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D+ ++ A ++ F + DE+++A + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVA 263
>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
virus GN=VCATH PE=3 SV=1
Length = 324
Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 132/237 (55%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F KF+K Y+++ E +RF++F+ NL + D TA + + KFSDL+
Sbjct: 21 LLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + IL P + P +FDWR VT VK+QG CG+
Sbjct: 81 KEEAISKYTGLS----LPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + L++LSEQQ +DCD ++GC+GGL+++AFE
Sbjct: 137 CWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---------VNAGCDGGLLHTAFESA 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPL 276
++ GGV+ E DYPY T G C+ + ++ V S I E+++ L GP+
Sbjct: 188 MEMGGVQMESDYPYE-TANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPI 243
>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 132/237 (55%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A +F F F+K Y+++ E +RF++F+ NL + L D +A + + KFSDL+
Sbjct: 21 LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+ GG++ E DYPY + G C+ + +K V + + E+++ L GPL
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPL 243
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 152 bits (384), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 128/253 (50%), Gaps = 19/253 (7%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
L A + FK+++ + Y +E YR RVF+ N + K+ + + T + +F
Sbjct: 13 LATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQF 72
Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
D+T EF G + R P A P + D DWR VT VKDQ CGS
Sbjct: 73 GDMTNEEFNAVMKGYKKGSRGEPKAVFTAEAGP---MAADVDWRTKALVTPVKDQEQCGS 129
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+FSATGALEG HFL ELVSLSEQQLVDC + + GC GG M SAF+YI
Sbjct: 130 CWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYG-------NDGCGGGWMTSAFDYI 182
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
GG++ E YPY D SC+FD + I A + + E+ + + GP++
Sbjct: 183 KDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPIS--- 238
Query: 281 ASIELPHISFSFL 293
+I+ H SF F
Sbjct: 239 VAIDASHFSFQFY 251
>sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana nuclear polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 152 bits (384), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 82/237 (34%), Positives = 134/237 (56%), Gaps = 20/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
+L A ++F F KF+K+Y+++ E RF++F+ NL + D TA + + KF+DL+
Sbjct: 21 VLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFADLS 80
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q + +L P + P +FDWR VT VK+QG CG+
Sbjct: 81 KDETISKYTGLS----LPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGA 136
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ G+LE + + ++LSEQQL+DCD D+GC+GGL+++AFE +
Sbjct: 137 CWAFATLGSLESQFAIKHNQFINLSEQQLIDCDF---------VDAGCDGGLLHTAFEAV 187
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+ GG++ E DYPY + G C+ + +K V + I+ E+++ L GP+
Sbjct: 188 MNMGGIQAESDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPI 243
>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
Length = 443
Score = 152 bits (383), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262
>sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica nuclear polyhedrosis
virus GN=VCATH PE=1 SV=1
Length = 323
Score = 152 bits (383), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 133/237 (56%), Gaps = 21/237 (8%)
Query: 46 LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
LL A ++F F +F+K Y ++ E RF++F+ NL + D +A + + KFSDL+
Sbjct: 21 LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLS 79
Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
E ++ GL+ LP Q K +L P P +FDWR VT VK+QG CG+
Sbjct: 80 KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135
Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
CW+F+ +LE + +L++LSEQQ++DCD D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186
Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
+K GGV+ E DYPY D +C+ + +K V + + I+ E+++ L GP+
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPI 242
>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
Length = 444
Score = 151 bits (382), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/235 (39%), Positives = 123/235 (52%), Gaps = 19/235 (8%)
Query: 53 FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
F FK + + Y T E R F+ NL + Q +P A G+TKF DL+ +EF +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
+L R A + + +P DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
+EG +L+ ELVSLSEQQLV CD D GC+GGLM AF+++L+ G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 227 EREKDYPYTGTDGGSCKFDKSK----IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
E YPY +G + S + A + +I S E MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 263
>sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexicana GN=LMCPA PE=2 SV=1
Length = 354
Score = 151 bits (382), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 100/275 (36%), Positives = 143/275 (52%), Gaps = 27/275 (9%)
Query: 12 LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
LL + V+ V A+I Q P D+ + A H+ FK + K + E
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60
Query: 72 YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
+RF FK N++ A +P A + V+ KF+DLTP EF + +L + R + K
Sbjct: 61 HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKN-HKED 119
Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
+ + P+ DWRD GAVT VK+QG CGSCW+FSA G +EG S LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
LV CD + D GCNGGLM+ A +I+++ G V E YPY T GG +
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228
Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
D+ ++ A ++ F + DE+++A + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVA 263
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.133 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 109,964,099
Number of Sequences: 539616
Number of extensions: 4597424
Number of successful extensions: 10531
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 195
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 9923
Number of HSP's gapped (non-prelim): 231
length of query: 300
length of database: 191,569,459
effective HSP length: 117
effective length of query: 183
effective length of database: 128,434,387
effective search space: 23503492821
effective search space used: 23503492821
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (28.1 bits)